Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
moonshotai
's Collections
Kimi K2.5
Kimi-K2
Kimi-Linear-A3B
Kimi-VL-A3B
Kimi-Audio-7B
Moonlight-A3B
Moonlight-A3B
updated
1 day ago
Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer
Upvote
9
moonshotai/Moonlight-16B-A3B-Instruct
Text Generation
•
16B
•
Updated
Mar 3, 2025
•
36.6k
•
186
moonshotai/Moonlight-16B-A3B
Text Generation
•
16B
•
Updated
Feb 26, 2025
•
45.7k
•
102
Muon is Scalable for LLM Training
Paper
•
2502.16982
•
Published
Feb 24, 2025
•
10
Upvote
9
+5
Share collection
View history
Collection guide
Browse collections