Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated about 4 hours ago • 18
view article Article **Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding** 1 day ago • 34
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 4 days ago • 54
Bielik-11B-v3.0 Collection A collection of models based on Bielik-11B-v3.0 - instruct and quantized versions. • 5 items • Updated 2 days ago • 8
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 10 days ago • 177
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 18 days ago • 186
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 14 days ago • 113
Helios Collection Helios: 14B Real-Time Long Video Generation Model can be Cheaper, Faster but Keep Stronger than 1.3B ones • 7 items • Updated 5 days ago • 22
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 24 days ago • 46
Discovering Multiagent Learning Algorithms with Large Language Models Paper • 2602.16928 • Published 30 days ago • 16