Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models β’ 18 items β’ Updated 18 minutes ago β’ 48
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability Collection A compilation of sparse auto-encoders trained on large language models. β’ 37 items β’ Updated Dec 16, 2025 β’ 21
Running on CPU Upgrade Featured 2.89k The Smol Training Playbook π 2.89k The secrets to building world-class LLMs
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 β’ 271
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 β’ 1.16k