view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9 • 722
view article Article Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models Jul 4 • 10
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21 • 234
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29 • 92
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7 • 253
view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers? Apr 4 • 15
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 Dec 16, 2024 • 150
view article Article Introducing RWKV - An RNN with the advantages of a transformer +1 May 15, 2023 • 23
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published Mar 24 • 19