view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.13k
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 52
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 895
OsakanaTeishoku/Qwen3-4B-Thinking-2507-reasoning-ja-20260329 Text Generation • 4B • Updated Mar 29 • 50
OsakanaTeishoku/Qwen3-4B-Thinking-2507-reasoning-ja-20260329 Text Generation • 4B • Updated Mar 29 • 50
OsakanaTeishoku/Qwen3-4B-Thinking-2507-reasoning-ja-20260328 Text Generation • 4B • Updated Mar 28 • 23 •
OsakanaTeishoku/Qwen3-4B-Thinking-2507-reasoning-ja-20260328 Text Generation • 4B • Updated Mar 28 • 23 •