view post Post 39 ICYMI, great blog by @kashif and @stas on Ulysses Sequence Parallelism: train with million-token contextson 4×H100s: 12x longer sequences, 3.7x throughputlearn how to integrate it with Accelerate, Transformers, and TRL ⤵️https://huggingface.co/blog/ulysses-sp See translation
Bringing Autonomous Driving RL to OpenEnv and TRL resources Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ Runtime error RL CARLA Environment Server 🚗 Control a Carla driving simulation with custom actions Runtime error RL CARLA Environment Server 🚗 Control a CARLA car simulation via custom actions Sleeping Carla Grpo Trolley 🚀 Visualize your program’s I/O activity in real time sergiopaniego/Qwen3-0.6B-carla-trolley-escape 0.8B • Updated 16 days ago • 144
📝 Research & Long-Form Blog Posts In-depth technical articles and research pieces published by Hugging Face Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs Running 285 Evaluation Guidebook 📝 285 Explore LLM benchmark trends over time Running 219 FineVision: Open Data is All You Need 📝 219 A new open-source dataset for training VLMs
Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
Bringing Autonomous Driving RL to OpenEnv and TRL resources Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ Runtime error RL CARLA Environment Server 🚗 Control a Carla driving simulation with custom actions Runtime error RL CARLA Environment Server 🚗 Control a CARLA car simulation via custom actions Sleeping Carla Grpo Trolley 🚀 Visualize your program’s I/O activity in real time sergiopaniego/Qwen3-0.6B-carla-trolley-escape 0.8B • Updated 16 days ago • 144
📝 Research & Long-Form Blog Posts In-depth technical articles and research pieces published by Hugging Face Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs Running 285 Evaluation Guidebook 📝 285 Explore LLM benchmark trends over time Running 219 FineVision: Open Data is All You Need 📝 219 A new open-source dataset for training VLMs
Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
pinned Running on Zero Featured 114 VLM Object Understanding 🦀 Explore object detection, visual grounding, keypoint Detecti
sergiopaniego/browsergym-grpo-functiongemma-270m-it-dataset Viewer • Updated about 1 hour ago • 105 • 12.7k