view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 23 days ago • 93
orbit-ai/orbit-4b-ablation-training-mix-124-v0.1 Text Generation • 4B • Updated about 15 hours ago • 131
ORBIT-v0.1 Collection A collection of ORBIT training datasets and search agents • 3 items • Updated about 17 hours ago
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated about 1 month ago • 12