SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 19 days ago • 17
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments Paper • 2510.01179 • Published Oct 1 • 25
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL Paper • 2505.23977 • Published May 29 • 10
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Paper • 2505.14625 • Published May 20 • 13
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 76
SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 19 days ago • 17
Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks Paper • 2504.01308 • Published Apr 2 • 14
SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 19 days ago • 17
SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 19 days ago • 17