MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 49
The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Paper • 2405.20089 • Published May 30, 2024 • 1
Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book? Paper • 2409.19151 • Published Sep 27, 2024
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 24
How Far Can 100 Samples Go? Unlocking Overall Zero-Shot Multilingual Translation via Tiny Multi-Parallel Data Paper • 2401.12413 • Published Jan 22, 2024 • 1
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 7 days ago • 23
Data Contamination Report from the 2024 CONDA Shared Task Paper • 2407.21530 • Published Jul 31, 2024 • 10
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Paper • 2204.07705 • Published Apr 16, 2022 • 2
Towards a general purpose machine translation system for Sranantongo Paper • 2212.06383 • Published Dec 13, 2022