SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 117
HelpSteer2-Preference: Complementing Ratings with Preferences Paper β’ 2410.01257 β’ Published Oct 2, 2024 β’ 24
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated 4 days ago β’ 155
NaturalFunctions Collection LLMs fine tuned for function calling π€ β’ 2 items β’ Updated Jan 28, 2024 β’ 3
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated May 5 β’ 239
view article Article π§ββοΈ "Replacing Judges with Juries" using distilabel May 3, 2024 β’ 17
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models +14 May 24, 2024 β’ 23
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts Paper β’ 2401.04081 β’ Published Jan 8, 2024 β’ 73
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Paper β’ 2307.16789 β’ Published Jul 31, 2023 β’ 101