Definition modeling Collection Models to generate contextualized word definitions • 22 items • Updated Nov 4 • 1
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 6 days ago • 116
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 6 days ago • 70
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 30 days ago • 128
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 17 items • Updated 21 days ago • 51
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15, 2024 • 86
The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective Paper • 2412.09460 • Published Dec 12, 2024 • 9
Comparative analysis of optical character recognition methods for Sámi texts from the National Library of Norway Paper • 2501.07300 • Published Jan 13 • 1
NorEval: A Norwegian Language Understanding and Generation Evaluation Benchmark Paper • 2504.07749 • Published Apr 10 • 1