Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper β’ 2603.24472 β’ Published 17 days ago β’ 52 β’ 7
Jina-VLM: Small Multilingual Vision Language Model Paper β’ 2512.04032 β’ Published Dec 3, 2025 β’ 15 β’ 4
MemMamba: Rethinking Memory Patterns in State Space Model Paper β’ 2510.03279 β’ Published Sep 28, 2025 β’ 74 β’ 3