OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17 • 89
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17, 2024 • 31
Minerva LLMs Collection The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated Dec 7, 2024 • 38
Chronos Models & Datasets Collection Collection of artifacts related to Chronos pretrained models for time series forecasting. • 16 items • Updated 26 days ago • 52
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation Paper • 2402.18334 • Published Feb 28, 2024 • 12
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 6 days ago • 74