Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Paper
• 2304.01373 • Published
• 9
This repository contains fine-tuned versions of Pythia-160M [Biderman et al., 2023] on the AG News dataset
These checkpoints are provided for teaching purposes only and should not be used outside this scope. They are intended to illustrate concepts in natural language processing and machine learning, not for production or deployment.
© 2025–present Laboratoire d'Informatique de l'École Polytechnique