Alignment Pretraining (Geodesic, 2025): Data & Models - a geodesic-research Collection

geodesic-research 's Collections

Alignment Pretraining (Geodesic, 2025): Data & Models

Self-Fulfilling (Mis)alignment: Datasets

Self-Fulfilling (Mis)alignment: Emergent Misalignment

Self-Fulfilling (Mis)alignment: Midtraining Ablations

Self-Fulfilling (Mis)alignment: Base Models

Self-Fulfilling (Mis)alignment: Post-Trained Models

Alignment Pretraining (Geodesic, 2025): Data & Models

updated 17 days ago

https://alignmentpretraining.ai — Read our paper for additional details about our data and models

Self-Fulfilling (Mis)alignment: Datasets

Collection

9 items • Updated Dec 20, 2025

Note Our misalignment evaluations, synthetic pretraining data, and regular training mixes.
Self-Fulfilling (Mis)alignment: Post-Trained Models

Collection

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated 17 days ago • 1

Note Our models are suitable for chat and limited multi-turn usage. We share intermediate checkpoints for our SFT models.
Self-Fulfilling (Mis)alignment: Base Models

Collection

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations. • 14 items • Updated 17 days ago

Note Our base models are best used for research into training dynamics and as a starting point for further post-training. We share intermediate checkpoints.
Self-Fulfilling (Mis)alignment: Emergent Misalignment

Collection

LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated 17 days ago • 1