Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

updated a collection about 15 hours ago

liked a model about 15 hours ago

LLM360/K2-V2

liked a model 5 days ago

arcee-ai/Trinity-Mini

View all activity

Organizations

upvoted a collection 16 days ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 5 days ago • 37

upvoted a collection about 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 5 days ago • 140

upvoted an article 4 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29

•

202

upvoted a paper 5 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted a collection 5 months ago

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 2 days ago • 22

upvoted 2 collections 6 months ago

Reward Bench 2

Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated 7 days ago • 16

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6 • 37

upvoted 2 collections 7 months ago

OpenVision

27 items • Updated Aug 15 • 32

Qwen3

84 items • Updated Aug 6 • 1.47k

upvoted a paper 8 months ago

Reinforcement Learning from Human Feedback

Paper • 2504.12501 • Published Apr 16 • 4

upvoted a collection 10 months ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 7 days ago • 16

upvoted an article 11 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

upvoted a collection 11 months ago

2024 Interconnects Artifacts

Models & datasets mentioned in the bottom section of posts! • 280 items • Updated Jan 2 • 6

upvoted a paper 12 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

upvoted a collection 12 months ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 7 days ago • 81

upvoted 5 collections about 1 year ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated 7 days ago • 149

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 7 days ago • 103

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 7 days ago • 96

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 308

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666