Feng
VandeeeFeng
·
AI & ML interests
None yet
Organizations
None yet
papers
-
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 145 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 125 - Running3.83k
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
- Running265
LLM训练终极指南 | The Ultra-Scale Playbook
🔥265了解LLM训练的方方面面
models
papers
-
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 145 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 125 - Running3.83k
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
- Running265
LLM训练终极指南 | The Ultra-Scale Playbook
🔥265了解LLM训练的方方面面
datasets 0
None public yet