Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
XAI, LLM reasoning & safety, Coding agent
Recent Activity
upvoted a paper about 22 hours ago
FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining liked a model 8 days ago
MiniMaxAI/MiniMax-M3