view article Article Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models Jul 10, 2025 • 54
DAG-Math: Graph-Guided Mathematical Reasoning in LLMs Paper • 2510.19842 • Published Oct 19, 2025
LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently Paper • 2502.01235 • Published Feb 3, 2025