Yuanhe Zhang

liminho123

https://warwick.ac.uk/fac/sci/statistics/staff/research_students/yuanhezhang

yuanhe-zhang

AI & ML interests

Theoretical foundation and algorithmic development of fine-tuning for LLM

Recent Activity

liked a dataset 23 days ago

nvidia/Nemotron-Math-v2

liked a dataset about 2 months ago

tasksource/leandojo

liked a dataset about 2 months ago

JohnYang88/lean-dojo-mathlib4

View all activity

Organizations

None yet

liked a dataset 23 days ago

nvidia/Nemotron-Math-v2

Preview • Updated 11 days ago • 6.52k • 130

liked 2 datasets about 2 months ago

tasksource/leandojo

Viewer • Updated Jun 28, 2023 • 91.8k • 125 • 8

JohnYang88/lean-dojo-mathlib4

Viewer • Updated Dec 11, 2023 • 103k • 48 • 1

liked 2 datasets 2 months ago

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated 23 days ago • 51k • 664 • 151

neulab/agent-data-collection

Preview • Updated 10 days ago • 1.32k • 106

upvoted an article 3 months ago

Article

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Jul 10, 2025

•

liked a model 3 months ago

Goedel-LM/Goedel-Formalizer-V2-32B

33B • Updated Jul 22, 2025 • 226 • 8

liked 2 datasets 3 months ago

florath/coq-facts-props-proofs-gen0-v1

Viewer • Updated Mar 20, 2024 • 270k • 67 • 8

internlm/Lean-Workbook

Viewer • Updated Oct 9, 2024 • 25.2k • 671 • 48

authored a paper 3 months ago

DAG-Math: Graph-Guided Mathematical Reasoning in LLMs

Paper • 2510.19842 • Published Oct 19, 2025

updated a dataset 3 months ago

liminho123/DAG-MATH-Formatted-CoT

Viewer • Updated Oct 19, 2025 • 2.89k • 19

published a dataset 3 months ago

liminho123/DAG-MATH-Formatted-CoT

Viewer • Updated Oct 19, 2025 • 2.89k • 19

liked a Space 5 months ago

Scaling test-time compute

📈

589

Implement test-time compute scaling for math problems

liked 3 datasets 6 months ago

upvoted a collection 8 months ago

DeepSeek-R1

Collection

10 items • Updated Nov 27, 2025 • 829

authored a paper 8 months ago

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Paper • 2502.01235 • Published Feb 3, 2025

liked a Space 9 months ago

FLUX.1 Studio Ghibli LoRA

🖼

Generate Studio Ghibli-style images from text prompts

liked a dataset 9 months ago

Nechintosh/ghibli

Viewer • Updated Jan 4, 2025 • 810 • 482 • 10