arxiv:2512.03704
Yijun Liao
YijunLiao
·
AI & ML interests
LLM, RLHF & Test-Time Compute
Recent Activity
updated
a model
23 days ago
YijunLiao/DZ-TDPO-Phi-3.5-mini-instruct
updated
a model
23 days ago
YijunLiao/DZ-TDPO-Qwen2.5-7B-Instruct
new activity
about 1 month ago
YijunLiao/DZ-TDPO-Phi-3.5-mini-instruct:Improve model card: Add pipeline tag, library name, and GitHub link
Organizations
None yet