Reg-DPO: SFT-Regularized Direct Preference Optimization with GT-Pair for Improving Video Generation Paper • 2511.01450 • Published Nov 3, 2025 • 1