yeyang's picture

2 14 3

yeyang

sysuyy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

upvoted a paper about 2 months ago

DocReward: A Document Reward Model for Structuring and Stylizing

upvoted a paper 3 months ago

Do You Need Proprioceptive States in Visuomotor Policies?

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published Oct 19 • 21

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13 • 27

upvoted a paper 3 months ago

Do You Need Proprioceptive States in Visuomotor Policies?

Paper • 2509.18644 • Published Sep 23 • 49

upvoted a paper 5 months ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10 • 33

upvoted 3 papers 6 months ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published May 26 • 18

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Paper • 2505.20292 • Published May 26 • 52

upvoted 3 papers 8 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 57

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11 • 42

upvoted 2 collections about 1 year ago

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators • 4 items • Updated Nov 29, 2024 • 13

ChronoMagic-Bench

ChronoMagic-Bench : A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation • 6 items • Updated Jul 26 • 10

upvoted a paper about 1 year ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published Nov 26, 2024 • 37

upvoted a collection about 1 year ago

ConsisID

Identity-Preserving Text-to-Video Generation by Frequency Decomposition • 4 items • Updated Dec 3, 2024 • 12