Yuxuan Wang
ColorfulAI
AI & ML interests
Multimodal Learning
Recent Activity
authored
a paper
11 days ago
Qwen3-Omni Technical Report
authored
a paper
11 days ago
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni
MLLMs
authored
a paper
11 days ago
V-HUB: A Visual-Centric Humor Understanding Benchmark for Video LLMs