Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 14 hours ago
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
upvoted
a
paper
2 days ago
AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning
upvoted
a
paper
5 days ago
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents