InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision Paper • 2512.01342 • Published 7 days ago • 14
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5 • 51
OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B Video-Text-to-Text • 9B • Updated May 16 • 825 • 5