Running on Zero Agents Featured 85 Lance 🎬 85 Generate, edit, and understand images and videos with Lance!
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 15 days ago • 77
KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration Paper • 2605.14278 • Published 19 days ago • 37
Composing Concepts from Images and Videos via Concept-prompt Binding Paper • 2512.09824 • Published Dec 10, 2025 • 28
MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment Paper • 2512.06628 • Published Dec 7, 2025 • 13
AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement Paper • 2511.23475 • Published Nov 28, 2025 • 43
Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation Paper • 2509.18824 • Published Sep 23, 2025 • 23
pyannote/speaker-diarization-3.1 Automatic Speech Recognition • Updated May 10, 2024 • 9.59M • 2.1k
deepseek-ai/DeepSeek-Prover-V2-671B Text Generation • 685B • Updated Apr 30, 2025 • 718 • • 829