view article Article Unlocking Longer Generation with Key-Value Cache Quantization RaushanTurganbay • May 16, 2024 • 57
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 12 days ago • 81
Aligning Latent Geometry for Spherical Flow Matching in Image Generation Paper • 2605.15193 • Published 12 days ago • 7
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Any-to-Any • 33B • Updated 17 days ago • 432k • 312
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 25 days ago • 84
DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation Paper • 2604.20841 • Published Apr 22 • 24