DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation Paper • 2605.30350 • Published 10 days ago • 13
Contrastive Distribution Matching for Amortized Sequential Monte Carlo in Discrete Diffusion Paper • 2605.23346 • Published 16 days ago
optimize_anything: A Universal API for Optimizing any Text Parameter Paper • 2605.19633 • Published 19 days ago • 6
MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines Paper • 2603.06679 • Published Mar 30 • 6
AVO: Agentic Variation Operators for Autonomous Evolutionary Search Paper • 2603.24517 • Published Mar 25 • 11
V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising Paper • 2603.16792 • Published Mar 17 • 3
Any to Full: Prompting Depth Anything for Depth Completion in One Stage Paper • 2603.05711 • Published Mar 5 • 2
SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization Paper • 2602.04811 • Published Feb 4 • 2
UniAudio 2.0: A Unified Audio Language Model with Text-Aligned Factorized Audio Tokenization Paper • 2602.04683 • Published Feb 4 • 3
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published Jan 15 • 49
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published Jan 15 • 49
UM-Text: A Unified Multimodal Model for Image Understanding Paper • 2601.08321 • Published Jan 13 • 21
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper • 2601.03955 • Published Jan 7 • 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper • 2512.24724 • Published Dec 31, 2025 • 9