EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 13 days ago • 79
PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting Paper • 2510.18714 • Published Oct 21, 2025 • 2
Seeing through Satellite Images at Street Views Paper • 2505.17001 • Published May 22, 2025 • 1
Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image Paper • 2605.14984 • Published 21 days ago • 5
Efficient Image Synthesis with Sphere Latent Encoder Paper • 2605.15592 • Published 20 days ago • 8
FFAvatar: Few-Shot, Feed-Forward, and Generalizable Avatar Reconstruction Paper • 2605.15320 • Published 21 days ago • 7
Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image Paper • 2605.14984 • Published 21 days ago • 5
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 84
BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate Paper • 2604.25203 • Published Apr 28 • 8
TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification Paper • 2604.14531 • Published Apr 16 • 7
On Semiotic-Grounded Interpretive Evaluation of Generative Art Paper • 2604.08641 • Published Apr 9 • 4
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos Paper • 2603.25645 • Published Mar 26 • 4
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Paper • 2311.16933 • Published Nov 28, 2023 • 1
Automated Conversion of Music Videos into Lyric Videos Paper • 2308.14922 • Published Aug 28, 2023
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Paper • 2502.08590 • Published Feb 12, 2025 • 43
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published Jun 23, 2025 • 89
Mindalogue: LLM-Powered Nonlinear Interaction for Effective Learning and Task Exploration Paper • 2410.10570 • Published Oct 14, 2024
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers Paper • 2305.17455 • Published May 27, 2023