PRISM: Demystifying Retention and Interaction in Mid-Training Paper • 2603.17074 • Published 1 day ago
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published 1 day ago • 60
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 1 day ago • 74
LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition Paper • 2603.17965 • Published about 19 hours ago • 3
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Paper • 2603.18004 • Published about 19 hours ago • 3
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 3 days ago • 159
Running on Zero Featured 180 LTX 2.3 Distilled 📚 180 Generate cinematic videos from text prompts and images
Running on Zero MCP Featured 329 FireRed Image Edit 1.0 Fast 🌖 329 FireRed-Image-Edit × Qwen-Image-Edit-Rapid (Transformers)
MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation Paper • 2603.16861 • Published 2 days ago • 2
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 2 days ago • 51
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 2 days ago • 16
OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder Paper • 2603.16099 • Published 2 days ago • 1
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 2 days ago • 51