-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 75 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 155 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2604.18486
-
VOID: Video Object and Interaction Deletion
Paper • 2604.02296 • Published • 54 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 93 -
WildDet3D: Scaling Promptable 3D Detection in the Wild
Paper • 2604.08626 • Published • 245 -
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling
Paper • 2604.19734 • Published • 31
-
Visual Spatial Tuning
Paper • 2511.05491 • Published • 53 -
Adam's Law: Textual Frequency Law on Large Language Models
Paper • 2604.02176 • Published • 503 -
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
Paper • 2604.10098 • Published • 81 -
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
Paper • 2604.13016 • Published • 94
-
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
Paper • 2604.16029 • Published • 23 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 58 -
REFRAG: Rethinking RAG based Decoding
Paper • 2509.01092 • Published • 9 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 93
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 195
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 75 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 155 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
-
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
Paper • 2604.16029 • Published • 23 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 58 -
REFRAG: Rethinking RAG based Decoding
Paper • 2509.01092 • Published • 9 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 93
-
VOID: Video Object and Interaction Deletion
Paper • 2604.02296 • Published • 54 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 93 -
WildDet3D: Scaling Promptable 3D Detection in the Wild
Paper • 2604.08626 • Published • 245 -
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling
Paper • 2604.19734 • Published • 31
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 195
-
Visual Spatial Tuning
Paper • 2511.05491 • Published • 53 -
Adam's Law: Textual Frequency Law on Large Language Models
Paper • 2604.02176 • Published • 503 -
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
Paper • 2604.10098 • Published • 81 -
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
Paper • 2604.13016 • Published • 94