Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 8 days ago • 134
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 7 days ago • 80
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 184
ConfTuner: Training Large Language Models to Express Their Confidence Verbally Paper • 2508.18847 • Published Aug 26, 2025 • 2
XtraGPT: LLMs for Human-AI Collaboration on Controllable Academic Paper Revision Paper • 2505.11336 • Published May 16, 2025 • 7
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published Mar 29, 2025 • 46
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25, 2025 • 31