Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 3 days ago • 33
Can LLMs Learn to Reason Robustly under Noisy Supervision? Paper • 2604.03993 • Published 7 days ago • 38
A Survey of On-Policy Distillation for Large Language Models Paper • 2604.00626 • Published 10 days ago • 9
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 22 days ago • 330
EpochX: Building the Infrastructure for an Emergent Agent Civilization Paper • 2603.27304 • Published 14 days ago • 46
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 16 days ago • 49
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 25 days ago • 136
EvoClaw: Evaluating AI Agents on Continuous Software Evolution Paper • 2603.13428 • Published 29 days ago • 21
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 30 days ago • 53
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making Paper • 2602.06570 • Published Feb 6 • 61
MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments Paper • 2602.06075 • Published Feb 3 • 13
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published Jan 21 • 75