Agent Bazaar: Enabling Economic Alignment in Multi-Agent Marketplaces Paper • 2605.17698 • Published 7 days ago • 6
Agent Bazaar: Enabling Economic Alignment in Multi-Agent Marketplaces Paper • 2605.17698 • Published 7 days ago • 6
Continual Harness: Online Adaptation for Self-Improving Foundation Agents Paper • 2605.09998 • Published 13 days ago • 17
Continual Harness: Online Adaptation for Self-Improving Foundation Agents Paper • 2605.09998 • Published 13 days ago • 17
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 23 days ago • 16
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 23 days ago • 16
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Paper • 2603.15563 • Published Mar 16 • 11
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Paper • 2603.15563 • Published Mar 16 • 11
Ego4D: Around the World in 3,000 Hours of Egocentric Video Paper • 2110.07058 • Published Oct 13, 2021 • 1
ICONS: Influence Consensus for Vision-Language Data Selection Paper • 2501.00654 • Published Dec 31, 2024
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? Paper • 2410.03859 • Published Oct 4, 2024 • 1