Agent Reward Bench Leaderboard
🥇
3
Leaderboard for AgentRewardBench
computational linguistics, natural language processing
Structured Distillation of Web Agent Capabilities Enables Generalization
LLM2Vec-Gen: Generative Embeddings from Large Language Models