Monthly-SWEBench

UnipatAI 's Collections

updated 3 days ago

A continuously updated benchmark evaluating AI coding agents on real-world software engineering tasks from GitHub issues.