ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks Paper โข 2508.15804 โข Published Aug 14 โข 15 โข 3