From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models Paper • 2511.10899 • Published Nov 14, 2025 • 3 • 2
Less is More for Long Document Summary Evaluation by LLMs Paper • 2309.07382 • Published Sep 14, 2023 • 1