Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas
Paper • 2603.10303 • Published
None defined yet.
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
sebis at ArchEHR-QA 2026: How Much Can You Do Locally? Evaluating Grounded EHR QA on a Single Notebook