OCR - a shoaibmohd Collection

shoaibmohd 's Collections

Memory

NBA/Recommenders

Computer Use Agent

Learning from examples - training/inference

OCR

Data Analysis Papers

OCR

updated 10 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136
CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20 • 19
Automated Structured Radiology Report Generation with Rich Clinical Context

Paper • 2510.00428 • Published Oct 1 • 7
Extract-0: A Specialized Language Model for Document Information Extraction

Paper • 2509.22906 • Published Sep 26
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 105
RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18 • 48
NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published 14 days ago • 20
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published 19 days ago • 91