Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shoaibmohd 's Collections
Datasets
Memory
NBA/Recommenders
Voice models
Tab models
Computer Use Agent
Learning from examples - training/inference
OCR
Data Analysis Papers

OCR

updated 10 days ago
Upvote
-

  • MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

    Paper • 2509.22186 • Published Sep 26 • 136

  • CommonForms: A Large, Diverse Dataset for Form Field Detection

    Paper • 2509.16506 • Published Sep 20 • 19

  • Automated Structured Radiology Report Generation with Rich Clinical Context

    Paper • 2510.00428 • Published Oct 1 • 7

  • Extract-0: A Specialized Language Model for Document Information Extraction

    Paper • 2509.22906 • Published Sep 26

  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published Oct 16 • 105

  • RL makes MLLMs see better than SFT

    Paper • 2510.16333 • Published Oct 18 • 48

  • NVIDIA Nemotron Parse 1.1

    Paper • 2511.20478 • Published 14 days ago • 20

  • OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

    Paper • 2511.16334 • Published 19 days ago • 91
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs