AI & ML interests
None defined yet.
Recent Activity
Papers
GutenOCR: A Grounded Vision-Language Front-End for Documents
PubMed-OCR: PMC Open Access OCR Annotations
Organization Card
Data and models for optical character recognition
-
PubMed-OCR: PMC Open Access OCR Annotations
Paper • 2601.11425 • Published • 11 -
GutenOCR: A Grounded Vision-Language Front-End for Documents
Paper • 2601.14490 • Published • 36 -
rootsautomation/TABMEpp
Viewer • Updated • 122k • 62 • 5 -
rootsautomation/pubmed-ocr
Viewer • Updated • 1.55M • 3.63k • 68
Data and models for optical character recognition
-
PubMed-OCR: PMC Open Access OCR Annotations
Paper • 2601.11425 • Published • 11 -
GutenOCR: A Grounded Vision-Language Front-End for Documents
Paper • 2601.14490 • Published • 36 -
rootsautomation/TABMEpp
Viewer • Updated • 122k • 62 • 5 -
rootsautomation/pubmed-ocr
Viewer • Updated • 1.55M • 3.63k • 68
datasets
13
rootsautomation/pubmed-ocr
Viewer
•
Updated
•
1.55M
•
3.63k
•
68
rootsautomation/TABMEpp
Viewer
•
Updated
•
122k
•
62
•
5
rootsautomation/websrc-test
Viewer
•
Updated
•
40.4k
•
6
rootsautomation/websrc
Viewer
•
Updated
•
360k
•
804
•
7
rootsautomation/RICO-ScreenAnnotation
Viewer
•
Updated
•
22.1k
•
40
•
12
rootsautomation/RICO-ScreenAnnotation-f
Viewer
•
Updated
•
22.1k
•
26
•
7
rootsautomation/RICO-ScreenQA-Complex
Viewer
•
Updated
•
11.8k
•
166
•
16
rootsautomation/RICO-ScreenQA-Short
Viewer
•
Updated
•
86k
•
224
•
4
rootsautomation/RICO-ScreenQA
Viewer
•
Updated
•
86k
•
111
•
11
rootsautomation/RICO-Screen2Words
Viewer
•
Updated
•
22.4k
•
105
•
9