Garnet-OCR-3B-0422-GGUF

The Garnet-OCR-3B-0422 model is a fine-tuned and optimized evolution of Megalodon-OCR-Sync-0713, built on top of the Qwen2.5-VL-3B-Instruct architecture. This version is specifically designed for high-precision mathematical formula extraction, structured markdown generation, and accurate table reconstruction, making it highly effective for technical, scientific, and structured documents. Trained on an enhanced mixture of document-centric datasets, including large-scale OCR-caption pairs and structured document corpora, the model improves layout fidelity, symbolic reasoning, and content structuring across diverse document types such as research papers, scanned PDFs, handwritten equations, and analytical reports.

Model Files

File Name Quant Type File Size File Link
Garnet-OCR-3B-0422.BF16.gguf BF16 6.8 GB Download
Garnet-OCR-3B-0422.F16.gguf F16 6.8 GB Download
Garnet-OCR-3B-0422.F32.gguf F32 13.6 GB Download
Garnet-OCR-3B-0422.Q8_0.gguf Q8_0 3.62 GB Download
Garnet-OCR-3B-0422.mmproj-bf16.gguf mmproj-bf16 1.34 GB Download
Garnet-OCR-3B-0422.mmproj-f16.gguf mmproj-f16 1.34 GB Download
Garnet-OCR-3B-0422.mmproj-f32.gguf mmproj-f32 2.67 GB Download
Garnet-OCR-3B-0422.mmproj-q8_0.gguf mmproj-q8_0 848 MB Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
1,084
GGUF
Model size
3B params
Architecture
qwen2vl
Hardware compatibility
Log In to add your hardware

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for prithivMLmods/Garnet-OCR-3B-0422-GGUF

Datasets used to train prithivMLmods/Garnet-OCR-3B-0422-GGUF

Collection including prithivMLmods/Garnet-OCR-3B-0422-GGUF