Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
1
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,021
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text, transformers
Clear all
stepfun-ai/GELab-Zero-4B-preview
Image-to-Text
•
4B
•
Updated
6 days ago
•
673
•
90
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
89.3k
•
406
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
13 days ago
•
15.2k
•
179
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.38M
•
821
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
16 days ago
•
982
•
47
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
Oct 22
•
536k
•
153
thesby/Qwen3-VL-8B-NSFW-Caption-V4.5
Image-to-Text
•
9B
•
Updated
30 days ago
•
15.5k
•
41
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
10.1k
•
19
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22
•
31.8k
•
88
shkb/MemeLeak
Image-to-Text
•
9B
•
Updated
4 days ago
•
100
•
2
team-lucid/trocr-small-korean
Image-to-Text
•
54.5M
•
Updated
Jul 1, 2023
•
563
•
18
SawanStack/gpt2-image-captioning-onnx
Image-to-Text
•
Updated
Nov 13, 2023
•
8
•
1
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
7.3k
•
38
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
164k
•
47
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
512k
•
80
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
22.1k
•
86
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-to-Text
•
2B
•
Updated
Nov 3, 2024
•
30
•
20
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
7.8k
•
18
enalis/scold
Image-to-Text
•
Updated
Oct 29
•
49
•
7
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27
•
7.9k
•
10
infly/INF-AZ-7B-0524
Image-to-Text
•
8B
•
Updated
May 25
•
31
•
3
helizac/dots.ocr-4bit
Image-to-Text
•
2B
•
Updated
Aug 6
•
515
•
28
allenai/olmOCR-7B-0825
Image-to-Text
•
8B
•
Updated
Oct 22
•
1.1k
•
60
mradermacher/dunhuang-qwen2.5-vl-7b-GGUF
Image-to-Text
•
8B
•
Updated
Sep 28
•
187
•
1
sbintuitions/sarashina2.2-vision-3b
Image-to-Text
•
4B
•
Updated
18 days ago
•
2.33k
•
13
Float16-cloud/typhoon-ocr1.5-2b-int8
Image-to-Text
•
Updated
14 days ago
•
36
•
2
suv11235/olmOCR-7B-grpo-v3
Image-to-Text
•
8B
•
Updated
7 days ago
•
17
•
1
prithivMLmods/LightOnOCR-1B-1025-AIO-GGUF
Image-to-Text
•
0.8B
•
Updated
about 1 hour ago
•
1
thesby/Qwen3-VL-8B-NSFW-Caption-V4
Image-to-Text
•
9B
•
Updated
Oct 23
•
957
•
20
thesby/Qwen2.5-VL-7B-NSFW-Caption-V3
Image-to-Text
•
8B
•
Updated
Jun 17
•
402
•
85
Previous
1
2
3
...
100
Next