Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Novita
Cerebras
Nebius AI
Featherless AI
Together AI
Fireworks
Groq
Hyperbolic
+ 6
Apply filters
Models
8,007
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
allenai/olmOCR-7B-0725
Image-to-Text
•
8B
•
Updated
7 days ago
•
982
•
28
ChatDOC/OCRFlux-3B
Image-to-Text
•
4B
•
Updated
22 days ago
•
19.3k
•
319
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
1.92M
•
761
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
188k
•
457
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
473k
•
420
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
153k
•
182
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
119k
•
225
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.67M
•
1.38k
Ertugrul/Qwen2-VL-7B-Captioner-Relaxed
Image-to-Text
•
8B
•
Updated
Sep 26, 2024
•
1.28k
•
56
allenai/olmOCR-7B-0225-preview
Image-to-Text
•
8B
•
Updated
Feb 25
•
178k
•
687
ibm-granite/granite-vision-3.2-2b
Image-to-Text
•
3B
•
Updated
Jun 12
•
5.25k
•
100
BCCard/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-to-Text
•
33B
•
Updated
Jun 20
•
174k
•
4
smolagents/Qwen2.5-VL-3B-Instruct-Agentic
Image-to-Text
•
4B
•
Updated
6 days ago
•
73
•
4
allenai/olmOCR-7B-0725-FP8
Image-to-Text
•
8B
•
Updated
7 days ago
•
4.9k
•
5
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
58.8k
•
124
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
95.1k
•
54
microsoft/trocr-small-printed
Image-to-Text
•
0.1B
•
Updated
May 27, 2024
•
22k
•
42
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
688k
•
903
xiaolv/ocr-captcha
Image-to-Text
•
Updated
Aug 22, 2023
•
40
qantev/trocr-small-spanish
Image-to-Text
•
Updated
Jul 18, 2024
•
81
•
8
hoang-quoc-trung/sumen-base
Image-to-Text
•
0.3B
•
Updated
Apr 17, 2024
•
54
•
5
cyberagent/llava-calm2-siglip
Image-to-Text
•
7B
•
Updated
Jun 12, 2024
•
1.04k
•
26
breezedeus/pix2text-mfd
Image-to-Text
•
Updated
Jul 10, 2024
•
23k
•
5
ahmed-masry/chartgemma
Image-to-Text
•
3B
•
Updated
Jul 27, 2024
•
1.8k
•
44
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
15.5k
•
82
prithivMLmods/Florence-2-VLM-Doc-VQA
Image-to-Text
•
0.3B
•
Updated
Oct 26, 2024
•
10
•
5
xiangjx/musk
Image-to-Text
•
Updated
Jan 19
•
33
Bllossom/llama-3.2-Korean-Bllossom-AICA-5B
Image-to-Text
•
5B
•
Updated
Mar 14
•
750k
•
92
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
4.24k
•
14
HuggingFaceTB/SmolVLM-500M-Base
Image-to-Text
•
0.5B
•
Updated
Jan 20
•
975
•
10
Previous
1
2
3
...
100
Next