Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
SambaNova
Fireworks
Hyperbolic
Novita
Nebius AI Studio
Together AI
Replicate
Cerebras
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
AutoTrain Compatible
8-bit precision
Eval Results
Merge
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
7,834
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
AIDC-AI/Ovis2-1B
Image-Text-to-Text
•
Updated
17 days ago
•
6.29k
•
75
AIDC-AI/Ovis2-4B
Image-Text-to-Text
•
Updated
17 days ago
•
5.56k
•
49
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
Updated
27 days ago
•
24.9k
•
24
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
9 days ago
•
171k
•
40
convergence-ai/proxy-lite-3b
Image-Text-to-Text
•
Updated
7 days ago
•
4.57k
•
124
unsloth/gemma-3-27b-it
Image-Text-to-Text
•
Updated
3 days ago
•
1.16k
•
4
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
Feb 11
•
164k
•
•
390
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
68.7k
•
•
110
microsoft/kosmos-2-patch14-224
Image-to-Text
•
Updated
Nov 28, 2023
•
156k
•
162
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
•
Updated
Mar 15, 2024
•
56.1k
•
247
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
109k
•
101
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
Updated
Jan 12
•
68.6k
•
48
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
Dec 26, 2024
•
•
622
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
10 days ago
•
62.9k
•
408
OpenGVLab/InternVL2_5-8B
Image-Text-to-Text
•
Updated
Feb 5
•
72.7k
•
84
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
188k
•
173
Bllossom/llama-3.2-Korean-Bllossom-AICA-5B
Image-Text-to-Text
•
Updated
2 days ago
•
439k
•
68
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 11
•
45.5k
•
55
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
10 days ago
•
19.2k
•
114
bytedance-research/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
25.6k
•
98
ibm-granite/granite-vision-3.1-2b-preview
Image-Text-to-Text
•
Updated
17 days ago
•
13.2k
•
93
zhibinlan/LLaVE-0.5B
Image-Text-to-Text
•
Updated
2 days ago
•
163
•
3
zhibinlan/LLaVE-7B
Image-Text-to-Text
•
Updated
2 days ago
•
120
•
3
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
Updated
17 days ago
•
7.21k
•
54
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text
•
Updated
10 days ago
•
5.45k
•
41
huihui-ai/Qwen2.5-VL-3B-Instruct-abliterated
Image-Text-to-Text
•
Updated
6 days ago
•
1.32k
•
8
huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated
Image-Text-to-Text
•
Updated
6 days ago
•
22.7k
•
7
Gen-Verse/HermesFlow
Image-Text-to-Text
•
Updated
22 days ago
•
103
•
3
unsloth/gemma-3-27b-pt
Image-Text-to-Text
•
Updated
3 days ago
•
155
•
3
unsloth/gemma-3-12b-pt
Image-Text-to-Text
•
Updated
3 days ago
•
633
•
3
Previous
1
2
3
4
5
...
100
Next