Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
Cerebras
Nscale
Cohere
fal
Fireworks
Novita
Replicate
Hyperbolic
Together AI
SambaNova
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
Merge
8-bit precision
Eval Results
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
10,987
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
129k
•
476
google/shieldgemma-2-4b-it
Image-Text-to-Text
•
Updated
Apr 4
•
27.8k
•
105
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
Updated
14 days ago
•
913
•
9
scb10x/typhoon-ocr-7b
Image-Text-to-Text
•
Updated
14 days ago
•
10.8k
•
40
mlabonne/gemma-3-12b-it-abliterated-v2-GGUF
Image-Text-to-Text
•
Updated
6 days ago
•
3.41k
•
7
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Mar 23
•
194k
•
•
473
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text
•
Updated
Apr 1
•
14.1k
•
91
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
Updated
Apr 14
•
480k
•
•
380
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.66M
•
724
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Feb 3
•
1.89M
•
1.35k
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Feb 3
•
895k
•
378
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
153k
•
71
ibm-granite/granite-vision-3.2-2b
Image-Text-to-Text
•
Updated
Apr 14
•
5.52k
•
95
microsoft/Magma-8B
Image-Text-to-Text
•
Updated
22 days ago
•
4.35k
•
397
Tesslate/Synthia-S1-27b
Image-Text-to-Text
•
Updated
Apr 9
•
489
•
•
75
meta-llama/Llama-4-Scout-17B-16E
Image-Text-to-Text
•
Updated
Apr 9
•
34.2k
•
173
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
Updated
Apr 29
•
64.4k
•
40
rp-yu/Dimple-7B
Image-Text-to-Text
•
Updated
9 days ago
•
624
•
6
ngxson/Devstral-Small-Vision-2505-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
18.3k
•
23
mlabonne/gemma-3-12b-it-qat-abliterated
Image-Text-to-Text
•
Updated
6 days ago
•
74
•
5
mlabonne/gemma-3-27b-it-qat-abliterated-GGUF
Image-Text-to-Text
•
Updated
5 days ago
•
1.2k
•
5
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
Updated
Jan 27
•
152k
•
101
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
Dec 8, 2024
•
673k
•
1.56k
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
40k
•
152
HuggingFaceTB/SmolVLM2-2.2B-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
65.4k
•
196
google/gemma-3-4b-pt
Image-Text-to-Text
•
Updated
Mar 21
•
50.3k
•
83
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
Updated
23 days ago
•
54.2k
•
102
unsloth/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
Updated
23 days ago
•
53.2k
•
77
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-Text-to-Text
•
Updated
6 days ago
•
43.1k
•
23
moonshotai/Kimi-VL-A3B-Thinking
Image-Text-to-Text
•
Updated
Apr 20
•
48.7k
•
409
Previous
1
2
3
4
...
100
Next