Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
10,166
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/gemma-3n-E4B-it-litert-preview
Image-Text-to-Text
•
Updated
15 days ago
•
1.05k
Hcompany/Holo1-7B
Image-Text-to-Text
•
Updated
5 days ago
•
1.11k
•
112
google/gemma-3n-E2B-it-litert-preview
Image-Text-to-Text
•
Updated
20 days ago
•
350
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
about 11 hours ago
•
3.6k
•
86
Hcompany/Holo1-3B
Image-Text-to-Text
•
Updated
5 days ago
•
1.54k
•
69
google/medgemma-4b-it
Image-Text-to-Text
•
Updated
19 days ago
•
71.7k
•
362
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
Updated
3 days ago
•
6.78k
•
129
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
2.4M
•
•
949
ByteDance/Dolphin
Image-Text-to-Text
•
Updated
14 days ago
•
4.15k
•
263
stockmark/Stockmark-2-VL-100B-beta
Image-Text-to-Text
•
Updated
7 days ago
•
505
•
17
bharatgenai/patram-7b-instruct
Image-Text-to-Text
•
Updated
2 days ago
•
192
•
17
google/gemma-3-4b-it
Image-Text-to-Text
•
Updated
Mar 21
•
990k
•
598
google/gemma-3-27b-it
Image-Text-to-Text
•
Updated
Mar 21
•
402k
•
•
1.42k
google/gemma-3-12b-it
Image-Text-to-Text
•
Updated
Mar 21
•
393k
•
•
404
fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text
•
Updated
25 days ago
•
60k
•
109
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
3.1M
•
400
microsoft/GUI-Actor-7B-Qwen2-VL
Image-Text-to-Text
•
Updated
34 minutes ago
•
219
•
14
qingy2024/GRMR-V3-G4B
Image-Text-to-Text
•
Updated
4 days ago
•
72
•
12
mistralai/Mistral-Small-3.1-24B-Instruct-2503
Image-Text-to-Text
•
Updated
May 9
•
115k
•
•
1.27k
google/medgemma-4b-pt
Image-Text-to-Text
•
Updated
19 days ago
•
4.78k
•
87
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Apr 14
•
347k
•
1.15k
microsoft/GUI-Actor-Verifier-2B
Image-Text-to-Text
•
Updated
32 minutes ago
•
48
•
10
microsoft/GUI-Actor-2B-Qwen2-VL
Image-Text-to-Text
•
Updated
34 minutes ago
•
169
•
9
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
820k
•
•
1.45k
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
433k
•
•
939
unsloth/medgemma-27b-text-it-GGUF
Image-Text-to-Text
•
Updated
20 days ago
•
26.7k
•
32
ds4sd/SmolDocling-256M-preview
Image-Text-to-Text
•
Updated
25 days ago
•
344k
•
1.42k
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
Updated
29 days ago
•
63.5k
•
109
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text
•
Updated
Apr 11
•
7.91k
•
164
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
644k
•
•
477
Previous
1
2
3
...
100
Next