Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Fireworks
Novita
Nebius AI
Groq
Together AI
fal
Nscale
+ 6
Apply filters
Models
6,150
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
ctranslate2-4you/InternVL2_5-1B
Image-Text-to-Text
•
0.9B
•
Updated
Feb 28
•
4
ctranslate2-4you/InternVL2_5-4B
Image-Text-to-Text
•
4B
•
Updated
Feb 28
•
6
turningpoint-ai/VisualThinker-R1-Zero
Image-Text-to-Text
•
2B
•
Updated
Apr 15
•
1.06k
•
6
ctranslate2-4you/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Mar 2
•
4
ljnlonoljpiljm/florence-2-base-ft-region-proposal
Image-Text-to-Text
•
0.3B
•
Updated
Mar 11
•
7
saim1212/qwen2_2b_git2
Image-Text-to-Text
•
2B
•
Updated
Mar 1
•
8
adityaghai07/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Mar 1
Captaint2004/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Mar 1
•
8
assentian1970/mplug3_dsd
Image-Text-to-Text
•
8B
•
Updated
Mar 2
•
3
mlx-community/UI-TARS-7B-SFT-4bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
11
mlx-community/UI-TARS-7B-DPO-4bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
8
mlx-community/UI-TARS-7B-SFT-6bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
9
mlx-community/UI-TARS-7B-DPO-6bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
10
mlx-community/UI-TARS-7B-SFT-8bit
Image-Text-to-Text
•
3B
•
Updated
Mar 3
•
10
mlx-community/UI-TARS-7B-SFT-bf16
Image-Text-to-Text
•
8B
•
Updated
Mar 3
•
3
mlx-community/UI-TARS-7B-DPO-8bit
Image-Text-to-Text
•
3B
•
Updated
Mar 3
•
15
•
1
OpenGVLab/InternVL2_5-Pretrain-Models
Image-Text-to-Text
•
Updated
Mar 25
•
6
mlx-community/UI-TARS-7B-DPO-bf16
Image-Text-to-Text
•
8B
•
Updated
Mar 3
•
12
egeozsoy/MM-OR
Image-Text-to-Text
•
Updated
about 16 hours ago
rootonchair/InternVL2_5-4B-AWQ
Image-Text-to-Text
•
1B
•
Updated
Mar 3
•
5
•
2
mlx-community/UI-TARS-72B-SFT-4bit
Image-Text-to-Text
•
12B
•
Updated
Mar 3
•
117
mlx-community/UI-TARS-72B-SFT-6bit
Image-Text-to-Text
•
17B
•
Updated
Mar 3
•
4
mlx-community/UI-TARS-72B-SFT-8bit
Image-Text-to-Text
•
21B
•
Updated
Mar 3
•
111
mlx-community/UI-TARS-72B-SFT-bf16
Image-Text-to-Text
•
73B
•
Updated
Mar 3
•
7
•
1
mradermacher/ToriiGate-v0.4-7B-i1-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jul 11
•
243
•
1
SenseLLM/SpiritSight-Agent-8B
Image-Text-to-Text
•
Updated
Apr 21
•
8
FriendliAI/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Mar 4
•
13
FriendliAI/Molmo-72B-0924
Image-Text-to-Text
•
73B
•
Updated
Mar 4
•
7
FriendliAI/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Mar 4
•
5
FriendliAI/Phi-3.5-vision-instruct
Image-Text-to-Text
•
4B
•
Updated
Mar 4
•
9
Previous
1
...
82
83
84
85
86
...
100
Next