Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Novita
Nebius AI
Together AI
Groq
fal
Cohere
+ 6
Apply filters
Models
6,151
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
magistermilitum/Tridis_HTR_MiniCPM
Image-Text-to-Text
•
Updated
Mar 19
•
3
zwt123home123/InternVL2_5-8B
Image-Text-to-Text
•
8B
•
Updated
Feb 19
•
5
MIL-UT/Asagi-8B
Image-Text-to-Text
•
8B
•
Updated
Feb 24
•
9
•
4
rp-yu/Qwen2-VL-7b-VPT-CLIP
Image-Text-to-Text
•
8B
•
Updated
Jul 7
•
268
•
1
rp-yu/Qwen2-VL-2b-VPT-Seg
Image-Text-to-Text
•
3B
•
Updated
Jul 14
•
19
•
1
rp-yu/Qwen2-VL-2b-VPT-CLIP
Image-Text-to-Text
•
Updated
Mar 11
•
26
•
1
rp-yu/Qwen2-VL-2b-VPT-Det
Image-Text-to-Text
•
Updated
Mar 11
•
19
rp-yu/Qwen2-VL-2b-VPT-Det-NoPrompt
Image-Text-to-Text
•
Updated
Mar 11
•
16
rp-yu/Qwen2-VL-2b-VPT-Seg-Alignment
Image-Text-to-Text
•
Updated
Mar 11
•
15
rp-yu/Qwen2-VL-2b-VPT-Det-Alignment
Image-Text-to-Text
•
Updated
Mar 11
•
21
2dameneko/MiniCPM-o-2_6-nf4
Image-Text-to-Text
•
5B
•
Updated
Feb 19
•
9
AXERA-TECH/InternVL2_5-1B
Image-Text-to-Text
•
Updated
Apr 4
•
4
•
1
moot20/paligemma2-3b-mix-224-MLX-4bits
Image-Text-to-Text
•
0.6B
•
Updated
Feb 19
•
20
moot20/paligemma2-3b-mix-224-MLX-6bits
Image-Text-to-Text
•
0.8B
•
Updated
Feb 19
•
16
moot20/paligemma2-3b-mix-224-MLX-8bits
Image-Text-to-Text
•
0.9B
•
Updated
Feb 19
•
19
moot20/paligemma2-3b-mix-448-MLX-4bits
Image-Text-to-Text
•
0.6B
•
Updated
Feb 19
•
15
moot20/paligemma2-3b-mix-448-MLX-8bits
Image-Text-to-Text
•
1.0B
•
Updated
Feb 19
•
14
moot20/paligemma2-3b-mix-448-MLX-6bits
Image-Text-to-Text
•
0.8B
•
Updated
Feb 19
•
17
moot20/paligemma2-10b-mix-224-MLX-4bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
15
moot20/paligemma2-10b-mix-224-MLX-6bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
17
moot20/paligemma2-10b-mix-224-MLX-8bits
Image-Text-to-Text
•
3B
•
Updated
Feb 19
•
15
moot20/paligemma2-10b-mix-448-MLX-4bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
20
moot20/paligemma2-10b-mix-448-MLX-6bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
15
moot20/paligemma2-10b-mix-448-MLX-8bits
Image-Text-to-Text
•
3B
•
Updated
Feb 19
•
18
moot20/paligemma2-28b-mix-224-MLX-4bits
Image-Text-to-Text
•
4B
•
Updated
Feb 19
•
23
moot20/paligemma2-28b-mix-224-MLX-6bits
Image-Text-to-Text
•
6B
•
Updated
Feb 19
•
16
moot20/paligemma2-28b-mix-224-MLX-8bits
Image-Text-to-Text
•
8B
•
Updated
Feb 19
•
18
mlx-community/paligemma2-3b-mix-224-4bit
Image-Text-to-Text
•
0.8B
•
Updated
Feb 19
•
18
mlx-community/paligemma2-3b-mix-224-3bit
Image-Text-to-Text
•
0.7B
•
Updated
Feb 19
•
16
mlx-community/paligemma2-3b-mix-224-6bit
Image-Text-to-Text
•
1.0B
•
Updated
Feb 19
•
16
Previous
1
...
78
79
80
81
82
...
100
Next