Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Nebius AI
Novita
Together AI
Groq
fal
Hyperbolic
+ 6
Apply filters
Models
6,086
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
yifanzhang114/MM-RLHF-Reward-7B-llava-ov-qwen
Image-Text-to-Text
•
8B
•
Updated
Mar 3
•
71
•
1
kairavishal37/LLava-med-api
Image-Text-to-Text
•
8B
•
Updated
Feb 1
•
3
moot20/Qwen2.5-VL-7B-Instruct-MLX-4bits
Image-Text-to-Text
•
1B
•
Updated
Feb 19
•
23
moot20/Qwen2.5-VL-7B-Instruct-MLX-6bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
13
ljnlonoljpiljm/florence-2-base-phrases
Image-Text-to-Text
•
0.3B
•
Updated
Feb 9
•
5
moot20/Qwen2.5-VL-7B-Instruct-MLX-8bits
Image-Text-to-Text
•
2B
•
Updated
Feb 19
•
16
moot20/Qwen2.5-VL-3B-Instruct-MLX-4bits
Image-Text-to-Text
•
0.7B
•
Updated
Feb 19
•
150
moot20/Qwen2.5-VL-3B-Instruct-MLX-6bits
Image-Text-to-Text
•
0.9B
•
Updated
Feb 19
•
7
moot20/Qwen2.5-VL-3B-Instruct-MLX-8bits
Image-Text-to-Text
•
1B
•
Updated
Feb 19
•
17
•
1
Aitrepreneur/Florence-2-base
Image-Text-to-Text
•
Updated
Feb 1
•
6
Aitrepreneur/Florence-2-large
Image-Text-to-Text
•
Updated
Feb 1
•
8
Triangle104/LatexMind-2B-Codec-Q4_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
4
Triangle104/LatexMind-2B-Codec-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
5
Triangle104/LatexMind-2B-Codec-Q5_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
3
Triangle104/LatexMind-2B-Codec-Q5_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
2
Triangle104/LatexMind-2B-Codec-Q6_K-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
13
Triangle104/LatexMind-2B-Codec-Q8_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
4
pauljmorris/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Feb 2
mukulp/Qwen2.5-VL-72B-Instruct-bf16
Image-Text-to-Text
•
73B
•
Updated
Feb 2
•
29
mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
Image-Text-to-Text
•
Updated
Feb 2
•
1
pedalnomica/InternVL2_5-78B-MPO-AWQ
Image-Text-to-Text
•
18B
•
Updated
Feb 2
•
6
ilpa-user/mp3_nc_vision
Image-Text-to-Text
•
Updated
Feb 3
•
6
billatsectorflow/Qwen2-VL-7B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
3B
•
Updated
Feb 3
•
5
moot20/SmolVLM-500M-Instruct-MLX-4bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
4
moot20/SmolVLM-500M-Instruct-MLX-6bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
4
moot20/SmolVLM-500M-Instruct-MLX-8bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
4
moot20/SmolVLM-256M-Instruct-MLX-4bits
Image-Text-to-Text
•
0.0B
•
Updated
Feb 19
•
12
moot20/SmolVLM-256M-Instruct-MLX-6bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
6
moot20/SmolVLM-256M-Instruct-MLX-8bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
5
moot20/SmolVLM-256M-Instruct-MLX
Image-Text-to-Text
•
0.3B
•
Updated
Feb 19
•
7
Previous
1
...
72
73
74
75
76
...
100
Next