Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Cerebras
Novita
Nebius AI
Featherless AI
Fireworks
Together AI
Groq
Hyperbolic
+ 6
Apply filters
Models
4,904
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Intel/llava-llama-3-8b
Image-Text-to-Text
•
8B
•
Updated
Jul 1, 2024
•
1.35k
•
17
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
9B
•
Updated
Jan 15
•
36.4k
•
1.4k
FreedomIntelligence/HuatuoGPT-Vision-34B
Image-Text-to-Text
•
35B
•
Updated
Jul 3, 2024
•
10.5k
•
25
llava-hf/llava-interleave-qwen-0.5b-hf
Image-Text-to-Text
•
0.9B
•
Updated
Jan 27
•
36.9k
•
34
llava-hf/llava-next-110b-hf
Image-Text-to-Text
•
112B
•
Updated
Jan 27
•
856
•
6
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
8B
•
Updated
Jun 13
•
66k
•
991
llava-hf/llava-onevision-qwen2-0.5b-si-hf
Image-Text-to-Text
•
0.9B
•
Updated
Jun 18
•
1.85k
•
11
llava-hf/llava-onevision-qwen2-7b-ov-hf
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
40.8k
•
32
TheFinAI/FinLLaVA
Image-Text-to-Text
•
8B
•
Updated
Aug 28, 2024
•
132
•
14
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
703k
•
•
1.22k
Qwen/Qwen2-VL-2B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
Sep 21, 2024
•
1.75k
•
23
openvla/openvla-7b-finetuned-libero-spatial
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
9.5k
•
2
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
8B
•
Updated
Jan 12
•
5k
•
56
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Feb 6
•
5.62k
•
•
305
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
10B
•
Updated
Feb 26
•
11.5k
•
275
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
20.4k
•
536
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Apr 4
•
44.6k
•
536
rhymes-ai/Aria
Image-Text-to-Text
•
25B
•
Updated
Apr 23
•
22.2k
•
633
nvidia/NVLM-D-72B
Image-Text-to-Text
•
79B
•
Updated
Jan 14
•
45.8k
•
771
OpenGVLab/Mono-InternVL-2B
Image-Text-to-Text
•
3B
•
Updated
8 days ago
•
4.65k
•
36
latent-action-pretraining/LAPA-7B-openx
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
12
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text
•
4B
•
Updated
Feb 26
•
13.7k
•
49
PULSE-ECG/PULSE-7B
Image-Text-to-Text
•
7B
•
Updated
Oct 28, 2024
•
8.76k
•
21
calcuis/llava-gguf
Image-Text-to-Text
•
7B
•
Updated
Nov 2, 2024
•
493
•
2
OpenGVLab/InternVL2-8B-MPO
Image-Text-to-Text
•
8B
•
Updated
Dec 20, 2024
•
249
•
36
OpenGVLab/InternVL2_5-8B
Image-Text-to-Text
•
8B
•
Updated
Mar 25
•
69.9k
•
94
OpenGVLab/InternVL2_5-1B
Image-Text-to-Text
•
0.9B
•
Updated
Mar 25
•
15.9k
•
60
OpenGVLab/InternVL2_5-4B
Image-Text-to-Text
•
4B
•
Updated
Mar 25
•
49.6k
•
52
unsloth/llava-v1.6-mistral-7b-hf-bnb-4bit
Image-Text-to-Text
•
4B
•
Updated
Feb 13
•
5.46k
•
8
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Dec 5, 2024
•
176k
•
154
Previous
1
...
3
4
5
6
7
...
100
Next