Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Novita
Replicate
fal
Fireworks
Cerebras
Together AI
SambaNova
Hyperbolic
Nebius AI Studio
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
AutoTrain Compatible
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
627
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
•
Updated
Jan 26
•
19
•
1
mlx-community/Qwen2.5-VL-3B-Instruct-8bit
Image-Text-to-Text
•
Updated
16 days ago
•
764
•
7
mlx-community/Qwen2.5-VL-3B-Instruct-bf16
Image-Text-to-Text
•
Updated
16 days ago
•
134
•
2
mlx-community/Qwen2.5-VL-7B-Instruct-6bit
Image-Text-to-Text
•
Updated
17 days ago
•
279
•
3
mlx-community/Qwen2.5-VL-7B-Instruct-8bit
Image-Text-to-Text
•
Updated
17 days ago
•
1.7k
•
12
mlx-community/Qwen2.5-VL-7B-Instruct-bf16
Image-Text-to-Text
•
Updated
16 days ago
•
349
•
3
jarvisvasu/Qwen2.5-VL-3B-Instruct-4bit
Image-Text-to-Text
•
Updated
Jan 29
•
275
•
3
mlx-community/Qwen2.5-VL-72B-Instruct-3bit
Image-Text-to-Text
•
Updated
17 days ago
•
135
•
4
mlx-community/Qwen2.5-VL-72B-Instruct-8bit
Image-Text-to-Text
•
Updated
17 days ago
•
222
•
1
unsloth/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
5 days ago
•
3.62k
•
10
unsloth/Qwen2.5-VL-72B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
5 days ago
•
6.92k
•
11
Benasd/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Feb 8
•
1.64k
•
6
Hibernates/Hibernates-JP-1.3b-Max
Updated
Feb 9
•
30
•
2
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-GGUF
Updated
15 days ago
•
720
•
1
ordis-co-ltd/Qwen2.5-VL-72B-Instruct_exl2_6.0bpw
Image-Text-to-Text
•
Updated
Feb 10
•
67
•
1
Benasd/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
22 days ago
•
24.4k
•
6
OpenGVLab/VideoChat-Flash-Qwen2_5-7B-1M_res224
Video-Text-to-Text
•
Updated
10 days ago
•
132
•
1
YuchengShi/llava-med-v1.5-mistral-7b-chest-xray
Image-Text-to-Text
•
Updated
20 days ago
•
174
•
1
hfl/Qwen2.5-VL-3B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
Updated
17 days ago
•
372
•
1
hfl/Qwen2.5-VL-7B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
Updated
17 days ago
•
998
•
2
lmms-lab/EgoGPT-7b-EgoIT-EgoLife
Updated
7 days ago
•
97
•
2
Benasd/Qwen2.5-VL-72B-Instruct-AWQ-fix
Image-Text-to-Text
•
Updated
12 days ago
•
1.51k
•
1
mlx-community/UI-TARS-7B-DPO-8bit
Image-Text-to-Text
•
Updated
11 days ago
•
51
•
1
mlx-community/UI-TARS-72B-SFT-bf16
Image-Text-to-Text
•
Updated
11 days ago
•
13
•
1
mlx-community/UI-TARS-72B-DPO-bf16
Image-Text-to-Text
•
Updated
8 days ago
•
5
•
1
sujitpal/clip-imageclef
Zero-Shot Image Classification
•
Updated
Oct 31, 2023
•
150
•
3
waybarrios/guidance-based-video-grounding
Updated
Apr 1, 2023
MonoHime/mosei-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
14
MonoHime/mosei-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
11
MonoHime/iemocap-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
13
Previous
1
...
4
5
6
7
8
...
21
Next