Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
Fireworks
Cerebras
Replicate
Novita
Hyperbolic
SambaNova
fal
Together AI
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
AutoTrain Compatible
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
625
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
Jan 14
•
7
mlx-community/Llama-3.2-90B-Vision-Instruct-4bit
Image-Text-to-Text
•
Updated
Dec 21, 2024
•
139
•
3
osunlp/UGround-V1-7B
Image-Text-to-Text
•
Updated
25 days ago
•
1.35k
•
11
mradermacher/UGround-V1-7B-GGUF
Updated
Jan 4
•
255
•
1
osunlp/UGround-V1-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
18
•
2
nintwentydo/Razorback-12B-v0.1
Image-Text-to-Text
•
Updated
Jan 10
•
18
•
2
nintwentydo/Razorback-12B-v0.2
Image-Text-to-Text
•
Updated
Jan 10
•
58
•
3
erax-ai/EraX-VL-7B-V2.0-Preview
Visual Question Answering
•
Updated
Jan 21
•
2.07k
•
21
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448
Video-Text-to-Text
•
Updated
10 days ago
•
937
•
14
OpenGVLab/VideoChat-Flash-Qwen2-7B_res448
Video-Text-to-Text
•
Updated
10 days ago
•
3.56k
•
9
osunlp/UGround-V1-72B
Image-Text-to-Text
•
Updated
Jan 23
•
74
•
4
tahamajs/plamma
Updated
Feb 9
•
16
•
2
bytedance-research/UI-TARS-7B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
2.69k
•
155
bytedance-research/UI-TARS-72B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
339
•
14
mradermacher/UI-TARS-2B-SFT-i1-GGUF
Updated
Jan 21
•
193
•
1
mradermacher/UI-TARS-7B-DPO-GGUF
Updated
Jan 21
•
1.74k
•
8
mradermacher/UI-TARS-7B-DPO-i1-GGUF
Updated
Jan 21
•
495
•
3
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
24 days ago
•
16.1k
•
44
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
Updated
28 days ago
•
900
•
3
OpenGVLab/InternVL_2_5_HiCo_R64
Video-Text-to-Text
•
Updated
28 days ago
•
207
•
1
bartowski/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
1.61k
•
2
lmstudio-community/UI-TARS-7B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
930
•
6
bartowski/UI-TARS-7B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
2.08k
•
6
lmstudio-community/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
372
•
3
bartowski/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
1.13k
•
1
bartowski/UI-TARS-7B-SFT-GGUF
Image-Text-to-Text
•
Updated
Jan 24
•
2.38k
•
3
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
238
•
1
3ib0n/Qwen2-VL-2B-rkllm
Image-Text-to-Text
•
Updated
Jan 23
•
2
vincentamato/ARIA
Updated
Jan 25
•
2
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
•
Updated
Jan 26
•
19
•
1
Previous
1
...
3
4
5
6
7
...
21
Next