Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Multi lora spaces
TTS
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
5 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Feb 3
•
1.01k
•
186
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
26 days ago
•
4.79k
•
269
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
20.4k
•
766
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.67k
•
1.65k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Jan 27
•
11.5k
•
583
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Jan 27
•
4.25k
•
142
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
1.26k
•
513
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Jan 9
•
156k
•
1.08k
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
86.8k
•
1.43k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
41
•
19
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
825
•
63
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
Feb 5
•
4.38k
•
185
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
149k
•
•
571
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
12.5k
•
305
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
409k
•
517
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 11
•
59.2k
•
57
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
6 days ago
•
1.78k
•
20
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
22.9k
•
116
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
1 day ago
•
374k
•
•
400
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
1 day ago
•
3.19M
•
•
717
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
Feb 18
•
6.97k
•
50
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
Jan 28
•
5.59k
•
45
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
168k
•
177
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
28 days ago
•
541k
•
575
microsoft/Magma-8B
Image-Text-to-Text
•
Updated
19 days ago
•
14.6k
•
339
marco/mcdse-2b-v1
Updated
Oct 29, 2024
•
6.11k
•
54
CohereForAI/aya-vision-8b
Image-Text-to-Text
•
Updated
20 days ago
•
150k
•
265
Skywork/Skywork-R1V-38B
Image-Text-to-Text
•
Updated
6 days ago
•
2.94k
•
97
ds4sd/SmolDocling-256M-preview
Image-Text-to-Text
•
Updated
1 day ago
•
27.9k
•
856
Upvote
-
Share collection
View history
Collection guide
Browse collections