Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
1
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Audio-Text-to-Text
Computer Vision
Image Classification
Object Detection
Video Classification
Image Segmentation
Image-to-Text
Zero-Shot Image Classification
Image Feature Extraction
Mask Generation
Text-to-Image
Depth Estimation
Zero-Shot Object Detection
Unconditional Image Generation
Image-to-Image
Keypoint Detection
Image-to-3D
Text-to-Video
Text-to-3D
Image-to-Video
Natural Language Processing
Text Generation
Text Classification
Text2Text Generation
Token Classification
Fill-Mask
Question Answering
Feature Extraction
Translation
Sentence Similarity
Summarization
Zero-Shot Classification
Table Question Answering
Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Text-to-Speech
Text-to-Audio
Voice Activity Detection
Tabular
Tabular Classification
Time Series Forecasting
Tabular Regression
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Tasks with no match
Multimodal
Visual Document Retrieval
Apply filters
Models
40
Full-text search
Edit filters
Sort: Trending
Active filters:
video-text-to-text, transformers
Clear all
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
about 16 hours ago
•
2.1k
•
13
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
Updated
Oct 25, 2024
•
70.6k
•
71
Neleac/SpaceTimeGPT
Video-Text-to-Text
•
Updated
22 days ago
•
980
•
32
Chat-UniVi/Chat-UniVi
Video-Text-to-Text
•
Updated
Oct 22, 2024
•
64.2k
•
14
lmms-lab/LLaVA-NeXT-Video-7B
Video-Text-to-Text
•
Updated
Aug 25, 2024
•
434
•
42
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
Updated
16 days ago
•
47.6k
•
63
llava-hf/LLaVA-NeXT-Video-34B-hf
Video-Text-to-Text
•
Updated
16 days ago
•
812
•
7
KangarooGroup/kangaroo
Video-Text-to-Text
•
Updated
Nov 13, 2024
•
184
•
12
lmms-lab/LLaVA-NeXT-Video-32B-Qwen
Video-Text-to-Text
•
Updated
Oct 4, 2024
•
2.76k
•
15
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
Updated
Oct 10, 2024
•
1.94k
•
22
THUDM/cogvlm2-llama3-caption
Video-Text-to-Text
•
Updated
21 days ago
•
7.1k
•
81
GoodiesHere/Apollo-LMMs-Apollo-7B-t32
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
417
•
50
OpenGVLab/VideoChat-TPO
Video-Text-to-Text
•
Updated
Jan 2
•
83
•
3
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448
Video-Text-to-Text
•
Updated
23 days ago
•
2.14k
•
12
OpenGVLab/VideoChat-Flash-Qwen2-7B_res224
Video-Text-to-Text
•
Updated
23 days ago
•
947
•
3
OpenGVLab/VideoChat-Flash-Qwen2-7B_res448
Video-Text-to-Text
•
Updated
23 days ago
•
1.18k
•
8
ruili0/LongVA-7B-TPO
Video-Text-to-Text
•
Updated
18 days ago
•
144
•
1
ruili0/LLaVA-Video-7B-Qwen2-TPO
Video-Text-to-Text
•
Updated
18 days ago
•
504
•
1
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
Updated
20 days ago
•
262
•
2
Chat-UniVi/Chat-UniVi-13B
Video-Text-to-Text
•
Updated
Dec 7, 2024
•
650
•
9
Chat-UniVi/Chat-UniVi-7B-v1.5
Video-Text-to-Text
•
Updated
Dec 7, 2024
•
51
•
2
lmms-lab/LLaVA-NeXT-Video-7B-DPO
Video-Text-to-Text
•
Updated
Aug 25, 2024
•
12k
•
25
lmms-lab/LLaVA-NeXT-Video-34B-DPO
Video-Text-to-Text
•
Updated
Aug 25, 2024
•
32
•
10
lmms-lab/LLaVA-NeXT-Video-7B-32K
Video-Text-to-Text
•
Updated
Aug 25, 2024
•
45
•
7
llava-hf/LLaVA-NeXT-Video-7B-DPO-hf
Video-Text-to-Text
•
Updated
16 days ago
•
1.5k
•
9
Mutonix/Vriptor-STLLM
Video-Text-to-Text
•
Updated
Aug 5, 2024
•
11
•
3
ColorfulAI/videollamb-llava-1.5-7b
Video-Text-to-Text
•
Updated
Sep 9, 2024
•
30
•
4
LeroyDyer/_Spydaz_Web_AI_LlavaNextVideo
Video-Text-to-Text
•
Updated
Sep 19, 2024
•
48
•
1
kiddobellamy/Llama_Vision
Video-Text-to-Text
•
Updated
Sep 28, 2024
•
4
•
1
jadechoghari/LongVU_Qwen2_7B
Video-Text-to-Text
•
Updated
Oct 31, 2024
•
47
•
1
Previous
1
2
Next