Add model card
#2
by
nielsr
HF staff
- opened
README.md
ADDED
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: video-text-to-text
|
3 |
+
---
|
4 |
+
This repository contains the model of the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).
|
5 |
+
|
6 |
+
Code: https://github.com/VITA-MLLM/VITA
|