VITA-MLLM
/

VITA-1.5

Video-Text-to-Text

Model card Files Files and versions Community

VITA-1.5 / README.md

lxysl's picture

Add model card (#2)

e42b47f verified 27 days ago

|

history blame contribute delete

245 Bytes

	---
	pipeline_tag: video-text-to-text
	---
	This repository contains the model of the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).

	Code: https://github.com/VITA-MLLM/VITA