A collection of EMOVA models (https://emova-ollm.github.io/)
AI & ML interests
Omni-modal Large Language Models, Multi-modal Large Language Models (MLLMs), Emotional spoken dialogue
Recent Activity
Organization Card
👋 Welcome to EMOVA! We are a team focusing on fully open-sourced omni-modal foundational models with visual, textual, and speech capabilities. EMOVA (EMotionally Omni-present Voice Assistant) is a novel Omni-modal Large Language Model with end-to-end speech capabilities while maintaining state-of-the-art vision-language performance. We wish to promote the development of omni-modal human interactions with intelligent models!
models
13

Emova-ollm/Qwen2.5-7B-Instruct_add_speech_token_4096_nostrip
Feature Extraction
•
7B
•
Updated
•
21

Emova-ollm/emova-qwen-2-5-72b-hf
Feature Extraction
•
74B
•
Updated
•
2
•
2

Emova-ollm/emova-qwen-2-5-72b
Text Generation
•
74B
•
Updated
•
2
•
1

Emova-ollm/emova-qwen-2-5-7b-hf
Feature Extraction
•
8B
•
Updated
•
64
•
2

Emova-ollm/emova-qwen-2-5-7b
Text Generation
•
8B
•
Updated
•
5
•
1

Emova-ollm/emova-qwen-2-5-3b-hf
Feature Extraction
•
4B
•
Updated
•
11
•
5

Emova-ollm/emova-qwen-2-5-3b
Text Generation
•
4B
•
Updated
•
4
•
2

Emova-ollm/qwen2vit600m
Feature Extraction
•
0.7B
•
Updated
•
1.97k

Emova-ollm/Meta-Llama-3.1-8B-Instruct_add_speech_token_4096_nostrip-2
Feature Extraction
•
8B
•
Updated
•
2

Emova-ollm/Qwen2.5-3B-Instruct_add_speech_token_4096_nostrip
Text Generation
•
3B
•
Updated
•
203
datasets
5
Emova-ollm/emova-alignment-7m
Viewer
•
Updated
•
6.18M
•
1.25k
•
4
Emova-ollm/emova-sft-speech-eval
Viewer
•
Updated
•
3.76k
•
12
Emova-ollm/emova-asr-tts-eval
Viewer
•
Updated
•
5.24k
•
14
Emova-ollm/emova-sft-speech-231k
Viewer
•
Updated
•
231k
•
100
•
2
Emova-ollm/emova-sft-4m
Viewer
•
Updated
•
4.31M
•
512
•
3