Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: llama2
|
| 3 |
+
pipeline_tag: text-generation
|
| 4 |
+
Tags:
|
| 5 |
+
- cortex.cpp
|
| 6 |
+
- multimodal
|
| 7 |
+
- vicuna
|
| 8 |
+
- vision-language
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
## Overview
|
| 12 |
+
|
| 13 |
+
**LLaVA** (Large Language and Vision Assistant) is an open-source chatbot trained to handle multimodal instruction-following tasks. It is a fine-tuned **Vicuna-7B** model, designed to process both **text and image** inputs. This auto-regressive language model leverages the **transformer architecture** to improve interactions in vision-language tasks, making it useful for research in **computer vision, natural language processing, machine learning, and artificial intelligence**.
|
| 14 |
+
|
| 15 |
+
LLaVA-v1.6-Vicuna-7B is the latest iteration, trained in **December 2023**, and optimized for improved instruction-following performance in multimodal settings.
|
| 16 |
+
|
| 17 |
+
## Variants
|
| 18 |
+
|
| 19 |
+
| No | Variant | Cortex CLI command |
|
| 20 |
+
| --- | --- | --- |
|
| 21 |
+
| 1 | [llava-v1.6-vicuna-7b-f16](https://huggingface.co/cortexso/llava-v1.6/tree/gguf-f16) | `cortex run llava-v1.6:gguf-f16` |
|
| 22 |
+
| 2 | [llava-v1.6-vicuna-7b-q4_km](https://huggingface.co/cortexso/llava-v1.6/tree/gguf-q4-km) | `cortex run llava-v1.6:gguf-q4-km` |
|
| 23 |
+
|
| 24 |
+
## Use it with Jan (UI)
|
| 25 |
+
|
| 26 |
+
1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
|
| 27 |
+
2. Use in Jan model Hub:
|
| 28 |
+
```bash
|
| 29 |
+
cortexso/llava-v1.6
|
| 30 |
+
```
|
| 31 |
+
|
| 32 |
+
## Use it with Cortex (CLI)
|
| 33 |
+
|
| 34 |
+
1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
|
| 35 |
+
2. Run the model with command:
|
| 36 |
+
```bash
|
| 37 |
+
cortex run llava-v1.6
|
| 38 |
+
```
|
| 39 |
+
|
| 40 |
+
## Credits
|
| 41 |
+
- **Author:** LLaVA Research Team
|
| 42 |
+
- **Converter:** [Homebrew](https://www.homebrew.ltd/)
|
| 43 |
+
- **Original License:** [LLAMA 2 Community License](https://github.com/facebookresearch/llama/blob/main/LICENSE)
|
| 44 |
+
- **Papers:** [LLaVA-v1.6: Enhancing Large Multimodal Models](https://llava-vl.github.io/)
|