Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ base_model:
|
|
8 |
pipeline_tag: visual-question-answering
|
9 |
---
|
10 |
|
11 |
-
# Pathumma-llm-vision-
|
12 |
|
13 |
## Model Overview
|
14 |
Pathumma-llm-vision-2.0.0-preview is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|
|
|
8 |
pipeline_tag: visual-question-answering
|
9 |
---
|
10 |
|
11 |
+
# Pathumma-llm-vision-2.0.0-preview
|
12 |
|
13 |
## Model Overview
|
14 |
Pathumma-llm-vision-2.0.0-preview is a multi-modal language model fine-tuned for Visual Question Answering (VQA) and Image Captioning tasks. It contains 8 billion parameters and leverages both image and text processing to understand and generate multi-modal content.
|