Update README.md
Browse files
README.md
CHANGED
@@ -21,11 +21,12 @@ tags:
|
|
21 |
|
22 |
This model is 4bit quantized of [glm-4v-9b](https://huggingface.co/THUDM/glm-4v-9b) Model (Less than 9G).
|
23 |
|
24 |
-
It
|
25 |
|
26 |
Some part of the original Model changed and It can excute on free version of google colab.
|
27 |
# Try it: [](https://colab.research.google.com/drive/1aZGX9f5Yw1WbiOrS3TpvPk_UJUP_yYQU?usp=sharing)
|
28 |
|
|
|
29 |
### About GLM-4V-9B
|
30 |
|
31 |
GLM-4V-9B is a multimodal language model with visual understanding capabilities. The evaluation results of its related classic tasks are as follows:
|
|
|
21 |
|
22 |
This model is 4bit quantized of [glm-4v-9b](https://huggingface.co/THUDM/glm-4v-9b) Model (Less than 9G).
|
23 |
|
24 |
+
It excels in document, image, chart questioning answering and delivers superior performance over GPT-4-turbo-2024-04-09, Gemini 1.0 Pro, Qwen-VL-Max, and Claude 3 Opus.
|
25 |
|
26 |
Some part of the original Model changed and It can excute on free version of google colab.
|
27 |
# Try it: [](https://colab.research.google.com/drive/1aZGX9f5Yw1WbiOrS3TpvPk_UJUP_yYQU?usp=sharing)
|
28 |
|
29 |
+
Note: For optimal performance with document and image understanding, please use English or Chinese. The model can still handle chat in any supported language.
|
30 |
### About GLM-4V-9B
|
31 |
|
32 |
GLM-4V-9B is a multimodal language model with visual understanding capabilities. The evaluation results of its related classic tasks are as follows:
|