Update README.md
Browse files
README.md
CHANGED
@@ -21,10 +21,9 @@ InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
|
|
21 |
## Model Details
|
22 |
- **Model Type:** multimodal large language model (MLLM)
|
23 |
- **Model Stats:**
|
24 |
-
- Architecture: [InternViT-6B-448px](https://huggingface.co/OpenGVLab/InternViT-6B-448px) + MLP + LLaMA2-13B (
|
|
|
25 |
- Params: 19B
|
26 |
-
- Image size: 448 x 448
|
27 |
-
- Number of visual tokens: 256
|
28 |
|
29 |
- **Training Strategy:**
|
30 |
- Pretraining Stage
|
|
|
21 |
## Model Details
|
22 |
- **Model Type:** multimodal large language model (MLLM)
|
23 |
- **Model Stats:**
|
24 |
+
- Architecture: [InternViT-6B-448px](https://huggingface.co/OpenGVLab/InternViT-6B-448px) + MLP + LLaMA2-13B (Our internal SFT versions)
|
25 |
+
- Image size: 448 x 448 (256 tokens)
|
26 |
- Params: 19B
|
|
|
|
|
27 |
|
28 |
- **Training Strategy:**
|
29 |
- Pretraining Stage
|