Update README.md
Browse files
README.md
CHANGED
|
@@ -34,19 +34,19 @@ We release InternViT-6B-448px-V1-0, which is integrated into [InternVL-Chat-V1-1
|
|
| 34 |
### Vision Foundation model
|
| 35 |
| Model | Date | Download | Note |
|
| 36 |
| ----------------------- | ---------- | ---------------------------------------------------------------------- | -------------------------------- |
|
| 37 |
-
| InternViT-6B-448px-V1
|
| 38 |
-
| InternViT-6B-448px-V1
|
| 39 |
-
| InternViT-6B-448px-V1
|
| 40 |
| InternViT-6B-224px | 2023.12.22 | π€ [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-224px) | vision foundation model |
|
| 41 |
| InternVL-14B-224px | 2023.12.22 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-14B-224px) | vision-language foundation model |
|
| 42 |
|
| 43 |
### Multimodal Large Language Model (MLLM)
|
| 44 |
| Model | Date | Download | Note |
|
| 45 |
| ----------------------- | ---------- | --------------------------------------------------------------------------- | ---------------------------------- |
|
| 46 |
-
| InternVL-Chat-V1
|
| 47 |
-
| InternVL-Chat-V1
|
| 48 |
-
| InternVL-Chat-V1
|
| 49 |
-
| InternVL-Chat-V1
|
| 50 |
|
| 51 |
|
| 52 |
## Model Usage (Image Embeddings)
|
|
|
|
| 34 |
### Vision Foundation model
|
| 35 |
| Model | Date | Download | Note |
|
| 36 |
| ----------------------- | ---------- | ---------------------------------------------------------------------- | -------------------------------- |
|
| 37 |
+
| InternViT-6B-448px-V1-5 | 2024.04.20 | π€ [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-5) | support dynamic resolution, super strong OCR (π₯new) |
|
| 38 |
+
| InternViT-6B-448px-V1-2 | 2024.02.11 | π€ [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) | 448 resolution |
|
| 39 |
+
| InternViT-6B-448px-V1-0 | 2024.01.30 | π€ [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-0) | 448 resolution |
|
| 40 |
| InternViT-6B-224px | 2023.12.22 | π€ [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-224px) | vision foundation model |
|
| 41 |
| InternVL-14B-224px | 2023.12.22 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-14B-224px) | vision-language foundation model |
|
| 42 |
|
| 43 |
### Multimodal Large Language Model (MLLM)
|
| 44 |
| Model | Date | Download | Note |
|
| 45 |
| ----------------------- | ---------- | --------------------------------------------------------------------------- | ---------------------------------- |
|
| 46 |
+
| InternVL-Chat-V1-5 | 2024.04.18 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5) | support 4K image; super strong OCR; Approaching the performance of GPT-4V and Gemini Pro on various benchmarks like MMMU, DocVQA, ChartQA, MathVista, etc. (π₯new)|
|
| 47 |
+
| InternVL-Chat-V1-2-Plus | 2024.02.21 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2-Plus) | more SFT data and stronger |
|
| 48 |
+
| InternVL-Chat-V1-2 | 2024.02.11 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2) | scaling up LLM to 34B |
|
| 49 |
+
| InternVL-Chat-V1-1 | 2024.01.24 | π€ [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-1) | support Chinese and stronger OCR |
|
| 50 |
|
| 51 |
|
| 52 |
## Model Usage (Image Embeddings)
|