Update README.md
Browse files
README.md
CHANGED
|
@@ -6,12 +6,11 @@ language:
|
|
| 6 |
- en
|
| 7 |
tags:
|
| 8 |
- colpali
|
| 9 |
-
- vidore
|
| 10 |
- vidore-experimental
|
| 11 |
---
|
| 12 |
# ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
|
| 13 |
|
| 14 |
-
### This is the base version trained with batch_size 256 instead of 32
|
| 15 |
|
| 16 |
ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
|
| 17 |
It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
|
|
|
|
| 6 |
- en
|
| 7 |
tags:
|
| 8 |
- colpali
|
|
|
|
| 9 |
- vidore-experimental
|
| 10 |
---
|
| 11 |
# ColQwen2: Visual Retriever based on Qwen2-VL-2B-Instruct with ColBERT strategy
|
| 12 |
|
| 13 |
+
### This is the base version trained with batch_size 256 instead of 32 for 1 epoch
|
| 14 |
|
| 15 |
ColQwen is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features.
|
| 16 |
It is a [Qwen2-VL-2B](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) extension that generates [ColBERT](https://arxiv.org/abs/2004.12832)- style multi-vector representations of text and images.
|