yydxlv
/

colqwen2.5-7b-v0.1

Visual Document Retrieval

multimodal_embedding

Text-to-Visual Document (T→VD) retrieval

Model card Files Files and versions Community

yydxlv commited on 17 days ago

Commit

b4a30e9

·

verified ·

1 Parent(s): 9a26e49

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ This version is trained with `colpali-engine==0.3.7`.
 We train models  use low-rank adapters ([LoRA](https://arxiv.org/abs/2106.09685))
 with `alpha=128`  and `r=128` on the transformer layers from the language model,
 as well as the final randomly initialized projection layer, and use a `paged_adamw_8bit` optimizer.
-We train on an 4xA100 GPU setup with distributed data parallelism (via accelerate), a learning rate of 2e-4 with linear decay with 1% warmup steps, batch size per device is 32, gradient accumulation steps are 2, in `bfloat16` format
 ## Installation

 We train models  use low-rank adapters ([LoRA](https://arxiv.org/abs/2106.09685))
 with `alpha=128`  and `r=128` on the transformer layers from the language model,
 as well as the final randomly initialized projection layer, and use a `paged_adamw_8bit` optimizer.
+We train on an 8xA100 GPU setup with distributed data parallelism (via accelerate), a learning rate of 2e-4 with linear decay with 1% warmup steps, batch size per device is 32, gradient accumulation steps are 2, in `bfloat16` format
 ## Installation