Upload README.md
Browse files
README.md
CHANGED
@@ -25,9 +25,9 @@ tags:
|
|
25 |
|
26 |
π [Paper](https://arxiv.org/abs/2410.17241) | π [Home](https://github.com/ai4colonoscopy/IntelliScope)
|
27 |
|
28 |
-
> This is the merged weights of [ColonGPT-v1-phi1.5-siglip-lora-stg2](https://drive.google.com/file/d/
|
29 |
|
30 |
-
Our ColonGPT is a standard multimodal language model, which contains four basic components: a language tokenizer, an visual encoder (π€ [SigLIP-SO](https://huggingface.co/google/siglip-so400m-patch14-384)), a multimodal connector, and a language model (π€ [Phi1.5](https://huggingface.co/microsoft/phi-1_5)). In this huggingface page, we provide a quick start for convenient of new users. For further details about ColonGPT, we highly recommend visiting our [homepage](https://github.com/
|
31 |
|
32 |
|
33 |
# Quick start
|
|
|
25 |
|
26 |
π [Paper](https://arxiv.org/abs/2410.17241) | π [Home](https://github.com/ai4colonoscopy/IntelliScope)
|
27 |
|
28 |
+
> This is the merged weights of [ColonGPT-v1-phi1.5-siglip-lora-stg2](https://drive.google.com/file/d/10_1UrMxpThnn61jZmvKx6f8h7lb0hx-W/view?usp=drive_link), including vision encoder (siglip) + language model (phi-1.5), and other fine-tuned weights on our ColonINST.
|
29 |
|
30 |
+
Our ColonGPT is a standard multimodal language model, which contains four basic components: a language tokenizer, an visual encoder (π€ [SigLIP-SO](https://huggingface.co/google/siglip-so400m-patch14-384)), a multimodal connector, and a language model (π€ [Phi1.5](https://huggingface.co/microsoft/phi-1_5)). In this huggingface page, we provide a quick start for convenient of new users. For further details about ColonGPT, we highly recommend visiting our [homepage](https://github.com/ai4colonoscopy/IntelliScope). There, you'll find comprehensive usage instructions for our model and the latest advancements in intelligent colonoscopy technology.
|
31 |
|
32 |
|
33 |
# Quick start
|