deepseek-ai
/

deepseek-vl-1.3b-base

Inference Endpoints

Model card Files Files and versions Community

doubility123 commited on Mar 11, 2024

Commit

b007cbe

·

verified ·

1 Parent(s): cdfecde

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ Haoyu Lu*, Wen Liu*, Bo Zhang**, Bingxuan Wang, Kai Dong, Bo Liu, Jingxiang Sun,
 ### 2. Model Summary
-DeepSeek-VL-1.3b is a tiny vision-language model. It uses the [SigLIP-L](https://huggingface.co/timm/ViT-L-16-SigLIP-384) as the vision encoder supporting 384 x 384 image input
 and is constructed based on the DeepSeek-LLM-1.3b-base which is trained on an approximate corpus of 500B text tokens. The whole DeepSeek-1.3b-base model is finally trained around 400B vision-language tokens.
 ## 3. Quick Start

 ### 2. Model Summary
+DeepSeek-VL-1.3b-base is a tiny vision-language model. It uses the [SigLIP-L](https://huggingface.co/timm/ViT-L-16-SigLIP-384) as the vision encoder supporting 384 x 384 image input
 and is constructed based on the DeepSeek-LLM-1.3b-base which is trained on an approximate corpus of 500B text tokens. The whole DeepSeek-1.3b-base model is finally trained around 400B vision-language tokens.
 ## 3. Quick Start