qc903113684 commited on
Commit
18fd315
·
verified ·
1 Parent(s): ba34ea5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -5
README.md CHANGED
@@ -1,5 +1,38 @@
1
- ---
2
- license: other
3
- license_name: aplux-model-farm-license
4
- license_link: https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: aplux-model-farm-license
4
+ license_link: https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf
5
+ pipeline_tag: image-classification
6
+ tags:
7
+ - AIoT
8
+ - QNN
9
+ ---
10
+
11
+ ![](https://aiot.aidlux.com/_next/image?url=%2Fapi%2Fv1%2Ffiles%2Fmodel%2Fcover%2F20250320025932_%25E5%259B%25BE1(11).png&w=640&q=75)
12
+
13
+ ## SigLIP-base: lmage Captioning
14
+
15
+ SigLIP-base is a medium-sized multimodal model developed by Google, built on the SoViT (Shape-optimized Vision Transformer) architecture and trained using Sigmoid Loss instead of the contrastive loss used in CLIP. This training approach improves performance in small-batch settings and enhances robustness to negative samples. SigLIP-base achieves strong results in tasks such as image-text retrieval and zero-shot image classification. With solid inference efficiency and scalability, it is well-suited for multilingual and multitask vision-language applications.
16
+
17
+ ### Source model
18
+
19
+ - Input shape: [1x3x384x384], [1x64]
20
+ - Number of parameters: 88.86M, 105.16M
21
+ - Model size: 359.10M, 424.01M
22
+ - Output shape: [1x768], [1x768]
23
+
24
+ The source model can be found [here](https://huggingface.co/google/siglip-base-patch16-384)
25
+
26
+ ## Performance Reference
27
+
28
+ Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)
29
+
30
+ ## Inference & Model Conversion
31
+
32
+ Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)
33
+
34
+ ## License
35
+
36
+ - Source Model: [APACHE-2.0](https://github.com/google-research/big_vision/blob/main/LICENSE)
37
+
38
+ - Deployable Model: [APLUX-MODEL-FARM-LICENSE](https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf)