Update README.md
Browse files
README.md
CHANGED
@@ -635,16 +635,16 @@ This is the quantized (INT8) ONNX variant of the [bge-small-en-v1.5](https://hug
|
|
635 |
|
636 |
Current list of sparse and quantized bge ONNX models:
|
637 |
|
638 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
639 |
|
640 |
-
[zeroshot/bge-large-en-v1.5-quant](https://huggingface.co/zeroshot/bge-large-en-v1.5-quant)
|
641 |
|
642 |
-
[
|
643 |
|
644 |
-
[
|
645 |
-
|
646 |
-
[zeroshot/bge-small-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-small-en-v1.5-sparse)
|
647 |
-
|
648 |
-
[zeroshot/bge-small-en-v1.5-quant](https://huggingface.co/zeroshot/bge-small-en-v1.5-quant)
|
649 |
-
|
650 |
-
For general questions on these models and sparsification methods, reach out to the engineering team on our [community Slack](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ).
|
|
|
635 |
|
636 |
Current list of sparse and quantized bge ONNX models:
|
637 |
|
638 |
+
| Links | Sparsification Method |
|
639 |
+
| --------------------------------------------------------------------------------------------------- | ---------------------- |
|
640 |
+
| [zeroshot/bge-large-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-large-en-v1.5-sparse) | Quantization (INT8) & 50% Pruning |
|
641 |
+
| [zeroshot/bge-large-en-v1.5-quant](https://huggingface.co/zeroshot/bge-large-en-v1.5-quant) | Quantization (INT8) |
|
642 |
+
| [zeroshot/bge-base-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-base-en-v1.5-sparse) | Quantization (INT8) & 50% Pruning |
|
643 |
+
| [zeroshot/bge-base-en-v1.5-quant](https://huggingface.co/zeroshot/bge-base-en-v1.5-quant) | Quantization (INT8) |
|
644 |
+
| [zeroshot/bge-small-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-small-en-v1.5-sparse) | Quantization (INT8) & 50% Pruning |
|
645 |
+
| [zeroshot/bge-small-en-v1.5-quant](https://huggingface.co/zeroshot/bge-small-en-v1.5-quant) | Quantization (INT8) |
|
646 |
|
|
|
647 |
|
648 |
+
For general questions on these models and sparsification methods, reach out to the engineering team on our [community Slack](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ).
|
649 |
|
650 |
+

|
|
|
|
|
|
|
|
|
|
|
|