Update README.md
Browse files
README.md
CHANGED
@@ -6,9 +6,9 @@ language:
|
|
6 |
- en
|
7 |
---
|
8 |
|
9 |
-
# gte-
|
10 |
|
11 |
-
This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-
|
12 |
|
13 |
Current list of sparse and quantized gte ONNX models:
|
14 |
|
|
|
6 |
- en
|
7 |
---
|
8 |
|
9 |
+
# gte-large-large
|
10 |
|
11 |
+
This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-large) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization.
|
12 |
|
13 |
Current list of sparse and quantized gte ONNX models:
|
14 |
|