zeroshot
/

gte-large-quant

Feature Extraction

sparse sparsity quantized onnx embeddings int8

text-embeddings-inference

Model card Files Files and versions Community

zeroshot commited on Oct 15, 2023

Commit

00f763a

·

1 Parent(s): 37916da

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -6,9 +6,9 @@ language:
 - en
 ---
-# gte-base-large
-This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-base) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization.
 Current list of sparse and quantized gte ONNX models:

 - en
 ---
+# gte-large-large
+This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-large) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization.
 Current list of sparse and quantized gte ONNX models: