cdtmc commited on
Commit
c315f90
·
verified ·
1 Parent(s): fa97f6e

Add exported openvino model 'openvino_model_qint8_quantized.xml'

Browse files

Hello!

*This pull request has been automatically generated from the [`export_static_quantized_openvino_model`](https://sbert.net/docs/package_reference/util.html#sentence_transformers.backend.export_static_quantized_openvino_model) function from the Sentence Transformers library.*

## Config
```python
OVQuantizationConfig(
quant_method=<OVQuantizationMethod.DEFAULT: 'default'>
)
```

## Tip:
Consider testing this pull request before merging by loading the model from this PR with the `revision` argument:
```python
from sentence_transformers import SentenceTransformer

# TODO: Fill in the PR number
pr_number = 2
model = SentenceTransformer(
"sentence-transformers/all-MiniLM-L6-v2",
revision=f"refs/pr/{pr_number}",
backend="openvino",
model_kwargs={"file_name": "openvino_model_qint8_quantized.xml"},
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)
```

openvino/openvino_model_qint8_quantized.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c92ea4af3c6bc7b4a0f3b3d61b147c850f4dbdd7c9e7beee0c0c70dc12da289b
3
  size 22933664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c150052e4a28afa9a14577e6c4c72efab8dbf777fef624f7021742562c3065c
3
  size 22933664
openvino/openvino_model_qint8_quantized.xml CHANGED
The diff for this file is too large to render. See raw diff