RedHatAI
/

OpenHermes-2.5-Mistral-7B-pruned50-quant-ds

Text Generation

Model card Files Files and versions Community

mgoin commited on Nov 22, 2023

Commit

f902ac3

·

1 Parent(s): edf88ee

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -8,21 +8,25 @@ prompt_template: |
   <|im_start|>user
   {prompt}<|im_end|>
   <|im_start|>assistant
-sparsified_by: mgoin
 tags:
 - deepsparse
 ---
 # OpenHermes 2.5 Mistral 7B - DeepSparse
-This repo contains [DeepSparse](https://github.com/neuralmagic/deepsparse) model files for [Teknium's OpenHermes 2.5 Mistral 7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B).
 This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
 ## Inference
-Install DeepSparse: `pip install deepsparse-nightly[llm]`
 ```python
 from deepsparse import TextGeneration
 system_message = ""

   <|im_start|>user
   {prompt}<|im_end|>
   <|im_start|>assistant
+quantized_by: mgoin
 tags:
 - deepsparse
 ---
 # OpenHermes 2.5 Mistral 7B - DeepSparse
+This repo contains [DeepSparse](https://github.com/neuralmagic/deepsparse), a sparsity-aware CPU inference runtime, model files for [Teknium's OpenHermes 2.5 Mistral 7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B).
 This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
 ## Inference
+Install [DeepSparse LLM](https://github.com/neuralmagic/deepsparse):
+```
+pip install deepsparse-nightly[llm]
+```
+Run in a [Python pipeline](https://github.com/neuralmagic/deepsparse/blob/main/docs/llms/text-generation-pipeline.md):
 ```python
 from deepsparse import TextGeneration
 system_message = ""