helenai
/

DeepSeek-R1-Distill-Qwen-7B-ov-int4

Model card Files Files and versions Community

helenai commited on about 12 hours ago

Commit

b7ed62f

·

verified ·

1 Parent(s): 19aade4

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,4 +1,4 @@
-This is the [Deepseek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) model, convert to OpenVINO with INT4 weight compression.
 To run inference on this model, install openvino-genai (`pip install openvino-genai`) and run [llm_chat_deepseek.py(https://gist.github.com/helena-intel/554fba91f380df590ecc9245abdad33f)
@@ -13,4 +13,6 @@ python llm_chat_deepseek.py DeepSeek-R1-Distill-Qwen-7B-ov-int4 GPU
 ```
 > [!NOTE]
-> The last line specifies the device to run inference. GPU is recommended for recent Intel laptops with integrated graphics, or for Intel discrete graphics. Change to CPU if you do not have an Intel GPU.

+This is the [Deepseek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) model, convert to OpenVINO with INT4 weight compression. This model is optimized for CPU and GPU. See [helenai/DeepSeek-R1-Distill-Qwen-7B-ov-int4-npu](https://huggingface.co/helenai/DeepSeek-R1-Distill-Qwen-7B-ov-int4-npu) for a version that works on NPU.
 To run inference on this model, install openvino-genai (`pip install openvino-genai`) and run [llm_chat_deepseek.py(https://gist.github.com/helena-intel/554fba91f380df590ecc9245abdad33f)
 ```
 > [!NOTE]
+> The last line specifies the device to run inference. GPU is recommended for recent Intel laptops with integrated graphics, or for Intel discrete graphics. Change to CPU if you do not have an Intel GPU.
+Gradio chatbot notebook using this model: https://gist.github.com/helena-intel/69e1c2921a2bcb618fdd7cdfb0bd0202