AXERA-TECH
/

DeepSeek-R1-Distill-Qwen-1.5B-GPTQ-Int4

DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek-R1-Distill-Qwen-1.5B-GPTQ-Int4

Inference Endpoints

Model card Files Files and versions Community

qqc1989 commited on 9 days ago

Commit

7cea081

·

verified ·

1 Parent(s): 936ce0f

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -11,7 +11,10 @@ This model has been optimized with the following LoRA:
 Compatible with Pulsar2 version: 3.4(Not released yet)
-## Useful links:
 [Pulsar2 Link, How to Convert LLM from Huggingface to axmodel](https://pulsar2-docs.readthedocs.io/en/latest/appendix/build_llm.html)
 [AXera NPU LLM Runtime](https://github.com/AXERA-TECH/ax-llm)
@@ -28,3 +31,10 @@ Compatible with Pulsar2 version: 3.4(Not released yet)
 |Chips|w8a16|w4a16|
 |--|--|--|
 |AX650| 11 tokens/sec|19 tokens/sec|

 Compatible with Pulsar2 version: 3.4(Not released yet)
+## Convert tools links:
+For those who are interested in model conversion, you can try to export axmodel through the original repo : https://huggingface.co/jakiAJK/DeepSeek-R1-Distill-Qwen-1.5B_GPTQ-int4
 [Pulsar2 Link, How to Convert LLM from Huggingface to axmodel](https://pulsar2-docs.readthedocs.io/en/latest/appendix/build_llm.html)
 [AXera NPU LLM Runtime](https://github.com/AXERA-TECH/ax-llm)
 |Chips|w8a16|w4a16|
 |--|--|--|
 |AX650| 11 tokens/sec|19 tokens/sec|
+## How to use
+### AX650 Host
+### AX650 M.2