Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,7 @@ base_model:
|
|
7 |
## Model Details
|
8 |
|
9 |
This model is an int4 model with group_size 64 and symmetric quantization of [deepseek-ai/DeepSeek-V2-Lite](https://huggingface.co/deepseek-ai/DeepSeek-V2-Lite) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the origin model
|
|
|
10 |
|
11 |
### INT4 Inference(CPU/HPU/CUDA)
|
12 |
|
@@ -22,7 +23,9 @@ tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir, trust_remote_code
|
|
22 |
model = AutoModelForCausalLM.from_pretrained(quantized_model_dir,
|
23 |
trust_remote_code=True,
|
24 |
torch_dtype=torch.float16,
|
25 |
-
device_map="auto"
|
|
|
|
|
26 |
model.generation_config = GenerationConfig.from_pretrained(quantized_model_dir)
|
27 |
model.generation_config.pad_token_id = model.generation_config.eos_token_id
|
28 |
prompt = "There is a girl who likes adventure,"
|
@@ -134,7 +137,7 @@ Determined to save his people and restore the kingdom to its former glory, Princ
|
|
134 |
pip3 install lm-eval==0.4.5.
|
135 |
|
136 |
```bash
|
137 |
-
auto-round --model "OPEA/
|
138 |
```
|
139 |
|
140 |
| Metric | BF16 | INT4 |
|
|
|
7 |
## Model Details
|
8 |
|
9 |
This model is an int4 model with group_size 64 and symmetric quantization of [deepseek-ai/DeepSeek-V2-Lite](https://huggingface.co/deepseek-ai/DeepSeek-V2-Lite) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the origin model
|
10 |
+
Load the model with revision="90bb8ef" to use AutoGPTQ format.
|
11 |
|
12 |
### INT4 Inference(CPU/HPU/CUDA)
|
13 |
|
|
|
23 |
model = AutoModelForCausalLM.from_pretrained(quantized_model_dir,
|
24 |
trust_remote_code=True,
|
25 |
torch_dtype=torch.float16,
|
26 |
+
device_map="auto",
|
27 |
+
##revision="90bb8ef" ## autogptq format
|
28 |
+
)
|
29 |
model.generation_config = GenerationConfig.from_pretrained(quantized_model_dir)
|
30 |
model.generation_config.pad_token_id = model.generation_config.eos_token_id
|
31 |
prompt = "There is a girl who likes adventure,"
|
|
|
137 |
pip3 install lm-eval==0.4.5.
|
138 |
|
139 |
```bash
|
140 |
+
auto-round --model "OPEA/DeepSeek-V2-Lite-int4-sym-inc" --eval --eval_bs 16 --tasks leaderboard_ifeval,leaderboard_mmlu_pro,gsm8k,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,cmmlu,ceval-valid
|
141 |
```
|
142 |
|
143 |
| Metric | BF16 | INT4 |
|