cicdatopea commited on
Commit
ab0561e
·
verified ·
1 Parent(s): c10c898

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -7,6 +7,7 @@ base_model:
7
  ## Model Details
8
 
9
  This model is an int4 model with group_size 64 and symmetric quantization of [deepseek-ai/DeepSeek-V2-Lite](https://huggingface.co/deepseek-ai/DeepSeek-V2-Lite) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the origin model
 
10
 
11
  ### INT4 Inference(CPU/HPU/CUDA)
12
 
@@ -22,7 +23,9 @@ tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir, trust_remote_code
22
  model = AutoModelForCausalLM.from_pretrained(quantized_model_dir,
23
  trust_remote_code=True,
24
  torch_dtype=torch.float16,
25
- device_map="auto")
 
 
26
  model.generation_config = GenerationConfig.from_pretrained(quantized_model_dir)
27
  model.generation_config.pad_token_id = model.generation_config.eos_token_id
28
  prompt = "There is a girl who likes adventure,"
@@ -134,7 +137,7 @@ Determined to save his people and restore the kingdom to its former glory, Princ
134
  pip3 install lm-eval==0.4.5.
135
 
136
  ```bash
137
- auto-round --model "OPEA/internlm2_5-7b-chat-int4-sym-inc" --eval --eval_bs 16 --tasks leaderboard_ifeval,leaderboard_mmlu_pro,gsm8k,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,cmmlu,ceval-valid
138
  ```
139
 
140
  | Metric | BF16 | INT4 |
 
7
  ## Model Details
8
 
9
  This model is an int4 model with group_size 64 and symmetric quantization of [deepseek-ai/DeepSeek-V2-Lite](https://huggingface.co/deepseek-ai/DeepSeek-V2-Lite) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the origin model
10
+ Load the model with revision="90bb8ef" to use AutoGPTQ format.
11
 
12
  ### INT4 Inference(CPU/HPU/CUDA)
13
 
 
23
  model = AutoModelForCausalLM.from_pretrained(quantized_model_dir,
24
  trust_remote_code=True,
25
  torch_dtype=torch.float16,
26
+ device_map="auto",
27
+ ##revision="90bb8ef" ## autogptq format
28
+ )
29
  model.generation_config = GenerationConfig.from_pretrained(quantized_model_dir)
30
  model.generation_config.pad_token_id = model.generation_config.eos_token_id
31
  prompt = "There is a girl who likes adventure,"
 
137
  pip3 install lm-eval==0.4.5.
138
 
139
  ```bash
140
+ auto-round --model "OPEA/DeepSeek-V2-Lite-int4-sym-inc" --eval --eval_bs 16 --tasks leaderboard_ifeval,leaderboard_mmlu_pro,gsm8k,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,cmmlu,ceval-valid
141
  ```
142
 
143
  | Metric | BF16 | INT4 |