OPEA
/

Llama-3.3-70B-Instruct-int2-sym-inc

Safetensors

llama

2-bit

intel/auto-round

Model card Files Files and versions Community

cicdatopea commited on Feb 8

Commit

e881b39

verified ·

1 Parent(s): cea9de7

update accuracy

Browse files

Files changed (1) hide show

README.md +17 -18

README.md CHANGED Viewed

@@ -103,25 +103,24 @@ pip3 install lm-eval==0.4.7
 we found lm-eval is very unstable for this model. Please set `add_bos_token=True `to align with the origin model. Please use autogptq format
 ```bash
-lm-eval --model hf --model_args pretrained=OPEA/Llama-3.3-70B-Instruct-int2-sym-inc,add_bos_token=True   --tasks mmlu --batch_size 16
 ```
-|           Metric           |   BF16(lm-eval==0.4.5)   | W2G32 With BOS |      WO BOS       |
-| :------------------------: | :----------------------: | -------------- | :---------------: |
-|            avg             |          0.7023          |                |                   |
-| leaderboard_mmlu_pro 5shot |          0.5484          |                |      0.4384       |
-|            mmlu            |          0.8195          | 0.7606         |      0.7142       |
-|       lambada_openai       |          0.7528          | 0.7413         |      0.7013       |
-|         hellaswag          |          0.6575          |                |      0.5576       |
-|         winogrande         |          0.7869          |                |      0.7080       |
-|            piqa            |          0.8303          |                |      0.7797       |
-|       truthfulqa_mc1       |          0.4284          |                |      0.3586       |
-|         openbookqa         |          0.3720          |                |      0.3000       |
-|           boolq            |          0.8865          |                |      0.8339       |
-|          arc_easy          |          0.8624          |                |      0.8064       |
-|       arc_challenge        |          0.6109          |                |      0.5188       |
-|     leaderboard_ifeval     | 0.6661=(0.7110+0.6211)/2 |                | (0.5959+0.4603)/2 |
-| gsm8k(5shot) strict match  |          0.9083          |                |                   |
 ## Generate the model

 we found lm-eval is very unstable for this model. Please set `add_bos_token=True `to align with the origin model. Please use autogptq format
 ```bash
+lm-eval --model hf --model_args pretrained=OPEA/Llama-3.3-70B-Instruct-int3-sym-inc,add_bos_token=True   --tasks leaderboard_mmlu_pro,leaderboard_ifeval,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,mmlu,gsm8k --batch_size 16
 ```
+|           Metric           |   BF16(lm-eval==0.4.5)   | W2G32 With BOS            | BF16(lm-eval==0.4.7 with BOS) |      WO BOS       |
+| :------------------------: | :----------------------: | ------------------------- | ----------------------------- | :---------------: |
+|            avg             |          0.7023          | 0.6606                    |                               |                   |
+| leaderboard_mmlu_pro 5shot |          0.5484          | 0.4461                    |                               |      0.4384       |
+|            mmlu            |          0.8195          | 0.7606                    | 0.8229                        |      0.7142       |
+|       lambada_openai       |          0.7528          | 0.7413                    |                               |      0.7013       |
+|         hellaswag          |          0.6575          | 0.6056                    |                               |      0.5576       |
+|         winogrande         |          0.7869          | 0.7727                    |                               |      0.7080       |
+|            piqa            |          0.8303          | 0.8069                    |                               |      0.7797       |
+|       truthfulqa_mc1       |          0.4284          | 0.3647                    |                               |      0.3586       |
+|         openbookqa         |          0.3720          | 0.3540                    |                               |      0.3000       |
+|           boolq            |          0.8865          | 0.8716                    |                               |      0.8339       |
+|          arc_easy          |          0.8624          | 0.8367                    |                               |      0.8064       |
+|     leaderboard_ifeval     | 0.6661=(0.7110+0.6211)/2 | 0.61235=(0.6739+0.5508)/2 |                               | (0.5959+0.4603)/2 |
+|       arc_challenge        |          0.6109          | 0.5580                    |                               |      0.5188       |
+| gsm8k(5shot) strict match  |          0.9083          | 0.8575                    |                               |                   |
 ## Generate the model