abhinavnmagic commited on
Commit
90c91dc
·
verified ·
1 Parent(s): 19ea320

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -9
README.md CHANGED
@@ -14,15 +14,17 @@ Produced using [AutoFP8 with calibration samples from ultrachat](https://github.
14
 
15
  ## Evaluation
16
 
 
 
17
  ### Open LLM Leaderboard evaluation scores
18
- | | Meta-Llama-3-70B-Instruct | Meta-Llama-3-70B-Instruct-FP8<br>(this model) |
19
  | :------------------: | :----------------------: | :------------------------------------------------: |
20
- | arc-c<br>25-shot | 71.58 | 72.09 |
21
- | hellaswag<br>10-shot | 86.94 | 86.83 |
22
- | mmlu<br>5-shot | 83.97 | 84.06 |
23
- | truthfulqa<br>0-shot | 66.98 | 66.95 |
24
- | winogrande<br>5-shot | 82.79 | 83.18 |
25
- | gsm8k<br>5-shot | 87.56 | 88.93 |
26
- | **Average<br>Accuracy** | **79.97** | **80.34** |
27
- | **Recovery** | **100%** | **100.46%** |
28
 
 
14
 
15
  ## Evaluation
16
 
17
+ 74.53666667 69.19 82.49 70.61 65.73 82.63 76.57
18
+
19
  ### Open LLM Leaderboard evaluation scores
20
+ | | Mixtral-8x22B-Instruct-v0.1 | Mixtral-8x22B-Instruct-v0.1-FP8<br>(this model) |
21
  | :------------------: | :----------------------: | :------------------------------------------------: |
22
+ | arc-c<br>25-shot | 72.70 | 69.19 |
23
+ | hellaswag<br>10-shot | 89.08 | 82.49 |
24
+ | mmlu<br>5-shot | 77.77 | 70.61 |
25
+ | truthfulqa<br>0-shot | 68.14 | 65.73 |
26
+ | winogrande<br>5-shot | 85.16 | 82.63 |
27
+ | gsm8k<br>5-shot | 82.03 | 76.57 |
28
+ | **Average<br>Accuracy** | **79.15** | **74.53** |
29
+ | **Recovery** | **100%** | **94.17%** |
30