Generate Perplexity, KLD, ARC, HellaSwag, MMLU, Truthful QA and WinoGrande scores
Browse files
scores/Dolphin3.0-Mistral-24B-Q3_K_M.tqa
CHANGED
@@ -758,3 +758,6 @@ task acc_norm
|
|
758 |
748 34.75935829
|
759 |
749 34.84646195
|
760 |
750 34.93333333
|
|
|
|
|
|
|
|
758 |
748 34.75935829
|
759 |
749 34.84646195
|
760 |
750 34.93333333
|
761 |
+
|
762 |
+
Final result: 34.9300 +/- 1.6774
|
763 |
+
Random chance: 19.5384 +/- 1.7940
|