dfurman
/

Llama-2-7B-Instruct-v0.1

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 15, 2023

Commit

3eeff15

·

1 Parent(s): 256b1cd

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -25,13 +25,13 @@ This instruction model was built via parameter-efficient QLoRA finetuning of [Ll
 | Metric                | Value |
 |-----------------------|-------|
-| MMLU (5-shot)         | Coming |
-| ARC (25-shot)         | Coming |
-| HellaSwag (10-shot)   | Coming |
-| TruthfulQA (0-shot)   | Coming |
-| Avg.                  | Coming |
-We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as Hugging Face's [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
 ## Helpful links

 | Metric                | Value |
 |-----------------------|-------|
+| MMLU (5-shot)         | 46.63 |
+| ARC (25-shot)         | 51.19 |
+| HellaSwag (10-shot)   | 78.92 |
+| TruthfulQA (0-shot)   | 48.5 |
+| Avg.                  | 56.31 |
+We use the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as Hugging Face's [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
 ## Helpful links