|
--- |
|
license: apache-2.0 |
|
tags: |
|
- uncensored |
|
--- |
|
This is an experimental model. |
|
|
|
Finetuned on dataset [toxic-dpo-v0.1-NoWarning-alpaca](https://huggingface.co/datasets/diffnamehard/toxic-dpo-v0.1-NoWarning-alpaca) using model [Mistral-CatMacaroni-slerp-7B](https://huggingface.co/diffnamehard/Mistral-CatMacaroni-slerp-7B) |
|
|
|
| Metric | Value | |
|
| --- | --- | |
|
| Avg. | 67.28 | |
|
| ARC (25-shot) | 64.25 | |
|
| HellaSwag (10-shot) | 84.09 | |
|
| MMLU (5-shot) | 62.66 | |
|
| TruthfulQA (0-shot) | 56.87 | |
|
| Winogrande (5-shot) | 79.72 | |
|
| GSM8K (5-shot) | 56.1 | |
|
|