Adding Evaluation Results

#1
by T145 - opened
Files changed (1) hide show
  1. README.md +114 -1
README.md CHANGED
@@ -4,6 +4,105 @@ tags:
4
  - mergekit
5
  - merge
6
  base_model: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ---
8
 
9
  <h2>L3.1-Dark-Planet-SpinFire-Uncensored-8B</h2>
@@ -49,4 +148,18 @@ For full information about this model, including:
49
 
50
  Please go to:
51
 
52
- [ https://huggingface.co/DavidAU/L3.1-Dark-Planet-SpinFire-Uncensored-8B-gguf ]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - mergekit
5
  - merge
6
  base_model: []
7
+ model-index:
8
+ - name: L3.1-Dark-Planet-SpinFire-Uncensored-8B
9
+ results:
10
+ - task:
11
+ type: text-generation
12
+ name: Text Generation
13
+ dataset:
14
+ name: IFEval (0-Shot)
15
+ type: wis-k/instruction-following-eval
16
+ split: train
17
+ args:
18
+ num_few_shot: 0
19
+ metrics:
20
+ - type: inst_level_strict_acc and prompt_level_strict_acc
21
+ value: 70.43
22
+ name: averaged accuracy
23
+ source:
24
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
25
+ name: Open LLM Leaderboard
26
+ - task:
27
+ type: text-generation
28
+ name: Text Generation
29
+ dataset:
30
+ name: BBH (3-Shot)
31
+ type: SaylorTwift/bbh
32
+ split: test
33
+ args:
34
+ num_few_shot: 3
35
+ metrics:
36
+ - type: acc_norm
37
+ value: 32.46
38
+ name: normalized accuracy
39
+ source:
40
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
41
+ name: Open LLM Leaderboard
42
+ - task:
43
+ type: text-generation
44
+ name: Text Generation
45
+ dataset:
46
+ name: MATH Lvl 5 (4-Shot)
47
+ type: lighteval/MATH-Hard
48
+ split: test
49
+ args:
50
+ num_few_shot: 4
51
+ metrics:
52
+ - type: exact_match
53
+ value: 9.29
54
+ name: exact match
55
+ source:
56
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
57
+ name: Open LLM Leaderboard
58
+ - task:
59
+ type: text-generation
60
+ name: Text Generation
61
+ dataset:
62
+ name: GPQA (0-shot)
63
+ type: Idavidrein/gpqa
64
+ split: train
65
+ args:
66
+ num_few_shot: 0
67
+ metrics:
68
+ - type: acc_norm
69
+ value: 3.91
70
+ name: acc_norm
71
+ source:
72
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
73
+ name: Open LLM Leaderboard
74
+ - task:
75
+ type: text-generation
76
+ name: Text Generation
77
+ dataset:
78
+ name: MuSR (0-shot)
79
+ type: TAUR-Lab/MuSR
80
+ args:
81
+ num_few_shot: 0
82
+ metrics:
83
+ - type: acc_norm
84
+ value: 2.5
85
+ name: acc_norm
86
+ source:
87
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
88
+ name: Open LLM Leaderboard
89
+ - task:
90
+ type: text-generation
91
+ name: Text Generation
92
+ dataset:
93
+ name: MMLU-PRO (5-shot)
94
+ type: TIGER-Lab/MMLU-Pro
95
+ config: main
96
+ split: test
97
+ args:
98
+ num_few_shot: 5
99
+ metrics:
100
+ - type: acc
101
+ value: 29.67
102
+ name: accuracy
103
+ source:
104
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?search=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B
105
+ name: Open LLM Leaderboard
106
  ---
107
 
108
  <h2>L3.1-Dark-Planet-SpinFire-Uncensored-8B</h2>
 
148
 
149
  Please go to:
150
 
151
+ [ https://huggingface.co/DavidAU/L3.1-Dark-Planet-SpinFire-Uncensored-8B-gguf ]
152
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
153
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/DavidAU__L3.1-Dark-Planet-SpinFire-Uncensored-8B-details)!
154
+ Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=DavidAU%2FL3.1-Dark-Planet-SpinFire-Uncensored-8B&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
155
+
156
+ | Metric |Value (%)|
157
+ |-------------------|--------:|
158
+ |**Average** | 24.71|
159
+ |IFEval (0-Shot) | 70.43|
160
+ |BBH (3-Shot) | 32.46|
161
+ |MATH Lvl 5 (4-Shot)| 9.29|
162
+ |GPQA (0-shot) | 3.91|
163
+ |MuSR (0-shot) | 2.50|
164
+ |MMLU-PRO (5-shot) | 29.67|
165
+