fredericowieser commited on
Commit
0878a3a
·
verified ·
1 Parent(s): d9187d0

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -0
README.md ADDED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ datasets:
5
+ - mindchain/wikitext2
6
+ - yahma/alpaca-cleaned
7
+ metrics:
8
+ - perplexity
9
+ - accuracy
10
+ base_model:
11
+ - TinyLlama/TinyLlama_v1.1
12
+ model-index:
13
+ - name: TinyLlama_v1.1_1bit_BitDistiller
14
+ results:
15
+ - task:
16
+ type: multiple-choice
17
+ name: QA Benchmarking
18
+ dataset:
19
+ type: allenai/arc
20
+ name: ARC-Challenge
21
+ config: challenge
22
+ split: test
23
+ metrics:
24
+ - type: accuracy
25
+ name: Accuracy
26
+ value: 0.2150170648464164
27
+ - type: accuracy
28
+ name: Normalized Accuracy
29
+ value: 0.24744027303754265
30
+ - task:
31
+ type: multiple-choice
32
+ name: QA Benchmarking
33
+ dataset:
34
+ type: hellaswag
35
+ name: HellaSwag
36
+ split: test
37
+ metrics:
38
+ - type: accuracy
39
+ name: Accuracy
40
+ value: 0.2568213503286198
41
+ - type: accuracy
42
+ name: Normalized Accuracy
43
+ value: 0.253359888468433
44
+ - task:
45
+ type: multiple-choice
46
+ name: QA Benchmarking
47
+ dataset:
48
+ type: piqa
49
+ name: PIQA
50
+ split: validation
51
+ metrics:
52
+ - type: accuracy
53
+ name: Accuracy
54
+ value: 0.5282916213275299
55
+ - type: accuracy
56
+ name: Normalized Accuracy
57
+ value: 0.5027203482845702
58
+ - task:
59
+ type: multiple-choice
60
+ name: QA Benchmarking
61
+ dataset:
62
+ type: winogrande
63
+ name: Winogrande
64
+ split: test
65
+ metrics:
66
+ - type: accuracy
67
+ name: Accuracy
68
+ value: 0.5122336227308603
69
+ - task:
70
+ type: multiple-choice
71
+ name: QA Benchmarking
72
+ dataset:
73
+ type: aggregated
74
+ name: QA-Avg
75
+ metrics:
76
+ - type: accuracy
77
+ name: QA Average
78
+ value: 0.3780991480835666
79
+ ---