hawei commited on
Commit
50abae0
·
verified ·
1 Parent(s): 1549e51

Update model card with metrics

Browse files
Files changed (1) hide show
  1. README.md +71 -2
README.md CHANGED
@@ -1,2 +1,71 @@
1
- # Control-LLM-Llama3.1-8B-Math16
2
- This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ datasets:
4
+ - nvidia/OpenMathInstruct-2
5
+ language:
6
+ - en
7
+ base_model:
8
+ - meta-llama/Llama-3.1-8B-Instruct
9
+ model-index:
10
+ - name: Control-LLM-Llama3.1-8B-Math16
11
+ results:
12
+ - task:
13
+ type: math-evaluation
14
+ dataset:
15
+ type: nvidia/OpenMathInstruct-2
16
+ name: OpenMathInstruct
17
+ metrics:
18
+ - name: exact_match,none
19
+ type: exact_match
20
+ value: 0.6327358367133324
21
+ stderr: 0.0052245703347459605
22
+ verified: false
23
+ - name: exact_match,none (gsm8k_0shot_instruct)
24
+ type: exact_match
25
+ value: 0.9052312357846853
26
+ stderr: 0.008067791560015407
27
+ verified: false
28
+ - name: exact_match,none (meta_math_0shot_instruct)
29
+ type: exact_match
30
+ value: 0.6276
31
+ stderr: 0.006837616441401548
32
+ verified: false
33
+ - name: exact_match,none (meta_math_hard_0shot_instruct)
34
+ type: exact_match
35
+ value: 0.3806646525679758
36
+ stderr: 0.013349170720370741
37
+ verified: false
38
+ - task:
39
+ type: original-capability
40
+ dataset:
41
+ type: meta/arc-dataset
42
+ name: Meta-ARC Dataset
43
+ metrics:
44
+ - name: exact_match,strict-match
45
+ type: exact_match
46
+ value: 0.5723263625528227
47
+ stderr: 0.002858377993520894
48
+ verified: false
49
+ - name: exact_match,strict-match (meta_arc_0shot_instruct)
50
+ type: exact_match
51
+ value: 0.7974248927038626
52
+ stderr: 0.01178043813618557
53
+ verified: false
54
+ - name: exact_match,strict-match (meta_gpqa_0shot_cot_instruct)
55
+ type: exact_match
56
+ value: 0.25223214285714285
57
+ stderr: 0.02054139101648797
58
+ verified: false
59
+ - name: exact_match,strict-match (meta_mmlu_0shot_instruct)
60
+ type: exact_match
61
+ value: 0.6837345107534539
62
+ stderr: 0.0039243761987253515
63
+ verified: false
64
+ - name: exact_match,strict-match (meta_mmlu_pro_5shot_instruct)
65
+ type: exact_match
66
+ value: 0.4324301861702128
67
+ stderr: 0.004516653585262379
68
+ verified: false
69
+ ---
70
+ # Control-LLM-Llama3.1-8B-Math16
71
+ This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.