tsunemoto commited on
Commit
eff300e
·
1 Parent(s): 7c1f00b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ![](https://cdn-uploads.huggingface.co/production/uploads/6468ce47e134d050a58aa89c/ddzjZ1irvtLcDRCWei9vQ.png)
2
+
3
+ ##GGUF's of Seraph-7B from Weyaxi
4
+
5
+ https://huggingface.co/Weyaxi/Seraph-7B
6
+
7
+ ##Original Model Card:
8
+
9
+ Seraph-7B
10
+ This is the model for Seraph-7B. I used mergekit to merge models.
11
+
12
+ Prompt Templates
13
+ You can use these prompt templates, but I recommend using ChatML.
14
+
15
+ ChatML:
16
+ <|im_start|>system
17
+ {system}<|im_end|>
18
+ <|im_start|>user
19
+ {user}<|im_end|>
20
+ <|im_start|>assistant
21
+ {asistant}<|im_end|>
22
+
23
+ System, User, Asistant Alpaca Style:
24
+ ### System:
25
+ {system}
26
+ ### User:
27
+ {user}
28
+ ### Assistant:
29
+
30
+ Yaml Config
31
+ slices:
32
+ - sources:
33
+ - model: Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp
34
+ layer_range: [0, 32]
35
+ - model: Q-bert/MetaMath-Cybertron-Starling
36
+ layer_range: [0, 32]
37
+ merge_method: slerp
38
+ base_model: mistralai/Mistral-7B-v0.1
39
+ parameters:
40
+ t:
41
+ - filter: self_attn
42
+ value: [0, 0.5, 0.3, 0.7, 1]
43
+ - filter: mlp
44
+ value: [1, 0.5, 0.7, 0.3, 0]
45
+ - value: 0.5 # fallback for rest of tensors
46
+ dtype: bfloat16
47
+
48
+ Open LLM Leaderboard Evaluation Results
49
+ Detailed results can be found here
50
+
51
+ Metric Value
52
+ Avg. 71.86
53
+ ARC (25-shot) 67.83
54
+ HellaSwag (10-shot) 86.22
55
+ MMLU (5-shot) 65.07
56
+ TruthfulQA (0-shot) 59.49
57
+ Winogrande (5-shot) 80.66
58
+ GSM8K (5-shot) 71.87