sleepdeprived3 commited on
Commit
bf192cd
·
verified ·
1 Parent(s): 045ca5e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +51 -0
README.md ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - TheDrummer/UnslopSmall-22B-v1
4
+ - ReadyArt/Forgotten-Safeword-v1.1-22B
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+
10
+ ---
11
+ # merge
12
+
13
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
+
15
+ ## Merge Details
16
+ ### Merge Method
17
+
18
+ This model was merged using the [DARE TIES](https://arxiv.org/abs/2311.03099) merge method using [ReadyArt/Forgotten-Safeword-v1.1-22B](https://huggingface.co/ReadyArt/Forgotten-Safeword-v1.1-22B) as a base.
19
+
20
+ ### Models Merged
21
+
22
+ The following models were included in the merge:
23
+ * [TheDrummer/UnslopSmall-22B-v1](https://huggingface.co/TheDrummer/UnslopSmall-22B-v1)
24
+
25
+ ### Configuration
26
+
27
+ The following YAML configuration was used to produce this model:
28
+
29
+ ```yaml
30
+ merge_method: dare_ties
31
+ base_model: ReadyArt/Forgotten-Safeword-v1.1-22B
32
+ models:
33
+ - model: ReadyArt/Forgotten-Safeword-v1.1-22B
34
+ parameters:
35
+ weight: 0.7
36
+ density: 0.8
37
+ - model: TheDrummer/UnslopSmall-22B-v1
38
+ parameters:
39
+ weight: 0.3
40
+ density: 0.4
41
+ parameters:
42
+ int8_mask: true
43
+ normalize: true
44
+ lambda: 0.85
45
+ filter:
46
+ - pattern: "model.layers.*.self_attn"
47
+ value: -0.15 # Direct numeric value instead of parameters dict
48
+ dtype: bfloat16
49
+ tokenizer_source: union
50
+
51
+ ```