aashish1904 commited on
Commit
2a3d464
·
verified ·
1 Parent(s): 3449839

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +74 -0
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ base_model:
5
+ - TheDrummer/Rocinante-12B-v1.1
6
+ - ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+
12
+
13
+ ---
14
+
15
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
16
+
17
+
18
+ # QuantFactory/Roci_Maxx_v2-GGUF
19
+ This is quantized version of [mergekit-community/Roci_Maxx_v2](https://huggingface.co/mergekit-community/Roci_Maxx_v2) created using llama.cpp
20
+
21
+ # Original Model Card
22
+
23
+ # merge
24
+
25
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
26
+
27
+ ## Merge Details
28
+ ### Merge Method
29
+
30
+ This model was merged using the SLERP merge method.
31
+
32
+ ### Models Merged
33
+
34
+ The following models were included in the merge:
35
+ * [TheDrummer/Rocinante-12B-v1.1](https://huggingface.co/TheDrummer/Rocinante-12B-v1.1)
36
+ * [ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1](https://huggingface.co/ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1)
37
+
38
+ ### Configuration
39
+
40
+ The following YAML configuration was used to produce this model:
41
+
42
+ ```yaml
43
+ slices:
44
+ - sources:
45
+ - model: TheDrummer/Rocinante-12B-v1.1
46
+ layer_range:
47
+ - 0
48
+ - 33
49
+ - model: ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1
50
+ layer_range:
51
+ - 0
52
+ - 33
53
+ merge_method: slerp
54
+ base_model: TheDrummer/Rocinante-12B-v1.1
55
+ parameters:
56
+ t:
57
+ - filter: self_attn
58
+ value:
59
+ - 0
60
+ - 0.5
61
+ - 0.3
62
+ - 0.7
63
+ - 1
64
+ - filter: mlp
65
+ value:
66
+ - 1
67
+ - 0.5
68
+ - 0.7
69
+ - 0.3
70
+ - 0
71
+ - value: 0.5
72
+ dtype: bfloat16
73
+ ```
74
+