Delta-Vector commited on
Commit
3390e0a
·
verified ·
1 Parent(s): 578d393

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +423 -0
README.md ADDED
@@ -0,0 +1,423 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg"
3
+ datasets:
4
+ - NewEden/Orion-LIT
5
+ - NewEden/Orion-Asstr-Stories-16K
6
+ - Mielikki/Erebus-87k
7
+ - NewEden/RP-logs-V2-Experimental-prefixed
8
+ - NewEden/Creative_Writing-Complexity
9
+ - NewEden/Discord-Filtered
10
+ - NewEden/DeepseekRP-Filtered
11
+ - NewEden/Storium-Prefixed-Clean
12
+ - NewEden/Basket-Weaving-Filtered
13
+ - NewEden/LIMARP-Complexity
14
+ - NewEden/Misc-Data-Sharegpt-Prefixed
15
+ - NewEden/BlueSky-10K-Complexity
16
+ - NewEden/OpenCAI-ShareGPT
17
+ - NewEden/Basket-Weaving-Filtered
18
+ - PocketDoc/Dans-Personamaxx-VN
19
+ - PocketDoc/Dans-Kinomaxx-VanillaBackrooms
20
+
21
+ base_model:
22
+ - Delta-Vector/Hamanasu-QwQ-V1.5-Instruct
23
+ tags:
24
+ - qwen
25
+ - roleplay
26
+ - finetune
27
+ - storywriting
28
+ ---
29
+ <!DOCTYPE html>
30
+ <style>
31
+ html, body {
32
+ background: black;
33
+ color: #c9d1d9 !important;
34
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
35
+ margin: 0;
36
+ padding: 0;
37
+ min-height: 100vh;
38
+ }
39
+ .markdown-body {
40
+ color: white;
41
+ margin: 40px auto;
42
+ padding: 40px;
43
+ border-radius: 12px;
44
+ position: relative;
45
+ overflow: hidden;
46
+ }
47
+
48
+ .markdown-body::after {
49
+ content: '';
50
+ position: absolute;
51
+ top: 0;
52
+ left: 0;
53
+ width: 100%;
54
+ height: 100%;
55
+ background: #0c0f18; /* background color */
56
+ pointer-events: none;
57
+ z-index: -999;
58
+ }
59
+
60
+ h1, h2, h3 {
61
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
62
+ -webkit-background-clip: text;
63
+ -webkit-text-fill-color: transparent;
64
+ border-bottom: 1px solid #333;
65
+ padding-bottom: 0.3em;
66
+ }
67
+
68
+ div[style*="border:2px solid #333"],
69
+ div[style*="border: 2px solid #333"],
70
+ div[style*="border:1px solid #333"],
71
+ div[style*="border: 1px solid #333"] {
72
+ background: rgba(22, 27, 34, 0.8) !important;
73
+ border: 2px solid #6e00ff !important;
74
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
75
+ border-radius: 10px;
76
+ padding: 20px;
77
+ margin: 20px 0;
78
+ }
79
+
80
+ code {
81
+ background-color: #1a1a1a !important;
82
+ border-radius: 4px;
83
+ padding: 0.2em 0.4em;
84
+ color: #00ffff;
85
+ }
86
+
87
+ pre {
88
+ background-color: #1a1a1a !important;
89
+ border: 1px solid #333;
90
+ border-radius: 8px;
91
+ padding: 16px;
92
+ }
93
+
94
+ table {
95
+ width: 100%;
96
+ border-collapse: collapse;
97
+ margin: 20px 0;
98
+ background: rgba(0,0,0,0.2);
99
+ table-layout: fixed;
100
+ color: white;
101
+ }
102
+
103
+ th, td {
104
+ border: 1px solid #333;
105
+ padding: 12px;
106
+ text-align: center;
107
+ color: white;
108
+ }
109
+
110
+ th {
111
+ background: rgba(110, 0, 255, 0.1);
112
+ }
113
+
114
+ td:nth-child(1) {
115
+ width: 1%;
116
+ white-space: nowrap;
117
+ }
118
+
119
+ td:nth-child(2) {
120
+ width: 100%;
121
+ }
122
+
123
+ td > span {
124
+ display: block;
125
+ padding: 4px 8px;
126
+ background: rgba(110, 0, 255, 0.1);
127
+ border-radius: 4px;
128
+ transition: all 0.3s ease;
129
+ }
130
+
131
+ td > span:hover {
132
+ background: rgba(110, 0, 255, 0.2);
133
+ transform: translateY(-1px);
134
+ }
135
+
136
+ a {
137
+ color: #00ffff;
138
+ text-decoration: none;
139
+ transition: all 0.3s ease;
140
+ }
141
+
142
+ a:hover {
143
+ color: #6e00ff;
144
+ text-decoration: none;
145
+ }
146
+
147
+ hr {
148
+ border: 0;
149
+ height: 1px;
150
+ background: linear-gradient(90deg, transparent, #333, transparent);
151
+ margin: 40px 0;
152
+ }
153
+
154
+ img {
155
+ max-width: 100%;
156
+ border-radius: 10px;
157
+ }
158
+
159
+ details summary:hover {
160
+ color: #00ffff;
161
+ }
162
+
163
+ * {
164
+ color-scheme: dark !important;
165
+ }
166
+
167
+ .prose, .max-w-none, .px-4 {
168
+ background-color: transparent !important;
169
+ color: #c9d1d9 !important;
170
+ }
171
+ </style>
172
+ <body>
173
+ <div class="markdown-body">
174
+ <div align="center">
175
+
176
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
177
+
178
+ <br>
179
+ <br>
180
+
181
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
182
+ Hamanasu 32B
183
+ </div>
184
+
185
+ </div>
186
+
187
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
188
+
189
+
190
+ ## 🌌 Overview
191
+
192
+ <i>This model is the Chat tune of the Instruct model, More accurately it is the "brainrotted" version, Finetuned with Bsky, 4chan and Discord logs, Its... really something beautiful.</i>
193
+ <i>The model is suited best towards being a highly dumb chat partner rather then regular RP, All thanks to Ruka-Hamanasu for funding the train.</i>
194
+
195
+ </div>
196
+
197
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
198
+
199
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
200
+
201
+ ### 📈 Quantizations
202
+ | Type | Link |
203
+ |:---:|:---:|
204
+ | `GGUF` | <span style="display:block; width:100%"></span> |
205
+ | `EXL2` | <span style="display:block; width:100%"></span> |
206
+
207
+ </div>
208
+
209
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
210
+
211
+ ### ⚔️ Hardware
212
+ - 8x H100s
213
+ - Epochs: 4
214
+ - Base: `Delta-Vector/Hamanasu-QwQ-V1.5-Instruct`
215
+ - Amount of Tokens: 10M
216
+ </div>
217
+
218
+ </div>
219
+
220
+
221
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
222
+
223
+ ## 💰 Prompting
224
+
225
+
226
+ <i>This model uses ChatML formatting</i>
227
+ ```python
228
+ <|im_start|>system
229
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
230
+ <|im_start|>User request
231
+ Take off your helmet.<|im_end|>
232
+ <|im_start|>No i shall not. This is the way.
233
+ ```
234
+ </div>
235
+
236
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
237
+
238
+ ## 🎲 Recommended Sampler Preset
239
+
240
+ ```python
241
+ temperature: 1.8
242
+ min_p: 0.1
243
+ System_Prompt: Keep blank for best chat experience.
244
+ ```
245
+ </div>
246
+
247
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
248
+
249
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
250
+
251
+ <details>
252
+
253
+ ```yaml
254
+ base_model: NewEden/32B-inst
255
+ model_type: AutoModelForCausalLM
256
+ tokenizer_type: AutoTokenizer
257
+
258
+ hub_model_id: NewEden/32b-rp
259
+ hub_strategy: "all_checkpoints"
260
+ push_dataset_to_hub:
261
+ hf_use_auth_token: true
262
+
263
+ plugins:
264
+ - axolotl.integrations.liger.LigerPlugin
265
+ - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
266
+ liger_rope: true
267
+ liger_rms_norm: true
268
+ liger_layer_norm: true
269
+ liger_glu_activation: true
270
+ liger_fused_linear_cross_entropy: false
271
+ cut_cross_entropy: true
272
+
273
+ load_in_8bit: false
274
+ load_in_4bit: false
275
+ strict: false
276
+
277
+ datasets:
278
+ - path: NewEden/RP-logs-V2-Experimental-prefixed
279
+ type: dan-chat-advanced
280
+ - path: NewEden/Creative_Writing-Complexity
281
+ type: dan-chat-advanced
282
+ - path: NewEden/Discord-Filtered
283
+ type: dan-chat-advanced
284
+ - path: NewEden/DeepseekRP-Filtered
285
+ type: dan-chat-advanced
286
+ - path: NewEden/Storium-Prefixed-Clean
287
+ type: dan-chat-advanced
288
+ - path: NewEden/Basket-Weaving-Filtered
289
+ type: dan-chat-advanced
290
+ - path: NewEden/LIMARP-Complexity
291
+ type: dan-chat-advanced
292
+ - path: NewEden/Misc-Data-Sharegpt-Prefixed
293
+ type: dan-chat-advanced
294
+ - path: NewEden/BlueSky-10K-Complexity
295
+ type: dan-chat-advanced
296
+ - path: NewEden/OpenCAI-ShareGPT
297
+ type: dan-chat-advanced
298
+ - path: NewEden/Basket-Weaving-Filtered
299
+ type: dan-chat-advanced
300
+ - path: PocketDoc/Dans-Personamaxx-VN
301
+ type: dan-chat-advanced
302
+ - path: PocketDoc/Dans-Kinomaxx-VanillaBackrooms
303
+ type: dan-chat-advanced
304
+ dataset_prepared_path: prepared_data
305
+ val_set_size: 0.0
306
+ output_dir: ./qwq-inst
307
+
308
+ sequence_len: 32768
309
+ sample_packing: true
310
+ pad_to_sequence_len: true
311
+
312
+ # adapter: lora
313
+ # lora_model_dir:
314
+ # lora_r: 128
315
+ # lora_alpha: 16
316
+ # lora_dropout: 0.05
317
+ # lora_target_modules:
318
+ # - gate_proj
319
+ # - down_proj
320
+ # - up_proj
321
+ # - q_proj
322
+ # - v_proj
323
+ # - k_proj
324
+ # - o_proj
325
+
326
+ wandb_project: qwq
327
+ wandb_entity:
328
+ wandb_watch:
329
+ wandb_name: rp-attempt-03
330
+ wandb_log_model:
331
+
332
+ gradient_accumulation_steps: 2
333
+ micro_batch_size: 2
334
+ num_epochs: 4
335
+ optimizer: adamw_bnb_8bit
336
+ lr_scheduler: cosine
337
+ learning_rate: 2.5e-5
338
+ max_grad_norm: 1.0
339
+
340
+ train_on_inputs: false
341
+ group_by_length: false
342
+ bf16: auto
343
+ fp16:
344
+ tf32: false
345
+
346
+ gradient_checkpointing: unsloth
347
+ early_stopping_patience:
348
+ resume_from_checkpoint:
349
+ local_rank:
350
+ logging_steps: 1
351
+ xformers_attention:
352
+ flash_attention: true
353
+
354
+ warmup_steps: 40
355
+ saves_per_epoch: 2
356
+ debug:
357
+ deepspeed: deepspeed_configs/zero3_bf16.json
358
+ weight_decay: 0.02
359
+ fsdp:
360
+ fsdp_config:
361
+ special_tokens:
362
+ ```
363
+
364
+ </details>
365
+ </div>
366
+
367
+ <div align="center">
368
+
369
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
370
+
371
+ ## ⚡ Credits
372
+ <div style="display: flex; justify-content: center;">
373
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
374
+
375
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
376
+ <a href="https://huggingface.co/lucyknada">
377
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
378
+ </a>
379
+ </div>
380
+
381
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
382
+ <a href="https://huggingface.co/hamanasu">
383
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka">
384
+ </a>
385
+ </div>
386
+
387
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
388
+ <a href="https://huggingface.co/intervitens">
389
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
390
+ </a>
391
+ </div>
392
+
393
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
394
+ <a href="https://huggingface.co/kalomaze">
395
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
396
+ </a>
397
+ </div>
398
+
399
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
400
+ <a href="https://huggingface.co/kubernetes-bad">
401
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
402
+ </a>
403
+ </div>
404
+
405
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
406
+ <a href="https://huggingface.co/anthracite-org">
407
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
408
+ </a>
409
+ </div>
410
+ </div>
411
+ </div>
412
+ </div>
413
+
414
+ ---
415
+
416
+ <div align="center">
417
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
418
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
419
+ </div>
420
+
421
+ </div>
422
+ </body>
423
+ </html>