suayptalha commited on
Commit
683b2b2
·
verified ·
1 Parent(s): b29048b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +626 -3
README.md CHANGED
@@ -1,3 +1,626 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - ServiceNow-AI/R1-Distill-SFT
5
+ language:
6
+ - en
7
+ base_model:
8
+ - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
9
+ pipeline_tag: text-generation
10
+ library_name: transformers
11
+ ---
12
+
13
+
14
+ <!DOCTYPE html>
15
+ <style>
16
+ ebody {
17
+ font-family: 'Quicksand', sans-serif;
18
+ background: linear-gradient(135deg, #FF69B4 0%, #800080 100%);
19
+ color: #FFFFFF;
20
+ margin: 0;
21
+ padding: 0;
22
+ font-size: 16px;
23
+ min-height: 100vh;
24
+ }
25
+
26
+ .container {
27
+ margin: 20px;
28
+ background-color: rgba(28, 14, 36, 0.95);
29
+ padding: 20px;
30
+ border-radius: 12px;
31
+ box-shadow: 0 4px 20px rgba(255, 105, 180, 0.4);
32
+ border: 1px solid rgba(255, 105, 180, 0.4);
33
+ outline: 1px solid rgba(255, 105, 180, 0.7);
34
+ outline-offset: -1px;
35
+ position: relative;
36
+ backdrop-filter: blur(10px);
37
+ }
38
+
39
+ .container::before {
40
+ content: '';
41
+ position: absolute;
42
+ top: -1px;
43
+ left: -1px;
44
+ right: -1px;
45
+ bottom: -1px;
46
+ border: 1px solid rgba(255, 105, 180, 0.98);
47
+ border-radius: 12px;
48
+ pointer-events: none;
49
+ animation: borderGlow 2s ease-in-out infinite;
50
+ }
51
+
52
+ @keyframes borderGlow {
53
+ 0% {
54
+ box-shadow: 0 0 5px rgba(255, 105, 180, 0.98);
55
+ }
56
+ 50% {
57
+ box-shadow: 0 0 20px rgba(255, 105, 180, 0.98);
58
+ }
59
+ 100% {
60
+ box-shadow: 0 0 5px rgba(255, 105, 180, 0.98);
61
+ }
62
+ }
63
+
64
+ .header h1 {
65
+ font-size: 28px;
66
+ color: #FF69B4;
67
+ margin: 0 0 20px 0;
68
+ text-shadow: 0 0 15px rgba(255, 105, 180, 0.8);
69
+ letter-spacing: 1px;
70
+ }
71
+
72
+ .update-section {
73
+ margin-top: 30px;
74
+ }
75
+
76
+ .update-section h2, h2 {
77
+ font-size: 24px;
78
+ color: #FF69B4;
79
+ text-shadow: 0 0 15px rgba(255, 105, 180, 0.8);
80
+ letter-spacing: 0.5px;
81
+ }
82
+
83
+ .update-section p {
84
+ font-size: 16px;
85
+ line-height: 1.6;
86
+ color: #FFE1FF;
87
+ }
88
+
89
+ .info p {
90
+ color: #FFE1FF;
91
+ line-height: 1.6;
92
+ font-size: 16px;
93
+ }
94
+
95
+ .info img {
96
+ width: 100%;
97
+ border-radius: 10px;
98
+ margin-bottom: 15px;
99
+ box-shadow: 0 0 30px rgba(255, 105, 180, 0.5);
100
+ border: 1px solid rgba(255, 105, 180, 0.4);
101
+ outline: 1px solid rgba(255, 105, 180, 0.7);
102
+ outline-offset: -1px;
103
+ transition: transform 0.3s ease, box-shadow 0.3s ease;
104
+ }
105
+
106
+ .info img:hover {
107
+ transform: scale(1.01);
108
+ box-shadow: 0 0 40px rgba(255, 105, 180, 0.6);
109
+ }
110
+
111
+ a {
112
+ color: #00FFEE;
113
+ text-decoration: none;
114
+ transition: color 0.3s ease;
115
+ }
116
+
117
+ a:hover {
118
+ color: #FF1493;
119
+ }
120
+
121
+ .button {
122
+ display: inline-block;
123
+ background: linear-gradient(45deg, rgba(255, 105, 180, 0.9), rgba(128, 0, 128, 0.9));
124
+ color: #FFFFFF;
125
+ padding: 12px 24px;
126
+ border-radius: 5px;
127
+ cursor: pointer;
128
+ text-decoration: none;
129
+ transition: all 0.3s ease;
130
+ border: 1px solid rgba(255, 105, 180, 0.4);
131
+ }
132
+
133
+ .button:hover {
134
+ background: linear-gradient(45deg, rgba(255, 105, 180, 1), rgba(128, 0, 128, 1));
135
+ box-shadow: 0 0 20px rgba(255, 105, 180, 0.7);
136
+ transform: translateY(-2px);
137
+ }
138
+
139
+ pre {
140
+ background-color: rgba(28, 14, 36, 0.95);
141
+ padding: 15px;
142
+ border-radius: 5px;
143
+ overflow-x: auto;
144
+ border: 1px solid rgba(255, 20, 147, 0.3);
145
+ outline: 1px solid rgba(255, 20, 147, 0.6);
146
+ outline-offset: -1px;
147
+ }
148
+
149
+ code {
150
+ font-family: 'Courier New', monospace;
151
+ color: #FFE1FF;
152
+ }
153
+
154
+ .info-grid {
155
+ display: grid;
156
+ grid-template-columns: repeat(3, 1fr);
157
+ gap: 15px;
158
+ }
159
+
160
+ .creator-section {
161
+ margin: 20px 0;
162
+ }
163
+
164
+ .creator-badge {
165
+ display: inline-flex;
166
+ align-items: center;
167
+ background: rgba(28, 14, 36, 0.95);
168
+ border: 1px solid rgba(255, 20, 147, 0.3);
169
+ border-radius: 8px;
170
+ padding: 10px 15px;
171
+ }
172
+
173
+ .creator-label {
174
+ color: #FFE1FF;
175
+ font-size: 14px;
176
+ margin-right: 8px;
177
+ }
178
+
179
+ .creator-link {
180
+ display: flex;
181
+ align-items: center;
182
+ gap: 5px;
183
+ color: #00FFEE;
184
+ text-decoration: none;
185
+ transition: all 0.3s ease;
186
+ }
187
+
188
+ .creator-name {
189
+ font-weight: 600;
190
+ }
191
+
192
+ .creator-arrow {
193
+ font-size: 16px;
194
+ transition: transform 0.3s ease;
195
+ }
196
+
197
+ .creator-link:hover {
198
+ color: #FF1493;
199
+ }
200
+
201
+ .creator-link:hover .creator-arrow {
202
+ transform: translateX(3px);
203
+ }
204
+
205
+ .model-info {
206
+ margin-top: 30px;
207
+ }
208
+
209
+ .name-legend {
210
+ background: rgba(28, 14, 36, 0.95);
211
+ border: 1px solid rgba(255, 20, 147, 0.3);
212
+ border-radius: 8px;
213
+ padding: 20px;
214
+ margin: 20px 0;
215
+ }
216
+
217
+ .name-legend h3 {
218
+ color: #FF1493;
219
+ font-size: 18px;
220
+ margin: 0 0 15px 0;
221
+ }
222
+
223
+ .legend-grid {
224
+ display: grid;
225
+ gap: 12px;
226
+ }
227
+
228
+ .legend-item {
229
+ display: flex;
230
+ align-items: baseline;
231
+ gap: 10px;
232
+ }
233
+
234
+ .legend-key {
235
+ color: #00FFEE;
236
+ font-weight: 600;
237
+ min-width: 80px;
238
+ }
239
+
240
+ .legend-value {
241
+ color: #FFE1FF;
242
+ }
243
+
244
+ .model-description {
245
+ background: rgba(28, 14, 36, 0.95);
246
+ border: 1px solid rgba(255, 20, 147, 0.3);
247
+ border-radius: 8px;
248
+ padding: 20px;
249
+ }
250
+
251
+ .model-description p {
252
+ margin: 0 0 15px 0;
253
+ line-height: 1.6;
254
+ }
255
+
256
+ .model-description p:last-child {
257
+ margin-bottom: 0;
258
+ }
259
+
260
+ .section-container {
261
+ margin: 40px 0;
262
+ }
263
+
264
+ .info-card {
265
+ background: rgba(28, 14, 36, 0.95);
266
+ border: 1px solid rgba(255, 20, 147, 0.3);
267
+ border-radius: 8px;
268
+ overflow: hidden;
269
+ }
270
+
271
+ .info-header {
272
+ background: rgba(255, 20, 147, 0.1);
273
+ padding: 20px;
274
+ border-bottom: 1px solid rgba(255, 20, 147, 0.3);
275
+ }
276
+
277
+ .info-header h3 {
278
+ color: #FF1493;
279
+ margin: 0 0 10px 0;
280
+ font-size: 20px;
281
+ text-shadow: 0 0 5px rgba(255, 20, 147, 0.3);
282
+ }
283
+
284
+ .model-tags {
285
+ display: flex;
286
+ gap: 8px;
287
+ flex-wrap: wrap;
288
+ }
289
+
290
+ .model-tag {
291
+ background: rgba(0, 255, 238, 0.1);
292
+ color: #00FFEE;
293
+ padding: 4px 8px;
294
+ border-radius: 4px;
295
+ font-size: 12px;
296
+ border: 1px solid rgba(0, 255, 238, 0.2);
297
+ }
298
+
299
+ .model-composition {
300
+ padding: 20px;
301
+ border-bottom: 1px solid rgba(255, 20, 147, 0.3);
302
+ }
303
+
304
+ .model-composition h4 {
305
+ color: #FF1493;
306
+ margin: 0 0 15px 0;
307
+ font-size: 16px;
308
+ }
309
+
310
+ .composition-list {
311
+ list-style: none;
312
+ padding: 0;
313
+ margin: 0;
314
+ display: grid;
315
+ gap: 10px;
316
+ }
317
+
318
+ .composition-list li {
319
+ color: #FFE1FF;
320
+ display: flex;
321
+ align-items: baseline;
322
+ gap: 8px;
323
+ }
324
+
325
+ .model-component {
326
+ color: #00FFEE;
327
+ font-weight: 500;
328
+ min-width: 120px;
329
+ }
330
+
331
+ .template-card {
332
+ background: rgba(28, 14, 36, 0.95);
333
+ border: 1px solid rgba(255, 20, 147, 0.3);
334
+ border-radius: 8px;
335
+ padding: 15px;
336
+ }
337
+
338
+ .template-item {
339
+ display: flex;
340
+ align-items: center;
341
+ gap: 12px;
342
+ }
343
+
344
+ .template-icon {
345
+ width: 24px;
346
+ height: 24px;
347
+ opacity: 0.8;
348
+ }
349
+
350
+ .template-content {
351
+ display: flex;
352
+ align-items: baseline;
353
+ gap: 8px;
354
+ }
355
+
356
+ .template-link {
357
+ color: #00FFEE;
358
+ text-decoration: none;
359
+ font-weight: 500;
360
+ display: flex;
361
+ align-items: center;
362
+ gap: 5px;
363
+ }
364
+
365
+ .template-author {
366
+ color: rgba(255, 225, 255, 0.7);
367
+ font-size: 14px;
368
+ }
369
+
370
+ .quantized-container {
371
+ display: grid;
372
+ gap: 20px;
373
+ }
374
+
375
+ .quantized-section {
376
+ background: rgba(28, 14, 36, 0.95);
377
+ border: 1px solid rgba(255, 20, 147, 0.3);
378
+ border-radius: 8px;
379
+ padding: 20px;
380
+ }
381
+
382
+ .quantized-section h3 {
383
+ color: #FF1493;
384
+ font-size: 18px;
385
+ margin: 0 0 15px 0;
386
+ }
387
+
388
+ .quantized-items {
389
+ display: grid;
390
+ gap: 12px;
391
+ }
392
+
393
+ .quantized-item {
394
+ display: flex;
395
+ align-items: baseline;
396
+ gap: 10px;
397
+ }
398
+
399
+ .quantized-item .author {
400
+ color: rgba(255, 225, 255, 0.7);
401
+ min-width: 100px;
402
+ }
403
+
404
+ .multi-links {
405
+ display: flex;
406
+ align-items: center;
407
+ gap: 8px;
408
+ }
409
+
410
+ .separator {
411
+ color: rgba(255, 225, 255, 0.5);
412
+ }
413
+
414
+ .config-container {
415
+ background: rgba(28, 14, 36, 0.95);
416
+ border: 1px solid rgba(255, 20, 147, 0.3);
417
+ border-radius: 8px;
418
+ overflow: hidden;
419
+ }
420
+
421
+ .config-header {
422
+ background: rgba(255, 20, 147, 0.1);
423
+ padding: 15px 20px;
424
+ border-bottom: 1px solid rgba(255, 20, 147, 0.3);
425
+ }
426
+
427
+ .model-name {
428
+ color: #FF1493;
429
+ font-weight: 600;
430
+ }
431
+
432
+ .config-content {
433
+ padding: 20px;
434
+ }
435
+
436
+ .config-item {
437
+ display: flex;
438
+ flex-direction: column;
439
+ gap: 5px;
440
+ margin-bottom: 15px;
441
+ }
442
+
443
+ .config-label {
444
+ color: #00FFEE;
445
+ font-size: 14px;
446
+ font-weight: 500;
447
+ }
448
+
449
+ .config-value {
450
+ color: #FFE1FF;
451
+ font-family: 'Courier New', monospace;
452
+ }
453
+
454
+ .config-models {
455
+ margin-top: 20px;
456
+ }
457
+
458
+ .model-list {
459
+ list-style: none;
460
+ padding: 0;
461
+ margin: 10px 0 0 0;
462
+ }
463
+
464
+ .model-list li {
465
+ color: #FFE1FF;
466
+ font-family: 'Courier New', monospace;
467
+ padding: 5px 0;
468
+ padding-left: 20px;
469
+ position: relative;
470
+ }
471
+
472
+ .model-list li::before {
473
+ content: '-';
474
+ position: absolute;
475
+ left: 0;
476
+ color: #00FFEE;
477
+ }
478
+
479
+ .link-arrow {
480
+ display: inline-block;
481
+ transition: transform 0.3s ease;
482
+ }
483
+
484
+ a:hover .link-arrow {
485
+ transform: translateX(3px);
486
+ }
487
+
488
+ .benchmark-notification {
489
+ background: rgba(255, 20, 147, 0.15);
490
+ border: 1px solid rgba(255, 20, 147, 0.3);
491
+ border-radius: 8px;
492
+ margin-bottom: 20px;
493
+ padding: 12px;
494
+ animation: glowPulse 2s infinite;
495
+ }
496
+
497
+ .notification-content {
498
+ display: flex;
499
+ align-items: center;
500
+ justify-content: center;
501
+ gap: 10px;
502
+ text-align: center;
503
+ }
504
+
505
+ .notification-icon {
506
+ font-size: 20px;
507
+ }
508
+
509
+ .notification-text {
510
+ color: #FFE1FF;
511
+ font-size: 16px;
512
+ font-weight: 500;
513
+ display: flex;
514
+ flex-direction: column;
515
+ align-items: center;
516
+ gap: 5px;
517
+ }
518
+
519
+ .benchmark-link {
520
+ color: #00FFEE;
521
+ text-decoration: none;
522
+ font-size: 14px;
523
+ padding: 4px 8px;
524
+ border-radius: 4px;
525
+ transition: all 0.3s ease;
526
+ border: 1px solid rgba(0, 255, 238, 0.3);
527
+ }
528
+
529
+ .benchmark-link:hover {
530
+ background: rgba(0, 255, 238, 0.1);
531
+ border-color: rgba(0, 255, 238, 0.5);
532
+ color: #00FFEE;
533
+ text-shadow: 0 0 5px rgba(0, 255, 238, 0.5);
534
+ }
535
+
536
+ @keyframes glowPulse {
537
+ 0% {
538
+ box-shadow: 0 0 5px rgba(255, 20, 147, 0.3);
539
+ }
540
+ 50% {
541
+ box-shadow: 0 0 15px rgba(255, 20, 147, 0.5);
542
+ }
543
+ 100% {
544
+ box-shadow: 0 0 5px rgba(255, 20, 147, 0.3);
545
+ }
546
+ }
547
+
548
+ .review-card {
549
+ background: rgba(28, 14, 36, 0.95);
550
+ border: 1px solid rgba(255, 20, 147, 0.3);
551
+ border-radius: 8px;
552
+ padding: 15px;
553
+ margin-bottom: 15px;
554
+ }
555
+
556
+ .review-card:last-child {
557
+ margin-bottom: 0;
558
+ }
559
+ </style>
560
+
561
+ <html lang="en">
562
+ <head>
563
+ <meta charset="UTF-8">
564
+ <meta name="viewport" content="width=device-width, initial-scale=1.0">
565
+ <title>Maestro-R1-Llama-8B</title>
566
+ <link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet">
567
+ <link href="styles.css" rel="stylesheet">
568
+ </head>
569
+ <body>
570
+ <div class="container">
571
+ <div class="header">
572
+ <h1>Maestro-R1-Llama-8B</h1>
573
+ </div>
574
+ <div class="info">
575
+ <img src="Maestro-Logo.png" alt="Model banner">
576
+ <div class="creator-section">
577
+ <div class="creator-badge">
578
+ <span class="creator-label">Created by</span>
579
+ <a href="https://huggingface.co/suayptalha" target="_blank" class="creator-link">
580
+ <span class="creator-name">suayptalha</span>
581
+ <span class="creator-arrow">→</span>
582
+ </a>
583
+ </div>
584
+ </div>
585
+ <div class="model-info">
586
+ <h2>Model Information</h2>
587
+ <div class="info-card">
588
+ <div class="info-header">
589
+ <h3>Maestro-R1-Llama-8B</h3>
590
+ <div class="model-tags">
591
+ <span class="model-tag">Maestro-R1-Llama-8B</span>
592
+ <span class="model-tag">deepseek-ai/DeepSeek-R1-Distill-Llama-8B</span>
593
+ <span class="model-tag">8B Parameters</span>
594
+ </div>
595
+ </div>
596
+ <div class="model-composition">
597
+ <h4>Base Model</h4>
598
+ <ul class="composition-list">
599
+ <li><span class="model-component"><a href="deepseek-ai/DeepSeek-R1-Distill-Llama-8B" target="_blank">DeepSeek-R1-Distill-Llama-8B</a></span></li>
600
+ </ul>
601
+ </div>
602
+ <div class="model-description">
603
+ Maestro-R1-Llama-8B is a powerful language model fine-tuned from DeepSeek-R1-Distill-Llama-8B, a distilled model based on the Llama-3 architecture. DeepSeek-R1-Distill-Llama-8B itself is derived from the Llama-3 architecture, with a distillation process from DeepSeek-v3, utilizing a large corpus of diverse data. This distillation enables the model to retain strong reasoning capabilities while maintaining a smaller parameter count.
604
+ <br>
605
+ Maestro-R1-Llama-8B builds on this foundation, further enhancing its performance through fine-tuning on the ServiceNow-AI/R1-Distill-SFT dataset. This fine-tuning step sharpens the model's ability to handle specialized tasks and improves its reasoning, problem-solving, and code generation capabilities. The combination of the distilled base model and domain-specific fine-tuning makes Maestro-R1-Llama-8B an efficient and robust model, excelling across a wide range of language tasks.
606
+ </div>
607
+ <div class="model-composition">
608
+ <h4>Loss Graph</h4>
609
+ <img src="loss.png" alt="Model banner">
610
+ </div>
611
+ </div>
612
+ <div class="support-section">
613
+ <h2>Support & Community:</h2>
614
+ <div class="support-buttons">
615
+ <a href="https://buymeacoffee.com/suayptalha" target="_blank" class="button">
616
+ Buy me a coffee
617
+ </a>
618
+ <a href="https://discord.com/users/suaypt" target="_blank" class="button">
619
+ suayptalha - Discord
620
+ </a>
621
+ </div>
622
+ </div>
623
+ </div>
624
+ </div>
625
+ </body>
626
+ </html>