parameters guide
samplers guide
model generation
role play settings
quant selection
arm quants
iq quants vs q quants
optimal model setting
gibberish fixes
coherence
instructing following
quality generation
chat settings
quality settings
llamacpp server
llamacpp
lmstudio
sillytavern
koboldcpp
backyard
ollama
model generation steering
steering
model generation fixes
text generation webui
ggufs
exl2
full precision
quants
imatrix
neo imatrix
Update README.md
Browse files
README.md
CHANGED
@@ -30,6 +30,8 @@ tags:
|
|
30 |
|
31 |
<h3>Maximizing Model Performance for All Quants Types And Full-Precision using Samplers, Advance Samplers and Parameters Guide</h3>
|
32 |
|
|
|
|
|
33 |
This document includes detailed information, references, and notes for general parameters, samplers and
|
34 |
advanced samplers to get the most out of your model's abilities including notes / settings for the most popular AI/LLM app in use (LLAMACPP, KoboldCPP, Text-Generation-WebUI, LMStudio, Sillytavern, Ollama and others).
|
35 |
|
@@ -86,11 +88,16 @@ You will get higher quality operation overall - stronger prose, better answers,
|
|
86 |
|
87 |
Review quant(s) information to select quant(s) to download, then review "Class 1,2,3..." for specific information on models followed by "Source Files...APPS to run LLMs/AIs".
|
88 |
|
|
|
|
|
|
|
|
|
89 |
"Quick reference" will state the best parameter settings for each "Class" of model(s) to get the best operation and/or good defaults to use to get started. If you came to this page from a repo card on my repo -DavidAU- the "class" of the model would have been stated just before you came to this page.
|
90 |
|
91 |
The detailed sections about parameters - Section 1 a,b,c and section 2 will help tune the model(s) operation.
|
92 |
|
93 |
-
The "DETAILED NOTES ON PARAMETERS, SAMPLERS and ADVANCED SAMPLERS" section after this covers and links to more information about "tuning" your model(s). These cover theory, hints, tips and tricks, and observations
|
|
|
94 |
|
95 |
All information about parameters, samplers and advanced samplers applies to ALL models, regardless of repo(s) you download them from.
|
96 |
|
@@ -112,9 +119,15 @@ SOURCE FILES for my Models / APPS to Run LLMs / AIs:
|
|
112 |
- SILLYTAVERN
|
113 |
- OTHER PROGRAMS
|
114 |
|
115 |
-
TESTING / Generation Example PARAMETERS AND SAMPLERS
|
|
|
|
|
|
|
|
|
|
|
116 |
|
117 |
Quick Reference Table - Parameters, Samplers, Advanced Samplers
|
|
|
118 |
|
119 |
Section 1a : PRIMARY PARAMETERS - ALL APPS
|
120 |
Section 1b : PENALITY SAMPLERS - ALL APPS
|
@@ -499,6 +512,138 @@ These parameters/settings are considered both safe and default and in most cases
|
|
499 |
Note for Class 3/Class 4 models (discussed below) "repeat-last-n" is a CRITICAL setting.
|
500 |
|
501 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
502 |
|
503 |
---
|
504 |
|
|
|
30 |
|
31 |
<h3>Maximizing Model Performance for All Quants Types And Full-Precision using Samplers, Advance Samplers and Parameters Guide</h3>
|
32 |
|
33 |
+
(Updated: "INDEX", and added "Generation Steering" section)
|
34 |
+
|
35 |
This document includes detailed information, references, and notes for general parameters, samplers and
|
36 |
advanced samplers to get the most out of your model's abilities including notes / settings for the most popular AI/LLM app in use (LLAMACPP, KoboldCPP, Text-Generation-WebUI, LMStudio, Sillytavern, Ollama and others).
|
37 |
|
|
|
88 |
|
89 |
Review quant(s) information to select quant(s) to download, then review "Class 1,2,3..." for specific information on models followed by "Source Files...APPS to run LLMs/AIs".
|
90 |
|
91 |
+
"TESTING / Default / Generation Example PARAMETERS AND SAMPLERS" are the basic defaults for parameters, and samplers - the bare minimums. You should always set these first.
|
92 |
+
|
93 |
+
The optional section "Generational Control And Steering of a Model / Fixing Model Issues on the Fly" covers methods to manually steer / edit / modify generation (as well as fixes) for any model.
|
94 |
+
|
95 |
"Quick reference" will state the best parameter settings for each "Class" of model(s) to get the best operation and/or good defaults to use to get started. If you came to this page from a repo card on my repo -DavidAU- the "class" of the model would have been stated just before you came to this page.
|
96 |
|
97 |
The detailed sections about parameters - Section 1 a,b,c and section 2 will help tune the model(s) operation.
|
98 |
|
99 |
+
The "DETAILED NOTES ON PARAMETERS, SAMPLERS and ADVANCED SAMPLERS" section after this covers and links to more information about "tuning" your model(s). These cover theory, hints, tips and tricks, and observations
|
100 |
+
and how to fine control CLASS 3/4 models directly.
|
101 |
|
102 |
All information about parameters, samplers and advanced samplers applies to ALL models, regardless of repo(s) you download them from.
|
103 |
|
|
|
119 |
- SILLYTAVERN
|
120 |
- OTHER PROGRAMS
|
121 |
|
122 |
+
TESTING / Default / Generation Example PARAMETERS AND SAMPLERS
|
123 |
+
|
124 |
+
Generational Control And Steering of a Model / Fixing Model Issues on the Fly
|
125 |
+
- Multiple Methods to Steer Generation on the fly
|
126 |
+
- On the fly Class 3/4 Steering / Generational Issues and Fixes (also for any model/type)
|
127 |
+
- Advanced Steering / Fixing Issues (any model, any type) and "sequenced" parameter/sampler change(s)
|
128 |
|
129 |
Quick Reference Table - Parameters, Samplers, Advanced Samplers
|
130 |
+
- Quick setup for all model classes for automated control / smooth operation.
|
131 |
|
132 |
Section 1a : PRIMARY PARAMETERS - ALL APPS
|
133 |
Section 1b : PENALITY SAMPLERS - ALL APPS
|
|
|
512 |
Note for Class 3/Class 4 models (discussed below) "repeat-last-n" is a CRITICAL setting.
|
513 |
|
514 |
|
515 |
+
---
|
516 |
+
|
517 |
+
<H2>Generational Control And Steering of a Model / Fixing Model Issues on the Fly</h2>
|
518 |
+
|
519 |
+
---
|
520 |
+
|
521 |
+
<B>Multiple Methods to Steer Generation on the fly</B>
|
522 |
+
|
523 |
+
Now that you have the basic parameters and samplers from the previous section, I will cover Generational Control and Steering.
|
524 |
+
|
525 |
+
This section will also cover how to deal with Class 3/4 issues directly, as well as general issues than can happen with any "class" of model during generation.
|
526 |
+
|
527 |
+
There is a very important concept that must be covered first:
|
528 |
+
|
529 |
+
The output/generation/answer to your prompt/instructions BECOMES part of your "prompt" after you click STOP, and then click on "CONTINUE".
|
530 |
+
|
531 |
+
Likewise is true in multi-turn chat, role play, or in a "chat window" so to speak.
|
532 |
+
|
533 |
+
Your prompts AND the model's "answers"/"generation" all become part of the "ROADMAP" for the model to use in whatever journey you are on.
|
534 |
+
|
535 |
+
When you hit "REGEN" this nullifies only the last "generation" - not the prompt before it, nor the prompt(s)/generation(s) in the same chat.
|
536 |
+
|
537 |
+
The part I will cover here is once a generation has started, from a single prompt (no other prompts/generations in the chat).
|
538 |
+
|
539 |
+
So we have a prompt:
|
540 |
+
|
541 |
+
Start a 1000 word scene (vivid horror, 1st person, include thoughts) with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode...
|
542 |
+
|
543 |
+
Generation starts ... and then ends.
|
544 |
+
|
545 |
+
Then you hit regen however many times to get a "good" generation.
|
546 |
+
|
547 |
+
There is a better way.
|
548 |
+
|
549 |
+
Generation starts... 200 words in you think... this is not going in the right direction.
|
550 |
+
|
551 |
+
Do you hit stop? Then regen?
|
552 |
+
|
553 |
+
There are a lot more options:
|
554 |
+
|
555 |
+
1 - Hit Stop.
|
556 |
+
|
557 |
+
2 - Select "EDIT" -> Edit out the part(s) you don't want AND/OR add in STEERING "text" (statement, phrase, paragraph, even a single word) (anywhere in the "generation" text).
|
558 |
+
|
559 |
+
3 - Hit Continue.
|
560 |
+
|
561 |
+
Once you hit "continue" the change(s) you made will now steer the models choices.
|
562 |
+
|
563 |
+
The LAST edit (bottom of the generation) will have the most impact. However ALL EDITS will affect generation as these become part of the generational "ROADMAP".
|
564 |
+
|
565 |
+
You can repeat this process at will.
|
566 |
+
|
567 |
+
Eventually the model will come to a "natural" stopping point.
|
568 |
+
|
569 |
+
If you want to model to continue past this model, delete a few lines AND "steer" it.
|
570 |
+
|
571 |
+
These methods apply to all generation types - not just a "scene" or "story", but "programming code", "article", "conclusions", "analytics", ... you name it.
|
572 |
+
|
573 |
+
Notes:
|
574 |
+
- For Text Generation Webui, you can transfer your "chat" to "notebook" for easy Stop/Edit/Continue function.
|
575 |
+
- For Silly Tavern -> This is built in.
|
576 |
+
- For LMStudio -> This is built in.
|
577 |
+
- For API (direct control) you have to send the "chat" elements back to the "server" with the "edits" (send the whole "revised" chat as a json payload).
|
578 |
+
|
579 |
+
<B>On the fly Class 3/4 Steering / Generational Issues and Fixes (also for any model/type):</B>
|
580 |
+
|
581 |
+
Generational issues can occur such as letter(s), word(s), phrase(s), paragraph repeat(s), "rants" etc etc which can occur at any point during generation.
|
582 |
+
|
583 |
+
This can happen to ANY model, any type ; however with Class 3/4 models there is a higher chance this will occur because of how these models operate.
|
584 |
+
|
585 |
+
The "Quick Reference" and Detailed Parameters, Samplers and Advanced Samplers (below) cover how to set the model "controls" to do this automatically.
|
586 |
+
|
587 |
+
However, sometimes these settings MAY trim too much (ie creativity, "madness", nuance, emotion, even the "right answer(s) etc etc) sometimes, so I will show you how to address these issues directly.
|
588 |
+
|
589 |
+
If you have a letter(s) and/or word(s) repeat:
|
590 |
+
|
591 |
+
- Stop generation, edit out this, and back ONE OR TWO lines (delete)
|
592 |
+
- Hit continue.
|
593 |
+
- Better: Do these steps, and add "steering" (last line -> word, phrase, sentence)
|
594 |
+
|
595 |
+
If you have single or multiple paragraph repeat(s):
|
596 |
+
|
597 |
+
- Stop generation, edit out all the paragraph(s), and back ONE OR TWO lines OR last NON repeating paragraph (delete)
|
598 |
+
- Hit continue.
|
599 |
+
- Better: Do these steps, and add "steering" (last line -> word, phrase, sentence or paragraph)
|
600 |
+
|
601 |
+
<B>Advanced Steering / Fixing Issues (any model, any type) and "sequenced" parameter/sampler change(s)</B>
|
602 |
+
|
603 |
+
This will drastically (depending on changes you make) change up "Continue(d)" generation(s):
|
604 |
+
|
605 |
+
- Do the edits above (steering and/or "steering fixes"), but before you click "Continue" (after your "Edit(s)"), adjust the parameter(s), sampler(s) and advanced sampler(s) settings.
|
606 |
+
- Once you do this BEFORE hitting "Continue" your new settings will be applied to all generation from your new "Continue" point.
|
607 |
+
- You can repeat this process at will.
|
608 |
+
- You can also hit "stop", make NO EDIT(S), adjust the parameter(s), sampler(s) and advanced sampler(s) settings and hit "Continue" and the new settings will take effect from the "stop point" going forward.
|
609 |
+
|
610 |
+
<B>"Cold" Editing/Generation</B>
|
611 |
+
|
612 |
+
Let say you have a generation, but you want to edit it later IN A NEW CHAT.
|
613 |
+
|
614 |
+
Sometimes you can just copy/paste the generation and the model MAY get the "IDEA" and continue the generation without a prompt or direction.
|
615 |
+
|
616 |
+
However this does not always work.
|
617 |
+
|
618 |
+
So you need something along these lines (adjust accordingly):
|
619 |
+
|
620 |
+
Instructions: Continue this scene, using vivid and graphic details.
|
621 |
+
|
622 |
+
SCENE:
|
623 |
+
|
624 |
+
(previous generation)
|
625 |
+
|
626 |
+
Note the structure, layout and spacing.
|
627 |
+
|
628 |
+
If it was programming code:
|
629 |
+
|
630 |
+
Instructions: Continue this javascript, [critical instructions here for "code" goals]
|
631 |
+
|
632 |
+
JAVASCRIPT:
|
633 |
+
|
634 |
+
(previous generation)
|
635 |
+
|
636 |
+
You may want to include the ENTIRE prior prompt (with some modifications) used in the first generation:
|
637 |
+
|
638 |
+
Instructions: Continue the scene below (vivid horror, 1st person, include thoughts) with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode...
|
639 |
+
|
640 |
+
SCENE:
|
641 |
+
|
642 |
+
(previous generation)
|
643 |
+
|
644 |
+
NOTE:
|
645 |
+
|
646 |
+
You may want to modify the instructions to provide a "steering" continue point and/or "goal" for the generation to the model has some idea how to proceed.
|
647 |
|
648 |
---
|
649 |
|