Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,9 @@ A GPT-2 M 355M parameter model for usage with [ai-msgbot](https://github.com/psz
|
|
4 |
|
5 |
This model was fine-tuned on a parsed version of The Wizard of Wikipedia dataset for 10,000 steps. 20/24 layers were frozen for the fine-tuning process.
|
6 |
|
7 |
-
##
|
|
|
|
|
8 |
|
9 |
`script_speaker_name` = `person alpha`
|
10 |
|
|
|
4 |
|
5 |
This model was fine-tuned on a parsed version of The Wizard of Wikipedia dataset for 10,000 steps. 20/24 layers were frozen for the fine-tuning process.
|
6 |
|
7 |
+
## conversation data
|
8 |
+
|
9 |
+
The dataset was tokenized and fed to the model as a conversation between two speakers, whose names are below. this is relevant for writing prompts and filtering/extracting text from responses.
|
10 |
|
11 |
`script_speaker_name` = `person alpha`
|
12 |
|