dfurman
/

Mistral-7B-Instruct-v0.1

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 13, 2023

Commit

dff6033

·

1 Parent(s): 344ca4f

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ We use Eleuther.AI's [Language Model Evaluation Harness](https://github.com/Eleu
 It took ~1 hour to train 1 epoch on 1x A100.
 Prompt format:
-This model (and all my future releases) use [ChatML](https://huggingface.co/docs/transformers/chat_templating#what-template-should-i-use) prompt format.
 ```
 <|im_start|>system
@@ -57,9 +57,11 @@ You are a helpful assistant.<|im_end|>
 <|im_start|>user
 {prompt}<|im_end|>
 <|im_start|>assistant
-### Training Hyperparameters
 ```
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune llms on instruction-following datasets.
 The following `TrainingArguments` config was used:

 It took ~1 hour to train 1 epoch on 1x A100.
 Prompt format:
+This model (and all my future releases) uses the [ChatML](https://huggingface.co/docs/transformers/chat_templating#what-template-should-i-use) prompt format, which was developed by OpenAI.
 ```
 <|im_start|>system
 <|im_start|>user
 {prompt}<|im_end|>
 <|im_start|>assistant
 ```
+### Training Hyperparameters
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune llms on instruction-following datasets.
 The following `TrainingArguments` config was used: