dfurman
/

Mistral-7B-Instruct-v0.1

Text Generation

Model card Files Files and versions

dfurman commited on Nov 14, 2023

Commit

b7eea4b

·

1 Parent(s): 12df6af

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -166,8 +166,16 @@ Shake all ingredients in a shaker filled with ice until well chilled and strain
 It took ~3 hours to train 3 epochs on 1x A100 (40 GB SXM).
-Prompt format:
-This model uses the same prompt format as [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) and does **not** expect a system prompt.  This format is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating) via the `apply_chat_template()` method. Here's an illustrative example:
 ```python
 messages = [
@@ -188,7 +196,7 @@ print(prompt)
 ```
 </details>
-## Training Hyperparameters
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.

 It took ~3 hours to train 3 epochs on 1x A100 (40 GB SXM).
+### Prompt Format
+This model was finetuned with the following format:
+```python
+tokenizer.chat_template = "{{ bos_token }}{% for message in messages %}{% if (message['role'] == 'user') != (loop.index0 % 2 == 0) %}{{ raise_exception('Conversation roles must alternate user/assistant/user/assistant/...') }}{% endif %}{% if message['role'] == 'user' %}{{ ' [INST] ' + message['content'] + ' [/INST] ' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token + ' ' }}{% else %}{{ raise_exception('Only user and assistant roles are supported!') }}{% endif %}{% endfor %}"
+```
+This format is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating) via the `apply_chat_template()` method. Here's an illustrative example:
 ```python
 messages = [
 ```
 </details>
+### Training Hyperparameters
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.