dfurman
/

Mistral-7B-Instruct-v0.1

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 13, 2023

Commit

d5f0a4a

·

1 Parent(s): e8a9e4b

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -19,11 +19,11 @@ base_model: mistralai/Mistral-7B-v0.1
 # Mistral-7B-Instruct-v0.1
-General instruction-following llm finetuned from [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1).
 ## Model Details
-This instruction-following llm was built via parameter-efficient QLoRA finetuning of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the first 5k rows of [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin). Finetuning was executed on 1x A100 (40 GB SXM) for roughly 1 hour on Google Colab.
 - **Developed by:** Daniel Furman
 - **Model type:** Decoder-only
@@ -154,7 +154,7 @@ You are a helpful assistant.<|im_end|>
 ## Training Hyperparameters
-We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune llms on instruction-following datasets.
 The following `TrainingArguments` config was used:

 # Mistral-7B-Instruct-v0.1
+General instruction-following LLM finetuned from [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1).
 ## Model Details
+This instruction-following LLM was built via parameter-efficient QLoRA finetuning of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the first 5k rows of [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin). Finetuning was executed on 1x A100 (40 GB SXM) for roughly 1 hour on Google Colab.
 - **Developed by:** Daniel Furman
 - **Model type:** Decoder-only
 ## Training Hyperparameters
+We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.
 The following `TrainingArguments` config was used: