Update README.md
Browse files
README.md
CHANGED
@@ -60,7 +60,7 @@ You are a helpful assistant.<|im_end|>
|
|
60 |
### Training Hyperparameters
|
61 |
```
|
62 |
|
63 |
-
We use the [
|
64 |
|
65 |
The following `TrainingArguments` config was used:
|
66 |
|
|
|
60 |
### Training Hyperparameters
|
61 |
```
|
62 |
|
63 |
+
We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune llms on instruction-following datasets.
|
64 |
|
65 |
The following `TrainingArguments` config was used:
|
66 |
|