Update README.md
Browse files
README.md
CHANGED
|
@@ -60,7 +60,7 @@ You are a helpful assistant.<|im_end|>
|
|
| 60 |
### Training Hyperparameters
|
| 61 |
```
|
| 62 |
|
| 63 |
-
We use the [
|
| 64 |
|
| 65 |
The following `TrainingArguments` config was used:
|
| 66 |
|
|
|
|
| 60 |
### Training Hyperparameters
|
| 61 |
```
|
| 62 |
|
| 63 |
+
We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune llms on instruction-following datasets.
|
| 64 |
|
| 65 |
The following `TrainingArguments` config was used:
|
| 66 |
|