license: mit | |
datasets: | |
- jaredjoss/jigsaw-long-2000 | |
language: | |
- en | |
lomahony/eleuther-pythia410m-hh-sft model fine-tuned on the jaredjoss/jigsaw-long-2000 dataset using RLHF. | |
The following parameters were used to train the model; | |
<figure style="width:16em"> | |
| Parameter | Value | | |
| --------------------: | ---------: | | |
| Size | 410m | | |
| learning rate | 8e-7 | | |
| steps | 12000 | | |
</figure> |