Update Hypers
Browse files
README.md
CHANGED
@@ -180,15 +180,12 @@ Custom synthetic
|
|
180 |
|
181 |
The following hyperparameters were used during training:
|
182 |
- learning_rate: 2e-04
|
183 |
-
- train_batch_size:
|
184 |
-
- eval_batch_size:
|
185 |
- distributed_type: multi-GPU
|
186 |
- num_devices: 2
|
187 |
-
- total_train_batch_size: 100
|
188 |
-
- total_eval_batch_size: 10
|
189 |
- optimizer: Adam 8bit
|
190 |
- lr_scheduler_type: linear
|
191 |
-
- lr_scheduler_warmup_steps: 10
|
192 |
- num_epochs: 3
|
193 |
|
194 |
### Training results
|
|
|
180 |
|
181 |
The following hyperparameters were used during training:
|
182 |
- learning_rate: 2e-04
|
183 |
+
- train_batch_size: 10
|
184 |
+
- eval_batch_size: 3
|
185 |
- distributed_type: multi-GPU
|
186 |
- num_devices: 2
|
|
|
|
|
187 |
- optimizer: Adam 8bit
|
188 |
- lr_scheduler_type: linear
|
|
|
189 |
- num_epochs: 3
|
190 |
|
191 |
### Training results
|