brad1141 commited on
Commit
bdea700
·
1 Parent(s): 9f233f1

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -31,12 +31,15 @@ More information needed
31
 
32
  The following hyperparameters were used during training:
33
  - learning_rate: 5e-05
34
- - train_batch_size: 8
35
- - eval_batch_size: 8
36
  - seed: 42
 
 
37
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
  - lr_scheduler_type: linear
39
- - num_epochs: 3.0
 
40
 
41
  ### Framework versions
42
 
 
31
 
32
  The following hyperparameters were used during training:
33
  - learning_rate: 5e-05
34
+ - train_batch_size: 1
35
+ - eval_batch_size: 1
36
  - seed: 42
37
+ - gradient_accumulation_steps: 8
38
+ - total_train_batch_size: 8
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
+ - lr_scheduler_warmup_ratio: 0.1
42
+ - num_epochs: 1
43
 
44
  ### Framework versions
45