neavo commited on
Commit
1a0b784
·
verified ·
1 Parent(s): 6cfea74

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ license: apache-2.0
17
  - Trained for approximately `100` hours on `L40*7` devices, with a training volume of about `60B` tokens.
18
  - Main training parameters:
19
  - Batch Size: 1792
20
- - Learning Rate: 4e-05
21
  - Maximum Sequence Length: 512
22
  - Optimizer: adamw_torch
23
  - LR Scheduler: warmup_stable_decay
 
17
  - Trained for approximately `100` hours on `L40*7` devices, with a training volume of about `60B` tokens.
18
  - Main training parameters:
19
  - Batch Size: 1792
20
+ - Learning Rate: 5e-04
21
  - Maximum Sequence Length: 512
22
  - Optimizer: adamw_torch
23
  - LR Scheduler: warmup_stable_decay