Marko Tasic commited on
Commit
b0c036d
·
2 Parent(s): 554dc41 91d88a3

Merge branch 'main' of hf.co:tangledgroup/tangled-alpha-0.9-core

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -218,3 +218,22 @@ litgpt convert_pretrained_checkpoint ../out/pretrain-core-0/final ../out/pretrai
218
  ```bash
219
  CUDA_VISIBLE_DEVICES=0 CUDA_LAUNCH_BLOCKING=0 PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True litgpt pretrain --config pretrain_core_model_1.yaml
220
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
218
  ```bash
219
  CUDA_VISIBLE_DEVICES=0 CUDA_LAUNCH_BLOCKING=0 PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True litgpt pretrain --config pretrain_core_model_1.yaml
220
  ```
221
+
222
+ ```
223
+ Seed set to 23
224
+ Time to instantiate model: 0.32 seconds.
225
+ Total parameters: 234,914,304
226
+ Validating ...
227
+ Measured TFLOPs: 27760.47
228
+ Epoch 1 | iter 128 step 1 | loss train: 3.205, val: 3.457 | iter time: 450.87 ms (step) remaining time: 10:13:33
229
+ Epoch 1 | iter 256 step 2 | loss train: 3.175, val: 3.457 | iter time: 386.03 ms (step) remaining time: 9:00:10
230
+ Epoch 1 | iter 384 step 3 | loss train: 3.144, val: 3.457 | iter time: 388.88 ms (step) remaining time: 8:35:53
231
+ Epoch 1 | iter 512 step 4 | loss train: 3.260, val: 3.457 | iter time: 390.66 ms (step) remaining time: 8:24:17
232
+ Epoch 1 | iter 640 step 5 | loss train: 3.247, val: 3.457 | iter time: 392.99 ms (step) remaining time: 8:17:54
233
+ Epoch 1 | iter 768 step 6 | loss train: 3.264, val: 3.457 | iter time: 397.01 ms (step) remaining time: 8:13:51
234
+ Epoch 1 | iter 896 step 7 | loss train: 3.232, val: 3.457 | iter time: 396.00 ms (step) remaining time: 8:11:00
235
+ Epoch 1 | iter 1024 step 8 | loss train: 3.287, val: 3.457 | iter time: 396.93 ms (step) remaining time: 8:08:43
236
+ Epoch 1 | iter 1152 step 9 | loss train: 3.236, val: 3.457 | iter time: 398.67 ms (step) remaining time: 8:06:49
237
+ Epoch 1 | iter 1280 step 10 | loss train: 3.274, val: 3.457 | iter time: 399.49 ms (step) remaining time: 8:05:09
238
+ # ...
239
+ ```