marijajolovic commited on
Commit
6e22284
·
verified ·
1 Parent(s): fb0779c

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -15,6 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
15
  # starcoder-llm-7b-base
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-7b](https://huggingface.co/bigcode/starcoderbase-7b) on an unknown dataset.
 
 
18
 
19
  ## Model description
20
 
@@ -43,10 +45,15 @@ The following hyperparameters were used during training:
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_ratio: 0.1
45
  - lr_scheduler_warmup_steps: 30
46
- - training_steps: 2
47
 
48
  ### Training results
49
 
 
 
 
 
 
50
 
51
 
52
  ### Framework versions
 
15
  # starcoder-llm-7b-base
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-7b](https://huggingface.co/bigcode/starcoderbase-7b) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.9413
20
 
21
  ## Model description
22
 
 
45
  - lr_scheduler_type: cosine
46
  - lr_scheduler_warmup_ratio: 0.1
47
  - lr_scheduler_warmup_steps: 30
48
+ - training_steps: 375
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss |
53
+ |:-------------:|:------:|:----:|:---------------:|
54
+ | 0.5705 | 0.2667 | 100 | 0.9178 |
55
+ | 0.8332 | 0.5333 | 200 | 0.9455 |
56
+ | 1.6848 | 0.8 | 300 | 0.9413 |
57
 
58
 
59
  ### Framework versions