NowaBwagel0 commited on
Commit
01421e5
·
1 Parent(s): cc49935

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.7769
19
 
20
  ## Model description
21
 
@@ -38,15 +38,19 @@ The following hyperparameters were used during training:
38
  - train_batch_size: 4
39
  - eval_batch_size: 4
40
  - seed: 42
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 0.1
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 3.9217 | 0.1 | 82 | 3.7769 |
 
 
50
 
51
 
52
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.3252
19
 
20
  ## Model description
21
 
 
38
  - train_batch_size: 4
39
  - eval_batch_size: 4
40
  - seed: 42
41
+ - gradient_accumulation_steps: 16
42
+ - total_train_batch_size: 64
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 3
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 3.6617 | 0.99 | 50 | 3.5323 |
52
+ | 3.4191 | 1.99 | 101 | 3.3614 |
53
+ | 3.3489 | 2.96 | 150 | 3.3252 |
54
 
55
 
56
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c2cfb8d24ef0503d16481c5866c32c0a555b202357204aacca5b081ff0fbe4b1
3
  size 272123144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f42ba90052ed9d888fd8682bf2a55ad3791cd0c27b7ab168f73e1310636e65a1
3
  size 272123144
runs/Jan03_13-21-43_Noah-Desktop/events.out.tfevents.1704309706.Noah-Desktop.16252.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:27bf7d45006c1abf8042fcceb550c3e974018445a9ae02e93fcf8dab9ff0bfb7
3
- size 24279
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28ac263c0d91f3fdb68bba8bec90006eee277e5db912bdecf5cb0aaf28856cec
3
+ size 28047
runs/Jan03_13-35-04_Noah-Desktop/events.out.tfevents.1704310507.Noah-Desktop.7340.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:939a222610b40082448d02cdd56e48c758878ad9719f03fe7e4dce62560ad5b2
3
+ size 4582
runs/Jan03_13-37-14_Noah-Desktop/events.out.tfevents.1704310637.Noah-Desktop.7180.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0fd4515a0a6f72c86ebf267e396efb1e9c18fa3ee196a6ca86c2ea7fdcc2a835
3
+ size 8364
runs/Jan03_13-37-14_Noah-Desktop/events.out.tfevents.1704312119.Noah-Desktop.7180.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1ec53dfb151d4643b5d14132d2031f33f15393f026f43e1b698dfe71d5d05ca
3
+ size 359
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:78b0e530aa2c85daa07d82e229d9790727de32ec678d644f0e7efac2f65a93d9
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3839dd655a5e3e6f1b65e848f8604b036ccd54f404d4e04b63d8d37d7d745dc7
3
  size 4728