End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7769
 ## Model description
@@ -38,15 +38,19 @@ The following hyperparameters were used during training:
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 0.1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.9217        | 0.1   | 82   | 3.7769          |
 ### Framework versions

 This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.3252
 ## Model description
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
+- gradient_accumulation_steps: 16
+- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.6617        | 0.99  | 50   | 3.5323          |
+| 3.4191        | 1.99  | 101  | 3.3614          |
+| 3.3489        | 2.96  | 150  | 3.3252          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c2cfb8d24ef0503d16481c5866c32c0a555b202357204aacca5b081ff0fbe4b1
 size 272123144

 version https://git-lfs.github.com/spec/v1
+oid sha256:f42ba90052ed9d888fd8682bf2a55ad3791cd0c27b7ab168f73e1310636e65a1
 size 272123144

runs/Jan03_13-21-43_Noah-Desktop/events.out.tfevents.1704309706.Noah-Desktop.16252.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:27bf7d45006c1abf8042fcceb550c3e974018445a9ae02e93fcf8dab9ff0bfb7
-size 24279

 version https://git-lfs.github.com/spec/v1
+oid sha256:28ac263c0d91f3fdb68bba8bec90006eee277e5db912bdecf5cb0aaf28856cec
+size 28047

runs/Jan03_13-35-04_Noah-Desktop/events.out.tfevents.1704310507.Noah-Desktop.7340.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:939a222610b40082448d02cdd56e48c758878ad9719f03fe7e4dce62560ad5b2
+size 4582

runs/Jan03_13-37-14_Noah-Desktop/events.out.tfevents.1704310637.Noah-Desktop.7180.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0fd4515a0a6f72c86ebf267e396efb1e9c18fa3ee196a6ca86c2ea7fdcc2a835
+size 8364

runs/Jan03_13-37-14_Noah-Desktop/events.out.tfevents.1704312119.Noah-Desktop.7180.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1ec53dfb151d4643b5d14132d2031f33f15393f026f43e1b698dfe71d5d05ca
+size 359

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:78b0e530aa2c85daa07d82e229d9790727de32ec678d644f0e7efac2f65a93d9
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:3839dd655a5e3e6f1b65e848f8604b036ccd54f404d4e04b63d8d37d7d745dc7
 size 4728