End of training

Browse files

Files changed (9) hide show

README.md +28 -9
model.safetensors +1 -1
runs/Mar25_17-17-44_supermicro/events.out.tfevents.1711358266.supermicro.3476667.0 +3 -0
runs/Mar25_17-19-08_supermicro/events.out.tfevents.1711358351.supermicro.3477082.0 +3 -0
runs/Mar25_17-21-29_supermicro/events.out.tfevents.1711358491.supermicro.3477633.0 +3 -0
runs/Mar25_17-23-20_supermicro/events.out.tfevents.1711358602.supermicro.3477976.0 +3 -0
runs/Mar25_17-47-44_supermicro/events.out.tfevents.1711360076.supermicro.3481966.0 +3 -0
runs/Mar25_20-07-49_supermicro/events.out.tfevents.1711368471.supermicro.3520570.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -23,13 +23,13 @@ model-index:
     metrics:
     - name: Precision
       type: precision
-      value: 0.6153846153498767
     - name: Recall
       type: recall
-      value: 0.804091266599492
     - name: F1
       type: f1
-      value: 0.6971945083321585
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -39,10 +39,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1990
-- Precision: 0.6154
-- Recall: 0.8041
-- F1: 0.6972
 ## Model description
@@ -67,13 +67,32 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
-| 0.2157        | 1.0   | 416  | 0.1990          | 0.6154    | 0.8041 | 0.6972 |
 ### Framework versions

     metrics:
     - name: Precision
       type: precision
+      value: 0.901610712050607
     - name: Recall
       type: recall
+      value: 0.8982985303950894
     - name: F1
       type: f1
+      value: 0.8999515736949341
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5195
+- Precision: 0.9016
+- Recall: 0.8983
+- F1: 0.9000
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
+| 0.2099        | 1.0   | 416  | 0.1940          | 0.8281    | 0.8152 | 0.8216 |
+| 0.1658        | 2.0   | 832  | 0.1799          | 0.8464    | 0.8590 | 0.8527 |
+| 0.1276        | 3.0   | 1248 | 0.1821          | 0.8795    | 0.8639 | 0.8716 |
+| 0.1076        | 4.0   | 1664 | 0.1961          | 0.8903    | 0.8788 | 0.8845 |
+| 0.0792        | 5.0   | 2080 | 0.2277          | 0.8787    | 0.8869 | 0.8828 |
+| 0.054         | 6.0   | 2496 | 0.2395          | 0.9084    | 0.8701 | 0.8888 |
+| 0.0433        | 7.0   | 2912 | 0.2991          | 0.8999    | 0.8915 | 0.8957 |
+| 0.0288        | 8.0   | 3328 | 0.3374          | 0.8919    | 0.8935 | 0.8927 |
+| 0.022         | 9.0   | 3744 | 0.3752          | 0.9054    | 0.8921 | 0.8987 |
+| 0.0211        | 10.0  | 4160 | 0.4105          | 0.8952    | 0.8985 | 0.8968 |
+| 0.0147        | 11.0  | 4576 | 0.4084          | 0.9013    | 0.9004 | 0.9009 |
+| 0.0095        | 12.0  | 4992 | 0.4542          | 0.9047    | 0.8952 | 0.8999 |
+| 0.01          | 13.0  | 5408 | 0.4516          | 0.9086    | 0.8896 | 0.8990 |
+| 0.0087        | 14.0  | 5824 | 0.4521          | 0.9025    | 0.8935 | 0.8980 |
+| 0.0069        | 15.0  | 6240 | 0.4878          | 0.9034    | 0.9022 | 0.9028 |
+| 0.0042        | 16.0  | 6656 | 0.5097          | 0.9021    | 0.8997 | 0.9009 |
+| 0.006         | 17.0  | 7072 | 0.5195          | 0.9054    | 0.9008 | 0.9031 |
+| 0.0043        | 18.0  | 7488 | 0.5032          | 0.9009    | 0.8977 | 0.8993 |
+| 0.0029        | 19.0  | 7904 | 0.5155          | 0.9003    | 0.8962 | 0.8983 |
+| 0.0034        | 20.0  | 8320 | 0.5195          | 0.9016    | 0.8983 | 0.9000 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3b723e13ec07df97483a0b641e084364bed29a7fa2e3209840b2449c4b4f697
 size 406740756

 version https://git-lfs.github.com/spec/v1
+oid sha256:eaca2c3e15c3803c6c28e9e3329fe545137b3efa4fd04479028e241bc0395385
 size 406740756

runs/Mar25_17-17-44_supermicro/events.out.tfevents.1711358266.supermicro.3476667.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:67af6b62cb9c59dbbd7fae4d0a546b543edb5a98ad3061c17ecd966e23597978
+size 6059

runs/Mar25_17-19-08_supermicro/events.out.tfevents.1711358351.supermicro.3477082.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd138fd39e0f742ba5e67f54859bb152bf8e07b65d88fc72d0667fe47312ffb3
+size 6059

runs/Mar25_17-21-29_supermicro/events.out.tfevents.1711358491.supermicro.3477633.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dff08e3e8ed7cfe5e03d5294f0f2bf4b9995da0978188b98c343470a61b8f230
+size 6059

runs/Mar25_17-23-20_supermicro/events.out.tfevents.1711358602.supermicro.3477976.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e27a9b6420bcccdbe4047d172b0e41e3e2920ea876b7d1942116ffd93c81d99a
+size 5420

runs/Mar25_17-47-44_supermicro/events.out.tfevents.1711360076.supermicro.3481966.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:badbfd51f88692b775675e3a459f4dcbd80f45527312e501848a0012d5060f8f
+size 5705

runs/Mar25_20-07-49_supermicro/events.out.tfevents.1711368471.supermicro.3520570.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c2ade37db64814b35db210512ead2f1975dccb0d913ef8e419a34e503c6692b0
+size 26443

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:570d20239598d150a8bdbcd3499fdf848353350942d86069d1db832c8c52e109
 size 4283

 version https://git-lfs.github.com/spec/v1
+oid sha256:c43b79607ac73360dfeb4bec006e8c76146ab430f38f63edf84e3d26d9e0603c
 size 4283