End of training
Browse files- README.md +28 -9
- model.safetensors +1 -1
- runs/Mar25_17-17-44_supermicro/events.out.tfevents.1711358266.supermicro.3476667.0 +3 -0
- runs/Mar25_17-19-08_supermicro/events.out.tfevents.1711358351.supermicro.3477082.0 +3 -0
- runs/Mar25_17-21-29_supermicro/events.out.tfevents.1711358491.supermicro.3477633.0 +3 -0
- runs/Mar25_17-23-20_supermicro/events.out.tfevents.1711358602.supermicro.3477976.0 +3 -0
- runs/Mar25_17-47-44_supermicro/events.out.tfevents.1711360076.supermicro.3481966.0 +3 -0
- runs/Mar25_20-07-49_supermicro/events.out.tfevents.1711368471.supermicro.3520570.0 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -23,13 +23,13 @@ model-index:
|
|
23 |
metrics:
|
24 |
- name: Precision
|
25 |
type: precision
|
26 |
-
value: 0.
|
27 |
- name: Recall
|
28 |
type: recall
|
29 |
-
value: 0.
|
30 |
- name: F1
|
31 |
type: f1
|
32 |
-
value: 0.
|
33 |
---
|
34 |
|
35 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -39,10 +39,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
39 |
|
40 |
This model is a fine-tuned version of [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese) on the generator dataset.
|
41 |
It achieves the following results on the evaluation set:
|
42 |
-
- Loss: 0.
|
43 |
-
- Precision: 0.
|
44 |
-
- Recall: 0.
|
45 |
-
- F1: 0.
|
46 |
|
47 |
## Model description
|
48 |
|
@@ -67,13 +67,32 @@ The following hyperparameters were used during training:
|
|
67 |
- seed: 42
|
68 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
69 |
- lr_scheduler_type: linear
|
70 |
-
- num_epochs:
|
71 |
|
72 |
### Training results
|
73 |
|
74 |
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 |
|
75 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
|
76 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
77 |
|
78 |
|
79 |
### Framework versions
|
|
|
23 |
metrics:
|
24 |
- name: Precision
|
25 |
type: precision
|
26 |
+
value: 0.901610712050607
|
27 |
- name: Recall
|
28 |
type: recall
|
29 |
+
value: 0.8982985303950894
|
30 |
- name: F1
|
31 |
type: f1
|
32 |
+
value: 0.8999515736949341
|
33 |
---
|
34 |
|
35 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
39 |
|
40 |
This model is a fine-tuned version of [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese) on the generator dataset.
|
41 |
It achieves the following results on the evaluation set:
|
42 |
+
- Loss: 0.5195
|
43 |
+
- Precision: 0.9016
|
44 |
+
- Recall: 0.8983
|
45 |
+
- F1: 0.9000
|
46 |
|
47 |
## Model description
|
48 |
|
|
|
67 |
- seed: 42
|
68 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
69 |
- lr_scheduler_type: linear
|
70 |
+
- num_epochs: 20
|
71 |
|
72 |
### Training results
|
73 |
|
74 |
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 |
|
75 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|
|
76 |
+
| 0.2099 | 1.0 | 416 | 0.1940 | 0.8281 | 0.8152 | 0.8216 |
|
77 |
+
| 0.1658 | 2.0 | 832 | 0.1799 | 0.8464 | 0.8590 | 0.8527 |
|
78 |
+
| 0.1276 | 3.0 | 1248 | 0.1821 | 0.8795 | 0.8639 | 0.8716 |
|
79 |
+
| 0.1076 | 4.0 | 1664 | 0.1961 | 0.8903 | 0.8788 | 0.8845 |
|
80 |
+
| 0.0792 | 5.0 | 2080 | 0.2277 | 0.8787 | 0.8869 | 0.8828 |
|
81 |
+
| 0.054 | 6.0 | 2496 | 0.2395 | 0.9084 | 0.8701 | 0.8888 |
|
82 |
+
| 0.0433 | 7.0 | 2912 | 0.2991 | 0.8999 | 0.8915 | 0.8957 |
|
83 |
+
| 0.0288 | 8.0 | 3328 | 0.3374 | 0.8919 | 0.8935 | 0.8927 |
|
84 |
+
| 0.022 | 9.0 | 3744 | 0.3752 | 0.9054 | 0.8921 | 0.8987 |
|
85 |
+
| 0.0211 | 10.0 | 4160 | 0.4105 | 0.8952 | 0.8985 | 0.8968 |
|
86 |
+
| 0.0147 | 11.0 | 4576 | 0.4084 | 0.9013 | 0.9004 | 0.9009 |
|
87 |
+
| 0.0095 | 12.0 | 4992 | 0.4542 | 0.9047 | 0.8952 | 0.8999 |
|
88 |
+
| 0.01 | 13.0 | 5408 | 0.4516 | 0.9086 | 0.8896 | 0.8990 |
|
89 |
+
| 0.0087 | 14.0 | 5824 | 0.4521 | 0.9025 | 0.8935 | 0.8980 |
|
90 |
+
| 0.0069 | 15.0 | 6240 | 0.4878 | 0.9034 | 0.9022 | 0.9028 |
|
91 |
+
| 0.0042 | 16.0 | 6656 | 0.5097 | 0.9021 | 0.8997 | 0.9009 |
|
92 |
+
| 0.006 | 17.0 | 7072 | 0.5195 | 0.9054 | 0.9008 | 0.9031 |
|
93 |
+
| 0.0043 | 18.0 | 7488 | 0.5032 | 0.9009 | 0.8977 | 0.8993 |
|
94 |
+
| 0.0029 | 19.0 | 7904 | 0.5155 | 0.9003 | 0.8962 | 0.8983 |
|
95 |
+
| 0.0034 | 20.0 | 8320 | 0.5195 | 0.9016 | 0.8983 | 0.9000 |
|
96 |
|
97 |
|
98 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 406740756
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eaca2c3e15c3803c6c28e9e3329fe545137b3efa4fd04479028e241bc0395385
|
3 |
size 406740756
|
runs/Mar25_17-17-44_supermicro/events.out.tfevents.1711358266.supermicro.3476667.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:67af6b62cb9c59dbbd7fae4d0a546b543edb5a98ad3061c17ecd966e23597978
|
3 |
+
size 6059
|
runs/Mar25_17-19-08_supermicro/events.out.tfevents.1711358351.supermicro.3477082.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fd138fd39e0f742ba5e67f54859bb152bf8e07b65d88fc72d0667fe47312ffb3
|
3 |
+
size 6059
|
runs/Mar25_17-21-29_supermicro/events.out.tfevents.1711358491.supermicro.3477633.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dff08e3e8ed7cfe5e03d5294f0f2bf4b9995da0978188b98c343470a61b8f230
|
3 |
+
size 6059
|
runs/Mar25_17-23-20_supermicro/events.out.tfevents.1711358602.supermicro.3477976.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e27a9b6420bcccdbe4047d172b0e41e3e2920ea876b7d1942116ffd93c81d99a
|
3 |
+
size 5420
|
runs/Mar25_17-47-44_supermicro/events.out.tfevents.1711360076.supermicro.3481966.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:badbfd51f88692b775675e3a459f4dcbd80f45527312e501848a0012d5060f8f
|
3 |
+
size 5705
|
runs/Mar25_20-07-49_supermicro/events.out.tfevents.1711368471.supermicro.3520570.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c2ade37db64814b35db210512ead2f1975dccb0d913ef8e419a34e503c6692b0
|
3 |
+
size 26443
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4283
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c43b79607ac73360dfeb4bec006e8c76146ab430f38f63edf84e3d26d9e0603c
|
3 |
size 4283
|