dung6903 commited on
Commit
bc3d26c
·
verified ·
1 Parent(s): 0f2d258

End of training

Browse files
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [vinai/phobert-base](https://huggingface.co/vinai/phobert-base) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.4711
25
- - Accuracy: 0.8232
26
- - F1: 0.8232
27
- - Precision: 0.8234
28
- - Recall: 0.8232
29
 
30
  ## Model description
31
 
@@ -45,8 +45,8 @@ More information needed
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 2e-05
48
- - train_batch_size: 16
49
- - eval_batch_size: 16
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
@@ -56,16 +56,16 @@ The following hyperparameters were used during training:
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
  |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
- | 0.4875 | 1.0 | 2813 | 0.4647 | 0.7732 | 0.7712 | 0.7832 | 0.7732 |
60
- | 0.426 | 2.0 | 5626 | 0.4987 | 0.7736 | 0.7685 | 0.7999 | 0.7736 |
61
- | 0.3703 | 3.0 | 8439 | 0.4647 | 0.8012 | 0.7999 | 0.8094 | 0.8012 |
62
- | 0.3185 | 4.0 | 11252 | 0.4711 | 0.8232 | 0.8232 | 0.8234 | 0.8232 |
63
- | 0.2738 | 5.0 | 14065 | 0.5240 | 0.8176 | 0.8169 | 0.8228 | 0.8176 |
64
- | 0.2279 | 6.0 | 16878 | 0.5875 | 0.8164 | 0.8164 | 0.8164 | 0.8164 |
65
- | 0.1935 | 7.0 | 19691 | 0.7180 | 0.8196 | 0.8194 | 0.8209 | 0.8196 |
66
- | 0.1671 | 8.0 | 22504 | 0.8426 | 0.8208 | 0.8205 | 0.8232 | 0.8208 |
67
- | 0.1357 | 9.0 | 25317 | 0.8800 | 0.8216 | 0.8216 | 0.8216 | 0.8216 |
68
- | 0.1073 | 10.0 | 28130 | 0.9236 | 0.8148 | 0.8148 | 0.8150 | 0.8148 |
69
 
70
 
71
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [vinai/phobert-base](https://huggingface.co/vinai/phobert-base) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.6822
25
+ - Accuracy: 0.8252
26
+ - F1: 0.8250
27
+ - Precision: 0.8265
28
+ - Recall: 0.8252
29
 
30
  ## Model description
31
 
 
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 2e-05
48
+ - train_batch_size: 32
49
+ - eval_batch_size: 32
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
 
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
  |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
+ | 0.5383 | 1.0 | 1407 | 0.4695 | 0.7768 | 0.7757 | 0.7825 | 0.7768 |
60
+ | 0.4349 | 2.0 | 2814 | 0.5131 | 0.7584 | 0.7493 | 0.8022 | 0.7584 |
61
+ | 0.3775 | 3.0 | 4221 | 0.4097 | 0.8132 | 0.8132 | 0.8133 | 0.8132 |
62
+ | 0.3281 | 4.0 | 5628 | 0.4962 | 0.8148 | 0.8137 | 0.8227 | 0.8148 |
63
+ | 0.2803 | 5.0 | 7035 | 0.4936 | 0.8076 | 0.8060 | 0.8183 | 0.8076 |
64
+ | 0.2293 | 6.0 | 8442 | 0.5015 | 0.824 | 0.8237 | 0.8265 | 0.824 |
65
+ | 0.1999 | 7.0 | 9849 | 0.5298 | 0.8224 | 0.8224 | 0.8224 | 0.8224 |
66
+ | 0.1646 | 8.0 | 11256 | 0.5862 | 0.8232 | 0.8230 | 0.8247 | 0.8232 |
67
+ | 0.1359 | 9.0 | 12663 | 0.6431 | 0.8248 | 0.8245 | 0.8268 | 0.8248 |
68
+ | 0.1154 | 10.0 | 14070 | 0.6822 | 0.8252 | 0.8250 | 0.8265 | 0.8252 |
69
 
70
 
71
  ### Framework versions
runs/Nov13_03-27-12_66a0168ab93c/events.out.tfevents.1731468433.66a0168ab93c.30.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:42a7642fe8220a1ceec10770e3916f0875bdbc5fb3727c95d503fefc68d70d18
3
- size 15161
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ffd41ddcafded5891daca31c095094a623a183285fcacf36d7549132c92889e7
3
+ size 15987
runs/Nov13_03-27-12_66a0168ab93c/events.out.tfevents.1731475420.66a0168ab93c.30.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89ebc22cbec8da0bfbdcd18d5477c358005e78888d2bd9f19a52ff46d812e02d
3
+ size 560