hawalurahman commited on
Commit
c36fc30
·
verified ·
1 Parent(s): 039ea29

End of training

Browse files
Files changed (2) hide show
  1. README.md +13 -13
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,13 +19,13 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 1.9476
23
- - Rouge1: 0.6394
24
- - Rouge2: 0.3540
25
- - Rougel: 0.6391
26
- - Rougelsum: 0.6397
27
- - Bleu: 0.4246
28
- - Exact Match: 0.4133
29
 
30
  ## Model description
31
 
@@ -44,7 +44,7 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 0.0003
48
  - train_batch_size: 8
49
  - eval_batch_size: 8
50
  - seed: 42
@@ -56,11 +56,11 @@ The following hyperparameters were used during training:
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu | Exact Match |
58
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|:-----------:|
59
- | 0.5058 | 1.0 | 2000 | 1.1567 | 0.6147 | 0.3280 | 0.6147 | 0.6144 | 0.3739 | 0.3912 |
60
- | 0.1688 | 2.0 | 4000 | 1.3225 | 0.6180 | 0.3428 | 0.6180 | 0.6184 | 0.4093 | 0.375 |
61
- | 0.0799 | 3.0 | 6000 | 1.6096 | 0.6397 | 0.3591 | 0.6399 | 0.6402 | 0.4213 | 0.404 |
62
- | 0.0298 | 4.0 | 8000 | 1.8350 | 0.6431 | 0.3549 | 0.6428 | 0.6430 | 0.4250 | 0.406 |
63
- | 0.0113 | 5.0 | 10000 | 1.9476 | 0.6394 | 0.3540 | 0.6391 | 0.6397 | 0.4246 | 0.4133 |
64
 
65
 
66
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 1.5600
23
+ - Rouge1: 0.7136
24
+ - Rouge2: 0.4062
25
+ - Rougel: 0.7131
26
+ - Rougelsum: 0.7129
27
+ - Bleu: 0.4978
28
+ - Exact Match: 0.4803
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 0.0001
48
  - train_batch_size: 8
49
  - eval_batch_size: 8
50
  - seed: 42
 
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu | Exact Match |
58
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|:-----------:|
59
+ | 0.5439 | 1.0 | 2000 | 0.9280 | 0.6925 | 0.4012 | 0.6919 | 0.6921 | 0.4703 | 0.4422 |
60
+ | 0.2443 | 2.0 | 4000 | 1.0939 | 0.6986 | 0.3915 | 0.6984 | 0.6984 | 0.4537 | 0.4525 |
61
+ | 0.1263 | 3.0 | 6000 | 1.2665 | 0.7005 | 0.3898 | 0.7004 | 0.7005 | 0.4569 | 0.4723 |
62
+ | 0.0769 | 4.0 | 8000 | 1.5002 | 0.7159 | 0.4065 | 0.7158 | 0.7157 | 0.4987 | 0.4828 |
63
+ | 0.0507 | 5.0 | 10000 | 1.5600 | 0.7136 | 0.4062 | 0.7131 | 0.7129 | 0.4978 | 0.4803 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c378e7db0420fe4e6e5edd5bed0128d5c3d05ca761be157b3f4151ae627a65e9
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:53f1656807ae27f58454b28586583abf0d8dafd2dd67f53e0bde2d4d26ec630a
3
  size 2329638768