EngLip commited on
Commit
3af2faa
·
1 Parent(s): 5071174

End of training

Browse files
README.md CHANGED
@@ -6,23 +6,23 @@ tags:
6
  metrics:
7
  - rouge
8
  model-index:
9
- - name: flan-t5-base-fineTuned
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # flan-t5-base-fineTuned
17
 
18
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.1995
21
- - Rouge1: 93.7396
22
- - Rouge2: 85.7064
23
- - Rougel: 93.7508
24
- - Rougelsum: 93.861
25
- - Gen Len: 11.7872
26
 
27
  ## Model description
28
 
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
- | No log | 1.0 | 30 | 0.3947 | 88.6458 | 75.4031 | 88.1675 | 88.2718 | 11.0851 |
57
- | No log | 2.0 | 60 | 0.2608 | 92.2792 | 81.347 | 92.3561 | 92.3951 | 11.7234 |
58
- | No log | 3.0 | 90 | 0.2242 | 93.0917 | 83.4787 | 93.1643 | 93.1916 | 11.7660 |
59
- | No log | 4.0 | 120 | 0.2056 | 93.7531 | 86.045 | 93.8218 | 93.8514 | 11.8298 |
60
- | No log | 5.0 | 150 | 0.1995 | 93.7396 | 85.7064 | 93.7508 | 93.861 | 11.7872 |
61
- | No log | 6.0 | 180 | 0.2021 | 93.5965 | 85.2921 | 93.6819 | 93.7096 | 11.8298 |
62
- | No log | 7.0 | 210 | 0.2089 | 93.5965 | 85.2921 | 93.6819 | 93.7096 | 11.8298 |
63
- | No log | 8.0 | 240 | 0.2073 | 93.8289 | 85.7832 | 93.8623 | 93.9475 | 11.8298 |
64
- | No log | 9.0 | 270 | 0.2083 | 93.8289 | 85.7832 | 93.8623 | 93.9475 | 11.8298 |
65
- | No log | 10.0 | 300 | 0.2087 | 93.8289 | 85.7832 | 93.8623 | 93.9475 | 11.8298 |
66
 
67
 
68
  ### Framework versions
 
6
  metrics:
7
  - rouge
8
  model-index:
9
+ - name: flan-t5-sentence-generator
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # flan-t5-sentence-generator
17
 
18
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.3271
21
+ - Rouge1: 92.6712
22
+ - Rouge2: 82.7566
23
+ - Rougel: 92.6246
24
+ - Rougelsum: 92.5733
25
+ - Gen Len: 12.6809
26
 
27
  ## Model description
28
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
+ | No log | 1.0 | 38 | 0.4665 | 87.5888 | 72.8489 | 87.0237 | 87.1042 | 11.5745 |
57
+ | No log | 2.0 | 76 | 0.3577 | 90.7662 | 79.8453 | 90.443 | 90.4784 | 12.2340 |
58
+ | No log | 3.0 | 114 | 0.3342 | 92.0014 | 81.8411 | 91.999 | 91.9489 | 12.4468 |
59
+ | No log | 4.0 | 152 | 0.3343 | 92.3868 | 81.5074 | 92.2937 | 92.2943 | 12.5319 |
60
+ | No log | 5.0 | 190 | 0.3517 | 92.7314 | 83.1921 | 92.7259 | 92.6681 | 12.7660 |
61
+ | No log | 6.0 | 228 | 0.3271 | 92.6712 | 82.7566 | 92.6246 | 92.5733 | 12.6809 |
62
+ | No log | 7.0 | 266 | 0.3285 | 92.7106 | 82.4425 | 92.7382 | 92.6212 | 12.6809 |
63
+ | No log | 8.0 | 304 | 0.3379 | 92.9469 | 83.0373 | 92.9539 | 92.8683 | 12.6596 |
64
+ | No log | 9.0 | 342 | 0.3318 | 93.217 | 83.9024 | 93.1868 | 93.1101 | 12.7234 |
65
+ | No log | 10.0 | 380 | 0.3336 | 93.0582 | 83.3947 | 93.053 | 92.9652 | 12.7021 |
66
 
67
 
68
  ### Framework versions
logs/events.out.tfevents.1700925733.dfb6af92617b.149.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7505139ce47d94ad3517e15ce72d6d638d8a05c41c3b5250b6143c167d70122c
3
+ size 10849
logs/events.out.tfevents.1700926022.dfb6af92617b.149.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a526d3b7089af51dc9c7f08f080b85bb5f840b19f69d0bd6cc87e0d643de7e6b
3
+ size 613
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a087cfd2d0a4f7abf462446420d5f54adab7c04a273012b869140cbd6e07193d
3
  size 990345064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6d445005957e65a0582b1f1fea53b7c4fe1c0f1f2369597c2c64bfa4ee019a1
3
  size 990345064
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d421d9df02370b2b2f680163ae8799d67598e42c2c4aed6d5e11518315339a9b
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01bd10ceec4acca8e4abdd450a6e9f260449d04786bd428881b0727fdca9371a
3
  size 4728