nandavikas16 commited on
Commit
cb0f142
·
verified ·
1 Parent(s): ef5ee1a

Model save

Browse files
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.0570
21
- - Rouge1: 73.0523
22
- - Rouge2: 57.189
23
- - Rougel: 63.7863
24
- - Rougelsum: 63.6277
25
 
26
  ## Model description
27
 
@@ -52,26 +52,26 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
- | 0.9009 | 1.0 | 43 | 0.7132 | 37.6307 | 19.9878 | 28.3308 | 28.3651 |
56
- | 0.6239 | 2.0 | 86 | 0.6394 | 39.1948 | 21.1172 | 29.1431 | 29.2046 |
57
- | 0.5428 | 3.0 | 129 | 0.5885 | 57.4199 | 37.8978 | 43.0583 | 43.0034 |
58
- | 0.4857 | 4.0 | 172 | 0.5204 | 57.0063 | 36.0008 | 43.8824 | 43.6754 |
59
- | 0.4436 | 5.0 | 215 | 0.4770 | 60.0159 | 41.1676 | 45.8763 | 45.767 |
60
- | 0.3918 | 6.0 | 258 | 0.4266 | 62.6838 | 43.0127 | 48.1281 | 47.8622 |
61
- | 0.3526 | 7.0 | 301 | 0.3420 | 63.2047 | 44.3474 | 49.3164 | 49.1797 |
62
- | 0.3138 | 8.0 | 344 | 0.3241 | 63.0339 | 43.0567 | 48.2804 | 48.2779 |
63
- | 0.2655 | 9.0 | 387 | 0.2640 | 66.8968 | 47.6739 | 54.0001 | 53.9609 |
64
- | 0.2215 | 10.0 | 430 | 0.2101 | 66.4286 | 48.4482 | 54.3566 | 54.5065 |
65
- | 0.1707 | 11.0 | 473 | 0.1497 | 67.4439 | 50.1131 | 57.1864 | 57.168 |
66
- | 0.1351 | 12.0 | 516 | 0.1275 | 70.2057 | 53.5349 | 59.2034 | 59.207 |
67
- | 0.1033 | 13.0 | 559 | 0.1161 | 70.6876 | 52.8255 | 59.3882 | 59.1475 |
68
- | 0.0764 | 14.0 | 602 | 0.0798 | 71.8239 | 54.9631 | 61.0645 | 60.7883 |
69
- | 0.0544 | 15.0 | 645 | 0.0784 | 73.1602 | 56.4301 | 62.3908 | 62.4026 |
70
- | 0.0433 | 16.0 | 688 | 0.0627 | 71.4275 | 55.8176 | 61.6217 | 61.6363 |
71
- | 0.0313 | 17.0 | 731 | 0.0573 | 73.1143 | 57.4068 | 65.0764 | 64.9105 |
72
- | 0.0248 | 18.0 | 774 | 0.0583 | 73.2775 | 57.1978 | 64.5147 | 64.5203 |
73
- | 0.0188 | 19.0 | 817 | 0.0600 | 74.0462 | 59.1939 | 65.6102 | 65.4091 |
74
- | 0.0166 | 20.0 | 860 | 0.0570 | 73.0523 | 57.189 | 63.7863 | 63.6277 |
75
 
76
 
77
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.0320
21
+ - Rouge1: 68.9308
22
+ - Rouge2: 57.4053
23
+ - Rougel: 66.8076
24
+ - Rougelsum: 66.8081
25
 
26
  ## Model description
27
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
+ | 0.5951 | 1.0 | 43 | 0.3629 | 55.5555 | 36.8881 | 50.5214 | 50.3938 |
56
+ | 0.355 | 2.0 | 86 | 0.3255 | 56.5617 | 38.8383 | 52.1122 | 52.1338 |
57
+ | 0.3034 | 3.0 | 129 | 0.2861 | 62.6103 | 39.5617 | 58.1436 | 58.0926 |
58
+ | 0.2739 | 4.0 | 172 | 0.2438 | 59.8522 | 42.3079 | 56.2315 | 56.1528 |
59
+ | 0.2419 | 5.0 | 215 | 0.2209 | 60.6664 | 42.1228 | 56.5815 | 56.5675 |
60
+ | 0.2181 | 6.0 | 258 | 0.1923 | 66.3468 | 45.9889 | 60.7055 | 60.7051 |
61
+ | 0.1954 | 7.0 | 301 | 0.1744 | 62.4993 | 46.8847 | 57.8644 | 57.9921 |
62
+ | 0.1652 | 8.0 | 344 | 0.1448 | 63.1495 | 46.7447 | 58.7703 | 58.7584 |
63
+ | 0.1351 | 9.0 | 387 | 0.1211 | 61.9121 | 48.057 | 58.2305 | 58.3135 |
64
+ | 0.0984 | 10.0 | 430 | 0.1181 | 64.3015 | 49.9335 | 60.9836 | 61.0381 |
65
+ | 0.0701 | 11.0 | 473 | 0.0729 | 69.2083 | 55.1905 | 66.5834 | 66.5658 |
66
+ | 0.0515 | 12.0 | 516 | 0.0518 | 67.9952 | 54.2532 | 65.1305 | 65.071 |
67
+ | 0.0361 | 13.0 | 559 | 0.0590 | 62.9846 | 51.6252 | 61.847 | 61.8582 |
68
+ | 0.0266 | 14.0 | 602 | 0.0481 | 67.7713 | 53.66 | 65.2245 | 65.087 |
69
+ | 0.0194 | 15.0 | 645 | 0.0461 | 67.7427 | 54.1529 | 64.7487 | 64.5543 |
70
+ | 0.0129 | 16.0 | 688 | 0.0247 | 68.6572 | 56.1596 | 65.8355 | 65.7232 |
71
+ | 0.008 | 17.0 | 731 | 0.0357 | 67.562 | 53.9853 | 64.5544 | 64.4777 |
72
+ | 0.0062 | 18.0 | 774 | 0.0435 | 68.3624 | 55.6939 | 65.9531 | 65.7978 |
73
+ | 0.0052 | 19.0 | 817 | 0.0373 | 65.4457 | 53.3074 | 62.8389 | 63.0017 |
74
+ | 0.0043 | 20.0 | 860 | 0.0320 | 68.9308 | 57.4053 | 66.8076 | 66.8081 |
75
 
76
 
77
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1df05ce8b479d733c9abaaf6b70f18d7668e339185be6215604ad0b1fbcff60b
3
  size 1625422896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc1d27ebdce9a47f657fb8eb9d57561de0a522284d89217faa9a3aff5b67520a
3
  size 1625422896
runs/Mar07_12-16-27_nvjep2ob1l/events.out.tfevents.1709814145.nvjep2ob1l.294.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f204fae8c2d088c8704f6594866f00cc071ad8bf286f8412175d1aee7f9f468b
3
- size 13305
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74515a25e09738cb37e63dcb94d32b0a749ca2238ff70f20c534a5f4d17cd307
3
+ size 19824
runs/Mar07_12-16-27_nvjep2ob1l/events.out.tfevents.1709817267.nvjep2ob1l.294.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da4ce5afd22c171d562ed43dbbeafc98b8e2bbbe9a18735a3675b0260b3f55b0
3
+ size 514