nandavikas16 commited on
Commit
dfab990
·
verified ·
1 Parent(s): bf7c37f

Model save

Browse files
Files changed (2) hide show
  1. README.md +16 -16
  2. generation_config.json +1 -1
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.1165
21
- - Rouge1: 55.7822
22
- - Rouge2: 41.9699
23
- - Rougel: 47.2427
24
- - Rougelsum: 47.1372
25
 
26
  ## Model description
27
 
@@ -52,21 +52,21 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
- | 0.613 | 1.0 | 35 | 0.2146 | 42.8394 | 24.2808 | 31.8024 | 31.7805 |
56
- | 0.1987 | 2.0 | 70 | 0.1989 | 46.2492 | 30.694 | 36.6481 | 36.5135 |
57
- | 0.1939 | 3.0 | 105 | 0.1792 | 47.7166 | 31.8667 | 38.5667 | 38.526 |
58
- | 0.1663 | 4.0 | 140 | 0.1642 | 49.6835 | 34.7278 | 39.5294 | 39.4623 |
59
- | 0.1711 | 5.0 | 175 | 0.1555 | 51.6538 | 35.9915 | 40.1589 | 40.1665 |
60
- | 0.1577 | 6.0 | 210 | 0.1443 | 50.4306 | 35.9713 | 40.4836 | 40.4492 |
61
- | 0.1511 | 7.0 | 245 | 0.1367 | 55.8887 | 43.2295 | 49.0124 | 48.9803 |
62
- | 0.1425 | 8.0 | 280 | 0.1306 | 56.2433 | 41.4182 | 45.9078 | 45.9027 |
63
- | 0.1255 | 9.0 | 315 | 0.1191 | 57.3464 | 43.7543 | 47.335 | 47.3058 |
64
- | 0.1299 | 10.0 | 350 | 0.1165 | 55.7822 | 41.9699 | 47.2427 | 47.1372 |
65
 
66
 
67
  ### Framework versions
68
 
69
- - Transformers 4.42.4
70
  - Pytorch 2.3.1+cu121
71
  - Datasets 2.20.0
72
  - Tokenizers 0.19.1
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.1466
21
+ - Rouge1: 53.6452
22
+ - Rouge2: 39.2506
23
+ - Rougel: 44.3673
24
+ - Rougelsum: 44.3521
25
 
26
  ## Model description
27
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
+ | 0.5077 | 1.0 | 36 | 0.2435 | 43.0788 | 24.6647 | 32.4588 | 32.5662 |
56
+ | 0.2391 | 2.0 | 72 | 0.2190 | 44.1416 | 26.995 | 33.4471 | 33.3775 |
57
+ | 0.2086 | 3.0 | 108 | 0.2094 | 47.6567 | 31.6759 | 37.2761 | 37.2626 |
58
+ | 0.1896 | 4.0 | 144 | 0.1986 | 49.8613 | 33.0333 | 38.6723 | 38.6662 |
59
+ | 0.1868 | 5.0 | 180 | 0.1945 | 49.5133 | 33.8958 | 38.6023 | 38.6402 |
60
+ | 0.1726 | 6.0 | 216 | 0.1745 | 52.5006 | 36.3466 | 41.5471 | 41.5067 |
61
+ | 0.1656 | 7.0 | 252 | 0.1705 | 52.6514 | 38.6746 | 43.3252 | 43.3525 |
62
+ | 0.1515 | 8.0 | 288 | 0.1580 | 53.659 | 37.2505 | 42.5857 | 42.6415 |
63
+ | 0.1561 | 9.0 | 324 | 0.1492 | 54.5155 | 39.0607 | 44.2114 | 44.3034 |
64
+ | 0.1363 | 10.0 | 360 | 0.1466 | 53.6452 | 39.2506 | 44.3673 | 44.3521 |
65
 
66
 
67
  ### Framework versions
68
 
69
+ - Transformers 4.43.2
70
  - Pytorch 2.3.1+cu121
71
  - Datasets 2.20.0
72
  - Tokenizers 0.19.1
generation_config.json CHANGED
@@ -11,6 +11,6 @@
11
  "no_repeat_ngram_size": 3,
12
  "num_beams": 4,
13
  "pad_token_id": 1,
14
- "transformers_version": "4.42.4",
15
  "use_cache": false
16
  }
 
11
  "no_repeat_ngram_size": 3,
12
  "num_beams": 4,
13
  "pad_token_id": 1,
14
+ "transformers_version": "4.43.2",
15
  "use_cache": false
16
  }