Miranda
/

t5-small-train

@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3391
-- Rouge1: 41.7833
-- Rouge2: 19.9755
-- Rougel: 37.2121
-- Rougelsum: 37.8536
 ## Model description
@@ -40,9 +40,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 6.6e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -52,14 +52,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 3.1181        | 1.0   | 45   | 2.6259          | 36.4131 | 16.4288 | 31.7464 | 32.35     |
-| 2.5513        | 2.0   | 90   | 2.4453          | 38.9936 | 17.574  | 34.739  | 35.1928   |
-| 2.3397        | 3.0   | 135  | 2.3846          | 39.9271 | 18.2479 | 35.2491 | 35.8645   |
-| 2.2917        | 4.0   | 180  | 2.3631          | 40.9718 | 19.1011 | 36.2473 | 37.0154   |
-| 2.2108        | 5.0   | 225  | 2.3485          | 41.3635 | 19.8269 | 36.632  | 37.3936   |
-| 2.1416        | 6.0   | 270  | 2.3430          | 41.5211 | 20.003  | 36.9579 | 37.6769   |
-| 2.0931        | 7.0   | 315  | 2.3402          | 42.1374 | 20.4328 | 37.5093 | 38.2262   |
-| 2.1076        | 8.0   | 360  | 2.3391          | 41.7833 | 19.9755 | 37.2121 | 37.8536   |
 ### Framework versions

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3177
+- Rouge1: 42.593
+- Rouge2: 20.4119
+- Rougel: 37.9869
+- Rougelsum: 38.7572
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4.6e-05
+- train_batch_size: 9
+- eval_batch_size: 9
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 2.11          | 1.0   | 40   | 2.3312          | 41.6952 | 20.5706 | 37.2314 | 38.0437   |
+| 2.0322        | 2.0   | 80   | 2.3218          | 42.0754 | 20.614  | 37.4971 | 38.3932   |
+| 1.9806        | 3.0   | 120  | 2.3205          | 42.257  | 20.6839 | 37.7397 | 38.4623   |
+| 1.9656        | 4.0   | 160  | 2.3229          | 42.2486 | 20.0638 | 37.5846 | 38.4475   |
+| 1.9416        | 5.0   | 200  | 2.3208          | 42.764  | 20.588  | 38.0969 | 38.9356   |
+| 1.9139        | 6.0   | 240  | 2.3187          | 42.6413 | 20.7776 | 38.0375 | 38.8627   |
+| 1.8388        | 7.0   | 280  | 2.3181          | 43.0338 | 21.0632 | 38.3891 | 39.1829   |
+| 1.9099        | 8.0   | 320  | 2.3177          | 42.593  | 20.4119 | 37.9869 | 38.7572   |
 ### Framework versions