Miranda
/

t5-small-train

text2text-generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Miranda commited on Apr 25, 2022

Commit

977a3d0

·

1 Parent(s): 701b07d

update model card README.md

Files changed (1) hide show

README.md +25 -11

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: apache-2.0
 tags:
 - summarization
 - generated_from_trainer
 model-index:
 - name: t5-small-train
   results: []
@@ -15,15 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 4.1125
-- eval_rouge1: 30.7616
-- eval_rouge2: 12.0164
-- eval_rougeL: 26.1888
-- eval_rougeLsum: 27.0841
-- eval_runtime: 11.6689
-- eval_samples_per_second: 13.026
-- eval_steps_per_second: 1.371
-- step: 0
 ## Model description
@@ -43,13 +41,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5.6e-05
-- train_batch_size: 10
-- eval_batch_size: 10
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 10
 ### Framework versions
 - Transformers 4.18.0

 tags:
 - summarization
 - generated_from_trainer
+metrics:
+- rouge
 model-index:
 - name: t5-small-train
   results: []
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4110
+- Rouge1: 41.006
+- Rouge2: 18.9406
+- Rougel: 35.7319
+- Rougelsum: 35.9987
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5.6e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 2.5097        | 1.0   | 45   | 2.5765          | 36.9095 | 15.7531 | 32.4588 | 32.7501   |
+| 2.39          | 2.0   | 90   | 2.4823          | 39.1984 | 17.2602 | 34.5018 | 34.8303   |
+| 2.2862        | 3.0   | 135  | 2.4521          | 39.9179 | 18.2643 | 35.4775 | 35.7854   |
+| 2.2011        | 4.0   | 180  | 2.4314          | 40.1014 | 18.3646 | 35.274  | 35.5883   |
+| 2.1335        | 5.0   | 225  | 2.4240          | 40.1053 | 18.406  | 35.0905 | 35.3427   |
+| 2.0803        | 6.0   | 270  | 2.4178          | 41.1202 | 18.5746 | 35.5454 | 35.7857   |
+| 2.0662        | 7.0   | 315  | 2.4129          | 40.7965 | 18.5148 | 35.5866 | 35.8591   |
+| 2.0291        | 8.0   | 360  | 2.4103          | 40.7121 | 18.8736 | 35.6646 | 35.9392   |
+| 1.9807        | 9.0   | 405  | 2.4112          | 40.9464 | 18.9815 | 35.8468 | 36.1114   |
+| 1.9702        | 10.0  | 450  | 2.4110          | 41.006  | 18.9406 | 35.7319 | 35.9987   |
 ### Framework versions
 - Transformers 4.18.0