duarteocarmo's picture
update model card README.md
a7bf1a4
|
raw
history blame
1.95 kB
metadata
license: apache-2.0
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: flan-t5-small-tigger
    results: []

flan-t5-small-tigger

This model is a fine-tuned version of google/flan-t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 2.1549
  • Rouge1: 17.5468
  • Rouge2: 10.1476
  • Rougel: 17.4231
  • Rougelsum: 17.4344
  • Gen Len: 16.6461

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
2.4678 1.0 1800 2.2583 19.1824 13.4043 19.1866 19.1587 10.7492
2.3099 2.0 3600 2.1954 17.3144 10.368 17.1994 17.2103 15.2175
2.2551 3.0 5400 2.1692 17.8406 10.6106 17.7207 17.7554 16.9789
2.2125 4.0 7200 2.1569 17.5768 10.199 17.4462 17.4731 16.4942
2.1944 5.0 9000 2.1549 17.5468 10.1476 17.4231 17.4344 16.6461

Framework versions

  • Transformers 4.29.1
  • Pytorch 2.0.0+cu118
  • Datasets 2.12.0
  • Tokenizers 0.13.3