trichter
/

t5-DistillingSbS-ABSA

Text Generation

text2text-generation

text-generation-inference

Model card Files Files and versions Community

trichter commited on Sep 26, 2024

Commit

86880a4

·

verified ·

1 Parent(s): 6f6923f

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -27,8 +27,13 @@ Hyperparameters
 Some of the key hyperparameters used for fine-tuning:
 Batch Size: 3
 Gradient Accumulation Steps: 12
 Optimizer: AdamW
 Learning Rate: 1e-4
 Epochs: 5
 Max Sequence Length: 512

 Some of the key hyperparameters used for fine-tuning:
 Batch Size: 3
 Gradient Accumulation Steps: 12
 Optimizer: AdamW
 Learning Rate: 1e-4
 Epochs: 5
 Max Sequence Length: 512