hawalurahman
/

mt5-base-qa_v1

@@ -19,13 +19,13 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9476
-- Rouge1: 0.6394
-- Rouge2: 0.3540
-- Rougel: 0.6391
-- Rougelsum: 0.6397
-- Bleu: 0.4246
-- Exact Match: 0.4133
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -56,11 +56,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu   | Exact Match |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|:-----------:|
-| 0.5058        | 1.0   | 2000  | 1.1567          | 0.6147 | 0.3280 | 0.6147 | 0.6144    | 0.3739 | 0.3912      |
-| 0.1688        | 2.0   | 4000  | 1.3225          | 0.6180 | 0.3428 | 0.6180 | 0.6184    | 0.4093 | 0.375       |
-| 0.0799        | 3.0   | 6000  | 1.6096          | 0.6397 | 0.3591 | 0.6399 | 0.6402    | 0.4213 | 0.404       |
-| 0.0298        | 4.0   | 8000  | 1.8350          | 0.6431 | 0.3549 | 0.6428 | 0.6430    | 0.4250 | 0.406       |
-| 0.0113        | 5.0   | 10000 | 1.9476          | 0.6394 | 0.3540 | 0.6391 | 0.6397    | 0.4246 | 0.4133      |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5600
+- Rouge1: 0.7136
+- Rouge2: 0.4062
+- Rougel: 0.7131
+- Rougelsum: 0.7129
+- Bleu: 0.4978
+- Exact Match: 0.4803
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu   | Exact Match |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|:-----------:|
+| 0.5439        | 1.0   | 2000  | 0.9280          | 0.6925 | 0.4012 | 0.6919 | 0.6921    | 0.4703 | 0.4422      |
+| 0.2443        | 2.0   | 4000  | 1.0939          | 0.6986 | 0.3915 | 0.6984 | 0.6984    | 0.4537 | 0.4525      |
+| 0.1263        | 3.0   | 6000  | 1.2665          | 0.7005 | 0.3898 | 0.7004 | 0.7005    | 0.4569 | 0.4723      |
+| 0.0769        | 4.0   | 8000  | 1.5002          | 0.7159 | 0.4065 | 0.7158 | 0.7157    | 0.4987 | 0.4828      |
+| 0.0507        | 5.0   | 10000 | 1.5600          | 0.7136 | 0.4062 | 0.7131 | 0.7129    | 0.4978 | 0.4803      |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c378e7db0420fe4e6e5edd5bed0128d5c3d05ca761be157b3f4151ae627a65e9
 size 2329638768

 version https://git-lfs.github.com/spec/v1
+oid sha256:53f1656807ae27f58454b28586583abf0d8dafd2dd67f53e0bde2d4d26ec630a
 size 2329638768