End of training

Browse files

Files changed (5) hide show

README.md +18 -18
logs/events.out.tfevents.1700925733.dfb6af92617b.149.0 +3 -0
logs/events.out.tfevents.1700926022.dfb6af92617b.149.1 +3 -0
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -6,23 +6,23 @@ tags:
 metrics:
 - rouge
 model-index:
-- name: flan-t5-base-fineTuned
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# flan-t5-base-fineTuned
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1995
-- Rouge1: 93.7396
-- Rouge2: 85.7064
-- Rougel: 93.7508
-- Rougelsum: 93.861
-- Gen Len: 11.7872
 ## Model description
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 30   | 0.3947          | 88.6458 | 75.4031 | 88.1675 | 88.2718   | 11.0851 |
-| No log        | 2.0   | 60   | 0.2608          | 92.2792 | 81.347  | 92.3561 | 92.3951   | 11.7234 |
-| No log        | 3.0   | 90   | 0.2242          | 93.0917 | 83.4787 | 93.1643 | 93.1916   | 11.7660 |
-| No log        | 4.0   | 120  | 0.2056          | 93.7531 | 86.045  | 93.8218 | 93.8514   | 11.8298 |
-| No log        | 5.0   | 150  | 0.1995          | 93.7396 | 85.7064 | 93.7508 | 93.861    | 11.7872 |
-| No log        | 6.0   | 180  | 0.2021          | 93.5965 | 85.2921 | 93.6819 | 93.7096   | 11.8298 |
-| No log        | 7.0   | 210  | 0.2089          | 93.5965 | 85.2921 | 93.6819 | 93.7096   | 11.8298 |
-| No log        | 8.0   | 240  | 0.2073          | 93.8289 | 85.7832 | 93.8623 | 93.9475   | 11.8298 |
-| No log        | 9.0   | 270  | 0.2083          | 93.8289 | 85.7832 | 93.8623 | 93.9475   | 11.8298 |
-| No log        | 10.0  | 300  | 0.2087          | 93.8289 | 85.7832 | 93.8623 | 93.9475   | 11.8298 |
 ### Framework versions

 metrics:
 - rouge
 model-index:
+- name: flan-t5-sentence-generator
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# flan-t5-sentence-generator
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3271
+- Rouge1: 92.6712
+- Rouge2: 82.7566
+- Rougel: 92.6246
+- Rougelsum: 92.5733
+- Gen Len: 12.6809
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 38   | 0.4665          | 87.5888 | 72.8489 | 87.0237 | 87.1042   | 11.5745 |
+| No log        | 2.0   | 76   | 0.3577          | 90.7662 | 79.8453 | 90.443  | 90.4784   | 12.2340 |
+| No log        | 3.0   | 114  | 0.3342          | 92.0014 | 81.8411 | 91.999  | 91.9489   | 12.4468 |
+| No log        | 4.0   | 152  | 0.3343          | 92.3868 | 81.5074 | 92.2937 | 92.2943   | 12.5319 |
+| No log        | 5.0   | 190  | 0.3517          | 92.7314 | 83.1921 | 92.7259 | 92.6681   | 12.7660 |
+| No log        | 6.0   | 228  | 0.3271          | 92.6712 | 82.7566 | 92.6246 | 92.5733   | 12.6809 |
+| No log        | 7.0   | 266  | 0.3285          | 92.7106 | 82.4425 | 92.7382 | 92.6212   | 12.6809 |
+| No log        | 8.0   | 304  | 0.3379          | 92.9469 | 83.0373 | 92.9539 | 92.8683   | 12.6596 |
+| No log        | 9.0   | 342  | 0.3318          | 93.217  | 83.9024 | 93.1868 | 93.1101   | 12.7234 |
+| No log        | 10.0  | 380  | 0.3336          | 93.0582 | 83.3947 | 93.053  | 92.9652   | 12.7021 |
 ### Framework versions

logs/events.out.tfevents.1700925733.dfb6af92617b.149.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7505139ce47d94ad3517e15ce72d6d638d8a05c41c3b5250b6143c167d70122c
+size 10849

logs/events.out.tfevents.1700926022.dfb6af92617b.149.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a526d3b7089af51dc9c7f08f080b85bb5f840b19f69d0bd6cc87e0d643de7e6b
+size 613

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a087cfd2d0a4f7abf462446420d5f54adab7c04a273012b869140cbd6e07193d
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6d445005957e65a0582b1f1fea53b7c4fe1c0f1f2369597c2c64bfa4ee019a1
 size 990345064

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d421d9df02370b2b2f680163ae8799d67598e42c2c4aed6d5e11518315339a9b
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:01bd10ceec4acca8e4abdd450a6e9f260449d04786bd428881b0727fdca9371a
 size 4728