bbunijieun
/

seq2seq_huggingface_mix_results

Transformers

Safetensors

transformer

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

bbunijieun commited on Jun 2, 2024

Commit

d8f3f34

verified ·

1 Parent(s): e0fee29

Initial model training

Browse files

Files changed (2) hide show

README.md +22 -64
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 7.0175
 ## Model description
@@ -41,75 +41,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 10.5072       | 0.0480 | 10   | 10.4491         |
-| 10.3574       | 0.0959 | 20   | 10.1991         |
-| 10.0831       | 0.1439 | 30   | 9.8790          |
-| 9.7946        | 0.1918 | 40   | 9.5780          |
-| 9.5118        | 0.2398 | 50   | 9.3344          |
-| 9.3333        | 0.2878 | 60   | 9.1722          |
-| 9.1888        | 0.3357 | 70   | 9.0610          |
-| 9.0913        | 0.3837 | 80   | 8.9742          |
-| 9.0007        | 0.4317 | 90   | 8.9005          |
-| 8.9134        | 0.4796 | 100  | 8.8328          |
-| 8.8583        | 0.5276 | 110  | 8.7615          |
-| 8.7722        | 0.5755 | 120  | 8.6873          |
-| 8.7092        | 0.6235 | 130  | 8.6137          |
-| 8.6223        | 0.6715 | 140  | 8.5340          |
-| 8.5312        | 0.7194 | 150  | 8.4538          |
-| 8.4582        | 0.7674 | 160  | 8.3681          |
-| 8.3748        | 0.8153 | 170  | 8.2801          |
-| 8.2637        | 0.8633 | 180  | 8.1936          |
-| 8.1704        | 0.9113 | 190  | 8.1001          |
-| 8.0697        | 0.9592 | 200  | 8.0079          |
-| 7.9792        | 1.0072 | 210  | 7.9126          |
-| 7.9           | 1.0552 | 220  | 7.8175          |
-| 7.8134        | 1.1031 | 230  | 7.7236          |
-| 7.7153        | 1.1511 | 240  | 7.6328          |
-| 7.6087        | 1.1990 | 250  | 7.5477          |
-| 7.5328        | 1.2470 | 260  | 7.4634          |
-| 7.4347        | 1.2950 | 270  | 7.3862          |
-| 7.3531        | 1.3429 | 280  | 7.3179          |
-| 7.3059        | 1.3909 | 290  | 7.2513          |
-| 7.2403        | 1.4388 | 300  | 7.1955          |
-| 7.2128        | 1.4868 | 310  | 7.1506          |
-| 7.1508        | 1.5348 | 320  | 7.1105          |
-| 7.1104        | 1.5827 | 330  | 7.0835          |
-| 7.067         | 1.6307 | 340  | 7.0655          |
-| 7.0594        | 1.6787 | 350  | 7.0558          |
-| 7.0591        | 1.7266 | 360  | 7.0411          |
-| 7.0129        | 1.7746 | 370  | 7.0381          |
-| 7.0107        | 1.8225 | 380  | 7.0344          |
-| 7.0549        | 1.8705 | 390  | 7.0268          |
-| 7.0358        | 1.9185 | 400  | 7.0249          |
-| 7.0395        | 1.9664 | 410  | 7.0242          |
-| 7.0105        | 2.0144 | 420  | 7.0215          |
-| 7.0113        | 2.0624 | 430  | 7.0259          |
-| 6.9985        | 2.1103 | 440  | 7.0213          |
-| 7.0218        | 2.1583 | 450  | 7.0218          |
-| 6.9735        | 2.2062 | 460  | 7.0275          |
-| 7.0132        | 2.2542 | 470  | 7.0254          |
-| 7.0241        | 2.3022 | 480  | 7.0219          |
-| 7.0127        | 2.3501 | 490  | 7.0238          |
-| 6.9644        | 2.3981 | 500  | 7.0249          |
-| 7.0103        | 2.4460 | 510  | 7.0259          |
-| 7.006         | 2.4940 | 520  | 7.0266          |
-| 6.9882        | 2.5420 | 530  | 7.0235          |
-| 7.0016        | 2.5899 | 540  | 7.0235          |
-| 7.002         | 2.6379 | 550  | 7.0217          |
-| 6.9782        | 2.6859 | 560  | 7.0196          |
-| 6.9833        | 2.7338 | 570  | 7.0198          |
-| 6.9967        | 2.7818 | 580  | 7.0202          |
-| 6.9644        | 2.8297 | 590  | 7.0196          |
-| 6.9825        | 2.8777 | 600  | 7.0199          |
-| 7.0097        | 2.9257 | 610  | 7.0178          |
-| 6.9909        | 2.9736 | 620  | 7.0175          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 7.9457
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 10.7021       | 0.0480 | 10   | 10.6458         |
+| 10.5611       | 0.0959 | 20   | 10.4143         |
+| 10.2935       | 0.1439 | 30   | 10.0786         |
+| 9.9673        | 0.1918 | 40   | 9.7462          |
+| 9.6468        | 0.2398 | 50   | 9.4724          |
+| 9.4303        | 0.2878 | 60   | 9.2583          |
+| 9.2452        | 0.3357 | 70   | 9.1136          |
+| 9.1357        | 0.3837 | 80   | 9.0100          |
+| 9.0307        | 0.4317 | 90   | 8.9296          |
+| 8.9363        | 0.4796 | 100  | 8.8591          |
+| 8.8781        | 0.5276 | 110  | 8.7821          |
+| 8.7907        | 0.5755 | 120  | 8.7088          |
+| 8.7214        | 0.6235 | 130  | 8.6329          |
+| 8.6375        | 0.6715 | 140  | 8.5511          |
+| 8.5439        | 0.7194 | 150  | 8.4701          |
+| 8.4715        | 0.7674 | 160  | 8.3835          |
+| 8.389         | 0.8153 | 170  | 8.2962          |
+| 8.2787        | 0.8633 | 180  | 8.2072          |
+| 8.1826        | 0.9113 | 190  | 8.1146          |
+| 8.0848        | 0.9592 | 200  | 8.0225          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c4d450c24368b42271e65a1f934af6e1fb78361eb03ead224038471e180c732
 size 310717424

 version https://git-lfs.github.com/spec/v1
+oid sha256:5bbf477e86932cb8f6a5d9e8fc32017cd44e35d8a891033d4cd65fa1cc1cdcb6
 size 310717424