Commit
·
04a0a6f
1
Parent(s):
de1c0a2
End of training
Browse files
README.md
CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
This model is a fine-tuned version of [youdiniplays/filipinolingo_model](https://huggingface.co/youdiniplays/filipinolingo_model) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
- Loss: 3.
|
21 |
-
- Bleu:
|
22 |
-
- Gen Len:
|
23 |
|
24 |
## Model description
|
25 |
|
@@ -38,39 +38,319 @@ More information needed
|
|
38 |
### Training hyperparameters
|
39 |
|
40 |
The following hyperparameters were used during training:
|
41 |
-
- learning_rate:
|
42 |
- train_batch_size: 16
|
43 |
- eval_batch_size: 16
|
44 |
- seed: 42
|
45 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
46 |
- lr_scheduler_type: linear
|
47 |
-
- num_epochs:
|
48 |
- mixed_precision_training: Native AMP
|
49 |
|
50 |
### Training results
|
51 |
|
52 |
-
| Training Loss | Epoch | Step | Validation Loss | Bleu
|
53 |
-
|
54 |
-
| No log | 1.0 | 4 |
|
55 |
-
| No log | 2.0 | 8 |
|
56 |
-
| No log | 3.0 | 12 |
|
57 |
-
| No log | 4.0 | 16 |
|
58 |
-
| No log | 5.0 | 20 |
|
59 |
-
| No log | 6.0 | 24 |
|
60 |
-
| No log | 7.0 | 28 |
|
61 |
-
| No log | 8.0 | 32 |
|
62 |
-
| No log | 9.0 | 36 |
|
63 |
-
| No log | 10.0 | 40 |
|
64 |
-
| No log | 11.0 | 44 | 3.
|
65 |
-
| No log | 12.0 | 48 | 3.
|
66 |
-
| No log | 13.0 | 52 | 3.
|
67 |
-
| No log | 14.0 | 56 | 3.
|
68 |
-
| No log | 15.0 | 60 | 3.
|
69 |
-
| No log | 16.0 | 64 | 3.
|
70 |
-
| No log | 17.0 | 68 | 3.
|
71 |
-
| No log | 18.0 | 72 | 3.
|
72 |
-
| No log | 19.0 | 76 | 3.
|
73 |
-
| No log | 20.0 | 80 | 3.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
74 |
|
75 |
|
76 |
### Framework versions
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [youdiniplays/filipinolingo_model](https://huggingface.co/youdiniplays/filipinolingo_model) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 3.6597
|
21 |
+
- Bleu: 11.8044
|
22 |
+
- Gen Len: 14.75
|
23 |
|
24 |
## Model description
|
25 |
|
|
|
38 |
### Training hyperparameters
|
39 |
|
40 |
The following hyperparameters were used during training:
|
41 |
+
- learning_rate: 0.001
|
42 |
- train_batch_size: 16
|
43 |
- eval_batch_size: 16
|
44 |
- seed: 42
|
45 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
46 |
- lr_scheduler_type: linear
|
47 |
+
- num_epochs: 300
|
48 |
- mixed_precision_training: Native AMP
|
49 |
|
50 |
### Training results
|
51 |
|
52 |
+
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
53 |
+
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
|
54 |
+
| No log | 1.0 | 4 | 2.6992 | 3.5276 | 13.75 |
|
55 |
+
| No log | 2.0 | 8 | 2.3483 | 6.8767 | 14.1875 |
|
56 |
+
| No log | 3.0 | 12 | 2.2289 | 8.4749 | 14.5625 |
|
57 |
+
| No log | 4.0 | 16 | 2.2552 | 8.537 | 14.375 |
|
58 |
+
| No log | 5.0 | 20 | 2.3404 | 9.3451 | 13.875 |
|
59 |
+
| No log | 6.0 | 24 | 2.5126 | 15.043 | 14.0625 |
|
60 |
+
| No log | 7.0 | 28 | 2.7072 | 14.9624 | 14.125 |
|
61 |
+
| No log | 8.0 | 32 | 2.8633 | 14.8092 | 14.3125 |
|
62 |
+
| No log | 9.0 | 36 | 2.9499 | 15.0385 | 14.125 |
|
63 |
+
| No log | 10.0 | 40 | 2.9954 | 9.0751 | 14.1875 |
|
64 |
+
| No log | 11.0 | 44 | 3.0306 | 8.321 | 14.125 |
|
65 |
+
| No log | 12.0 | 48 | 3.0640 | 8.5338 | 14.0625 |
|
66 |
+
| No log | 13.0 | 52 | 3.0869 | 8.5302 | 14.0625 |
|
67 |
+
| No log | 14.0 | 56 | 3.1138 | 8.3647 | 14.125 |
|
68 |
+
| No log | 15.0 | 60 | 3.1254 | 8.5765 | 13.9375 |
|
69 |
+
| No log | 16.0 | 64 | 3.1203 | 8.5302 | 14.0625 |
|
70 |
+
| No log | 17.0 | 68 | 3.1250 | 12.0182 | 14.1875 |
|
71 |
+
| No log | 18.0 | 72 | 3.1192 | 12.0182 | 14.1875 |
|
72 |
+
| No log | 19.0 | 76 | 3.1231 | 8.5338 | 14.1875 |
|
73 |
+
| No log | 20.0 | 80 | 3.1155 | 11.9388 | 13.875 |
|
74 |
+
| No log | 21.0 | 84 | 3.1176 | 11.9402 | 13.875 |
|
75 |
+
| No log | 22.0 | 88 | 3.1295 | 11.9402 | 13.875 |
|
76 |
+
| No log | 23.0 | 92 | 3.1487 | 11.9402 | 13.875 |
|
77 |
+
| No log | 24.0 | 96 | 3.1673 | 12.1489 | 13.875 |
|
78 |
+
| No log | 25.0 | 100 | 3.1859 | 16.2159 | 13.875 |
|
79 |
+
| No log | 26.0 | 104 | 3.2061 | 15.9711 | 13.8125 |
|
80 |
+
| No log | 27.0 | 108 | 3.2147 | 15.9711 | 13.8125 |
|
81 |
+
| No log | 28.0 | 112 | 3.2223 | 15.9711 | 13.8125 |
|
82 |
+
| No log | 29.0 | 116 | 3.2345 | 16.2159 | 13.8125 |
|
83 |
+
| No log | 30.0 | 120 | 3.2414 | 16.1289 | 13.8125 |
|
84 |
+
| No log | 31.0 | 124 | 3.2448 | 16.1261 | 13.8125 |
|
85 |
+
| No log | 32.0 | 128 | 3.2446 | 16.1261 | 13.8125 |
|
86 |
+
| No log | 33.0 | 132 | 3.2307 | 15.8836 | 13.75 |
|
87 |
+
| No log | 34.0 | 136 | 3.2247 | 15.8803 | 13.75 |
|
88 |
+
| No log | 35.0 | 140 | 3.2364 | 15.8803 | 13.75 |
|
89 |
+
| No log | 36.0 | 144 | 3.2507 | 16.1261 | 13.8125 |
|
90 |
+
| No log | 37.0 | 148 | 3.2608 | 16.1261 | 13.8125 |
|
91 |
+
| No log | 38.0 | 152 | 3.2893 | 16.536 | 13.8125 |
|
92 |
+
| No log | 39.0 | 156 | 3.3026 | 16.3582 | 13.8125 |
|
93 |
+
| No log | 40.0 | 160 | 3.2786 | 16.3582 | 13.9375 |
|
94 |
+
| No log | 41.0 | 164 | 3.2607 | 16.3548 | 14.0 |
|
95 |
+
| No log | 42.0 | 168 | 3.2557 | 16.4428 | 14.0 |
|
96 |
+
| No log | 43.0 | 172 | 3.2648 | 16.1734 | 14.1875 |
|
97 |
+
| No log | 44.0 | 176 | 3.2455 | 12.2013 | 14.375 |
|
98 |
+
| No log | 45.0 | 180 | 3.2444 | 12.2013 | 14.375 |
|
99 |
+
| No log | 46.0 | 184 | 3.2416 | 12.2013 | 14.375 |
|
100 |
+
| No log | 47.0 | 188 | 3.2412 | 11.8127 | 14.375 |
|
101 |
+
| No log | 48.0 | 192 | 3.2656 | 16.2611 | 14.3125 |
|
102 |
+
| No log | 49.0 | 196 | 3.2998 | 16.0785 | 15.1875 |
|
103 |
+
| No log | 50.0 | 200 | 3.3196 | 16.0785 | 14.6875 |
|
104 |
+
| No log | 51.0 | 204 | 3.3304 | 15.9095 | 15.0 |
|
105 |
+
| No log | 52.0 | 208 | 3.3312 | 16.0125 | 15.0 |
|
106 |
+
| No log | 53.0 | 212 | 3.3265 | 16.0956 | 14.5 |
|
107 |
+
| No log | 54.0 | 216 | 3.3282 | 16.2714 | 14.0625 |
|
108 |
+
| No log | 55.0 | 220 | 3.3316 | 16.2714 | 14.0625 |
|
109 |
+
| No log | 56.0 | 224 | 3.3312 | 16.2714 | 14.0625 |
|
110 |
+
| No log | 57.0 | 228 | 3.3262 | 15.8593 | 14.5 |
|
111 |
+
| No log | 58.0 | 232 | 3.3327 | 15.8672 | 14.5 |
|
112 |
+
| No log | 59.0 | 236 | 3.3157 | 15.6948 | 14.9375 |
|
113 |
+
| No log | 60.0 | 240 | 3.2849 | 15.8593 | 15.0 |
|
114 |
+
| No log | 61.0 | 244 | 3.2707 | 15.8593 | 15.0 |
|
115 |
+
| No log | 62.0 | 248 | 3.2732 | 15.8593 | 15.0625 |
|
116 |
+
| No log | 63.0 | 252 | 3.2781 | 18.4173 | 15.1875 |
|
117 |
+
| No log | 64.0 | 256 | 3.2990 | 18.6185 | 15.0 |
|
118 |
+
| No log | 65.0 | 260 | 3.3277 | 18.6185 | 14.9375 |
|
119 |
+
| No log | 66.0 | 264 | 3.3475 | 15.1975 | 14.8125 |
|
120 |
+
| No log | 67.0 | 268 | 3.3274 | 15.2762 | 14.6875 |
|
121 |
+
| No log | 68.0 | 272 | 3.3065 | 15.5165 | 14.75 |
|
122 |
+
| No log | 69.0 | 276 | 3.3111 | 18.6185 | 14.625 |
|
123 |
+
| No log | 70.0 | 280 | 3.3575 | 18.2583 | 14.6875 |
|
124 |
+
| No log | 71.0 | 284 | 3.4089 | 18.5319 | 14.875 |
|
125 |
+
| No log | 72.0 | 288 | 3.3937 | 18.6269 | 14.8125 |
|
126 |
+
| No log | 73.0 | 292 | 3.3043 | 18.6269 | 14.8125 |
|
127 |
+
| No log | 74.0 | 296 | 3.2596 | 18.7252 | 14.8125 |
|
128 |
+
| No log | 75.0 | 300 | 3.2515 | 12.9228 | 15.125 |
|
129 |
+
| No log | 76.0 | 304 | 3.2995 | 13.0338 | 15.125 |
|
130 |
+
| No log | 77.0 | 308 | 3.3457 | 12.7784 | 15.25 |
|
131 |
+
| No log | 78.0 | 312 | 3.3949 | 12.5078 | 15.375 |
|
132 |
+
| No log | 79.0 | 316 | 3.4148 | 12.5862 | 14.625 |
|
133 |
+
| No log | 80.0 | 320 | 3.4307 | 12.3785 | 14.75 |
|
134 |
+
| No log | 81.0 | 324 | 3.4095 | 11.6247 | 14.5 |
|
135 |
+
| No log | 82.0 | 328 | 3.3948 | 11.6247 | 14.5625 |
|
136 |
+
| No log | 83.0 | 332 | 3.3857 | 11.6247 | 14.4375 |
|
137 |
+
| No log | 84.0 | 336 | 3.3724 | 11.4452 | 13.875 |
|
138 |
+
| No log | 85.0 | 340 | 3.3688 | 11.4377 | 13.8125 |
|
139 |
+
| No log | 86.0 | 344 | 3.3656 | 11.4377 | 13.8125 |
|
140 |
+
| No log | 87.0 | 348 | 3.3839 | 11.4295 | 13.8125 |
|
141 |
+
| No log | 88.0 | 352 | 3.4168 | 11.1357 | 13.8125 |
|
142 |
+
| No log | 89.0 | 356 | 3.4694 | 11.1357 | 13.8125 |
|
143 |
+
| No log | 90.0 | 360 | 3.4992 | 10.5869 | 13.8125 |
|
144 |
+
| No log | 91.0 | 364 | 3.5087 | 10.5869 | 13.8125 |
|
145 |
+
| No log | 92.0 | 368 | 3.4923 | 11.0784 | 14.125 |
|
146 |
+
| No log | 93.0 | 372 | 3.4931 | 14.544 | 14.5 |
|
147 |
+
| No log | 94.0 | 376 | 3.5046 | 14.544 | 14.625 |
|
148 |
+
| No log | 95.0 | 380 | 3.5058 | 14.1526 | 14.375 |
|
149 |
+
| No log | 96.0 | 384 | 3.5057 | 13.9259 | 14.8125 |
|
150 |
+
| No log | 97.0 | 388 | 3.5107 | 13.9259 | 14.75 |
|
151 |
+
| No log | 98.0 | 392 | 3.5173 | 11.0784 | 14.25 |
|
152 |
+
| No log | 99.0 | 396 | 3.5231 | 11.0887 | 14.3125 |
|
153 |
+
| No log | 100.0 | 400 | 3.5289 | 11.2541 | 13.75 |
|
154 |
+
| No log | 101.0 | 404 | 3.5357 | 11.2541 | 13.75 |
|
155 |
+
| No log | 102.0 | 408 | 3.5417 | 11.1254 | 14.125 |
|
156 |
+
| No log | 103.0 | 412 | 3.5468 | 11.3608 | 14.25 |
|
157 |
+
| No log | 104.0 | 416 | 3.5430 | 11.3023 | 14.625 |
|
158 |
+
| No log | 105.0 | 420 | 3.5337 | 10.9245 | 14.875 |
|
159 |
+
| No log | 106.0 | 424 | 3.5247 | 10.9783 | 14.8125 |
|
160 |
+
| No log | 107.0 | 428 | 3.5199 | 10.9783 | 14.8125 |
|
161 |
+
| No log | 108.0 | 432 | 3.5172 | 10.9783 | 14.8125 |
|
162 |
+
| No log | 109.0 | 436 | 3.5164 | 11.3128 | 14.9375 |
|
163 |
+
| No log | 110.0 | 440 | 3.5167 | 11.3128 | 14.9375 |
|
164 |
+
| No log | 111.0 | 444 | 3.5178 | 11.3128 | 14.9375 |
|
165 |
+
| No log | 112.0 | 448 | 3.5201 | 11.3128 | 14.9375 |
|
166 |
+
| No log | 113.0 | 452 | 3.5232 | 11.5924 | 14.9375 |
|
167 |
+
| No log | 114.0 | 456 | 3.5264 | 11.5924 | 14.9375 |
|
168 |
+
| No log | 115.0 | 460 | 3.5210 | 11.5924 | 14.9375 |
|
169 |
+
| No log | 116.0 | 464 | 3.5163 | 11.3128 | 14.6875 |
|
170 |
+
| No log | 117.0 | 468 | 3.5180 | 11.3706 | 14.625 |
|
171 |
+
| No log | 118.0 | 472 | 3.5237 | 11.3706 | 14.625 |
|
172 |
+
| No log | 119.0 | 476 | 3.5285 | 11.6792 | 14.875 |
|
173 |
+
| No log | 120.0 | 480 | 3.5299 | 11.9509 | 14.875 |
|
174 |
+
| No log | 121.0 | 484 | 3.5301 | 11.9509 | 14.875 |
|
175 |
+
| No log | 122.0 | 488 | 3.5318 | 11.9509 | 14.875 |
|
176 |
+
| No log | 123.0 | 492 | 3.5342 | 11.9509 | 14.875 |
|
177 |
+
| No log | 124.0 | 496 | 3.5355 | 11.9509 | 14.875 |
|
178 |
+
| 0.0683 | 125.0 | 500 | 3.5385 | 11.9509 | 14.6875 |
|
179 |
+
| 0.0683 | 126.0 | 504 | 3.5422 | 11.9509 | 14.6875 |
|
180 |
+
| 0.0683 | 127.0 | 508 | 3.5454 | 11.9509 | 14.6875 |
|
181 |
+
| 0.0683 | 128.0 | 512 | 3.5490 | 11.9509 | 14.875 |
|
182 |
+
| 0.0683 | 129.0 | 516 | 3.5494 | 11.9509 | 14.6875 |
|
183 |
+
| 0.0683 | 130.0 | 520 | 3.5500 | 11.9509 | 14.6875 |
|
184 |
+
| 0.0683 | 131.0 | 524 | 3.5513 | 11.6107 | 14.6875 |
|
185 |
+
| 0.0683 | 132.0 | 528 | 3.5545 | 11.8824 | 14.6875 |
|
186 |
+
| 0.0683 | 133.0 | 532 | 3.5571 | 11.8202 | 14.6875 |
|
187 |
+
| 0.0683 | 134.0 | 536 | 3.5597 | 11.8202 | 14.875 |
|
188 |
+
| 0.0683 | 135.0 | 540 | 3.5611 | 11.8824 | 14.5625 |
|
189 |
+
| 0.0683 | 136.0 | 544 | 3.5629 | 11.8824 | 14.5625 |
|
190 |
+
| 0.0683 | 137.0 | 548 | 3.5666 | 11.8824 | 14.5625 |
|
191 |
+
| 0.0683 | 138.0 | 552 | 3.5715 | 11.8824 | 14.5625 |
|
192 |
+
| 0.0683 | 139.0 | 556 | 3.5762 | 11.8824 | 14.5625 |
|
193 |
+
| 0.0683 | 140.0 | 560 | 3.5789 | 11.8824 | 14.5625 |
|
194 |
+
| 0.0683 | 141.0 | 564 | 3.5807 | 11.8824 | 14.5625 |
|
195 |
+
| 0.0683 | 142.0 | 568 | 3.5858 | 11.8824 | 14.5625 |
|
196 |
+
| 0.0683 | 143.0 | 572 | 3.5902 | 11.8202 | 14.875 |
|
197 |
+
| 0.0683 | 144.0 | 576 | 3.5886 | 11.5499 | 14.875 |
|
198 |
+
| 0.0683 | 145.0 | 580 | 3.5877 | 11.5499 | 14.875 |
|
199 |
+
| 0.0683 | 146.0 | 584 | 3.5866 | 11.6107 | 14.875 |
|
200 |
+
| 0.0683 | 147.0 | 588 | 3.5875 | 11.6107 | 14.875 |
|
201 |
+
| 0.0683 | 148.0 | 592 | 3.5892 | 11.6107 | 14.875 |
|
202 |
+
| 0.0683 | 149.0 | 596 | 3.5951 | 11.6792 | 14.875 |
|
203 |
+
| 0.0683 | 150.0 | 600 | 3.6008 | 11.6792 | 14.875 |
|
204 |
+
| 0.0683 | 151.0 | 604 | 3.6067 | 11.6792 | 14.875 |
|
205 |
+
| 0.0683 | 152.0 | 608 | 3.5964 | 11.6107 | 14.875 |
|
206 |
+
| 0.0683 | 153.0 | 612 | 3.5930 | 11.6107 | 14.875 |
|
207 |
+
| 0.0683 | 154.0 | 616 | 3.5945 | 11.5499 | 15.125 |
|
208 |
+
| 0.0683 | 155.0 | 620 | 3.5948 | 11.5499 | 15.125 |
|
209 |
+
| 0.0683 | 156.0 | 624 | 3.5953 | 11.6107 | 14.875 |
|
210 |
+
| 0.0683 | 157.0 | 628 | 3.5990 | 11.6107 | 14.875 |
|
211 |
+
| 0.0683 | 158.0 | 632 | 3.6028 | 11.6107 | 14.875 |
|
212 |
+
| 0.0683 | 159.0 | 636 | 3.6059 | 11.6026 | 14.875 |
|
213 |
+
| 0.0683 | 160.0 | 640 | 3.6090 | 11.6026 | 14.875 |
|
214 |
+
| 0.0683 | 161.0 | 644 | 3.6104 | 11.6026 | 14.875 |
|
215 |
+
| 0.0683 | 162.0 | 648 | 3.6114 | 11.6026 | 14.875 |
|
216 |
+
| 0.0683 | 163.0 | 652 | 3.6129 | 11.6026 | 14.875 |
|
217 |
+
| 0.0683 | 164.0 | 656 | 3.6135 | 11.6026 | 14.875 |
|
218 |
+
| 0.0683 | 165.0 | 660 | 3.6145 | 11.6026 | 14.875 |
|
219 |
+
| 0.0683 | 166.0 | 664 | 3.6152 | 11.6026 | 14.875 |
|
220 |
+
| 0.0683 | 167.0 | 668 | 3.6175 | 11.6026 | 14.875 |
|
221 |
+
| 0.0683 | 168.0 | 672 | 3.6140 | 11.6026 | 14.875 |
|
222 |
+
| 0.0683 | 169.0 | 676 | 3.6140 | 11.6026 | 14.875 |
|
223 |
+
| 0.0683 | 170.0 | 680 | 3.6159 | 11.3715 | 14.875 |
|
224 |
+
| 0.0683 | 171.0 | 684 | 3.6162 | 11.3715 | 14.875 |
|
225 |
+
| 0.0683 | 172.0 | 688 | 3.6174 | 11.3715 | 14.875 |
|
226 |
+
| 0.0683 | 173.0 | 692 | 3.6192 | 11.3715 | 14.875 |
|
227 |
+
| 0.0683 | 174.0 | 696 | 3.6209 | 11.3715 | 14.875 |
|
228 |
+
| 0.0683 | 175.0 | 700 | 3.6219 | 11.3715 | 14.875 |
|
229 |
+
| 0.0683 | 176.0 | 704 | 3.6239 | 11.3715 | 14.875 |
|
230 |
+
| 0.0683 | 177.0 | 708 | 3.6266 | 11.3715 | 14.875 |
|
231 |
+
| 0.0683 | 178.0 | 712 | 3.6308 | 11.3715 | 14.875 |
|
232 |
+
| 0.0683 | 179.0 | 716 | 3.6316 | 11.3715 | 14.875 |
|
233 |
+
| 0.0683 | 180.0 | 720 | 3.6321 | 11.6026 | 14.875 |
|
234 |
+
| 0.0683 | 181.0 | 724 | 3.6322 | 11.6026 | 14.875 |
|
235 |
+
| 0.0683 | 182.0 | 728 | 3.6319 | 11.8757 | 14.875 |
|
236 |
+
| 0.0683 | 183.0 | 732 | 3.6319 | 11.6577 | 14.875 |
|
237 |
+
| 0.0683 | 184.0 | 736 | 3.6293 | 11.8757 | 14.875 |
|
238 |
+
| 0.0683 | 185.0 | 740 | 3.6229 | 11.8757 | 14.875 |
|
239 |
+
| 0.0683 | 186.0 | 744 | 3.6186 | 11.8757 | 14.875 |
|
240 |
+
| 0.0683 | 187.0 | 748 | 3.6166 | 11.8757 | 14.875 |
|
241 |
+
| 0.0683 | 188.0 | 752 | 3.6165 | 11.8757 | 14.875 |
|
242 |
+
| 0.0683 | 189.0 | 756 | 3.6193 | 11.8757 | 14.875 |
|
243 |
+
| 0.0683 | 190.0 | 760 | 3.6216 | 11.8757 | 14.875 |
|
244 |
+
| 0.0683 | 191.0 | 764 | 3.6239 | 11.8757 | 14.875 |
|
245 |
+
| 0.0683 | 192.0 | 768 | 3.6265 | 11.8757 | 14.875 |
|
246 |
+
| 0.0683 | 193.0 | 772 | 3.6284 | 11.8757 | 14.875 |
|
247 |
+
| 0.0683 | 194.0 | 776 | 3.6301 | 11.8684 | 14.8125 |
|
248 |
+
| 0.0683 | 195.0 | 780 | 3.6319 | 11.8684 | 14.8125 |
|
249 |
+
| 0.0683 | 196.0 | 784 | 3.6341 | 11.8684 | 14.8125 |
|
250 |
+
| 0.0683 | 197.0 | 788 | 3.6364 | 11.8684 | 14.8125 |
|
251 |
+
| 0.0683 | 198.0 | 792 | 3.6386 | 11.8684 | 14.8125 |
|
252 |
+
| 0.0683 | 199.0 | 796 | 3.6418 | 11.8757 | 14.8125 |
|
253 |
+
| 0.0683 | 200.0 | 800 | 3.6447 | 11.8757 | 14.8125 |
|
254 |
+
| 0.0683 | 201.0 | 804 | 3.6463 | 12.1401 | 14.8125 |
|
255 |
+
| 0.0683 | 202.0 | 808 | 3.6476 | 12.1401 | 14.8125 |
|
256 |
+
| 0.0683 | 203.0 | 812 | 3.6496 | 11.9402 | 14.5625 |
|
257 |
+
| 0.0683 | 204.0 | 816 | 3.6518 | 12.0061 | 14.1875 |
|
258 |
+
| 0.0683 | 205.0 | 820 | 3.6544 | 12.0061 | 14.1875 |
|
259 |
+
| 0.0683 | 206.0 | 824 | 3.6561 | 12.0061 | 14.1875 |
|
260 |
+
| 0.0683 | 207.0 | 828 | 3.6574 | 12.206 | 14.3125 |
|
261 |
+
| 0.0683 | 208.0 | 832 | 3.6588 | 12.1401 | 14.6875 |
|
262 |
+
| 0.0683 | 209.0 | 836 | 3.6603 | 12.1401 | 14.6875 |
|
263 |
+
| 0.0683 | 210.0 | 840 | 3.6612 | 12.1401 | 14.6875 |
|
264 |
+
| 0.0683 | 211.0 | 844 | 3.6620 | 12.1401 | 14.6875 |
|
265 |
+
| 0.0683 | 212.0 | 848 | 3.6628 | 12.1401 | 14.6875 |
|
266 |
+
| 0.0683 | 213.0 | 852 | 3.6628 | 12.1401 | 14.6875 |
|
267 |
+
| 0.0683 | 214.0 | 856 | 3.6633 | 11.8757 | 14.6875 |
|
268 |
+
| 0.0683 | 215.0 | 860 | 3.6648 | 11.8757 | 14.6875 |
|
269 |
+
| 0.0683 | 216.0 | 864 | 3.6665 | 11.8757 | 14.6875 |
|
270 |
+
| 0.0683 | 217.0 | 868 | 3.6678 | 11.8044 | 14.75 |
|
271 |
+
| 0.0683 | 218.0 | 872 | 3.6690 | 11.8044 | 14.75 |
|
272 |
+
| 0.0683 | 219.0 | 876 | 3.6699 | 11.8044 | 14.75 |
|
273 |
+
| 0.0683 | 220.0 | 880 | 3.6693 | 11.8044 | 14.75 |
|
274 |
+
| 0.0683 | 221.0 | 884 | 3.6689 | 11.8757 | 14.6875 |
|
275 |
+
| 0.0683 | 222.0 | 888 | 3.6687 | 11.8757 | 14.8125 |
|
276 |
+
| 0.0683 | 223.0 | 892 | 3.6687 | 11.8757 | 14.8125 |
|
277 |
+
| 0.0683 | 224.0 | 896 | 3.6690 | 11.8757 | 14.8125 |
|
278 |
+
| 0.0683 | 225.0 | 900 | 3.6662 | 11.8757 | 14.8125 |
|
279 |
+
| 0.0683 | 226.0 | 904 | 3.6609 | 11.8757 | 14.8125 |
|
280 |
+
| 0.0683 | 227.0 | 908 | 3.6561 | 11.8757 | 14.8125 |
|
281 |
+
| 0.0683 | 228.0 | 912 | 3.6536 | 11.8757 | 14.8125 |
|
282 |
+
| 0.0683 | 229.0 | 916 | 3.6522 | 11.8757 | 14.8125 |
|
283 |
+
| 0.0683 | 230.0 | 920 | 3.6515 | 11.8757 | 14.8125 |
|
284 |
+
| 0.0683 | 231.0 | 924 | 3.6526 | 11.8757 | 14.8125 |
|
285 |
+
| 0.0683 | 232.0 | 928 | 3.6532 | 11.8757 | 14.8125 |
|
286 |
+
| 0.0683 | 233.0 | 932 | 3.6537 | 11.8757 | 14.8125 |
|
287 |
+
| 0.0683 | 234.0 | 936 | 3.6536 | 11.8757 | 14.8125 |
|
288 |
+
| 0.0683 | 235.0 | 940 | 3.6540 | 11.8757 | 14.8125 |
|
289 |
+
| 0.0683 | 236.0 | 944 | 3.6540 | 11.8757 | 14.8125 |
|
290 |
+
| 0.0683 | 237.0 | 948 | 3.6540 | 11.8757 | 14.8125 |
|
291 |
+
| 0.0683 | 238.0 | 952 | 3.6545 | 11.8757 | 14.8125 |
|
292 |
+
| 0.0683 | 239.0 | 956 | 3.6553 | 11.8757 | 14.8125 |
|
293 |
+
| 0.0683 | 240.0 | 960 | 3.6557 | 11.8757 | 14.8125 |
|
294 |
+
| 0.0683 | 241.0 | 964 | 3.6563 | 11.8757 | 14.8125 |
|
295 |
+
| 0.0683 | 242.0 | 968 | 3.6573 | 11.8757 | 14.8125 |
|
296 |
+
| 0.0683 | 243.0 | 972 | 3.6579 | 11.8757 | 14.8125 |
|
297 |
+
| 0.0683 | 244.0 | 976 | 3.6583 | 11.8757 | 14.8125 |
|
298 |
+
| 0.0683 | 245.0 | 980 | 3.6594 | 11.8757 | 14.8125 |
|
299 |
+
| 0.0683 | 246.0 | 984 | 3.6599 | 11.8757 | 14.8125 |
|
300 |
+
| 0.0683 | 247.0 | 988 | 3.6606 | 11.8757 | 14.8125 |
|
301 |
+
| 0.0683 | 248.0 | 992 | 3.6513 | 11.8757 | 14.8125 |
|
302 |
+
| 0.0683 | 249.0 | 996 | 3.6454 | 11.8757 | 14.8125 |
|
303 |
+
| 0.0005 | 250.0 | 1000 | 3.6429 | 11.8757 | 14.8125 |
|
304 |
+
| 0.0005 | 251.0 | 1004 | 3.6415 | 11.8757 | 14.8125 |
|
305 |
+
| 0.0005 | 252.0 | 1008 | 3.6403 | 11.8757 | 14.8125 |
|
306 |
+
| 0.0005 | 253.0 | 1012 | 3.6400 | 11.8757 | 14.8125 |
|
307 |
+
| 0.0005 | 254.0 | 1016 | 3.6410 | 11.8757 | 14.8125 |
|
308 |
+
| 0.0005 | 255.0 | 1020 | 3.6418 | 11.8757 | 14.8125 |
|
309 |
+
| 0.0005 | 256.0 | 1024 | 3.6430 | 11.8044 | 14.75 |
|
310 |
+
| 0.0005 | 257.0 | 1028 | 3.6441 | 11.8044 | 14.75 |
|
311 |
+
| 0.0005 | 258.0 | 1032 | 3.6455 | 11.8044 | 14.75 |
|
312 |
+
| 0.0005 | 259.0 | 1036 | 3.6463 | 11.8044 | 14.75 |
|
313 |
+
| 0.0005 | 260.0 | 1040 | 3.6471 | 11.8044 | 14.75 |
|
314 |
+
| 0.0005 | 261.0 | 1044 | 3.6478 | 11.8044 | 14.75 |
|
315 |
+
| 0.0005 | 262.0 | 1048 | 3.6487 | 11.8044 | 14.75 |
|
316 |
+
| 0.0005 | 263.0 | 1052 | 3.6499 | 11.8044 | 14.75 |
|
317 |
+
| 0.0005 | 264.0 | 1056 | 3.6509 | 11.8044 | 14.75 |
|
318 |
+
| 0.0005 | 265.0 | 1060 | 3.6516 | 11.8044 | 14.75 |
|
319 |
+
| 0.0005 | 266.0 | 1064 | 3.6518 | 11.8044 | 14.75 |
|
320 |
+
| 0.0005 | 267.0 | 1068 | 3.6522 | 11.8044 | 14.75 |
|
321 |
+
| 0.0005 | 268.0 | 1072 | 3.6524 | 11.8044 | 14.75 |
|
322 |
+
| 0.0005 | 269.0 | 1076 | 3.6533 | 11.8044 | 14.75 |
|
323 |
+
| 0.0005 | 270.0 | 1080 | 3.6535 | 11.8044 | 14.75 |
|
324 |
+
| 0.0005 | 271.0 | 1084 | 3.6543 | 11.8044 | 14.75 |
|
325 |
+
| 0.0005 | 272.0 | 1088 | 3.6551 | 11.8044 | 14.75 |
|
326 |
+
| 0.0005 | 273.0 | 1092 | 3.6554 | 11.8044 | 14.75 |
|
327 |
+
| 0.0005 | 274.0 | 1096 | 3.6559 | 11.8044 | 14.75 |
|
328 |
+
| 0.0005 | 275.0 | 1100 | 3.6558 | 11.8044 | 14.75 |
|
329 |
+
| 0.0005 | 276.0 | 1104 | 3.6563 | 11.8044 | 14.75 |
|
330 |
+
| 0.0005 | 277.0 | 1108 | 3.6567 | 11.8044 | 14.75 |
|
331 |
+
| 0.0005 | 278.0 | 1112 | 3.6568 | 11.8044 | 14.75 |
|
332 |
+
| 0.0005 | 279.0 | 1116 | 3.6570 | 11.8044 | 14.75 |
|
333 |
+
| 0.0005 | 280.0 | 1120 | 3.6573 | 11.8044 | 14.75 |
|
334 |
+
| 0.0005 | 281.0 | 1124 | 3.6575 | 11.8044 | 14.75 |
|
335 |
+
| 0.0005 | 282.0 | 1128 | 3.6575 | 11.8044 | 14.75 |
|
336 |
+
| 0.0005 | 283.0 | 1132 | 3.6574 | 11.8044 | 14.75 |
|
337 |
+
| 0.0005 | 284.0 | 1136 | 3.6574 | 11.8044 | 14.75 |
|
338 |
+
| 0.0005 | 285.0 | 1140 | 3.6580 | 11.8044 | 14.75 |
|
339 |
+
| 0.0005 | 286.0 | 1144 | 3.6579 | 11.8044 | 14.75 |
|
340 |
+
| 0.0005 | 287.0 | 1148 | 3.6583 | 11.8044 | 14.75 |
|
341 |
+
| 0.0005 | 288.0 | 1152 | 3.6583 | 11.8044 | 14.75 |
|
342 |
+
| 0.0005 | 289.0 | 1156 | 3.6589 | 11.8044 | 14.75 |
|
343 |
+
| 0.0005 | 290.0 | 1160 | 3.6588 | 11.8044 | 14.75 |
|
344 |
+
| 0.0005 | 291.0 | 1164 | 3.6587 | 11.8044 | 14.75 |
|
345 |
+
| 0.0005 | 292.0 | 1168 | 3.6588 | 11.8044 | 14.75 |
|
346 |
+
| 0.0005 | 293.0 | 1172 | 3.6592 | 11.8044 | 14.75 |
|
347 |
+
| 0.0005 | 294.0 | 1176 | 3.6590 | 11.8044 | 14.75 |
|
348 |
+
| 0.0005 | 295.0 | 1180 | 3.6592 | 11.8044 | 14.75 |
|
349 |
+
| 0.0005 | 296.0 | 1184 | 3.6593 | 11.8044 | 14.75 |
|
350 |
+
| 0.0005 | 297.0 | 1188 | 3.6593 | 11.8044 | 14.75 |
|
351 |
+
| 0.0005 | 298.0 | 1192 | 3.6598 | 11.8044 | 14.75 |
|
352 |
+
| 0.0005 | 299.0 | 1196 | 3.6597 | 11.8044 | 14.75 |
|
353 |
+
| 0.0005 | 300.0 | 1200 | 3.6597 | 11.8044 | 14.75 |
|
354 |
|
355 |
|
356 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 242041896
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1d37d09f96de4fe47d53609d29fe3756a3133356c80a96d54d23ec7825b82e44
|
3 |
size 242041896
|
runs/Jan06_17-30-38_9039a63683b7/events.out.tfevents.1704562239.9039a63683b7.4951.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5fc19c1b7407e0a2d4814d2c4c6b90e6b565f458ebe37bcef945a0883f5ff96f
|
3 |
+
size 116743
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4792
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:68ad0a8143f67158af5d1a9f9ae5810a9694089669796406adfe86056b50c338
|
3 |
size 4792
|