youdiniplays commited on
Commit
04a0a6f
·
1 Parent(s): de1c0a2

End of training

Browse files
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [youdiniplays/filipinolingo_model](https://huggingface.co/youdiniplays/filipinolingo_model) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.6977
21
- - Bleu: 3.3165
22
- - Gen Len: 13.6875
23
 
24
  ## Model description
25
 
@@ -38,39 +38,319 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 2e-05
42
  - train_batch_size: 16
43
  - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 20
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
- |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
54
- | No log | 1.0 | 4 | 4.1415 | 3.2885 | 13.8125 |
55
- | No log | 2.0 | 8 | 4.1011 | 3.2885 | 13.8125 |
56
- | No log | 3.0 | 12 | 4.0519 | 3.2885 | 13.8125 |
57
- | No log | 4.0 | 16 | 4.0073 | 3.2885 | 13.875 |
58
- | No log | 5.0 | 20 | 3.9667 | 3.2885 | 13.875 |
59
- | No log | 6.0 | 24 | 3.9285 | 3.2885 | 13.875 |
60
- | No log | 7.0 | 28 | 3.8955 | 3.2885 | 13.875 |
61
- | No log | 8.0 | 32 | 3.8654 | 3.2885 | 13.875 |
62
- | No log | 9.0 | 36 | 3.8379 | 3.2885 | 13.875 |
63
- | No log | 10.0 | 40 | 3.8128 | 3.3165 | 13.875 |
64
- | No log | 11.0 | 44 | 3.7902 | 3.3165 | 13.9375 |
65
- | No log | 12.0 | 48 | 3.7710 | 3.3165 | 13.6875 |
66
- | No log | 13.0 | 52 | 3.7543 | 3.3165 | 13.6875 |
67
- | No log | 14.0 | 56 | 3.7402 | 3.3165 | 13.6875 |
68
- | No log | 15.0 | 60 | 3.7279 | 3.3165 | 13.6875 |
69
- | No log | 16.0 | 64 | 3.7177 | 3.3165 | 13.6875 |
70
- | No log | 17.0 | 68 | 3.7094 | 3.3165 | 13.6875 |
71
- | No log | 18.0 | 72 | 3.7033 | 3.3165 | 13.6875 |
72
- | No log | 19.0 | 76 | 3.6994 | 3.3165 | 13.6875 |
73
- | No log | 20.0 | 80 | 3.6977 | 3.3165 | 13.6875 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
74
 
75
 
76
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [youdiniplays/filipinolingo_model](https://huggingface.co/youdiniplays/filipinolingo_model) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.6597
21
+ - Bleu: 11.8044
22
+ - Gen Len: 14.75
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 0.001
42
  - train_batch_size: 16
43
  - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 300
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
54
+ | No log | 1.0 | 4 | 2.6992 | 3.5276 | 13.75 |
55
+ | No log | 2.0 | 8 | 2.3483 | 6.8767 | 14.1875 |
56
+ | No log | 3.0 | 12 | 2.2289 | 8.4749 | 14.5625 |
57
+ | No log | 4.0 | 16 | 2.2552 | 8.537 | 14.375 |
58
+ | No log | 5.0 | 20 | 2.3404 | 9.3451 | 13.875 |
59
+ | No log | 6.0 | 24 | 2.5126 | 15.043 | 14.0625 |
60
+ | No log | 7.0 | 28 | 2.7072 | 14.9624 | 14.125 |
61
+ | No log | 8.0 | 32 | 2.8633 | 14.8092 | 14.3125 |
62
+ | No log | 9.0 | 36 | 2.9499 | 15.0385 | 14.125 |
63
+ | No log | 10.0 | 40 | 2.9954 | 9.0751 | 14.1875 |
64
+ | No log | 11.0 | 44 | 3.0306 | 8.321 | 14.125 |
65
+ | No log | 12.0 | 48 | 3.0640 | 8.5338 | 14.0625 |
66
+ | No log | 13.0 | 52 | 3.0869 | 8.5302 | 14.0625 |
67
+ | No log | 14.0 | 56 | 3.1138 | 8.3647 | 14.125 |
68
+ | No log | 15.0 | 60 | 3.1254 | 8.5765 | 13.9375 |
69
+ | No log | 16.0 | 64 | 3.1203 | 8.5302 | 14.0625 |
70
+ | No log | 17.0 | 68 | 3.1250 | 12.0182 | 14.1875 |
71
+ | No log | 18.0 | 72 | 3.1192 | 12.0182 | 14.1875 |
72
+ | No log | 19.0 | 76 | 3.1231 | 8.5338 | 14.1875 |
73
+ | No log | 20.0 | 80 | 3.1155 | 11.9388 | 13.875 |
74
+ | No log | 21.0 | 84 | 3.1176 | 11.9402 | 13.875 |
75
+ | No log | 22.0 | 88 | 3.1295 | 11.9402 | 13.875 |
76
+ | No log | 23.0 | 92 | 3.1487 | 11.9402 | 13.875 |
77
+ | No log | 24.0 | 96 | 3.1673 | 12.1489 | 13.875 |
78
+ | No log | 25.0 | 100 | 3.1859 | 16.2159 | 13.875 |
79
+ | No log | 26.0 | 104 | 3.2061 | 15.9711 | 13.8125 |
80
+ | No log | 27.0 | 108 | 3.2147 | 15.9711 | 13.8125 |
81
+ | No log | 28.0 | 112 | 3.2223 | 15.9711 | 13.8125 |
82
+ | No log | 29.0 | 116 | 3.2345 | 16.2159 | 13.8125 |
83
+ | No log | 30.0 | 120 | 3.2414 | 16.1289 | 13.8125 |
84
+ | No log | 31.0 | 124 | 3.2448 | 16.1261 | 13.8125 |
85
+ | No log | 32.0 | 128 | 3.2446 | 16.1261 | 13.8125 |
86
+ | No log | 33.0 | 132 | 3.2307 | 15.8836 | 13.75 |
87
+ | No log | 34.0 | 136 | 3.2247 | 15.8803 | 13.75 |
88
+ | No log | 35.0 | 140 | 3.2364 | 15.8803 | 13.75 |
89
+ | No log | 36.0 | 144 | 3.2507 | 16.1261 | 13.8125 |
90
+ | No log | 37.0 | 148 | 3.2608 | 16.1261 | 13.8125 |
91
+ | No log | 38.0 | 152 | 3.2893 | 16.536 | 13.8125 |
92
+ | No log | 39.0 | 156 | 3.3026 | 16.3582 | 13.8125 |
93
+ | No log | 40.0 | 160 | 3.2786 | 16.3582 | 13.9375 |
94
+ | No log | 41.0 | 164 | 3.2607 | 16.3548 | 14.0 |
95
+ | No log | 42.0 | 168 | 3.2557 | 16.4428 | 14.0 |
96
+ | No log | 43.0 | 172 | 3.2648 | 16.1734 | 14.1875 |
97
+ | No log | 44.0 | 176 | 3.2455 | 12.2013 | 14.375 |
98
+ | No log | 45.0 | 180 | 3.2444 | 12.2013 | 14.375 |
99
+ | No log | 46.0 | 184 | 3.2416 | 12.2013 | 14.375 |
100
+ | No log | 47.0 | 188 | 3.2412 | 11.8127 | 14.375 |
101
+ | No log | 48.0 | 192 | 3.2656 | 16.2611 | 14.3125 |
102
+ | No log | 49.0 | 196 | 3.2998 | 16.0785 | 15.1875 |
103
+ | No log | 50.0 | 200 | 3.3196 | 16.0785 | 14.6875 |
104
+ | No log | 51.0 | 204 | 3.3304 | 15.9095 | 15.0 |
105
+ | No log | 52.0 | 208 | 3.3312 | 16.0125 | 15.0 |
106
+ | No log | 53.0 | 212 | 3.3265 | 16.0956 | 14.5 |
107
+ | No log | 54.0 | 216 | 3.3282 | 16.2714 | 14.0625 |
108
+ | No log | 55.0 | 220 | 3.3316 | 16.2714 | 14.0625 |
109
+ | No log | 56.0 | 224 | 3.3312 | 16.2714 | 14.0625 |
110
+ | No log | 57.0 | 228 | 3.3262 | 15.8593 | 14.5 |
111
+ | No log | 58.0 | 232 | 3.3327 | 15.8672 | 14.5 |
112
+ | No log | 59.0 | 236 | 3.3157 | 15.6948 | 14.9375 |
113
+ | No log | 60.0 | 240 | 3.2849 | 15.8593 | 15.0 |
114
+ | No log | 61.0 | 244 | 3.2707 | 15.8593 | 15.0 |
115
+ | No log | 62.0 | 248 | 3.2732 | 15.8593 | 15.0625 |
116
+ | No log | 63.0 | 252 | 3.2781 | 18.4173 | 15.1875 |
117
+ | No log | 64.0 | 256 | 3.2990 | 18.6185 | 15.0 |
118
+ | No log | 65.0 | 260 | 3.3277 | 18.6185 | 14.9375 |
119
+ | No log | 66.0 | 264 | 3.3475 | 15.1975 | 14.8125 |
120
+ | No log | 67.0 | 268 | 3.3274 | 15.2762 | 14.6875 |
121
+ | No log | 68.0 | 272 | 3.3065 | 15.5165 | 14.75 |
122
+ | No log | 69.0 | 276 | 3.3111 | 18.6185 | 14.625 |
123
+ | No log | 70.0 | 280 | 3.3575 | 18.2583 | 14.6875 |
124
+ | No log | 71.0 | 284 | 3.4089 | 18.5319 | 14.875 |
125
+ | No log | 72.0 | 288 | 3.3937 | 18.6269 | 14.8125 |
126
+ | No log | 73.0 | 292 | 3.3043 | 18.6269 | 14.8125 |
127
+ | No log | 74.0 | 296 | 3.2596 | 18.7252 | 14.8125 |
128
+ | No log | 75.0 | 300 | 3.2515 | 12.9228 | 15.125 |
129
+ | No log | 76.0 | 304 | 3.2995 | 13.0338 | 15.125 |
130
+ | No log | 77.0 | 308 | 3.3457 | 12.7784 | 15.25 |
131
+ | No log | 78.0 | 312 | 3.3949 | 12.5078 | 15.375 |
132
+ | No log | 79.0 | 316 | 3.4148 | 12.5862 | 14.625 |
133
+ | No log | 80.0 | 320 | 3.4307 | 12.3785 | 14.75 |
134
+ | No log | 81.0 | 324 | 3.4095 | 11.6247 | 14.5 |
135
+ | No log | 82.0 | 328 | 3.3948 | 11.6247 | 14.5625 |
136
+ | No log | 83.0 | 332 | 3.3857 | 11.6247 | 14.4375 |
137
+ | No log | 84.0 | 336 | 3.3724 | 11.4452 | 13.875 |
138
+ | No log | 85.0 | 340 | 3.3688 | 11.4377 | 13.8125 |
139
+ | No log | 86.0 | 344 | 3.3656 | 11.4377 | 13.8125 |
140
+ | No log | 87.0 | 348 | 3.3839 | 11.4295 | 13.8125 |
141
+ | No log | 88.0 | 352 | 3.4168 | 11.1357 | 13.8125 |
142
+ | No log | 89.0 | 356 | 3.4694 | 11.1357 | 13.8125 |
143
+ | No log | 90.0 | 360 | 3.4992 | 10.5869 | 13.8125 |
144
+ | No log | 91.0 | 364 | 3.5087 | 10.5869 | 13.8125 |
145
+ | No log | 92.0 | 368 | 3.4923 | 11.0784 | 14.125 |
146
+ | No log | 93.0 | 372 | 3.4931 | 14.544 | 14.5 |
147
+ | No log | 94.0 | 376 | 3.5046 | 14.544 | 14.625 |
148
+ | No log | 95.0 | 380 | 3.5058 | 14.1526 | 14.375 |
149
+ | No log | 96.0 | 384 | 3.5057 | 13.9259 | 14.8125 |
150
+ | No log | 97.0 | 388 | 3.5107 | 13.9259 | 14.75 |
151
+ | No log | 98.0 | 392 | 3.5173 | 11.0784 | 14.25 |
152
+ | No log | 99.0 | 396 | 3.5231 | 11.0887 | 14.3125 |
153
+ | No log | 100.0 | 400 | 3.5289 | 11.2541 | 13.75 |
154
+ | No log | 101.0 | 404 | 3.5357 | 11.2541 | 13.75 |
155
+ | No log | 102.0 | 408 | 3.5417 | 11.1254 | 14.125 |
156
+ | No log | 103.0 | 412 | 3.5468 | 11.3608 | 14.25 |
157
+ | No log | 104.0 | 416 | 3.5430 | 11.3023 | 14.625 |
158
+ | No log | 105.0 | 420 | 3.5337 | 10.9245 | 14.875 |
159
+ | No log | 106.0 | 424 | 3.5247 | 10.9783 | 14.8125 |
160
+ | No log | 107.0 | 428 | 3.5199 | 10.9783 | 14.8125 |
161
+ | No log | 108.0 | 432 | 3.5172 | 10.9783 | 14.8125 |
162
+ | No log | 109.0 | 436 | 3.5164 | 11.3128 | 14.9375 |
163
+ | No log | 110.0 | 440 | 3.5167 | 11.3128 | 14.9375 |
164
+ | No log | 111.0 | 444 | 3.5178 | 11.3128 | 14.9375 |
165
+ | No log | 112.0 | 448 | 3.5201 | 11.3128 | 14.9375 |
166
+ | No log | 113.0 | 452 | 3.5232 | 11.5924 | 14.9375 |
167
+ | No log | 114.0 | 456 | 3.5264 | 11.5924 | 14.9375 |
168
+ | No log | 115.0 | 460 | 3.5210 | 11.5924 | 14.9375 |
169
+ | No log | 116.0 | 464 | 3.5163 | 11.3128 | 14.6875 |
170
+ | No log | 117.0 | 468 | 3.5180 | 11.3706 | 14.625 |
171
+ | No log | 118.0 | 472 | 3.5237 | 11.3706 | 14.625 |
172
+ | No log | 119.0 | 476 | 3.5285 | 11.6792 | 14.875 |
173
+ | No log | 120.0 | 480 | 3.5299 | 11.9509 | 14.875 |
174
+ | No log | 121.0 | 484 | 3.5301 | 11.9509 | 14.875 |
175
+ | No log | 122.0 | 488 | 3.5318 | 11.9509 | 14.875 |
176
+ | No log | 123.0 | 492 | 3.5342 | 11.9509 | 14.875 |
177
+ | No log | 124.0 | 496 | 3.5355 | 11.9509 | 14.875 |
178
+ | 0.0683 | 125.0 | 500 | 3.5385 | 11.9509 | 14.6875 |
179
+ | 0.0683 | 126.0 | 504 | 3.5422 | 11.9509 | 14.6875 |
180
+ | 0.0683 | 127.0 | 508 | 3.5454 | 11.9509 | 14.6875 |
181
+ | 0.0683 | 128.0 | 512 | 3.5490 | 11.9509 | 14.875 |
182
+ | 0.0683 | 129.0 | 516 | 3.5494 | 11.9509 | 14.6875 |
183
+ | 0.0683 | 130.0 | 520 | 3.5500 | 11.9509 | 14.6875 |
184
+ | 0.0683 | 131.0 | 524 | 3.5513 | 11.6107 | 14.6875 |
185
+ | 0.0683 | 132.0 | 528 | 3.5545 | 11.8824 | 14.6875 |
186
+ | 0.0683 | 133.0 | 532 | 3.5571 | 11.8202 | 14.6875 |
187
+ | 0.0683 | 134.0 | 536 | 3.5597 | 11.8202 | 14.875 |
188
+ | 0.0683 | 135.0 | 540 | 3.5611 | 11.8824 | 14.5625 |
189
+ | 0.0683 | 136.0 | 544 | 3.5629 | 11.8824 | 14.5625 |
190
+ | 0.0683 | 137.0 | 548 | 3.5666 | 11.8824 | 14.5625 |
191
+ | 0.0683 | 138.0 | 552 | 3.5715 | 11.8824 | 14.5625 |
192
+ | 0.0683 | 139.0 | 556 | 3.5762 | 11.8824 | 14.5625 |
193
+ | 0.0683 | 140.0 | 560 | 3.5789 | 11.8824 | 14.5625 |
194
+ | 0.0683 | 141.0 | 564 | 3.5807 | 11.8824 | 14.5625 |
195
+ | 0.0683 | 142.0 | 568 | 3.5858 | 11.8824 | 14.5625 |
196
+ | 0.0683 | 143.0 | 572 | 3.5902 | 11.8202 | 14.875 |
197
+ | 0.0683 | 144.0 | 576 | 3.5886 | 11.5499 | 14.875 |
198
+ | 0.0683 | 145.0 | 580 | 3.5877 | 11.5499 | 14.875 |
199
+ | 0.0683 | 146.0 | 584 | 3.5866 | 11.6107 | 14.875 |
200
+ | 0.0683 | 147.0 | 588 | 3.5875 | 11.6107 | 14.875 |
201
+ | 0.0683 | 148.0 | 592 | 3.5892 | 11.6107 | 14.875 |
202
+ | 0.0683 | 149.0 | 596 | 3.5951 | 11.6792 | 14.875 |
203
+ | 0.0683 | 150.0 | 600 | 3.6008 | 11.6792 | 14.875 |
204
+ | 0.0683 | 151.0 | 604 | 3.6067 | 11.6792 | 14.875 |
205
+ | 0.0683 | 152.0 | 608 | 3.5964 | 11.6107 | 14.875 |
206
+ | 0.0683 | 153.0 | 612 | 3.5930 | 11.6107 | 14.875 |
207
+ | 0.0683 | 154.0 | 616 | 3.5945 | 11.5499 | 15.125 |
208
+ | 0.0683 | 155.0 | 620 | 3.5948 | 11.5499 | 15.125 |
209
+ | 0.0683 | 156.0 | 624 | 3.5953 | 11.6107 | 14.875 |
210
+ | 0.0683 | 157.0 | 628 | 3.5990 | 11.6107 | 14.875 |
211
+ | 0.0683 | 158.0 | 632 | 3.6028 | 11.6107 | 14.875 |
212
+ | 0.0683 | 159.0 | 636 | 3.6059 | 11.6026 | 14.875 |
213
+ | 0.0683 | 160.0 | 640 | 3.6090 | 11.6026 | 14.875 |
214
+ | 0.0683 | 161.0 | 644 | 3.6104 | 11.6026 | 14.875 |
215
+ | 0.0683 | 162.0 | 648 | 3.6114 | 11.6026 | 14.875 |
216
+ | 0.0683 | 163.0 | 652 | 3.6129 | 11.6026 | 14.875 |
217
+ | 0.0683 | 164.0 | 656 | 3.6135 | 11.6026 | 14.875 |
218
+ | 0.0683 | 165.0 | 660 | 3.6145 | 11.6026 | 14.875 |
219
+ | 0.0683 | 166.0 | 664 | 3.6152 | 11.6026 | 14.875 |
220
+ | 0.0683 | 167.0 | 668 | 3.6175 | 11.6026 | 14.875 |
221
+ | 0.0683 | 168.0 | 672 | 3.6140 | 11.6026 | 14.875 |
222
+ | 0.0683 | 169.0 | 676 | 3.6140 | 11.6026 | 14.875 |
223
+ | 0.0683 | 170.0 | 680 | 3.6159 | 11.3715 | 14.875 |
224
+ | 0.0683 | 171.0 | 684 | 3.6162 | 11.3715 | 14.875 |
225
+ | 0.0683 | 172.0 | 688 | 3.6174 | 11.3715 | 14.875 |
226
+ | 0.0683 | 173.0 | 692 | 3.6192 | 11.3715 | 14.875 |
227
+ | 0.0683 | 174.0 | 696 | 3.6209 | 11.3715 | 14.875 |
228
+ | 0.0683 | 175.0 | 700 | 3.6219 | 11.3715 | 14.875 |
229
+ | 0.0683 | 176.0 | 704 | 3.6239 | 11.3715 | 14.875 |
230
+ | 0.0683 | 177.0 | 708 | 3.6266 | 11.3715 | 14.875 |
231
+ | 0.0683 | 178.0 | 712 | 3.6308 | 11.3715 | 14.875 |
232
+ | 0.0683 | 179.0 | 716 | 3.6316 | 11.3715 | 14.875 |
233
+ | 0.0683 | 180.0 | 720 | 3.6321 | 11.6026 | 14.875 |
234
+ | 0.0683 | 181.0 | 724 | 3.6322 | 11.6026 | 14.875 |
235
+ | 0.0683 | 182.0 | 728 | 3.6319 | 11.8757 | 14.875 |
236
+ | 0.0683 | 183.0 | 732 | 3.6319 | 11.6577 | 14.875 |
237
+ | 0.0683 | 184.0 | 736 | 3.6293 | 11.8757 | 14.875 |
238
+ | 0.0683 | 185.0 | 740 | 3.6229 | 11.8757 | 14.875 |
239
+ | 0.0683 | 186.0 | 744 | 3.6186 | 11.8757 | 14.875 |
240
+ | 0.0683 | 187.0 | 748 | 3.6166 | 11.8757 | 14.875 |
241
+ | 0.0683 | 188.0 | 752 | 3.6165 | 11.8757 | 14.875 |
242
+ | 0.0683 | 189.0 | 756 | 3.6193 | 11.8757 | 14.875 |
243
+ | 0.0683 | 190.0 | 760 | 3.6216 | 11.8757 | 14.875 |
244
+ | 0.0683 | 191.0 | 764 | 3.6239 | 11.8757 | 14.875 |
245
+ | 0.0683 | 192.0 | 768 | 3.6265 | 11.8757 | 14.875 |
246
+ | 0.0683 | 193.0 | 772 | 3.6284 | 11.8757 | 14.875 |
247
+ | 0.0683 | 194.0 | 776 | 3.6301 | 11.8684 | 14.8125 |
248
+ | 0.0683 | 195.0 | 780 | 3.6319 | 11.8684 | 14.8125 |
249
+ | 0.0683 | 196.0 | 784 | 3.6341 | 11.8684 | 14.8125 |
250
+ | 0.0683 | 197.0 | 788 | 3.6364 | 11.8684 | 14.8125 |
251
+ | 0.0683 | 198.0 | 792 | 3.6386 | 11.8684 | 14.8125 |
252
+ | 0.0683 | 199.0 | 796 | 3.6418 | 11.8757 | 14.8125 |
253
+ | 0.0683 | 200.0 | 800 | 3.6447 | 11.8757 | 14.8125 |
254
+ | 0.0683 | 201.0 | 804 | 3.6463 | 12.1401 | 14.8125 |
255
+ | 0.0683 | 202.0 | 808 | 3.6476 | 12.1401 | 14.8125 |
256
+ | 0.0683 | 203.0 | 812 | 3.6496 | 11.9402 | 14.5625 |
257
+ | 0.0683 | 204.0 | 816 | 3.6518 | 12.0061 | 14.1875 |
258
+ | 0.0683 | 205.0 | 820 | 3.6544 | 12.0061 | 14.1875 |
259
+ | 0.0683 | 206.0 | 824 | 3.6561 | 12.0061 | 14.1875 |
260
+ | 0.0683 | 207.0 | 828 | 3.6574 | 12.206 | 14.3125 |
261
+ | 0.0683 | 208.0 | 832 | 3.6588 | 12.1401 | 14.6875 |
262
+ | 0.0683 | 209.0 | 836 | 3.6603 | 12.1401 | 14.6875 |
263
+ | 0.0683 | 210.0 | 840 | 3.6612 | 12.1401 | 14.6875 |
264
+ | 0.0683 | 211.0 | 844 | 3.6620 | 12.1401 | 14.6875 |
265
+ | 0.0683 | 212.0 | 848 | 3.6628 | 12.1401 | 14.6875 |
266
+ | 0.0683 | 213.0 | 852 | 3.6628 | 12.1401 | 14.6875 |
267
+ | 0.0683 | 214.0 | 856 | 3.6633 | 11.8757 | 14.6875 |
268
+ | 0.0683 | 215.0 | 860 | 3.6648 | 11.8757 | 14.6875 |
269
+ | 0.0683 | 216.0 | 864 | 3.6665 | 11.8757 | 14.6875 |
270
+ | 0.0683 | 217.0 | 868 | 3.6678 | 11.8044 | 14.75 |
271
+ | 0.0683 | 218.0 | 872 | 3.6690 | 11.8044 | 14.75 |
272
+ | 0.0683 | 219.0 | 876 | 3.6699 | 11.8044 | 14.75 |
273
+ | 0.0683 | 220.0 | 880 | 3.6693 | 11.8044 | 14.75 |
274
+ | 0.0683 | 221.0 | 884 | 3.6689 | 11.8757 | 14.6875 |
275
+ | 0.0683 | 222.0 | 888 | 3.6687 | 11.8757 | 14.8125 |
276
+ | 0.0683 | 223.0 | 892 | 3.6687 | 11.8757 | 14.8125 |
277
+ | 0.0683 | 224.0 | 896 | 3.6690 | 11.8757 | 14.8125 |
278
+ | 0.0683 | 225.0 | 900 | 3.6662 | 11.8757 | 14.8125 |
279
+ | 0.0683 | 226.0 | 904 | 3.6609 | 11.8757 | 14.8125 |
280
+ | 0.0683 | 227.0 | 908 | 3.6561 | 11.8757 | 14.8125 |
281
+ | 0.0683 | 228.0 | 912 | 3.6536 | 11.8757 | 14.8125 |
282
+ | 0.0683 | 229.0 | 916 | 3.6522 | 11.8757 | 14.8125 |
283
+ | 0.0683 | 230.0 | 920 | 3.6515 | 11.8757 | 14.8125 |
284
+ | 0.0683 | 231.0 | 924 | 3.6526 | 11.8757 | 14.8125 |
285
+ | 0.0683 | 232.0 | 928 | 3.6532 | 11.8757 | 14.8125 |
286
+ | 0.0683 | 233.0 | 932 | 3.6537 | 11.8757 | 14.8125 |
287
+ | 0.0683 | 234.0 | 936 | 3.6536 | 11.8757 | 14.8125 |
288
+ | 0.0683 | 235.0 | 940 | 3.6540 | 11.8757 | 14.8125 |
289
+ | 0.0683 | 236.0 | 944 | 3.6540 | 11.8757 | 14.8125 |
290
+ | 0.0683 | 237.0 | 948 | 3.6540 | 11.8757 | 14.8125 |
291
+ | 0.0683 | 238.0 | 952 | 3.6545 | 11.8757 | 14.8125 |
292
+ | 0.0683 | 239.0 | 956 | 3.6553 | 11.8757 | 14.8125 |
293
+ | 0.0683 | 240.0 | 960 | 3.6557 | 11.8757 | 14.8125 |
294
+ | 0.0683 | 241.0 | 964 | 3.6563 | 11.8757 | 14.8125 |
295
+ | 0.0683 | 242.0 | 968 | 3.6573 | 11.8757 | 14.8125 |
296
+ | 0.0683 | 243.0 | 972 | 3.6579 | 11.8757 | 14.8125 |
297
+ | 0.0683 | 244.0 | 976 | 3.6583 | 11.8757 | 14.8125 |
298
+ | 0.0683 | 245.0 | 980 | 3.6594 | 11.8757 | 14.8125 |
299
+ | 0.0683 | 246.0 | 984 | 3.6599 | 11.8757 | 14.8125 |
300
+ | 0.0683 | 247.0 | 988 | 3.6606 | 11.8757 | 14.8125 |
301
+ | 0.0683 | 248.0 | 992 | 3.6513 | 11.8757 | 14.8125 |
302
+ | 0.0683 | 249.0 | 996 | 3.6454 | 11.8757 | 14.8125 |
303
+ | 0.0005 | 250.0 | 1000 | 3.6429 | 11.8757 | 14.8125 |
304
+ | 0.0005 | 251.0 | 1004 | 3.6415 | 11.8757 | 14.8125 |
305
+ | 0.0005 | 252.0 | 1008 | 3.6403 | 11.8757 | 14.8125 |
306
+ | 0.0005 | 253.0 | 1012 | 3.6400 | 11.8757 | 14.8125 |
307
+ | 0.0005 | 254.0 | 1016 | 3.6410 | 11.8757 | 14.8125 |
308
+ | 0.0005 | 255.0 | 1020 | 3.6418 | 11.8757 | 14.8125 |
309
+ | 0.0005 | 256.0 | 1024 | 3.6430 | 11.8044 | 14.75 |
310
+ | 0.0005 | 257.0 | 1028 | 3.6441 | 11.8044 | 14.75 |
311
+ | 0.0005 | 258.0 | 1032 | 3.6455 | 11.8044 | 14.75 |
312
+ | 0.0005 | 259.0 | 1036 | 3.6463 | 11.8044 | 14.75 |
313
+ | 0.0005 | 260.0 | 1040 | 3.6471 | 11.8044 | 14.75 |
314
+ | 0.0005 | 261.0 | 1044 | 3.6478 | 11.8044 | 14.75 |
315
+ | 0.0005 | 262.0 | 1048 | 3.6487 | 11.8044 | 14.75 |
316
+ | 0.0005 | 263.0 | 1052 | 3.6499 | 11.8044 | 14.75 |
317
+ | 0.0005 | 264.0 | 1056 | 3.6509 | 11.8044 | 14.75 |
318
+ | 0.0005 | 265.0 | 1060 | 3.6516 | 11.8044 | 14.75 |
319
+ | 0.0005 | 266.0 | 1064 | 3.6518 | 11.8044 | 14.75 |
320
+ | 0.0005 | 267.0 | 1068 | 3.6522 | 11.8044 | 14.75 |
321
+ | 0.0005 | 268.0 | 1072 | 3.6524 | 11.8044 | 14.75 |
322
+ | 0.0005 | 269.0 | 1076 | 3.6533 | 11.8044 | 14.75 |
323
+ | 0.0005 | 270.0 | 1080 | 3.6535 | 11.8044 | 14.75 |
324
+ | 0.0005 | 271.0 | 1084 | 3.6543 | 11.8044 | 14.75 |
325
+ | 0.0005 | 272.0 | 1088 | 3.6551 | 11.8044 | 14.75 |
326
+ | 0.0005 | 273.0 | 1092 | 3.6554 | 11.8044 | 14.75 |
327
+ | 0.0005 | 274.0 | 1096 | 3.6559 | 11.8044 | 14.75 |
328
+ | 0.0005 | 275.0 | 1100 | 3.6558 | 11.8044 | 14.75 |
329
+ | 0.0005 | 276.0 | 1104 | 3.6563 | 11.8044 | 14.75 |
330
+ | 0.0005 | 277.0 | 1108 | 3.6567 | 11.8044 | 14.75 |
331
+ | 0.0005 | 278.0 | 1112 | 3.6568 | 11.8044 | 14.75 |
332
+ | 0.0005 | 279.0 | 1116 | 3.6570 | 11.8044 | 14.75 |
333
+ | 0.0005 | 280.0 | 1120 | 3.6573 | 11.8044 | 14.75 |
334
+ | 0.0005 | 281.0 | 1124 | 3.6575 | 11.8044 | 14.75 |
335
+ | 0.0005 | 282.0 | 1128 | 3.6575 | 11.8044 | 14.75 |
336
+ | 0.0005 | 283.0 | 1132 | 3.6574 | 11.8044 | 14.75 |
337
+ | 0.0005 | 284.0 | 1136 | 3.6574 | 11.8044 | 14.75 |
338
+ | 0.0005 | 285.0 | 1140 | 3.6580 | 11.8044 | 14.75 |
339
+ | 0.0005 | 286.0 | 1144 | 3.6579 | 11.8044 | 14.75 |
340
+ | 0.0005 | 287.0 | 1148 | 3.6583 | 11.8044 | 14.75 |
341
+ | 0.0005 | 288.0 | 1152 | 3.6583 | 11.8044 | 14.75 |
342
+ | 0.0005 | 289.0 | 1156 | 3.6589 | 11.8044 | 14.75 |
343
+ | 0.0005 | 290.0 | 1160 | 3.6588 | 11.8044 | 14.75 |
344
+ | 0.0005 | 291.0 | 1164 | 3.6587 | 11.8044 | 14.75 |
345
+ | 0.0005 | 292.0 | 1168 | 3.6588 | 11.8044 | 14.75 |
346
+ | 0.0005 | 293.0 | 1172 | 3.6592 | 11.8044 | 14.75 |
347
+ | 0.0005 | 294.0 | 1176 | 3.6590 | 11.8044 | 14.75 |
348
+ | 0.0005 | 295.0 | 1180 | 3.6592 | 11.8044 | 14.75 |
349
+ | 0.0005 | 296.0 | 1184 | 3.6593 | 11.8044 | 14.75 |
350
+ | 0.0005 | 297.0 | 1188 | 3.6593 | 11.8044 | 14.75 |
351
+ | 0.0005 | 298.0 | 1192 | 3.6598 | 11.8044 | 14.75 |
352
+ | 0.0005 | 299.0 | 1196 | 3.6597 | 11.8044 | 14.75 |
353
+ | 0.0005 | 300.0 | 1200 | 3.6597 | 11.8044 | 14.75 |
354
 
355
 
356
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8437f56bcb80e6f0724144dce25880581deab2ea8109fa8f3b8b2300b46c4206
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d37d09f96de4fe47d53609d29fe3756a3133356c80a96d54d23ec7825b82e44
3
  size 242041896
runs/Jan06_17-30-38_9039a63683b7/events.out.tfevents.1704562239.9039a63683b7.4951.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5fc19c1b7407e0a2d4814d2c4c6b90e6b565f458ebe37bcef945a0883f5ff96f
3
+ size 116743
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ebb848166c57ab88b6c0d903f5cfb04adefeba6704bc6b91c200bb2c2532763c
3
  size 4792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68ad0a8143f67158af5d1a9f9ae5810a9694089669796406adfe86056b50c338
3
  size 4792