2025-01-16 01:11:54,549 - INFO - Total Parameters: 124,439,808 2025-01-16 01:11:54,549 - INFO - Trainable Parameters: 124,439,808 2025-01-16 01:12:01,389 - INFO - step 0, loss: 10.984674, best loss: 10.984674 2025-01-16 01:12:05,623 - INFO - step 1, loss: 9.399015, best loss: 9.399015 2025-01-16 01:12:09,738 - INFO - step 2, loss: 9.224066, best loss: 9.224066 2025-01-16 01:12:13,778 - INFO - step 3, loss: 8.766793, best loss: 8.766793 2025-01-16 01:12:13,929 - INFO - step 4, loss: 8.890295, best loss: 8.766793 2025-01-16 01:12:18,021 - INFO - step 5, loss: 8.354116, best loss: 8.354116 2025-01-16 01:12:21,378 - INFO - step 6, loss: 8.216693, best loss: 8.216693 2025-01-16 01:12:24,818 - INFO - step 7, loss: 8.000173, best loss: 8.000173 2025-01-16 01:12:28,132 - INFO - step 8, loss: 7.911955, best loss: 7.911955 2025-01-16 01:12:31,363 - INFO - step 9, loss: 7.701931, best loss: 7.701931 2025-01-16 01:12:35,178 - INFO - step 10, loss: 7.317376, best loss: 7.317376 2025-01-16 01:12:35,329 - INFO - step 11, loss: 7.622540, best loss: 7.317376 2025-01-16 01:12:41,149 - INFO - step 12, loss: 7.002343, best loss: 7.002343 2025-01-16 01:12:41,298 - INFO - step 13, loss: 7.173810, best loss: 7.002343 2025-01-16 01:12:47,386 - INFO - step 14, loss: 6.924718, best loss: 6.924718 2025-01-16 01:12:53,515 - INFO - step 15, loss: 6.831016, best loss: 6.831016 2025-01-16 01:12:53,678 - INFO - step 16, loss: 11.426704, best loss: 6.831016 2025-01-16 01:13:00,008 - INFO - step 17, loss: 6.602736, best loss: 6.602736 2025-01-16 01:13:06,726 - INFO - step 18, loss: 6.444612, best loss: 6.444612 2025-01-16 01:13:06,876 - INFO - step 19, loss: 6.573782, best loss: 6.444612 2025-01-16 01:13:12,791 - INFO - step 20, loss: 6.155459, best loss: 6.155459 2025-01-16 01:13:12,941 - INFO - step 21, loss: 6.713335, best loss: 6.155459 2025-01-16 01:13:40,202 - INFO - step 22, loss: 6.105377, best loss: 6.105377 2025-01-16 01:13:43,701 - INFO - step 23, loss: 5.863136, best loss: 5.863136 2025-01-16 01:13:43,851 - INFO - step 24, loss: 6.168331, best loss: 5.863136 2025-01-16 01:13:44,001 - INFO - step 25, loss: 6.184391, best loss: 5.863136 2025-01-16 01:13:44,151 - INFO - step 26, loss: 6.233185, best loss: 5.863136 2025-01-16 01:13:44,301 - INFO - step 27, loss: 6.035625, best loss: 5.863136 2025-01-16 01:13:44,451 - INFO - step 28, loss: 6.212886, best loss: 5.863136 2025-01-16 01:13:44,601 - INFO - step 29, loss: 6.159973, best loss: 5.863136 2025-01-16 01:13:44,750 - INFO - step 30, loss: 6.092636, best loss: 5.863136 2025-01-16 01:13:44,900 - INFO - step 31, loss: 6.076606, best loss: 5.863136 2025-01-16 01:13:45,051 - INFO - step 32, loss: 6.480474, best loss: 5.863136 2025-01-16 01:13:45,200 - INFO - step 33, loss: 6.406061, best loss: 5.863136 2025-01-16 01:13:45,350 - INFO - step 34, loss: 6.302889, best loss: 5.863136 2025-01-16 01:13:48,890 - INFO - step 35, loss: 5.833194, best loss: 5.833194 2025-01-16 01:13:54,782 - INFO - step 36, loss: 5.814800, best loss: 5.814800 2025-01-16 01:13:54,933 - INFO - step 37, loss: 6.271455, best loss: 5.814800 2025-01-16 01:13:58,092 - INFO - step 38, loss: 5.711025, best loss: 5.711025 2025-01-16 01:13:58,242 - INFO - step 39, loss: 6.367735, best loss: 5.711025 2025-01-16 01:13:58,392 - INFO - step 40, loss: 6.292561, best loss: 5.711025 2025-01-16 01:13:58,542 - INFO - step 41, loss: 6.402388, best loss: 5.711025 2025-01-16 01:13:58,692 - INFO - step 42, loss: 6.180318, best loss: 5.711025 2025-01-16 01:13:58,842 - INFO - step 43, loss: 6.264365, best loss: 5.711025 2025-01-16 01:13:58,992 - INFO - step 44, loss: 6.280950, best loss: 5.711025 2025-01-16 01:13:59,142 - INFO - step 45, loss: 6.189835, best loss: 5.711025 2025-01-16 01:13:59,291 - INFO - step 46, loss: 7.166827, best loss: 5.711025 2025-01-16 01:13:59,441 - INFO - step 47, loss: 6.857326, best loss: 5.711025 2025-01-16 01:13:59,591 - INFO - step 48, loss: 7.069935, best loss: 5.711025 2025-01-16 01:13:59,740 - INFO - step 49, loss: 6.912243, best loss: 5.711025 2025-01-16 01:13:59,890 - INFO - step 50, loss: 6.710613, best loss: 5.711025 2025-01-16 01:14:00,048 - INFO - step 51, loss: 6.703650, best loss: 5.711025 2025-01-16 01:14:00,198 - INFO - step 52, loss: 6.842998, best loss: 5.711025 2025-01-16 01:14:00,348 - INFO - step 53, loss: 6.597635, best loss: 5.711025 2025-01-16 01:14:00,498 - INFO - step 54, loss: 6.692338, best loss: 5.711025 2025-01-16 01:14:00,648 - INFO - step 55, loss: 6.560492, best loss: 5.711025 2025-01-16 01:14:00,798 - INFO - step 56, loss: 6.349058, best loss: 5.711025 2025-01-16 01:14:00,948 - INFO - step 57, loss: 6.638471, best loss: 5.711025 2025-01-16 01:14:01,098 - INFO - step 58, loss: 6.290427, best loss: 5.711025 2025-01-16 01:14:01,247 - INFO - step 59, loss: 6.137897, best loss: 5.711025 2025-01-16 01:14:01,397 - INFO - step 60, loss: 6.209739, best loss: 5.711025 2025-01-16 01:14:01,547 - INFO - step 61, loss: 6.320157, best loss: 5.711025 2025-01-16 01:14:01,704 - INFO - step 62, loss: 6.253428, best loss: 5.711025 2025-01-16 01:14:01,853 - INFO - step 63, loss: 6.369972, best loss: 5.711025 2025-01-16 01:14:02,003 - INFO - step 64, loss: 5.968691, best loss: 5.711025 2025-01-16 01:14:02,153 - INFO - step 65, loss: 6.440616, best loss: 5.711025 2025-01-16 01:14:02,303 - INFO - step 66, loss: 6.229952, best loss: 5.711025 2025-01-16 01:14:02,452 - INFO - step 67, loss: 5.767194, best loss: 5.711025 2025-01-16 01:14:02,602 - INFO - step 68, loss: 5.963345, best loss: 5.711025 2025-01-16 01:14:02,751 - INFO - step 69, loss: 6.038363, best loss: 5.711025 2025-01-16 01:14:02,901 - INFO - step 70, loss: 5.822574, best loss: 5.711025 2025-01-16 01:14:03,051 - INFO - step 71, loss: 5.818875, best loss: 5.711025 2025-01-16 01:14:03,218 - INFO - step 72, loss: 5.981959, best loss: 5.711025 2025-01-16 01:14:03,368 - INFO - step 73, loss: 6.197146, best loss: 5.711025 2025-01-16 01:14:03,517 - INFO - step 74, loss: 5.951148, best loss: 5.711025 2025-01-16 01:14:03,667 - INFO - step 75, loss: 6.121564, best loss: 5.711025 2025-01-16 01:14:03,817 - INFO - step 76, loss: 5.910117, best loss: 5.711025 2025-01-16 01:14:03,967 - INFO - step 77, loss: 6.061495, best loss: 5.711025 2025-01-16 01:14:04,116 - INFO - step 78, loss: 6.260435, best loss: 5.711025 2025-01-16 01:14:04,266 - INFO - step 79, loss: 6.151994, best loss: 5.711025 2025-01-16 01:14:04,416 - INFO - step 80, loss: 6.620976, best loss: 5.711025 2025-01-16 01:14:04,566 - INFO - step 81, loss: 6.320269, best loss: 5.711025 2025-01-16 01:14:04,716 - INFO - step 82, loss: 6.189922, best loss: 5.711025 2025-01-16 01:14:04,866 - INFO - step 83, loss: 6.181428, best loss: 5.711025 2025-01-16 01:14:05,016 - INFO - step 84, loss: 5.854453, best loss: 5.711025 2025-01-16 01:14:05,166 - INFO - step 85, loss: 5.996202, best loss: 5.711025 2025-01-16 01:14:05,316 - INFO - step 86, loss: 5.918901, best loss: 5.711025 2025-01-16 01:14:05,466 - INFO - step 87, loss: 5.821301, best loss: 5.711025 2025-01-16 01:14:05,616 - INFO - step 88, loss: 6.319996, best loss: 5.711025 2025-01-16 01:14:05,766 - INFO - step 89, loss: 6.374994, best loss: 5.711025 2025-01-16 01:14:05,916 - INFO - step 90, loss: 6.084029, best loss: 5.711025 2025-01-16 01:14:06,066 - INFO - step 91, loss: 6.172344, best loss: 5.711025 2025-01-16 01:14:06,216 - INFO - step 92, loss: 6.027640, best loss: 5.711025 2025-01-16 01:14:06,365 - INFO - step 93, loss: 6.396429, best loss: 5.711025 2025-01-16 01:14:06,515 - INFO - step 94, loss: 6.546192, best loss: 5.711025 2025-01-16 01:14:06,665 - INFO - step 95, loss: 6.443549, best loss: 5.711025 2025-01-16 01:14:06,815 - INFO - step 96, loss: 6.432400, best loss: 5.711025 2025-01-16 01:14:06,965 - INFO - step 97, loss: 6.305459, best loss: 5.711025 2025-01-16 01:14:07,115 - INFO - step 98, loss: 6.553704, best loss: 5.711025 2025-01-16 01:14:07,265 - INFO - step 99, loss: 6.307276, best loss: 5.711025 2025-01-16 01:14:07,415 - INFO - step 100, loss: 6.291646, best loss: 5.711025 2025-01-16 01:14:07,565 - INFO - step 101, loss: 6.362191, best loss: 5.711025 2025-01-16 01:14:07,714 - INFO - step 102, loss: 6.377411, best loss: 5.711025 2025-01-16 01:14:07,864 - INFO - step 103, loss: 6.086195, best loss: 5.711025 2025-01-16 01:14:08,014 - INFO - step 104, loss: 6.232041, best loss: 5.711025 2025-01-16 01:14:08,165 - INFO - step 105, loss: 6.468611, best loss: 5.711025 2025-01-16 01:14:08,315 - INFO - step 106, loss: 5.984085, best loss: 5.711025 2025-01-16 01:14:08,464 - INFO - step 107, loss: 6.173664, best loss: 5.711025 2025-01-16 01:14:08,614 - INFO - step 108, loss: 6.030838, best loss: 5.711025 2025-01-16 01:14:08,764 - INFO - step 109, loss: 6.089221, best loss: 5.711025 2025-01-16 01:14:08,914 - INFO - step 110, loss: 6.185270, best loss: 5.711025 2025-01-16 01:14:09,064 - INFO - step 111, loss: 6.375745, best loss: 5.711025 2025-01-16 01:14:09,213 - INFO - step 112, loss: 6.355259, best loss: 5.711025 2025-01-16 01:14:09,363 - INFO - step 113, loss: 6.021276, best loss: 5.711025 2025-01-16 01:14:09,513 - INFO - step 114, loss: 6.177767, best loss: 5.711025 2025-01-16 01:14:09,662 - INFO - step 115, loss: 5.866199, best loss: 5.711025 2025-01-16 01:14:09,812 - INFO - step 116, loss: 6.203371, best loss: 5.711025 2025-01-16 01:14:09,962 - INFO - step 117, loss: 5.873975, best loss: 5.711025 2025-01-16 01:14:10,112 - INFO - step 118, loss: 6.051735, best loss: 5.711025 2025-01-16 01:14:10,262 - INFO - step 119, loss: 5.757576, best loss: 5.711025 2025-01-16 01:14:10,412 - INFO - step 120, loss: 5.874238, best loss: 5.711025 2025-01-16 01:14:13,962 - INFO - step 121, loss: 5.647548, best loss: 5.647548 2025-01-16 01:14:14,112 - INFO - step 122, loss: 5.864003, best loss: 5.647548 2025-01-16 01:14:17,621 - INFO - step 123, loss: 5.589639, best loss: 5.589639 2025-01-16 01:14:20,846 - INFO - step 124, loss: 5.341327, best loss: 5.341327 2025-01-16 01:14:20,997 - INFO - step 125, loss: 5.508380, best loss: 5.341327 2025-01-16 01:14:21,147 - INFO - step 126, loss: 5.588662, best loss: 5.341327 2025-01-16 01:14:21,296 - INFO - step 127, loss: 6.226934, best loss: 5.341327 2025-01-16 01:14:21,446 - INFO - step 128, loss: 5.915997, best loss: 5.341327 2025-01-16 01:14:21,596 - INFO - step 129, loss: 6.289304, best loss: 5.341327 2025-01-16 01:14:21,746 - INFO - step 130, loss: 6.496845, best loss: 5.341327 2025-01-16 01:14:21,896 - INFO - step 131, loss: 6.110124, best loss: 5.341327 2025-01-16 01:14:22,046 - INFO - step 132, loss: 6.490894, best loss: 5.341327 2025-01-16 01:14:22,196 - INFO - step 133, loss: 6.385570, best loss: 5.341327 2025-01-16 01:14:22,346 - INFO - step 134, loss: 6.266291, best loss: 5.341327 2025-01-16 01:14:22,496 - INFO - step 135, loss: 6.267380, best loss: 5.341327 2025-01-16 01:14:22,646 - INFO - step 136, loss: 6.817994, best loss: 5.341327 2025-01-16 01:14:22,795 - INFO - step 137, loss: 6.060211, best loss: 5.341327 2025-01-16 01:14:22,945 - INFO - step 138, loss: 5.825859, best loss: 5.341327 2025-01-16 01:14:23,095 - INFO - step 139, loss: 6.225033, best loss: 5.341327 2025-01-16 01:14:23,245 - INFO - step 140, loss: 5.939146, best loss: 5.341327 2025-01-16 01:14:23,394 - INFO - step 141, loss: 5.732660, best loss: 5.341327 2025-01-16 01:14:23,544 - INFO - step 142, loss: 6.505282, best loss: 5.341327 2025-01-16 01:14:23,694 - INFO - step 143, loss: 6.403181, best loss: 5.341327 2025-01-16 01:14:23,844 - INFO - step 144, loss: 5.841729, best loss: 5.341327 2025-01-16 01:14:23,994 - INFO - step 145, loss: 5.770128, best loss: 5.341327 2025-01-16 01:14:24,144 - INFO - step 146, loss: 5.792890, best loss: 5.341327 2025-01-16 01:14:24,293 - INFO - step 147, loss: 5.999312, best loss: 5.341327 2025-01-16 01:14:24,443 - INFO - step 148, loss: 6.048155, best loss: 5.341327 2025-01-16 01:14:24,593 - INFO - step 149, loss: 5.836678, best loss: 5.341327 2025-01-16 01:14:24,743 - INFO - step 150, loss: 6.246448, best loss: 5.341327 2025-01-16 01:14:24,893 - INFO - step 151, loss: 5.921693, best loss: 5.341327 2025-01-16 01:14:25,043 - INFO - step 152, loss: 6.010100, best loss: 5.341327 2025-01-16 01:14:25,193 - INFO - step 153, loss: 5.775085, best loss: 5.341327 2025-01-16 01:14:25,342 - INFO - step 154, loss: 5.818711, best loss: 5.341327 2025-01-16 01:14:25,493 - INFO - step 155, loss: 5.625128, best loss: 5.341327 2025-01-16 01:14:25,643 - INFO - step 156, loss: 5.484381, best loss: 5.341327 2025-01-16 01:14:25,793 - INFO - step 157, loss: 5.767538, best loss: 5.341327 2025-01-16 01:14:25,942 - INFO - step 158, loss: 5.374424, best loss: 5.341327 2025-01-16 01:14:26,092 - INFO - step 159, loss: 5.782289, best loss: 5.341327 2025-01-16 01:14:26,242 - INFO - step 160, loss: 5.724983, best loss: 5.341327 2025-01-16 01:14:26,393 - INFO - step 161, loss: 5.693506, best loss: 5.341327 2025-01-16 01:14:26,542 - INFO - step 162, loss: 5.486349, best loss: 5.341327 2025-01-16 01:14:26,692 - INFO - step 163, loss: 6.171004, best loss: 5.341327 2025-01-16 01:14:26,842 - INFO - step 164, loss: 6.156045, best loss: 5.341327 2025-01-16 01:14:26,992 - INFO - step 165, loss: 5.921999, best loss: 5.341327 2025-01-16 01:14:27,142 - INFO - step 166, loss: 5.823175, best loss: 5.341327 2025-01-16 01:14:27,292 - INFO - step 167, loss: 5.570381, best loss: 5.341327 2025-01-16 01:14:27,441 - INFO - step 168, loss: 5.803085, best loss: 5.341327 2025-01-16 01:14:27,591 - INFO - step 169, loss: 5.828316, best loss: 5.341327 2025-01-16 01:14:27,741 - INFO - step 170, loss: 6.063859, best loss: 5.341327 2025-01-16 01:14:27,891 - INFO - step 171, loss: 5.963082, best loss: 5.341327 2025-01-16 01:14:28,041 - INFO - step 172, loss: 5.946830, best loss: 5.341327 2025-01-16 01:14:28,191 - INFO - step 173, loss: 5.691726, best loss: 5.341327 2025-01-16 01:14:28,341 - INFO - step 174, loss: 5.652229, best loss: 5.341327 2025-01-16 01:14:28,491 - INFO - step 175, loss: 6.194779, best loss: 5.341327 2025-01-16 01:14:28,641 - INFO - step 176, loss: 6.043158, best loss: 5.341327 2025-01-16 01:14:28,791 - INFO - step 177, loss: 5.964791, best loss: 5.341327 2025-01-16 01:14:28,940 - INFO - step 178, loss: 6.086370, best loss: 5.341327 2025-01-16 01:14:29,090 - INFO - step 179, loss: 5.961589, best loss: 5.341327 2025-01-16 01:14:29,240 - INFO - step 180, loss: 5.848120, best loss: 5.341327 2025-01-16 01:14:29,390 - INFO - step 181, loss: 5.477336, best loss: 5.341327 2025-01-16 01:14:29,540 - INFO - step 182, loss: 5.838650, best loss: 5.341327 2025-01-16 01:14:29,690 - INFO - step 183, loss: 6.005394, best loss: 5.341327 2025-01-16 01:14:29,840 - INFO - step 184, loss: 5.680823, best loss: 5.341327 2025-01-16 01:14:29,990 - INFO - step 185, loss: 5.825180, best loss: 5.341327 2025-01-16 01:14:30,141 - INFO - step 186, loss: 5.754776, best loss: 5.341327 2025-01-16 01:14:30,291 - INFO - step 187, loss: 5.367939, best loss: 5.341327 2025-01-16 01:14:33,828 - INFO - step 188, loss: 5.297245, best loss: 5.297245 2025-01-16 01:14:33,990 - INFO - step 189, loss: 5.662703, best loss: 5.297245 2025-01-16 01:14:34,143 - INFO - step 190, loss: 6.028159, best loss: 5.297245 2025-01-16 01:14:34,293 - INFO - step 191, loss: 5.893387, best loss: 5.297245 2025-01-16 01:14:34,442 - INFO - step 192, loss: 5.749903, best loss: 5.297245 2025-01-16 01:14:34,592 - INFO - step 193, loss: 5.468061, best loss: 5.297245 2025-01-16 01:14:34,742 - INFO - step 194, loss: 5.617097, best loss: 5.297245 2025-01-16 01:14:34,892 - INFO - step 195, loss: 5.633214, best loss: 5.297245 2025-01-16 01:14:35,042 - INFO - step 196, loss: 5.445663, best loss: 5.297245 2025-01-16 01:14:35,192 - INFO - step 197, loss: 5.639896, best loss: 5.297245 2025-01-16 01:14:35,342 - INFO - step 198, loss: 5.422250, best loss: 5.297245 2025-01-16 01:14:35,492 - INFO - step 199, loss: 5.413107, best loss: 5.297245 2025-01-16 01:14:35,641 - INFO - step 200, loss: 5.750402, best loss: 5.297245 2025-01-16 01:14:39,157 - INFO - step 201, loss: 5.228148, best loss: 5.228148 2025-01-16 01:14:39,307 - INFO - step 202, loss: 5.523131, best loss: 5.228148 2025-01-16 01:14:39,457 - INFO - step 203, loss: 5.846604, best loss: 5.228148 2025-01-16 01:14:39,607 - INFO - step 204, loss: 5.478476, best loss: 5.228148 2025-01-16 01:14:43,136 - INFO - step 205, loss: 5.056911, best loss: 5.056911 2025-01-16 01:14:43,286 - INFO - step 206, loss: 5.752746, best loss: 5.056911 2025-01-16 01:14:43,436 - INFO - step 207, loss: 6.014012, best loss: 5.056911 2025-01-16 01:14:43,586 - INFO - step 208, loss: 6.466284, best loss: 5.056911 2025-01-16 01:14:43,736 - INFO - step 209, loss: 6.239737, best loss: 5.056911 2025-01-16 01:14:43,885 - INFO - step 210, loss: 6.208727, best loss: 5.056911 2025-01-16 01:14:44,036 - INFO - step 211, loss: 6.310506, best loss: 5.056911 2025-01-16 01:14:44,186 - INFO - step 212, loss: 6.393189, best loss: 5.056911 2025-01-16 01:14:44,336 - INFO - step 213, loss: 6.020179, best loss: 5.056911 2025-01-16 01:14:44,486 - INFO - step 214, loss: 5.912087, best loss: 5.056911 2025-01-16 01:14:44,636 - INFO - step 215, loss: 6.055243, best loss: 5.056911 2025-01-16 01:14:44,786 - INFO - step 216, loss: 5.834484, best loss: 5.056911 2025-01-16 01:14:44,936 - INFO - step 217, loss: 6.079072, best loss: 5.056911 2025-01-16 01:14:45,086 - INFO - step 218, loss: 5.742022, best loss: 5.056911 2025-01-16 01:14:45,236 - INFO - step 219, loss: 5.715427, best loss: 5.056911 2025-01-16 01:14:45,386 - INFO - step 220, loss: 5.792460, best loss: 5.056911 2025-01-16 01:14:45,536 - INFO - step 221, loss: 5.916228, best loss: 5.056911 2025-01-16 01:14:45,685 - INFO - step 222, loss: 5.608758, best loss: 5.056911 2025-01-16 01:14:45,835 - INFO - step 223, loss: 6.051811, best loss: 5.056911 2025-01-16 01:14:45,985 - INFO - step 224, loss: 6.083393, best loss: 5.056911 2025-01-16 01:14:46,135 - INFO - step 225, loss: 6.319884, best loss: 5.056911 2025-01-16 01:14:46,285 - INFO - step 226, loss: 6.322484, best loss: 5.056911 2025-01-16 01:14:46,436 - INFO - step 227, loss: 6.449049, best loss: 5.056911 2025-01-16 01:14:46,586 - INFO - step 228, loss: 6.109381, best loss: 5.056911 2025-01-16 01:14:46,735 - INFO - step 229, loss: 6.098219, best loss: 5.056911 2025-01-16 01:14:46,885 - INFO - step 230, loss: 6.026084, best loss: 5.056911 2025-01-16 01:14:47,035 - INFO - step 231, loss: 6.405728, best loss: 5.056911 2025-01-16 01:14:47,185 - INFO - step 232, loss: 5.916165, best loss: 5.056911 2025-01-16 01:14:47,335 - INFO - step 233, loss: 5.677113, best loss: 5.056911 2025-01-16 01:14:47,485 - INFO - step 234, loss: 5.705913, best loss: 5.056911 2025-01-16 01:14:47,635 - INFO - step 235, loss: 5.637208, best loss: 5.056911 2025-01-16 01:14:47,785 - INFO - step 236, loss: 6.033350, best loss: 5.056911 2025-01-16 01:14:47,935 - INFO - step 237, loss: 5.671475, best loss: 5.056911 2025-01-16 01:14:48,085 - INFO - step 238, loss: 5.525915, best loss: 5.056911 2025-01-16 01:14:48,235 - INFO - step 239, loss: 5.942299, best loss: 5.056911 2025-01-16 01:14:48,385 - INFO - step 240, loss: 5.520181, best loss: 5.056911 2025-01-16 01:14:48,535 - INFO - step 241, loss: 5.717176, best loss: 5.056911 2025-01-16 01:14:48,685 - INFO - step 242, loss: 5.490720, best loss: 5.056911 2025-01-16 01:14:48,836 - INFO - step 243, loss: 6.384752, best loss: 5.056911 2025-01-16 01:14:48,985 - INFO - step 244, loss: 5.975294, best loss: 5.056911 2025-01-16 01:14:49,136 - INFO - step 245, loss: 5.449126, best loss: 5.056911 2025-01-16 01:14:49,286 - INFO - step 246, loss: 5.423995, best loss: 5.056911 2025-01-16 01:14:49,436 - INFO - step 247, loss: 6.024109, best loss: 5.056911 2025-01-16 01:14:49,586 - INFO - step 248, loss: 5.875146, best loss: 5.056911 2025-01-16 01:14:49,736 - INFO - step 249, loss: 6.165745, best loss: 5.056911 2025-01-16 01:14:49,886 - INFO - step 250, loss: 6.189351, best loss: 5.056911 2025-01-16 01:14:50,036 - INFO - step 251, loss: 6.137371, best loss: 5.056911 2025-01-16 01:14:50,186 - INFO - step 252, loss: 6.022383, best loss: 5.056911 2025-01-16 01:14:50,336 - INFO - step 253, loss: 5.775570, best loss: 5.056911 2025-01-16 01:14:50,486 - INFO - step 254, loss: 5.620009, best loss: 5.056911 2025-01-16 01:14:50,636 - INFO - step 255, loss: 5.850831, best loss: 5.056911 2025-01-16 01:14:50,785 - INFO - step 256, loss: 5.417816, best loss: 5.056911 2025-01-16 01:14:50,936 - INFO - step 257, loss: 5.319973, best loss: 5.056911 2025-01-16 01:14:51,086 - INFO - step 258, loss: 5.837712, best loss: 5.056911 2025-01-16 01:14:51,236 - INFO - step 259, loss: 5.583446, best loss: 5.056911 2025-01-16 01:14:51,386 - INFO - step 260, loss: 5.440937, best loss: 5.056911 2025-01-16 01:14:51,536 - INFO - step 261, loss: 5.602666, best loss: 5.056911 2025-01-16 01:14:51,686 - INFO - step 262, loss: 5.732263, best loss: 5.056911 2025-01-16 01:14:51,836 - INFO - step 263, loss: 5.773939, best loss: 5.056911 2025-01-16 01:14:51,986 - INFO - step 264, loss: 5.844690, best loss: 5.056911 2025-01-16 01:14:52,136 - INFO - step 265, loss: 5.970802, best loss: 5.056911 2025-01-16 01:14:52,286 - INFO - step 266, loss: 5.444706, best loss: 5.056911 2025-01-16 01:14:52,437 - INFO - step 267, loss: 5.443188, best loss: 5.056911 2025-01-16 01:14:52,587 - INFO - step 268, loss: 5.681134, best loss: 5.056911 2025-01-16 01:14:52,736 - INFO - step 269, loss: 5.836062, best loss: 5.056911 2025-01-16 01:14:52,886 - INFO - step 270, loss: 5.383400, best loss: 5.056911 2025-01-16 01:14:53,036 - INFO - step 271, loss: 5.183482, best loss: 5.056911 2025-01-16 01:14:53,186 - INFO - step 272, loss: 5.357909, best loss: 5.056911 2025-01-16 01:14:53,336 - INFO - step 273, loss: 5.799757, best loss: 5.056911 2025-01-16 01:14:53,486 - INFO - step 274, loss: 5.138873, best loss: 5.056911 2025-01-16 01:14:53,636 - INFO - step 275, loss: 5.455534, best loss: 5.056911 2025-01-16 01:14:53,786 - INFO - step 276, loss: 5.395452, best loss: 5.056911 2025-01-16 01:14:53,935 - INFO - step 277, loss: 5.071115, best loss: 5.056911 2025-01-16 01:14:54,086 - INFO - step 278, loss: 5.184526, best loss: 5.056911 2025-01-16 01:14:54,236 - INFO - step 279, loss: 5.400198, best loss: 5.056911 2025-01-16 01:14:57,591 - INFO - step 280, loss: 4.905932, best loss: 4.905932 2025-01-16 01:14:57,741 - INFO - step 281, loss: 5.301545, best loss: 4.905932 2025-01-16 01:15:01,380 - INFO - step 282, loss: 4.852422, best loss: 4.852422 2025-01-16 01:15:01,530 - INFO - step 283, loss: 5.551692, best loss: 4.852422 2025-01-16 01:15:01,681 - INFO - step 284, loss: 5.876320, best loss: 4.852422 2025-01-16 01:15:01,831 - INFO - step 285, loss: 6.540989, best loss: 4.852422 2025-01-16 01:15:01,980 - INFO - step 286, loss: 5.590770, best loss: 4.852422 2025-01-16 01:15:02,130 - INFO - step 287, loss: 6.153898, best loss: 4.852422 2025-01-16 01:15:02,280 - INFO - step 288, loss: 6.074806, best loss: 4.852422 2025-01-16 01:15:02,430 - INFO - step 289, loss: 5.679489, best loss: 4.852422 2025-01-16 01:15:02,580 - INFO - step 290, loss: 5.729318, best loss: 4.852422 2025-01-16 01:15:02,730 - INFO - step 291, loss: 6.075615, best loss: 4.852422 2025-01-16 01:15:02,880 - INFO - step 292, loss: 5.832588, best loss: 4.852422 2025-01-16 01:15:03,030 - INFO - step 293, loss: 5.362818, best loss: 4.852422 2025-01-16 01:15:03,180 - INFO - step 294, loss: 5.693813, best loss: 4.852422 2025-01-16 01:15:03,330 - INFO - step 295, loss: 5.574802, best loss: 4.852422 2025-01-16 01:15:03,480 - INFO - step 296, loss: 5.742521, best loss: 4.852422 2025-01-16 01:15:03,630 - INFO - step 297, loss: 5.200696, best loss: 4.852422 2025-01-16 01:15:03,780 - INFO - step 298, loss: 5.529074, best loss: 4.852422 2025-01-16 01:15:03,930 - INFO - step 299, loss: 5.778002, best loss: 4.852422 2025-01-16 01:15:04,080 - INFO - step 300, loss: 5.875923, best loss: 4.852422 2025-01-16 01:15:04,230 - INFO - step 301, loss: 5.571141, best loss: 4.852422 2025-01-16 01:15:04,380 - INFO - step 302, loss: 5.614904, best loss: 4.852422 2025-01-16 01:15:04,530 - INFO - step 303, loss: 5.489616, best loss: 4.852422 2025-01-16 01:15:04,680 - INFO - step 304, loss: 5.130790, best loss: 4.852422 2025-01-16 01:15:04,830 - INFO - step 305, loss: 5.717614, best loss: 4.852422 2025-01-16 01:15:04,980 - INFO - step 306, loss: 5.436563, best loss: 4.852422 2025-01-16 01:15:05,130 - INFO - step 307, loss: 5.314315, best loss: 4.852422 2025-01-16 01:15:05,279 - INFO - step 308, loss: 5.069864, best loss: 4.852422 2025-01-16 01:15:05,429 - INFO - step 309, loss: 5.017853, best loss: 4.852422 2025-01-16 01:15:05,579 - INFO - step 310, loss: 5.148254, best loss: 4.852422 2025-01-16 01:15:05,729 - INFO - step 311, loss: 5.041708, best loss: 4.852422 2025-01-16 01:15:05,879 - INFO - step 312, loss: 5.008966, best loss: 4.852422 2025-01-16 01:15:06,029 - INFO - step 313, loss: 5.043231, best loss: 4.852422 2025-01-16 01:15:06,179 - INFO - step 314, loss: 5.030823, best loss: 4.852422 2025-01-16 01:15:06,329 - INFO - step 315, loss: 4.961367, best loss: 4.852422 2025-01-16 01:15:09,834 - INFO - step 316, loss: 4.692222, best loss: 4.692222 2025-01-16 01:15:13,356 - INFO - step 317, loss: 4.428338, best loss: 4.428338 2025-01-16 01:15:13,506 - INFO - step 318, loss: 5.235705, best loss: 4.428338 2025-01-16 01:15:13,656 - INFO - step 319, loss: 5.774701, best loss: 4.428338 2025-01-16 01:15:13,805 - INFO - step 320, loss: 6.072133, best loss: 4.428338 2025-01-16 01:15:13,955 - INFO - step 321, loss: 6.300399, best loss: 4.428338 2025-01-16 01:15:14,105 - INFO - step 322, loss: 6.301980, best loss: 4.428338 2025-01-16 01:15:14,255 - INFO - step 323, loss: 6.022117, best loss: 4.428338 2025-01-16 01:15:14,405 - INFO - step 324, loss: 6.194532, best loss: 4.428338 2025-01-16 01:15:14,554 - INFO - step 325, loss: 6.093456, best loss: 4.428338 2025-01-16 01:15:14,704 - INFO - step 326, loss: 5.617487, best loss: 4.428338 2025-01-16 01:15:14,854 - INFO - step 327, loss: 5.627713, best loss: 4.428338 2025-01-16 01:15:15,003 - INFO - step 328, loss: 5.689122, best loss: 4.428338 2025-01-16 01:15:15,153 - INFO - step 329, loss: 5.576690, best loss: 4.428338 2025-01-16 01:15:15,303 - INFO - step 330, loss: 6.038289, best loss: 4.428338 2025-01-16 01:15:15,453 - INFO - step 331, loss: 5.719393, best loss: 4.428338 2025-01-16 01:15:15,603 - INFO - step 332, loss: 6.023388, best loss: 4.428338 2025-01-16 01:15:15,752 - INFO - step 333, loss: 5.833945, best loss: 4.428338 2025-01-16 01:15:15,902 - INFO - step 334, loss: 5.990437, best loss: 4.428338 2025-01-16 01:15:16,052 - INFO - step 335, loss: 5.601676, best loss: 4.428338 2025-01-16 01:15:16,202 - INFO - step 336, loss: 5.689137, best loss: 4.428338 2025-01-16 01:15:16,351 - INFO - step 337, loss: 5.607321, best loss: 4.428338 2025-01-16 01:15:16,501 - INFO - step 338, loss: 5.721552, best loss: 4.428338 2025-01-16 01:15:16,651 - INFO - step 339, loss: 5.562581, best loss: 4.428338 2025-01-16 01:15:16,801 - INFO - step 340, loss: 5.454736, best loss: 4.428338 2025-01-16 01:15:16,951 - INFO - step 341, loss: 6.050483, best loss: 4.428338 2025-01-16 01:15:17,101 - INFO - step 342, loss: 5.429643, best loss: 4.428338 2025-01-16 01:15:17,250 - INFO - step 343, loss: 5.685795, best loss: 4.428338 2025-01-16 01:15:17,400 - INFO - step 344, loss: 5.600162, best loss: 4.428338 2025-01-16 01:15:17,550 - INFO - step 345, loss: 5.625694, best loss: 4.428338 2025-01-16 01:15:17,700 - INFO - step 346, loss: 5.512758, best loss: 4.428338 2025-01-16 01:15:17,849 - INFO - step 347, loss: 5.366059, best loss: 4.428338 2025-01-16 01:15:17,999 - INFO - step 348, loss: 5.224154, best loss: 4.428338 2025-01-16 01:15:18,149 - INFO - step 349, loss: 5.329496, best loss: 4.428338 2025-01-16 01:15:18,299 - INFO - step 350, loss: 4.857948, best loss: 4.428338 2025-01-16 01:15:18,449 - INFO - step 351, loss: 5.579833, best loss: 4.428338 2025-01-16 01:15:18,598 - INFO - step 352, loss: 4.857267, best loss: 4.428338 2025-01-16 01:15:18,748 - INFO - step 353, loss: 4.707117, best loss: 4.428338 2025-01-16 01:15:18,898 - INFO - step 354, loss: 5.146955, best loss: 4.428338 2025-01-16 01:15:19,048 - INFO - step 355, loss: 5.127914, best loss: 4.428338 2025-01-16 01:15:19,198 - INFO - step 356, loss: 5.189980, best loss: 4.428338 2025-01-16 01:15:19,347 - INFO - step 357, loss: 4.868447, best loss: 4.428338 2025-01-16 01:15:19,497 - INFO - step 358, loss: 5.099505, best loss: 4.428338 2025-01-16 01:15:19,647 - INFO - step 359, loss: 5.135512, best loss: 4.428338 2025-01-16 01:15:19,797 - INFO - step 360, loss: 5.211339, best loss: 4.428338 2025-01-16 01:15:19,947 - INFO - step 361, loss: 5.060120, best loss: 4.428338 2025-01-16 01:15:20,097 - INFO - step 362, loss: 5.459232, best loss: 4.428338 2025-01-16 01:15:20,246 - INFO - step 363, loss: 5.538853, best loss: 4.428338 2025-01-16 01:15:20,396 - INFO - step 364, loss: 5.371754, best loss: 4.428338 2025-01-16 01:15:20,546 - INFO - step 365, loss: 4.930281, best loss: 4.428338 2025-01-16 01:15:20,695 - INFO - step 366, loss: 4.963799, best loss: 4.428338 2025-01-16 01:15:20,845 - INFO - step 367, loss: 5.364643, best loss: 4.428338 2025-01-16 01:15:20,995 - INFO - step 368, loss: 4.817084, best loss: 4.428338 2025-01-16 01:15:21,145 - INFO - step 369, loss: 5.572270, best loss: 4.428338 2025-01-16 01:15:21,295 - INFO - step 370, loss: 5.423930, best loss: 4.428338 2025-01-16 01:15:21,444 - INFO - step 371, loss: 5.441885, best loss: 4.428338 2025-01-16 01:15:21,594 - INFO - step 372, loss: 5.347143, best loss: 4.428338 2025-01-16 01:15:21,744 - INFO - step 373, loss: 5.373116, best loss: 4.428338 2025-01-16 01:15:21,894 - INFO - step 374, loss: 5.365183, best loss: 4.428338 2025-01-16 01:15:22,043 - INFO - step 375, loss: 5.229572, best loss: 4.428338 2025-01-16 01:15:22,193 - INFO - step 376, loss: 6.067573, best loss: 4.428338 2025-01-16 01:15:22,343 - INFO - step 377, loss: 5.733449, best loss: 4.428338 2025-01-16 01:15:22,493 - INFO - step 378, loss: 6.029624, best loss: 4.428338 2025-01-16 01:15:22,642 - INFO - step 379, loss: 5.618134, best loss: 4.428338 2025-01-16 01:15:22,792 - INFO - step 380, loss: 5.512797, best loss: 4.428338 2025-01-16 01:15:22,941 - INFO - step 381, loss: 5.591210, best loss: 4.428338 2025-01-16 01:15:23,091 - INFO - step 382, loss: 5.619900, best loss: 4.428338 2025-01-16 01:15:23,241 - INFO - step 383, loss: 5.347808, best loss: 4.428338 2025-01-16 01:15:23,391 - INFO - step 384, loss: 5.504454, best loss: 4.428338 2025-01-16 01:15:23,541 - INFO - step 385, loss: 5.334104, best loss: 4.428338 2025-01-16 01:15:23,690 - INFO - step 386, loss: 5.312442, best loss: 4.428338 2025-01-16 01:15:23,840 - INFO - step 387, loss: 5.709073, best loss: 4.428338 2025-01-16 01:15:23,990 - INFO - step 388, loss: 5.354908, best loss: 4.428338 2025-01-16 01:15:24,140 - INFO - step 389, loss: 5.159586, best loss: 4.428338 2025-01-16 01:15:24,290 - INFO - step 390, loss: 5.236647, best loss: 4.428338 2025-01-16 01:15:24,439 - INFO - step 391, loss: 5.318797, best loss: 4.428338 2025-01-16 01:15:24,589 - INFO - step 392, loss: 5.269147, best loss: 4.428338 2025-01-16 01:15:24,739 - INFO - step 393, loss: 5.369607, best loss: 4.428338 2025-01-16 01:15:24,888 - INFO - step 394, loss: 5.075601, best loss: 4.428338 2025-01-16 01:15:25,038 - INFO - step 395, loss: 5.150060, best loss: 4.428338 2025-01-16 01:15:25,188 - INFO - step 396, loss: 5.282524, best loss: 4.428338 2025-01-16 01:15:25,338 - INFO - step 397, loss: 4.633216, best loss: 4.428338 2025-01-16 01:15:25,487 - INFO - step 398, loss: 5.012975, best loss: 4.428338 2025-01-16 01:15:25,637 - INFO - step 399, loss: 5.204732, best loss: 4.428338 2025-01-16 01:15:25,787 - INFO - step 400, loss: 4.996494, best loss: 4.428338 2025-01-16 01:15:25,937 - INFO - step 401, loss: 4.958020, best loss: 4.428338 2025-01-16 01:15:26,087 - INFO - step 402, loss: 5.186386, best loss: 4.428338 2025-01-16 01:15:26,236 - INFO - step 403, loss: 5.438526, best loss: 4.428338 2025-01-16 01:15:26,386 - INFO - step 404, loss: 5.232860, best loss: 4.428338 2025-01-16 01:15:26,536 - INFO - step 405, loss: 5.420398, best loss: 4.428338 2025-01-16 01:15:26,686 - INFO - step 406, loss: 5.059898, best loss: 4.428338 2025-01-16 01:15:26,836 - INFO - step 407, loss: 5.230422, best loss: 4.428338 2025-01-16 01:15:26,985 - INFO - step 408, loss: 5.186315, best loss: 4.428338 2025-01-16 01:15:27,135 - INFO - step 409, loss: 5.009678, best loss: 4.428338 2025-01-16 01:15:27,285 - INFO - step 410, loss: 5.708469, best loss: 4.428338 2025-01-16 01:15:27,434 - INFO - step 411, loss: 5.526350, best loss: 4.428338 2025-01-16 01:15:27,584 - INFO - step 412, loss: 5.223800, best loss: 4.428338 2025-01-16 01:15:27,734 - INFO - step 413, loss: 5.175985, best loss: 4.428338 2025-01-16 01:15:27,884 - INFO - step 414, loss: 4.851195, best loss: 4.428338 2025-01-16 01:15:28,034 - INFO - step 415, loss: 5.006116, best loss: 4.428338 2025-01-16 01:15:28,184 - INFO - step 416, loss: 4.962808, best loss: 4.428338 2025-01-16 01:15:28,333 - INFO - step 417, loss: 4.930802, best loss: 4.428338 2025-01-16 01:15:28,483 - INFO - step 418, loss: 5.575535, best loss: 4.428338 2025-01-16 01:15:28,633 - INFO - step 419, loss: 5.544423, best loss: 4.428338 2025-01-16 01:15:28,782 - INFO - step 420, loss: 5.289223, best loss: 4.428338 2025-01-16 01:15:28,932 - INFO - step 421, loss: 5.462655, best loss: 4.428338 2025-01-16 01:15:29,082 - INFO - step 422, loss: 5.217792, best loss: 4.428338 2025-01-16 01:15:29,231 - INFO - step 423, loss: 5.656689, best loss: 4.428338 2025-01-16 01:15:29,381 - INFO - step 424, loss: 5.810602, best loss: 4.428338 2025-01-16 01:15:29,531 - INFO - step 425, loss: 5.707852, best loss: 4.428338 2025-01-16 01:15:29,681 - INFO - step 426, loss: 5.782586, best loss: 4.428338 2025-01-16 01:15:29,831 - INFO - step 427, loss: 5.570039, best loss: 4.428338 2025-01-16 01:15:29,980 - INFO - step 428, loss: 5.830788, best loss: 4.428338 2025-01-16 01:15:30,130 - INFO - step 429, loss: 5.573044, best loss: 4.428338 2025-01-16 01:15:30,280 - INFO - step 430, loss: 5.571815, best loss: 4.428338 2025-01-16 01:15:30,430 - INFO - step 431, loss: 5.723540, best loss: 4.428338 2025-01-16 01:15:30,580 - INFO - step 432, loss: 5.739876, best loss: 4.428338 2025-01-16 01:15:30,730 - INFO - step 433, loss: 5.440271, best loss: 4.428338 2025-01-16 01:15:30,879 - INFO - step 434, loss: 5.485482, best loss: 4.428338 2025-01-16 01:15:31,029 - INFO - step 435, loss: 5.769653, best loss: 4.428338 2025-01-16 01:15:31,179 - INFO - step 436, loss: 5.401524, best loss: 4.428338 2025-01-16 01:15:31,329 - INFO - step 437, loss: 5.455580, best loss: 4.428338 2025-01-16 01:15:31,479 - INFO - step 438, loss: 5.265343, best loss: 4.428338 2025-01-16 01:15:31,629 - INFO - step 439, loss: 5.459846, best loss: 4.428338 2025-01-16 01:15:31,779 - INFO - step 440, loss: 5.491034, best loss: 4.428338 2025-01-16 01:15:31,929 - INFO - step 441, loss: 5.754066, best loss: 4.428338 2025-01-16 01:15:32,078 - INFO - step 442, loss: 5.662181, best loss: 4.428338 2025-01-16 01:15:32,228 - INFO - step 443, loss: 5.279775, best loss: 4.428338 2025-01-16 01:15:32,378 - INFO - step 444, loss: 5.567875, best loss: 4.428338 2025-01-16 01:15:32,528 - INFO - step 445, loss: 5.199014, best loss: 4.428338 2025-01-16 01:15:32,678 - INFO - step 446, loss: 5.628713, best loss: 4.428338 2025-01-16 01:15:32,828 - INFO - step 447, loss: 5.196372, best loss: 4.428338 2025-01-16 01:15:32,978 - INFO - step 448, loss: 5.379093, best loss: 4.428338 2025-01-16 01:15:33,128 - INFO - step 449, loss: 5.175889, best loss: 4.428338 2025-01-16 01:15:33,278 - INFO - step 450, loss: 5.249819, best loss: 4.428338 2025-01-16 01:15:33,427 - INFO - step 451, loss: 5.112347, best loss: 4.428338 2025-01-16 01:15:33,577 - INFO - step 452, loss: 5.341799, best loss: 4.428338 2025-01-16 01:15:33,727 - INFO - step 453, loss: 5.007740, best loss: 4.428338 2025-01-16 01:15:33,877 - INFO - step 454, loss: 4.708673, best loss: 4.428338 2025-01-16 01:15:34,027 - INFO - step 455, loss: 4.846386, best loss: 4.428338 2025-01-16 01:15:34,177 - INFO - step 456, loss: 5.037615, best loss: 4.428338 2025-01-16 01:15:34,327 - INFO - step 457, loss: 5.678431, best loss: 4.428338 2025-01-16 01:15:34,476 - INFO - step 458, loss: 5.270485, best loss: 4.428338 2025-01-16 01:15:34,626 - INFO - step 459, loss: 5.605327, best loss: 4.428338 2025-01-16 01:15:34,776 - INFO - step 460, loss: 5.736541, best loss: 4.428338 2025-01-16 01:15:34,925 - INFO - step 461, loss: 5.294149, best loss: 4.428338 2025-01-16 01:15:35,075 - INFO - step 462, loss: 5.803459, best loss: 4.428338 2025-01-16 01:15:35,224 - INFO - step 463, loss: 5.503345, best loss: 4.428338 2025-01-16 01:15:35,374 - INFO - step 464, loss: 5.571827, best loss: 4.428338 2025-01-16 01:15:35,524 - INFO - step 465, loss: 5.398186, best loss: 4.428338 2025-01-16 01:15:35,674 - INFO - step 466, loss: 6.069377, best loss: 4.428338 2025-01-16 01:15:35,824 - INFO - step 467, loss: 5.334764, best loss: 4.428338 2025-01-16 01:15:35,974 - INFO - step 468, loss: 5.147666, best loss: 4.428338 2025-01-16 01:15:36,123 - INFO - step 469, loss: 5.529024, best loss: 4.428338 2025-01-16 01:15:36,273 - INFO - step 470, loss: 5.242214, best loss: 4.428338 2025-01-16 01:15:36,423 - INFO - step 471, loss: 5.013799, best loss: 4.428338 2025-01-16 01:15:36,572 - INFO - step 472, loss: 5.866437, best loss: 4.428338 2025-01-16 01:15:36,722 - INFO - step 473, loss: 5.539916, best loss: 4.428338 2025-01-16 01:15:36,871 - INFO - step 474, loss: 4.960258, best loss: 4.428338 2025-01-16 01:15:37,021 - INFO - step 475, loss: 5.093252, best loss: 4.428338 2025-01-16 01:15:37,171 - INFO - step 476, loss: 5.267919, best loss: 4.428338 2025-01-16 01:15:37,321 - INFO - step 477, loss: 5.328834, best loss: 4.428338 2025-01-16 01:15:37,470 - INFO - step 478, loss: 5.316064, best loss: 4.428338 2025-01-16 01:15:37,620 - INFO - step 479, loss: 5.129792, best loss: 4.428338 2025-01-16 01:15:37,770 - INFO - step 480, loss: 5.694792, best loss: 4.428338 2025-01-16 01:15:37,919 - INFO - step 481, loss: 5.370588, best loss: 4.428338 2025-01-16 01:15:38,069 - INFO - step 482, loss: 5.384638, best loss: 4.428338 2025-01-16 01:15:38,219 - INFO - step 483, loss: 5.128279, best loss: 4.428338 2025-01-16 01:15:38,369 - INFO - step 484, loss: 5.270810, best loss: 4.428338 2025-01-16 01:15:38,519 - INFO - step 485, loss: 5.084486, best loss: 4.428338 2025-01-16 01:15:38,668 - INFO - step 486, loss: 4.867311, best loss: 4.428338 2025-01-16 01:15:38,818 - INFO - step 487, loss: 5.215781, best loss: 4.428338 2025-01-16 01:15:38,968 - INFO - step 488, loss: 4.793370, best loss: 4.428338 2025-01-16 01:15:39,117 - INFO - step 489, loss: 5.219522, best loss: 4.428338 2025-01-16 01:15:39,267 - INFO - step 490, loss: 5.129004, best loss: 4.428338 2025-01-16 01:15:39,418 - INFO - step 491, loss: 5.154034, best loss: 4.428338 2025-01-16 01:15:39,568 - INFO - step 492, loss: 4.996566, best loss: 4.428338 2025-01-16 01:15:39,717 - INFO - step 493, loss: 5.572796, best loss: 4.428338 2025-01-16 01:15:39,867 - INFO - step 494, loss: 5.648782, best loss: 4.428338 2025-01-16 01:15:40,017 - INFO - step 495, loss: 5.376532, best loss: 4.428338 2025-01-16 01:15:40,167 - INFO - step 496, loss: 5.372405, best loss: 4.428338 2025-01-16 01:15:40,316 - INFO - step 497, loss: 5.022101, best loss: 4.428338 2025-01-16 01:15:40,466 - INFO - step 498, loss: 5.260891, best loss: 4.428338 2025-01-16 01:15:40,616 - INFO - step 499, loss: 5.291424, best loss: 4.428338 2025-01-16 01:15:40,766 - INFO - step 500, loss: 5.384350, best loss: 4.428338 2025-01-16 01:15:40,915 - INFO - step 501, loss: 5.234388, best loss: 4.428338 2025-01-16 01:15:41,065 - INFO - step 502, loss: 5.267787, best loss: 4.428338 2025-01-16 01:15:41,215 - INFO - step 503, loss: 5.137744, best loss: 4.428338 2025-01-16 01:15:41,365 - INFO - step 504, loss: 5.198113, best loss: 4.428338 2025-01-16 01:15:41,514 - INFO - step 505, loss: 5.633555, best loss: 4.428338 2025-01-16 01:15:41,664 - INFO - step 506, loss: 5.533216, best loss: 4.428338 2025-01-16 01:15:41,814 - INFO - step 507, loss: 5.443839, best loss: 4.428338 2025-01-16 01:15:41,964 - INFO - step 508, loss: 5.539847, best loss: 4.428338 2025-01-16 01:15:42,113 - INFO - step 509, loss: 5.418295, best loss: 4.428338 2025-01-16 01:15:42,263 - INFO - step 510, loss: 5.327545, best loss: 4.428338 2025-01-16 01:15:42,413 - INFO - step 511, loss: 4.913204, best loss: 4.428338 2025-01-16 01:15:42,562 - INFO - step 512, loss: 5.339007, best loss: 4.428338 2025-01-16 01:15:42,712 - INFO - step 513, loss: 5.577472, best loss: 4.428338 2025-01-16 01:15:42,862 - INFO - step 514, loss: 5.182582, best loss: 4.428338 2025-01-16 01:15:43,012 - INFO - step 515, loss: 5.351979, best loss: 4.428338 2025-01-16 01:15:43,162 - INFO - step 516, loss: 5.341078, best loss: 4.428338 2025-01-16 01:15:43,312 - INFO - step 517, loss: 4.853027, best loss: 4.428338 2025-01-16 01:15:43,462 - INFO - step 518, loss: 4.689891, best loss: 4.428338 2025-01-16 01:15:43,612 - INFO - step 519, loss: 5.173126, best loss: 4.428338 2025-01-16 01:15:43,762 - INFO - step 520, loss: 5.466821, best loss: 4.428338 2025-01-16 01:15:43,911 - INFO - step 521, loss: 5.435303, best loss: 4.428338 2025-01-16 01:15:44,061 - INFO - step 522, loss: 5.274607, best loss: 4.428338 2025-01-16 01:15:44,211 - INFO - step 523, loss: 4.968715, best loss: 4.428338 2025-01-16 01:15:44,360 - INFO - step 524, loss: 5.175912, best loss: 4.428338 2025-01-16 01:15:44,510 - INFO - step 525, loss: 5.207887, best loss: 4.428338 2025-01-16 01:15:44,660 - INFO - step 526, loss: 4.979901, best loss: 4.428338 2025-01-16 01:15:44,810 - INFO - step 527, loss: 5.141584, best loss: 4.428338 2025-01-16 01:15:44,960 - INFO - step 528, loss: 5.035440, best loss: 4.428338 2025-01-16 01:15:45,110 - INFO - step 529, loss: 4.944641, best loss: 4.428338 2025-01-16 01:15:45,259 - INFO - step 530, loss: 5.320227, best loss: 4.428338 2025-01-16 01:15:45,409 - INFO - step 531, loss: 4.756113, best loss: 4.428338 2025-01-16 01:15:45,559 - INFO - step 532, loss: 5.069548, best loss: 4.428338 2025-01-16 01:15:45,709 - INFO - step 533, loss: 5.429130, best loss: 4.428338 2025-01-16 01:15:45,858 - INFO - step 534, loss: 5.051551, best loss: 4.428338 2025-01-16 01:15:46,008 - INFO - step 535, loss: 4.578081, best loss: 4.428338 2025-01-16 01:15:46,158 - INFO - step 536, loss: 5.229028, best loss: 4.428338 2025-01-16 01:15:46,307 - INFO - step 537, loss: 5.483756, best loss: 4.428338 2025-01-16 01:15:46,457 - INFO - step 538, loss: 5.883965, best loss: 4.428338 2025-01-16 01:15:46,606 - INFO - step 539, loss: 5.704124, best loss: 4.428338 2025-01-16 01:15:46,756 - INFO - step 540, loss: 5.689432, best loss: 4.428338 2025-01-16 01:15:46,905 - INFO - step 541, loss: 5.695269, best loss: 4.428338 2025-01-16 01:15:47,055 - INFO - step 542, loss: 5.859981, best loss: 4.428338 2025-01-16 01:15:47,205 - INFO - step 543, loss: 5.350374, best loss: 4.428338 2025-01-16 01:15:47,354 - INFO - step 544, loss: 5.421849, best loss: 4.428338 2025-01-16 01:15:47,504 - INFO - step 545, loss: 5.502777, best loss: 4.428338 2025-01-16 01:15:47,654 - INFO - step 546, loss: 5.380705, best loss: 4.428338 2025-01-16 01:15:47,803 - INFO - step 547, loss: 5.513247, best loss: 4.428338 2025-01-16 01:15:47,953 - INFO - step 548, loss: 5.212142, best loss: 4.428338 2025-01-16 01:15:48,103 - INFO - step 549, loss: 5.169964, best loss: 4.428338 2025-01-16 01:15:48,252 - INFO - step 550, loss: 5.280539, best loss: 4.428338 2025-01-16 01:15:48,402 - INFO - step 551, loss: 5.467798, best loss: 4.428338 2025-01-16 01:15:48,552 - INFO - step 552, loss: 5.162021, best loss: 4.428338 2025-01-16 01:15:48,701 - INFO - step 553, loss: 5.585567, best loss: 4.428338 2025-01-16 01:15:48,851 - INFO - step 554, loss: 5.618466, best loss: 4.428338 2025-01-16 01:15:49,000 - INFO - step 555, loss: 5.841139, best loss: 4.428338 2025-01-16 01:15:49,150 - INFO - step 556, loss: 5.811703, best loss: 4.428338 2025-01-16 01:15:49,300 - INFO - step 557, loss: 5.881956, best loss: 4.428338 2025-01-16 01:15:49,450 - INFO - step 558, loss: 5.494566, best loss: 4.428338 2025-01-16 01:15:49,600 - INFO - step 559, loss: 5.581909, best loss: 4.428338 2025-01-16 01:15:49,750 - INFO - step 560, loss: 5.560174, best loss: 4.428338 2025-01-16 01:15:49,900 - INFO - step 561, loss: 5.875561, best loss: 4.428338 2025-01-16 01:15:50,049 - INFO - step 562, loss: 5.402814, best loss: 4.428338 2025-01-16 01:15:50,199 - INFO - step 563, loss: 5.302403, best loss: 4.428338 2025-01-16 01:15:50,349 - INFO - step 564, loss: 5.267731, best loss: 4.428338 2025-01-16 01:15:50,498 - INFO - step 565, loss: 5.161955, best loss: 4.428338 2025-01-16 01:15:50,648 - INFO - step 566, loss: 5.525698, best loss: 4.428338 2025-01-16 01:15:50,798 - INFO - step 567, loss: 5.092316, best loss: 4.428338 2025-01-16 01:15:50,947 - INFO - step 568, loss: 5.053370, best loss: 4.428338 2025-01-16 01:15:51,097 - INFO - step 569, loss: 5.510680, best loss: 4.428338 2025-01-16 01:15:51,247 - INFO - step 570, loss: 5.121903, best loss: 4.428338 2025-01-16 01:15:51,396 - INFO - step 571, loss: 5.300208, best loss: 4.428338 2025-01-16 01:15:51,546 - INFO - step 572, loss: 5.121206, best loss: 4.428338 2025-01-16 01:15:51,696 - INFO - step 573, loss: 5.896414, best loss: 4.428338 2025-01-16 01:15:51,846 - INFO - step 574, loss: 5.544085, best loss: 4.428338 2025-01-16 01:15:51,995 - INFO - step 575, loss: 5.059411, best loss: 4.428338 2025-01-16 01:15:52,145 - INFO - step 576, loss: 5.010740, best loss: 4.428338 2025-01-16 01:15:52,295 - INFO - step 577, loss: 5.517527, best loss: 4.428338 2025-01-16 01:15:52,444 - INFO - step 578, loss: 5.249890, best loss: 4.428338 2025-01-16 01:15:52,594 - INFO - step 579, loss: 5.416392, best loss: 4.428338 2025-01-16 01:15:52,744 - INFO - step 580, loss: 5.510307, best loss: 4.428338 2025-01-16 01:15:52,893 - INFO - step 581, loss: 5.496721, best loss: 4.428338 2025-01-16 01:15:53,043 - INFO - step 582, loss: 5.421468, best loss: 4.428338 2025-01-16 01:15:53,193 - INFO - step 583, loss: 5.176512, best loss: 4.428338 2025-01-16 01:15:53,342 - INFO - step 584, loss: 4.951984, best loss: 4.428338 2025-01-16 01:15:53,492 - INFO - step 585, loss: 5.104851, best loss: 4.428338 2025-01-16 01:15:53,642 - INFO - step 586, loss: 4.895884, best loss: 4.428338 2025-01-16 01:15:53,791 - INFO - step 587, loss: 4.798601, best loss: 4.428338 2025-01-16 01:15:53,941 - INFO - step 588, loss: 5.325092, best loss: 4.428338 2025-01-16 01:15:54,091 - INFO - step 589, loss: 5.070796, best loss: 4.428338 2025-01-16 01:15:54,240 - INFO - step 590, loss: 4.962511, best loss: 4.428338 2025-01-16 01:15:54,390 - INFO - step 591, loss: 5.137301, best loss: 4.428338 2025-01-16 01:15:54,540 - INFO - step 592, loss: 5.210496, best loss: 4.428338 2025-01-16 01:15:54,690 - INFO - step 593, loss: 5.197955, best loss: 4.428338 2025-01-16 01:15:54,839 - INFO - step 594, loss: 5.300178, best loss: 4.428338 2025-01-16 01:15:54,990 - INFO - step 595, loss: 5.528227, best loss: 4.428338 2025-01-16 01:15:55,139 - INFO - step 596, loss: 5.014107, best loss: 4.428338 2025-01-16 01:15:55,289 - INFO - step 597, loss: 4.991662, best loss: 4.428338 2025-01-16 01:15:55,439 - INFO - step 598, loss: 5.233101, best loss: 4.428338 2025-01-16 01:15:55,589 - INFO - step 599, loss: 5.382183, best loss: 4.428338 2025-01-16 01:15:55,738 - INFO - step 600, loss: 4.974504, best loss: 4.428338 2025-01-16 01:15:55,888 - INFO - step 601, loss: 4.765510, best loss: 4.428338 2025-01-16 01:15:56,038 - INFO - step 602, loss: 4.974107, best loss: 4.428338 2025-01-16 01:15:56,187 - INFO - step 603, loss: 5.319493, best loss: 4.428338 2025-01-16 01:15:56,337 - INFO - step 604, loss: 4.712119, best loss: 4.428338 2025-01-16 01:15:56,487 - INFO - step 605, loss: 5.084779, best loss: 4.428338 2025-01-16 01:15:56,636 - INFO - step 606, loss: 5.028487, best loss: 4.428338 2025-01-16 01:15:56,786 - INFO - step 607, loss: 4.772764, best loss: 4.428338 2025-01-16 01:15:56,936 - INFO - step 608, loss: 4.762350, best loss: 4.428338 2025-01-16 01:15:57,085 - INFO - step 609, loss: 5.019977, best loss: 4.428338 2025-01-16 01:15:57,235 - INFO - step 610, loss: 4.560853, best loss: 4.428338 2025-01-16 01:15:57,385 - INFO - step 611, loss: 4.895277, best loss: 4.428338 2025-01-16 01:15:57,534 - INFO - step 612, loss: 4.497416, best loss: 4.428338 2025-01-16 01:15:57,684 - INFO - step 613, loss: 5.141530, best loss: 4.428338 2025-01-16 01:15:57,834 - INFO - step 614, loss: 5.527625, best loss: 4.428338 2025-01-16 01:15:57,983 - INFO - step 615, loss: 6.067667, best loss: 4.428338 2025-01-16 01:15:58,133 - INFO - step 616, loss: 5.214848, best loss: 4.428338 2025-01-16 01:15:58,283 - INFO - step 617, loss: 5.573472, best loss: 4.428338 2025-01-16 01:15:58,433 - INFO - step 618, loss: 5.443247, best loss: 4.428338 2025-01-16 01:15:58,583 - INFO - step 619, loss: 5.210669, best loss: 4.428338 2025-01-16 01:15:58,733 - INFO - step 620, loss: 5.157336, best loss: 4.428338 2025-01-16 01:15:58,882 - INFO - step 621, loss: 5.555627, best loss: 4.428338 2025-01-16 01:15:59,032 - INFO - step 622, loss: 5.148558, best loss: 4.428338 2025-01-16 01:15:59,182 - INFO - step 623, loss: 4.776489, best loss: 4.428338 2025-01-16 01:15:59,332 - INFO - step 624, loss: 5.093853, best loss: 4.428338 2025-01-16 01:15:59,482 - INFO - step 625, loss: 5.005960, best loss: 4.428338 2025-01-16 01:15:59,632 - INFO - step 626, loss: 5.236046, best loss: 4.428338 2025-01-16 01:15:59,782 - INFO - step 627, loss: 4.552772, best loss: 4.428338 2025-01-16 01:15:59,932 - INFO - step 628, loss: 4.985428, best loss: 4.428338 2025-01-16 01:16:00,081 - INFO - step 629, loss: 5.225046, best loss: 4.428338 2025-01-16 01:16:00,231 - INFO - step 630, loss: 5.343008, best loss: 4.428338 2025-01-16 01:16:00,380 - INFO - step 631, loss: 4.995236, best loss: 4.428338 2025-01-16 01:16:00,530 - INFO - step 632, loss: 5.048373, best loss: 4.428338 2025-01-16 01:16:00,680 - INFO - step 633, loss: 5.037751, best loss: 4.428338 2025-01-16 01:16:00,830 - INFO - step 634, loss: 4.654148, best loss: 4.428338 2025-01-16 01:16:00,980 - INFO - step 635, loss: 5.307111, best loss: 4.428338 2025-01-16 01:16:01,129 - INFO - step 636, loss: 4.994147, best loss: 4.428338 2025-01-16 01:16:01,279 - INFO - step 637, loss: 4.959507, best loss: 4.428338 2025-01-16 01:16:01,429 - INFO - step 638, loss: 4.651282, best loss: 4.428338 2025-01-16 01:16:01,578 - INFO - step 639, loss: 4.607708, best loss: 4.428338 2025-01-16 01:16:01,728 - INFO - step 640, loss: 4.757521, best loss: 4.428338 2025-01-16 01:16:01,878 - INFO - step 641, loss: 4.639140, best loss: 4.428338 2025-01-16 01:16:02,028 - INFO - step 642, loss: 4.615286, best loss: 4.428338 2025-01-16 01:16:02,178 - INFO - step 643, loss: 4.649825, best loss: 4.428338 2025-01-16 01:16:02,328 - INFO - step 644, loss: 4.619293, best loss: 4.428338 2025-01-16 01:16:02,478 - INFO - step 645, loss: 4.554084, best loss: 4.428338 2025-01-16 01:16:05,991 - INFO - step 646, loss: 4.270548, best loss: 4.270548 2025-01-16 01:16:09,462 - INFO - step 647, loss: 4.080480, best loss: 4.080480 2025-01-16 01:16:09,614 - INFO - step 648, loss: 4.876622, best loss: 4.080480 2025-01-16 01:16:09,764 - INFO - step 649, loss: 5.349923, best loss: 4.080480 2025-01-16 01:16:09,914 - INFO - step 650, loss: 5.560199, best loss: 4.080480 2025-01-16 01:16:10,064 - INFO - step 651, loss: 5.786558, best loss: 4.080480 2025-01-16 01:16:10,214 - INFO - step 652, loss: 5.790942, best loss: 4.080480 2025-01-16 01:16:10,364 - INFO - step 653, loss: 5.482889, best loss: 4.080480 2025-01-16 01:16:10,514 - INFO - step 654, loss: 5.620522, best loss: 4.080480 2025-01-16 01:16:10,663 - INFO - step 655, loss: 5.598651, best loss: 4.080480 2025-01-16 01:16:10,813 - INFO - step 656, loss: 5.040321, best loss: 4.080480 2025-01-16 01:16:10,963 - INFO - step 657, loss: 5.036730, best loss: 4.080480 2025-01-16 01:16:11,113 - INFO - step 658, loss: 5.216171, best loss: 4.080480 2025-01-16 01:16:11,263 - INFO - step 659, loss: 5.081857, best loss: 4.080480 2025-01-16 01:16:11,413 - INFO - step 660, loss: 5.634674, best loss: 4.080480 2025-01-16 01:16:11,563 - INFO - step 661, loss: 5.388659, best loss: 4.080480 2025-01-16 01:16:11,713 - INFO - step 662, loss: 5.661983, best loss: 4.080480 2025-01-16 01:16:11,863 - INFO - step 663, loss: 5.410127, best loss: 4.080480 2025-01-16 01:16:12,012 - INFO - step 664, loss: 5.590026, best loss: 4.080480 2025-01-16 01:16:12,162 - INFO - step 665, loss: 5.166814, best loss: 4.080480 2025-01-16 01:16:12,312 - INFO - step 666, loss: 5.366580, best loss: 4.080480 2025-01-16 01:16:12,462 - INFO - step 667, loss: 5.184182, best loss: 4.080480 2025-01-16 01:16:12,612 - INFO - step 668, loss: 5.326493, best loss: 4.080480 2025-01-16 01:16:12,762 - INFO - step 669, loss: 5.155212, best loss: 4.080480 2025-01-16 01:16:12,912 - INFO - step 670, loss: 5.033990, best loss: 4.080480 2025-01-16 01:16:13,061 - INFO - step 671, loss: 5.708227, best loss: 4.080480 2025-01-16 01:16:13,211 - INFO - step 672, loss: 5.000329, best loss: 4.080480 2025-01-16 01:16:13,361 - INFO - step 673, loss: 5.357898, best loss: 4.080480 2025-01-16 01:16:13,511 - INFO - step 674, loss: 5.245154, best loss: 4.080480 2025-01-16 01:16:13,661 - INFO - step 675, loss: 5.262615, best loss: 4.080480 2025-01-16 01:16:13,811 - INFO - step 676, loss: 5.186070, best loss: 4.080480 2025-01-16 01:16:13,961 - INFO - step 677, loss: 4.974766, best loss: 4.080480 2025-01-16 01:16:14,110 - INFO - step 678, loss: 4.840732, best loss: 4.080480 2025-01-16 01:16:14,260 - INFO - step 679, loss: 5.011436, best loss: 4.080480 2025-01-16 01:16:14,410 - INFO - step 680, loss: 4.506796, best loss: 4.080480 2025-01-16 01:16:14,560 - INFO - step 681, loss: 5.270014, best loss: 4.080480 2025-01-16 01:16:14,710 - INFO - step 682, loss: 4.480711, best loss: 4.080480 2025-01-16 01:16:14,860 - INFO - step 683, loss: 4.322324, best loss: 4.080480 2025-01-16 01:16:15,009 - INFO - step 684, loss: 4.843074, best loss: 4.080480 2025-01-16 01:16:15,159 - INFO - step 685, loss: 4.819137, best loss: 4.080480 2025-01-16 01:16:15,309 - INFO - step 686, loss: 4.866986, best loss: 4.080480 2025-01-16 01:16:15,459 - INFO - step 687, loss: 4.521440, best loss: 4.080480 2025-01-16 01:16:15,609 - INFO - step 688, loss: 4.801044, best loss: 4.080480 2025-01-16 01:16:15,758 - INFO - step 689, loss: 4.814404, best loss: 4.080480 2025-01-16 01:16:15,908 - INFO - step 690, loss: 4.855151, best loss: 4.080480 2025-01-16 01:16:16,058 - INFO - step 691, loss: 4.709825, best loss: 4.080480 2025-01-16 01:16:16,208 - INFO - step 692, loss: 5.122548, best loss: 4.080480 2025-01-16 01:16:16,358 - INFO - step 693, loss: 5.159893, best loss: 4.080480 2025-01-16 01:16:16,508 - INFO - step 694, loss: 5.063462, best loss: 4.080480 2025-01-16 01:16:16,657 - INFO - step 695, loss: 4.614593, best loss: 4.080480 2025-01-16 01:16:16,807 - INFO - step 696, loss: 4.699930, best loss: 4.080480 2025-01-16 01:16:16,957 - INFO - step 697, loss: 5.106384, best loss: 4.080480 2025-01-16 01:16:17,106 - INFO - step 698, loss: 4.568625, best loss: 4.080480 2025-01-16 01:16:17,256 - INFO - step 699, loss: 5.249374, best loss: 4.080480 2025-01-16 01:16:17,406 - INFO - step 700, loss: 5.126807, best loss: 4.080480 2025-01-16 01:16:17,555 - INFO - step 701, loss: 5.147311, best loss: 4.080480 2025-01-16 01:16:17,705 - INFO - step 702, loss: 5.046470, best loss: 4.080480 2025-01-16 01:16:17,855 - INFO - step 703, loss: 5.074219, best loss: 4.080480 2025-01-16 01:16:18,004 - INFO - step 704, loss: 5.063146, best loss: 4.080480 2025-01-16 01:16:18,154 - INFO - step 705, loss: 4.901693, best loss: 4.080480 2025-01-16 01:16:18,304 - INFO - step 706, loss: 5.690214, best loss: 4.080480 2025-01-16 01:16:18,454 - INFO - step 707, loss: 5.250329, best loss: 4.080480 2025-01-16 01:16:18,603 - INFO - step 708, loss: 5.609388, best loss: 4.080480 2025-01-16 01:16:18,753 - INFO - step 709, loss: 4.993277, best loss: 4.080480 2025-01-16 01:16:18,903 - INFO - step 710, loss: 4.883810, best loss: 4.080480 2025-01-16 01:16:19,052 - INFO - step 711, loss: 5.183537, best loss: 4.080480 2025-01-16 01:16:19,202 - INFO - step 712, loss: 5.220796, best loss: 4.080480 2025-01-16 01:16:19,352 - INFO - step 713, loss: 4.950719, best loss: 4.080480 2025-01-16 01:16:19,502 - INFO - step 714, loss: 5.128646, best loss: 4.080480 2025-01-16 01:16:19,651 - INFO - step 715, loss: 4.975901, best loss: 4.080480 2025-01-16 01:16:19,801 - INFO - step 716, loss: 4.946092, best loss: 4.080480 2025-01-16 01:16:19,951 - INFO - step 717, loss: 5.326333, best loss: 4.080480 2025-01-16 01:16:20,100 - INFO - step 718, loss: 4.922956, best loss: 4.080480 2025-01-16 01:16:20,250 - INFO - step 719, loss: 4.810004, best loss: 4.080480 2025-01-16 01:16:20,400 - INFO - step 720, loss: 4.917301, best loss: 4.080480 2025-01-16 01:16:20,550 - INFO - step 721, loss: 5.005477, best loss: 4.080480 2025-01-16 01:16:20,700 - INFO - step 722, loss: 4.963332, best loss: 4.080480 2025-01-16 01:16:20,850 - INFO - step 723, loss: 5.095877, best loss: 4.080480 2025-01-16 01:16:21,000 - INFO - step 724, loss: 4.819242, best loss: 4.080480 2025-01-16 01:16:21,150 - INFO - step 725, loss: 4.767730, best loss: 4.080480 2025-01-16 01:16:21,299 - INFO - step 726, loss: 4.984897, best loss: 4.080480 2025-01-16 01:16:21,449 - INFO - step 727, loss: 4.312194, best loss: 4.080480 2025-01-16 01:16:21,599 - INFO - step 728, loss: 4.716308, best loss: 4.080480 2025-01-16 01:16:21,748 - INFO - step 729, loss: 4.871198, best loss: 4.080480 2025-01-16 01:16:21,898 - INFO - step 730, loss: 4.713090, best loss: 4.080480 2025-01-16 01:16:22,048 - INFO - step 731, loss: 4.672776, best loss: 4.080480 2025-01-16 01:16:22,198 - INFO - step 732, loss: 4.933290, best loss: 4.080480 2025-01-16 01:16:22,348 - INFO - step 733, loss: 5.148425, best loss: 4.080480 2025-01-16 01:16:22,497 - INFO - step 734, loss: 4.909829, best loss: 4.080480 2025-01-16 01:16:22,647 - INFO - step 735, loss: 5.140857, best loss: 4.080480 2025-01-16 01:16:22,797 - INFO - step 736, loss: 4.775753, best loss: 4.080480 2025-01-16 01:16:22,947 - INFO - step 737, loss: 4.931338, best loss: 4.080480 2025-01-16 01:16:23,096 - INFO - step 738, loss: 4.826108, best loss: 4.080480 2025-01-16 01:16:23,246 - INFO - step 739, loss: 4.593310, best loss: 4.080480 2025-01-16 01:16:23,396 - INFO - step 740, loss: 5.395525, best loss: 4.080480 2025-01-16 01:16:23,546 - INFO - step 741, loss: 5.256794, best loss: 4.080480 2025-01-16 01:16:23,696 - INFO - step 742, loss: 4.912005, best loss: 4.080480 2025-01-16 01:16:23,846 - INFO - step 743, loss: 4.821586, best loss: 4.080480 2025-01-16 01:16:23,996 - INFO - step 744, loss: 4.515374, best loss: 4.080480 2025-01-16 01:16:24,146 - INFO - step 745, loss: 4.669507, best loss: 4.080480 2025-01-16 01:16:24,295 - INFO - step 746, loss: 4.612502, best loss: 4.080480 2025-01-16 01:16:24,445 - INFO - step 747, loss: 4.570557, best loss: 4.080480 2025-01-16 01:16:24,595 - INFO - step 748, loss: 5.284494, best loss: 4.080480 2025-01-16 01:16:24,745 - INFO - step 749, loss: 5.234517, best loss: 4.080480 2025-01-16 01:16:24,894 - INFO - step 750, loss: 5.000631, best loss: 4.080480 2025-01-16 01:16:25,044 - INFO - step 751, loss: 5.191642, best loss: 4.080480 2025-01-16 01:16:25,194 - INFO - step 752, loss: 4.914756, best loss: 4.080480 2025-01-16 01:16:25,344 - INFO - step 753, loss: 5.338488, best loss: 4.080480 2025-01-16 01:16:25,494 - INFO - step 754, loss: 5.454067, best loss: 4.080480 2025-01-16 01:16:25,643 - INFO - step 755, loss: 5.397747, best loss: 4.080480 2025-01-16 01:16:25,793 - INFO - step 756, loss: 5.496190, best loss: 4.080480 2025-01-16 01:16:25,943 - INFO - step 757, loss: 5.210244, best loss: 4.080480 2025-01-16 01:16:26,093 - INFO - step 758, loss: 5.477654, best loss: 4.080480 2025-01-16 01:16:26,243 - INFO - step 759, loss: 5.246481, best loss: 4.080480 2025-01-16 01:16:26,393 - INFO - step 760, loss: 5.194191, best loss: 4.080480 2025-01-16 01:16:26,542 - INFO - step 761, loss: 5.404346, best loss: 4.080480 2025-01-16 01:16:26,692 - INFO - step 762, loss: 5.425000, best loss: 4.080480 2025-01-16 01:16:26,841 - INFO - step 763, loss: 5.145512, best loss: 4.080480 2025-01-16 01:16:26,992 - INFO - step 764, loss: 5.095736, best loss: 4.080480 2025-01-16 01:16:27,142 - INFO - step 765, loss: 5.406224, best loss: 4.080480 2025-01-16 01:16:27,291 - INFO - step 766, loss: 5.096498, best loss: 4.080480 2025-01-16 01:16:27,441 - INFO - step 767, loss: 5.048913, best loss: 4.080480 2025-01-16 01:16:27,590 - INFO - step 768, loss: 4.939571, best loss: 4.080480 2025-01-16 01:16:27,741 - INFO - step 769, loss: 5.143640, best loss: 4.080480 2025-01-16 01:16:27,890 - INFO - step 770, loss: 5.201542, best loss: 4.080480 2025-01-16 01:16:28,040 - INFO - step 771, loss: 5.438045, best loss: 4.080480 2025-01-16 01:16:28,190 - INFO - step 772, loss: 5.352585, best loss: 4.080480 2025-01-16 01:16:28,340 - INFO - step 773, loss: 4.958022, best loss: 4.080480 2025-01-16 01:16:28,489 - INFO - step 774, loss: 5.286680, best loss: 4.080480 2025-01-16 01:16:28,639 - INFO - step 775, loss: 4.875582, best loss: 4.080480 2025-01-16 01:16:28,789 - INFO - step 776, loss: 5.348083, best loss: 4.080480 2025-01-16 01:16:28,939 - INFO - step 777, loss: 4.907395, best loss: 4.080480 2025-01-16 01:16:29,089 - INFO - step 778, loss: 5.062779, best loss: 4.080480 2025-01-16 01:16:29,238 - INFO - step 779, loss: 4.943466, best loss: 4.080480 2025-01-16 01:16:29,388 - INFO - step 780, loss: 4.963500, best loss: 4.080480 2025-01-16 01:16:29,538 - INFO - step 781, loss: 4.791018, best loss: 4.080480 2025-01-16 01:16:29,688 - INFO - step 782, loss: 5.038814, best loss: 4.080480 2025-01-16 01:16:29,837 - INFO - step 783, loss: 4.628893, best loss: 4.080480 2025-01-16 01:16:29,987 - INFO - step 784, loss: 4.319673, best loss: 4.080480 2025-01-16 01:16:30,137 - INFO - step 785, loss: 4.497497, best loss: 4.080480 2025-01-16 01:16:30,287 - INFO - step 786, loss: 4.735953, best loss: 4.080480 2025-01-16 01:16:30,437 - INFO - step 787, loss: 5.431400, best loss: 4.080480 2025-01-16 01:16:30,587 - INFO - step 788, loss: 4.967247, best loss: 4.080480 2025-01-16 01:16:30,736 - INFO - step 789, loss: 5.219985, best loss: 4.080480 2025-01-16 01:16:30,886 - INFO - step 790, loss: 5.412382, best loss: 4.080480 2025-01-16 01:16:31,036 - INFO - step 791, loss: 4.960702, best loss: 4.080480 2025-01-16 01:16:31,186 - INFO - step 792, loss: 5.522036, best loss: 4.080480 2025-01-16 01:16:31,336 - INFO - step 793, loss: 5.099906, best loss: 4.080480 2025-01-16 01:16:31,486 - INFO - step 794, loss: 5.202086, best loss: 4.080480 2025-01-16 01:16:31,636 - INFO - step 795, loss: 5.040035, best loss: 4.080480 2025-01-16 01:16:31,785 - INFO - step 796, loss: 5.704251, best loss: 4.080480 2025-01-16 01:16:31,935 - INFO - step 797, loss: 4.966858, best loss: 4.080480 2025-01-16 01:16:32,085 - INFO - step 798, loss: 4.800507, best loss: 4.080480 2025-01-16 01:16:32,235 - INFO - step 799, loss: 5.221467, best loss: 4.080480 2025-01-16 01:16:32,384 - INFO - step 800, loss: 4.913869, best loss: 4.080480 2025-01-16 01:16:32,534 - INFO - step 801, loss: 4.712706, best loss: 4.080480 2025-01-16 01:16:32,684 - INFO - step 802, loss: 5.514788, best loss: 4.080480 2025-01-16 01:16:32,834 - INFO - step 803, loss: 5.136098, best loss: 4.080480 2025-01-16 01:16:32,984 - INFO - step 804, loss: 4.631621, best loss: 4.080480 2025-01-16 01:16:33,133 - INFO - step 805, loss: 4.782257, best loss: 4.080480 2025-01-16 01:16:33,283 - INFO - step 806, loss: 5.003664, best loss: 4.080480 2025-01-16 01:16:33,433 - INFO - step 807, loss: 4.986492, best loss: 4.080480 2025-01-16 01:16:33,583 - INFO - step 808, loss: 5.007578, best loss: 4.080480 2025-01-16 01:16:33,732 - INFO - step 809, loss: 4.819805, best loss: 4.080480 2025-01-16 01:16:33,882 - INFO - step 810, loss: 5.423862, best loss: 4.080480 2025-01-16 01:16:34,032 - INFO - step 811, loss: 5.084391, best loss: 4.080480 2025-01-16 01:16:34,182 - INFO - step 812, loss: 5.060486, best loss: 4.080480 2025-01-16 01:16:34,332 - INFO - step 813, loss: 4.833265, best loss: 4.080480 2025-01-16 01:16:34,482 - INFO - step 814, loss: 5.030151, best loss: 4.080480 2025-01-16 01:16:34,632 - INFO - step 815, loss: 4.838317, best loss: 4.080480 2025-01-16 01:16:34,781 - INFO - step 816, loss: 4.542569, best loss: 4.080480 2025-01-16 01:16:34,931 - INFO - step 817, loss: 4.946984, best loss: 4.080480 2025-01-16 01:16:35,081 - INFO - step 818, loss: 4.534706, best loss: 4.080480 2025-01-16 01:16:35,231 - INFO - step 819, loss: 4.969966, best loss: 4.080480 2025-01-16 01:16:35,381 - INFO - step 820, loss: 4.848241, best loss: 4.080480 2025-01-16 01:16:35,531 - INFO - step 821, loss: 4.911795, best loss: 4.080480 2025-01-16 01:16:35,681 - INFO - step 822, loss: 4.713475, best loss: 4.080480 2025-01-16 01:16:35,831 - INFO - step 823, loss: 5.230385, best loss: 4.080480 2025-01-16 01:16:35,981 - INFO - step 824, loss: 5.402587, best loss: 4.080480 2025-01-16 01:16:36,131 - INFO - step 825, loss: 5.107879, best loss: 4.080480 2025-01-16 01:16:36,281 - INFO - step 826, loss: 5.130954, best loss: 4.080480 2025-01-16 01:16:36,431 - INFO - step 827, loss: 4.735476, best loss: 4.080480 2025-01-16 01:16:36,581 - INFO - step 828, loss: 5.052818, best loss: 4.080480 2025-01-16 01:16:36,731 - INFO - step 829, loss: 5.066325, best loss: 4.080480 2025-01-16 01:16:36,880 - INFO - step 830, loss: 5.074569, best loss: 4.080480 2025-01-16 01:16:37,030 - INFO - step 831, loss: 4.877312, best loss: 4.080480 2025-01-16 01:16:37,180 - INFO - step 832, loss: 4.917575, best loss: 4.080480 2025-01-16 01:16:37,330 - INFO - step 833, loss: 4.800828, best loss: 4.080480 2025-01-16 01:16:37,480 - INFO - step 834, loss: 4.906640, best loss: 4.080480 2025-01-16 01:16:37,630 - INFO - step 835, loss: 5.323066, best loss: 4.080480 2025-01-16 01:16:37,780 - INFO - step 836, loss: 5.316807, best loss: 4.080480 2025-01-16 01:16:37,930 - INFO - step 837, loss: 5.146011, best loss: 4.080480 2025-01-16 01:16:38,079 - INFO - step 838, loss: 5.242501, best loss: 4.080480 2025-01-16 01:16:38,229 - INFO - step 839, loss: 5.132389, best loss: 4.080480 2025-01-16 01:16:38,379 - INFO - step 840, loss: 5.039059, best loss: 4.080480 2025-01-16 01:16:38,528 - INFO - step 841, loss: 4.599503, best loss: 4.080480 2025-01-16 01:16:38,678 - INFO - step 842, loss: 5.044328, best loss: 4.080480 2025-01-16 01:16:38,828 - INFO - step 843, loss: 5.343104, best loss: 4.080480 2025-01-16 01:16:38,978 - INFO - step 844, loss: 4.921278, best loss: 4.080480 2025-01-16 01:16:39,127 - INFO - step 845, loss: 5.076612, best loss: 4.080480 2025-01-16 01:16:39,277 - INFO - step 846, loss: 5.032590, best loss: 4.080480 2025-01-16 01:16:39,427 - INFO - step 847, loss: 4.571560, best loss: 4.080480 2025-01-16 01:16:39,577 - INFO - step 848, loss: 4.334148, best loss: 4.080480 2025-01-16 01:16:39,727 - INFO - step 849, loss: 4.872010, best loss: 4.080480 2025-01-16 01:16:39,876 - INFO - step 850, loss: 5.123252, best loss: 4.080480 2025-01-16 01:16:40,026 - INFO - step 851, loss: 5.193604, best loss: 4.080480 2025-01-16 01:16:40,176 - INFO - step 852, loss: 4.992482, best loss: 4.080480 2025-01-16 01:16:40,326 - INFO - step 853, loss: 4.688320, best loss: 4.080480 2025-01-16 01:16:40,476 - INFO - step 854, loss: 4.892126, best loss: 4.080480 2025-01-16 01:16:40,625 - INFO - step 855, loss: 4.938465, best loss: 4.080480 2025-01-16 01:16:40,775 - INFO - step 856, loss: 4.701191, best loss: 4.080480 2025-01-16 01:16:40,925 - INFO - step 857, loss: 4.870609, best loss: 4.080480 2025-01-16 01:16:41,075 - INFO - step 858, loss: 4.789864, best loss: 4.080480 2025-01-16 01:16:41,224 - INFO - step 859, loss: 4.642659, best loss: 4.080480 2025-01-16 01:16:41,374 - INFO - step 860, loss: 5.068631, best loss: 4.080480 2025-01-16 01:16:41,524 - INFO - step 861, loss: 4.501361, best loss: 4.080480 2025-01-16 01:16:41,674 - INFO - step 862, loss: 4.832747, best loss: 4.080480 2025-01-16 01:16:41,824 - INFO - step 863, loss: 5.160292, best loss: 4.080480 2025-01-16 01:16:41,974 - INFO - step 864, loss: 4.773212, best loss: 4.080480 2025-01-16 01:16:42,124 - INFO - step 865, loss: 4.307539, best loss: 4.080480 2025-01-16 01:16:42,274 - INFO - step 866, loss: 4.940269, best loss: 4.080480 2025-01-16 01:16:42,424 - INFO - step 867, loss: 5.168776, best loss: 4.080480 2025-01-16 01:16:42,574 - INFO - step 868, loss: 5.521045, best loss: 4.080480 2025-01-16 01:16:42,723 - INFO - step 869, loss: 5.339331, best loss: 4.080480 2025-01-16 01:16:42,873 - INFO - step 870, loss: 5.361271, best loss: 4.080480 2025-01-16 01:16:43,023 - INFO - step 871, loss: 5.283003, best loss: 4.080480 2025-01-16 01:16:43,173 - INFO - step 872, loss: 5.523750, best loss: 4.080480 2025-01-16 01:16:43,323 - INFO - step 873, loss: 4.994528, best loss: 4.080480 2025-01-16 01:16:43,473 - INFO - step 874, loss: 5.059849, best loss: 4.080480 2025-01-16 01:16:43,623 - INFO - step 875, loss: 5.129005, best loss: 4.080480 2025-01-16 01:16:43,773 - INFO - step 876, loss: 5.078990, best loss: 4.080480 2025-01-16 01:16:43,923 - INFO - step 877, loss: 5.163375, best loss: 4.080480 2025-01-16 01:16:44,072 - INFO - step 878, loss: 4.894025, best loss: 4.080480 2025-01-16 01:16:44,222 - INFO - step 879, loss: 4.854370, best loss: 4.080480 2025-01-16 01:16:44,372 - INFO - step 880, loss: 5.017024, best loss: 4.080480 2025-01-16 01:16:44,522 - INFO - step 881, loss: 5.157976, best loss: 4.080480 2025-01-16 01:16:44,671 - INFO - step 882, loss: 4.863299, best loss: 4.080480 2025-01-16 01:16:44,821 - INFO - step 883, loss: 5.299779, best loss: 4.080480 2025-01-16 01:16:44,971 - INFO - step 884, loss: 5.335347, best loss: 4.080480 2025-01-16 01:16:45,121 - INFO - step 885, loss: 5.569180, best loss: 4.080480 2025-01-16 01:16:45,270 - INFO - step 886, loss: 5.533322, best loss: 4.080480 2025-01-16 01:16:45,420 - INFO - step 887, loss: 5.605604, best loss: 4.080480 2025-01-16 01:16:45,570 - INFO - step 888, loss: 5.142260, best loss: 4.080480 2025-01-16 01:16:45,720 - INFO - step 889, loss: 5.326534, best loss: 4.080480 2025-01-16 01:16:45,870 - INFO - step 890, loss: 5.292044, best loss: 4.080480 2025-01-16 01:16:46,019 - INFO - step 891, loss: 5.550233, best loss: 4.080480 2025-01-16 01:16:46,169 - INFO - step 892, loss: 5.099695, best loss: 4.080480 2025-01-16 01:16:46,319 - INFO - step 893, loss: 5.026720, best loss: 4.080480 2025-01-16 01:16:46,469 - INFO - step 894, loss: 5.012986, best loss: 4.080480 2025-01-16 01:16:46,619 - INFO - step 895, loss: 4.922065, best loss: 4.080480 2025-01-16 01:16:46,769 - INFO - step 896, loss: 5.243456, best loss: 4.080480 2025-01-16 01:16:46,918 - INFO - step 897, loss: 4.767737, best loss: 4.080480 2025-01-16 01:16:47,068 - INFO - step 898, loss: 4.753106, best loss: 4.080480 2025-01-16 01:16:47,218 - INFO - step 899, loss: 5.225980, best loss: 4.080480 2025-01-16 01:16:47,368 - INFO - step 900, loss: 4.901911, best loss: 4.080480 2025-01-16 01:16:47,517 - INFO - step 901, loss: 5.100704, best loss: 4.080480 2025-01-16 01:16:47,667 - INFO - step 902, loss: 4.895215, best loss: 4.080480 2025-01-16 01:16:47,817 - INFO - step 903, loss: 5.591096, best loss: 4.080480 2025-01-16 01:16:47,967 - INFO - step 904, loss: 5.280798, best loss: 4.080480 2025-01-16 01:16:48,118 - INFO - step 905, loss: 4.836327, best loss: 4.080480 2025-01-16 01:16:48,268 - INFO - step 906, loss: 4.783313, best loss: 4.080480 2025-01-16 01:16:48,418 - INFO - step 907, loss: 5.300454, best loss: 4.080480 2025-01-16 01:16:48,568 - INFO - step 908, loss: 4.972814, best loss: 4.080480 2025-01-16 01:16:48,718 - INFO - step 909, loss: 5.067006, best loss: 4.080480 2025-01-16 01:16:48,868 - INFO - step 910, loss: 5.252483, best loss: 4.080480 2025-01-16 01:16:49,017 - INFO - step 911, loss: 5.205988, best loss: 4.080480 2025-01-16 01:16:49,167 - INFO - step 912, loss: 5.140587, best loss: 4.080480 2025-01-16 01:16:49,317 - INFO - step 913, loss: 4.866863, best loss: 4.080480 2025-01-16 01:16:49,467 - INFO - step 914, loss: 4.682424, best loss: 4.080480 2025-01-16 01:16:49,617 - INFO - step 915, loss: 4.797035, best loss: 4.080480 2025-01-16 01:16:49,767 - INFO - step 916, loss: 4.583345, best loss: 4.080480 2025-01-16 01:16:49,916 - INFO - step 917, loss: 4.524280, best loss: 4.080480 2025-01-16 01:16:50,066 - INFO - step 918, loss: 5.059742, best loss: 4.080480 2025-01-16 01:16:50,216 - INFO - step 919, loss: 4.756054, best loss: 4.080480 2025-01-16 01:16:50,366 - INFO - step 920, loss: 4.698910, best loss: 4.080480 2025-01-16 01:16:50,516 - INFO - step 921, loss: 4.870677, best loss: 4.080480 2025-01-16 01:16:50,665 - INFO - step 922, loss: 4.930135, best loss: 4.080480 2025-01-16 01:16:50,815 - INFO - step 923, loss: 4.884700, best loss: 4.080480 2025-01-16 01:16:50,965 - INFO - step 924, loss: 5.011014, best loss: 4.080480 2025-01-16 01:16:51,115 - INFO - step 925, loss: 5.266817, best loss: 4.080480 2025-01-16 01:16:51,264 - INFO - step 926, loss: 4.690986, best loss: 4.080480 2025-01-16 01:16:51,414 - INFO - step 927, loss: 4.705637, best loss: 4.080480 2025-01-16 01:16:51,564 - INFO - step 928, loss: 4.953834, best loss: 4.080480 2025-01-16 01:16:51,714 - INFO - step 929, loss: 5.137105, best loss: 4.080480 2025-01-16 01:16:51,864 - INFO - step 930, loss: 4.663776, best loss: 4.080480 2025-01-16 01:16:52,014 - INFO - step 931, loss: 4.518529, best loss: 4.080480 2025-01-16 01:16:52,164 - INFO - step 932, loss: 4.762403, best loss: 4.080480 2025-01-16 01:16:52,314 - INFO - step 933, loss: 5.004446, best loss: 4.080480 2025-01-16 01:16:52,464 - INFO - step 934, loss: 4.426937, best loss: 4.080480 2025-01-16 01:16:52,614 - INFO - step 935, loss: 4.843480, best loss: 4.080480 2025-01-16 01:16:52,763 - INFO - step 936, loss: 4.730632, best loss: 4.080480 2025-01-16 01:16:52,913 - INFO - step 937, loss: 4.516703, best loss: 4.080480 2025-01-16 01:16:53,062 - INFO - step 938, loss: 4.517571, best loss: 4.080480 2025-01-16 01:16:53,212 - INFO - step 939, loss: 4.751542, best loss: 4.080480 2025-01-16 01:16:53,362 - INFO - step 940, loss: 4.297875, best loss: 4.080480 2025-01-16 01:16:53,512 - INFO - step 941, loss: 4.605875, best loss: 4.080480 2025-01-16 01:16:53,662 - INFO - step 942, loss: 4.230160, best loss: 4.080480 2025-01-16 01:16:53,812 - INFO - step 943, loss: 4.898784, best loss: 4.080480 2025-01-16 01:16:53,962 - INFO - step 944, loss: 5.243511, best loss: 4.080480 2025-01-16 01:16:54,112 - INFO - step 945, loss: 5.752703, best loss: 4.080480 2025-01-16 01:16:54,261 - INFO - step 946, loss: 4.991428, best loss: 4.080480 2025-01-16 01:16:54,411 - INFO - step 947, loss: 5.331942, best loss: 4.080480 2025-01-16 01:16:54,561 - INFO - step 948, loss: 5.227653, best loss: 4.080480 2025-01-16 01:16:54,711 - INFO - step 949, loss: 4.929822, best loss: 4.080480 2025-01-16 01:16:54,860 - INFO - step 950, loss: 4.853251, best loss: 4.080480 2025-01-16 01:16:55,010 - INFO - step 951, loss: 5.259534, best loss: 4.080480 2025-01-16 01:16:55,160 - INFO - step 952, loss: 4.865597, best loss: 4.080480 2025-01-16 01:16:55,310 - INFO - step 953, loss: 4.468994, best loss: 4.080480 2025-01-16 01:16:55,460 - INFO - step 954, loss: 4.794074, best loss: 4.080480 2025-01-16 01:16:55,610 - INFO - step 955, loss: 4.711510, best loss: 4.080480 2025-01-16 01:16:55,760 - INFO - step 956, loss: 4.932762, best loss: 4.080480 2025-01-16 01:16:55,909 - INFO - step 957, loss: 4.240738, best loss: 4.080480 2025-01-16 01:16:56,059 - INFO - step 958, loss: 4.693844, best loss: 4.080480 2025-01-16 01:16:56,209 - INFO - step 959, loss: 4.937788, best loss: 4.080480 2025-01-16 01:16:56,359 - INFO - step 960, loss: 5.025978, best loss: 4.080480 2025-01-16 01:16:56,509 - INFO - step 961, loss: 4.688865, best loss: 4.080480 2025-01-16 01:16:56,658 - INFO - step 962, loss: 4.803966, best loss: 4.080480 2025-01-16 01:16:56,808 - INFO - step 963, loss: 4.759771, best loss: 4.080480 2025-01-16 01:16:56,958 - INFO - step 964, loss: 4.381271, best loss: 4.080480 2025-01-16 01:16:57,108 - INFO - step 965, loss: 5.029188, best loss: 4.080480 2025-01-16 01:16:57,258 - INFO - step 966, loss: 4.770003, best loss: 4.080480 2025-01-16 01:16:57,408 - INFO - step 967, loss: 4.723969, best loss: 4.080480 2025-01-16 01:16:57,558 - INFO - step 968, loss: 4.379445, best loss: 4.080480 2025-01-16 01:16:57,708 - INFO - step 969, loss: 4.385056, best loss: 4.080480 2025-01-16 01:16:57,858 - INFO - step 970, loss: 4.511448, best loss: 4.080480 2025-01-16 01:16:58,007 - INFO - step 971, loss: 4.413971, best loss: 4.080480 2025-01-16 01:16:58,157 - INFO - step 972, loss: 4.426960, best loss: 4.080480 2025-01-16 01:16:58,307 - INFO - step 973, loss: 4.423867, best loss: 4.080480 2025-01-16 01:16:58,457 - INFO - step 974, loss: 4.377290, best loss: 4.080480 2025-01-16 01:16:58,607 - INFO - step 975, loss: 4.277027, best loss: 4.080480 2025-01-16 01:17:01,865 - INFO - step 976, loss: 4.060049, best loss: 4.060049 2025-01-16 01:17:09,089 - INFO - step 977, loss: 3.822340, best loss: 3.822340 2025-01-16 01:17:09,239 - INFO - step 978, loss: 4.649113, best loss: 3.822340 2025-01-16 01:17:09,391 - INFO - step 979, loss: 5.129494, best loss: 3.822340 2025-01-16 01:17:09,541 - INFO - step 980, loss: 5.295849, best loss: 3.822340 2025-01-16 01:17:09,691 - INFO - step 981, loss: 5.512109, best loss: 3.822340 2025-01-16 01:17:09,841 - INFO - step 982, loss: 5.515702, best loss: 3.822340 2025-01-16 01:17:09,991 - INFO - step 983, loss: 5.181946, best loss: 3.822340 2025-01-16 01:17:10,141 - INFO - step 984, loss: 5.293642, best loss: 3.822340 2025-01-16 01:17:10,291 - INFO - step 985, loss: 5.319295, best loss: 3.822340 2025-01-16 01:17:10,441 - INFO - step 986, loss: 4.811839, best loss: 3.822340 2025-01-16 01:17:10,591 - INFO - step 987, loss: 4.700165, best loss: 3.822340 2025-01-16 01:17:10,740 - INFO - step 988, loss: 4.898229, best loss: 3.822340 2025-01-16 01:17:10,890 - INFO - step 989, loss: 4.789296, best loss: 3.822340 2025-01-16 01:17:11,040 - INFO - step 990, loss: 5.340391, best loss: 3.822340 2025-01-16 01:17:11,191 - INFO - step 991, loss: 5.121665, best loss: 3.822340 2025-01-16 01:17:11,340 - INFO - step 992, loss: 5.323699, best loss: 3.822340 2025-01-16 01:17:11,490 - INFO - step 993, loss: 5.076300, best loss: 3.822340 2025-01-16 01:17:11,641 - INFO - step 994, loss: 5.318087, best loss: 3.822340 2025-01-16 01:17:11,790 - INFO - step 995, loss: 4.894315, best loss: 3.822340 2025-01-16 01:17:11,940 - INFO - step 996, loss: 5.082086, best loss: 3.822340 2025-01-16 01:17:12,091 - INFO - step 997, loss: 4.847385, best loss: 3.822340 2025-01-16 01:17:12,240 - INFO - step 998, loss: 5.071432, best loss: 3.822340 2025-01-16 01:17:12,390 - INFO - step 999, loss: 4.830506, best loss: 3.822340 2025-01-16 01:17:12,540 - INFO - step 1000, loss: 4.717095, best loss: 3.822340 2025-01-16 01:17:12,690 - INFO - step 1001, loss: 5.474697, best loss: 3.822340 2025-01-16 01:17:12,840 - INFO - step 1002, loss: 4.695499, best loss: 3.822340 2025-01-16 01:17:12,990 - INFO - step 1003, loss: 5.094129, best loss: 3.822340 2025-01-16 01:17:13,140 - INFO - step 1004, loss: 4.980521, best loss: 3.822340 2025-01-16 01:17:13,290 - INFO - step 1005, loss: 5.018208, best loss: 3.822340 2025-01-16 01:17:13,440 - INFO - step 1006, loss: 4.955667, best loss: 3.822340 2025-01-16 01:17:13,590 - INFO - step 1007, loss: 4.668766, best loss: 3.822340 2025-01-16 01:17:13,740 - INFO - step 1008, loss: 4.589791, best loss: 3.822340 2025-01-16 01:17:13,889 - INFO - step 1009, loss: 4.751706, best loss: 3.822340 2025-01-16 01:17:14,039 - INFO - step 1010, loss: 4.202228, best loss: 3.822340 2025-01-16 01:17:14,189 - INFO - step 1011, loss: 5.000613, best loss: 3.822340 2025-01-16 01:17:14,339 - INFO - step 1012, loss: 4.222201, best loss: 3.822340 2025-01-16 01:17:14,489 - INFO - step 1013, loss: 4.080025, best loss: 3.822340 2025-01-16 01:17:14,639 - INFO - step 1014, loss: 4.598041, best loss: 3.822340 2025-01-16 01:17:14,788 - INFO - step 1015, loss: 4.598392, best loss: 3.822340 2025-01-16 01:17:14,938 - INFO - step 1016, loss: 4.620738, best loss: 3.822340 2025-01-16 01:17:15,088 - INFO - step 1017, loss: 4.267956, best loss: 3.822340 2025-01-16 01:17:15,238 - INFO - step 1018, loss: 4.556950, best loss: 3.822340 2025-01-16 01:17:15,387 - INFO - step 1019, loss: 4.621089, best loss: 3.822340 2025-01-16 01:17:15,538 - INFO - step 1020, loss: 4.660874, best loss: 3.822340 2025-01-16 01:17:15,687 - INFO - step 1021, loss: 4.480968, best loss: 3.822340 2025-01-16 01:17:15,837 - INFO - step 1022, loss: 4.859280, best loss: 3.822340 2025-01-16 01:17:15,987 - INFO - step 1023, loss: 4.904871, best loss: 3.822340 2025-01-16 01:17:16,137 - INFO - step 1024, loss: 4.837596, best loss: 3.822340 2025-01-16 01:17:16,287 - INFO - step 1025, loss: 4.378281, best loss: 3.822340 2025-01-16 01:17:16,436 - INFO - step 1026, loss: 4.514217, best loss: 3.822340 2025-01-16 01:17:16,586 - INFO - step 1027, loss: 4.894362, best loss: 3.822340 2025-01-16 01:17:16,736 - INFO - step 1028, loss: 4.348118, best loss: 3.822340 2025-01-16 01:17:16,885 - INFO - step 1029, loss: 5.008468, best loss: 3.822340 2025-01-16 01:17:17,036 - INFO - step 1030, loss: 4.964454, best loss: 3.822340 2025-01-16 01:17:17,185 - INFO - step 1031, loss: 4.929890, best loss: 3.822340 2025-01-16 01:17:17,335 - INFO - step 1032, loss: 4.836927, best loss: 3.822340 2025-01-16 01:17:17,485 - INFO - step 1033, loss: 4.856476, best loss: 3.822340 2025-01-16 01:17:17,635 - INFO - step 1034, loss: 4.803213, best loss: 3.822340 2025-01-16 01:17:17,785 - INFO - step 1035, loss: 4.652755, best loss: 3.822340 2025-01-16 01:17:17,935 - INFO - step 1036, loss: 5.433598, best loss: 3.822340 2025-01-16 01:17:18,084 - INFO - step 1037, loss: 4.980661, best loss: 3.822340 2025-01-16 01:17:18,234 - INFO - step 1038, loss: 5.321484, best loss: 3.822340 2025-01-16 01:17:18,384 - INFO - step 1039, loss: 4.583469, best loss: 3.822340 2025-01-16 01:17:18,533 - INFO - step 1040, loss: 4.471177, best loss: 3.822340 2025-01-16 01:17:18,683 - INFO - step 1041, loss: 4.904380, best loss: 3.822340 2025-01-16 01:17:18,833 - INFO - step 1042, loss: 4.929317, best loss: 3.822340 2025-01-16 01:17:18,982 - INFO - step 1043, loss: 4.620378, best loss: 3.822340 2025-01-16 01:17:19,132 - INFO - step 1044, loss: 4.790607, best loss: 3.822340 2025-01-16 01:17:19,282 - INFO - step 1045, loss: 4.644658, best loss: 3.822340 2025-01-16 01:17:19,432 - INFO - step 1046, loss: 4.671895, best loss: 3.822340 2025-01-16 01:17:19,583 - INFO - step 1047, loss: 5.068351, best loss: 3.822340 2025-01-16 01:17:19,733 - INFO - step 1048, loss: 4.609327, best loss: 3.822340 2025-01-16 01:17:19,883 - INFO - step 1049, loss: 4.524060, best loss: 3.822340 2025-01-16 01:17:20,032 - INFO - step 1050, loss: 4.645939, best loss: 3.822340 2025-01-16 01:17:20,182 - INFO - step 1051, loss: 4.773569, best loss: 3.822340 2025-01-16 01:17:20,332 - INFO - step 1052, loss: 4.733419, best loss: 3.822340 2025-01-16 01:17:20,482 - INFO - step 1053, loss: 4.851995, best loss: 3.822340 2025-01-16 01:17:20,631 - INFO - step 1054, loss: 4.632754, best loss: 3.822340 2025-01-16 01:17:20,781 - INFO - step 1055, loss: 4.447622, best loss: 3.822340 2025-01-16 01:17:20,931 - INFO - step 1056, loss: 4.754843, best loss: 3.822340 2025-01-16 01:17:21,081 - INFO - step 1057, loss: 4.039084, best loss: 3.822340 2025-01-16 01:17:21,231 - INFO - step 1058, loss: 4.424966, best loss: 3.822340 2025-01-16 01:17:21,381 - INFO - step 1059, loss: 4.609906, best loss: 3.822340 2025-01-16 01:17:21,530 - INFO - step 1060, loss: 4.501684, best loss: 3.822340 2025-01-16 01:17:21,680 - INFO - step 1061, loss: 4.414448, best loss: 3.822340 2025-01-16 01:17:21,830 - INFO - step 1062, loss: 4.704134, best loss: 3.822340 2025-01-16 01:17:21,980 - INFO - step 1063, loss: 4.957699, best loss: 3.822340 2025-01-16 01:17:22,130 - INFO - step 1064, loss: 4.687964, best loss: 3.822340 2025-01-16 01:17:22,280 - INFO - step 1065, loss: 4.963849, best loss: 3.822340 2025-01-16 01:17:22,429 - INFO - step 1066, loss: 4.557508, best loss: 3.822340 2025-01-16 01:17:22,579 - INFO - step 1067, loss: 4.680689, best loss: 3.822340 2025-01-16 01:17:22,729 - INFO - step 1068, loss: 4.591299, best loss: 3.822340 2025-01-16 01:17:22,879 - INFO - step 1069, loss: 4.358174, best loss: 3.822340 2025-01-16 01:17:23,028 - INFO - step 1070, loss: 5.146415, best loss: 3.822340 2025-01-16 01:17:23,178 - INFO - step 1071, loss: 4.991949, best loss: 3.822340 2025-01-16 01:17:23,328 - INFO - step 1072, loss: 4.660058, best loss: 3.822340 2025-01-16 01:17:23,478 - INFO - step 1073, loss: 4.552716, best loss: 3.822340 2025-01-16 01:17:23,628 - INFO - step 1074, loss: 4.206006, best loss: 3.822340 2025-01-16 01:17:23,778 - INFO - step 1075, loss: 4.415969, best loss: 3.822340 2025-01-16 01:17:23,928 - INFO - step 1076, loss: 4.356648, best loss: 3.822340 2025-01-16 01:17:24,078 - INFO - step 1077, loss: 4.343517, best loss: 3.822340 2025-01-16 01:17:24,228 - INFO - step 1078, loss: 5.087547, best loss: 3.822340 2025-01-16 01:17:24,378 - INFO - step 1079, loss: 4.992053, best loss: 3.822340 2025-01-16 01:17:24,527 - INFO - step 1080, loss: 4.768847, best loss: 3.822340 2025-01-16 01:17:24,677 - INFO - step 1081, loss: 5.021774, best loss: 3.822340 2025-01-16 01:17:24,827 - INFO - step 1082, loss: 4.702421, best loss: 3.822340 2025-01-16 01:17:24,977 - INFO - step 1083, loss: 5.096456, best loss: 3.822340 2025-01-16 01:17:25,127 - INFO - step 1084, loss: 5.231545, best loss: 3.822340 2025-01-16 01:17:25,277 - INFO - step 1085, loss: 5.159089, best loss: 3.822340 2025-01-16 01:17:25,427 - INFO - step 1086, loss: 5.243127, best loss: 3.822340 2025-01-16 01:17:25,577 - INFO - step 1087, loss: 4.920466, best loss: 3.822340 2025-01-16 01:17:25,726 - INFO - step 1088, loss: 5.225648, best loss: 3.822340 2025-01-16 01:17:25,876 - INFO - step 1089, loss: 4.989342, best loss: 3.822340 2025-01-16 01:17:26,026 - INFO - step 1090, loss: 4.850152, best loss: 3.822340 2025-01-16 01:17:26,176 - INFO - step 1091, loss: 5.194167, best loss: 3.822340 2025-01-16 01:17:26,326 - INFO - step 1092, loss: 5.187849, best loss: 3.822340 2025-01-16 01:17:26,476 - INFO - step 1093, loss: 4.917020, best loss: 3.822340 2025-01-16 01:17:26,625 - INFO - step 1094, loss: 4.920861, best loss: 3.822340 2025-01-16 01:17:26,775 - INFO - step 1095, loss: 5.211720, best loss: 3.822340 2025-01-16 01:17:26,925 - INFO - step 1096, loss: 4.884522, best loss: 3.822340 2025-01-16 01:17:27,075 - INFO - step 1097, loss: 4.756593, best loss: 3.822340 2025-01-16 01:17:27,224 - INFO - step 1098, loss: 4.622079, best loss: 3.822340 2025-01-16 01:17:27,374 - INFO - step 1099, loss: 4.961782, best loss: 3.822340 2025-01-16 01:17:27,524 - INFO - step 1100, loss: 5.019433, best loss: 3.822340 2025-01-16 01:17:27,674 - INFO - step 1101, loss: 5.200605, best loss: 3.822340 2025-01-16 01:17:27,824 - INFO - step 1102, loss: 5.143195, best loss: 3.822340 2025-01-16 01:17:27,974 - INFO - step 1103, loss: 4.735417, best loss: 3.822340 2025-01-16 01:17:28,123 - INFO - step 1104, loss: 5.081779, best loss: 3.822340 2025-01-16 01:17:28,273 - INFO - step 1105, loss: 4.636673, best loss: 3.822340 2025-01-16 01:17:28,423 - INFO - step 1106, loss: 5.122087, best loss: 3.822340 2025-01-16 01:17:28,573 - INFO - step 1107, loss: 4.671114, best loss: 3.822340 2025-01-16 01:17:28,722 - INFO - step 1108, loss: 4.844568, best loss: 3.822340 2025-01-16 01:17:28,872 - INFO - step 1109, loss: 4.754574, best loss: 3.822340 2025-01-16 01:17:29,023 - INFO - step 1110, loss: 4.779024, best loss: 3.822340 2025-01-16 01:17:29,173 - INFO - step 1111, loss: 4.574785, best loss: 3.822340 2025-01-16 01:17:29,323 - INFO - step 1112, loss: 4.818474, best loss: 3.822340 2025-01-16 01:17:29,472 - INFO - step 1113, loss: 4.410142, best loss: 3.822340 2025-01-16 01:17:29,623 - INFO - step 1114, loss: 4.112535, best loss: 3.822340 2025-01-16 01:17:29,772 - INFO - step 1115, loss: 4.274867, best loss: 3.822340 2025-01-16 01:17:29,922 - INFO - step 1116, loss: 4.524991, best loss: 3.822340 2025-01-16 01:17:30,072 - INFO - step 1117, loss: 5.235142, best loss: 3.822340 2025-01-16 01:17:30,222 - INFO - step 1118, loss: 4.772668, best loss: 3.822340 2025-01-16 01:17:30,371 - INFO - step 1119, loss: 4.995851, best loss: 3.822340 2025-01-16 01:17:30,522 - INFO - step 1120, loss: 5.172804, best loss: 3.822340 2025-01-16 01:17:30,671 - INFO - step 1121, loss: 4.685729, best loss: 3.822340 2025-01-16 01:17:30,821 - INFO - step 1122, loss: 5.283796, best loss: 3.822340 2025-01-16 01:17:30,971 - INFO - step 1123, loss: 4.780731, best loss: 3.822340 2025-01-16 01:17:31,121 - INFO - step 1124, loss: 4.939324, best loss: 3.822340 2025-01-16 01:17:31,271 - INFO - step 1125, loss: 4.774073, best loss: 3.822340 2025-01-16 01:17:31,421 - INFO - step 1126, loss: 5.432964, best loss: 3.822340 2025-01-16 01:17:31,570 - INFO - step 1127, loss: 4.736975, best loss: 3.822340 2025-01-16 01:17:31,720 - INFO - step 1128, loss: 4.528766, best loss: 3.822340 2025-01-16 01:17:31,870 - INFO - step 1129, loss: 4.994067, best loss: 3.822340 2025-01-16 01:17:32,020 - INFO - step 1130, loss: 4.653758, best loss: 3.822340 2025-01-16 01:17:32,170 - INFO - step 1131, loss: 4.452290, best loss: 3.822340 2025-01-16 01:17:32,320 - INFO - step 1132, loss: 5.318018, best loss: 3.822340 2025-01-16 01:17:32,469 - INFO - step 1133, loss: 4.891570, best loss: 3.822340 2025-01-16 01:17:32,619 - INFO - step 1134, loss: 4.355906, best loss: 3.822340 2025-01-16 01:17:32,769 - INFO - step 1135, loss: 4.602037, best loss: 3.822340 2025-01-16 01:17:32,919 - INFO - step 1136, loss: 4.871978, best loss: 3.822340 2025-01-16 01:17:33,069 - INFO - step 1137, loss: 4.803718, best loss: 3.822340 2025-01-16 01:17:33,218 - INFO - step 1138, loss: 4.787079, best loss: 3.822340 2025-01-16 01:17:33,369 - INFO - step 1139, loss: 4.565081, best loss: 3.822340 2025-01-16 01:17:33,518 - INFO - step 1140, loss: 5.211781, best loss: 3.822340 2025-01-16 01:17:33,668 - INFO - step 1141, loss: 4.953745, best loss: 3.822340 2025-01-16 01:17:33,818 - INFO - step 1142, loss: 4.921594, best loss: 3.822340 2025-01-16 01:17:33,968 - INFO - step 1143, loss: 4.596416, best loss: 3.822340 2025-01-16 01:17:34,118 - INFO - step 1144, loss: 4.802131, best loss: 3.822340 2025-01-16 01:17:34,268 - INFO - step 1145, loss: 4.644826, best loss: 3.822340 2025-01-16 01:17:34,418 - INFO - step 1146, loss: 4.303538, best loss: 3.822340 2025-01-16 01:17:34,567 - INFO - step 1147, loss: 4.787142, best loss: 3.822340 2025-01-16 01:17:34,717 - INFO - step 1148, loss: 4.352244, best loss: 3.822340 2025-01-16 01:17:34,867 - INFO - step 1149, loss: 4.754063, best loss: 3.822340 2025-01-16 01:17:35,017 - INFO - step 1150, loss: 4.581022, best loss: 3.822340 2025-01-16 01:17:35,167 - INFO - step 1151, loss: 4.689787, best loss: 3.822340 2025-01-16 01:17:35,317 - INFO - step 1152, loss: 4.515761, best loss: 3.822340 2025-01-16 01:17:35,467 - INFO - step 1153, loss: 5.008175, best loss: 3.822340 2025-01-16 01:17:35,617 - INFO - step 1154, loss: 5.190750, best loss: 3.822340 2025-01-16 01:17:35,767 - INFO - step 1155, loss: 4.867358, best loss: 3.822340 2025-01-16 01:17:35,917 - INFO - step 1156, loss: 4.969733, best loss: 3.822340 2025-01-16 01:17:36,067 - INFO - step 1157, loss: 4.537405, best loss: 3.822340 2025-01-16 01:17:36,216 - INFO - step 1158, loss: 4.839571, best loss: 3.822340 2025-01-16 01:17:36,366 - INFO - step 1159, loss: 4.876422, best loss: 3.822340 2025-01-16 01:17:36,516 - INFO - step 1160, loss: 4.807392, best loss: 3.822340 2025-01-16 01:17:36,666 - INFO - step 1161, loss: 4.595146, best loss: 3.822340 2025-01-16 01:17:36,816 - INFO - step 1162, loss: 4.581009, best loss: 3.822340 2025-01-16 01:17:36,966 - INFO - step 1163, loss: 4.532821, best loss: 3.822340 2025-01-16 01:17:37,116 - INFO - step 1164, loss: 4.685805, best loss: 3.822340 2025-01-16 01:17:37,266 - INFO - step 1165, loss: 5.049771, best loss: 3.822340 2025-01-16 01:17:37,416 - INFO - step 1166, loss: 5.095854, best loss: 3.822340 2025-01-16 01:17:37,565 - INFO - step 1167, loss: 4.920158, best loss: 3.822340 2025-01-16 01:17:37,715 - INFO - step 1168, loss: 5.027310, best loss: 3.822340 2025-01-16 01:17:37,865 - INFO - step 1169, loss: 4.859958, best loss: 3.822340 2025-01-16 01:17:38,015 - INFO - step 1170, loss: 4.810522, best loss: 3.822340 2025-01-16 01:17:38,165 - INFO - step 1171, loss: 4.356511, best loss: 3.822340 2025-01-16 01:17:38,315 - INFO - step 1172, loss: 4.841954, best loss: 3.822340 2025-01-16 01:17:38,465 - INFO - step 1173, loss: 5.161623, best loss: 3.822340 2025-01-16 01:17:38,615 - INFO - step 1174, loss: 4.759130, best loss: 3.822340 2025-01-16 01:17:38,765 - INFO - step 1175, loss: 4.873509, best loss: 3.822340 2025-01-16 01:17:38,915 - INFO - step 1176, loss: 4.820889, best loss: 3.822340 2025-01-16 01:17:39,064 - INFO - step 1177, loss: 4.352328, best loss: 3.822340 2025-01-16 01:17:39,214 - INFO - step 1178, loss: 4.089184, best loss: 3.822340 2025-01-16 01:17:39,364 - INFO - step 1179, loss: 4.680466, best loss: 3.822340 2025-01-16 01:17:39,514 - INFO - step 1180, loss: 4.962836, best loss: 3.822340 2025-01-16 01:17:39,664 - INFO - step 1181, loss: 5.010825, best loss: 3.822340 2025-01-16 01:17:39,814 - INFO - step 1182, loss: 4.800069, best loss: 3.822340 2025-01-16 01:17:39,964 - INFO - step 1183, loss: 4.513885, best loss: 3.822340 2025-01-16 01:17:40,114 - INFO - step 1184, loss: 4.724462, best loss: 3.822340 2025-01-16 01:17:40,264 - INFO - step 1185, loss: 4.793240, best loss: 3.822340 2025-01-16 01:17:40,414 - INFO - step 1186, loss: 4.550538, best loss: 3.822340 2025-01-16 01:17:40,564 - INFO - step 1187, loss: 4.709866, best loss: 3.822340 2025-01-16 01:17:40,713 - INFO - step 1188, loss: 4.648468, best loss: 3.822340 2025-01-16 01:17:40,863 - INFO - step 1189, loss: 4.479922, best loss: 3.822340 2025-01-16 01:17:41,013 - INFO - step 1190, loss: 4.935442, best loss: 3.822340 2025-01-16 01:17:41,162 - INFO - step 1191, loss: 4.389118, best loss: 3.822340 2025-01-16 01:17:41,312 - INFO - step 1192, loss: 4.670057, best loss: 3.822340 2025-01-16 01:17:41,462 - INFO - step 1193, loss: 4.987532, best loss: 3.822340 2025-01-16 01:17:41,612 - INFO - step 1194, loss: 4.645941, best loss: 3.822340 2025-01-16 01:17:41,762 - INFO - step 1195, loss: 4.167989, best loss: 3.822340 2025-01-16 01:17:41,913 - INFO - step 1196, loss: 4.748029, best loss: 3.822340 2025-01-16 01:17:42,063 - INFO - step 1197, loss: 4.970248, best loss: 3.822340 2025-01-16 01:17:42,213 - INFO - step 1198, loss: 5.270022, best loss: 3.822340 2025-01-16 01:17:42,363 - INFO - step 1199, loss: 5.053975, best loss: 3.822340 2025-01-16 01:17:42,512 - INFO - step 1200, loss: 5.157451, best loss: 3.822340 2025-01-16 01:17:42,662 - INFO - step 1201, loss: 5.044639, best loss: 3.822340 2025-01-16 01:17:42,812 - INFO - step 1202, loss: 5.282798, best loss: 3.822340 2025-01-16 01:17:42,962 - INFO - step 1203, loss: 4.734299, best loss: 3.822340 2025-01-16 01:17:43,112 - INFO - step 1204, loss: 4.829934, best loss: 3.822340 2025-01-16 01:17:43,262 - INFO - step 1205, loss: 4.928779, best loss: 3.822340 2025-01-16 01:17:43,412 - INFO - step 1206, loss: 4.886291, best loss: 3.822340 2025-01-16 01:17:43,562 - INFO - step 1207, loss: 4.938731, best loss: 3.822340 2025-01-16 01:17:43,712 - INFO - step 1208, loss: 4.702260, best loss: 3.822340 2025-01-16 01:17:43,862 - INFO - step 1209, loss: 4.650768, best loss: 3.822340 2025-01-16 01:17:44,012 - INFO - step 1210, loss: 4.836262, best loss: 3.822340 2025-01-16 01:17:44,162 - INFO - step 1211, loss: 5.006073, best loss: 3.822340 2025-01-16 01:17:44,312 - INFO - step 1212, loss: 4.716239, best loss: 3.822340 2025-01-16 01:17:44,462 - INFO - step 1213, loss: 5.137732, best loss: 3.822340 2025-01-16 01:17:44,611 - INFO - step 1214, loss: 5.139927, best loss: 3.822340 2025-01-16 01:17:44,761 - INFO - step 1215, loss: 5.336618, best loss: 3.822340 2025-01-16 01:17:44,911 - INFO - step 1216, loss: 5.338019, best loss: 3.822340 2025-01-16 01:17:45,061 - INFO - step 1217, loss: 5.393841, best loss: 3.822340 2025-01-16 01:17:45,211 - INFO - step 1218, loss: 4.833293, best loss: 3.822340 2025-01-16 01:17:45,361 - INFO - step 1219, loss: 5.129786, best loss: 3.822340 2025-01-16 01:17:45,511 - INFO - step 1220, loss: 5.101919, best loss: 3.822340 2025-01-16 01:17:45,660 - INFO - step 1221, loss: 5.319942, best loss: 3.822340 2025-01-16 01:17:45,810 - INFO - step 1222, loss: 4.814179, best loss: 3.822340 2025-01-16 01:17:45,960 - INFO - step 1223, loss: 4.862331, best loss: 3.822340 2025-01-16 01:17:46,110 - INFO - step 1224, loss: 4.833065, best loss: 3.822340 2025-01-16 01:17:46,260 - INFO - step 1225, loss: 4.743778, best loss: 3.822340 2025-01-16 01:17:46,409 - INFO - step 1226, loss: 5.048407, best loss: 3.822340 2025-01-16 01:17:46,560 - INFO - step 1227, loss: 4.560546, best loss: 3.822340 2025-01-16 01:17:46,709 - INFO - step 1228, loss: 4.595232, best loss: 3.822340 2025-01-16 01:17:46,859 - INFO - step 1229, loss: 5.034108, best loss: 3.822340 2025-01-16 01:17:47,009 - INFO - step 1230, loss: 4.787831, best loss: 3.822340 2025-01-16 01:17:47,158 - INFO - step 1231, loss: 4.959406, best loss: 3.822340 2025-01-16 01:17:47,308 - INFO - step 1232, loss: 4.732430, best loss: 3.822340 2025-01-16 01:17:47,458 - INFO - step 1233, loss: 5.418541, best loss: 3.822340 2025-01-16 01:17:47,608 - INFO - step 1234, loss: 5.108908, best loss: 3.822340 2025-01-16 01:17:47,758 - INFO - step 1235, loss: 4.631943, best loss: 3.822340 2025-01-16 01:17:47,908 - INFO - step 1236, loss: 4.585319, best loss: 3.822340 2025-01-16 01:17:48,059 - INFO - step 1237, loss: 5.105906, best loss: 3.822340 2025-01-16 01:17:48,209 - INFO - step 1238, loss: 4.834917, best loss: 3.822340 2025-01-16 01:17:48,358 - INFO - step 1239, loss: 4.883843, best loss: 3.822340 2025-01-16 01:17:48,508 - INFO - step 1240, loss: 5.068125, best loss: 3.822340 2025-01-16 01:17:48,658 - INFO - step 1241, loss: 5.038887, best loss: 3.822340 2025-01-16 01:17:48,807 - INFO - step 1242, loss: 5.002731, best loss: 3.822340 2025-01-16 01:17:48,957 - INFO - step 1243, loss: 4.668684, best loss: 3.822340 2025-01-16 01:17:49,107 - INFO - step 1244, loss: 4.493148, best loss: 3.822340 2025-01-16 01:17:49,257 - INFO - step 1245, loss: 4.613707, best loss: 3.822340 2025-01-16 01:17:49,407 - INFO - step 1246, loss: 4.409328, best loss: 3.822340 2025-01-16 01:17:49,557 - INFO - step 1247, loss: 4.352019, best loss: 3.822340 2025-01-16 01:17:49,707 - INFO - step 1248, loss: 4.869640, best loss: 3.822340 2025-01-16 01:17:49,857 - INFO - step 1249, loss: 4.677898, best loss: 3.822340 2025-01-16 01:17:50,006 - INFO - step 1250, loss: 4.533389, best loss: 3.822340 2025-01-16 01:17:50,156 - INFO - step 1251, loss: 4.688638, best loss: 3.822340 2025-01-16 01:17:50,306 - INFO - step 1252, loss: 4.704052, best loss: 3.822340 2025-01-16 01:17:50,456 - INFO - step 1253, loss: 4.661780, best loss: 3.822340 2025-01-16 01:17:50,606 - INFO - step 1254, loss: 4.791490, best loss: 3.822340 2025-01-16 01:17:50,756 - INFO - step 1255, loss: 5.055339, best loss: 3.822340 2025-01-16 01:17:50,906 - INFO - step 1256, loss: 4.476814, best loss: 3.822340 2025-01-16 01:17:51,055 - INFO - step 1257, loss: 4.458859, best loss: 3.822340 2025-01-16 01:17:51,205 - INFO - step 1258, loss: 4.771104, best loss: 3.822340 2025-01-16 01:17:51,355 - INFO - step 1259, loss: 4.916130, best loss: 3.822340 2025-01-16 01:17:51,505 - INFO - step 1260, loss: 4.486631, best loss: 3.822340 2025-01-16 01:17:51,655 - INFO - step 1261, loss: 4.334374, best loss: 3.822340 2025-01-16 01:17:51,805 - INFO - step 1262, loss: 4.571632, best loss: 3.822340 2025-01-16 01:17:51,955 - INFO - step 1263, loss: 4.772620, best loss: 3.822340 2025-01-16 01:17:52,105 - INFO - step 1264, loss: 4.252944, best loss: 3.822340 2025-01-16 01:17:52,254 - INFO - step 1265, loss: 4.667770, best loss: 3.822340 2025-01-16 01:17:52,404 - INFO - step 1266, loss: 4.529250, best loss: 3.822340 2025-01-16 01:17:52,554 - INFO - step 1267, loss: 4.325837, best loss: 3.822340 2025-01-16 01:17:52,704 - INFO - step 1268, loss: 4.288313, best loss: 3.822340 2025-01-16 01:17:52,854 - INFO - step 1269, loss: 4.558618, best loss: 3.822340 2025-01-16 01:17:53,003 - INFO - step 1270, loss: 4.114383, best loss: 3.822340 2025-01-16 01:17:53,153 - INFO - step 1271, loss: 4.411471, best loss: 3.822340 2025-01-16 01:17:53,303 - INFO - step 1272, loss: 4.075590, best loss: 3.822340 2025-01-16 01:17:53,453 - INFO - step 1273, loss: 4.735438, best loss: 3.822340 2025-01-16 01:17:53,603 - INFO - step 1274, loss: 5.044685, best loss: 3.822340 2025-01-16 01:17:53,752 - INFO - step 1275, loss: 5.543852, best loss: 3.822340 2025-01-16 01:17:53,902 - INFO - step 1276, loss: 4.795752, best loss: 3.822340 2025-01-16 01:17:54,052 - INFO - step 1277, loss: 5.069489, best loss: 3.822340 2025-01-16 01:17:54,201 - INFO - step 1278, loss: 4.929065, best loss: 3.822340 2025-01-16 01:17:54,351 - INFO - step 1279, loss: 4.740695, best loss: 3.822340 2025-01-16 01:17:54,501 - INFO - step 1280, loss: 4.623075, best loss: 3.822340 2025-01-16 01:17:54,651 - INFO - step 1281, loss: 4.993383, best loss: 3.822340 2025-01-16 01:17:54,801 - INFO - step 1282, loss: 4.627156, best loss: 3.822340 2025-01-16 01:17:54,950 - INFO - step 1283, loss: 4.293150, best loss: 3.822340 2025-01-16 01:17:55,100 - INFO - step 1284, loss: 4.583559, best loss: 3.822340 2025-01-16 01:17:55,250 - INFO - step 1285, loss: 4.508961, best loss: 3.822340 2025-01-16 01:17:55,400 - INFO - step 1286, loss: 4.731554, best loss: 3.822340 2025-01-16 01:17:55,550 - INFO - step 1287, loss: 3.902894, best loss: 3.822340 2025-01-16 01:17:55,700 - INFO - step 1288, loss: 4.476564, best loss: 3.822340 2025-01-16 01:17:55,850 - INFO - step 1289, loss: 4.723794, best loss: 3.822340 2025-01-16 01:17:55,999 - INFO - step 1290, loss: 4.834489, best loss: 3.822340 2025-01-16 01:17:56,149 - INFO - step 1291, loss: 4.530269, best loss: 3.822340 2025-01-16 01:17:56,299 - INFO - step 1292, loss: 4.634365, best loss: 3.822340 2025-01-16 01:17:56,449 - INFO - step 1293, loss: 4.607801, best loss: 3.822340 2025-01-16 01:17:56,599 - INFO - step 1294, loss: 4.237238, best loss: 3.822340 2025-01-16 01:17:56,749 - INFO - step 1295, loss: 4.833084, best loss: 3.822340 2025-01-16 01:17:56,899 - INFO - step 1296, loss: 4.587777, best loss: 3.822340 2025-01-16 01:17:57,049 - INFO - step 1297, loss: 4.561473, best loss: 3.822340 2025-01-16 01:17:57,199 - INFO - step 1298, loss: 4.197370, best loss: 3.822340 2025-01-16 01:17:57,349 - INFO - step 1299, loss: 4.252676, best loss: 3.822340 2025-01-16 01:17:57,499 - INFO - step 1300, loss: 4.392962, best loss: 3.822340 2025-01-16 01:17:57,649 - INFO - step 1301, loss: 4.269036, best loss: 3.822340 2025-01-16 01:17:57,799 - INFO - step 1302, loss: 4.253004, best loss: 3.822340 2025-01-16 01:17:57,948 - INFO - step 1303, loss: 4.260985, best loss: 3.822340 2025-01-16 01:17:58,098 - INFO - step 1304, loss: 4.228646, best loss: 3.822340 2025-01-16 01:17:58,248 - INFO - step 1305, loss: 4.085654, best loss: 3.822340 2025-01-16 01:17:58,398 - INFO - step 1306, loss: 3.882917, best loss: 3.822340 2025-01-16 01:18:01,869 - INFO - step 1307, loss: 3.643105, best loss: 3.643105 2025-01-16 01:18:02,030 - INFO - step 1308, loss: 4.530385, best loss: 3.643105 2025-01-16 01:18:02,181 - INFO - step 1309, loss: 4.996289, best loss: 3.643105 2025-01-16 01:18:02,331 - INFO - step 1310, loss: 5.115501, best loss: 3.643105 2025-01-16 01:18:02,481 - INFO - step 1311, loss: 5.317307, best loss: 3.643105 2025-01-16 01:18:02,631 - INFO - step 1312, loss: 5.314745, best loss: 3.643105 2025-01-16 01:18:02,782 - INFO - step 1313, loss: 5.015072, best loss: 3.643105 2025-01-16 01:18:02,931 - INFO - step 1314, loss: 5.072314, best loss: 3.643105 2025-01-16 01:18:03,082 - INFO - step 1315, loss: 5.088732, best loss: 3.643105 2025-01-16 01:18:03,232 - INFO - step 1316, loss: 4.590903, best loss: 3.643105 2025-01-16 01:18:03,382 - INFO - step 1317, loss: 4.429594, best loss: 3.643105 2025-01-16 01:18:03,532 - INFO - step 1318, loss: 4.665611, best loss: 3.643105 2025-01-16 01:18:03,681 - INFO - step 1319, loss: 4.524466, best loss: 3.643105 2025-01-16 01:18:03,831 - INFO - step 1320, loss: 5.114454, best loss: 3.643105 2025-01-16 01:18:03,981 - INFO - step 1321, loss: 4.942633, best loss: 3.643105 2025-01-16 01:18:04,131 - INFO - step 1322, loss: 5.183571, best loss: 3.643105 2025-01-16 01:18:04,281 - INFO - step 1323, loss: 4.933654, best loss: 3.643105 2025-01-16 01:18:04,431 - INFO - step 1324, loss: 5.156769, best loss: 3.643105 2025-01-16 01:18:04,581 - INFO - step 1325, loss: 4.728490, best loss: 3.643105 2025-01-16 01:18:04,731 - INFO - step 1326, loss: 4.903671, best loss: 3.643105 2025-01-16 01:18:04,881 - INFO - step 1327, loss: 4.648975, best loss: 3.643105 2025-01-16 01:18:05,031 - INFO - step 1328, loss: 4.910897, best loss: 3.643105 2025-01-16 01:18:05,181 - INFO - step 1329, loss: 4.664119, best loss: 3.643105 2025-01-16 01:18:05,331 - INFO - step 1330, loss: 4.550632, best loss: 3.643105 2025-01-16 01:18:05,480 - INFO - step 1331, loss: 5.291758, best loss: 3.643105 2025-01-16 01:18:05,630 - INFO - step 1332, loss: 4.489563, best loss: 3.643105 2025-01-16 01:18:05,780 - INFO - step 1333, loss: 4.918080, best loss: 3.643105 2025-01-16 01:18:05,930 - INFO - step 1334, loss: 4.797229, best loss: 3.643105 2025-01-16 01:18:06,080 - INFO - step 1335, loss: 4.836025, best loss: 3.643105 2025-01-16 01:18:06,230 - INFO - step 1336, loss: 4.776563, best loss: 3.643105 2025-01-16 01:18:06,380 - INFO - step 1337, loss: 4.447199, best loss: 3.643105 2025-01-16 01:18:06,530 - INFO - step 1338, loss: 4.420025, best loss: 3.643105 2025-01-16 01:18:06,680 - INFO - step 1339, loss: 4.597166, best loss: 3.643105 2025-01-16 01:18:06,830 - INFO - step 1340, loss: 4.036246, best loss: 3.643105 2025-01-16 01:18:06,979 - INFO - step 1341, loss: 4.855193, best loss: 3.643105 2025-01-16 01:18:07,129 - INFO - step 1342, loss: 4.045655, best loss: 3.643105 2025-01-16 01:18:07,279 - INFO - step 1343, loss: 3.959997, best loss: 3.643105 2025-01-16 01:18:07,430 - INFO - step 1344, loss: 4.476233, best loss: 3.643105 2025-01-16 01:18:07,580 - INFO - step 1345, loss: 4.477133, best loss: 3.643105 2025-01-16 01:18:07,730 - INFO - step 1346, loss: 4.450119, best loss: 3.643105 2025-01-16 01:18:07,880 - INFO - step 1347, loss: 4.143569, best loss: 3.643105 2025-01-16 01:18:08,030 - INFO - step 1348, loss: 4.413898, best loss: 3.643105 2025-01-16 01:18:08,180 - INFO - step 1349, loss: 4.455333, best loss: 3.643105 2025-01-16 01:18:08,330 - INFO - step 1350, loss: 4.464359, best loss: 3.643105 2025-01-16 01:18:08,480 - INFO - step 1351, loss: 4.300102, best loss: 3.643105 2025-01-16 01:18:08,630 - INFO - step 1352, loss: 4.716461, best loss: 3.643105 2025-01-16 01:18:08,780 - INFO - step 1353, loss: 4.722072, best loss: 3.643105 2025-01-16 01:18:08,930 - INFO - step 1354, loss: 4.690756, best loss: 3.643105 2025-01-16 01:18:09,080 - INFO - step 1355, loss: 4.179422, best loss: 3.643105 2025-01-16 01:18:09,229 - INFO - step 1356, loss: 4.400605, best loss: 3.643105 2025-01-16 01:18:09,380 - INFO - step 1357, loss: 4.750452, best loss: 3.643105 2025-01-16 01:18:09,529 - INFO - step 1358, loss: 4.229976, best loss: 3.643105 2025-01-16 01:18:09,679 - INFO - step 1359, loss: 4.903996, best loss: 3.643105 2025-01-16 01:18:09,830 - INFO - step 1360, loss: 4.871208, best loss: 3.643105 2025-01-16 01:18:09,979 - INFO - step 1361, loss: 4.786589, best loss: 3.643105 2025-01-16 01:18:10,129 - INFO - step 1362, loss: 4.708612, best loss: 3.643105 2025-01-16 01:18:10,279 - INFO - step 1363, loss: 4.766451, best loss: 3.643105 2025-01-16 01:18:10,429 - INFO - step 1364, loss: 4.670454, best loss: 3.643105 2025-01-16 01:18:10,579 - INFO - step 1365, loss: 4.557959, best loss: 3.643105 2025-01-16 01:18:10,729 - INFO - step 1366, loss: 5.258934, best loss: 3.643105 2025-01-16 01:18:10,879 - INFO - step 1367, loss: 4.778114, best loss: 3.643105 2025-01-16 01:18:11,029 - INFO - step 1368, loss: 5.103074, best loss: 3.643105 2025-01-16 01:18:11,179 - INFO - step 1369, loss: 4.205481, best loss: 3.643105 2025-01-16 01:18:11,329 - INFO - step 1370, loss: 4.204242, best loss: 3.643105 2025-01-16 01:18:11,479 - INFO - step 1371, loss: 4.685203, best loss: 3.643105 2025-01-16 01:18:11,630 - INFO - step 1372, loss: 4.759745, best loss: 3.643105 2025-01-16 01:18:11,780 - INFO - step 1373, loss: 4.446958, best loss: 3.643105 2025-01-16 01:18:11,930 - INFO - step 1374, loss: 4.585100, best loss: 3.643105 2025-01-16 01:18:12,080 - INFO - step 1375, loss: 4.459153, best loss: 3.643105 2025-01-16 01:18:12,230 - INFO - step 1376, loss: 4.509706, best loss: 3.643105 2025-01-16 01:18:12,380 - INFO - step 1377, loss: 4.828421, best loss: 3.643105 2025-01-16 01:18:12,530 - INFO - step 1378, loss: 4.351666, best loss: 3.643105 2025-01-16 01:18:12,680 - INFO - step 1379, loss: 4.316513, best loss: 3.643105 2025-01-16 01:18:12,829 - INFO - step 1380, loss: 4.470515, best loss: 3.643105 2025-01-16 01:18:12,979 - INFO - step 1381, loss: 4.585949, best loss: 3.643105 2025-01-16 01:18:13,129 - INFO - step 1382, loss: 4.552459, best loss: 3.643105 2025-01-16 01:18:13,279 - INFO - step 1383, loss: 4.697975, best loss: 3.643105 2025-01-16 01:18:13,429 - INFO - step 1384, loss: 4.471239, best loss: 3.643105 2025-01-16 01:18:13,579 - INFO - step 1385, loss: 4.236254, best loss: 3.643105 2025-01-16 01:18:13,729 - INFO - step 1386, loss: 4.587869, best loss: 3.643105 2025-01-16 01:18:13,878 - INFO - step 1387, loss: 3.882976, best loss: 3.643105 2025-01-16 01:18:14,028 - INFO - step 1388, loss: 4.249365, best loss: 3.643105 2025-01-16 01:18:14,178 - INFO - step 1389, loss: 4.455472, best loss: 3.643105 2025-01-16 01:18:14,328 - INFO - step 1390, loss: 4.338915, best loss: 3.643105 2025-01-16 01:18:14,478 - INFO - step 1391, loss: 4.229668, best loss: 3.643105 2025-01-16 01:18:14,628 - INFO - step 1392, loss: 4.540906, best loss: 3.643105 2025-01-16 01:18:14,778 - INFO - step 1393, loss: 4.786582, best loss: 3.643105 2025-01-16 01:18:14,928 - INFO - step 1394, loss: 4.534609, best loss: 3.643105 2025-01-16 01:18:15,078 - INFO - step 1395, loss: 4.840997, best loss: 3.643105 2025-01-16 01:18:15,228 - INFO - step 1396, loss: 4.413767, best loss: 3.643105 2025-01-16 01:18:15,378 - INFO - step 1397, loss: 4.544184, best loss: 3.643105 2025-01-16 01:18:15,528 - INFO - step 1398, loss: 4.412750, best loss: 3.643105 2025-01-16 01:18:15,678 - INFO - step 1399, loss: 4.133868, best loss: 3.643105 2025-01-16 01:18:15,828 - INFO - step 1400, loss: 4.991579, best loss: 3.643105 2025-01-16 01:18:15,978 - INFO - step 1401, loss: 4.864550, best loss: 3.643105 2025-01-16 01:18:16,128 - INFO - step 1402, loss: 4.506287, best loss: 3.643105 2025-01-16 01:18:16,278 - INFO - step 1403, loss: 4.390550, best loss: 3.643105 2025-01-16 01:18:16,427 - INFO - step 1404, loss: 4.038967, best loss: 3.643105 2025-01-16 01:18:16,577 - INFO - step 1405, loss: 4.267878, best loss: 3.643105 2025-01-16 01:18:16,727 - INFO - step 1406, loss: 4.227545, best loss: 3.643105 2025-01-16 01:18:16,877 - INFO - step 1407, loss: 4.164159, best loss: 3.643105 2025-01-16 01:18:17,026 - INFO - step 1408, loss: 4.921145, best loss: 3.643105 2025-01-16 01:18:17,176 - INFO - step 1409, loss: 4.832112, best loss: 3.643105 2025-01-16 01:18:17,326 - INFO - step 1410, loss: 4.624103, best loss: 3.643105 2025-01-16 01:18:17,475 - INFO - step 1411, loss: 4.911406, best loss: 3.643105 2025-01-16 01:18:17,625 - INFO - step 1412, loss: 4.555109, best loss: 3.643105 2025-01-16 01:18:17,775 - INFO - step 1413, loss: 4.876947, best loss: 3.643105 2025-01-16 01:18:17,925 - INFO - step 1414, loss: 5.058352, best loss: 3.643105 2025-01-16 01:18:18,075 - INFO - step 1415, loss: 4.992896, best loss: 3.643105 2025-01-16 01:18:18,224 - INFO - step 1416, loss: 5.091951, best loss: 3.643105 2025-01-16 01:18:18,374 - INFO - step 1417, loss: 4.762087, best loss: 3.643105 2025-01-16 01:18:18,524 - INFO - step 1418, loss: 5.049131, best loss: 3.643105 2025-01-16 01:18:18,674 - INFO - step 1419, loss: 4.800747, best loss: 3.643105 2025-01-16 01:18:18,824 - INFO - step 1420, loss: 4.631884, best loss: 3.643105 2025-01-16 01:18:18,973 - INFO - step 1421, loss: 5.045737, best loss: 3.643105 2025-01-16 01:18:19,124 - INFO - step 1422, loss: 5.056322, best loss: 3.643105 2025-01-16 01:18:19,273 - INFO - step 1423, loss: 4.824269, best loss: 3.643105 2025-01-16 01:18:19,423 - INFO - step 1424, loss: 4.764308, best loss: 3.643105 2025-01-16 01:18:19,574 - INFO - step 1425, loss: 5.042625, best loss: 3.643105 2025-01-16 01:18:19,724 - INFO - step 1426, loss: 4.723125, best loss: 3.643105 2025-01-16 01:18:19,874 - INFO - step 1427, loss: 4.601805, best loss: 3.643105 2025-01-16 01:18:20,024 - INFO - step 1428, loss: 4.478897, best loss: 3.643105 2025-01-16 01:18:20,174 - INFO - step 1429, loss: 4.851868, best loss: 3.643105 2025-01-16 01:18:20,324 - INFO - step 1430, loss: 4.884688, best loss: 3.643105 2025-01-16 01:18:20,473 - INFO - step 1431, loss: 5.076900, best loss: 3.643105 2025-01-16 01:18:20,623 - INFO - step 1432, loss: 4.960288, best loss: 3.643105 2025-01-16 01:18:20,773 - INFO - step 1433, loss: 4.564798, best loss: 3.643105 2025-01-16 01:18:20,923 - INFO - step 1434, loss: 4.975750, best loss: 3.643105 2025-01-16 01:18:21,072 - INFO - step 1435, loss: 4.562202, best loss: 3.643105 2025-01-16 01:18:21,223 - INFO - step 1436, loss: 5.031573, best loss: 3.643105 2025-01-16 01:18:21,373 - INFO - step 1437, loss: 4.538243, best loss: 3.643105 2025-01-16 01:18:21,523 - INFO - step 1438, loss: 4.675879, best loss: 3.643105 2025-01-16 01:18:21,672 - INFO - step 1439, loss: 4.620345, best loss: 3.643105 2025-01-16 01:18:21,822 - INFO - step 1440, loss: 4.671127, best loss: 3.643105 2025-01-16 01:18:21,972 - INFO - step 1441, loss: 4.442658, best loss: 3.643105 2025-01-16 01:18:22,122 - INFO - step 1442, loss: 4.719795, best loss: 3.643105 2025-01-16 01:18:22,272 - INFO - step 1443, loss: 4.306382, best loss: 3.643105 2025-01-16 01:18:22,422 - INFO - step 1444, loss: 3.957252, best loss: 3.643105 2025-01-16 01:18:22,572 - INFO - step 1445, loss: 4.133301, best loss: 3.643105 2025-01-16 01:18:22,722 - INFO - step 1446, loss: 4.381915, best loss: 3.643105 2025-01-16 01:18:22,872 - INFO - step 1447, loss: 5.120074, best loss: 3.643105 2025-01-16 01:18:23,022 - INFO - step 1448, loss: 4.670312, best loss: 3.643105 2025-01-16 01:18:23,172 - INFO - step 1449, loss: 4.920527, best loss: 3.643105 2025-01-16 01:18:23,321 - INFO - step 1450, loss: 5.076456, best loss: 3.643105 2025-01-16 01:18:23,471 - INFO - step 1451, loss: 4.550630, best loss: 3.643105 2025-01-16 01:18:23,621 - INFO - step 1452, loss: 5.156519, best loss: 3.643105 2025-01-16 01:18:23,771 - INFO - step 1453, loss: 4.644118, best loss: 3.643105 2025-01-16 01:18:23,921 - INFO - step 1454, loss: 4.813843, best loss: 3.643105 2025-01-16 01:18:24,071 - INFO - step 1455, loss: 4.651981, best loss: 3.643105 2025-01-16 01:18:24,221 - INFO - step 1456, loss: 5.327451, best loss: 3.643105 2025-01-16 01:18:24,371 - INFO - step 1457, loss: 4.612746, best loss: 3.643105 2025-01-16 01:18:24,521 - INFO - step 1458, loss: 4.380403, best loss: 3.643105 2025-01-16 01:18:24,671 - INFO - step 1459, loss: 4.828497, best loss: 3.643105 2025-01-16 01:18:24,821 - INFO - step 1460, loss: 4.491069, best loss: 3.643105 2025-01-16 01:18:24,972 - INFO - step 1461, loss: 4.289473, best loss: 3.643105 2025-01-16 01:18:25,121 - INFO - step 1462, loss: 5.162121, best loss: 3.643105 2025-01-16 01:18:25,272 - INFO - step 1463, loss: 4.711225, best loss: 3.643105 2025-01-16 01:18:25,422 - INFO - step 1464, loss: 4.192340, best loss: 3.643105 2025-01-16 01:18:25,572 - INFO - step 1465, loss: 4.457788, best loss: 3.643105 2025-01-16 01:18:25,721 - INFO - step 1466, loss: 4.704496, best loss: 3.643105 2025-01-16 01:18:25,872 - INFO - step 1467, loss: 4.593518, best loss: 3.643105 2025-01-16 01:18:26,022 - INFO - step 1468, loss: 4.620266, best loss: 3.643105 2025-01-16 01:18:26,172 - INFO - step 1469, loss: 4.410289, best loss: 3.643105 2025-01-16 01:18:26,322 - INFO - step 1470, loss: 5.124258, best loss: 3.643105 2025-01-16 01:18:26,471 - INFO - step 1471, loss: 4.792521, best loss: 3.643105 2025-01-16 01:18:26,622 - INFO - step 1472, loss: 4.724285, best loss: 3.643105 2025-01-16 01:18:26,771 - INFO - step 1473, loss: 4.416248, best loss: 3.643105 2025-01-16 01:18:26,921 - INFO - step 1474, loss: 4.723979, best loss: 3.643105 2025-01-16 01:18:27,071 - INFO - step 1475, loss: 4.562686, best loss: 3.643105 2025-01-16 01:18:27,221 - INFO - step 1476, loss: 4.190563, best loss: 3.643105 2025-01-16 01:18:27,371 - INFO - step 1477, loss: 4.635269, best loss: 3.643105 2025-01-16 01:18:27,521 - INFO - step 1478, loss: 4.223786, best loss: 3.643105 2025-01-16 01:18:27,670 - INFO - step 1479, loss: 4.673402, best loss: 3.643105 2025-01-16 01:18:27,820 - INFO - step 1480, loss: 4.455328, best loss: 3.643105 2025-01-16 01:18:27,970 - INFO - step 1481, loss: 4.575551, best loss: 3.643105 2025-01-16 01:18:28,120 - INFO - step 1482, loss: 4.373409, best loss: 3.643105 2025-01-16 01:18:28,270 - INFO - step 1483, loss: 4.822809, best loss: 3.643105 2025-01-16 01:18:28,420 - INFO - step 1484, loss: 5.043060, best loss: 3.643105 2025-01-16 01:18:28,570 - INFO - step 1485, loss: 4.813494, best loss: 3.643105 2025-01-16 01:18:28,720 - INFO - step 1486, loss: 4.901963, best loss: 3.643105 2025-01-16 01:18:28,870 - INFO - step 1487, loss: 4.464810, best loss: 3.643105 2025-01-16 01:18:29,020 - INFO - step 1488, loss: 4.703460, best loss: 3.643105 2025-01-16 01:18:29,170 - INFO - step 1489, loss: 4.732255, best loss: 3.643105 2025-01-16 01:18:29,319 - INFO - step 1490, loss: 4.604649, best loss: 3.643105 2025-01-16 01:18:29,469 - INFO - step 1491, loss: 4.420231, best loss: 3.643105 2025-01-16 01:18:29,620 - INFO - step 1492, loss: 4.506390, best loss: 3.643105 2025-01-16 01:18:29,770 - INFO - step 1493, loss: 4.476338, best loss: 3.643105 2025-01-16 01:18:29,920 - INFO - step 1494, loss: 4.602036, best loss: 3.643105 2025-01-16 01:18:30,069 - INFO - step 1495, loss: 4.919237, best loss: 3.643105 2025-01-16 01:18:30,219 - INFO - step 1496, loss: 5.038985, best loss: 3.643105 2025-01-16 01:18:30,369 - INFO - step 1497, loss: 4.753460, best loss: 3.643105 2025-01-16 01:18:30,519 - INFO - step 1498, loss: 4.864933, best loss: 3.643105 2025-01-16 01:18:30,669 - INFO - step 1499, loss: 4.755122, best loss: 3.643105 2025-01-16 01:18:30,819 - INFO - step 1500, loss: 4.686550, best loss: 3.643105 2025-01-16 01:18:30,969 - INFO - step 1501, loss: 4.159677, best loss: 3.643105 2025-01-16 01:18:31,119 - INFO - step 1502, loss: 4.728007, best loss: 3.643105 2025-01-16 01:18:31,269 - INFO - step 1503, loss: 5.039515, best loss: 3.643105 2025-01-16 01:18:31,419 - INFO - step 1504, loss: 4.691142, best loss: 3.643105 2025-01-16 01:18:31,569 - INFO - step 1505, loss: 4.784678, best loss: 3.643105 2025-01-16 01:18:31,719 - INFO - step 1506, loss: 4.720138, best loss: 3.643105 2025-01-16 01:18:31,869 - INFO - step 1507, loss: 4.256588, best loss: 3.643105 2025-01-16 01:18:32,019 - INFO - step 1508, loss: 3.916909, best loss: 3.643105 2025-01-16 01:18:32,169 - INFO - step 1509, loss: 4.613449, best loss: 3.643105 2025-01-16 01:18:32,319 - INFO - step 1510, loss: 4.865936, best loss: 3.643105 2025-01-16 01:18:32,468 - INFO - step 1511, loss: 4.892869, best loss: 3.643105 2025-01-16 01:18:32,618 - INFO - step 1512, loss: 4.725914, best loss: 3.643105 2025-01-16 01:18:32,768 - INFO - step 1513, loss: 4.360494, best loss: 3.643105 2025-01-16 01:18:32,918 - INFO - step 1514, loss: 4.610586, best loss: 3.643105 2025-01-16 01:18:33,068 - INFO - step 1515, loss: 4.645887, best loss: 3.643105 2025-01-16 01:18:33,218 - INFO - step 1516, loss: 4.371863, best loss: 3.643105 2025-01-16 01:18:33,368 - INFO - step 1517, loss: 4.575857, best loss: 3.643105 2025-01-16 01:18:33,518 - INFO - step 1518, loss: 4.476451, best loss: 3.643105 2025-01-16 01:18:33,668 - INFO - step 1519, loss: 4.322357, best loss: 3.643105 2025-01-16 01:18:33,818 - INFO - step 1520, loss: 4.763562, best loss: 3.643105 2025-01-16 01:18:33,968 - INFO - step 1521, loss: 4.193756, best loss: 3.643105 2025-01-16 01:18:34,118 - INFO - step 1522, loss: 4.536095, best loss: 3.643105 2025-01-16 01:18:34,268 - INFO - step 1523, loss: 4.864474, best loss: 3.643105 2025-01-16 01:18:34,419 - INFO - step 1524, loss: 4.477575, best loss: 3.643105 2025-01-16 01:18:34,569 - INFO - step 1525, loss: 3.974166, best loss: 3.643105 2025-01-16 01:18:34,719 - INFO - step 1526, loss: 4.599289, best loss: 3.643105 2025-01-16 01:18:34,869 - INFO - step 1527, loss: 4.865993, best loss: 3.643105 2025-01-16 01:18:35,018 - INFO - step 1528, loss: 5.119968, best loss: 3.643105 2025-01-16 01:18:35,168 - INFO - step 1529, loss: 4.920225, best loss: 3.643105 2025-01-16 01:18:35,318 - INFO - step 1530, loss: 4.998863, best loss: 3.643105 2025-01-16 01:18:35,468 - INFO - step 1531, loss: 4.861130, best loss: 3.643105 2025-01-16 01:18:35,618 - INFO - step 1532, loss: 5.114634, best loss: 3.643105 2025-01-16 01:18:35,768 - INFO - step 1533, loss: 4.569397, best loss: 3.643105 2025-01-16 01:18:35,918 - INFO - step 1534, loss: 4.666114, best loss: 3.643105 2025-01-16 01:18:36,068 - INFO - step 1535, loss: 4.812839, best loss: 3.643105 2025-01-16 01:18:36,218 - INFO - step 1536, loss: 4.760207, best loss: 3.643105 2025-01-16 01:18:36,368 - INFO - step 1537, loss: 4.731116, best loss: 3.643105 2025-01-16 01:18:36,518 - INFO - step 1538, loss: 4.487600, best loss: 3.643105 2025-01-16 01:18:36,668 - INFO - step 1539, loss: 4.505418, best loss: 3.643105 2025-01-16 01:18:36,817 - INFO - step 1540, loss: 4.721874, best loss: 3.643105 2025-01-16 01:18:36,967 - INFO - step 1541, loss: 4.892230, best loss: 3.643105 2025-01-16 01:18:37,117 - INFO - step 1542, loss: 4.582920, best loss: 3.643105 2025-01-16 01:18:37,267 - INFO - step 1543, loss: 5.002473, best loss: 3.643105 2025-01-16 01:18:37,417 - INFO - step 1544, loss: 5.037145, best loss: 3.643105 2025-01-16 01:18:37,567 - INFO - step 1545, loss: 5.222927, best loss: 3.643105 2025-01-16 01:18:37,716 - INFO - step 1546, loss: 5.226508, best loss: 3.643105 2025-01-16 01:18:37,866 - INFO - step 1547, loss: 5.273725, best loss: 3.643105 2025-01-16 01:18:38,016 - INFO - step 1548, loss: 4.711895, best loss: 3.643105 2025-01-16 01:18:38,166 - INFO - step 1549, loss: 5.029178, best loss: 3.643105 2025-01-16 01:18:38,316 - INFO - step 1550, loss: 4.981399, best loss: 3.643105 2025-01-16 01:18:38,466 - INFO - step 1551, loss: 5.164338, best loss: 3.643105 2025-01-16 01:18:38,616 - INFO - step 1552, loss: 4.635078, best loss: 3.643105 2025-01-16 01:18:38,766 - INFO - step 1553, loss: 4.757158, best loss: 3.643105 2025-01-16 01:18:38,915 - INFO - step 1554, loss: 4.731201, best loss: 3.643105 2025-01-16 01:18:39,065 - INFO - step 1555, loss: 4.630553, best loss: 3.643105 2025-01-16 01:18:39,215 - INFO - step 1556, loss: 4.931474, best loss: 3.643105 2025-01-16 01:18:39,365 - INFO - step 1557, loss: 4.406610, best loss: 3.643105 2025-01-16 01:18:39,515 - INFO - step 1558, loss: 4.429394, best loss: 3.643105 2025-01-16 01:18:39,665 - INFO - step 1559, loss: 4.918204, best loss: 3.643105 2025-01-16 01:18:39,814 - INFO - step 1560, loss: 4.670196, best loss: 3.643105 2025-01-16 01:18:39,964 - INFO - step 1561, loss: 4.827296, best loss: 3.643105 2025-01-16 01:18:40,114 - INFO - step 1562, loss: 4.571490, best loss: 3.643105 2025-01-16 01:18:40,265 - INFO - step 1563, loss: 5.248524, best loss: 3.643105 2025-01-16 01:18:40,415 - INFO - step 1564, loss: 4.977457, best loss: 3.643105 2025-01-16 01:18:40,565 - INFO - step 1565, loss: 4.564385, best loss: 3.643105 2025-01-16 01:18:40,714 - INFO - step 1566, loss: 4.495085, best loss: 3.643105 2025-01-16 01:18:40,864 - INFO - step 1567, loss: 4.974737, best loss: 3.643105 2025-01-16 01:18:41,014 - INFO - step 1568, loss: 4.639308, best loss: 3.643105 2025-01-16 01:18:41,164 - INFO - step 1569, loss: 4.693868, best loss: 3.643105 2025-01-16 01:18:41,314 - INFO - step 1570, loss: 4.907969, best loss: 3.643105 2025-01-16 01:18:41,464 - INFO - step 1571, loss: 4.844059, best loss: 3.643105 2025-01-16 01:18:41,614 - INFO - step 1572, loss: 4.782668, best loss: 3.643105 2025-01-16 01:18:41,764 - INFO - step 1573, loss: 4.466346, best loss: 3.643105 2025-01-16 01:18:41,914 - INFO - step 1574, loss: 4.295766, best loss: 3.643105 2025-01-16 01:18:42,064 - INFO - step 1575, loss: 4.369328, best loss: 3.643105 2025-01-16 01:18:42,214 - INFO - step 1576, loss: 4.251836, best loss: 3.643105 2025-01-16 01:18:42,364 - INFO - step 1577, loss: 4.264352, best loss: 3.643105 2025-01-16 01:18:42,515 - INFO - step 1578, loss: 4.756798, best loss: 3.643105 2025-01-16 01:18:42,665 - INFO - step 1579, loss: 4.516586, best loss: 3.643105 2025-01-16 01:18:42,815 - INFO - step 1580, loss: 4.363246, best loss: 3.643105 2025-01-16 01:18:42,965 - INFO - step 1581, loss: 4.560830, best loss: 3.643105 2025-01-16 01:18:43,115 - INFO - step 1582, loss: 4.564066, best loss: 3.643105 2025-01-16 01:18:43,265 - INFO - step 1583, loss: 4.586270, best loss: 3.643105 2025-01-16 01:18:43,415 - INFO - step 1584, loss: 4.663486, best loss: 3.643105 2025-01-16 01:18:43,565 - INFO - step 1585, loss: 4.913854, best loss: 3.643105 2025-01-16 01:18:43,715 - INFO - step 1586, loss: 4.362004, best loss: 3.643105 2025-01-16 01:18:43,865 - INFO - step 1587, loss: 4.337944, best loss: 3.643105 2025-01-16 01:18:44,015 - INFO - step 1588, loss: 4.623985, best loss: 3.643105 2025-01-16 01:18:44,164 - INFO - step 1589, loss: 4.764257, best loss: 3.643105 2025-01-16 01:18:44,315 - INFO - step 1590, loss: 4.284033, best loss: 3.643105 2025-01-16 01:18:44,464 - INFO - step 1591, loss: 4.185061, best loss: 3.643105 2025-01-16 01:18:44,614 - INFO - step 1592, loss: 4.454151, best loss: 3.643105 2025-01-16 01:18:44,764 - INFO - step 1593, loss: 4.620372, best loss: 3.643105 2025-01-16 01:18:44,914 - INFO - step 1594, loss: 4.141444, best loss: 3.643105 2025-01-16 01:18:45,064 - INFO - step 1595, loss: 4.543132, best loss: 3.643105 2025-01-16 01:18:45,213 - INFO - step 1596, loss: 4.366852, best loss: 3.643105 2025-01-16 01:18:45,363 - INFO - step 1597, loss: 4.183995, best loss: 3.643105 2025-01-16 01:18:45,513 - INFO - step 1598, loss: 4.140357, best loss: 3.643105 2025-01-16 01:18:45,663 - INFO - step 1599, loss: 4.413545, best loss: 3.643105 2025-01-16 01:18:45,813 - INFO - step 1600, loss: 3.984053, best loss: 3.643105 2025-01-16 01:18:45,963 - INFO - step 1601, loss: 4.289340, best loss: 3.643105 2025-01-16 01:18:46,113 - INFO - step 1602, loss: 3.945838, best loss: 3.643105 2025-01-16 01:18:46,263 - INFO - step 1603, loss: 4.592099, best loss: 3.643105 2025-01-16 01:18:46,413 - INFO - step 1604, loss: 4.916007, best loss: 3.643105 2025-01-16 01:18:46,563 - INFO - step 1605, loss: 5.353401, best loss: 3.643105 2025-01-16 01:18:46,713 - INFO - step 1606, loss: 4.660941, best loss: 3.643105 2025-01-16 01:18:46,862 - INFO - step 1607, loss: 4.915994, best loss: 3.643105 2025-01-16 01:18:47,012 - INFO - step 1608, loss: 4.794295, best loss: 3.643105 2025-01-16 01:18:47,162 - INFO - step 1609, loss: 4.585675, best loss: 3.643105 2025-01-16 01:18:47,312 - INFO - step 1610, loss: 4.446462, best loss: 3.643105 2025-01-16 01:18:47,462 - INFO - step 1611, loss: 4.830736, best loss: 3.643105 2025-01-16 01:18:47,612 - INFO - step 1612, loss: 4.407476, best loss: 3.643105 2025-01-16 01:18:47,762 - INFO - step 1613, loss: 4.140017, best loss: 3.643105 2025-01-16 01:18:47,912 - INFO - step 1614, loss: 4.411823, best loss: 3.643105 2025-01-16 01:18:48,062 - INFO - step 1615, loss: 4.333847, best loss: 3.643105 2025-01-16 01:18:48,212 - INFO - step 1616, loss: 4.553721, best loss: 3.643105 2025-01-16 01:18:48,362 - INFO - step 1617, loss: 3.667831, best loss: 3.643105 2025-01-16 01:18:48,512 - INFO - step 1618, loss: 4.332246, best loss: 3.643105 2025-01-16 01:18:48,662 - INFO - step 1619, loss: 4.578033, best loss: 3.643105 2025-01-16 01:18:48,812 - INFO - step 1620, loss: 4.655682, best loss: 3.643105 2025-01-16 01:18:48,962 - INFO - step 1621, loss: 4.380732, best loss: 3.643105 2025-01-16 01:18:49,112 - INFO - step 1622, loss: 4.488712, best loss: 3.643105 2025-01-16 01:18:49,262 - INFO - step 1623, loss: 4.449607, best loss: 3.643105 2025-01-16 01:18:49,413 - INFO - step 1624, loss: 4.089229, best loss: 3.643105 2025-01-16 01:18:49,563 - INFO - step 1625, loss: 4.623144, best loss: 3.643105 2025-01-16 01:18:49,713 - INFO - step 1626, loss: 4.478683, best loss: 3.643105 2025-01-16 01:18:49,863 - INFO - step 1627, loss: 4.440497, best loss: 3.643105 2025-01-16 01:18:50,013 - INFO - step 1628, loss: 4.053710, best loss: 3.643105 2025-01-16 01:18:50,163 - INFO - step 1629, loss: 4.104360, best loss: 3.643105 2025-01-16 01:18:50,313 - INFO - step 1630, loss: 4.262159, best loss: 3.643105 2025-01-16 01:18:50,463 - INFO - step 1631, loss: 4.158314, best loss: 3.643105 2025-01-16 01:18:50,613 - INFO - step 1632, loss: 4.120023, best loss: 3.643105 2025-01-16 01:18:50,763 - INFO - step 1633, loss: 4.156435, best loss: 3.643105 2025-01-16 01:18:50,913 - INFO - step 1634, loss: 4.098279, best loss: 3.643105 2025-01-16 01:18:51,063 - INFO - step 1635, loss: 3.933525, best loss: 3.643105 2025-01-16 01:18:51,214 - INFO - step 1636, loss: 3.757139, best loss: 3.643105 2025-01-16 01:18:54,740 - INFO - step 1637, loss: 3.444499, best loss: 3.444499 2025-01-16 01:18:54,897 - INFO - step 1638, loss: 4.399190, best loss: 3.444499 2025-01-16 01:18:55,048 - INFO - step 1639, loss: 4.887989, best loss: 3.444499 2025-01-16 01:18:55,199 - INFO - step 1640, loss: 4.991242, best loss: 3.444499 2025-01-16 01:18:55,349 - INFO - step 1641, loss: 5.166602, best loss: 3.444499 2025-01-16 01:18:55,499 - INFO - step 1642, loss: 5.183780, best loss: 3.444499 2025-01-16 01:18:55,649 - INFO - step 1643, loss: 4.740144, best loss: 3.444499 2025-01-16 01:18:55,799 - INFO - step 1644, loss: 4.886008, best loss: 3.444499 2025-01-16 01:18:55,949 - INFO - step 1645, loss: 4.976393, best loss: 3.444499 2025-01-16 01:18:56,099 - INFO - step 1646, loss: 4.490770, best loss: 3.444499 2025-01-16 01:18:56,249 - INFO - step 1647, loss: 4.264399, best loss: 3.444499 2025-01-16 01:18:56,399 - INFO - step 1648, loss: 4.516766, best loss: 3.444499 2025-01-16 01:18:56,549 - INFO - step 1649, loss: 4.387111, best loss: 3.444499 2025-01-16 01:18:56,699 - INFO - step 1650, loss: 4.945206, best loss: 3.444499 2025-01-16 01:18:56,849 - INFO - step 1651, loss: 4.744806, best loss: 3.444499 2025-01-16 01:18:57,000 - INFO - step 1652, loss: 4.971529, best loss: 3.444499 2025-01-16 01:18:57,150 - INFO - step 1653, loss: 4.670042, best loss: 3.444499 2025-01-16 01:18:57,300 - INFO - step 1654, loss: 5.048854, best loss: 3.444499 2025-01-16 01:18:57,450 - INFO - step 1655, loss: 4.565998, best loss: 3.444499 2025-01-16 01:18:57,600 - INFO - step 1656, loss: 4.759706, best loss: 3.444499 2025-01-16 01:18:57,750 - INFO - step 1657, loss: 4.495094, best loss: 3.444499 2025-01-16 01:18:57,900 - INFO - step 1658, loss: 4.803284, best loss: 3.444499 2025-01-16 01:18:58,050 - INFO - step 1659, loss: 4.563843, best loss: 3.444499 2025-01-16 01:18:58,200 - INFO - step 1660, loss: 4.444194, best loss: 3.444499 2025-01-16 01:18:58,350 - INFO - step 1661, loss: 5.214147, best loss: 3.444499 2025-01-16 01:18:58,500 - INFO - step 1662, loss: 4.421207, best loss: 3.444499 2025-01-16 01:18:58,650 - INFO - step 1663, loss: 4.769724, best loss: 3.444499 2025-01-16 01:18:58,800 - INFO - step 1664, loss: 4.676218, best loss: 3.444499 2025-01-16 01:18:58,950 - INFO - step 1665, loss: 4.752935, best loss: 3.444499 2025-01-16 01:18:59,100 - INFO - step 1666, loss: 4.721569, best loss: 3.444499 2025-01-16 01:18:59,250 - INFO - step 1667, loss: 4.400458, best loss: 3.444499 2025-01-16 01:18:59,400 - INFO - step 1668, loss: 4.338164, best loss: 3.444499 2025-01-16 01:18:59,550 - INFO - step 1669, loss: 4.475978, best loss: 3.444499 2025-01-16 01:18:59,700 - INFO - step 1670, loss: 3.941602, best loss: 3.444499 2025-01-16 01:18:59,850 - INFO - step 1671, loss: 4.762794, best loss: 3.444499 2025-01-16 01:19:00,000 - INFO - step 1672, loss: 3.973442, best loss: 3.444499 2025-01-16 01:19:00,150 - INFO - step 1673, loss: 3.848841, best loss: 3.444499 2025-01-16 01:19:00,300 - INFO - step 1674, loss: 4.374429, best loss: 3.444499 2025-01-16 01:19:00,450 - INFO - step 1675, loss: 4.376676, best loss: 3.444499 2025-01-16 01:19:00,600 - INFO - step 1676, loss: 4.354896, best loss: 3.444499 2025-01-16 01:19:00,750 - INFO - step 1677, loss: 4.026906, best loss: 3.444499 2025-01-16 01:19:00,900 - INFO - step 1678, loss: 4.302742, best loss: 3.444499 2025-01-16 01:19:01,051 - INFO - step 1679, loss: 4.361783, best loss: 3.444499 2025-01-16 01:19:01,201 - INFO - step 1680, loss: 4.336766, best loss: 3.444499 2025-01-16 01:19:01,351 - INFO - step 1681, loss: 4.115113, best loss: 3.444499 2025-01-16 01:19:01,501 - INFO - step 1682, loss: 4.587647, best loss: 3.444499 2025-01-16 01:19:01,650 - INFO - step 1683, loss: 4.565971, best loss: 3.444499 2025-01-16 01:19:01,800 - INFO - step 1684, loss: 4.568164, best loss: 3.444499 2025-01-16 01:19:01,950 - INFO - step 1685, loss: 4.077219, best loss: 3.444499 2025-01-16 01:19:02,101 - INFO - step 1686, loss: 4.245542, best loss: 3.444499 2025-01-16 01:19:02,251 - INFO - step 1687, loss: 4.621176, best loss: 3.444499 2025-01-16 01:19:02,401 - INFO - step 1688, loss: 4.088223, best loss: 3.444499 2025-01-16 01:19:02,551 - INFO - step 1689, loss: 4.723764, best loss: 3.444499 2025-01-16 01:19:02,701 - INFO - step 1690, loss: 4.696809, best loss: 3.444499 2025-01-16 01:19:02,851 - INFO - step 1691, loss: 4.661265, best loss: 3.444499 2025-01-16 01:19:03,001 - INFO - step 1692, loss: 4.583079, best loss: 3.444499 2025-01-16 01:19:03,151 - INFO - step 1693, loss: 4.574359, best loss: 3.444499 2025-01-16 01:19:03,301 - INFO - step 1694, loss: 4.462191, best loss: 3.444499 2025-01-16 01:19:03,452 - INFO - step 1695, loss: 4.351431, best loss: 3.444499 2025-01-16 01:19:03,602 - INFO - step 1696, loss: 5.094024, best loss: 3.444499 2025-01-16 01:19:03,752 - INFO - step 1697, loss: 4.594484, best loss: 3.444499 2025-01-16 01:19:03,902 - INFO - step 1698, loss: 4.966462, best loss: 3.444499 2025-01-16 01:19:04,052 - INFO - step 1699, loss: 4.097600, best loss: 3.444499 2025-01-16 01:19:04,202 - INFO - step 1700, loss: 4.012588, best loss: 3.444499 2025-01-16 01:19:04,352 - INFO - step 1701, loss: 4.476934, best loss: 3.444499 2025-01-16 01:19:04,502 - INFO - step 1702, loss: 4.550650, best loss: 3.444499 2025-01-16 01:19:04,652 - INFO - step 1703, loss: 4.265011, best loss: 3.444499 2025-01-16 01:19:04,802 - INFO - step 1704, loss: 4.454503, best loss: 3.444499 2025-01-16 01:19:04,952 - INFO - step 1705, loss: 4.254937, best loss: 3.444499 2025-01-16 01:19:05,102 - INFO - step 1706, loss: 4.344417, best loss: 3.444499 2025-01-16 01:19:05,252 - INFO - step 1707, loss: 4.648076, best loss: 3.444499 2025-01-16 01:19:05,402 - INFO - step 1708, loss: 4.190379, best loss: 3.444499 2025-01-16 01:19:05,552 - INFO - step 1709, loss: 4.203669, best loss: 3.444499 2025-01-16 01:19:05,702 - INFO - step 1710, loss: 4.360679, best loss: 3.444499 2025-01-16 01:19:05,853 - INFO - step 1711, loss: 4.508383, best loss: 3.444499 2025-01-16 01:19:06,003 - INFO - step 1712, loss: 4.370976, best loss: 3.444499 2025-01-16 01:19:06,153 - INFO - step 1713, loss: 4.599944, best loss: 3.444499 2025-01-16 01:19:06,303 - INFO - step 1714, loss: 4.356117, best loss: 3.444499 2025-01-16 01:19:06,453 - INFO - step 1715, loss: 4.044911, best loss: 3.444499 2025-01-16 01:19:06,603 - INFO - step 1716, loss: 4.459991, best loss: 3.444499 2025-01-16 01:19:06,753 - INFO - step 1717, loss: 3.753129, best loss: 3.444499 2025-01-16 01:19:06,903 - INFO - step 1718, loss: 4.126762, best loss: 3.444499 2025-01-16 01:19:07,053 - INFO - step 1719, loss: 4.358607, best loss: 3.444499 2025-01-16 01:19:07,203 - INFO - step 1720, loss: 4.233316, best loss: 3.444499 2025-01-16 01:19:07,353 - INFO - step 1721, loss: 4.123222, best loss: 3.444499 2025-01-16 01:19:07,503 - INFO - step 1722, loss: 4.410870, best loss: 3.444499 2025-01-16 01:19:07,653 - INFO - step 1723, loss: 4.651639, best loss: 3.444499 2025-01-16 01:19:07,804 - INFO - step 1724, loss: 4.445105, best loss: 3.444499 2025-01-16 01:19:07,954 - INFO - step 1725, loss: 4.751774, best loss: 3.444499 2025-01-16 01:19:08,103 - INFO - step 1726, loss: 4.281215, best loss: 3.444499 2025-01-16 01:19:08,253 - INFO - step 1727, loss: 4.422626, best loss: 3.444499 2025-01-16 01:19:08,404 - INFO - step 1728, loss: 4.331913, best loss: 3.444499 2025-01-16 01:19:08,554 - INFO - step 1729, loss: 4.014161, best loss: 3.444499 2025-01-16 01:19:08,704 - INFO - step 1730, loss: 4.825400, best loss: 3.444499 2025-01-16 01:19:08,854 - INFO - step 1731, loss: 4.713915, best loss: 3.444499 2025-01-16 01:19:09,003 - INFO - step 1732, loss: 4.423101, best loss: 3.444499 2025-01-16 01:19:09,154 - INFO - step 1733, loss: 4.290539, best loss: 3.444499 2025-01-16 01:19:09,304 - INFO - step 1734, loss: 3.932987, best loss: 3.444499 2025-01-16 01:19:09,454 - INFO - step 1735, loss: 4.183866, best loss: 3.444499 2025-01-16 01:19:09,605 - INFO - step 1736, loss: 4.134150, best loss: 3.444499 2025-01-16 01:19:09,755 - INFO - step 1737, loss: 4.042234, best loss: 3.444499 2025-01-16 01:19:09,905 - INFO - step 1738, loss: 4.780396, best loss: 3.444499 2025-01-16 01:19:10,055 - INFO - step 1739, loss: 4.739939, best loss: 3.444499 2025-01-16 01:19:10,205 - INFO - step 1740, loss: 4.533217, best loss: 3.444499 2025-01-16 01:19:10,355 - INFO - step 1741, loss: 4.784536, best loss: 3.444499 2025-01-16 01:19:10,505 - INFO - step 1742, loss: 4.445871, best loss: 3.444499 2025-01-16 01:19:10,655 - INFO - step 1743, loss: 4.804518, best loss: 3.444499 2025-01-16 01:19:10,805 - INFO - step 1744, loss: 4.946750, best loss: 3.444499 2025-01-16 01:19:10,955 - INFO - step 1745, loss: 4.865306, best loss: 3.444499 2025-01-16 01:19:11,105 - INFO - step 1746, loss: 4.947816, best loss: 3.444499 2025-01-16 01:19:11,255 - INFO - step 1747, loss: 4.641342, best loss: 3.444499 2025-01-16 01:19:11,405 - INFO - step 1748, loss: 4.929804, best loss: 3.444499 2025-01-16 01:19:11,555 - INFO - step 1749, loss: 4.643216, best loss: 3.444499 2025-01-16 01:19:11,705 - INFO - step 1750, loss: 4.447805, best loss: 3.444499 2025-01-16 01:19:11,855 - INFO - step 1751, loss: 4.901993, best loss: 3.444499 2025-01-16 01:19:12,005 - INFO - step 1752, loss: 4.860420, best loss: 3.444499 2025-01-16 01:19:12,155 - INFO - step 1753, loss: 4.715701, best loss: 3.444499 2025-01-16 01:19:12,305 - INFO - step 1754, loss: 4.574183, best loss: 3.444499 2025-01-16 01:19:12,456 - INFO - step 1755, loss: 4.899611, best loss: 3.444499 2025-01-16 01:19:12,606 - INFO - step 1756, loss: 4.593523, best loss: 3.444499 2025-01-16 01:19:12,755 - INFO - step 1757, loss: 4.422170, best loss: 3.444499 2025-01-16 01:19:12,906 - INFO - step 1758, loss: 4.287203, best loss: 3.444499 2025-01-16 01:19:13,056 - INFO - step 1759, loss: 4.713040, best loss: 3.444499 2025-01-16 01:19:13,205 - INFO - step 1760, loss: 4.765454, best loss: 3.444499 2025-01-16 01:19:13,356 - INFO - step 1761, loss: 4.976592, best loss: 3.444499 2025-01-16 01:19:13,506 - INFO - step 1762, loss: 4.867031, best loss: 3.444499 2025-01-16 01:19:13,655 - INFO - step 1763, loss: 4.447619, best loss: 3.444499 2025-01-16 01:19:13,805 - INFO - step 1764, loss: 4.814239, best loss: 3.444499 2025-01-16 01:19:13,955 - INFO - step 1765, loss: 4.357604, best loss: 3.444499 2025-01-16 01:19:14,105 - INFO - step 1766, loss: 4.848289, best loss: 3.444499 2025-01-16 01:19:14,256 - INFO - step 1767, loss: 4.431715, best loss: 3.444499 2025-01-16 01:19:14,406 - INFO - step 1768, loss: 4.546031, best loss: 3.444499 2025-01-16 01:19:14,556 - INFO - step 1769, loss: 4.543353, best loss: 3.444499 2025-01-16 01:19:14,706 - INFO - step 1770, loss: 4.551207, best loss: 3.444499 2025-01-16 01:19:14,856 - INFO - step 1771, loss: 4.362311, best loss: 3.444499 2025-01-16 01:19:15,006 - INFO - step 1772, loss: 4.602979, best loss: 3.444499 2025-01-16 01:19:15,156 - INFO - step 1773, loss: 4.139163, best loss: 3.444499 2025-01-16 01:19:15,305 - INFO - step 1774, loss: 3.808177, best loss: 3.444499 2025-01-16 01:19:15,455 - INFO - step 1775, loss: 4.000355, best loss: 3.444499 2025-01-16 01:19:15,605 - INFO - step 1776, loss: 4.289847, best loss: 3.444499 2025-01-16 01:19:15,755 - INFO - step 1777, loss: 5.020082, best loss: 3.444499 2025-01-16 01:19:15,905 - INFO - step 1778, loss: 4.526668, best loss: 3.444499 2025-01-16 01:19:16,055 - INFO - step 1779, loss: 4.714726, best loss: 3.444499 2025-01-16 01:19:16,205 - INFO - step 1780, loss: 4.895053, best loss: 3.444499 2025-01-16 01:19:16,355 - INFO - step 1781, loss: 4.436773, best loss: 3.444499 2025-01-16 01:19:16,505 - INFO - step 1782, loss: 5.059353, best loss: 3.444499 2025-01-16 01:19:16,655 - INFO - step 1783, loss: 4.488819, best loss: 3.444499 2025-01-16 01:19:16,805 - INFO - step 1784, loss: 4.727921, best loss: 3.444499 2025-01-16 01:19:16,955 - INFO - step 1785, loss: 4.465539, best loss: 3.444499 2025-01-16 01:19:17,105 - INFO - step 1786, loss: 5.123830, best loss: 3.444499 2025-01-16 01:19:17,255 - INFO - step 1787, loss: 4.483301, best loss: 3.444499 2025-01-16 01:19:17,405 - INFO - step 1788, loss: 4.273337, best loss: 3.444499 2025-01-16 01:19:17,555 - INFO - step 1789, loss: 4.710136, best loss: 3.444499 2025-01-16 01:19:17,705 - INFO - step 1790, loss: 4.368049, best loss: 3.444499 2025-01-16 01:19:17,854 - INFO - step 1791, loss: 4.173735, best loss: 3.444499 2025-01-16 01:19:18,004 - INFO - step 1792, loss: 5.083363, best loss: 3.444499 2025-01-16 01:19:18,154 - INFO - step 1793, loss: 4.571454, best loss: 3.444499 2025-01-16 01:19:18,304 - INFO - step 1794, loss: 4.020126, best loss: 3.444499 2025-01-16 01:19:18,454 - INFO - step 1795, loss: 4.329404, best loss: 3.444499 2025-01-16 01:19:18,604 - INFO - step 1796, loss: 4.627894, best loss: 3.444499 2025-01-16 01:19:18,754 - INFO - step 1797, loss: 4.532244, best loss: 3.444499 2025-01-16 01:19:18,903 - INFO - step 1798, loss: 4.511202, best loss: 3.444499 2025-01-16 01:19:19,053 - INFO - step 1799, loss: 4.258860, best loss: 3.444499 2025-01-16 01:19:19,204 - INFO - step 1800, loss: 4.924484, best loss: 3.444499 2025-01-16 01:19:19,353 - INFO - step 1801, loss: 4.679166, best loss: 3.444499 2025-01-16 01:19:19,503 - INFO - step 1802, loss: 4.617450, best loss: 3.444499 2025-01-16 01:19:19,653 - INFO - step 1803, loss: 4.291155, best loss: 3.444499 2025-01-16 01:19:19,803 - INFO - step 1804, loss: 4.605870, best loss: 3.444499 2025-01-16 01:19:19,953 - INFO - step 1805, loss: 4.440898, best loss: 3.444499 2025-01-16 01:19:20,103 - INFO - step 1806, loss: 4.041924, best loss: 3.444499 2025-01-16 01:19:20,253 - INFO - step 1807, loss: 4.544341, best loss: 3.444499 2025-01-16 01:19:20,403 - INFO - step 1808, loss: 4.118219, best loss: 3.444499 2025-01-16 01:19:20,553 - INFO - step 1809, loss: 4.580351, best loss: 3.444499 2025-01-16 01:19:20,702 - INFO - step 1810, loss: 4.312058, best loss: 3.444499 2025-01-16 01:19:20,852 - INFO - step 1811, loss: 4.459320, best loss: 3.444499 2025-01-16 01:19:21,002 - INFO - step 1812, loss: 4.286514, best loss: 3.444499 2025-01-16 01:19:21,153 - INFO - step 1813, loss: 4.679321, best loss: 3.444499 2025-01-16 01:19:21,303 - INFO - step 1814, loss: 4.910047, best loss: 3.444499 2025-01-16 01:19:21,453 - INFO - step 1815, loss: 4.619986, best loss: 3.444499 2025-01-16 01:19:21,602 - INFO - step 1816, loss: 4.761996, best loss: 3.444499 2025-01-16 01:19:21,752 - INFO - step 1817, loss: 4.266718, best loss: 3.444499 2025-01-16 01:19:21,902 - INFO - step 1818, loss: 4.600722, best loss: 3.444499 2025-01-16 01:19:22,052 - INFO - step 1819, loss: 4.672803, best loss: 3.444499 2025-01-16 01:19:22,202 - INFO - step 1820, loss: 4.503979, best loss: 3.444499 2025-01-16 01:19:22,352 - INFO - step 1821, loss: 4.308916, best loss: 3.444499 2025-01-16 01:19:22,502 - INFO - step 1822, loss: 4.272830, best loss: 3.444499 2025-01-16 01:19:22,652 - INFO - step 1823, loss: 4.281113, best loss: 3.444499 2025-01-16 01:19:22,802 - INFO - step 1824, loss: 4.484575, best loss: 3.444499 2025-01-16 01:19:22,952 - INFO - step 1825, loss: 4.910404, best loss: 3.444499 2025-01-16 01:19:23,102 - INFO - step 1826, loss: 4.955716, best loss: 3.444499 2025-01-16 01:19:23,252 - INFO - step 1827, loss: 4.678556, best loss: 3.444499 2025-01-16 01:19:23,402 - INFO - step 1828, loss: 4.771763, best loss: 3.444499 2025-01-16 01:19:23,552 - INFO - step 1829, loss: 4.617335, best loss: 3.444499 2025-01-16 01:19:23,701 - INFO - step 1830, loss: 4.585498, best loss: 3.444499 2025-01-16 01:19:23,852 - INFO - step 1831, loss: 4.039464, best loss: 3.444499 2025-01-16 01:19:24,001 - INFO - step 1832, loss: 4.639744, best loss: 3.444499 2025-01-16 01:19:24,152 - INFO - step 1833, loss: 4.954259, best loss: 3.444499 2025-01-16 01:19:24,302 - INFO - step 1834, loss: 4.537942, best loss: 3.444499 2025-01-16 01:19:24,452 - INFO - step 1835, loss: 4.610563, best loss: 3.444499 2025-01-16 01:19:24,602 - INFO - step 1836, loss: 4.613280, best loss: 3.444499 2025-01-16 01:19:24,752 - INFO - step 1837, loss: 4.163211, best loss: 3.444499 2025-01-16 01:19:24,902 - INFO - step 1838, loss: 3.679003, best loss: 3.444499 2025-01-16 01:19:25,052 - INFO - step 1839, loss: 4.459673, best loss: 3.444499 2025-01-16 01:19:25,201 - INFO - step 1840, loss: 4.695011, best loss: 3.444499 2025-01-16 01:19:25,351 - INFO - step 1841, loss: 4.730509, best loss: 3.444499 2025-01-16 01:19:25,501 - INFO - step 1842, loss: 4.595298, best loss: 3.444499 2025-01-16 01:19:25,651 - INFO - step 1843, loss: 4.242024, best loss: 3.444499 2025-01-16 01:19:25,801 - INFO - step 1844, loss: 4.457270, best loss: 3.444499 2025-01-16 01:19:25,951 - INFO - step 1845, loss: 4.502748, best loss: 3.444499 2025-01-16 01:19:26,101 - INFO - step 1846, loss: 4.246491, best loss: 3.444499 2025-01-16 01:19:26,252 - INFO - step 1847, loss: 4.449115, best loss: 3.444499 2025-01-16 01:19:26,402 - INFO - step 1848, loss: 4.345785, best loss: 3.444499 2025-01-16 01:19:26,551 - INFO - step 1849, loss: 4.165601, best loss: 3.444499 2025-01-16 01:19:26,702 - INFO - step 1850, loss: 4.622309, best loss: 3.444499 2025-01-16 01:19:26,852 - INFO - step 1851, loss: 4.099069, best loss: 3.444499 2025-01-16 01:19:27,001 - INFO - step 1852, loss: 4.442211, best loss: 3.444499 2025-01-16 01:19:27,151 - INFO - step 1853, loss: 4.740893, best loss: 3.444499 2025-01-16 01:19:27,301 - INFO - step 1854, loss: 4.341063, best loss: 3.444499 2025-01-16 01:19:27,451 - INFO - step 1855, loss: 3.846962, best loss: 3.444499 2025-01-16 01:19:27,601 - INFO - step 1856, loss: 4.500912, best loss: 3.444499 2025-01-16 01:19:27,751 - INFO - step 1857, loss: 4.717655, best loss: 3.444499 2025-01-16 01:19:27,901 - INFO - step 1858, loss: 4.940100, best loss: 3.444499 2025-01-16 01:19:28,051 - INFO - step 1859, loss: 4.704324, best loss: 3.444499 2025-01-16 01:19:28,201 - INFO - step 1860, loss: 4.850095, best loss: 3.444499 2025-01-16 01:19:28,351 - INFO - step 1861, loss: 4.676288, best loss: 3.444499 2025-01-16 01:19:28,501 - INFO - step 1862, loss: 4.960200, best loss: 3.444499 2025-01-16 01:19:28,650 - INFO - step 1863, loss: 4.388335, best loss: 3.444499 2025-01-16 01:19:28,800 - INFO - step 1864, loss: 4.496319, best loss: 3.444499 2025-01-16 01:19:28,950 - INFO - step 1865, loss: 4.616724, best loss: 3.444499 2025-01-16 01:19:29,100 - INFO - step 1866, loss: 4.617025, best loss: 3.444499 2025-01-16 01:19:29,250 - INFO - step 1867, loss: 4.601110, best loss: 3.444499 2025-01-16 01:19:29,400 - INFO - step 1868, loss: 4.358637, best loss: 3.444499 2025-01-16 01:19:29,550 - INFO - step 1869, loss: 4.387615, best loss: 3.444499 2025-01-16 01:19:29,700 - INFO - step 1870, loss: 4.600622, best loss: 3.444499 2025-01-16 01:19:29,850 - INFO - step 1871, loss: 4.798409, best loss: 3.444499 2025-01-16 01:19:30,000 - INFO - step 1872, loss: 4.457735, best loss: 3.444499 2025-01-16 01:19:30,150 - INFO - step 1873, loss: 4.893494, best loss: 3.444499 2025-01-16 01:19:30,300 - INFO - step 1874, loss: 4.922431, best loss: 3.444499 2025-01-16 01:19:30,450 - INFO - step 1875, loss: 5.088746, best loss: 3.444499 2025-01-16 01:19:30,600 - INFO - step 1876, loss: 5.146836, best loss: 3.444499 2025-01-16 01:19:30,750 - INFO - step 1877, loss: 5.116781, best loss: 3.444499 2025-01-16 01:19:30,900 - INFO - step 1878, loss: 4.531296, best loss: 3.444499 2025-01-16 01:19:31,050 - INFO - step 1879, loss: 4.919022, best loss: 3.444499 2025-01-16 01:19:31,199 - INFO - step 1880, loss: 4.845959, best loss: 3.444499 2025-01-16 01:19:31,349 - INFO - step 1881, loss: 5.040581, best loss: 3.444499 2025-01-16 01:19:31,499 - INFO - step 1882, loss: 4.444290, best loss: 3.444499 2025-01-16 01:19:31,649 - INFO - step 1883, loss: 4.635463, best loss: 3.444499 2025-01-16 01:19:31,799 - INFO - step 1884, loss: 4.620937, best loss: 3.444499 2025-01-16 01:19:31,949 - INFO - step 1885, loss: 4.512968, best loss: 3.444499 2025-01-16 01:19:32,099 - INFO - step 1886, loss: 4.806843, best loss: 3.444499 2025-01-16 01:19:32,249 - INFO - step 1887, loss: 4.268165, best loss: 3.444499 2025-01-16 01:19:32,399 - INFO - step 1888, loss: 4.321347, best loss: 3.444499 2025-01-16 01:19:32,549 - INFO - step 1889, loss: 4.796877, best loss: 3.444499 2025-01-16 01:19:32,699 - INFO - step 1890, loss: 4.534344, best loss: 3.444499 2025-01-16 01:19:32,848 - INFO - step 1891, loss: 4.742866, best loss: 3.444499 2025-01-16 01:19:32,999 - INFO - step 1892, loss: 4.525409, best loss: 3.444499 2025-01-16 01:19:33,149 - INFO - step 1893, loss: 5.178124, best loss: 3.444499 2025-01-16 01:19:33,298 - INFO - step 1894, loss: 4.879620, best loss: 3.444499 2025-01-16 01:19:33,448 - INFO - step 1895, loss: 4.414724, best loss: 3.444499 2025-01-16 01:19:33,598 - INFO - step 1896, loss: 4.366560, best loss: 3.444499 2025-01-16 01:19:33,748 - INFO - step 1897, loss: 4.880536, best loss: 3.444499 2025-01-16 01:19:33,898 - INFO - step 1898, loss: 4.533463, best loss: 3.444499 2025-01-16 01:19:34,048 - INFO - step 1899, loss: 4.489189, best loss: 3.444499 2025-01-16 01:19:34,198 - INFO - step 1900, loss: 4.828660, best loss: 3.444499 2025-01-16 01:19:34,348 - INFO - step 1901, loss: 4.719355, best loss: 3.444499 2025-01-16 01:19:34,498 - INFO - step 1902, loss: 4.667289, best loss: 3.444499 2025-01-16 01:19:34,648 - INFO - step 1903, loss: 4.330294, best loss: 3.444499 2025-01-16 01:19:34,798 - INFO - step 1904, loss: 4.129284, best loss: 3.444499 2025-01-16 01:19:34,947 - INFO - step 1905, loss: 4.168903, best loss: 3.444499 2025-01-16 01:19:35,098 - INFO - step 1906, loss: 4.136962, best loss: 3.444499 2025-01-16 01:19:35,248 - INFO - step 1907, loss: 4.097947, best loss: 3.444499 2025-01-16 01:19:35,397 - INFO - step 1908, loss: 4.592343, best loss: 3.444499 2025-01-16 01:19:35,548 - INFO - step 1909, loss: 4.344801, best loss: 3.444499 2025-01-16 01:19:35,698 - INFO - step 1910, loss: 4.218170, best loss: 3.444499 2025-01-16 01:19:35,848 - INFO - step 1911, loss: 4.419947, best loss: 3.444499 2025-01-16 01:19:35,998 - INFO - step 1912, loss: 4.419975, best loss: 3.444499 2025-01-16 01:19:36,147 - INFO - step 1913, loss: 4.414432, best loss: 3.444499 2025-01-16 01:19:36,297 - INFO - step 1914, loss: 4.530540, best loss: 3.444499 2025-01-16 01:19:36,447 - INFO - step 1915, loss: 4.790284, best loss: 3.444499 2025-01-16 01:19:36,597 - INFO - step 1916, loss: 4.174508, best loss: 3.444499 2025-01-16 01:19:36,747 - INFO - step 1917, loss: 4.186488, best loss: 3.444499 2025-01-16 01:19:36,897 - INFO - step 1918, loss: 4.495297, best loss: 3.444499 2025-01-16 01:19:37,046 - INFO - step 1919, loss: 4.601474, best loss: 3.444499 2025-01-16 01:19:37,196 - INFO - step 1920, loss: 4.175935, best loss: 3.444499 2025-01-16 01:19:37,346 - INFO - step 1921, loss: 4.070350, best loss: 3.444499 2025-01-16 01:19:37,496 - INFO - step 1922, loss: 4.324419, best loss: 3.444499 2025-01-16 01:19:37,646 - INFO - step 1923, loss: 4.461739, best loss: 3.444499 2025-01-16 01:19:37,796 - INFO - step 1924, loss: 4.067255, best loss: 3.444499 2025-01-16 01:19:37,947 - INFO - step 1925, loss: 4.479429, best loss: 3.444499 2025-01-16 01:19:38,096 - INFO - step 1926, loss: 4.324888, best loss: 3.444499 2025-01-16 01:19:38,246 - INFO - step 1927, loss: 4.083663, best loss: 3.444499 2025-01-16 01:19:38,396 - INFO - step 1928, loss: 4.033595, best loss: 3.444499 2025-01-16 01:19:38,546 - INFO - step 1929, loss: 4.308529, best loss: 3.444499 2025-01-16 01:19:38,696 - INFO - step 1930, loss: 3.840469, best loss: 3.444499 2025-01-16 01:19:38,846 - INFO - step 1931, loss: 4.168866, best loss: 3.444499 2025-01-16 01:19:38,996 - INFO - step 1932, loss: 3.851458, best loss: 3.444499 2025-01-16 01:19:39,146 - INFO - step 1933, loss: 4.503688, best loss: 3.444499 2025-01-16 01:19:39,296 - INFO - step 1934, loss: 4.816286, best loss: 3.444499 2025-01-16 01:19:39,446 - INFO - step 1935, loss: 5.233146, best loss: 3.444499 2025-01-16 01:19:39,596 - INFO - step 1936, loss: 4.571247, best loss: 3.444499 2025-01-16 01:19:39,746 - INFO - step 1937, loss: 4.800480, best loss: 3.444499 2025-01-16 01:19:39,896 - INFO - step 1938, loss: 4.644928, best loss: 3.444499 2025-01-16 01:19:40,046 - INFO - step 1939, loss: 4.456595, best loss: 3.444499 2025-01-16 01:19:40,196 - INFO - step 1940, loss: 4.329626, best loss: 3.444499 2025-01-16 01:19:40,345 - INFO - step 1941, loss: 4.726069, best loss: 3.444499 2025-01-16 01:19:40,495 - INFO - step 1942, loss: 4.292455, best loss: 3.444499 2025-01-16 01:19:40,645 - INFO - step 1943, loss: 3.912360, best loss: 3.444499 2025-01-16 01:19:40,795 - INFO - step 1944, loss: 4.229107, best loss: 3.444499 2025-01-16 01:19:40,945 - INFO - step 1945, loss: 4.213336, best loss: 3.444499 2025-01-16 01:19:41,095 - INFO - step 1946, loss: 4.433714, best loss: 3.444499 2025-01-16 01:19:41,245 - INFO - step 1947, loss: 3.538800, best loss: 3.444499 2025-01-16 01:19:41,395 - INFO - step 1948, loss: 4.207430, best loss: 3.444499 2025-01-16 01:19:41,545 - INFO - step 1949, loss: 4.457369, best loss: 3.444499 2025-01-16 01:19:41,695 - INFO - step 1950, loss: 4.553742, best loss: 3.444499 2025-01-16 01:19:41,845 - INFO - step 1951, loss: 4.292586, best loss: 3.444499 2025-01-16 01:19:41,994 - INFO - step 1952, loss: 4.351169, best loss: 3.444499 2025-01-16 01:19:42,144 - INFO - step 1953, loss: 4.310711, best loss: 3.444499 2025-01-16 01:19:42,294 - INFO - step 1954, loss: 3.953327, best loss: 3.444499 2025-01-16 01:19:42,444 - INFO - step 1955, loss: 4.499620, best loss: 3.444499 2025-01-16 01:19:42,594 - INFO - step 1956, loss: 4.350173, best loss: 3.444499 2025-01-16 01:19:42,744 - INFO - step 1957, loss: 4.341684, best loss: 3.444499 2025-01-16 01:19:42,894 - INFO - step 1958, loss: 3.945674, best loss: 3.444499 2025-01-16 01:19:43,044 - INFO - step 1959, loss: 3.999178, best loss: 3.444499 2025-01-16 01:19:43,194 - INFO - step 1960, loss: 4.132164, best loss: 3.444499 2025-01-16 01:19:43,344 - INFO - step 1961, loss: 4.054665, best loss: 3.444499 2025-01-16 01:19:43,494 - INFO - step 1962, loss: 4.038637, best loss: 3.444499 2025-01-16 01:19:43,644 - INFO - step 1963, loss: 4.074815, best loss: 3.444499 2025-01-16 01:19:43,794 - INFO - step 1964, loss: 3.964961, best loss: 3.444499 2025-01-16 01:19:43,944 - INFO - step 1965, loss: 3.764654, best loss: 3.444499 2025-01-16 01:19:44,094 - INFO - step 1966, loss: 3.647218, best loss: 3.444499 2025-01-16 01:19:47,599 - INFO - step 1967, loss: 3.365786, best loss: 3.365786 2025-01-16 01:19:47,761 - INFO - step 1968, loss: 4.305971, best loss: 3.365786 2025-01-16 01:19:47,918 - INFO - step 1969, loss: 4.751580, best loss: 3.365786 2025-01-16 01:19:48,068 - INFO - step 1970, loss: 4.832130, best loss: 3.365786 2025-01-16 01:19:48,218 - INFO - step 1971, loss: 5.007476, best loss: 3.365786 2025-01-16 01:19:48,368 - INFO - step 1972, loss: 5.045435, best loss: 3.365786 2025-01-16 01:19:48,518 - INFO - step 1973, loss: 4.552763, best loss: 3.365786 2025-01-16 01:19:48,668 - INFO - step 1974, loss: 4.747596, best loss: 3.365786 2025-01-16 01:19:48,818 - INFO - step 1975, loss: 4.887081, best loss: 3.365786 2025-01-16 01:19:48,968 - INFO - step 1976, loss: 4.317879, best loss: 3.365786 2025-01-16 01:19:49,118 - INFO - step 1977, loss: 4.150571, best loss: 3.365786 2025-01-16 01:19:49,269 - INFO - step 1978, loss: 4.403630, best loss: 3.365786 2025-01-16 01:19:49,419 - INFO - step 1979, loss: 4.240231, best loss: 3.365786 2025-01-16 01:19:49,569 - INFO - step 1980, loss: 4.826247, best loss: 3.365786 2025-01-16 01:19:49,720 - INFO - step 1981, loss: 4.625219, best loss: 3.365786 2025-01-16 01:19:49,869 - INFO - step 1982, loss: 4.838290, best loss: 3.365786 2025-01-16 01:19:50,019 - INFO - step 1983, loss: 4.535506, best loss: 3.365786 2025-01-16 01:19:50,169 - INFO - step 1984, loss: 4.830039, best loss: 3.365786 2025-01-16 01:19:50,319 - INFO - step 1985, loss: 4.326384, best loss: 3.365786 2025-01-16 01:19:50,469 - INFO - step 1986, loss: 4.700591, best loss: 3.365786 2025-01-16 01:19:50,619 - INFO - step 1987, loss: 4.405792, best loss: 3.365786 2025-01-16 01:19:50,769 - INFO - step 1988, loss: 4.673093, best loss: 3.365786 2025-01-16 01:19:50,919 - INFO - step 1989, loss: 4.421672, best loss: 3.365786 2025-01-16 01:19:51,069 - INFO - step 1990, loss: 4.278938, best loss: 3.365786 2025-01-16 01:19:51,219 - INFO - step 1991, loss: 5.053328, best loss: 3.365786 2025-01-16 01:19:51,369 - INFO - step 1992, loss: 4.220458, best loss: 3.365786 2025-01-16 01:19:51,519 - INFO - step 1993, loss: 4.658002, best loss: 3.365786 2025-01-16 01:19:51,670 - INFO - step 1994, loss: 4.585019, best loss: 3.365786 2025-01-16 01:19:51,820 - INFO - step 1995, loss: 4.611610, best loss: 3.365786 2025-01-16 01:19:51,970 - INFO - step 1996, loss: 4.557939, best loss: 3.365786 2025-01-16 01:19:52,119 - INFO - step 1997, loss: 4.260723, best loss: 3.365786 2025-01-16 01:19:52,269 - INFO - step 1998, loss: 4.196182, best loss: 3.365786 2025-01-16 01:19:52,419 - INFO - step 1999, loss: 4.332960, best loss: 3.365786 2025-01-16 01:19:52,569 - INFO - step 2000, loss: 3.787109, best loss: 3.365786 2025-01-16 01:19:52,719 - INFO - step 2001, loss: 4.645494, best loss: 3.365786 2025-01-16 01:19:52,869 - INFO - step 2002, loss: 3.862845, best loss: 3.365786 2025-01-16 01:19:53,019 - INFO - step 2003, loss: 3.709783, best loss: 3.365786 2025-01-16 01:19:53,169 - INFO - step 2004, loss: 4.264852, best loss: 3.365786 2025-01-16 01:19:53,319 - INFO - step 2005, loss: 4.244766, best loss: 3.365786 2025-01-16 01:19:53,470 - INFO - step 2006, loss: 4.222635, best loss: 3.365786 2025-01-16 01:19:53,620 - INFO - step 2007, loss: 3.893689, best loss: 3.365786 2025-01-16 01:19:53,769 - INFO - step 2008, loss: 4.170681, best loss: 3.365786 2025-01-16 01:19:53,919 - INFO - step 2009, loss: 4.213867, best loss: 3.365786 2025-01-16 01:19:54,069 - INFO - step 2010, loss: 4.189872, best loss: 3.365786 2025-01-16 01:19:54,220 - INFO - step 2011, loss: 4.003039, best loss: 3.365786 2025-01-16 01:19:54,370 - INFO - step 2012, loss: 4.466764, best loss: 3.365786 2025-01-16 01:19:54,520 - INFO - step 2013, loss: 4.410840, best loss: 3.365786 2025-01-16 01:19:54,670 - INFO - step 2014, loss: 4.430594, best loss: 3.365786 2025-01-16 01:19:54,821 - INFO - step 2015, loss: 3.943619, best loss: 3.365786 2025-01-16 01:19:54,970 - INFO - step 2016, loss: 4.162528, best loss: 3.365786 2025-01-16 01:19:55,120 - INFO - step 2017, loss: 4.562455, best loss: 3.365786 2025-01-16 01:19:55,270 - INFO - step 2018, loss: 4.005974, best loss: 3.365786 2025-01-16 01:19:55,420 - INFO - step 2019, loss: 4.580411, best loss: 3.365786 2025-01-16 01:19:55,570 - INFO - step 2020, loss: 4.564038, best loss: 3.365786 2025-01-16 01:19:55,720 - INFO - step 2021, loss: 4.508029, best loss: 3.365786 2025-01-16 01:19:55,870 - INFO - step 2022, loss: 4.475643, best loss: 3.365786 2025-01-16 01:19:56,020 - INFO - step 2023, loss: 4.461284, best loss: 3.365786 2025-01-16 01:19:56,170 - INFO - step 2024, loss: 4.348899, best loss: 3.365786 2025-01-16 01:19:56,320 - INFO - step 2025, loss: 4.158384, best loss: 3.365786 2025-01-16 01:19:56,470 - INFO - step 2026, loss: 4.998446, best loss: 3.365786 2025-01-16 01:19:56,620 - INFO - step 2027, loss: 4.456056, best loss: 3.365786 2025-01-16 01:19:56,770 - INFO - step 2028, loss: 4.785886, best loss: 3.365786 2025-01-16 01:19:56,921 - INFO - step 2029, loss: 3.834531, best loss: 3.365786 2025-01-16 01:19:57,070 - INFO - step 2030, loss: 3.859263, best loss: 3.365786 2025-01-16 01:19:57,220 - INFO - step 2031, loss: 4.412868, best loss: 3.365786 2025-01-16 01:19:57,370 - INFO - step 2032, loss: 4.487470, best loss: 3.365786 2025-01-16 01:19:57,520 - INFO - step 2033, loss: 4.167033, best loss: 3.365786 2025-01-16 01:19:57,670 - INFO - step 2034, loss: 4.306668, best loss: 3.365786 2025-01-16 01:19:57,820 - INFO - step 2035, loss: 4.118422, best loss: 3.365786 2025-01-16 01:19:57,970 - INFO - step 2036, loss: 4.250541, best loss: 3.365786 2025-01-16 01:19:58,120 - INFO - step 2037, loss: 4.566323, best loss: 3.365786 2025-01-16 01:19:58,270 - INFO - step 2038, loss: 4.115898, best loss: 3.365786 2025-01-16 01:19:58,420 - INFO - step 2039, loss: 4.101805, best loss: 3.365786 2025-01-16 01:19:58,570 - INFO - step 2040, loss: 4.234598, best loss: 3.365786 2025-01-16 01:19:58,720 - INFO - step 2041, loss: 4.406959, best loss: 3.365786 2025-01-16 01:19:58,870 - INFO - step 2042, loss: 4.278063, best loss: 3.365786 2025-01-16 01:19:59,020 - INFO - step 2043, loss: 4.532535, best loss: 3.365786 2025-01-16 01:19:59,170 - INFO - step 2044, loss: 4.254408, best loss: 3.365786 2025-01-16 01:19:59,320 - INFO - step 2045, loss: 3.975920, best loss: 3.365786 2025-01-16 01:19:59,470 - INFO - step 2046, loss: 4.352916, best loss: 3.365786 2025-01-16 01:19:59,620 - INFO - step 2047, loss: 3.607635, best loss: 3.365786 2025-01-16 01:19:59,771 - INFO - step 2048, loss: 4.030655, best loss: 3.365786 2025-01-16 01:19:59,921 - INFO - step 2049, loss: 4.240450, best loss: 3.365786 2025-01-16 01:20:00,071 - INFO - step 2050, loss: 4.172235, best loss: 3.365786 2025-01-16 01:20:00,221 - INFO - step 2051, loss: 4.066116, best loss: 3.365786 2025-01-16 01:20:00,371 - INFO - step 2052, loss: 4.332716, best loss: 3.365786 2025-01-16 01:20:00,521 - INFO - step 2053, loss: 4.560111, best loss: 3.365786 2025-01-16 01:20:00,671 - INFO - step 2054, loss: 4.366543, best loss: 3.365786 2025-01-16 01:20:00,821 - INFO - step 2055, loss: 4.634885, best loss: 3.365786 2025-01-16 01:20:00,971 - INFO - step 2056, loss: 4.214121, best loss: 3.365786 2025-01-16 01:20:01,121 - INFO - step 2057, loss: 4.324586, best loss: 3.365786 2025-01-16 01:20:01,271 - INFO - step 2058, loss: 4.122445, best loss: 3.365786 2025-01-16 01:20:01,421 - INFO - step 2059, loss: 3.842197, best loss: 3.365786 2025-01-16 01:20:01,571 - INFO - step 2060, loss: 4.699141, best loss: 3.365786 2025-01-16 01:20:01,721 - INFO - step 2061, loss: 4.638427, best loss: 3.365786 2025-01-16 01:20:01,871 - INFO - step 2062, loss: 4.267258, best loss: 3.365786 2025-01-16 01:20:02,021 - INFO - step 2063, loss: 4.135425, best loss: 3.365786 2025-01-16 01:20:02,171 - INFO - step 2064, loss: 3.778512, best loss: 3.365786 2025-01-16 01:20:02,321 - INFO - step 2065, loss: 4.024215, best loss: 3.365786 2025-01-16 01:20:02,470 - INFO - step 2066, loss: 4.009557, best loss: 3.365786 2025-01-16 01:20:02,621 - INFO - step 2067, loss: 3.965559, best loss: 3.365786 2025-01-16 01:20:02,770 - INFO - step 2068, loss: 4.701775, best loss: 3.365786 2025-01-16 01:20:02,920 - INFO - step 2069, loss: 4.624771, best loss: 3.365786 2025-01-16 01:20:03,070 - INFO - step 2070, loss: 4.388451, best loss: 3.365786 2025-01-16 01:20:03,220 - INFO - step 2071, loss: 4.680478, best loss: 3.365786 2025-01-16 01:20:03,370 - INFO - step 2072, loss: 4.297402, best loss: 3.365786 2025-01-16 01:20:03,520 - INFO - step 2073, loss: 4.719445, best loss: 3.365786 2025-01-16 01:20:03,671 - INFO - step 2074, loss: 4.844286, best loss: 3.365786 2025-01-16 01:20:03,821 - INFO - step 2075, loss: 4.764591, best loss: 3.365786 2025-01-16 01:20:03,971 - INFO - step 2076, loss: 4.823060, best loss: 3.365786 2025-01-16 01:20:04,121 - INFO - step 2077, loss: 4.503260, best loss: 3.365786 2025-01-16 01:20:04,271 - INFO - step 2078, loss: 4.813829, best loss: 3.365786 2025-01-16 01:20:04,421 - INFO - step 2079, loss: 4.511494, best loss: 3.365786 2025-01-16 01:20:04,571 - INFO - step 2080, loss: 4.290452, best loss: 3.365786 2025-01-16 01:20:04,721 - INFO - step 2081, loss: 4.736899, best loss: 3.365786 2025-01-16 01:20:04,871 - INFO - step 2082, loss: 4.747751, best loss: 3.365786 2025-01-16 01:20:05,021 - INFO - step 2083, loss: 4.595129, best loss: 3.365786 2025-01-16 01:20:05,171 - INFO - step 2084, loss: 4.445728, best loss: 3.365786 2025-01-16 01:20:05,321 - INFO - step 2085, loss: 4.794281, best loss: 3.365786 2025-01-16 01:20:05,470 - INFO - step 2086, loss: 4.469361, best loss: 3.365786 2025-01-16 01:20:05,621 - INFO - step 2087, loss: 4.272638, best loss: 3.365786 2025-01-16 01:20:05,771 - INFO - step 2088, loss: 4.184638, best loss: 3.365786 2025-01-16 01:20:05,921 - INFO - step 2089, loss: 4.612098, best loss: 3.365786 2025-01-16 01:20:06,071 - INFO - step 2090, loss: 4.654095, best loss: 3.365786 2025-01-16 01:20:06,220 - INFO - step 2091, loss: 4.874041, best loss: 3.365786 2025-01-16 01:20:06,370 - INFO - step 2092, loss: 4.734110, best loss: 3.365786 2025-01-16 01:20:06,520 - INFO - step 2093, loss: 4.304569, best loss: 3.365786 2025-01-16 01:20:06,670 - INFO - step 2094, loss: 4.729260, best loss: 3.365786 2025-01-16 01:20:06,820 - INFO - step 2095, loss: 4.258372, best loss: 3.365786 2025-01-16 01:20:06,970 - INFO - step 2096, loss: 4.710384, best loss: 3.365786 2025-01-16 01:20:07,120 - INFO - step 2097, loss: 4.267449, best loss: 3.365786 2025-01-16 01:20:07,270 - INFO - step 2098, loss: 4.452733, best loss: 3.365786 2025-01-16 01:20:07,420 - INFO - step 2099, loss: 4.381345, best loss: 3.365786 2025-01-16 01:20:07,570 - INFO - step 2100, loss: 4.439862, best loss: 3.365786 2025-01-16 01:20:07,720 - INFO - step 2101, loss: 4.227244, best loss: 3.365786 2025-01-16 01:20:07,871 - INFO - step 2102, loss: 4.473489, best loss: 3.365786 2025-01-16 01:20:08,021 - INFO - step 2103, loss: 4.004318, best loss: 3.365786 2025-01-16 01:20:08,170 - INFO - step 2104, loss: 3.688174, best loss: 3.365786 2025-01-16 01:20:08,320 - INFO - step 2105, loss: 3.897554, best loss: 3.365786 2025-01-16 01:20:08,470 - INFO - step 2106, loss: 4.150882, best loss: 3.365786 2025-01-16 01:20:08,620 - INFO - step 2107, loss: 4.880678, best loss: 3.365786 2025-01-16 01:20:08,770 - INFO - step 2108, loss: 4.395987, best loss: 3.365786 2025-01-16 01:20:08,920 - INFO - step 2109, loss: 4.449521, best loss: 3.365786 2025-01-16 01:20:09,070 - INFO - step 2110, loss: 4.745208, best loss: 3.365786 2025-01-16 01:20:09,220 - INFO - step 2111, loss: 4.311430, best loss: 3.365786 2025-01-16 01:20:09,370 - INFO - step 2112, loss: 4.915691, best loss: 3.365786 2025-01-16 01:20:09,520 - INFO - step 2113, loss: 4.262475, best loss: 3.365786 2025-01-16 01:20:09,670 - INFO - step 2114, loss: 4.553813, best loss: 3.365786 2025-01-16 01:20:09,820 - INFO - step 2115, loss: 4.268216, best loss: 3.365786 2025-01-16 01:20:09,970 - INFO - step 2116, loss: 4.967653, best loss: 3.365786 2025-01-16 01:20:10,120 - INFO - step 2117, loss: 4.376955, best loss: 3.365786 2025-01-16 01:20:10,270 - INFO - step 2118, loss: 4.107978, best loss: 3.365786 2025-01-16 01:20:10,420 - INFO - step 2119, loss: 4.586154, best loss: 3.365786 2025-01-16 01:20:10,570 - INFO - step 2120, loss: 4.240215, best loss: 3.365786 2025-01-16 01:20:10,720 - INFO - step 2121, loss: 4.036891, best loss: 3.365786 2025-01-16 01:20:10,870 - INFO - step 2122, loss: 4.970226, best loss: 3.365786 2025-01-16 01:20:11,020 - INFO - step 2123, loss: 4.464821, best loss: 3.365786 2025-01-16 01:20:11,170 - INFO - step 2124, loss: 3.888701, best loss: 3.365786 2025-01-16 01:20:11,321 - INFO - step 2125, loss: 4.207209, best loss: 3.365786 2025-01-16 01:20:11,471 - INFO - step 2126, loss: 4.508816, best loss: 3.365786 2025-01-16 01:20:11,621 - INFO - step 2127, loss: 4.376008, best loss: 3.365786 2025-01-16 01:20:11,771 - INFO - step 2128, loss: 4.362422, best loss: 3.365786 2025-01-16 01:20:11,921 - INFO - step 2129, loss: 4.148305, best loss: 3.365786 2025-01-16 01:20:12,071 - INFO - step 2130, loss: 4.771512, best loss: 3.365786 2025-01-16 01:20:12,221 - INFO - step 2131, loss: 4.583955, best loss: 3.365786 2025-01-16 01:20:12,371 - INFO - step 2132, loss: 4.466474, best loss: 3.365786 2025-01-16 01:20:12,521 - INFO - step 2133, loss: 4.146929, best loss: 3.365786 2025-01-16 01:20:12,671 - INFO - step 2134, loss: 4.469459, best loss: 3.365786 2025-01-16 01:20:12,821 - INFO - step 2135, loss: 4.280499, best loss: 3.365786 2025-01-16 01:20:12,970 - INFO - step 2136, loss: 3.895651, best loss: 3.365786 2025-01-16 01:20:13,120 - INFO - step 2137, loss: 4.382687, best loss: 3.365786 2025-01-16 01:20:13,270 - INFO - step 2138, loss: 3.979737, best loss: 3.365786 2025-01-16 01:20:13,420 - INFO - step 2139, loss: 4.437334, best loss: 3.365786 2025-01-16 01:20:13,570 - INFO - step 2140, loss: 4.179725, best loss: 3.365786 2025-01-16 01:20:13,720 - INFO - step 2141, loss: 4.319169, best loss: 3.365786 2025-01-16 01:20:13,870 - INFO - step 2142, loss: 4.138318, best loss: 3.365786 2025-01-16 01:20:14,020 - INFO - step 2143, loss: 4.494524, best loss: 3.365786 2025-01-16 01:20:14,170 - INFO - step 2144, loss: 4.782467, best loss: 3.365786 2025-01-16 01:20:14,320 - INFO - step 2145, loss: 4.553587, best loss: 3.365786 2025-01-16 01:20:14,470 - INFO - step 2146, loss: 4.674507, best loss: 3.365786 2025-01-16 01:20:14,620 - INFO - step 2147, loss: 4.125454, best loss: 3.365786 2025-01-16 01:20:14,770 - INFO - step 2148, loss: 4.450150, best loss: 3.365786 2025-01-16 01:20:14,920 - INFO - step 2149, loss: 4.525848, best loss: 3.365786 2025-01-16 01:20:15,069 - INFO - step 2150, loss: 4.323960, best loss: 3.365786 2025-01-16 01:20:15,220 - INFO - step 2151, loss: 4.192309, best loss: 3.365786 2025-01-16 01:20:15,370 - INFO - step 2152, loss: 4.211489, best loss: 3.365786 2025-01-16 01:20:15,520 - INFO - step 2153, loss: 4.259064, best loss: 3.365786 2025-01-16 01:20:15,670 - INFO - step 2154, loss: 4.412490, best loss: 3.365786 2025-01-16 01:20:15,820 - INFO - step 2155, loss: 4.758875, best loss: 3.365786 2025-01-16 01:20:15,970 - INFO - step 2156, loss: 4.851939, best loss: 3.365786 2025-01-16 01:20:16,120 - INFO - step 2157, loss: 4.578955, best loss: 3.365786 2025-01-16 01:20:16,270 - INFO - step 2158, loss: 4.676840, best loss: 3.365786 2025-01-16 01:20:16,420 - INFO - step 2159, loss: 4.537306, best loss: 3.365786 2025-01-16 01:20:16,570 - INFO - step 2160, loss: 4.502434, best loss: 3.365786 2025-01-16 01:20:16,720 - INFO - step 2161, loss: 3.902988, best loss: 3.365786 2025-01-16 01:20:16,871 - INFO - step 2162, loss: 4.548420, best loss: 3.365786 2025-01-16 01:20:17,021 - INFO - step 2163, loss: 4.860206, best loss: 3.365786 2025-01-16 01:20:17,171 - INFO - step 2164, loss: 4.442837, best loss: 3.365786 2025-01-16 01:20:17,321 - INFO - step 2165, loss: 4.514628, best loss: 3.365786 2025-01-16 01:20:17,471 - INFO - step 2166, loss: 4.510094, best loss: 3.365786 2025-01-16 01:20:17,621 - INFO - step 2167, loss: 4.048302, best loss: 3.365786 2025-01-16 01:20:17,771 - INFO - step 2168, loss: 3.467144, best loss: 3.365786 2025-01-16 01:20:17,921 - INFO - step 2169, loss: 4.337586, best loss: 3.365786 2025-01-16 01:20:18,071 - INFO - step 2170, loss: 4.543154, best loss: 3.365786 2025-01-16 01:20:18,221 - INFO - step 2171, loss: 4.584580, best loss: 3.365786 2025-01-16 01:20:18,371 - INFO - step 2172, loss: 4.468502, best loss: 3.365786 2025-01-16 01:20:18,521 - INFO - step 2173, loss: 4.148065, best loss: 3.365786 2025-01-16 01:20:18,671 - INFO - step 2174, loss: 4.348150, best loss: 3.365786 2025-01-16 01:20:18,821 - INFO - step 2175, loss: 4.447546, best loss: 3.365786 2025-01-16 01:20:18,971 - INFO - step 2176, loss: 4.137503, best loss: 3.365786 2025-01-16 01:20:19,121 - INFO - step 2177, loss: 4.343268, best loss: 3.365786 2025-01-16 01:20:19,271 - INFO - step 2178, loss: 4.239983, best loss: 3.365786 2025-01-16 01:20:19,421 - INFO - step 2179, loss: 4.040792, best loss: 3.365786 2025-01-16 01:20:19,571 - INFO - step 2180, loss: 4.531077, best loss: 3.365786 2025-01-16 01:20:19,721 - INFO - step 2181, loss: 3.983596, best loss: 3.365786 2025-01-16 01:20:19,871 - INFO - step 2182, loss: 4.342096, best loss: 3.365786 2025-01-16 01:20:20,021 - INFO - step 2183, loss: 4.636961, best loss: 3.365786 2025-01-16 01:20:20,171 - INFO - step 2184, loss: 4.239333, best loss: 3.365786 2025-01-16 01:20:20,321 - INFO - step 2185, loss: 3.748000, best loss: 3.365786 2025-01-16 01:20:20,471 - INFO - step 2186, loss: 4.371206, best loss: 3.365786 2025-01-16 01:20:20,621 - INFO - step 2187, loss: 4.640535, best loss: 3.365786 2025-01-16 01:20:20,771 - INFO - step 2188, loss: 4.879212, best loss: 3.365786 2025-01-16 01:20:20,921 - INFO - step 2189, loss: 4.623385, best loss: 3.365786 2025-01-16 01:20:21,071 - INFO - step 2190, loss: 4.759142, best loss: 3.365786 2025-01-16 01:20:21,221 - INFO - step 2191, loss: 4.591348, best loss: 3.365786 2025-01-16 01:20:21,371 - INFO - step 2192, loss: 4.834535, best loss: 3.365786 2025-01-16 01:20:21,521 - INFO - step 2193, loss: 4.283295, best loss: 3.365786 2025-01-16 01:20:21,671 - INFO - step 2194, loss: 4.409310, best loss: 3.365786 2025-01-16 01:20:21,821 - INFO - step 2195, loss: 4.603210, best loss: 3.365786 2025-01-16 01:20:21,971 - INFO - step 2196, loss: 4.571886, best loss: 3.365786 2025-01-16 01:20:22,121 - INFO - step 2197, loss: 4.519885, best loss: 3.365786 2025-01-16 01:20:22,271 - INFO - step 2198, loss: 4.250565, best loss: 3.365786 2025-01-16 01:20:22,421 - INFO - step 2199, loss: 4.281387, best loss: 3.365786 2025-01-16 01:20:22,571 - INFO - step 2200, loss: 4.507528, best loss: 3.365786 2025-01-16 01:20:22,720 - INFO - step 2201, loss: 4.702680, best loss: 3.365786 2025-01-16 01:20:22,870 - INFO - step 2202, loss: 4.410825, best loss: 3.365786 2025-01-16 01:20:23,020 - INFO - step 2203, loss: 4.817811, best loss: 3.365786 2025-01-16 01:20:23,170 - INFO - step 2204, loss: 4.814015, best loss: 3.365786 2025-01-16 01:20:23,321 - INFO - step 2205, loss: 4.934943, best loss: 3.365786 2025-01-16 01:20:23,471 - INFO - step 2206, loss: 5.041353, best loss: 3.365786 2025-01-16 01:20:23,620 - INFO - step 2207, loss: 4.991591, best loss: 3.365786 2025-01-16 01:20:23,770 - INFO - step 2208, loss: 4.345539, best loss: 3.365786 2025-01-16 01:20:23,920 - INFO - step 2209, loss: 4.822869, best loss: 3.365786 2025-01-16 01:20:24,070 - INFO - step 2210, loss: 4.761040, best loss: 3.365786 2025-01-16 01:20:24,220 - INFO - step 2211, loss: 4.920417, best loss: 3.365786 2025-01-16 01:20:24,370 - INFO - step 2212, loss: 4.315870, best loss: 3.365786 2025-01-16 01:20:24,520 - INFO - step 2213, loss: 4.526197, best loss: 3.365786 2025-01-16 01:20:24,670 - INFO - step 2214, loss: 4.519526, best loss: 3.365786 2025-01-16 01:20:24,820 - INFO - step 2215, loss: 4.434987, best loss: 3.365786 2025-01-16 01:20:24,970 - INFO - step 2216, loss: 4.719637, best loss: 3.365786 2025-01-16 01:20:25,120 - INFO - step 2217, loss: 4.197643, best loss: 3.365786 2025-01-16 01:20:25,270 - INFO - step 2218, loss: 4.187955, best loss: 3.365786 2025-01-16 01:20:25,420 - INFO - step 2219, loss: 4.703060, best loss: 3.365786 2025-01-16 01:20:25,570 - INFO - step 2220, loss: 4.439255, best loss: 3.365786 2025-01-16 01:20:25,720 - INFO - step 2221, loss: 4.624660, best loss: 3.365786 2025-01-16 01:20:25,870 - INFO - step 2222, loss: 4.397116, best loss: 3.365786 2025-01-16 01:20:26,020 - INFO - step 2223, loss: 5.033679, best loss: 3.365786 2025-01-16 01:20:26,170 - INFO - step 2224, loss: 4.743614, best loss: 3.365786 2025-01-16 01:20:26,320 - INFO - step 2225, loss: 4.269752, best loss: 3.365786 2025-01-16 01:20:26,470 - INFO - step 2226, loss: 4.271940, best loss: 3.365786 2025-01-16 01:20:26,620 - INFO - step 2227, loss: 4.802166, best loss: 3.365786 2025-01-16 01:20:26,770 - INFO - step 2228, loss: 4.326671, best loss: 3.365786 2025-01-16 01:20:26,920 - INFO - step 2229, loss: 4.272419, best loss: 3.365786 2025-01-16 01:20:27,070 - INFO - step 2230, loss: 4.669510, best loss: 3.365786 2025-01-16 01:20:27,220 - INFO - step 2231, loss: 4.591818, best loss: 3.365786 2025-01-16 01:20:27,370 - INFO - step 2232, loss: 4.553791, best loss: 3.365786 2025-01-16 01:20:27,520 - INFO - step 2233, loss: 4.148795, best loss: 3.365786 2025-01-16 01:20:27,670 - INFO - step 2234, loss: 3.902905, best loss: 3.365786 2025-01-16 01:20:27,820 - INFO - step 2235, loss: 3.993744, best loss: 3.365786 2025-01-16 01:20:27,970 - INFO - step 2236, loss: 3.962801, best loss: 3.365786 2025-01-16 01:20:28,120 - INFO - step 2237, loss: 3.946796, best loss: 3.365786 2025-01-16 01:20:28,270 - INFO - step 2238, loss: 4.440948, best loss: 3.365786 2025-01-16 01:20:28,420 - INFO - step 2239, loss: 4.269834, best loss: 3.365786 2025-01-16 01:20:28,570 - INFO - step 2240, loss: 4.129732, best loss: 3.365786 2025-01-16 01:20:28,720 - INFO - step 2241, loss: 4.339722, best loss: 3.365786 2025-01-16 01:20:28,870 - INFO - step 2242, loss: 4.276417, best loss: 3.365786 2025-01-16 01:20:29,020 - INFO - step 2243, loss: 4.263684, best loss: 3.365786 2025-01-16 01:20:29,170 - INFO - step 2244, loss: 4.379226, best loss: 3.365786 2025-01-16 01:20:29,320 - INFO - step 2245, loss: 4.672460, best loss: 3.365786 2025-01-16 01:20:29,470 - INFO - step 2246, loss: 4.067685, best loss: 3.365786 2025-01-16 01:20:29,620 - INFO - step 2247, loss: 4.066444, best loss: 3.365786 2025-01-16 01:20:29,770 - INFO - step 2248, loss: 4.339128, best loss: 3.365786 2025-01-16 01:20:29,920 - INFO - step 2249, loss: 4.501626, best loss: 3.365786 2025-01-16 01:20:30,070 - INFO - step 2250, loss: 4.067429, best loss: 3.365786 2025-01-16 01:20:30,220 - INFO - step 2251, loss: 3.941510, best loss: 3.365786 2025-01-16 01:20:30,370 - INFO - step 2252, loss: 4.224838, best loss: 3.365786 2025-01-16 01:20:30,520 - INFO - step 2253, loss: 4.334887, best loss: 3.365786 2025-01-16 01:20:30,671 - INFO - step 2254, loss: 3.930769, best loss: 3.365786 2025-01-16 01:20:30,820 - INFO - step 2255, loss: 4.323936, best loss: 3.365786 2025-01-16 01:20:30,970 - INFO - step 2256, loss: 4.160687, best loss: 3.365786 2025-01-16 01:20:31,121 - INFO - step 2257, loss: 3.975210, best loss: 3.365786 2025-01-16 01:20:31,271 - INFO - step 2258, loss: 3.925728, best loss: 3.365786 2025-01-16 01:20:31,420 - INFO - step 2259, loss: 4.193463, best loss: 3.365786 2025-01-16 01:20:31,571 - INFO - step 2260, loss: 3.750206, best loss: 3.365786 2025-01-16 01:20:31,720 - INFO - step 2261, loss: 4.044039, best loss: 3.365786 2025-01-16 01:20:31,870 - INFO - step 2262, loss: 3.737053, best loss: 3.365786 2025-01-16 01:20:32,020 - INFO - step 2263, loss: 4.395921, best loss: 3.365786 2025-01-16 01:20:32,170 - INFO - step 2264, loss: 4.705295, best loss: 3.365786 2025-01-16 01:20:32,320 - INFO - step 2265, loss: 5.098071, best loss: 3.365786 2025-01-16 01:20:32,470 - INFO - step 2266, loss: 4.449038, best loss: 3.365786 2025-01-16 01:20:32,620 - INFO - step 2267, loss: 4.689698, best loss: 3.365786 2025-01-16 01:20:32,770 - INFO - step 2268, loss: 4.537678, best loss: 3.365786 2025-01-16 01:20:32,920 - INFO - step 2269, loss: 4.341499, best loss: 3.365786 2025-01-16 01:20:33,070 - INFO - step 2270, loss: 4.206146, best loss: 3.365786 2025-01-16 01:20:33,220 - INFO - step 2271, loss: 4.575915, best loss: 3.365786 2025-01-16 01:20:33,370 - INFO - step 2272, loss: 4.174856, best loss: 3.365786 2025-01-16 01:20:33,520 - INFO - step 2273, loss: 3.846081, best loss: 3.365786 2025-01-16 01:20:33,670 - INFO - step 2274, loss: 4.150860, best loss: 3.365786 2025-01-16 01:20:33,820 - INFO - step 2275, loss: 4.101245, best loss: 3.365786 2025-01-16 01:20:33,970 - INFO - step 2276, loss: 4.343288, best loss: 3.365786 2025-01-16 01:20:34,120 - INFO - step 2277, loss: 3.404500, best loss: 3.365786 2025-01-16 01:20:34,270 - INFO - step 2278, loss: 4.119939, best loss: 3.365786 2025-01-16 01:20:34,421 - INFO - step 2279, loss: 4.317080, best loss: 3.365786 2025-01-16 01:20:34,571 - INFO - step 2280, loss: 4.387280, best loss: 3.365786 2025-01-16 01:20:34,721 - INFO - step 2281, loss: 4.156083, best loss: 3.365786 2025-01-16 01:20:34,871 - INFO - step 2282, loss: 4.237295, best loss: 3.365786 2025-01-16 01:20:35,021 - INFO - step 2283, loss: 4.235803, best loss: 3.365786 2025-01-16 01:20:35,171 - INFO - step 2284, loss: 3.845414, best loss: 3.365786 2025-01-16 01:20:35,321 - INFO - step 2285, loss: 4.372179, best loss: 3.365786 2025-01-16 01:20:35,471 - INFO - step 2286, loss: 4.250345, best loss: 3.365786 2025-01-16 01:20:35,621 - INFO - step 2287, loss: 4.261064, best loss: 3.365786 2025-01-16 01:20:35,771 - INFO - step 2288, loss: 3.873139, best loss: 3.365786 2025-01-16 01:20:35,921 - INFO - step 2289, loss: 3.898951, best loss: 3.365786 2025-01-16 01:20:36,071 - INFO - step 2290, loss: 3.949822, best loss: 3.365786 2025-01-16 01:20:36,221 - INFO - step 2291, loss: 3.917259, best loss: 3.365786 2025-01-16 01:20:36,371 - INFO - step 2292, loss: 3.918312, best loss: 3.365786 2025-01-16 01:20:36,521 - INFO - step 2293, loss: 3.982516, best loss: 3.365786 2025-01-16 01:20:36,671 - INFO - step 2294, loss: 3.834057, best loss: 3.365786 2025-01-16 01:20:36,821 - INFO - step 2295, loss: 3.668252, best loss: 3.365786 2025-01-16 01:20:36,971 - INFO - step 2296, loss: 3.556601, best loss: 3.365786 2025-01-16 01:20:40,409 - INFO - step 2297, loss: 3.266408, best loss: 3.266408 2025-01-16 01:20:40,567 - INFO - step 2298, loss: 4.157605, best loss: 3.266408 2025-01-16 01:20:40,717 - INFO - step 2299, loss: 4.624305, best loss: 3.266408 2025-01-16 01:20:40,867 - INFO - step 2300, loss: 4.724160, best loss: 3.266408 2025-01-16 01:20:41,017 - INFO - step 2301, loss: 4.915836, best loss: 3.266408 2025-01-16 01:20:41,167 - INFO - step 2302, loss: 4.915646, best loss: 3.266408 2025-01-16 01:20:41,317 - INFO - step 2303, loss: 4.384615, best loss: 3.266408 2025-01-16 01:20:41,467 - INFO - step 2304, loss: 4.591999, best loss: 3.266408 2025-01-16 01:20:41,617 - INFO - step 2305, loss: 4.754535, best loss: 3.266408 2025-01-16 01:20:41,767 - INFO - step 2306, loss: 4.222084, best loss: 3.266408 2025-01-16 01:20:41,917 - INFO - step 2307, loss: 3.924160, best loss: 3.266408 2025-01-16 01:20:42,066 - INFO - step 2308, loss: 4.299448, best loss: 3.266408 2025-01-16 01:20:42,216 - INFO - step 2309, loss: 4.082626, best loss: 3.266408 2025-01-16 01:20:42,366 - INFO - step 2310, loss: 4.693675, best loss: 3.266408 2025-01-16 01:20:42,516 - INFO - step 2311, loss: 4.470674, best loss: 3.266408 2025-01-16 01:20:42,666 - INFO - step 2312, loss: 4.700318, best loss: 3.266408 2025-01-16 01:20:42,816 - INFO - step 2313, loss: 4.465418, best loss: 3.266408 2025-01-16 01:20:42,966 - INFO - step 2314, loss: 4.732350, best loss: 3.266408 2025-01-16 01:20:43,116 - INFO - step 2315, loss: 4.183267, best loss: 3.266408 2025-01-16 01:20:43,267 - INFO - step 2316, loss: 4.544014, best loss: 3.266408 2025-01-16 01:20:43,416 - INFO - step 2317, loss: 4.281565, best loss: 3.266408 2025-01-16 01:20:43,566 - INFO - step 2318, loss: 4.559983, best loss: 3.266408 2025-01-16 01:20:43,716 - INFO - step 2319, loss: 4.352766, best loss: 3.266408 2025-01-16 01:20:43,866 - INFO - step 2320, loss: 4.163868, best loss: 3.266408 2025-01-16 01:20:44,016 - INFO - step 2321, loss: 4.906682, best loss: 3.266408 2025-01-16 01:20:44,166 - INFO - step 2322, loss: 4.056436, best loss: 3.266408 2025-01-16 01:20:44,316 - INFO - step 2323, loss: 4.502529, best loss: 3.266408 2025-01-16 01:20:44,466 - INFO - step 2324, loss: 4.435474, best loss: 3.266408 2025-01-16 01:20:44,616 - INFO - step 2325, loss: 4.492128, best loss: 3.266408 2025-01-16 01:20:44,766 - INFO - step 2326, loss: 4.409434, best loss: 3.266408 2025-01-16 01:20:44,916 - INFO - step 2327, loss: 4.154474, best loss: 3.266408 2025-01-16 01:20:45,066 - INFO - step 2328, loss: 4.069038, best loss: 3.266408 2025-01-16 01:20:45,216 - INFO - step 2329, loss: 4.228340, best loss: 3.266408 2025-01-16 01:20:45,366 - INFO - step 2330, loss: 3.668149, best loss: 3.266408 2025-01-16 01:20:45,516 - INFO - step 2331, loss: 4.535064, best loss: 3.266408 2025-01-16 01:20:45,666 - INFO - step 2332, loss: 3.726462, best loss: 3.266408 2025-01-16 01:20:45,816 - INFO - step 2333, loss: 3.603377, best loss: 3.266408 2025-01-16 01:20:45,966 - INFO - step 2334, loss: 4.172576, best loss: 3.266408 2025-01-16 01:20:46,116 - INFO - step 2335, loss: 4.115295, best loss: 3.266408 2025-01-16 01:20:46,266 - INFO - step 2336, loss: 4.111091, best loss: 3.266408 2025-01-16 01:20:46,416 - INFO - step 2337, loss: 3.787727, best loss: 3.266408 2025-01-16 01:20:46,566 - INFO - step 2338, loss: 4.057366, best loss: 3.266408 2025-01-16 01:20:46,716 - INFO - step 2339, loss: 4.120954, best loss: 3.266408 2025-01-16 01:20:46,867 - INFO - step 2340, loss: 4.059935, best loss: 3.266408 2025-01-16 01:20:47,017 - INFO - step 2341, loss: 3.887753, best loss: 3.266408 2025-01-16 01:20:47,167 - INFO - step 2342, loss: 4.343323, best loss: 3.266408 2025-01-16 01:20:47,317 - INFO - step 2343, loss: 4.267031, best loss: 3.266408 2025-01-16 01:20:47,467 - INFO - step 2344, loss: 4.263153, best loss: 3.266408 2025-01-16 01:20:47,617 - INFO - step 2345, loss: 3.844485, best loss: 3.266408 2025-01-16 01:20:47,767 - INFO - step 2346, loss: 4.080109, best loss: 3.266408 2025-01-16 01:20:47,917 - INFO - step 2347, loss: 4.421774, best loss: 3.266408 2025-01-16 01:20:48,067 - INFO - step 2348, loss: 3.916671, best loss: 3.266408 2025-01-16 01:20:48,218 - INFO - step 2349, loss: 4.493367, best loss: 3.266408 2025-01-16 01:20:48,368 - INFO - step 2350, loss: 4.542777, best loss: 3.266408 2025-01-16 01:20:48,518 - INFO - step 2351, loss: 4.472827, best loss: 3.266408 2025-01-16 01:20:48,668 - INFO - step 2352, loss: 4.407744, best loss: 3.266408 2025-01-16 01:20:48,818 - INFO - step 2353, loss: 4.386802, best loss: 3.266408 2025-01-16 01:20:48,968 - INFO - step 2354, loss: 4.264514, best loss: 3.266408 2025-01-16 01:20:49,118 - INFO - step 2355, loss: 4.032127, best loss: 3.266408 2025-01-16 01:20:49,268 - INFO - step 2356, loss: 4.904023, best loss: 3.266408 2025-01-16 01:20:49,420 - INFO - step 2357, loss: 4.346612, best loss: 3.266408 2025-01-16 01:20:49,570 - INFO - step 2358, loss: 4.718446, best loss: 3.266408 2025-01-16 01:20:49,720 - INFO - step 2359, loss: 3.762935, best loss: 3.266408 2025-01-16 01:20:49,870 - INFO - step 2360, loss: 3.711178, best loss: 3.266408 2025-01-16 01:20:50,020 - INFO - step 2361, loss: 4.300038, best loss: 3.266408 2025-01-16 01:20:50,169 - INFO - step 2362, loss: 4.357825, best loss: 3.266408 2025-01-16 01:20:50,319 - INFO - step 2363, loss: 4.109884, best loss: 3.266408 2025-01-16 01:20:50,469 - INFO - step 2364, loss: 4.252260, best loss: 3.266408 2025-01-16 01:20:50,618 - INFO - step 2365, loss: 4.038924, best loss: 3.266408 2025-01-16 01:20:50,768 - INFO - step 2366, loss: 4.159401, best loss: 3.266408 2025-01-16 01:20:50,917 - INFO - step 2367, loss: 4.407596, best loss: 3.266408 2025-01-16 01:20:51,067 - INFO - step 2368, loss: 3.959492, best loss: 3.266408 2025-01-16 01:20:51,217 - INFO - step 2369, loss: 4.024813, best loss: 3.266408 2025-01-16 01:20:51,367 - INFO - step 2370, loss: 4.122392, best loss: 3.266408 2025-01-16 01:20:51,517 - INFO - step 2371, loss: 4.342781, best loss: 3.266408 2025-01-16 01:20:51,667 - INFO - step 2372, loss: 4.202404, best loss: 3.266408 2025-01-16 01:20:51,817 - INFO - step 2373, loss: 4.412216, best loss: 3.266408 2025-01-16 01:20:51,967 - INFO - step 2374, loss: 4.159679, best loss: 3.266408 2025-01-16 01:20:52,117 - INFO - step 2375, loss: 3.850692, best loss: 3.266408 2025-01-16 01:20:52,267 - INFO - step 2376, loss: 4.311236, best loss: 3.266408 2025-01-16 01:20:52,418 - INFO - step 2377, loss: 3.580350, best loss: 3.266408 2025-01-16 01:20:52,568 - INFO - step 2378, loss: 3.960049, best loss: 3.266408 2025-01-16 01:20:52,717 - INFO - step 2379, loss: 4.141058, best loss: 3.266408 2025-01-16 01:20:52,867 - INFO - step 2380, loss: 4.033066, best loss: 3.266408 2025-01-16 01:20:53,017 - INFO - step 2381, loss: 3.946978, best loss: 3.266408 2025-01-16 01:20:53,167 - INFO - step 2382, loss: 4.277855, best loss: 3.266408 2025-01-16 01:20:53,317 - INFO - step 2383, loss: 4.511716, best loss: 3.266408 2025-01-16 01:20:53,468 - INFO - step 2384, loss: 4.268087, best loss: 3.266408 2025-01-16 01:20:53,618 - INFO - step 2385, loss: 4.530976, best loss: 3.266408 2025-01-16 01:20:53,767 - INFO - step 2386, loss: 4.078174, best loss: 3.266408 2025-01-16 01:20:53,918 - INFO - step 2387, loss: 4.185123, best loss: 3.266408 2025-01-16 01:20:54,068 - INFO - step 2388, loss: 4.037674, best loss: 3.266408 2025-01-16 01:20:54,218 - INFO - step 2389, loss: 3.756581, best loss: 3.266408 2025-01-16 01:20:54,368 - INFO - step 2390, loss: 4.579285, best loss: 3.266408 2025-01-16 01:20:54,518 - INFO - step 2391, loss: 4.524762, best loss: 3.266408 2025-01-16 01:20:54,668 - INFO - step 2392, loss: 4.195286, best loss: 3.266408 2025-01-16 01:20:54,818 - INFO - step 2393, loss: 4.045249, best loss: 3.266408 2025-01-16 01:20:54,968 - INFO - step 2394, loss: 3.721565, best loss: 3.266408 2025-01-16 01:20:55,118 - INFO - step 2395, loss: 3.946686, best loss: 3.266408 2025-01-16 01:20:55,268 - INFO - step 2396, loss: 3.898959, best loss: 3.266408 2025-01-16 01:20:55,417 - INFO - step 2397, loss: 3.810281, best loss: 3.266408 2025-01-16 01:20:55,568 - INFO - step 2398, loss: 4.542048, best loss: 3.266408 2025-01-16 01:20:55,718 - INFO - step 2399, loss: 4.489064, best loss: 3.266408 2025-01-16 01:20:55,868 - INFO - step 2400, loss: 4.273386, best loss: 3.266408 2025-01-16 01:20:56,018 - INFO - step 2401, loss: 4.576273, best loss: 3.266408 2025-01-16 01:20:56,168 - INFO - step 2402, loss: 4.212069, best loss: 3.266408 2025-01-16 01:20:56,318 - INFO - step 2403, loss: 4.575039, best loss: 3.266408 2025-01-16 01:20:56,468 - INFO - step 2404, loss: 4.742218, best loss: 3.266408 2025-01-16 01:20:56,618 - INFO - step 2405, loss: 4.718855, best loss: 3.266408 2025-01-16 01:20:56,768 - INFO - step 2406, loss: 4.729608, best loss: 3.266408 2025-01-16 01:20:56,918 - INFO - step 2407, loss: 4.460126, best loss: 3.266408 2025-01-16 01:20:57,068 - INFO - step 2408, loss: 4.734580, best loss: 3.266408 2025-01-16 01:20:57,218 - INFO - step 2409, loss: 4.455766, best loss: 3.266408 2025-01-16 01:20:57,368 - INFO - step 2410, loss: 4.217339, best loss: 3.266408 2025-01-16 01:20:57,519 - INFO - step 2411, loss: 4.702334, best loss: 3.266408 2025-01-16 01:20:57,669 - INFO - step 2412, loss: 4.692269, best loss: 3.266408 2025-01-16 01:20:57,819 - INFO - step 2413, loss: 4.537218, best loss: 3.266408 2025-01-16 01:20:57,969 - INFO - step 2414, loss: 4.365360, best loss: 3.266408 2025-01-16 01:20:58,119 - INFO - step 2415, loss: 4.676099, best loss: 3.266408 2025-01-16 01:20:58,268 - INFO - step 2416, loss: 4.391534, best loss: 3.266408 2025-01-16 01:20:58,418 - INFO - step 2417, loss: 4.161654, best loss: 3.266408 2025-01-16 01:20:58,569 - INFO - step 2418, loss: 4.104646, best loss: 3.266408 2025-01-16 01:20:58,719 - INFO - step 2419, loss: 4.532625, best loss: 3.266408 2025-01-16 01:20:58,868 - INFO - step 2420, loss: 4.543839, best loss: 3.266408 2025-01-16 01:20:59,019 - INFO - step 2421, loss: 4.766882, best loss: 3.266408 2025-01-16 01:20:59,169 - INFO - step 2422, loss: 4.602414, best loss: 3.266408 2025-01-16 01:20:59,319 - INFO - step 2423, loss: 4.199040, best loss: 3.266408 2025-01-16 01:20:59,469 - INFO - step 2424, loss: 4.629328, best loss: 3.266408 2025-01-16 01:20:59,619 - INFO - step 2425, loss: 4.195237, best loss: 3.266408 2025-01-16 01:20:59,769 - INFO - step 2426, loss: 4.656229, best loss: 3.266408 2025-01-16 01:20:59,919 - INFO - step 2427, loss: 4.218840, best loss: 3.266408 2025-01-16 01:21:00,069 - INFO - step 2428, loss: 4.361217, best loss: 3.266408 2025-01-16 01:21:00,219 - INFO - step 2429, loss: 4.304898, best loss: 3.266408 2025-01-16 01:21:00,369 - INFO - step 2430, loss: 4.364471, best loss: 3.266408 2025-01-16 01:21:00,519 - INFO - step 2431, loss: 4.180048, best loss: 3.266408 2025-01-16 01:21:00,669 - INFO - step 2432, loss: 4.395767, best loss: 3.266408 2025-01-16 01:21:00,819 - INFO - step 2433, loss: 3.987320, best loss: 3.266408 2025-01-16 01:21:00,969 - INFO - step 2434, loss: 3.605495, best loss: 3.266408 2025-01-16 01:21:01,119 - INFO - step 2435, loss: 3.822858, best loss: 3.266408 2025-01-16 01:21:01,269 - INFO - step 2436, loss: 4.086831, best loss: 3.266408 2025-01-16 01:21:01,420 - INFO - step 2437, loss: 4.786025, best loss: 3.266408 2025-01-16 01:21:01,570 - INFO - step 2438, loss: 4.305030, best loss: 3.266408 2025-01-16 01:21:01,720 - INFO - step 2439, loss: 4.319425, best loss: 3.266408 2025-01-16 01:21:01,870 - INFO - step 2440, loss: 4.607520, best loss: 3.266408 2025-01-16 01:21:02,020 - INFO - step 2441, loss: 4.169061, best loss: 3.266408 2025-01-16 01:21:02,170 - INFO - step 2442, loss: 4.857291, best loss: 3.266408 2025-01-16 01:21:02,320 - INFO - step 2443, loss: 4.174371, best loss: 3.266408 2025-01-16 01:21:02,470 - INFO - step 2444, loss: 4.487708, best loss: 3.266408 2025-01-16 01:21:02,620 - INFO - step 2445, loss: 4.175857, best loss: 3.266408 2025-01-16 01:21:02,770 - INFO - step 2446, loss: 4.857378, best loss: 3.266408 2025-01-16 01:21:02,920 - INFO - step 2447, loss: 4.296950, best loss: 3.266408 2025-01-16 01:21:03,070 - INFO - step 2448, loss: 4.009399, best loss: 3.266408 2025-01-16 01:21:03,220 - INFO - step 2449, loss: 4.508677, best loss: 3.266408 2025-01-16 01:21:03,370 - INFO - step 2450, loss: 4.161132, best loss: 3.266408 2025-01-16 01:21:03,520 - INFO - step 2451, loss: 3.939823, best loss: 3.266408 2025-01-16 01:21:03,670 - INFO - step 2452, loss: 4.852427, best loss: 3.266408 2025-01-16 01:21:03,819 - INFO - step 2453, loss: 4.377810, best loss: 3.266408 2025-01-16 01:21:03,970 - INFO - step 2454, loss: 3.791517, best loss: 3.266408 2025-01-16 01:21:04,120 - INFO - step 2455, loss: 4.093227, best loss: 3.266408 2025-01-16 01:21:04,269 - INFO - step 2456, loss: 4.411355, best loss: 3.266408 2025-01-16 01:21:04,419 - INFO - step 2457, loss: 4.283479, best loss: 3.266408 2025-01-16 01:21:04,569 - INFO - step 2458, loss: 4.262252, best loss: 3.266408 2025-01-16 01:21:04,719 - INFO - step 2459, loss: 4.064022, best loss: 3.266408 2025-01-16 01:21:04,869 - INFO - step 2460, loss: 4.695533, best loss: 3.266408 2025-01-16 01:21:05,019 - INFO - step 2461, loss: 4.463230, best loss: 3.266408 2025-01-16 01:21:05,169 - INFO - step 2462, loss: 4.349517, best loss: 3.266408 2025-01-16 01:21:05,319 - INFO - step 2463, loss: 4.035236, best loss: 3.266408 2025-01-16 01:21:05,469 - INFO - step 2464, loss: 4.364019, best loss: 3.266408 2025-01-16 01:21:05,619 - INFO - step 2465, loss: 4.189874, best loss: 3.266408 2025-01-16 01:21:05,769 - INFO - step 2466, loss: 3.801838, best loss: 3.266408 2025-01-16 01:21:05,919 - INFO - step 2467, loss: 4.341357, best loss: 3.266408 2025-01-16 01:21:06,069 - INFO - step 2468, loss: 3.929942, best loss: 3.266408 2025-01-16 01:21:06,219 - INFO - step 2469, loss: 4.354929, best loss: 3.266408 2025-01-16 01:21:06,369 - INFO - step 2470, loss: 4.064621, best loss: 3.266408 2025-01-16 01:21:06,519 - INFO - step 2471, loss: 4.204679, best loss: 3.266408 2025-01-16 01:21:06,669 - INFO - step 2472, loss: 4.092371, best loss: 3.266408 2025-01-16 01:21:06,818 - INFO - step 2473, loss: 4.435764, best loss: 3.266408 2025-01-16 01:21:06,969 - INFO - step 2474, loss: 4.736518, best loss: 3.266408 2025-01-16 01:21:07,119 - INFO - step 2475, loss: 4.436011, best loss: 3.266408 2025-01-16 01:21:07,269 - INFO - step 2476, loss: 4.572707, best loss: 3.266408 2025-01-16 01:21:07,419 - INFO - step 2477, loss: 4.038981, best loss: 3.266408 2025-01-16 01:21:07,569 - INFO - step 2478, loss: 4.397570, best loss: 3.266408 2025-01-16 01:21:07,719 - INFO - step 2479, loss: 4.477445, best loss: 3.266408 2025-01-16 01:21:07,869 - INFO - step 2480, loss: 4.245440, best loss: 3.266408 2025-01-16 01:21:08,019 - INFO - step 2481, loss: 4.069747, best loss: 3.266408 2025-01-16 01:21:08,169 - INFO - step 2482, loss: 4.084984, best loss: 3.266408 2025-01-16 01:21:08,319 - INFO - step 2483, loss: 4.153677, best loss: 3.266408 2025-01-16 01:21:08,469 - INFO - step 2484, loss: 4.283092, best loss: 3.266408 2025-01-16 01:21:08,619 - INFO - step 2485, loss: 4.661273, best loss: 3.266408 2025-01-16 01:21:08,768 - INFO - step 2486, loss: 4.725348, best loss: 3.266408 2025-01-16 01:21:08,918 - INFO - step 2487, loss: 4.514959, best loss: 3.266408 2025-01-16 01:21:09,068 - INFO - step 2488, loss: 4.626453, best loss: 3.266408 2025-01-16 01:21:09,219 - INFO - step 2489, loss: 4.479562, best loss: 3.266408 2025-01-16 01:21:09,369 - INFO - step 2490, loss: 4.423825, best loss: 3.266408 2025-01-16 01:21:09,519 - INFO - step 2491, loss: 3.808102, best loss: 3.266408 2025-01-16 01:21:09,669 - INFO - step 2492, loss: 4.478771, best loss: 3.266408 2025-01-16 01:21:09,819 - INFO - step 2493, loss: 4.790782, best loss: 3.266408 2025-01-16 01:21:09,969 - INFO - step 2494, loss: 4.362581, best loss: 3.266408 2025-01-16 01:21:10,119 - INFO - step 2495, loss: 4.456277, best loss: 3.266408 2025-01-16 01:21:10,269 - INFO - step 2496, loss: 4.443572, best loss: 3.266408 2025-01-16 01:21:10,420 - INFO - step 2497, loss: 3.925860, best loss: 3.266408 2025-01-16 01:21:10,569 - INFO - step 2498, loss: 3.272314, best loss: 3.266408 2025-01-16 01:21:10,719 - INFO - step 2499, loss: 4.197223, best loss: 3.266408 2025-01-16 01:21:10,870 - INFO - step 2500, loss: 4.453618, best loss: 3.266408 2025-01-16 01:21:11,019 - INFO - step 2501, loss: 4.485469, best loss: 3.266408 2025-01-16 01:21:11,170 - INFO - step 2502, loss: 4.387918, best loss: 3.266408 2025-01-16 01:21:11,320 - INFO - step 2503, loss: 4.053741, best loss: 3.266408 2025-01-16 01:21:11,470 - INFO - step 2504, loss: 4.256951, best loss: 3.266408 2025-01-16 01:21:11,620 - INFO - step 2505, loss: 4.361398, best loss: 3.266408 2025-01-16 01:21:11,770 - INFO - step 2506, loss: 4.050495, best loss: 3.266408 2025-01-16 01:21:11,920 - INFO - step 2507, loss: 4.261132, best loss: 3.266408 2025-01-16 01:21:12,070 - INFO - step 2508, loss: 4.144002, best loss: 3.266408 2025-01-16 01:21:12,220 - INFO - step 2509, loss: 3.958461, best loss: 3.266408 2025-01-16 01:21:12,370 - INFO - step 2510, loss: 4.408634, best loss: 3.266408 2025-01-16 01:21:12,520 - INFO - step 2511, loss: 3.859579, best loss: 3.266408 2025-01-16 01:21:12,670 - INFO - step 2512, loss: 4.236633, best loss: 3.266408 2025-01-16 01:21:12,820 - INFO - step 2513, loss: 4.544575, best loss: 3.266408 2025-01-16 01:21:12,970 - INFO - step 2514, loss: 4.145456, best loss: 3.266408 2025-01-16 01:21:13,120 - INFO - step 2515, loss: 3.678384, best loss: 3.266408 2025-01-16 01:21:13,270 - INFO - step 2516, loss: 4.298752, best loss: 3.266408 2025-01-16 01:21:13,421 - INFO - step 2517, loss: 4.518021, best loss: 3.266408 2025-01-16 01:21:13,571 - INFO - step 2518, loss: 4.725917, best loss: 3.266408 2025-01-16 01:21:13,721 - INFO - step 2519, loss: 4.479445, best loss: 3.266408 2025-01-16 01:21:13,871 - INFO - step 2520, loss: 4.618215, best loss: 3.266408 2025-01-16 01:21:14,021 - INFO - step 2521, loss: 4.462130, best loss: 3.266408 2025-01-16 01:21:14,171 - INFO - step 2522, loss: 4.733547, best loss: 3.266408 2025-01-16 01:21:14,321 - INFO - step 2523, loss: 4.155368, best loss: 3.266408 2025-01-16 01:21:14,471 - INFO - step 2524, loss: 4.254314, best loss: 3.266408 2025-01-16 01:21:14,621 - INFO - step 2525, loss: 4.399068, best loss: 3.266408 2025-01-16 01:21:14,771 - INFO - step 2526, loss: 4.440727, best loss: 3.266408 2025-01-16 01:21:14,921 - INFO - step 2527, loss: 4.438764, best loss: 3.266408 2025-01-16 01:21:15,071 - INFO - step 2528, loss: 4.214522, best loss: 3.266408 2025-01-16 01:21:15,221 - INFO - step 2529, loss: 4.213889, best loss: 3.266408 2025-01-16 01:21:15,371 - INFO - step 2530, loss: 4.427793, best loss: 3.266408 2025-01-16 01:21:15,521 - INFO - step 2531, loss: 4.594188, best loss: 3.266408 2025-01-16 01:21:15,671 - INFO - step 2532, loss: 4.289289, best loss: 3.266408 2025-01-16 01:21:15,821 - INFO - step 2533, loss: 4.691721, best loss: 3.266408 2025-01-16 01:21:15,971 - INFO - step 2534, loss: 4.717841, best loss: 3.266408 2025-01-16 01:21:16,121 - INFO - step 2535, loss: 4.841056, best loss: 3.266408 2025-01-16 01:21:16,271 - INFO - step 2536, loss: 4.976285, best loss: 3.266408 2025-01-16 01:21:16,421 - INFO - step 2537, loss: 4.937793, best loss: 3.266408 2025-01-16 01:21:16,571 - INFO - step 2538, loss: 4.208027, best loss: 3.266408 2025-01-16 01:21:16,721 - INFO - step 2539, loss: 4.672365, best loss: 3.266408 2025-01-16 01:21:16,871 - INFO - step 2540, loss: 4.624802, best loss: 3.266408 2025-01-16 01:21:17,021 - INFO - step 2541, loss: 4.783800, best loss: 3.266408 2025-01-16 01:21:17,171 - INFO - step 2542, loss: 4.196152, best loss: 3.266408 2025-01-16 01:21:17,321 - INFO - step 2543, loss: 4.448402, best loss: 3.266408 2025-01-16 01:21:17,471 - INFO - step 2544, loss: 4.484107, best loss: 3.266408 2025-01-16 01:21:17,621 - INFO - step 2545, loss: 4.368480, best loss: 3.266408 2025-01-16 01:21:17,771 - INFO - step 2546, loss: 4.640105, best loss: 3.266408 2025-01-16 01:21:17,921 - INFO - step 2547, loss: 4.080968, best loss: 3.266408 2025-01-16 01:21:18,071 - INFO - step 2548, loss: 4.112761, best loss: 3.266408 2025-01-16 01:21:18,221 - INFO - step 2549, loss: 4.601885, best loss: 3.266408 2025-01-16 01:21:18,371 - INFO - step 2550, loss: 4.439837, best loss: 3.266408 2025-01-16 01:21:18,521 - INFO - step 2551, loss: 4.655319, best loss: 3.266408 2025-01-16 01:21:18,671 - INFO - step 2552, loss: 4.373237, best loss: 3.266408 2025-01-16 01:21:18,821 - INFO - step 2553, loss: 4.912389, best loss: 3.266408 2025-01-16 01:21:18,972 - INFO - step 2554, loss: 4.684001, best loss: 3.266408 2025-01-16 01:21:19,122 - INFO - step 2555, loss: 4.216092, best loss: 3.266408 2025-01-16 01:21:19,272 - INFO - step 2556, loss: 4.169446, best loss: 3.266408 2025-01-16 01:21:19,423 - INFO - step 2557, loss: 4.676332, best loss: 3.266408 2025-01-16 01:21:19,573 - INFO - step 2558, loss: 4.273833, best loss: 3.266408 2025-01-16 01:21:19,723 - INFO - step 2559, loss: 4.193944, best loss: 3.266408 2025-01-16 01:21:19,874 - INFO - step 2560, loss: 4.559364, best loss: 3.266408 2025-01-16 01:21:20,024 - INFO - step 2561, loss: 4.502153, best loss: 3.266408 2025-01-16 01:21:20,174 - INFO - step 2562, loss: 4.473485, best loss: 3.266408 2025-01-16 01:21:20,324 - INFO - step 2563, loss: 4.104455, best loss: 3.266408 2025-01-16 01:21:20,473 - INFO - step 2564, loss: 3.863728, best loss: 3.266408 2025-01-16 01:21:20,624 - INFO - step 2565, loss: 3.923200, best loss: 3.266408 2025-01-16 01:21:20,774 - INFO - step 2566, loss: 3.864806, best loss: 3.266408 2025-01-16 01:21:20,924 - INFO - step 2567, loss: 3.876156, best loss: 3.266408 2025-01-16 01:21:21,073 - INFO - step 2568, loss: 4.345301, best loss: 3.266408 2025-01-16 01:21:21,223 - INFO - step 2569, loss: 4.089262, best loss: 3.266408 2025-01-16 01:21:21,374 - INFO - step 2570, loss: 4.043327, best loss: 3.266408 2025-01-16 01:21:21,524 - INFO - step 2571, loss: 4.256227, best loss: 3.266408 2025-01-16 01:21:21,674 - INFO - step 2572, loss: 4.208452, best loss: 3.266408 2025-01-16 01:21:21,824 - INFO - step 2573, loss: 4.182690, best loss: 3.266408 2025-01-16 01:21:21,974 - INFO - step 2574, loss: 4.301821, best loss: 3.266408 2025-01-16 01:21:22,124 - INFO - step 2575, loss: 4.554548, best loss: 3.266408 2025-01-16 01:21:22,274 - INFO - step 2576, loss: 3.974390, best loss: 3.266408 2025-01-16 01:21:22,424 - INFO - step 2577, loss: 3.952582, best loss: 3.266408 2025-01-16 01:21:22,575 - INFO - step 2578, loss: 4.267575, best loss: 3.266408 2025-01-16 01:21:22,724 - INFO - step 2579, loss: 4.399294, best loss: 3.266408 2025-01-16 01:21:22,875 - INFO - step 2580, loss: 3.909039, best loss: 3.266408 2025-01-16 01:21:23,025 - INFO - step 2581, loss: 3.802920, best loss: 3.266408 2025-01-16 01:21:23,175 - INFO - step 2582, loss: 4.149318, best loss: 3.266408 2025-01-16 01:21:23,325 - INFO - step 2583, loss: 4.190588, best loss: 3.266408 2025-01-16 01:21:23,475 - INFO - step 2584, loss: 3.811419, best loss: 3.266408 2025-01-16 01:21:23,625 - INFO - step 2585, loss: 4.255421, best loss: 3.266408 2025-01-16 01:21:23,775 - INFO - step 2586, loss: 4.033162, best loss: 3.266408 2025-01-16 01:21:23,925 - INFO - step 2587, loss: 3.897357, best loss: 3.266408 2025-01-16 01:21:24,075 - INFO - step 2588, loss: 3.813999, best loss: 3.266408 2025-01-16 01:21:24,225 - INFO - step 2589, loss: 4.100793, best loss: 3.266408 2025-01-16 01:21:24,375 - INFO - step 2590, loss: 3.654561, best loss: 3.266408 2025-01-16 01:21:24,525 - INFO - step 2591, loss: 3.960396, best loss: 3.266408 2025-01-16 01:21:24,675 - INFO - step 2592, loss: 3.655372, best loss: 3.266408 2025-01-16 01:21:24,826 - INFO - step 2593, loss: 4.291556, best loss: 3.266408 2025-01-16 01:21:24,976 - INFO - step 2594, loss: 4.563923, best loss: 3.266408 2025-01-16 01:21:25,126 - INFO - step 2595, loss: 5.001992, best loss: 3.266408 2025-01-16 01:21:25,276 - INFO - step 2596, loss: 4.340372, best loss: 3.266408 2025-01-16 01:21:25,426 - INFO - step 2597, loss: 4.537808, best loss: 3.266408 2025-01-16 01:21:25,576 - INFO - step 2598, loss: 4.415127, best loss: 3.266408 2025-01-16 01:21:25,726 - INFO - step 2599, loss: 4.270083, best loss: 3.266408 2025-01-16 01:21:25,876 - INFO - step 2600, loss: 4.123645, best loss: 3.266408 2025-01-16 01:21:26,026 - INFO - step 2601, loss: 4.435830, best loss: 3.266408 2025-01-16 01:21:26,176 - INFO - step 2602, loss: 4.069849, best loss: 3.266408 2025-01-16 01:21:26,326 - INFO - step 2603, loss: 3.773145, best loss: 3.266408 2025-01-16 01:21:26,476 - INFO - step 2604, loss: 4.087649, best loss: 3.266408 2025-01-16 01:21:26,626 - INFO - step 2605, loss: 3.985808, best loss: 3.266408 2025-01-16 01:21:26,776 - INFO - step 2606, loss: 4.223623, best loss: 3.266408 2025-01-16 01:21:26,926 - INFO - step 2607, loss: 3.316100, best loss: 3.266408 2025-01-16 01:21:27,076 - INFO - step 2608, loss: 4.088923, best loss: 3.266408 2025-01-16 01:21:27,226 - INFO - step 2609, loss: 4.264326, best loss: 3.266408 2025-01-16 01:21:27,376 - INFO - step 2610, loss: 4.333745, best loss: 3.266408 2025-01-16 01:21:27,526 - INFO - step 2611, loss: 4.056658, best loss: 3.266408 2025-01-16 01:21:27,676 - INFO - step 2612, loss: 4.120752, best loss: 3.266408 2025-01-16 01:21:27,826 - INFO - step 2613, loss: 4.157048, best loss: 3.266408 2025-01-16 01:21:27,977 - INFO - step 2614, loss: 3.813728, best loss: 3.266408 2025-01-16 01:21:28,127 - INFO - step 2615, loss: 4.288506, best loss: 3.266408 2025-01-16 01:21:28,277 - INFO - step 2616, loss: 4.172612, best loss: 3.266408 2025-01-16 01:21:28,427 - INFO - step 2617, loss: 4.143210, best loss: 3.266408 2025-01-16 01:21:28,577 - INFO - step 2618, loss: 3.744174, best loss: 3.266408 2025-01-16 01:21:28,727 - INFO - step 2619, loss: 3.827562, best loss: 3.266408 2025-01-16 01:21:28,877 - INFO - step 2620, loss: 3.894173, best loss: 3.266408 2025-01-16 01:21:29,027 - INFO - step 2621, loss: 3.816056, best loss: 3.266408 2025-01-16 01:21:29,178 - INFO - step 2622, loss: 3.862714, best loss: 3.266408 2025-01-16 01:21:29,328 - INFO - step 2623, loss: 3.891937, best loss: 3.266408 2025-01-16 01:21:29,478 - INFO - step 2624, loss: 3.780507, best loss: 3.266408 2025-01-16 01:21:29,629 - INFO - step 2625, loss: 3.566047, best loss: 3.266408 2025-01-16 01:21:29,779 - INFO - step 2626, loss: 3.447699, best loss: 3.266408 2025-01-16 01:21:33,216 - INFO - step 2627, loss: 3.169164, best loss: 3.169164 2025-01-16 01:21:33,375 - INFO - step 2628, loss: 4.077272, best loss: 3.169164 2025-01-16 01:21:33,527 - INFO - step 2629, loss: 4.523200, best loss: 3.169164 2025-01-16 01:21:33,677 - INFO - step 2630, loss: 4.626702, best loss: 3.169164 2025-01-16 01:21:33,827 - INFO - step 2631, loss: 4.764953, best loss: 3.169164 2025-01-16 01:21:33,977 - INFO - step 2632, loss: 4.850917, best loss: 3.169164 2025-01-16 01:21:34,127 - INFO - step 2633, loss: 4.298967, best loss: 3.169164 2025-01-16 01:21:34,277 - INFO - step 2634, loss: 4.505284, best loss: 3.169164 2025-01-16 01:21:34,428 - INFO - step 2635, loss: 4.605688, best loss: 3.169164 2025-01-16 01:21:34,578 - INFO - step 2636, loss: 4.134974, best loss: 3.169164 2025-01-16 01:21:34,728 - INFO - step 2637, loss: 3.687586, best loss: 3.169164 2025-01-16 01:21:34,878 - INFO - step 2638, loss: 4.125874, best loss: 3.169164 2025-01-16 01:21:35,029 - INFO - step 2639, loss: 3.947093, best loss: 3.169164 2025-01-16 01:21:35,179 - INFO - step 2640, loss: 4.578958, best loss: 3.169164 2025-01-16 01:21:35,329 - INFO - step 2641, loss: 4.396006, best loss: 3.169164 2025-01-16 01:21:35,479 - INFO - step 2642, loss: 4.572675, best loss: 3.169164 2025-01-16 01:21:35,629 - INFO - step 2643, loss: 4.282512, best loss: 3.169164 2025-01-16 01:21:35,779 - INFO - step 2644, loss: 4.574563, best loss: 3.169164 2025-01-16 01:21:35,930 - INFO - step 2645, loss: 4.107800, best loss: 3.169164 2025-01-16 01:21:36,080 - INFO - step 2646, loss: 4.484842, best loss: 3.169164 2025-01-16 01:21:36,230 - INFO - step 2647, loss: 4.192881, best loss: 3.169164 2025-01-16 01:21:36,381 - INFO - step 2648, loss: 4.426904, best loss: 3.169164 2025-01-16 01:21:36,531 - INFO - step 2649, loss: 4.265919, best loss: 3.169164 2025-01-16 01:21:36,681 - INFO - step 2650, loss: 4.094894, best loss: 3.169164 2025-01-16 01:21:36,831 - INFO - step 2651, loss: 4.788928, best loss: 3.169164 2025-01-16 01:21:36,981 - INFO - step 2652, loss: 4.009277, best loss: 3.169164 2025-01-16 01:21:37,132 - INFO - step 2653, loss: 4.423936, best loss: 3.169164 2025-01-16 01:21:37,282 - INFO - step 2654, loss: 4.362654, best loss: 3.169164 2025-01-16 01:21:37,432 - INFO - step 2655, loss: 4.396146, best loss: 3.169164 2025-01-16 01:21:37,582 - INFO - step 2656, loss: 4.292604, best loss: 3.169164 2025-01-16 01:21:37,732 - INFO - step 2657, loss: 3.973027, best loss: 3.169164 2025-01-16 01:21:37,882 - INFO - step 2658, loss: 3.977349, best loss: 3.169164 2025-01-16 01:21:38,032 - INFO - step 2659, loss: 4.138554, best loss: 3.169164 2025-01-16 01:21:38,182 - INFO - step 2660, loss: 3.607742, best loss: 3.169164 2025-01-16 01:21:38,333 - INFO - step 2661, loss: 4.449225, best loss: 3.169164 2025-01-16 01:21:38,483 - INFO - step 2662, loss: 3.673233, best loss: 3.169164 2025-01-16 01:21:38,633 - INFO - step 2663, loss: 3.513101, best loss: 3.169164 2025-01-16 01:21:38,783 - INFO - step 2664, loss: 4.073444, best loss: 3.169164 2025-01-16 01:21:38,933 - INFO - step 2665, loss: 4.059529, best loss: 3.169164 2025-01-16 01:21:39,083 - INFO - step 2666, loss: 4.038798, best loss: 3.169164 2025-01-16 01:21:39,233 - INFO - step 2667, loss: 3.669594, best loss: 3.169164 2025-01-16 01:21:39,383 - INFO - step 2668, loss: 3.985269, best loss: 3.169164 2025-01-16 01:21:39,533 - INFO - step 2669, loss: 4.045423, best loss: 3.169164 2025-01-16 01:21:39,684 - INFO - step 2670, loss: 3.918946, best loss: 3.169164 2025-01-16 01:21:39,834 - INFO - step 2671, loss: 3.770319, best loss: 3.169164 2025-01-16 01:21:39,984 - INFO - step 2672, loss: 4.217131, best loss: 3.169164 2025-01-16 01:21:40,134 - INFO - step 2673, loss: 4.120239, best loss: 3.169164 2025-01-16 01:21:40,284 - INFO - step 2674, loss: 4.139420, best loss: 3.169164 2025-01-16 01:21:40,434 - INFO - step 2675, loss: 3.723016, best loss: 3.169164 2025-01-16 01:21:40,584 - INFO - step 2676, loss: 3.971524, best loss: 3.169164 2025-01-16 01:21:40,734 - INFO - step 2677, loss: 4.298065, best loss: 3.169164 2025-01-16 01:21:40,884 - INFO - step 2678, loss: 3.829519, best loss: 3.169164 2025-01-16 01:21:41,034 - INFO - step 2679, loss: 4.337348, best loss: 3.169164 2025-01-16 01:21:41,185 - INFO - step 2680, loss: 4.397766, best loss: 3.169164 2025-01-16 01:21:41,335 - INFO - step 2681, loss: 4.309866, best loss: 3.169164 2025-01-16 01:21:41,485 - INFO - step 2682, loss: 4.276903, best loss: 3.169164 2025-01-16 01:21:41,635 - INFO - step 2683, loss: 4.253493, best loss: 3.169164 2025-01-16 01:21:41,785 - INFO - step 2684, loss: 4.157745, best loss: 3.169164 2025-01-16 01:21:41,935 - INFO - step 2685, loss: 3.917579, best loss: 3.169164 2025-01-16 01:21:42,086 - INFO - step 2686, loss: 4.795050, best loss: 3.169164 2025-01-16 01:21:42,236 - INFO - step 2687, loss: 4.181700, best loss: 3.169164 2025-01-16 01:21:42,386 - INFO - step 2688, loss: 4.568418, best loss: 3.169164 2025-01-16 01:21:42,536 - INFO - step 2689, loss: 3.559148, best loss: 3.169164 2025-01-16 01:21:42,686 - INFO - step 2690, loss: 3.560574, best loss: 3.169164 2025-01-16 01:21:42,836 - INFO - step 2691, loss: 4.155050, best loss: 3.169164 2025-01-16 01:21:42,986 - INFO - step 2692, loss: 4.156716, best loss: 3.169164 2025-01-16 01:21:43,136 - INFO - step 2693, loss: 3.969157, best loss: 3.169164 2025-01-16 01:21:43,287 - INFO - step 2694, loss: 4.090976, best loss: 3.169164 2025-01-16 01:21:43,437 - INFO - step 2695, loss: 3.896957, best loss: 3.169164 2025-01-16 01:21:43,587 - INFO - step 2696, loss: 4.037840, best loss: 3.169164 2025-01-16 01:21:43,737 - INFO - step 2697, loss: 4.282967, best loss: 3.169164 2025-01-16 01:21:43,888 - INFO - step 2698, loss: 3.796077, best loss: 3.169164 2025-01-16 01:21:44,038 - INFO - step 2699, loss: 3.894114, best loss: 3.169164 2025-01-16 01:21:44,188 - INFO - step 2700, loss: 3.999828, best loss: 3.169164 2025-01-16 01:21:44,338 - INFO - step 2701, loss: 4.215703, best loss: 3.169164 2025-01-16 01:21:44,488 - INFO - step 2702, loss: 4.068679, best loss: 3.169164 2025-01-16 01:21:44,639 - INFO - step 2703, loss: 4.298558, best loss: 3.169164 2025-01-16 01:21:44,789 - INFO - step 2704, loss: 4.089968, best loss: 3.169164 2025-01-16 01:21:44,939 - INFO - step 2705, loss: 3.715303, best loss: 3.169164 2025-01-16 01:21:45,089 - INFO - step 2706, loss: 4.131111, best loss: 3.169164 2025-01-16 01:21:45,239 - INFO - step 2707, loss: 3.434660, best loss: 3.169164 2025-01-16 01:21:45,389 - INFO - step 2708, loss: 3.865018, best loss: 3.169164 2025-01-16 01:21:45,539 - INFO - step 2709, loss: 4.067457, best loss: 3.169164 2025-01-16 01:21:45,690 - INFO - step 2710, loss: 3.968171, best loss: 3.169164 2025-01-16 01:21:45,840 - INFO - step 2711, loss: 3.849212, best loss: 3.169164 2025-01-16 01:21:45,990 - INFO - step 2712, loss: 4.148001, best loss: 3.169164 2025-01-16 01:21:46,140 - INFO - step 2713, loss: 4.424646, best loss: 3.169164 2025-01-16 01:21:46,291 - INFO - step 2714, loss: 4.211203, best loss: 3.169164 2025-01-16 01:21:46,441 - INFO - step 2715, loss: 4.431274, best loss: 3.169164 2025-01-16 01:21:46,591 - INFO - step 2716, loss: 3.985402, best loss: 3.169164 2025-01-16 01:21:46,741 - INFO - step 2717, loss: 4.093046, best loss: 3.169164 2025-01-16 01:21:46,891 - INFO - step 2718, loss: 3.902739, best loss: 3.169164 2025-01-16 01:21:47,041 - INFO - step 2719, loss: 3.616662, best loss: 3.169164 2025-01-16 01:21:47,191 - INFO - step 2720, loss: 4.429454, best loss: 3.169164 2025-01-16 01:21:47,341 - INFO - step 2721, loss: 4.370273, best loss: 3.169164 2025-01-16 01:21:47,491 - INFO - step 2722, loss: 4.028874, best loss: 3.169164 2025-01-16 01:21:47,642 - INFO - step 2723, loss: 3.930635, best loss: 3.169164 2025-01-16 01:21:47,792 - INFO - step 2724, loss: 3.633518, best loss: 3.169164 2025-01-16 01:21:47,942 - INFO - step 2725, loss: 3.851778, best loss: 3.169164 2025-01-16 01:21:48,092 - INFO - step 2726, loss: 3.817730, best loss: 3.169164 2025-01-16 01:21:48,243 - INFO - step 2727, loss: 3.736901, best loss: 3.169164 2025-01-16 01:21:48,393 - INFO - step 2728, loss: 4.431311, best loss: 3.169164 2025-01-16 01:21:48,543 - INFO - step 2729, loss: 4.371430, best loss: 3.169164 2025-01-16 01:21:48,693 - INFO - step 2730, loss: 4.155304, best loss: 3.169164 2025-01-16 01:21:48,843 - INFO - step 2731, loss: 4.461221, best loss: 3.169164 2025-01-16 01:21:48,993 - INFO - step 2732, loss: 4.112524, best loss: 3.169164 2025-01-16 01:21:49,143 - INFO - step 2733, loss: 4.436219, best loss: 3.169164 2025-01-16 01:21:49,293 - INFO - step 2734, loss: 4.606326, best loss: 3.169164 2025-01-16 01:21:49,443 - INFO - step 2735, loss: 4.579418, best loss: 3.169164 2025-01-16 01:21:49,594 - INFO - step 2736, loss: 4.633140, best loss: 3.169164 2025-01-16 01:21:49,744 - INFO - step 2737, loss: 4.296656, best loss: 3.169164 2025-01-16 01:21:49,894 - INFO - step 2738, loss: 4.562685, best loss: 3.169164 2025-01-16 01:21:50,044 - INFO - step 2739, loss: 4.311941, best loss: 3.169164 2025-01-16 01:21:50,194 - INFO - step 2740, loss: 4.092839, best loss: 3.169164 2025-01-16 01:21:50,345 - INFO - step 2741, loss: 4.578676, best loss: 3.169164 2025-01-16 01:21:50,495 - INFO - step 2742, loss: 4.564883, best loss: 3.169164 2025-01-16 01:21:50,645 - INFO - step 2743, loss: 4.427987, best loss: 3.169164 2025-01-16 01:21:50,795 - INFO - step 2744, loss: 4.210533, best loss: 3.169164 2025-01-16 01:21:50,945 - INFO - step 2745, loss: 4.546589, best loss: 3.169164 2025-01-16 01:21:51,095 - INFO - step 2746, loss: 4.301291, best loss: 3.169164 2025-01-16 01:21:51,246 - INFO - step 2747, loss: 4.061400, best loss: 3.169164 2025-01-16 01:21:51,396 - INFO - step 2748, loss: 3.997882, best loss: 3.169164 2025-01-16 01:21:51,546 - INFO - step 2749, loss: 4.419624, best loss: 3.169164 2025-01-16 01:21:51,697 - INFO - step 2750, loss: 4.431808, best loss: 3.169164 2025-01-16 01:21:51,847 - INFO - step 2751, loss: 4.642076, best loss: 3.169164 2025-01-16 01:21:51,997 - INFO - step 2752, loss: 4.526585, best loss: 3.169164 2025-01-16 01:21:52,147 - INFO - step 2753, loss: 4.118266, best loss: 3.169164 2025-01-16 01:21:52,297 - INFO - step 2754, loss: 4.574816, best loss: 3.169164 2025-01-16 01:21:52,447 - INFO - step 2755, loss: 4.131603, best loss: 3.169164 2025-01-16 01:21:52,597 - INFO - step 2756, loss: 4.553592, best loss: 3.169164 2025-01-16 01:21:52,747 - INFO - step 2757, loss: 4.119154, best loss: 3.169164 2025-01-16 01:21:52,897 - INFO - step 2758, loss: 4.253746, best loss: 3.169164 2025-01-16 01:21:53,047 - INFO - step 2759, loss: 4.237190, best loss: 3.169164 2025-01-16 01:21:53,197 - INFO - step 2760, loss: 4.257871, best loss: 3.169164 2025-01-16 01:21:53,347 - INFO - step 2761, loss: 4.089380, best loss: 3.169164 2025-01-16 01:21:53,497 - INFO - step 2762, loss: 4.281093, best loss: 3.169164 2025-01-16 01:21:53,647 - INFO - step 2763, loss: 3.883813, best loss: 3.169164 2025-01-16 01:21:53,797 - INFO - step 2764, loss: 3.550241, best loss: 3.169164 2025-01-16 01:21:53,947 - INFO - step 2765, loss: 3.781856, best loss: 3.169164 2025-01-16 01:21:54,097 - INFO - step 2766, loss: 4.066912, best loss: 3.169164 2025-01-16 01:21:54,247 - INFO - step 2767, loss: 4.766016, best loss: 3.169164 2025-01-16 01:21:54,397 - INFO - step 2768, loss: 4.222015, best loss: 3.169164 2025-01-16 01:21:54,547 - INFO - step 2769, loss: 4.123701, best loss: 3.169164 2025-01-16 01:21:54,697 - INFO - step 2770, loss: 4.532583, best loss: 3.169164 2025-01-16 01:21:54,847 - INFO - step 2771, loss: 4.131895, best loss: 3.169164 2025-01-16 01:21:54,997 - INFO - step 2772, loss: 4.799494, best loss: 3.169164 2025-01-16 01:21:55,148 - INFO - step 2773, loss: 4.089931, best loss: 3.169164 2025-01-16 01:21:55,298 - INFO - step 2774, loss: 4.445897, best loss: 3.169164 2025-01-16 01:21:55,448 - INFO - step 2775, loss: 4.080871, best loss: 3.169164 2025-01-16 01:21:55,598 - INFO - step 2776, loss: 4.794124, best loss: 3.169164 2025-01-16 01:21:55,748 - INFO - step 2777, loss: 4.233117, best loss: 3.169164 2025-01-16 01:21:55,898 - INFO - step 2778, loss: 3.943922, best loss: 3.169164 2025-01-16 01:21:56,048 - INFO - step 2779, loss: 4.458071, best loss: 3.169164 2025-01-16 01:21:56,197 - INFO - step 2780, loss: 4.079530, best loss: 3.169164 2025-01-16 01:21:56,347 - INFO - step 2781, loss: 3.887980, best loss: 3.169164 2025-01-16 01:21:56,497 - INFO - step 2782, loss: 4.761570, best loss: 3.169164 2025-01-16 01:21:56,647 - INFO - step 2783, loss: 4.279564, best loss: 3.169164 2025-01-16 01:21:56,797 - INFO - step 2784, loss: 3.743381, best loss: 3.169164 2025-01-16 01:21:56,948 - INFO - step 2785, loss: 4.028795, best loss: 3.169164 2025-01-16 01:21:57,098 - INFO - step 2786, loss: 4.344543, best loss: 3.169164 2025-01-16 01:21:57,248 - INFO - step 2787, loss: 4.223204, best loss: 3.169164 2025-01-16 01:21:57,398 - INFO - step 2788, loss: 4.148855, best loss: 3.169164 2025-01-16 01:21:57,548 - INFO - step 2789, loss: 3.987720, best loss: 3.169164 2025-01-16 01:21:57,698 - INFO - step 2790, loss: 4.626778, best loss: 3.169164 2025-01-16 01:21:57,848 - INFO - step 2791, loss: 4.421206, best loss: 3.169164 2025-01-16 01:21:57,998 - INFO - step 2792, loss: 4.290365, best loss: 3.169164 2025-01-16 01:21:58,148 - INFO - step 2793, loss: 3.973336, best loss: 3.169164 2025-01-16 01:21:58,298 - INFO - step 2794, loss: 4.334469, best loss: 3.169164 2025-01-16 01:21:58,448 - INFO - step 2795, loss: 4.173987, best loss: 3.169164 2025-01-16 01:21:58,598 - INFO - step 2796, loss: 3.710086, best loss: 3.169164 2025-01-16 01:21:58,748 - INFO - step 2797, loss: 4.258105, best loss: 3.169164 2025-01-16 01:21:58,898 - INFO - step 2798, loss: 3.822584, best loss: 3.169164 2025-01-16 01:21:59,049 - INFO - step 2799, loss: 4.242935, best loss: 3.169164 2025-01-16 01:21:59,199 - INFO - step 2800, loss: 4.024447, best loss: 3.169164 2025-01-16 01:21:59,349 - INFO - step 2801, loss: 4.161732, best loss: 3.169164 2025-01-16 01:21:59,499 - INFO - step 2802, loss: 4.025956, best loss: 3.169164 2025-01-16 01:21:59,649 - INFO - step 2803, loss: 4.320054, best loss: 3.169164 2025-01-16 01:21:59,799 - INFO - step 2804, loss: 4.608586, best loss: 3.169164 2025-01-16 01:21:59,949 - INFO - step 2805, loss: 4.377857, best loss: 3.169164 2025-01-16 01:22:00,099 - INFO - step 2806, loss: 4.525183, best loss: 3.169164 2025-01-16 01:22:00,249 - INFO - step 2807, loss: 3.955507, best loss: 3.169164 2025-01-16 01:22:00,398 - INFO - step 2808, loss: 4.299424, best loss: 3.169164 2025-01-16 01:22:00,548 - INFO - step 2809, loss: 4.358335, best loss: 3.169164 2025-01-16 01:22:00,699 - INFO - step 2810, loss: 4.081829, best loss: 3.169164 2025-01-16 01:22:00,849 - INFO - step 2811, loss: 3.941059, best loss: 3.169164 2025-01-16 01:22:00,999 - INFO - step 2812, loss: 3.996108, best loss: 3.169164 2025-01-16 01:22:01,149 - INFO - step 2813, loss: 4.049016, best loss: 3.169164 2025-01-16 01:22:01,299 - INFO - step 2814, loss: 4.196612, best loss: 3.169164 2025-01-16 01:22:01,449 - INFO - step 2815, loss: 4.520537, best loss: 3.169164 2025-01-16 01:22:01,599 - INFO - step 2816, loss: 4.675850, best loss: 3.169164 2025-01-16 01:22:01,749 - INFO - step 2817, loss: 4.399892, best loss: 3.169164 2025-01-16 01:22:01,899 - INFO - step 2818, loss: 4.529214, best loss: 3.169164 2025-01-16 01:22:02,049 - INFO - step 2819, loss: 4.373075, best loss: 3.169164 2025-01-16 01:22:02,200 - INFO - step 2820, loss: 4.331654, best loss: 3.169164 2025-01-16 01:22:02,350 - INFO - step 2821, loss: 3.745087, best loss: 3.169164 2025-01-16 01:22:02,500 - INFO - step 2822, loss: 4.385192, best loss: 3.169164 2025-01-16 01:22:02,650 - INFO - step 2823, loss: 4.684049, best loss: 3.169164 2025-01-16 01:22:02,800 - INFO - step 2824, loss: 4.248240, best loss: 3.169164 2025-01-16 01:22:02,949 - INFO - step 2825, loss: 4.319641, best loss: 3.169164 2025-01-16 01:22:03,099 - INFO - step 2826, loss: 4.328333, best loss: 3.169164 2025-01-16 01:22:03,250 - INFO - step 2827, loss: 3.883717, best loss: 3.169164 2025-01-16 01:22:06,690 - INFO - step 2828, loss: 3.143701, best loss: 3.143701 2025-01-16 01:22:06,840 - INFO - step 2829, loss: 4.148838, best loss: 3.143701 2025-01-16 01:22:06,990 - INFO - step 2830, loss: 4.379711, best loss: 3.143701 2025-01-16 01:22:07,140 - INFO - step 2831, loss: 4.398755, best loss: 3.143701 2025-01-16 01:22:07,290 - INFO - step 2832, loss: 4.291952, best loss: 3.143701 2025-01-16 01:22:07,441 - INFO - step 2833, loss: 3.928257, best loss: 3.143701 2025-01-16 01:22:07,591 - INFO - step 2834, loss: 4.151434, best loss: 3.143701 2025-01-16 01:22:07,741 - INFO - step 2835, loss: 4.244147, best loss: 3.143701 2025-01-16 01:22:07,891 - INFO - step 2836, loss: 3.924169, best loss: 3.143701 2025-01-16 01:22:08,041 - INFO - step 2837, loss: 4.186257, best loss: 3.143701 2025-01-16 01:22:08,191 - INFO - step 2838, loss: 4.085235, best loss: 3.143701 2025-01-16 01:22:08,341 - INFO - step 2839, loss: 3.898976, best loss: 3.143701 2025-01-16 01:22:08,491 - INFO - step 2840, loss: 4.354964, best loss: 3.143701 2025-01-16 01:22:08,641 - INFO - step 2841, loss: 3.807961, best loss: 3.143701 2025-01-16 01:22:08,790 - INFO - step 2842, loss: 4.170305, best loss: 3.143701 2025-01-16 01:22:08,940 - INFO - step 2843, loss: 4.473217, best loss: 3.143701 2025-01-16 01:22:09,091 - INFO - step 2844, loss: 4.070871, best loss: 3.143701 2025-01-16 01:22:09,241 - INFO - step 2845, loss: 3.598747, best loss: 3.143701 2025-01-16 01:22:09,391 - INFO - step 2846, loss: 4.221144, best loss: 3.143701 2025-01-16 01:22:09,541 - INFO - step 2847, loss: 4.406838, best loss: 3.143701 2025-01-16 01:22:09,691 - INFO - step 2848, loss: 4.575805, best loss: 3.143701 2025-01-16 01:22:09,841 - INFO - step 2849, loss: 4.389477, best loss: 3.143701 2025-01-16 01:22:09,991 - INFO - step 2850, loss: 4.471956, best loss: 3.143701 2025-01-16 01:22:10,142 - INFO - step 2851, loss: 4.308764, best loss: 3.143701 2025-01-16 01:22:10,292 - INFO - step 2852, loss: 4.609304, best loss: 3.143701 2025-01-16 01:22:10,441 - INFO - step 2853, loss: 4.075412, best loss: 3.143701 2025-01-16 01:22:10,592 - INFO - step 2854, loss: 4.147034, best loss: 3.143701 2025-01-16 01:22:10,742 - INFO - step 2855, loss: 4.274425, best loss: 3.143701 2025-01-16 01:22:10,892 - INFO - step 2856, loss: 4.319327, best loss: 3.143701 2025-01-16 01:22:11,042 - INFO - step 2857, loss: 4.288742, best loss: 3.143701 2025-01-16 01:22:11,192 - INFO - step 2858, loss: 4.059742, best loss: 3.143701 2025-01-16 01:22:11,342 - INFO - step 2859, loss: 4.107843, best loss: 3.143701 2025-01-16 01:22:11,492 - INFO - step 2860, loss: 4.339161, best loss: 3.143701 2025-01-16 01:22:11,642 - INFO - step 2861, loss: 4.533073, best loss: 3.143701 2025-01-16 01:22:11,792 - INFO - step 2862, loss: 4.218285, best loss: 3.143701 2025-01-16 01:22:11,942 - INFO - step 2863, loss: 4.611056, best loss: 3.143701 2025-01-16 01:22:12,093 - INFO - step 2864, loss: 4.591624, best loss: 3.143701 2025-01-16 01:22:12,243 - INFO - step 2865, loss: 4.659547, best loss: 3.143701 2025-01-16 01:22:12,393 - INFO - step 2866, loss: 4.839210, best loss: 3.143701 2025-01-16 01:22:12,543 - INFO - step 2867, loss: 4.774990, best loss: 3.143701 2025-01-16 01:22:12,693 - INFO - step 2868, loss: 4.103759, best loss: 3.143701 2025-01-16 01:22:12,843 - INFO - step 2869, loss: 4.669455, best loss: 3.143701 2025-01-16 01:22:12,992 - INFO - step 2870, loss: 4.564801, best loss: 3.143701 2025-01-16 01:22:13,143 - INFO - step 2871, loss: 4.700142, best loss: 3.143701 2025-01-16 01:22:13,293 - INFO - step 2872, loss: 4.045986, best loss: 3.143701 2025-01-16 01:22:13,443 - INFO - step 2873, loss: 4.408737, best loss: 3.143701 2025-01-16 01:22:13,593 - INFO - step 2874, loss: 4.414691, best loss: 3.143701 2025-01-16 01:22:13,743 - INFO - step 2875, loss: 4.323098, best loss: 3.143701 2025-01-16 01:22:13,894 - INFO - step 2876, loss: 4.612025, best loss: 3.143701 2025-01-16 01:22:14,044 - INFO - step 2877, loss: 4.069940, best loss: 3.143701 2025-01-16 01:22:14,194 - INFO - step 2878, loss: 4.022915, best loss: 3.143701 2025-01-16 01:22:14,344 - INFO - step 2879, loss: 4.545019, best loss: 3.143701 2025-01-16 01:22:14,494 - INFO - step 2880, loss: 4.343956, best loss: 3.143701 2025-01-16 01:22:14,644 - INFO - step 2881, loss: 4.516982, best loss: 3.143701 2025-01-16 01:22:14,794 - INFO - step 2882, loss: 4.290866, best loss: 3.143701 2025-01-16 01:22:14,944 - INFO - step 2883, loss: 4.905012, best loss: 3.143701 2025-01-16 01:22:15,094 - INFO - step 2884, loss: 4.713759, best loss: 3.143701 2025-01-16 01:22:15,244 - INFO - step 2885, loss: 4.208556, best loss: 3.143701 2025-01-16 01:22:15,394 - INFO - step 2886, loss: 4.127708, best loss: 3.143701 2025-01-16 01:22:15,544 - INFO - step 2887, loss: 4.608361, best loss: 3.143701 2025-01-16 01:22:15,695 - INFO - step 2888, loss: 4.176170, best loss: 3.143701 2025-01-16 01:22:15,845 - INFO - step 2889, loss: 4.098923, best loss: 3.143701 2025-01-16 01:22:15,995 - INFO - step 2890, loss: 4.473744, best loss: 3.143701 2025-01-16 01:22:16,145 - INFO - step 2891, loss: 4.466852, best loss: 3.143701 2025-01-16 01:22:16,295 - INFO - step 2892, loss: 4.372177, best loss: 3.143701 2025-01-16 01:22:16,445 - INFO - step 2893, loss: 3.948567, best loss: 3.143701 2025-01-16 01:22:16,595 - INFO - step 2894, loss: 3.792614, best loss: 3.143701 2025-01-16 01:22:16,745 - INFO - step 2895, loss: 3.859420, best loss: 3.143701 2025-01-16 01:22:16,895 - INFO - step 2896, loss: 3.822126, best loss: 3.143701 2025-01-16 01:22:17,045 - INFO - step 2897, loss: 3.894292, best loss: 3.143701 2025-01-16 01:22:17,195 - INFO - step 2898, loss: 4.307159, best loss: 3.143701 2025-01-16 01:22:17,345 - INFO - step 2899, loss: 4.077662, best loss: 3.143701 2025-01-16 01:22:17,496 - INFO - step 2900, loss: 3.938539, best loss: 3.143701 2025-01-16 01:22:17,646 - INFO - step 2901, loss: 4.160530, best loss: 3.143701 2025-01-16 01:22:17,795 - INFO - step 2902, loss: 4.128179, best loss: 3.143701 2025-01-16 01:22:17,945 - INFO - step 2903, loss: 4.094565, best loss: 3.143701 2025-01-16 01:22:18,095 - INFO - step 2904, loss: 4.228058, best loss: 3.143701 2025-01-16 01:22:18,245 - INFO - step 2905, loss: 4.509299, best loss: 3.143701 2025-01-16 01:22:18,395 - INFO - step 2906, loss: 3.935961, best loss: 3.143701 2025-01-16 01:22:18,546 - INFO - step 2907, loss: 3.860392, best loss: 3.143701 2025-01-16 01:22:18,696 - INFO - step 2908, loss: 4.215426, best loss: 3.143701 2025-01-16 01:22:18,846 - INFO - step 2909, loss: 4.311070, best loss: 3.143701 2025-01-16 01:22:18,996 - INFO - step 2910, loss: 3.867220, best loss: 3.143701 2025-01-16 01:22:19,146 - INFO - step 2911, loss: 3.763206, best loss: 3.143701 2025-01-16 01:22:19,295 - INFO - step 2912, loss: 4.057285, best loss: 3.143701 2025-01-16 01:22:19,446 - INFO - step 2913, loss: 4.081022, best loss: 3.143701 2025-01-16 01:22:19,596 - INFO - step 2914, loss: 3.746382, best loss: 3.143701 2025-01-16 01:22:19,746 - INFO - step 2915, loss: 4.205338, best loss: 3.143701 2025-01-16 01:22:19,896 - INFO - step 2916, loss: 3.948726, best loss: 3.143701 2025-01-16 01:22:20,046 - INFO - step 2917, loss: 3.844742, best loss: 3.143701 2025-01-16 01:22:20,196 - INFO - step 2918, loss: 3.753490, best loss: 3.143701 2025-01-16 01:22:20,346 - INFO - step 2919, loss: 4.032658, best loss: 3.143701 2025-01-16 01:22:20,496 - INFO - step 2920, loss: 3.573339, best loss: 3.143701 2025-01-16 01:22:20,646 - INFO - step 2921, loss: 3.906662, best loss: 3.143701 2025-01-16 01:22:20,797 - INFO - step 2922, loss: 3.662131, best loss: 3.143701 2025-01-16 01:22:20,947 - INFO - step 2923, loss: 4.214391, best loss: 3.143701 2025-01-16 01:22:21,097 - INFO - step 2924, loss: 4.523153, best loss: 3.143701 2025-01-16 01:22:21,247 - INFO - step 2925, loss: 4.886729, best loss: 3.143701 2025-01-16 01:22:21,397 - INFO - step 2926, loss: 4.228867, best loss: 3.143701 2025-01-16 01:22:21,547 - INFO - step 2927, loss: 4.432405, best loss: 3.143701 2025-01-16 01:22:21,698 - INFO - step 2928, loss: 4.253086, best loss: 3.143701 2025-01-16 01:22:21,848 - INFO - step 2929, loss: 4.190255, best loss: 3.143701 2025-01-16 01:22:21,998 - INFO - step 2930, loss: 4.065504, best loss: 3.143701 2025-01-16 01:22:22,148 - INFO - step 2931, loss: 4.362310, best loss: 3.143701 2025-01-16 01:22:22,299 - INFO - step 2932, loss: 3.996323, best loss: 3.143701 2025-01-16 01:22:22,449 - INFO - step 2933, loss: 3.660304, best loss: 3.143701 2025-01-16 01:22:22,600 - INFO - step 2934, loss: 3.975971, best loss: 3.143701 2025-01-16 01:22:22,750 - INFO - step 2935, loss: 3.891340, best loss: 3.143701 2025-01-16 01:22:22,900 - INFO - step 2936, loss: 4.108759, best loss: 3.143701 2025-01-16 01:22:23,050 - INFO - step 2937, loss: 3.180187, best loss: 3.143701 2025-01-16 01:22:23,201 - INFO - step 2938, loss: 3.936999, best loss: 3.143701 2025-01-16 01:22:23,351 - INFO - step 2939, loss: 4.171760, best loss: 3.143701 2025-01-16 01:22:23,501 - INFO - step 2940, loss: 4.223144, best loss: 3.143701 2025-01-16 01:22:23,651 - INFO - step 2941, loss: 4.011322, best loss: 3.143701 2025-01-16 01:22:23,800 - INFO - step 2942, loss: 4.025612, best loss: 3.143701 2025-01-16 01:22:23,950 - INFO - step 2943, loss: 4.098238, best loss: 3.143701 2025-01-16 01:22:24,100 - INFO - step 2944, loss: 3.739938, best loss: 3.143701 2025-01-16 01:22:24,251 - INFO - step 2945, loss: 4.139771, best loss: 3.143701 2025-01-16 01:22:24,401 - INFO - step 2946, loss: 4.034443, best loss: 3.143701 2025-01-16 01:22:24,551 - INFO - step 2947, loss: 4.047333, best loss: 3.143701 2025-01-16 01:22:24,701 - INFO - step 2948, loss: 3.663967, best loss: 3.143701 2025-01-16 01:22:24,851 - INFO - step 2949, loss: 3.714195, best loss: 3.143701 2025-01-16 01:22:25,001 - INFO - step 2950, loss: 3.782046, best loss: 3.143701 2025-01-16 01:22:25,151 - INFO - step 2951, loss: 3.761142, best loss: 3.143701 2025-01-16 01:22:25,301 - INFO - step 2952, loss: 3.804753, best loss: 3.143701 2025-01-16 01:22:25,451 - INFO - step 2953, loss: 3.794085, best loss: 3.143701 2025-01-16 01:22:25,601 - INFO - step 2954, loss: 3.722190, best loss: 3.143701 2025-01-16 01:22:25,751 - INFO - step 2955, loss: 3.493579, best loss: 3.143701 2025-01-16 01:22:25,902 - INFO - step 2956, loss: 3.380485, best loss: 3.143701 2025-01-16 01:22:29,340 - INFO - step 2957, loss: 3.116312, best loss: 3.116312 2025-01-16 01:22:29,503 - INFO - step 2958, loss: 4.020926, best loss: 3.116312 2025-01-16 01:22:29,656 - INFO - step 2959, loss: 4.416344, best loss: 3.116312 2025-01-16 01:22:29,806 - INFO - step 2960, loss: 4.514421, best loss: 3.116312 2025-01-16 01:22:29,956 - INFO - step 2961, loss: 4.621398, best loss: 3.116312 2025-01-16 01:22:30,106 - INFO - step 2962, loss: 4.696087, best loss: 3.116312 2025-01-16 01:22:30,256 - INFO - step 2963, loss: 4.201841, best loss: 3.116312 2025-01-16 01:22:30,406 - INFO - step 2964, loss: 4.398482, best loss: 3.116312 2025-01-16 01:22:30,556 - INFO - step 2965, loss: 4.488908, best loss: 3.116312 2025-01-16 01:22:30,706 - INFO - step 2966, loss: 4.012529, best loss: 3.116312 2025-01-16 01:22:30,856 - INFO - step 2967, loss: 3.593152, best loss: 3.116312 2025-01-16 01:22:31,006 - INFO - step 2968, loss: 4.014527, best loss: 3.116312 2025-01-16 01:22:31,156 - INFO - step 2969, loss: 3.835064, best loss: 3.116312 2025-01-16 01:22:31,306 - INFO - step 2970, loss: 4.433640, best loss: 3.116312 2025-01-16 01:22:31,456 - INFO - step 2971, loss: 4.308933, best loss: 3.116312 2025-01-16 01:22:31,606 - INFO - step 2972, loss: 4.398682, best loss: 3.116312 2025-01-16 01:22:31,756 - INFO - step 2973, loss: 4.127976, best loss: 3.116312 2025-01-16 01:22:31,906 - INFO - step 2974, loss: 4.487261, best loss: 3.116312 2025-01-16 01:22:32,057 - INFO - step 2975, loss: 3.968374, best loss: 3.116312 2025-01-16 01:22:32,207 - INFO - step 2976, loss: 4.354932, best loss: 3.116312 2025-01-16 01:22:32,357 - INFO - step 2977, loss: 4.068432, best loss: 3.116312 2025-01-16 01:22:32,507 - INFO - step 2978, loss: 4.298133, best loss: 3.116312 2025-01-16 01:22:32,657 - INFO - step 2979, loss: 4.137476, best loss: 3.116312 2025-01-16 01:22:32,807 - INFO - step 2980, loss: 4.031023, best loss: 3.116312 2025-01-16 01:22:32,957 - INFO - step 2981, loss: 4.700033, best loss: 3.116312 2025-01-16 01:22:33,107 - INFO - step 2982, loss: 3.865004, best loss: 3.116312 2025-01-16 01:22:33,257 - INFO - step 2983, loss: 4.289367, best loss: 3.116312 2025-01-16 01:22:33,408 - INFO - step 2984, loss: 4.256965, best loss: 3.116312 2025-01-16 01:22:33,558 - INFO - step 2985, loss: 4.295588, best loss: 3.116312 2025-01-16 01:22:33,709 - INFO - step 2986, loss: 4.185329, best loss: 3.116312 2025-01-16 01:22:33,859 - INFO - step 2987, loss: 3.871856, best loss: 3.116312 2025-01-16 01:22:34,009 - INFO - step 2988, loss: 3.919712, best loss: 3.116312 2025-01-16 01:22:34,159 - INFO - step 2989, loss: 4.061873, best loss: 3.116312 2025-01-16 01:22:34,309 - INFO - step 2990, loss: 3.543791, best loss: 3.116312 2025-01-16 01:22:34,459 - INFO - step 2991, loss: 4.352708, best loss: 3.116312 2025-01-16 01:22:34,610 - INFO - step 2992, loss: 3.544658, best loss: 3.116312 2025-01-16 01:22:34,759 - INFO - step 2993, loss: 3.447516, best loss: 3.116312 2025-01-16 01:22:34,909 - INFO - step 2994, loss: 4.020699, best loss: 3.116312 2025-01-16 01:22:35,059 - INFO - step 2995, loss: 3.971159, best loss: 3.116312 2025-01-16 01:22:35,209 - INFO - step 2996, loss: 3.964343, best loss: 3.116312 2025-01-16 01:22:35,360 - INFO - step 2997, loss: 3.627479, best loss: 3.116312 2025-01-16 01:22:35,510 - INFO - step 2998, loss: 3.873642, best loss: 3.116312 2025-01-16 01:22:35,660 - INFO - step 2999, loss: 3.945884, best loss: 3.116312 2025-01-16 01:22:35,810 - INFO - step 3000, loss: 3.840458, best loss: 3.116312 2025-01-16 01:22:35,961 - INFO - step 3001, loss: 3.649186, best loss: 3.116312 2025-01-16 01:22:36,111 - INFO - step 3002, loss: 4.123910, best loss: 3.116312 2025-01-16 01:22:36,261 - INFO - step 3003, loss: 4.046471, best loss: 3.116312 2025-01-16 01:22:36,411 - INFO - step 3004, loss: 4.014663, best loss: 3.116312 2025-01-16 01:22:36,561 - INFO - step 3005, loss: 3.639627, best loss: 3.116312 2025-01-16 01:22:36,711 - INFO - step 3006, loss: 3.879836, best loss: 3.116312 2025-01-16 01:22:36,861 - INFO - step 3007, loss: 4.175313, best loss: 3.116312 2025-01-16 01:22:37,011 - INFO - step 3008, loss: 3.731415, best loss: 3.116312 2025-01-16 01:22:37,161 - INFO - step 3009, loss: 4.221789, best loss: 3.116312 2025-01-16 01:22:37,311 - INFO - step 3010, loss: 4.314767, best loss: 3.116312 2025-01-16 01:22:37,461 - INFO - step 3011, loss: 4.223339, best loss: 3.116312 2025-01-16 01:22:37,611 - INFO - step 3012, loss: 4.194271, best loss: 3.116312 2025-01-16 01:22:37,761 - INFO - step 3013, loss: 4.118866, best loss: 3.116312 2025-01-16 01:22:37,911 - INFO - step 3014, loss: 4.054801, best loss: 3.116312 2025-01-16 01:22:38,061 - INFO - step 3015, loss: 3.803346, best loss: 3.116312 2025-01-16 01:22:38,212 - INFO - step 3016, loss: 4.647673, best loss: 3.116312 2025-01-16 01:22:38,361 - INFO - step 3017, loss: 4.094465, best loss: 3.116312 2025-01-16 01:22:38,511 - INFO - step 3018, loss: 4.425151, best loss: 3.116312 2025-01-16 01:22:38,661 - INFO - step 3019, loss: 3.489989, best loss: 3.116312 2025-01-16 01:22:38,811 - INFO - step 3020, loss: 3.513269, best loss: 3.116312 2025-01-16 01:22:38,961 - INFO - step 3021, loss: 4.045298, best loss: 3.116312 2025-01-16 01:22:39,111 - INFO - step 3022, loss: 4.023735, best loss: 3.116312 2025-01-16 01:22:39,262 - INFO - step 3023, loss: 3.856232, best loss: 3.116312 2025-01-16 01:22:39,412 - INFO - step 3024, loss: 4.009213, best loss: 3.116312 2025-01-16 01:22:39,562 - INFO - step 3025, loss: 3.810401, best loss: 3.116312 2025-01-16 01:22:39,712 - INFO - step 3026, loss: 3.941525, best loss: 3.116312 2025-01-16 01:22:39,862 - INFO - step 3027, loss: 4.154411, best loss: 3.116312 2025-01-16 01:22:40,012 - INFO - step 3028, loss: 3.706875, best loss: 3.116312 2025-01-16 01:22:40,162 - INFO - step 3029, loss: 3.797015, best loss: 3.116312 2025-01-16 01:22:40,312 - INFO - step 3030, loss: 3.938986, best loss: 3.116312 2025-01-16 01:22:40,462 - INFO - step 3031, loss: 4.115389, best loss: 3.116312 2025-01-16 01:22:40,612 - INFO - step 3032, loss: 3.929017, best loss: 3.116312 2025-01-16 01:22:40,763 - INFO - step 3033, loss: 4.163872, best loss: 3.116312 2025-01-16 01:22:40,913 - INFO - step 3034, loss: 4.006507, best loss: 3.116312 2025-01-16 01:22:41,062 - INFO - step 3035, loss: 3.599267, best loss: 3.116312 2025-01-16 01:22:41,213 - INFO - step 3036, loss: 4.074486, best loss: 3.116312 2025-01-16 01:22:41,363 - INFO - step 3037, loss: 3.394723, best loss: 3.116312 2025-01-16 01:22:41,513 - INFO - step 3038, loss: 3.742120, best loss: 3.116312 2025-01-16 01:22:41,663 - INFO - step 3039, loss: 3.933204, best loss: 3.116312 2025-01-16 01:22:41,813 - INFO - step 3040, loss: 3.813293, best loss: 3.116312 2025-01-16 01:22:41,963 - INFO - step 3041, loss: 3.792370, best loss: 3.116312 2025-01-16 01:22:42,113 - INFO - step 3042, loss: 4.109671, best loss: 3.116312 2025-01-16 01:22:42,263 - INFO - step 3043, loss: 4.318623, best loss: 3.116312 2025-01-16 01:22:42,413 - INFO - step 3044, loss: 4.112427, best loss: 3.116312 2025-01-16 01:22:42,563 - INFO - step 3045, loss: 4.348285, best loss: 3.116312 2025-01-16 01:22:42,713 - INFO - step 3046, loss: 3.916706, best loss: 3.116312 2025-01-16 01:22:42,863 - INFO - step 3047, loss: 3.992551, best loss: 3.116312 2025-01-16 01:22:43,013 - INFO - step 3048, loss: 3.919234, best loss: 3.116312 2025-01-16 01:22:43,163 - INFO - step 3049, loss: 3.595330, best loss: 3.116312 2025-01-16 01:22:43,313 - INFO - step 3050, loss: 4.416422, best loss: 3.116312 2025-01-16 01:22:43,463 - INFO - step 3051, loss: 4.334155, best loss: 3.116312 2025-01-16 01:22:43,613 - INFO - step 3052, loss: 3.985575, best loss: 3.116312 2025-01-16 01:22:43,763 - INFO - step 3053, loss: 3.860008, best loss: 3.116312 2025-01-16 01:22:43,913 - INFO - step 3054, loss: 3.601134, best loss: 3.116312 2025-01-16 01:22:44,064 - INFO - step 3055, loss: 3.783239, best loss: 3.116312 2025-01-16 01:22:44,214 - INFO - step 3056, loss: 3.762363, best loss: 3.116312 2025-01-16 01:22:44,363 - INFO - step 3057, loss: 3.678654, best loss: 3.116312 2025-01-16 01:22:44,514 - INFO - step 3058, loss: 4.307206, best loss: 3.116312 2025-01-16 01:22:44,664 - INFO - step 3059, loss: 4.285886, best loss: 3.116312 2025-01-16 01:22:44,814 - INFO - step 3060, loss: 4.097656, best loss: 3.116312 2025-01-16 01:22:44,964 - INFO - step 3061, loss: 4.403617, best loss: 3.116312 2025-01-16 01:22:45,115 - INFO - step 3062, loss: 4.020994, best loss: 3.116312 2025-01-16 01:22:45,265 - INFO - step 3063, loss: 4.328946, best loss: 3.116312 2025-01-16 01:22:45,415 - INFO - step 3064, loss: 4.435358, best loss: 3.116312 2025-01-16 01:22:45,565 - INFO - step 3065, loss: 4.438906, best loss: 3.116312 2025-01-16 01:22:45,716 - INFO - step 3066, loss: 4.524824, best loss: 3.116312 2025-01-16 01:22:45,866 - INFO - step 3067, loss: 4.184145, best loss: 3.116312 2025-01-16 01:22:46,016 - INFO - step 3068, loss: 4.470247, best loss: 3.116312 2025-01-16 01:22:46,166 - INFO - step 3069, loss: 4.210205, best loss: 3.116312 2025-01-16 01:22:46,316 - INFO - step 3070, loss: 3.998966, best loss: 3.116312 2025-01-16 01:22:46,466 - INFO - step 3071, loss: 4.453240, best loss: 3.116312 2025-01-16 01:22:46,616 - INFO - step 3072, loss: 4.451391, best loss: 3.116312 2025-01-16 01:22:46,766 - INFO - step 3073, loss: 4.314057, best loss: 3.116312 2025-01-16 01:22:46,916 - INFO - step 3074, loss: 4.144973, best loss: 3.116312 2025-01-16 01:22:47,066 - INFO - step 3075, loss: 4.418244, best loss: 3.116312 2025-01-16 01:22:47,216 - INFO - step 3076, loss: 4.167986, best loss: 3.116312 2025-01-16 01:22:47,366 - INFO - step 3077, loss: 3.930482, best loss: 3.116312 2025-01-16 01:22:47,516 - INFO - step 3078, loss: 3.951915, best loss: 3.116312 2025-01-16 01:22:47,666 - INFO - step 3079, loss: 4.322987, best loss: 3.116312 2025-01-16 01:22:47,816 - INFO - step 3080, loss: 4.354350, best loss: 3.116312 2025-01-16 01:22:47,966 - INFO - step 3081, loss: 4.594454, best loss: 3.116312 2025-01-16 01:22:48,116 - INFO - step 3082, loss: 4.484299, best loss: 3.116312 2025-01-16 01:22:48,266 - INFO - step 3083, loss: 4.037545, best loss: 3.116312 2025-01-16 01:22:48,417 - INFO - step 3084, loss: 4.478149, best loss: 3.116312 2025-01-16 01:22:48,567 - INFO - step 3085, loss: 3.989580, best loss: 3.116312 2025-01-16 01:22:48,717 - INFO - step 3086, loss: 4.428098, best loss: 3.116312 2025-01-16 01:22:48,867 - INFO - step 3087, loss: 3.994046, best loss: 3.116312 2025-01-16 01:22:49,017 - INFO - step 3088, loss: 4.150364, best loss: 3.116312 2025-01-16 01:22:49,167 - INFO - step 3089, loss: 4.197021, best loss: 3.116312 2025-01-16 01:22:49,317 - INFO - step 3090, loss: 4.269868, best loss: 3.116312 2025-01-16 01:22:49,468 - INFO - step 3091, loss: 4.029844, best loss: 3.116312 2025-01-16 01:22:49,618 - INFO - step 3092, loss: 4.202672, best loss: 3.116312 2025-01-16 01:22:49,768 - INFO - step 3093, loss: 3.790192, best loss: 3.116312 2025-01-16 01:22:49,918 - INFO - step 3094, loss: 3.427894, best loss: 3.116312 2025-01-16 01:22:50,068 - INFO - step 3095, loss: 3.658713, best loss: 3.116312 2025-01-16 01:22:50,218 - INFO - step 3096, loss: 3.931651, best loss: 3.116312 2025-01-16 01:22:50,369 - INFO - step 3097, loss: 4.643894, best loss: 3.116312 2025-01-16 01:22:50,519 - INFO - step 3098, loss: 4.160352, best loss: 3.116312 2025-01-16 01:22:50,669 - INFO - step 3099, loss: 3.975319, best loss: 3.116312 2025-01-16 01:22:50,820 - INFO - step 3100, loss: 4.443131, best loss: 3.116312 2025-01-16 01:22:50,970 - INFO - step 3101, loss: 4.014565, best loss: 3.116312 2025-01-16 01:22:51,120 - INFO - step 3102, loss: 4.609741, best loss: 3.116312 2025-01-16 01:22:51,270 - INFO - step 3103, loss: 3.974504, best loss: 3.116312 2025-01-16 01:22:51,419 - INFO - step 3104, loss: 4.300471, best loss: 3.116312 2025-01-16 01:22:51,570 - INFO - step 3105, loss: 4.051742, best loss: 3.116312 2025-01-16 01:22:51,720 - INFO - step 3106, loss: 4.658160, best loss: 3.116312 2025-01-16 01:22:51,870 - INFO - step 3107, loss: 4.152386, best loss: 3.116312 2025-01-16 01:22:52,020 - INFO - step 3108, loss: 3.865429, best loss: 3.116312 2025-01-16 01:22:52,170 - INFO - step 3109, loss: 4.358914, best loss: 3.116312 2025-01-16 01:22:52,320 - INFO - step 3110, loss: 4.080777, best loss: 3.116312 2025-01-16 01:22:52,470 - INFO - step 3111, loss: 3.833112, best loss: 3.116312 2025-01-16 01:22:52,620 - INFO - step 3112, loss: 4.670580, best loss: 3.116312 2025-01-16 01:22:52,771 - INFO - step 3113, loss: 4.123451, best loss: 3.116312 2025-01-16 01:22:52,921 - INFO - step 3114, loss: 3.616692, best loss: 3.116312 2025-01-16 01:22:53,071 - INFO - step 3115, loss: 3.932023, best loss: 3.116312 2025-01-16 01:22:53,221 - INFO - step 3116, loss: 4.277129, best loss: 3.116312 2025-01-16 01:22:53,371 - INFO - step 3117, loss: 4.159033, best loss: 3.116312 2025-01-16 01:22:53,521 - INFO - step 3118, loss: 4.048106, best loss: 3.116312 2025-01-16 01:22:53,671 - INFO - step 3119, loss: 3.921598, best loss: 3.116312 2025-01-16 01:22:53,821 - INFO - step 3120, loss: 4.541512, best loss: 3.116312 2025-01-16 01:22:53,971 - INFO - step 3121, loss: 4.309319, best loss: 3.116312 2025-01-16 01:22:54,121 - INFO - step 3122, loss: 4.220939, best loss: 3.116312 2025-01-16 01:22:54,271 - INFO - step 3123, loss: 3.840054, best loss: 3.116312 2025-01-16 01:22:54,422 - INFO - step 3124, loss: 4.214710, best loss: 3.116312 2025-01-16 01:22:54,572 - INFO - step 3125, loss: 4.038100, best loss: 3.116312 2025-01-16 01:22:54,722 - INFO - step 3126, loss: 3.625880, best loss: 3.116312 2025-01-16 01:22:54,872 - INFO - step 3127, loss: 4.193869, best loss: 3.116312 2025-01-16 01:22:55,022 - INFO - step 3128, loss: 3.794507, best loss: 3.116312 2025-01-16 01:22:55,172 - INFO - step 3129, loss: 4.171086, best loss: 3.116312 2025-01-16 01:22:55,323 - INFO - step 3130, loss: 3.932531, best loss: 3.116312 2025-01-16 01:22:55,473 - INFO - step 3131, loss: 4.035428, best loss: 3.116312 2025-01-16 01:22:55,623 - INFO - step 3132, loss: 3.889660, best loss: 3.116312 2025-01-16 01:22:55,773 - INFO - step 3133, loss: 4.162637, best loss: 3.116312 2025-01-16 01:22:55,923 - INFO - step 3134, loss: 4.477314, best loss: 3.116312 2025-01-16 01:22:56,074 - INFO - step 3135, loss: 4.238515, best loss: 3.116312 2025-01-16 01:22:56,224 - INFO - step 3136, loss: 4.407053, best loss: 3.116312 2025-01-16 01:22:56,374 - INFO - step 3137, loss: 3.891735, best loss: 3.116312 2025-01-16 01:22:56,524 - INFO - step 3138, loss: 4.226667, best loss: 3.116312 2025-01-16 01:22:56,674 - INFO - step 3139, loss: 4.275548, best loss: 3.116312 2025-01-16 01:22:56,824 - INFO - step 3140, loss: 3.970326, best loss: 3.116312 2025-01-16 01:22:56,975 - INFO - step 3141, loss: 3.818427, best loss: 3.116312 2025-01-16 01:22:57,125 - INFO - step 3142, loss: 3.776206, best loss: 3.116312 2025-01-16 01:22:57,275 - INFO - step 3143, loss: 3.882714, best loss: 3.116312 2025-01-16 01:22:57,425 - INFO - step 3144, loss: 4.057224, best loss: 3.116312 2025-01-16 01:22:57,575 - INFO - step 3145, loss: 4.372856, best loss: 3.116312 2025-01-16 01:22:57,725 - INFO - step 3146, loss: 4.562910, best loss: 3.116312 2025-01-16 01:22:57,875 - INFO - step 3147, loss: 4.297167, best loss: 3.116312 2025-01-16 01:22:58,025 - INFO - step 3148, loss: 4.442658, best loss: 3.116312 2025-01-16 01:22:58,175 - INFO - step 3149, loss: 4.313455, best loss: 3.116312 2025-01-16 01:22:58,326 - INFO - step 3150, loss: 4.230998, best loss: 3.116312 2025-01-16 01:22:58,476 - INFO - step 3151, loss: 3.646293, best loss: 3.116312 2025-01-16 01:22:58,626 - INFO - step 3152, loss: 4.257503, best loss: 3.116312 2025-01-16 01:22:58,776 - INFO - step 3153, loss: 4.574184, best loss: 3.116312 2025-01-16 01:22:58,926 - INFO - step 3154, loss: 4.187691, best loss: 3.116312 2025-01-16 01:22:59,076 - INFO - step 3155, loss: 4.288855, best loss: 3.116312 2025-01-16 01:22:59,227 - INFO - step 3156, loss: 4.262624, best loss: 3.116312 2025-01-16 01:22:59,376 - INFO - step 3157, loss: 3.796472, best loss: 3.116312 2025-01-16 01:23:02,829 - INFO - step 3158, loss: 3.032990, best loss: 3.032990 2025-01-16 01:23:02,979 - INFO - step 3159, loss: 4.005055, best loss: 3.032990 2025-01-16 01:23:03,128 - INFO - step 3160, loss: 4.252920, best loss: 3.032990 2025-01-16 01:23:03,278 - INFO - step 3161, loss: 4.341980, best loss: 3.032990 2025-01-16 01:23:03,429 - INFO - step 3162, loss: 4.292308, best loss: 3.032990 2025-01-16 01:23:03,578 - INFO - step 3163, loss: 3.915586, best loss: 3.032990 2025-01-16 01:23:03,728 - INFO - step 3164, loss: 4.121335, best loss: 3.032990 2025-01-16 01:23:03,878 - INFO - step 3165, loss: 4.155735, best loss: 3.032990 2025-01-16 01:23:04,028 - INFO - step 3166, loss: 3.800094, best loss: 3.032990 2025-01-16 01:23:04,178 - INFO - step 3167, loss: 4.051804, best loss: 3.032990 2025-01-16 01:23:04,328 - INFO - step 3168, loss: 3.939047, best loss: 3.032990 2025-01-16 01:23:04,479 - INFO - step 3169, loss: 3.823065, best loss: 3.032990 2025-01-16 01:23:04,629 - INFO - step 3170, loss: 4.282872, best loss: 3.032990 2025-01-16 01:23:04,779 - INFO - step 3171, loss: 3.753438, best loss: 3.032990 2025-01-16 01:23:04,929 - INFO - step 3172, loss: 4.057003, best loss: 3.032990 2025-01-16 01:23:05,079 - INFO - step 3173, loss: 4.343899, best loss: 3.032990 2025-01-16 01:23:05,229 - INFO - step 3174, loss: 3.967087, best loss: 3.032990 2025-01-16 01:23:05,379 - INFO - step 3175, loss: 3.507532, best loss: 3.032990 2025-01-16 01:23:05,530 - INFO - step 3176, loss: 4.141518, best loss: 3.032990 2025-01-16 01:23:05,680 - INFO - step 3177, loss: 4.294113, best loss: 3.032990 2025-01-16 01:23:05,830 - INFO - step 3178, loss: 4.433913, best loss: 3.032990 2025-01-16 01:23:05,980 - INFO - step 3179, loss: 4.222852, best loss: 3.032990 2025-01-16 01:23:06,131 - INFO - step 3180, loss: 4.325747, best loss: 3.032990 2025-01-16 01:23:06,281 - INFO - step 3181, loss: 4.170657, best loss: 3.032990 2025-01-16 01:23:06,431 - INFO - step 3182, loss: 4.488952, best loss: 3.032990 2025-01-16 01:23:06,582 - INFO - step 3183, loss: 3.947314, best loss: 3.032990 2025-01-16 01:23:06,732 - INFO - step 3184, loss: 4.067688, best loss: 3.032990 2025-01-16 01:23:06,882 - INFO - step 3185, loss: 4.222829, best loss: 3.032990 2025-01-16 01:23:07,032 - INFO - step 3186, loss: 4.260117, best loss: 3.032990 2025-01-16 01:23:07,183 - INFO - step 3187, loss: 4.203094, best loss: 3.032990 2025-01-16 01:23:07,333 - INFO - step 3188, loss: 3.973304, best loss: 3.032990 2025-01-16 01:23:07,483 - INFO - step 3189, loss: 3.994435, best loss: 3.032990 2025-01-16 01:23:07,634 - INFO - step 3190, loss: 4.226516, best loss: 3.032990 2025-01-16 01:23:07,784 - INFO - step 3191, loss: 4.418303, best loss: 3.032990 2025-01-16 01:23:07,934 - INFO - step 3192, loss: 4.145984, best loss: 3.032990 2025-01-16 01:23:08,085 - INFO - step 3193, loss: 4.505550, best loss: 3.032990 2025-01-16 01:23:08,235 - INFO - step 3194, loss: 4.543242, best loss: 3.032990 2025-01-16 01:23:08,385 - INFO - step 3195, loss: 4.556030, best loss: 3.032990 2025-01-16 01:23:08,535 - INFO - step 3196, loss: 4.741663, best loss: 3.032990 2025-01-16 01:23:08,686 - INFO - step 3197, loss: 4.599358, best loss: 3.032990 2025-01-16 01:23:08,836 - INFO - step 3198, loss: 3.971691, best loss: 3.032990 2025-01-16 01:23:08,986 - INFO - step 3199, loss: 4.536891, best loss: 3.032990 2025-01-16 01:23:09,136 - INFO - step 3200, loss: 4.488710, best loss: 3.032990 2025-01-16 01:23:09,287 - INFO - step 3201, loss: 4.577770, best loss: 3.032990 2025-01-16 01:23:09,437 - INFO - step 3202, loss: 3.985087, best loss: 3.032990 2025-01-16 01:23:09,587 - INFO - step 3203, loss: 4.300428, best loss: 3.032990 2025-01-16 01:23:09,737 - INFO - step 3204, loss: 4.241285, best loss: 3.032990 2025-01-16 01:23:09,888 - INFO - step 3205, loss: 4.170108, best loss: 3.032990 2025-01-16 01:23:10,038 - INFO - step 3206, loss: 4.401706, best loss: 3.032990 2025-01-16 01:23:10,188 - INFO - step 3207, loss: 3.968989, best loss: 3.032990 2025-01-16 01:23:10,338 - INFO - step 3208, loss: 3.993782, best loss: 3.032990 2025-01-16 01:23:10,488 - INFO - step 3209, loss: 4.549263, best loss: 3.032990 2025-01-16 01:23:10,638 - INFO - step 3210, loss: 4.387430, best loss: 3.032990 2025-01-16 01:23:10,789 - INFO - step 3211, loss: 4.461892, best loss: 3.032990 2025-01-16 01:23:10,939 - INFO - step 3212, loss: 4.252446, best loss: 3.032990 2025-01-16 01:23:11,089 - INFO - step 3213, loss: 4.728073, best loss: 3.032990 2025-01-16 01:23:11,239 - INFO - step 3214, loss: 4.527409, best loss: 3.032990 2025-01-16 01:23:11,389 - INFO - step 3215, loss: 4.107182, best loss: 3.032990 2025-01-16 01:23:11,539 - INFO - step 3216, loss: 4.101039, best loss: 3.032990 2025-01-16 01:23:11,689 - INFO - step 3217, loss: 4.581911, best loss: 3.032990 2025-01-16 01:23:11,839 - INFO - step 3218, loss: 4.102655, best loss: 3.032990 2025-01-16 01:23:11,990 - INFO - step 3219, loss: 4.065925, best loss: 3.032990 2025-01-16 01:23:12,140 - INFO - step 3220, loss: 4.377510, best loss: 3.032990 2025-01-16 01:23:12,290 - INFO - step 3221, loss: 4.420265, best loss: 3.032990 2025-01-16 01:23:12,441 - INFO - step 3222, loss: 4.309538, best loss: 3.032990 2025-01-16 01:23:12,591 - INFO - step 3223, loss: 3.924556, best loss: 3.032990 2025-01-16 01:23:12,741 - INFO - step 3224, loss: 3.704567, best loss: 3.032990 2025-01-16 01:23:12,891 - INFO - step 3225, loss: 3.782714, best loss: 3.032990 2025-01-16 01:23:13,042 - INFO - step 3226, loss: 3.793390, best loss: 3.032990 2025-01-16 01:23:13,191 - INFO - step 3227, loss: 3.824037, best loss: 3.032990 2025-01-16 01:23:13,342 - INFO - step 3228, loss: 4.247582, best loss: 3.032990 2025-01-16 01:23:13,492 - INFO - step 3229, loss: 4.075500, best loss: 3.032990 2025-01-16 01:23:13,643 - INFO - step 3230, loss: 3.925547, best loss: 3.032990 2025-01-16 01:23:13,793 - INFO - step 3231, loss: 4.118956, best loss: 3.032990 2025-01-16 01:23:13,943 - INFO - step 3232, loss: 4.092425, best loss: 3.032990 2025-01-16 01:23:14,093 - INFO - step 3233, loss: 4.048551, best loss: 3.032990 2025-01-16 01:23:14,243 - INFO - step 3234, loss: 4.124149, best loss: 3.032990 2025-01-16 01:23:14,393 - INFO - step 3235, loss: 4.374495, best loss: 3.032990 2025-01-16 01:23:14,543 - INFO - step 3236, loss: 3.781086, best loss: 3.032990 2025-01-16 01:23:14,693 - INFO - step 3237, loss: 3.755286, best loss: 3.032990 2025-01-16 01:23:14,844 - INFO - step 3238, loss: 4.108246, best loss: 3.032990 2025-01-16 01:23:14,994 - INFO - step 3239, loss: 4.261688, best loss: 3.032990 2025-01-16 01:23:15,144 - INFO - step 3240, loss: 3.792383, best loss: 3.032990 2025-01-16 01:23:15,294 - INFO - step 3241, loss: 3.735245, best loss: 3.032990 2025-01-16 01:23:15,444 - INFO - step 3242, loss: 4.022185, best loss: 3.032990 2025-01-16 01:23:15,594 - INFO - step 3243, loss: 3.993513, best loss: 3.032990 2025-01-16 01:23:15,744 - INFO - step 3244, loss: 3.676503, best loss: 3.032990 2025-01-16 01:23:15,894 - INFO - step 3245, loss: 4.125330, best loss: 3.032990 2025-01-16 01:23:16,045 - INFO - step 3246, loss: 3.843191, best loss: 3.032990 2025-01-16 01:23:16,195 - INFO - step 3247, loss: 3.728904, best loss: 3.032990 2025-01-16 01:23:16,345 - INFO - step 3248, loss: 3.692912, best loss: 3.032990 2025-01-16 01:23:16,495 - INFO - step 3249, loss: 3.941427, best loss: 3.032990 2025-01-16 01:23:16,645 - INFO - step 3250, loss: 3.548203, best loss: 3.032990 2025-01-16 01:23:16,795 - INFO - step 3251, loss: 3.809163, best loss: 3.032990 2025-01-16 01:23:16,945 - INFO - step 3252, loss: 3.563151, best loss: 3.032990 2025-01-16 01:23:17,095 - INFO - step 3253, loss: 4.095707, best loss: 3.032990 2025-01-16 01:23:17,246 - INFO - step 3254, loss: 4.402151, best loss: 3.032990 2025-01-16 01:23:17,396 - INFO - step 3255, loss: 4.794961, best loss: 3.032990 2025-01-16 01:23:17,546 - INFO - step 3256, loss: 4.218788, best loss: 3.032990 2025-01-16 01:23:17,696 - INFO - step 3257, loss: 4.392452, best loss: 3.032990 2025-01-16 01:23:17,846 - INFO - step 3258, loss: 4.225094, best loss: 3.032990 2025-01-16 01:23:17,996 - INFO - step 3259, loss: 4.127989, best loss: 3.032990 2025-01-16 01:23:18,147 - INFO - step 3260, loss: 3.919231, best loss: 3.032990 2025-01-16 01:23:18,297 - INFO - step 3261, loss: 4.252003, best loss: 3.032990 2025-01-16 01:23:18,447 - INFO - step 3262, loss: 3.906394, best loss: 3.032990 2025-01-16 01:23:18,597 - INFO - step 3263, loss: 3.602199, best loss: 3.032990 2025-01-16 01:23:18,747 - INFO - step 3264, loss: 3.988419, best loss: 3.032990 2025-01-16 01:23:18,897 - INFO - step 3265, loss: 3.902469, best loss: 3.032990 2025-01-16 01:23:19,048 - INFO - step 3266, loss: 4.104125, best loss: 3.032990 2025-01-16 01:23:19,198 - INFO - step 3267, loss: 3.113061, best loss: 3.032990 2025-01-16 01:23:19,348 - INFO - step 3268, loss: 3.857027, best loss: 3.032990 2025-01-16 01:23:19,498 - INFO - step 3269, loss: 4.042940, best loss: 3.032990 2025-01-16 01:23:19,649 - INFO - step 3270, loss: 4.120718, best loss: 3.032990 2025-01-16 01:23:19,799 - INFO - step 3271, loss: 3.892148, best loss: 3.032990 2025-01-16 01:23:19,950 - INFO - step 3272, loss: 3.898039, best loss: 3.032990 2025-01-16 01:23:20,100 - INFO - step 3273, loss: 4.001248, best loss: 3.032990 2025-01-16 01:23:20,250 - INFO - step 3274, loss: 3.679629, best loss: 3.032990 2025-01-16 01:23:20,400 - INFO - step 3275, loss: 4.096514, best loss: 3.032990 2025-01-16 01:23:20,550 - INFO - step 3276, loss: 3.958922, best loss: 3.032990 2025-01-16 01:23:20,700 - INFO - step 3277, loss: 4.017973, best loss: 3.032990 2025-01-16 01:23:20,850 - INFO - step 3278, loss: 3.652013, best loss: 3.032990 2025-01-16 01:23:21,000 - INFO - step 3279, loss: 3.694582, best loss: 3.032990 2025-01-16 01:23:21,151 - INFO - step 3280, loss: 3.736484, best loss: 3.032990 2025-01-16 01:23:21,301 - INFO - step 3281, loss: 3.671647, best loss: 3.032990 2025-01-16 01:23:21,451 - INFO - step 3282, loss: 3.705178, best loss: 3.032990 2025-01-16 01:23:21,601 - INFO - step 3283, loss: 3.696593, best loss: 3.032990 2025-01-16 01:23:21,751 - INFO - step 3284, loss: 3.629798, best loss: 3.032990 2025-01-16 01:23:21,901 - INFO - step 3285, loss: 3.413370, best loss: 3.032990 2025-01-16 01:23:22,051 - INFO - step 3286, loss: 3.349458, best loss: 3.032990 2025-01-16 01:23:22,201 - INFO - step 3287, loss: 3.066962, best loss: 3.032990 2025-01-16 01:23:22,351 - INFO - step 3288, loss: 3.924691, best loss: 3.032990 2025-01-16 01:23:22,501 - INFO - step 3289, loss: 4.275050, best loss: 3.032990 2025-01-16 01:23:22,651 - INFO - step 3290, loss: 4.373735, best loss: 3.032990 2025-01-16 01:23:22,801 - INFO - step 3291, loss: 4.489044, best loss: 3.032990 2025-01-16 01:23:22,951 - INFO - step 3292, loss: 4.546803, best loss: 3.032990 2025-01-16 01:23:23,101 - INFO - step 3293, loss: 4.027902, best loss: 3.032990 2025-01-16 01:23:23,251 - INFO - step 3294, loss: 4.249963, best loss: 3.032990 2025-01-16 01:23:23,401 - INFO - step 3295, loss: 4.360065, best loss: 3.032990 2025-01-16 01:23:23,551 - INFO - step 3296, loss: 3.972949, best loss: 3.032990 2025-01-16 01:23:23,702 - INFO - step 3297, loss: 3.442331, best loss: 3.032990 2025-01-16 01:23:23,851 - INFO - step 3298, loss: 3.874717, best loss: 3.032990 2025-01-16 01:23:24,002 - INFO - step 3299, loss: 3.697071, best loss: 3.032990 2025-01-16 01:23:24,152 - INFO - step 3300, loss: 4.288491, best loss: 3.032990 2025-01-16 01:23:24,302 - INFO - step 3301, loss: 4.137943, best loss: 3.032990 2025-01-16 01:23:24,452 - INFO - step 3302, loss: 4.255213, best loss: 3.032990 2025-01-16 01:23:24,602 - INFO - step 3303, loss: 3.986466, best loss: 3.032990 2025-01-16 01:23:24,752 - INFO - step 3304, loss: 4.343746, best loss: 3.032990 2025-01-16 01:23:24,902 - INFO - step 3305, loss: 3.819178, best loss: 3.032990 2025-01-16 01:23:25,052 - INFO - step 3306, loss: 4.267361, best loss: 3.032990 2025-01-16 01:23:25,203 - INFO - step 3307, loss: 3.963764, best loss: 3.032990 2025-01-16 01:23:25,353 - INFO - step 3308, loss: 4.201184, best loss: 3.032990 2025-01-16 01:23:25,503 - INFO - step 3309, loss: 3.969689, best loss: 3.032990 2025-01-16 01:23:25,653 - INFO - step 3310, loss: 3.872808, best loss: 3.032990 2025-01-16 01:23:25,803 - INFO - step 3311, loss: 4.552954, best loss: 3.032990 2025-01-16 01:23:25,954 - INFO - step 3312, loss: 3.756157, best loss: 3.032990 2025-01-16 01:23:26,104 - INFO - step 3313, loss: 4.218941, best loss: 3.032990 2025-01-16 01:23:26,254 - INFO - step 3314, loss: 4.161405, best loss: 3.032990 2025-01-16 01:23:26,405 - INFO - step 3315, loss: 4.200799, best loss: 3.032990 2025-01-16 01:23:26,555 - INFO - step 3316, loss: 4.109653, best loss: 3.032990 2025-01-16 01:23:26,706 - INFO - step 3317, loss: 3.810586, best loss: 3.032990 2025-01-16 01:23:26,856 - INFO - step 3318, loss: 3.802768, best loss: 3.032990 2025-01-16 01:23:27,006 - INFO - step 3319, loss: 3.955019, best loss: 3.032990 2025-01-16 01:23:27,156 - INFO - step 3320, loss: 3.433573, best loss: 3.032990 2025-01-16 01:23:27,306 - INFO - step 3321, loss: 4.263709, best loss: 3.032990 2025-01-16 01:23:27,456 - INFO - step 3322, loss: 3.482862, best loss: 3.032990 2025-01-16 01:23:27,607 - INFO - step 3323, loss: 3.362590, best loss: 3.032990 2025-01-16 01:23:27,757 - INFO - step 3324, loss: 3.938577, best loss: 3.032990 2025-01-16 01:23:27,907 - INFO - step 3325, loss: 3.860414, best loss: 3.032990 2025-01-16 01:23:28,057 - INFO - step 3326, loss: 3.850409, best loss: 3.032990 2025-01-16 01:23:28,207 - INFO - step 3327, loss: 3.584881, best loss: 3.032990 2025-01-16 01:23:28,357 - INFO - step 3328, loss: 3.801656, best loss: 3.032990 2025-01-16 01:23:28,507 - INFO - step 3329, loss: 3.887040, best loss: 3.032990 2025-01-16 01:23:28,657 - INFO - step 3330, loss: 3.712665, best loss: 3.032990 2025-01-16 01:23:28,807 - INFO - step 3331, loss: 3.604837, best loss: 3.032990 2025-01-16 01:23:28,957 - INFO - step 3332, loss: 4.040088, best loss: 3.032990 2025-01-16 01:23:29,107 - INFO - step 3333, loss: 3.970340, best loss: 3.032990 2025-01-16 01:23:29,257 - INFO - step 3334, loss: 3.917416, best loss: 3.032990 2025-01-16 01:23:29,408 - INFO - step 3335, loss: 3.585504, best loss: 3.032990 2025-01-16 01:23:29,558 - INFO - step 3336, loss: 3.836026, best loss: 3.032990 2025-01-16 01:23:29,708 - INFO - step 3337, loss: 4.158009, best loss: 3.032990 2025-01-16 01:23:29,858 - INFO - step 3338, loss: 3.682891, best loss: 3.032990 2025-01-16 01:23:30,008 - INFO - step 3339, loss: 4.156803, best loss: 3.032990 2025-01-16 01:23:30,158 - INFO - step 3340, loss: 4.211882, best loss: 3.032990 2025-01-16 01:23:30,308 - INFO - step 3341, loss: 4.094454, best loss: 3.032990 2025-01-16 01:23:30,459 - INFO - step 3342, loss: 4.101450, best loss: 3.032990 2025-01-16 01:23:30,609 - INFO - step 3343, loss: 4.086987, best loss: 3.032990 2025-01-16 01:23:30,759 - INFO - step 3344, loss: 3.999254, best loss: 3.032990 2025-01-16 01:23:30,909 - INFO - step 3345, loss: 3.754822, best loss: 3.032990 2025-01-16 01:23:31,059 - INFO - step 3346, loss: 4.569148, best loss: 3.032990 2025-01-16 01:23:31,209 - INFO - step 3347, loss: 4.013231, best loss: 3.032990 2025-01-16 01:23:31,359 - INFO - step 3348, loss: 4.326182, best loss: 3.032990 2025-01-16 01:23:31,509 - INFO - step 3349, loss: 3.436412, best loss: 3.032990 2025-01-16 01:23:31,659 - INFO - step 3350, loss: 3.471414, best loss: 3.032990 2025-01-16 01:23:31,809 - INFO - step 3351, loss: 3.982650, best loss: 3.032990 2025-01-16 01:23:31,959 - INFO - step 3352, loss: 4.019873, best loss: 3.032990 2025-01-16 01:23:32,109 - INFO - step 3353, loss: 3.812449, best loss: 3.032990 2025-01-16 01:23:32,259 - INFO - step 3354, loss: 3.922161, best loss: 3.032990 2025-01-16 01:23:32,410 - INFO - step 3355, loss: 3.736054, best loss: 3.032990 2025-01-16 01:23:32,560 - INFO - step 3356, loss: 3.876873, best loss: 3.032990 2025-01-16 01:23:32,710 - INFO - step 3357, loss: 4.097229, best loss: 3.032990 2025-01-16 01:23:32,860 - INFO - step 3358, loss: 3.631974, best loss: 3.032990 2025-01-16 01:23:33,010 - INFO - step 3359, loss: 3.754151, best loss: 3.032990 2025-01-16 01:23:33,160 - INFO - step 3360, loss: 3.917462, best loss: 3.032990 2025-01-16 01:23:33,310 - INFO - step 3361, loss: 4.128898, best loss: 3.032990 2025-01-16 01:23:33,460 - INFO - step 3362, loss: 3.965872, best loss: 3.032990 2025-01-16 01:23:33,610 - INFO - step 3363, loss: 4.158767, best loss: 3.032990 2025-01-16 01:23:33,760 - INFO - step 3364, loss: 3.985852, best loss: 3.032990 2025-01-16 01:23:33,910 - INFO - step 3365, loss: 3.514523, best loss: 3.032990 2025-01-16 01:23:34,060 - INFO - step 3366, loss: 3.966143, best loss: 3.032990 2025-01-16 01:23:34,210 - INFO - step 3367, loss: 3.301582, best loss: 3.032990 2025-01-16 01:23:34,361 - INFO - step 3368, loss: 3.691691, best loss: 3.032990 2025-01-16 01:23:34,511 - INFO - step 3369, loss: 3.872281, best loss: 3.032990 2025-01-16 01:23:34,661 - INFO - step 3370, loss: 3.761220, best loss: 3.032990 2025-01-16 01:23:34,811 - INFO - step 3371, loss: 3.700638, best loss: 3.032990 2025-01-16 01:23:34,961 - INFO - step 3372, loss: 4.026441, best loss: 3.032990 2025-01-16 01:23:35,111 - INFO - step 3373, loss: 4.206919, best loss: 3.032990 2025-01-16 01:23:35,261 - INFO - step 3374, loss: 4.020310, best loss: 3.032990 2025-01-16 01:23:35,411 - INFO - step 3375, loss: 4.285516, best loss: 3.032990 2025-01-16 01:23:35,561 - INFO - step 3376, loss: 3.921047, best loss: 3.032990 2025-01-16 01:23:35,711 - INFO - step 3377, loss: 3.965133, best loss: 3.032990 2025-01-16 01:23:35,862 - INFO - step 3378, loss: 3.721079, best loss: 3.032990 2025-01-16 01:23:36,012 - INFO - step 3379, loss: 3.403732, best loss: 3.032990 2025-01-16 01:23:36,162 - INFO - step 3380, loss: 4.295722, best loss: 3.032990 2025-01-16 01:23:36,312 - INFO - step 3381, loss: 4.180847, best loss: 3.032990 2025-01-16 01:23:36,462 - INFO - step 3382, loss: 3.810323, best loss: 3.032990 2025-01-16 01:23:36,612 - INFO - step 3383, loss: 3.734004, best loss: 3.032990 2025-01-16 01:23:36,762 - INFO - step 3384, loss: 3.518665, best loss: 3.032990 2025-01-16 01:23:36,912 - INFO - step 3385, loss: 3.715517, best loss: 3.032990 2025-01-16 01:23:37,062 - INFO - step 3386, loss: 3.679636, best loss: 3.032990 2025-01-16 01:23:37,212 - INFO - step 3387, loss: 3.622524, best loss: 3.032990 2025-01-16 01:23:37,362 - INFO - step 3388, loss: 4.260224, best loss: 3.032990 2025-01-16 01:23:37,512 - INFO - step 3389, loss: 4.168526, best loss: 3.032990 2025-01-16 01:23:37,662 - INFO - step 3390, loss: 3.925157, best loss: 3.032990 2025-01-16 01:23:37,812 - INFO - step 3391, loss: 4.287516, best loss: 3.032990 2025-01-16 01:23:37,962 - INFO - step 3392, loss: 3.934618, best loss: 3.032990 2025-01-16 01:23:38,112 - INFO - step 3393, loss: 4.245953, best loss: 3.032990 2025-01-16 01:23:38,262 - INFO - step 3394, loss: 4.389229, best loss: 3.032990 2025-01-16 01:23:38,412 - INFO - step 3395, loss: 4.393506, best loss: 3.032990 2025-01-16 01:23:38,563 - INFO - step 3396, loss: 4.486455, best loss: 3.032990 2025-01-16 01:23:38,713 - INFO - step 3397, loss: 4.108725, best loss: 3.032990 2025-01-16 01:23:38,863 - INFO - step 3398, loss: 4.373447, best loss: 3.032990 2025-01-16 01:23:39,013 - INFO - step 3399, loss: 4.109004, best loss: 3.032990 2025-01-16 01:23:39,163 - INFO - step 3400, loss: 3.897531, best loss: 3.032990 2025-01-16 01:23:39,313 - INFO - step 3401, loss: 4.389658, best loss: 3.032990 2025-01-16 01:23:39,463 - INFO - step 3402, loss: 4.373769, best loss: 3.032990 2025-01-16 01:23:39,614 - INFO - step 3403, loss: 4.277227, best loss: 3.032990 2025-01-16 01:23:39,764 - INFO - step 3404, loss: 4.014180, best loss: 3.032990 2025-01-16 01:23:39,914 - INFO - step 3405, loss: 4.262276, best loss: 3.032990 2025-01-16 01:23:40,064 - INFO - step 3406, loss: 4.046994, best loss: 3.032990 2025-01-16 01:23:40,214 - INFO - step 3407, loss: 3.784466, best loss: 3.032990 2025-01-16 01:23:40,364 - INFO - step 3408, loss: 3.821136, best loss: 3.032990 2025-01-16 01:23:40,514 - INFO - step 3409, loss: 4.238941, best loss: 3.032990 2025-01-16 01:23:40,664 - INFO - step 3410, loss: 4.213163, best loss: 3.032990 2025-01-16 01:23:40,814 - INFO - step 3411, loss: 4.444112, best loss: 3.032990 2025-01-16 01:23:40,964 - INFO - step 3412, loss: 4.366406, best loss: 3.032990 2025-01-16 01:23:41,114 - INFO - step 3413, loss: 3.944360, best loss: 3.032990 2025-01-16 01:23:41,264 - INFO - step 3414, loss: 4.397336, best loss: 3.032990 2025-01-16 01:23:41,414 - INFO - step 3415, loss: 3.951515, best loss: 3.032990 2025-01-16 01:23:41,565 - INFO - step 3416, loss: 4.314756, best loss: 3.032990 2025-01-16 01:23:41,715 - INFO - step 3417, loss: 3.923228, best loss: 3.032990 2025-01-16 01:23:41,865 - INFO - step 3418, loss: 4.031953, best loss: 3.032990 2025-01-16 01:23:42,015 - INFO - step 3419, loss: 4.032898, best loss: 3.032990 2025-01-16 01:23:42,165 - INFO - step 3420, loss: 4.074542, best loss: 3.032990 2025-01-16 01:23:42,315 - INFO - step 3421, loss: 3.935102, best loss: 3.032990 2025-01-16 01:23:42,465 - INFO - step 3422, loss: 4.106686, best loss: 3.032990 2025-01-16 01:23:42,616 - INFO - step 3423, loss: 3.748803, best loss: 3.032990 2025-01-16 01:23:42,766 - INFO - step 3424, loss: 3.433330, best loss: 3.032990 2025-01-16 01:23:42,916 - INFO - step 3425, loss: 3.574983, best loss: 3.032990 2025-01-16 01:23:43,066 - INFO - step 3426, loss: 3.849843, best loss: 3.032990 2025-01-16 01:23:43,216 - INFO - step 3427, loss: 4.529129, best loss: 3.032990 2025-01-16 01:23:43,366 - INFO - step 3428, loss: 3.992183, best loss: 3.032990 2025-01-16 01:23:43,516 - INFO - step 3429, loss: 3.692487, best loss: 3.032990 2025-01-16 01:23:43,666 - INFO - step 3430, loss: 4.280567, best loss: 3.032990 2025-01-16 01:23:43,816 - INFO - step 3431, loss: 3.954625, best loss: 3.032990 2025-01-16 01:23:43,967 - INFO - step 3432, loss: 4.547367, best loss: 3.032990 2025-01-16 01:23:44,117 - INFO - step 3433, loss: 3.861763, best loss: 3.032990 2025-01-16 01:23:44,267 - INFO - step 3434, loss: 4.207866, best loss: 3.032990 2025-01-16 01:23:44,417 - INFO - step 3435, loss: 3.889360, best loss: 3.032990 2025-01-16 01:23:44,567 - INFO - step 3436, loss: 4.479663, best loss: 3.032990 2025-01-16 01:23:44,717 - INFO - step 3437, loss: 4.025543, best loss: 3.032990 2025-01-16 01:23:44,867 - INFO - step 3438, loss: 3.751172, best loss: 3.032990 2025-01-16 01:23:45,017 - INFO - step 3439, loss: 4.204502, best loss: 3.032990 2025-01-16 01:23:45,167 - INFO - step 3440, loss: 3.925639, best loss: 3.032990 2025-01-16 01:23:45,317 - INFO - step 3441, loss: 3.709970, best loss: 3.032990 2025-01-16 01:23:45,467 - INFO - step 3442, loss: 4.584757, best loss: 3.032990 2025-01-16 01:23:45,617 - INFO - step 3443, loss: 4.047235, best loss: 3.032990 2025-01-16 01:23:45,767 - INFO - step 3444, loss: 3.511861, best loss: 3.032990 2025-01-16 01:23:45,917 - INFO - step 3445, loss: 3.853812, best loss: 3.032990 2025-01-16 01:23:46,068 - INFO - step 3446, loss: 4.133864, best loss: 3.032990 2025-01-16 01:23:46,217 - INFO - step 3447, loss: 4.029939, best loss: 3.032990 2025-01-16 01:23:46,368 - INFO - step 3448, loss: 3.983393, best loss: 3.032990 2025-01-16 01:23:46,518 - INFO - step 3449, loss: 3.834946, best loss: 3.032990 2025-01-16 01:23:46,668 - INFO - step 3450, loss: 4.397630, best loss: 3.032990 2025-01-16 01:23:46,818 - INFO - step 3451, loss: 4.174963, best loss: 3.032990 2025-01-16 01:23:46,968 - INFO - step 3452, loss: 4.130631, best loss: 3.032990 2025-01-16 01:23:47,117 - INFO - step 3453, loss: 3.756940, best loss: 3.032990 2025-01-16 01:23:47,267 - INFO - step 3454, loss: 4.122260, best loss: 3.032990 2025-01-16 01:23:47,418 - INFO - step 3455, loss: 3.968500, best loss: 3.032990 2025-01-16 01:23:47,568 - INFO - step 3456, loss: 3.505757, best loss: 3.032990 2025-01-16 01:23:47,718 - INFO - step 3457, loss: 4.045893, best loss: 3.032990 2025-01-16 01:23:47,868 - INFO - step 3458, loss: 3.670455, best loss: 3.032990 2025-01-16 01:23:48,018 - INFO - step 3459, loss: 4.030027, best loss: 3.032990 2025-01-16 01:23:48,168 - INFO - step 3460, loss: 3.821303, best loss: 3.032990 2025-01-16 01:23:48,318 - INFO - step 3461, loss: 3.958694, best loss: 3.032990 2025-01-16 01:23:48,468 - INFO - step 3462, loss: 3.828406, best loss: 3.032990 2025-01-16 01:23:48,618 - INFO - step 3463, loss: 4.061922, best loss: 3.032990 2025-01-16 01:23:48,769 - INFO - step 3464, loss: 4.332158, best loss: 3.032990 2025-01-16 01:23:48,919 - INFO - step 3465, loss: 4.130800, best loss: 3.032990 2025-01-16 01:23:49,069 - INFO - step 3466, loss: 4.252311, best loss: 3.032990 2025-01-16 01:23:49,219 - INFO - step 3467, loss: 3.740616, best loss: 3.032990 2025-01-16 01:23:49,369 - INFO - step 3468, loss: 4.097504, best loss: 3.032990 2025-01-16 01:23:49,519 - INFO - step 3469, loss: 4.166971, best loss: 3.032990 2025-01-16 01:23:49,669 - INFO - step 3470, loss: 3.830734, best loss: 3.032990 2025-01-16 01:23:49,820 - INFO - step 3471, loss: 3.647730, best loss: 3.032990 2025-01-16 01:23:49,970 - INFO - step 3472, loss: 3.657609, best loss: 3.032990 2025-01-16 01:23:50,120 - INFO - step 3473, loss: 3.825354, best loss: 3.032990 2025-01-16 01:23:50,270 - INFO - step 3474, loss: 3.948323, best loss: 3.032990 2025-01-16 01:23:50,420 - INFO - step 3475, loss: 4.248864, best loss: 3.032990 2025-01-16 01:23:50,570 - INFO - step 3476, loss: 4.443284, best loss: 3.032990 2025-01-16 01:23:50,721 - INFO - step 3477, loss: 4.204480, best loss: 3.032990 2025-01-16 01:23:50,871 - INFO - step 3478, loss: 4.332364, best loss: 3.032990 2025-01-16 01:23:51,021 - INFO - step 3479, loss: 4.224205, best loss: 3.032990 2025-01-16 01:23:51,171 - INFO - step 3480, loss: 4.186951, best loss: 3.032990 2025-01-16 01:23:51,321 - INFO - step 3481, loss: 3.626064, best loss: 3.032990 2025-01-16 01:23:51,471 - INFO - step 3482, loss: 4.233764, best loss: 3.032990 2025-01-16 01:23:51,622 - INFO - step 3483, loss: 4.474945, best loss: 3.032990 2025-01-16 01:23:51,772 - INFO - step 3484, loss: 4.044428, best loss: 3.032990 2025-01-16 01:23:51,922 - INFO - step 3485, loss: 4.127686, best loss: 3.032990 2025-01-16 01:23:52,072 - INFO - step 3486, loss: 4.119731, best loss: 3.032990 2025-01-16 01:23:52,223 - INFO - step 3487, loss: 3.768391, best loss: 3.032990 2025-01-16 01:23:55,730 - INFO - step 3488, loss: 2.947698, best loss: 2.947698 2025-01-16 01:23:55,893 - INFO - step 3489, loss: 3.968420, best loss: 2.947698 2025-01-16 01:23:56,051 - INFO - step 3490, loss: 4.220069, best loss: 2.947698 2025-01-16 01:23:56,201 - INFO - step 3491, loss: 4.209546, best loss: 2.947698 2025-01-16 01:23:56,352 - INFO - step 3492, loss: 4.168770, best loss: 2.947698 2025-01-16 01:23:56,502 - INFO - step 3493, loss: 3.793020, best loss: 2.947698 2025-01-16 01:23:56,652 - INFO - step 3494, loss: 4.053180, best loss: 2.947698 2025-01-16 01:23:56,802 - INFO - step 3495, loss: 4.100843, best loss: 2.947698 2025-01-16 01:23:56,953 - INFO - step 3496, loss: 3.803852, best loss: 2.947698 2025-01-16 01:23:57,103 - INFO - step 3497, loss: 4.042656, best loss: 2.947698 2025-01-16 01:23:57,253 - INFO - step 3498, loss: 3.904171, best loss: 2.947698 2025-01-16 01:23:57,403 - INFO - step 3499, loss: 3.725825, best loss: 2.947698 2025-01-16 01:23:57,553 - INFO - step 3500, loss: 4.157477, best loss: 2.947698 2025-01-16 01:23:57,703 - INFO - step 3501, loss: 3.679832, best loss: 2.947698 2025-01-16 01:23:57,854 - INFO - step 3502, loss: 4.012032, best loss: 2.947698 2025-01-16 01:23:58,005 - INFO - step 3503, loss: 4.305735, best loss: 2.947698 2025-01-16 01:23:58,155 - INFO - step 3504, loss: 3.910416, best loss: 2.947698 2025-01-16 01:23:58,305 - INFO - step 3505, loss: 3.482448, best loss: 2.947698 2025-01-16 01:23:58,455 - INFO - step 3506, loss: 4.050472, best loss: 2.947698 2025-01-16 01:23:58,605 - INFO - step 3507, loss: 4.264566, best loss: 2.947698 2025-01-16 01:23:58,756 - INFO - step 3508, loss: 4.386326, best loss: 2.947698 2025-01-16 01:23:58,906 - INFO - step 3509, loss: 4.150109, best loss: 2.947698 2025-01-16 01:23:59,056 - INFO - step 3510, loss: 4.250046, best loss: 2.947698 2025-01-16 01:23:59,206 - INFO - step 3511, loss: 4.033556, best loss: 2.947698 2025-01-16 01:23:59,356 - INFO - step 3512, loss: 4.377664, best loss: 2.947698 2025-01-16 01:23:59,507 - INFO - step 3513, loss: 3.867661, best loss: 2.947698 2025-01-16 01:23:59,657 - INFO - step 3514, loss: 3.971378, best loss: 2.947698 2025-01-16 01:23:59,807 - INFO - step 3515, loss: 4.145796, best loss: 2.947698 2025-01-16 01:23:59,957 - INFO - step 3516, loss: 4.183806, best loss: 2.947698 2025-01-16 01:24:00,107 - INFO - step 3517, loss: 4.089601, best loss: 2.947698 2025-01-16 01:24:00,258 - INFO - step 3518, loss: 3.825933, best loss: 2.947698 2025-01-16 01:24:00,408 - INFO - step 3519, loss: 3.867658, best loss: 2.947698 2025-01-16 01:24:00,558 - INFO - step 3520, loss: 4.100204, best loss: 2.947698 2025-01-16 01:24:00,708 - INFO - step 3521, loss: 4.338399, best loss: 2.947698 2025-01-16 01:24:00,859 - INFO - step 3522, loss: 4.077178, best loss: 2.947698 2025-01-16 01:24:01,009 - INFO - step 3523, loss: 4.429143, best loss: 2.947698 2025-01-16 01:24:01,159 - INFO - step 3524, loss: 4.484186, best loss: 2.947698 2025-01-16 01:24:01,309 - INFO - step 3525, loss: 4.404580, best loss: 2.947698 2025-01-16 01:24:01,459 - INFO - step 3526, loss: 4.606185, best loss: 2.947698 2025-01-16 01:24:01,609 - INFO - step 3527, loss: 4.456061, best loss: 2.947698 2025-01-16 01:24:01,760 - INFO - step 3528, loss: 3.818591, best loss: 2.947698 2025-01-16 01:24:01,910 - INFO - step 3529, loss: 4.407452, best loss: 2.947698 2025-01-16 01:24:02,060 - INFO - step 3530, loss: 4.303061, best loss: 2.947698 2025-01-16 01:24:02,210 - INFO - step 3531, loss: 4.425319, best loss: 2.947698 2025-01-16 01:24:02,360 - INFO - step 3532, loss: 3.833764, best loss: 2.947698 2025-01-16 01:24:02,511 - INFO - step 3533, loss: 4.167942, best loss: 2.947698 2025-01-16 01:24:02,661 - INFO - step 3534, loss: 4.111135, best loss: 2.947698 2025-01-16 01:24:02,811 - INFO - step 3535, loss: 4.025977, best loss: 2.947698 2025-01-16 01:24:02,961 - INFO - step 3536, loss: 4.319889, best loss: 2.947698 2025-01-16 01:24:03,111 - INFO - step 3537, loss: 3.845034, best loss: 2.947698 2025-01-16 01:24:03,261 - INFO - step 3538, loss: 3.832825, best loss: 2.947698 2025-01-16 01:24:03,412 - INFO - step 3539, loss: 4.360984, best loss: 2.947698 2025-01-16 01:24:03,562 - INFO - step 3540, loss: 4.239736, best loss: 2.947698 2025-01-16 01:24:03,712 - INFO - step 3541, loss: 4.367884, best loss: 2.947698 2025-01-16 01:24:03,862 - INFO - step 3542, loss: 4.132208, best loss: 2.947698 2025-01-16 01:24:04,012 - INFO - step 3543, loss: 4.608447, best loss: 2.947698 2025-01-16 01:24:04,163 - INFO - step 3544, loss: 4.370927, best loss: 2.947698 2025-01-16 01:24:04,313 - INFO - step 3545, loss: 3.947270, best loss: 2.947698 2025-01-16 01:24:04,464 - INFO - step 3546, loss: 3.926391, best loss: 2.947698 2025-01-16 01:24:04,614 - INFO - step 3547, loss: 4.378272, best loss: 2.947698 2025-01-16 01:24:04,764 - INFO - step 3548, loss: 4.033756, best loss: 2.947698 2025-01-16 01:24:04,914 - INFO - step 3549, loss: 3.985259, best loss: 2.947698 2025-01-16 01:24:05,064 - INFO - step 3550, loss: 4.313403, best loss: 2.947698 2025-01-16 01:24:05,214 - INFO - step 3551, loss: 4.264491, best loss: 2.947698 2025-01-16 01:24:05,365 - INFO - step 3552, loss: 4.197161, best loss: 2.947698 2025-01-16 01:24:05,515 - INFO - step 3553, loss: 3.814412, best loss: 2.947698 2025-01-16 01:24:05,665 - INFO - step 3554, loss: 3.533997, best loss: 2.947698 2025-01-16 01:24:05,815 - INFO - step 3555, loss: 3.596254, best loss: 2.947698 2025-01-16 01:24:05,965 - INFO - step 3556, loss: 3.617623, best loss: 2.947698 2025-01-16 01:24:06,115 - INFO - step 3557, loss: 3.733792, best loss: 2.947698 2025-01-16 01:24:06,266 - INFO - step 3558, loss: 4.115147, best loss: 2.947698 2025-01-16 01:24:06,416 - INFO - step 3559, loss: 3.986310, best loss: 2.947698 2025-01-16 01:24:06,566 - INFO - step 3560, loss: 3.876972, best loss: 2.947698 2025-01-16 01:24:06,716 - INFO - step 3561, loss: 4.062320, best loss: 2.947698 2025-01-16 01:24:06,867 - INFO - step 3562, loss: 4.000645, best loss: 2.947698 2025-01-16 01:24:07,017 - INFO - step 3563, loss: 3.946775, best loss: 2.947698 2025-01-16 01:24:07,167 - INFO - step 3564, loss: 4.031703, best loss: 2.947698 2025-01-16 01:24:07,317 - INFO - step 3565, loss: 4.215952, best loss: 2.947698 2025-01-16 01:24:07,467 - INFO - step 3566, loss: 3.682231, best loss: 2.947698 2025-01-16 01:24:07,617 - INFO - step 3567, loss: 3.622506, best loss: 2.947698 2025-01-16 01:24:07,768 - INFO - step 3568, loss: 3.974883, best loss: 2.947698 2025-01-16 01:24:07,918 - INFO - step 3569, loss: 4.093096, best loss: 2.947698 2025-01-16 01:24:08,068 - INFO - step 3570, loss: 3.608951, best loss: 2.947698 2025-01-16 01:24:08,218 - INFO - step 3571, loss: 3.597836, best loss: 2.947698 2025-01-16 01:24:08,368 - INFO - step 3572, loss: 3.969797, best loss: 2.947698 2025-01-16 01:24:08,519 - INFO - step 3573, loss: 3.930456, best loss: 2.947698 2025-01-16 01:24:08,669 - INFO - step 3574, loss: 3.615350, best loss: 2.947698 2025-01-16 01:24:08,819 - INFO - step 3575, loss: 4.006296, best loss: 2.947698 2025-01-16 01:24:08,969 - INFO - step 3576, loss: 3.762796, best loss: 2.947698 2025-01-16 01:24:09,119 - INFO - step 3577, loss: 3.634935, best loss: 2.947698 2025-01-16 01:24:09,270 - INFO - step 3578, loss: 3.574179, best loss: 2.947698 2025-01-16 01:24:09,421 - INFO - step 3579, loss: 3.856407, best loss: 2.947698 2025-01-16 01:24:09,571 - INFO - step 3580, loss: 3.431597, best loss: 2.947698 2025-01-16 01:24:09,722 - INFO - step 3581, loss: 3.693842, best loss: 2.947698 2025-01-16 01:24:09,872 - INFO - step 3582, loss: 3.465998, best loss: 2.947698 2025-01-16 01:24:10,023 - INFO - step 3583, loss: 4.010042, best loss: 2.947698 2025-01-16 01:24:10,173 - INFO - step 3584, loss: 4.288721, best loss: 2.947698 2025-01-16 01:24:10,323 - INFO - step 3585, loss: 4.604991, best loss: 2.947698 2025-01-16 01:24:10,473 - INFO - step 3586, loss: 4.065432, best loss: 2.947698 2025-01-16 01:24:10,624 - INFO - step 3587, loss: 4.236596, best loss: 2.947698 2025-01-16 01:24:10,774 - INFO - step 3588, loss: 4.072531, best loss: 2.947698 2025-01-16 01:24:10,924 - INFO - step 3589, loss: 3.964726, best loss: 2.947698 2025-01-16 01:24:11,074 - INFO - step 3590, loss: 3.838192, best loss: 2.947698 2025-01-16 01:24:11,225 - INFO - step 3591, loss: 4.158020, best loss: 2.947698 2025-01-16 01:24:11,375 - INFO - step 3592, loss: 3.781246, best loss: 2.947698 2025-01-16 01:24:11,525 - INFO - step 3593, loss: 3.474874, best loss: 2.947698 2025-01-16 01:24:11,675 - INFO - step 3594, loss: 3.802394, best loss: 2.947698 2025-01-16 01:24:11,826 - INFO - step 3595, loss: 3.762087, best loss: 2.947698 2025-01-16 01:24:11,976 - INFO - step 3596, loss: 3.952297, best loss: 2.947698 2025-01-16 01:24:12,125 - INFO - step 3597, loss: 3.056185, best loss: 2.947698 2025-01-16 01:24:12,275 - INFO - step 3598, loss: 3.761377, best loss: 2.947698 2025-01-16 01:24:12,426 - INFO - step 3599, loss: 4.008686, best loss: 2.947698 2025-01-16 01:24:12,576 - INFO - step 3600, loss: 4.033391, best loss: 2.947698 2025-01-16 01:24:12,726 - INFO - step 3601, loss: 3.796759, best loss: 2.947698 2025-01-16 01:24:12,876 - INFO - step 3602, loss: 3.765258, best loss: 2.947698 2025-01-16 01:24:13,026 - INFO - step 3603, loss: 3.815337, best loss: 2.947698 2025-01-16 01:24:13,176 - INFO - step 3604, loss: 3.485785, best loss: 2.947698 2025-01-16 01:24:13,326 - INFO - step 3605, loss: 3.946011, best loss: 2.947698 2025-01-16 01:24:13,477 - INFO - step 3606, loss: 3.825161, best loss: 2.947698 2025-01-16 01:24:13,627 - INFO - step 3607, loss: 3.890555, best loss: 2.947698 2025-01-16 01:24:13,776 - INFO - step 3608, loss: 3.552333, best loss: 2.947698 2025-01-16 01:24:13,927 - INFO - step 3609, loss: 3.569695, best loss: 2.947698 2025-01-16 01:24:14,077 - INFO - step 3610, loss: 3.616378, best loss: 2.947698 2025-01-16 01:24:14,227 - INFO - step 3611, loss: 3.533747, best loss: 2.947698 2025-01-16 01:24:14,377 - INFO - step 3612, loss: 3.614899, best loss: 2.947698 2025-01-16 01:24:14,527 - INFO - step 3613, loss: 3.625174, best loss: 2.947698 2025-01-16 01:24:14,678 - INFO - step 3614, loss: 3.515146, best loss: 2.947698 2025-01-16 01:24:14,828 - INFO - step 3615, loss: 3.275401, best loss: 2.947698 2025-01-16 01:24:14,978 - INFO - step 3616, loss: 3.227636, best loss: 2.947698 2025-01-16 01:24:15,128 - INFO - step 3617, loss: 2.976954, best loss: 2.947698 2025-01-16 01:24:15,278 - INFO - step 3618, loss: 3.824914, best loss: 2.947698 2025-01-16 01:24:15,428 - INFO - step 3619, loss: 4.162131, best loss: 2.947698 2025-01-16 01:24:15,578 - INFO - step 3620, loss: 4.237042, best loss: 2.947698 2025-01-16 01:24:15,728 - INFO - step 3621, loss: 4.340729, best loss: 2.947698 2025-01-16 01:24:15,878 - INFO - step 3622, loss: 4.413512, best loss: 2.947698 2025-01-16 01:24:16,029 - INFO - step 3623, loss: 3.955682, best loss: 2.947698 2025-01-16 01:24:16,179 - INFO - step 3624, loss: 4.156771, best loss: 2.947698 2025-01-16 01:24:16,329 - INFO - step 3625, loss: 4.288327, best loss: 2.947698 2025-01-16 01:24:16,479 - INFO - step 3626, loss: 3.834558, best loss: 2.947698 2025-01-16 01:24:16,629 - INFO - step 3627, loss: 3.361422, best loss: 2.947698 2025-01-16 01:24:16,779 - INFO - step 3628, loss: 3.757439, best loss: 2.947698 2025-01-16 01:24:16,929 - INFO - step 3629, loss: 3.606851, best loss: 2.947698 2025-01-16 01:24:17,079 - INFO - step 3630, loss: 4.264467, best loss: 2.947698 2025-01-16 01:24:17,230 - INFO - step 3631, loss: 4.072685, best loss: 2.947698 2025-01-16 01:24:17,380 - INFO - step 3632, loss: 4.112666, best loss: 2.947698 2025-01-16 01:24:17,530 - INFO - step 3633, loss: 3.892811, best loss: 2.947698 2025-01-16 01:24:17,680 - INFO - step 3634, loss: 4.210742, best loss: 2.947698 2025-01-16 01:24:17,830 - INFO - step 3635, loss: 3.723072, best loss: 2.947698 2025-01-16 01:24:17,980 - INFO - step 3636, loss: 4.163930, best loss: 2.947698 2025-01-16 01:24:18,130 - INFO - step 3637, loss: 3.876914, best loss: 2.947698 2025-01-16 01:24:18,280 - INFO - step 3638, loss: 4.137752, best loss: 2.947698 2025-01-16 01:24:18,430 - INFO - step 3639, loss: 3.877169, best loss: 2.947698 2025-01-16 01:24:18,580 - INFO - step 3640, loss: 3.777941, best loss: 2.947698 2025-01-16 01:24:18,730 - INFO - step 3641, loss: 4.450007, best loss: 2.947698 2025-01-16 01:24:18,881 - INFO - step 3642, loss: 3.694408, best loss: 2.947698 2025-01-16 01:24:19,030 - INFO - step 3643, loss: 4.130129, best loss: 2.947698 2025-01-16 01:24:19,180 - INFO - step 3644, loss: 4.066130, best loss: 2.947698 2025-01-16 01:24:19,331 - INFO - step 3645, loss: 4.096249, best loss: 2.947698 2025-01-16 01:24:19,481 - INFO - step 3646, loss: 3.999646, best loss: 2.947698 2025-01-16 01:24:19,631 - INFO - step 3647, loss: 3.766980, best loss: 2.947698 2025-01-16 01:24:19,782 - INFO - step 3648, loss: 3.698332, best loss: 2.947698 2025-01-16 01:24:19,932 - INFO - step 3649, loss: 3.830639, best loss: 2.947698 2025-01-16 01:24:20,082 - INFO - step 3650, loss: 3.363824, best loss: 2.947698 2025-01-16 01:24:20,232 - INFO - step 3651, loss: 4.190815, best loss: 2.947698 2025-01-16 01:24:20,382 - INFO - step 3652, loss: 3.384311, best loss: 2.947698 2025-01-16 01:24:20,533 - INFO - step 3653, loss: 3.269495, best loss: 2.947698 2025-01-16 01:24:20,683 - INFO - step 3654, loss: 3.824272, best loss: 2.947698 2025-01-16 01:24:20,833 - INFO - step 3655, loss: 3.764939, best loss: 2.947698 2025-01-16 01:24:20,983 - INFO - step 3656, loss: 3.759455, best loss: 2.947698 2025-01-16 01:24:21,133 - INFO - step 3657, loss: 3.483178, best loss: 2.947698 2025-01-16 01:24:21,283 - INFO - step 3658, loss: 3.743241, best loss: 2.947698 2025-01-16 01:24:21,433 - INFO - step 3659, loss: 3.821221, best loss: 2.947698 2025-01-16 01:24:21,583 - INFO - step 3660, loss: 3.641813, best loss: 2.947698 2025-01-16 01:24:21,734 - INFO - step 3661, loss: 3.525318, best loss: 2.947698 2025-01-16 01:24:21,884 - INFO - step 3662, loss: 3.913915, best loss: 2.947698 2025-01-16 01:24:22,034 - INFO - step 3663, loss: 3.853338, best loss: 2.947698 2025-01-16 01:24:22,184 - INFO - step 3664, loss: 3.813550, best loss: 2.947698 2025-01-16 01:24:22,334 - INFO - step 3665, loss: 3.421324, best loss: 2.947698 2025-01-16 01:24:22,484 - INFO - step 3666, loss: 3.702125, best loss: 2.947698 2025-01-16 01:24:22,634 - INFO - step 3667, loss: 4.019238, best loss: 2.947698 2025-01-16 01:24:22,784 - INFO - step 3668, loss: 3.606860, best loss: 2.947698 2025-01-16 01:24:22,934 - INFO - step 3669, loss: 4.039696, best loss: 2.947698 2025-01-16 01:24:23,084 - INFO - step 3670, loss: 4.135503, best loss: 2.947698 2025-01-16 01:24:23,234 - INFO - step 3671, loss: 4.048470, best loss: 2.947698 2025-01-16 01:24:23,384 - INFO - step 3672, loss: 3.982802, best loss: 2.947698 2025-01-16 01:24:23,534 - INFO - step 3673, loss: 3.950770, best loss: 2.947698 2025-01-16 01:24:23,684 - INFO - step 3674, loss: 3.849736, best loss: 2.947698 2025-01-16 01:24:23,834 - INFO - step 3675, loss: 3.637872, best loss: 2.947698 2025-01-16 01:24:23,985 - INFO - step 3676, loss: 4.532047, best loss: 2.947698 2025-01-16 01:24:24,135 - INFO - step 3677, loss: 4.013798, best loss: 2.947698 2025-01-16 01:24:24,285 - INFO - step 3678, loss: 4.338875, best loss: 2.947698 2025-01-16 01:24:24,435 - INFO - step 3679, loss: 3.368396, best loss: 2.947698 2025-01-16 01:24:24,585 - INFO - step 3680, loss: 3.409135, best loss: 2.947698 2025-01-16 01:24:24,735 - INFO - step 3681, loss: 3.844902, best loss: 2.947698 2025-01-16 01:24:24,885 - INFO - step 3682, loss: 3.829017, best loss: 2.947698 2025-01-16 01:24:25,035 - INFO - step 3683, loss: 3.722204, best loss: 2.947698 2025-01-16 01:24:25,185 - INFO - step 3684, loss: 3.858030, best loss: 2.947698 2025-01-16 01:24:25,335 - INFO - step 3685, loss: 3.681008, best loss: 2.947698 2025-01-16 01:24:25,485 - INFO - step 3686, loss: 3.814608, best loss: 2.947698 2025-01-16 01:24:25,635 - INFO - step 3687, loss: 3.992924, best loss: 2.947698 2025-01-16 01:24:25,786 - INFO - step 3688, loss: 3.468903, best loss: 2.947698 2025-01-16 01:24:25,936 - INFO - step 3689, loss: 3.649072, best loss: 2.947698 2025-01-16 01:24:26,086 - INFO - step 3690, loss: 3.778645, best loss: 2.947698 2025-01-16 01:24:26,236 - INFO - step 3691, loss: 3.983792, best loss: 2.947698 2025-01-16 01:24:26,386 - INFO - step 3692, loss: 3.825849, best loss: 2.947698 2025-01-16 01:24:26,537 - INFO - step 3693, loss: 4.035639, best loss: 2.947698 2025-01-16 01:24:26,687 - INFO - step 3694, loss: 3.863699, best loss: 2.947698 2025-01-16 01:24:26,837 - INFO - step 3695, loss: 3.413689, best loss: 2.947698 2025-01-16 01:24:26,987 - INFO - step 3696, loss: 3.869694, best loss: 2.947698 2025-01-16 01:24:27,136 - INFO - step 3697, loss: 3.189808, best loss: 2.947698 2025-01-16 01:24:27,286 - INFO - step 3698, loss: 3.575990, best loss: 2.947698 2025-01-16 01:24:27,436 - INFO - step 3699, loss: 3.691443, best loss: 2.947698 2025-01-16 01:24:27,587 - INFO - step 3700, loss: 3.609389, best loss: 2.947698 2025-01-16 01:24:27,737 - INFO - step 3701, loss: 3.585994, best loss: 2.947698 2025-01-16 01:24:27,888 - INFO - step 3702, loss: 3.874209, best loss: 2.947698 2025-01-16 01:24:28,038 - INFO - step 3703, loss: 4.084913, best loss: 2.947698 2025-01-16 01:24:28,188 - INFO - step 3704, loss: 3.917672, best loss: 2.947698 2025-01-16 01:24:28,338 - INFO - step 3705, loss: 4.190057, best loss: 2.947698 2025-01-16 01:24:28,488 - INFO - step 3706, loss: 3.775151, best loss: 2.947698 2025-01-16 01:24:28,638 - INFO - step 3707, loss: 3.886026, best loss: 2.947698 2025-01-16 01:24:28,789 - INFO - step 3708, loss: 3.655191, best loss: 2.947698 2025-01-16 01:24:28,939 - INFO - step 3709, loss: 3.314407, best loss: 2.947698 2025-01-16 01:24:29,089 - INFO - step 3710, loss: 4.169142, best loss: 2.947698 2025-01-16 01:24:29,239 - INFO - step 3711, loss: 4.062656, best loss: 2.947698 2025-01-16 01:24:29,389 - INFO - step 3712, loss: 3.708129, best loss: 2.947698 2025-01-16 01:24:29,539 - INFO - step 3713, loss: 3.599591, best loss: 2.947698 2025-01-16 01:24:29,689 - INFO - step 3714, loss: 3.379622, best loss: 2.947698 2025-01-16 01:24:29,840 - INFO - step 3715, loss: 3.588333, best loss: 2.947698 2025-01-16 01:24:29,990 - INFO - step 3716, loss: 3.545320, best loss: 2.947698 2025-01-16 01:24:30,140 - INFO - step 3717, loss: 3.495472, best loss: 2.947698 2025-01-16 01:24:30,290 - INFO - step 3718, loss: 4.085364, best loss: 2.947698 2025-01-16 01:24:30,440 - INFO - step 3719, loss: 4.056590, best loss: 2.947698 2025-01-16 01:24:30,590 - INFO - step 3720, loss: 3.868677, best loss: 2.947698 2025-01-16 01:24:30,740 - INFO - step 3721, loss: 4.161688, best loss: 2.947698 2025-01-16 01:24:30,890 - INFO - step 3722, loss: 3.793206, best loss: 2.947698 2025-01-16 01:24:31,040 - INFO - step 3723, loss: 4.118195, best loss: 2.947698 2025-01-16 01:24:31,190 - INFO - step 3724, loss: 4.228546, best loss: 2.947698 2025-01-16 01:24:31,341 - INFO - step 3725, loss: 4.269008, best loss: 2.947698 2025-01-16 01:24:31,491 - INFO - step 3726, loss: 4.347136, best loss: 2.947698 2025-01-16 01:24:31,641 - INFO - step 3727, loss: 3.985318, best loss: 2.947698 2025-01-16 01:24:31,791 - INFO - step 3728, loss: 4.237364, best loss: 2.947698 2025-01-16 01:24:31,941 - INFO - step 3729, loss: 4.058208, best loss: 2.947698 2025-01-16 01:24:32,091 - INFO - step 3730, loss: 3.813615, best loss: 2.947698 2025-01-16 01:24:32,241 - INFO - step 3731, loss: 4.278013, best loss: 2.947698 2025-01-16 01:24:32,391 - INFO - step 3732, loss: 4.195570, best loss: 2.947698 2025-01-16 01:24:32,541 - INFO - step 3733, loss: 4.141093, best loss: 2.947698 2025-01-16 01:24:32,691 - INFO - step 3734, loss: 3.974894, best loss: 2.947698 2025-01-16 01:24:32,841 - INFO - step 3735, loss: 4.211939, best loss: 2.947698 2025-01-16 01:24:32,992 - INFO - step 3736, loss: 3.976809, best loss: 2.947698 2025-01-16 01:24:33,142 - INFO - step 3737, loss: 3.734197, best loss: 2.947698 2025-01-16 01:24:33,292 - INFO - step 3738, loss: 3.720016, best loss: 2.947698 2025-01-16 01:24:33,442 - INFO - step 3739, loss: 4.114801, best loss: 2.947698 2025-01-16 01:24:33,592 - INFO - step 3740, loss: 4.046737, best loss: 2.947698 2025-01-16 01:24:33,742 - INFO - step 3741, loss: 4.282284, best loss: 2.947698 2025-01-16 01:24:33,893 - INFO - step 3742, loss: 4.223146, best loss: 2.947698 2025-01-16 01:24:34,043 - INFO - step 3743, loss: 3.818010, best loss: 2.947698 2025-01-16 01:24:34,193 - INFO - step 3744, loss: 4.254355, best loss: 2.947698 2025-01-16 01:24:34,343 - INFO - step 3745, loss: 3.826915, best loss: 2.947698 2025-01-16 01:24:34,493 - INFO - step 3746, loss: 4.204926, best loss: 2.947698 2025-01-16 01:24:34,644 - INFO - step 3747, loss: 3.823200, best loss: 2.947698 2025-01-16 01:24:34,794 - INFO - step 3748, loss: 3.918926, best loss: 2.947698 2025-01-16 01:24:34,944 - INFO - step 3749, loss: 3.922204, best loss: 2.947698 2025-01-16 01:24:35,094 - INFO - step 3750, loss: 3.904200, best loss: 2.947698 2025-01-16 01:24:35,244 - INFO - step 3751, loss: 3.788534, best loss: 2.947698 2025-01-16 01:24:35,394 - INFO - step 3752, loss: 3.961417, best loss: 2.947698 2025-01-16 01:24:35,544 - INFO - step 3753, loss: 3.576726, best loss: 2.947698 2025-01-16 01:24:35,694 - INFO - step 3754, loss: 3.301471, best loss: 2.947698 2025-01-16 01:24:35,844 - INFO - step 3755, loss: 3.501863, best loss: 2.947698 2025-01-16 01:24:35,994 - INFO - step 3756, loss: 3.734057, best loss: 2.947698 2025-01-16 01:24:36,144 - INFO - step 3757, loss: 4.407058, best loss: 2.947698 2025-01-16 01:24:36,294 - INFO - step 3758, loss: 3.859008, best loss: 2.947698 2025-01-16 01:24:36,445 - INFO - step 3759, loss: 3.551201, best loss: 2.947698 2025-01-16 01:24:36,595 - INFO - step 3760, loss: 4.120705, best loss: 2.947698 2025-01-16 01:24:36,745 - INFO - step 3761, loss: 3.798837, best loss: 2.947698 2025-01-16 01:24:36,895 - INFO - step 3762, loss: 4.390978, best loss: 2.947698 2025-01-16 01:24:37,045 - INFO - step 3763, loss: 3.667530, best loss: 2.947698 2025-01-16 01:24:37,195 - INFO - step 3764, loss: 4.032388, best loss: 2.947698 2025-01-16 01:24:37,345 - INFO - step 3765, loss: 3.823746, best loss: 2.947698 2025-01-16 01:24:37,495 - INFO - step 3766, loss: 4.348568, best loss: 2.947698 2025-01-16 01:24:37,645 - INFO - step 3767, loss: 3.973110, best loss: 2.947698 2025-01-16 01:24:37,795 - INFO - step 3768, loss: 3.679356, best loss: 2.947698 2025-01-16 01:24:37,945 - INFO - step 3769, loss: 4.120759, best loss: 2.947698 2025-01-16 01:24:38,095 - INFO - step 3770, loss: 3.831556, best loss: 2.947698 2025-01-16 01:24:38,245 - INFO - step 3771, loss: 3.631988, best loss: 2.947698 2025-01-16 01:24:38,395 - INFO - step 3772, loss: 4.403734, best loss: 2.947698 2025-01-16 01:24:38,545 - INFO - step 3773, loss: 3.910824, best loss: 2.947698 2025-01-16 01:24:38,695 - INFO - step 3774, loss: 3.452652, best loss: 2.947698 2025-01-16 01:24:38,845 - INFO - step 3775, loss: 3.731200, best loss: 2.947698 2025-01-16 01:24:38,995 - INFO - step 3776, loss: 4.032889, best loss: 2.947698 2025-01-16 01:24:39,145 - INFO - step 3777, loss: 3.898661, best loss: 2.947698 2025-01-16 01:24:39,295 - INFO - step 3778, loss: 3.811236, best loss: 2.947698 2025-01-16 01:24:39,446 - INFO - step 3779, loss: 3.674406, best loss: 2.947698 2025-01-16 01:24:39,597 - INFO - step 3780, loss: 4.287160, best loss: 2.947698 2025-01-16 01:24:39,747 - INFO - step 3781, loss: 4.117008, best loss: 2.947698 2025-01-16 01:24:39,897 - INFO - step 3782, loss: 4.014122, best loss: 2.947698 2025-01-16 01:24:40,047 - INFO - step 3783, loss: 3.660758, best loss: 2.947698 2025-01-16 01:24:40,197 - INFO - step 3784, loss: 3.939502, best loss: 2.947698 2025-01-16 01:24:40,347 - INFO - step 3785, loss: 3.884572, best loss: 2.947698 2025-01-16 01:24:40,497 - INFO - step 3786, loss: 3.466031, best loss: 2.947698 2025-01-16 01:24:40,648 - INFO - step 3787, loss: 3.988115, best loss: 2.947698 2025-01-16 01:24:40,798 - INFO - step 3788, loss: 3.636434, best loss: 2.947698 2025-01-16 01:24:40,948 - INFO - step 3789, loss: 3.938540, best loss: 2.947698 2025-01-16 01:24:41,098 - INFO - step 3790, loss: 3.721897, best loss: 2.947698 2025-01-16 01:24:41,248 - INFO - step 3791, loss: 3.811526, best loss: 2.947698 2025-01-16 01:24:41,397 - INFO - step 3792, loss: 3.653351, best loss: 2.947698 2025-01-16 01:24:41,548 - INFO - step 3793, loss: 3.854807, best loss: 2.947698 2025-01-16 01:24:41,698 - INFO - step 3794, loss: 4.211572, best loss: 2.947698 2025-01-16 01:24:41,848 - INFO - step 3795, loss: 4.049437, best loss: 2.947698 2025-01-16 01:24:41,998 - INFO - step 3796, loss: 4.196865, best loss: 2.947698 2025-01-16 01:24:42,148 - INFO - step 3797, loss: 3.703216, best loss: 2.947698 2025-01-16 01:24:42,298 - INFO - step 3798, loss: 3.985195, best loss: 2.947698 2025-01-16 01:24:42,448 - INFO - step 3799, loss: 3.992273, best loss: 2.947698 2025-01-16 01:24:42,598 - INFO - step 3800, loss: 3.633135, best loss: 2.947698 2025-01-16 01:24:42,748 - INFO - step 3801, loss: 3.533285, best loss: 2.947698 2025-01-16 01:24:42,898 - INFO - step 3802, loss: 3.553786, best loss: 2.947698 2025-01-16 01:24:43,048 - INFO - step 3803, loss: 3.709839, best loss: 2.947698 2025-01-16 01:24:43,198 - INFO - step 3804, loss: 3.904934, best loss: 2.947698 2025-01-16 01:24:43,348 - INFO - step 3805, loss: 4.160413, best loss: 2.947698 2025-01-16 01:24:43,498 - INFO - step 3806, loss: 4.365855, best loss: 2.947698 2025-01-16 01:24:43,648 - INFO - step 3807, loss: 4.128124, best loss: 2.947698 2025-01-16 01:24:43,798 - INFO - step 3808, loss: 4.225393, best loss: 2.947698 2025-01-16 01:24:43,948 - INFO - step 3809, loss: 4.036213, best loss: 2.947698 2025-01-16 01:24:44,098 - INFO - step 3810, loss: 4.015563, best loss: 2.947698 2025-01-16 01:24:44,248 - INFO - step 3811, loss: 3.463658, best loss: 2.947698 2025-01-16 01:24:44,398 - INFO - step 3812, loss: 4.088349, best loss: 2.947698 2025-01-16 01:24:44,548 - INFO - step 3813, loss: 4.363723, best loss: 2.947698 2025-01-16 01:24:44,698 - INFO - step 3814, loss: 4.006876, best loss: 2.947698 2025-01-16 01:24:44,848 - INFO - step 3815, loss: 4.049538, best loss: 2.947698 2025-01-16 01:24:44,998 - INFO - step 3816, loss: 4.066018, best loss: 2.947698 2025-01-16 01:24:45,148 - INFO - step 3817, loss: 3.627277, best loss: 2.947698 2025-01-16 01:24:48,570 - INFO - step 3818, loss: 2.862297, best loss: 2.862297 2025-01-16 01:24:48,734 - INFO - step 3819, loss: 3.811579, best loss: 2.862297 2025-01-16 01:24:48,890 - INFO - step 3820, loss: 4.000829, best loss: 2.862297 2025-01-16 01:24:49,041 - INFO - step 3821, loss: 4.031079, best loss: 2.862297 2025-01-16 01:24:49,191 - INFO - step 3822, loss: 4.054297, best loss: 2.862297 2025-01-16 01:24:49,342 - INFO - step 3823, loss: 3.718392, best loss: 2.862297 2025-01-16 01:24:49,492 - INFO - step 3824, loss: 3.939955, best loss: 2.862297 2025-01-16 01:24:49,642 - INFO - step 3825, loss: 3.960021, best loss: 2.862297 2025-01-16 01:24:49,793 - INFO - step 3826, loss: 3.700551, best loss: 2.862297 2025-01-16 01:24:49,943 - INFO - step 3827, loss: 3.915506, best loss: 2.862297 2025-01-16 01:24:50,093 - INFO - step 3828, loss: 3.765263, best loss: 2.862297 2025-01-16 01:24:50,243 - INFO - step 3829, loss: 3.605289, best loss: 2.862297 2025-01-16 01:24:50,393 - INFO - step 3830, loss: 3.972132, best loss: 2.862297 2025-01-16 01:24:50,543 - INFO - step 3831, loss: 3.563951, best loss: 2.862297 2025-01-16 01:24:50,693 - INFO - step 3832, loss: 3.899652, best loss: 2.862297 2025-01-16 01:24:50,843 - INFO - step 3833, loss: 4.186865, best loss: 2.862297 2025-01-16 01:24:50,994 - INFO - step 3834, loss: 3.783490, best loss: 2.862297 2025-01-16 01:24:51,144 - INFO - step 3835, loss: 3.334876, best loss: 2.862297 2025-01-16 01:24:51,294 - INFO - step 3836, loss: 3.911667, best loss: 2.862297 2025-01-16 01:24:51,445 - INFO - step 3837, loss: 4.092468, best loss: 2.862297 2025-01-16 01:24:51,595 - INFO - step 3838, loss: 4.209922, best loss: 2.862297 2025-01-16 01:24:51,745 - INFO - step 3839, loss: 4.046938, best loss: 2.862297 2025-01-16 01:24:51,895 - INFO - step 3840, loss: 4.138184, best loss: 2.862297 2025-01-16 01:24:52,045 - INFO - step 3841, loss: 3.968220, best loss: 2.862297 2025-01-16 01:24:52,195 - INFO - step 3842, loss: 4.274323, best loss: 2.862297 2025-01-16 01:24:52,346 - INFO - step 3843, loss: 3.771345, best loss: 2.862297 2025-01-16 01:24:52,496 - INFO - step 3844, loss: 3.852623, best loss: 2.862297 2025-01-16 01:24:52,646 - INFO - step 3845, loss: 4.007262, best loss: 2.862297 2025-01-16 01:24:52,796 - INFO - step 3846, loss: 4.025417, best loss: 2.862297 2025-01-16 01:24:52,946 - INFO - step 3847, loss: 3.958184, best loss: 2.862297 2025-01-16 01:24:53,097 - INFO - step 3848, loss: 3.757015, best loss: 2.862297 2025-01-16 01:24:53,247 - INFO - step 3849, loss: 3.785891, best loss: 2.862297 2025-01-16 01:24:53,397 - INFO - step 3850, loss: 4.008401, best loss: 2.862297 2025-01-16 01:24:53,547 - INFO - step 3851, loss: 4.274368, best loss: 2.862297 2025-01-16 01:24:53,697 - INFO - step 3852, loss: 3.962058, best loss: 2.862297 2025-01-16 01:24:53,847 - INFO - step 3853, loss: 4.247274, best loss: 2.862297 2025-01-16 01:24:53,997 - INFO - step 3854, loss: 4.301385, best loss: 2.862297 2025-01-16 01:24:54,147 - INFO - step 3855, loss: 4.209593, best loss: 2.862297 2025-01-16 01:24:54,298 - INFO - step 3856, loss: 4.436292, best loss: 2.862297 2025-01-16 01:24:54,448 - INFO - step 3857, loss: 4.326220, best loss: 2.862297 2025-01-16 01:24:54,598 - INFO - step 3858, loss: 3.749420, best loss: 2.862297 2025-01-16 01:24:54,748 - INFO - step 3859, loss: 4.228268, best loss: 2.862297 2025-01-16 01:24:54,898 - INFO - step 3860, loss: 4.152245, best loss: 2.862297 2025-01-16 01:24:55,048 - INFO - step 3861, loss: 4.236738, best loss: 2.862297 2025-01-16 01:24:55,199 - INFO - step 3862, loss: 3.701937, best loss: 2.862297 2025-01-16 01:24:55,349 - INFO - step 3863, loss: 4.067257, best loss: 2.862297 2025-01-16 01:24:55,499 - INFO - step 3864, loss: 4.011086, best loss: 2.862297 2025-01-16 01:24:55,649 - INFO - step 3865, loss: 3.877909, best loss: 2.862297 2025-01-16 01:24:55,820 - INFO - step 3866, loss: 4.162027, best loss: 2.862297 2025-01-16 01:24:55,970 - INFO - step 3867, loss: 3.681622, best loss: 2.862297 2025-01-16 01:24:56,120 - INFO - step 3868, loss: 3.731231, best loss: 2.862297 2025-01-16 01:24:56,270 - INFO - step 3869, loss: 4.235391, best loss: 2.862297 2025-01-16 01:24:56,420 - INFO - step 3870, loss: 4.093062, best loss: 2.862297 2025-01-16 01:24:56,570 - INFO - step 3871, loss: 4.210660, best loss: 2.862297 2025-01-16 01:24:56,721 - INFO - step 3872, loss: 4.024991, best loss: 2.862297 2025-01-16 01:24:56,871 - INFO - step 3873, loss: 4.391644, best loss: 2.862297 2025-01-16 01:24:57,021 - INFO - step 3874, loss: 4.264544, best loss: 2.862297 2025-01-16 01:24:57,171 - INFO - step 3875, loss: 3.851300, best loss: 2.862297 2025-01-16 01:24:57,321 - INFO - step 3876, loss: 3.810200, best loss: 2.862297 2025-01-16 01:24:57,471 - INFO - step 3877, loss: 4.209718, best loss: 2.862297 2025-01-16 01:24:57,621 - INFO - step 3878, loss: 3.830763, best loss: 2.862297 2025-01-16 01:24:57,771 - INFO - step 3879, loss: 3.676741, best loss: 2.862297 2025-01-16 01:24:57,921 - INFO - step 3880, loss: 4.106415, best loss: 2.862297 2025-01-16 01:24:58,072 - INFO - step 3881, loss: 4.119777, best loss: 2.862297 2025-01-16 01:24:58,222 - INFO - step 3882, loss: 4.074101, best loss: 2.862297 2025-01-16 01:24:58,372 - INFO - step 3883, loss: 3.718267, best loss: 2.862297 2025-01-16 01:24:58,522 - INFO - step 3884, loss: 3.434718, best loss: 2.862297 2025-01-16 01:24:58,672 - INFO - step 3885, loss: 3.462926, best loss: 2.862297 2025-01-16 01:24:58,823 - INFO - step 3886, loss: 3.455864, best loss: 2.862297 2025-01-16 01:24:58,974 - INFO - step 3887, loss: 3.589790, best loss: 2.862297 2025-01-16 01:24:59,124 - INFO - step 3888, loss: 3.983812, best loss: 2.862297 2025-01-16 01:24:59,274 - INFO - step 3889, loss: 3.769390, best loss: 2.862297 2025-01-16 01:24:59,425 - INFO - step 3890, loss: 3.699566, best loss: 2.862297 2025-01-16 01:24:59,576 - INFO - step 3891, loss: 3.919184, best loss: 2.862297 2025-01-16 01:24:59,726 - INFO - step 3892, loss: 3.846298, best loss: 2.862297 2025-01-16 01:24:59,876 - INFO - step 3893, loss: 3.837106, best loss: 2.862297 2025-01-16 01:25:00,026 - INFO - step 3894, loss: 3.973562, best loss: 2.862297 2025-01-16 01:25:00,176 - INFO - step 3895, loss: 4.127296, best loss: 2.862297 2025-01-16 01:25:00,326 - INFO - step 3896, loss: 3.650044, best loss: 2.862297 2025-01-16 01:25:00,476 - INFO - step 3897, loss: 3.526114, best loss: 2.862297 2025-01-16 01:25:00,626 - INFO - step 3898, loss: 3.822014, best loss: 2.862297 2025-01-16 01:25:00,776 - INFO - step 3899, loss: 3.958873, best loss: 2.862297 2025-01-16 01:25:00,926 - INFO - step 3900, loss: 3.525235, best loss: 2.862297 2025-01-16 01:25:01,076 - INFO - step 3901, loss: 3.522563, best loss: 2.862297 2025-01-16 01:25:01,226 - INFO - step 3902, loss: 3.841271, best loss: 2.862297 2025-01-16 01:25:01,376 - INFO - step 3903, loss: 3.771253, best loss: 2.862297 2025-01-16 01:25:01,527 - INFO - step 3904, loss: 3.457503, best loss: 2.862297 2025-01-16 01:25:01,677 - INFO - step 3905, loss: 3.897023, best loss: 2.862297 2025-01-16 01:25:01,827 - INFO - step 3906, loss: 3.621536, best loss: 2.862297 2025-01-16 01:25:01,977 - INFO - step 3907, loss: 3.538185, best loss: 2.862297 2025-01-16 01:25:02,127 - INFO - step 3908, loss: 3.508503, best loss: 2.862297 2025-01-16 01:25:02,277 - INFO - step 3909, loss: 3.799652, best loss: 2.862297 2025-01-16 01:25:02,428 - INFO - step 3910, loss: 3.387300, best loss: 2.862297 2025-01-16 01:25:02,578 - INFO - step 3911, loss: 3.592125, best loss: 2.862297 2025-01-16 01:25:02,727 - INFO - step 3912, loss: 3.385308, best loss: 2.862297 2025-01-16 01:25:02,878 - INFO - step 3913, loss: 3.925856, best loss: 2.862297 2025-01-16 01:25:03,028 - INFO - step 3914, loss: 4.179010, best loss: 2.862297 2025-01-16 01:25:03,178 - INFO - step 3915, loss: 4.480985, best loss: 2.862297 2025-01-16 01:25:03,328 - INFO - step 3916, loss: 3.955474, best loss: 2.862297 2025-01-16 01:25:03,479 - INFO - step 3917, loss: 4.118694, best loss: 2.862297 2025-01-16 01:25:03,629 - INFO - step 3918, loss: 3.909337, best loss: 2.862297 2025-01-16 01:25:03,779 - INFO - step 3919, loss: 3.843558, best loss: 2.862297 2025-01-16 01:25:03,929 - INFO - step 3920, loss: 3.696546, best loss: 2.862297 2025-01-16 01:25:04,079 - INFO - step 3921, loss: 4.023712, best loss: 2.862297 2025-01-16 01:25:04,230 - INFO - step 3922, loss: 3.666006, best loss: 2.862297 2025-01-16 01:25:04,380 - INFO - step 3923, loss: 3.409386, best loss: 2.862297 2025-01-16 01:25:04,530 - INFO - step 3924, loss: 3.728184, best loss: 2.862297 2025-01-16 01:25:04,680 - INFO - step 3925, loss: 3.632190, best loss: 2.862297 2025-01-16 01:25:04,831 - INFO - step 3926, loss: 3.868195, best loss: 2.862297 2025-01-16 01:25:04,981 - INFO - step 3927, loss: 2.940809, best loss: 2.862297 2025-01-16 01:25:05,131 - INFO - step 3928, loss: 3.664984, best loss: 2.862297 2025-01-16 01:25:05,281 - INFO - step 3929, loss: 3.842934, best loss: 2.862297 2025-01-16 01:25:05,431 - INFO - step 3930, loss: 3.887467, best loss: 2.862297 2025-01-16 01:25:05,582 - INFO - step 3931, loss: 3.652126, best loss: 2.862297 2025-01-16 01:25:05,732 - INFO - step 3932, loss: 3.669146, best loss: 2.862297 2025-01-16 01:25:05,882 - INFO - step 3933, loss: 3.715626, best loss: 2.862297 2025-01-16 01:25:06,032 - INFO - step 3934, loss: 3.407999, best loss: 2.862297 2025-01-16 01:25:06,182 - INFO - step 3935, loss: 3.781427, best loss: 2.862297 2025-01-16 01:25:06,332 - INFO - step 3936, loss: 3.668165, best loss: 2.862297 2025-01-16 01:25:06,483 - INFO - step 3937, loss: 3.705271, best loss: 2.862297 2025-01-16 01:25:06,633 - INFO - step 3938, loss: 3.368849, best loss: 2.862297 2025-01-16 01:25:06,783 - INFO - step 3939, loss: 3.443061, best loss: 2.862297 2025-01-16 01:25:06,933 - INFO - step 3940, loss: 3.474095, best loss: 2.862297 2025-01-16 01:25:07,083 - INFO - step 3941, loss: 3.366399, best loss: 2.862297 2025-01-16 01:25:07,233 - INFO - step 3942, loss: 3.514192, best loss: 2.862297 2025-01-16 01:25:07,384 - INFO - step 3943, loss: 3.513492, best loss: 2.862297 2025-01-16 01:25:07,534 - INFO - step 3944, loss: 3.433162, best loss: 2.862297 2025-01-16 01:25:07,684 - INFO - step 3945, loss: 3.181126, best loss: 2.862297 2025-01-16 01:25:07,834 - INFO - step 3946, loss: 3.122223, best loss: 2.862297 2025-01-16 01:25:07,984 - INFO - step 3947, loss: 2.892930, best loss: 2.862297 2025-01-16 01:25:08,134 - INFO - step 3948, loss: 3.651223, best loss: 2.862297 2025-01-16 01:25:08,284 - INFO - step 3949, loss: 3.988884, best loss: 2.862297 2025-01-16 01:25:08,434 - INFO - step 3950, loss: 4.062376, best loss: 2.862297 2025-01-16 01:25:08,584 - INFO - step 3951, loss: 4.187868, best loss: 2.862297 2025-01-16 01:25:08,734 - INFO - step 3952, loss: 4.260240, best loss: 2.862297 2025-01-16 01:25:08,884 - INFO - step 3953, loss: 3.852780, best loss: 2.862297 2025-01-16 01:25:09,035 - INFO - step 3954, loss: 4.037853, best loss: 2.862297 2025-01-16 01:25:09,185 - INFO - step 3955, loss: 4.183427, best loss: 2.862297 2025-01-16 01:25:09,334 - INFO - step 3956, loss: 3.798806, best loss: 2.862297 2025-01-16 01:25:09,485 - INFO - step 3957, loss: 3.298870, best loss: 2.862297 2025-01-16 01:25:09,635 - INFO - step 3958, loss: 3.680521, best loss: 2.862297 2025-01-16 01:25:09,785 - INFO - step 3959, loss: 3.478696, best loss: 2.862297 2025-01-16 01:25:09,936 - INFO - step 3960, loss: 4.115978, best loss: 2.862297 2025-01-16 01:25:10,086 - INFO - step 3961, loss: 3.882814, best loss: 2.862297 2025-01-16 01:25:10,236 - INFO - step 3962, loss: 3.981191, best loss: 2.862297 2025-01-16 01:25:10,386 - INFO - step 3963, loss: 3.811690, best loss: 2.862297 2025-01-16 01:25:10,536 - INFO - step 3964, loss: 4.081166, best loss: 2.862297 2025-01-16 01:25:10,687 - INFO - step 3965, loss: 3.610871, best loss: 2.862297 2025-01-16 01:25:10,837 - INFO - step 3966, loss: 4.066877, best loss: 2.862297 2025-01-16 01:25:10,987 - INFO - step 3967, loss: 3.785300, best loss: 2.862297 2025-01-16 01:25:11,137 - INFO - step 3968, loss: 4.017299, best loss: 2.862297 2025-01-16 01:25:11,287 - INFO - step 3969, loss: 3.799132, best loss: 2.862297 2025-01-16 01:25:11,437 - INFO - step 3970, loss: 3.656042, best loss: 2.862297 2025-01-16 01:25:11,587 - INFO - step 3971, loss: 4.259176, best loss: 2.862297 2025-01-16 01:25:11,737 - INFO - step 3972, loss: 3.561766, best loss: 2.862297 2025-01-16 01:25:11,888 - INFO - step 3973, loss: 3.989578, best loss: 2.862297 2025-01-16 01:25:12,038 - INFO - step 3974, loss: 4.004282, best loss: 2.862297 2025-01-16 01:25:12,188 - INFO - step 3975, loss: 4.006729, best loss: 2.862297 2025-01-16 01:25:12,338 - INFO - step 3976, loss: 3.892529, best loss: 2.862297 2025-01-16 01:25:12,489 - INFO - step 3977, loss: 3.664225, best loss: 2.862297 2025-01-16 01:25:12,639 - INFO - step 3978, loss: 3.623712, best loss: 2.862297 2025-01-16 01:25:12,790 - INFO - step 3979, loss: 3.742815, best loss: 2.862297 2025-01-16 01:25:12,940 - INFO - step 3980, loss: 3.268506, best loss: 2.862297 2025-01-16 01:25:13,090 - INFO - step 3981, loss: 4.112820, best loss: 2.862297 2025-01-16 01:25:13,240 - INFO - step 3982, loss: 3.264845, best loss: 2.862297 2025-01-16 01:25:13,390 - INFO - step 3983, loss: 3.204821, best loss: 2.862297 2025-01-16 01:25:13,540 - INFO - step 3984, loss: 3.747121, best loss: 2.862297 2025-01-16 01:25:13,690 - INFO - step 3985, loss: 3.705537, best loss: 2.862297 2025-01-16 01:25:13,840 - INFO - step 3986, loss: 3.666230, best loss: 2.862297 2025-01-16 01:25:13,990 - INFO - step 3987, loss: 3.354251, best loss: 2.862297 2025-01-16 01:25:14,140 - INFO - step 3988, loss: 3.593961, best loss: 2.862297 2025-01-16 01:25:14,290 - INFO - step 3989, loss: 3.675976, best loss: 2.862297 2025-01-16 01:25:14,440 - INFO - step 3990, loss: 3.548607, best loss: 2.862297 2025-01-16 01:25:14,590 - INFO - step 3991, loss: 3.415659, best loss: 2.862297 2025-01-16 01:25:14,740 - INFO - step 3992, loss: 3.806419, best loss: 2.862297 2025-01-16 01:25:14,891 - INFO - step 3993, loss: 3.710891, best loss: 2.862297 2025-01-16 01:25:15,041 - INFO - step 3994, loss: 3.682370, best loss: 2.862297 2025-01-16 01:25:15,190 - INFO - step 3995, loss: 3.340423, best loss: 2.862297 2025-01-16 01:25:15,341 - INFO - step 3996, loss: 3.577335, best loss: 2.862297 2025-01-16 01:25:15,491 - INFO - step 3997, loss: 3.915129, best loss: 2.862297 2025-01-16 01:25:15,641 - INFO - step 3998, loss: 3.461843, best loss: 2.862297 2025-01-16 01:25:15,792 - INFO - step 3999, loss: 3.874213, best loss: 2.862297 2025-01-16 01:25:15,942 - INFO - step 4000, loss: 4.021764, best loss: 2.862297 2025-01-16 01:25:16,092 - INFO - step 4001, loss: 3.957188, best loss: 2.862297 2025-01-16 01:25:16,243 - INFO - step 4002, loss: 3.910459, best loss: 2.862297 2025-01-16 01:25:16,393 - INFO - step 4003, loss: 3.844237, best loss: 2.862297 2025-01-16 01:25:16,543 - INFO - step 4004, loss: 3.704853, best loss: 2.862297 2025-01-16 01:25:16,693 - INFO - step 4005, loss: 3.553427, best loss: 2.862297 2025-01-16 01:25:16,843 - INFO - step 4006, loss: 4.302111, best loss: 2.862297 2025-01-16 01:25:16,993 - INFO - step 4007, loss: 3.811669, best loss: 2.862297 2025-01-16 01:25:17,143 - INFO - step 4008, loss: 4.163445, best loss: 2.862297 2025-01-16 01:25:17,293 - INFO - step 4009, loss: 3.256803, best loss: 2.862297 2025-01-16 01:25:17,444 - INFO - step 4010, loss: 3.389153, best loss: 2.862297 2025-01-16 01:25:17,594 - INFO - step 4011, loss: 3.857291, best loss: 2.862297 2025-01-16 01:25:17,743 - INFO - step 4012, loss: 3.820500, best loss: 2.862297 2025-01-16 01:25:17,894 - INFO - step 4013, loss: 3.647430, best loss: 2.862297 2025-01-16 01:25:18,044 - INFO - step 4014, loss: 3.742373, best loss: 2.862297 2025-01-16 01:25:18,194 - INFO - step 4015, loss: 3.512926, best loss: 2.862297 2025-01-16 01:25:18,344 - INFO - step 4016, loss: 3.687745, best loss: 2.862297 2025-01-16 01:25:18,494 - INFO - step 4017, loss: 3.873646, best loss: 2.862297 2025-01-16 01:25:18,645 - INFO - step 4018, loss: 3.443012, best loss: 2.862297 2025-01-16 01:25:18,795 - INFO - step 4019, loss: 3.654567, best loss: 2.862297 2025-01-16 01:25:18,945 - INFO - step 4020, loss: 3.748448, best loss: 2.862297 2025-01-16 01:25:19,095 - INFO - step 4021, loss: 3.894990, best loss: 2.862297 2025-01-16 01:25:19,245 - INFO - step 4022, loss: 3.750355, best loss: 2.862297 2025-01-16 01:25:19,395 - INFO - step 4023, loss: 3.922922, best loss: 2.862297 2025-01-16 01:25:19,546 - INFO - step 4024, loss: 3.732296, best loss: 2.862297 2025-01-16 01:25:19,696 - INFO - step 4025, loss: 3.350560, best loss: 2.862297 2025-01-16 01:25:19,846 - INFO - step 4026, loss: 3.736990, best loss: 2.862297 2025-01-16 01:25:19,996 - INFO - step 4027, loss: 3.123057, best loss: 2.862297 2025-01-16 01:25:20,146 - INFO - step 4028, loss: 3.516204, best loss: 2.862297 2025-01-16 01:25:20,296 - INFO - step 4029, loss: 3.664418, best loss: 2.862297 2025-01-16 01:25:20,446 - INFO - step 4030, loss: 3.645846, best loss: 2.862297 2025-01-16 01:25:20,596 - INFO - step 4031, loss: 3.569567, best loss: 2.862297 2025-01-16 01:25:20,747 - INFO - step 4032, loss: 3.793924, best loss: 2.862297 2025-01-16 01:25:20,896 - INFO - step 4033, loss: 4.036189, best loss: 2.862297 2025-01-16 01:25:21,047 - INFO - step 4034, loss: 3.758485, best loss: 2.862297 2025-01-16 01:25:21,197 - INFO - step 4035, loss: 4.002125, best loss: 2.862297 2025-01-16 01:25:21,347 - INFO - step 4036, loss: 3.679672, best loss: 2.862297 2025-01-16 01:25:21,497 - INFO - step 4037, loss: 3.754143, best loss: 2.862297 2025-01-16 01:25:21,647 - INFO - step 4038, loss: 3.581070, best loss: 2.862297 2025-01-16 01:25:21,797 - INFO - step 4039, loss: 3.272987, best loss: 2.862297 2025-01-16 01:25:21,947 - INFO - step 4040, loss: 4.137602, best loss: 2.862297 2025-01-16 01:25:22,097 - INFO - step 4041, loss: 4.003031, best loss: 2.862297 2025-01-16 01:25:22,247 - INFO - step 4042, loss: 3.610150, best loss: 2.862297 2025-01-16 01:25:22,398 - INFO - step 4043, loss: 3.506586, best loss: 2.862297 2025-01-16 01:25:22,548 - INFO - step 4044, loss: 3.302558, best loss: 2.862297 2025-01-16 01:25:22,698 - INFO - step 4045, loss: 3.546632, best loss: 2.862297 2025-01-16 01:25:22,848 - INFO - step 4046, loss: 3.457350, best loss: 2.862297 2025-01-16 01:25:22,999 - INFO - step 4047, loss: 3.399163, best loss: 2.862297 2025-01-16 01:25:23,149 - INFO - step 4048, loss: 4.043939, best loss: 2.862297 2025-01-16 01:25:23,299 - INFO - step 4049, loss: 3.945828, best loss: 2.862297 2025-01-16 01:25:23,449 - INFO - step 4050, loss: 3.716061, best loss: 2.862297 2025-01-16 01:25:23,600 - INFO - step 4051, loss: 4.048644, best loss: 2.862297 2025-01-16 01:25:23,750 - INFO - step 4052, loss: 3.665253, best loss: 2.862297 2025-01-16 01:25:23,900 - INFO - step 4053, loss: 3.982387, best loss: 2.862297 2025-01-16 01:25:24,050 - INFO - step 4054, loss: 4.102748, best loss: 2.862297 2025-01-16 01:25:24,200 - INFO - step 4055, loss: 4.125135, best loss: 2.862297 2025-01-16 01:25:24,350 - INFO - step 4056, loss: 4.205357, best loss: 2.862297 2025-01-16 01:25:24,500 - INFO - step 4057, loss: 3.820370, best loss: 2.862297 2025-01-16 01:25:24,650 - INFO - step 4058, loss: 4.080403, best loss: 2.862297 2025-01-16 01:25:24,800 - INFO - step 4059, loss: 3.886637, best loss: 2.862297 2025-01-16 01:25:24,951 - INFO - step 4060, loss: 3.683070, best loss: 2.862297 2025-01-16 01:25:25,101 - INFO - step 4061, loss: 4.167438, best loss: 2.862297 2025-01-16 01:25:25,250 - INFO - step 4062, loss: 4.117999, best loss: 2.862297 2025-01-16 01:25:25,401 - INFO - step 4063, loss: 4.029932, best loss: 2.862297 2025-01-16 01:25:25,551 - INFO - step 4064, loss: 3.849014, best loss: 2.862297 2025-01-16 01:25:25,701 - INFO - step 4065, loss: 4.027817, best loss: 2.862297 2025-01-16 01:25:25,851 - INFO - step 4066, loss: 3.832740, best loss: 2.862297 2025-01-16 01:25:26,002 - INFO - step 4067, loss: 3.605005, best loss: 2.862297 2025-01-16 01:25:26,152 - INFO - step 4068, loss: 3.619222, best loss: 2.862297 2025-01-16 01:25:26,302 - INFO - step 4069, loss: 3.997670, best loss: 2.862297 2025-01-16 01:25:26,453 - INFO - step 4070, loss: 4.003250, best loss: 2.862297 2025-01-16 01:25:26,603 - INFO - step 4071, loss: 4.162371, best loss: 2.862297 2025-01-16 01:25:26,753 - INFO - step 4072, loss: 4.072200, best loss: 2.862297 2025-01-16 01:25:26,903 - INFO - step 4073, loss: 3.681572, best loss: 2.862297 2025-01-16 01:25:27,053 - INFO - step 4074, loss: 4.130746, best loss: 2.862297 2025-01-16 01:25:27,203 - INFO - step 4075, loss: 3.703823, best loss: 2.862297 2025-01-16 01:25:27,353 - INFO - step 4076, loss: 4.058766, best loss: 2.862297 2025-01-16 01:25:27,503 - INFO - step 4077, loss: 3.693918, best loss: 2.862297 2025-01-16 01:25:27,653 - INFO - step 4078, loss: 3.816720, best loss: 2.862297 2025-01-16 01:25:27,803 - INFO - step 4079, loss: 3.832206, best loss: 2.862297 2025-01-16 01:25:27,953 - INFO - step 4080, loss: 3.846812, best loss: 2.862297 2025-01-16 01:25:28,103 - INFO - step 4081, loss: 3.675766, best loss: 2.862297 2025-01-16 01:25:28,254 - INFO - step 4082, loss: 3.874609, best loss: 2.862297 2025-01-16 01:25:28,404 - INFO - step 4083, loss: 3.419458, best loss: 2.862297 2025-01-16 01:25:28,554 - INFO - step 4084, loss: 3.105223, best loss: 2.862297 2025-01-16 01:25:28,704 - INFO - step 4085, loss: 3.374844, best loss: 2.862297 2025-01-16 01:25:28,854 - INFO - step 4086, loss: 3.593726, best loss: 2.862297 2025-01-16 01:25:29,004 - INFO - step 4087, loss: 4.228742, best loss: 2.862297 2025-01-16 01:25:29,154 - INFO - step 4088, loss: 3.789120, best loss: 2.862297 2025-01-16 01:25:29,304 - INFO - step 4089, loss: 3.538476, best loss: 2.862297 2025-01-16 01:25:29,455 - INFO - step 4090, loss: 4.020114, best loss: 2.862297 2025-01-16 01:25:29,605 - INFO - step 4091, loss: 3.724466, best loss: 2.862297 2025-01-16 01:25:29,755 - INFO - step 4092, loss: 4.331324, best loss: 2.862297 2025-01-16 01:25:29,905 - INFO - step 4093, loss: 3.573590, best loss: 2.862297 2025-01-16 01:25:30,056 - INFO - step 4094, loss: 3.894168, best loss: 2.862297 2025-01-16 01:25:30,206 - INFO - step 4095, loss: 3.654279, best loss: 2.862297 2025-01-16 01:25:30,357 - INFO - step 4096, loss: 4.171013, best loss: 2.862297 2025-01-16 01:25:30,507 - INFO - step 4097, loss: 3.821797, best loss: 2.862297 2025-01-16 01:25:30,657 - INFO - step 4098, loss: 3.631576, best loss: 2.862297 2025-01-16 01:25:30,807 - INFO - step 4099, loss: 4.060742, best loss: 2.862297 2025-01-16 01:25:30,957 - INFO - step 4100, loss: 3.783252, best loss: 2.862297 2025-01-16 01:25:31,107 - INFO - step 4101, loss: 3.578845, best loss: 2.862297 2025-01-16 01:25:31,257 - INFO - step 4102, loss: 4.302065, best loss: 2.862297 2025-01-16 01:25:31,407 - INFO - step 4103, loss: 3.821378, best loss: 2.862297 2025-01-16 01:25:31,558 - INFO - step 4104, loss: 3.368078, best loss: 2.862297 2025-01-16 01:25:31,707 - INFO - step 4105, loss: 3.678156, best loss: 2.862297 2025-01-16 01:25:31,857 - INFO - step 4106, loss: 3.986781, best loss: 2.862297 2025-01-16 01:25:32,008 - INFO - step 4107, loss: 3.886044, best loss: 2.862297 2025-01-16 01:25:32,158 - INFO - step 4108, loss: 3.719613, best loss: 2.862297 2025-01-16 01:25:32,308 - INFO - step 4109, loss: 3.625547, best loss: 2.862297 2025-01-16 01:25:32,458 - INFO - step 4110, loss: 4.199812, best loss: 2.862297 2025-01-16 01:25:32,608 - INFO - step 4111, loss: 3.977102, best loss: 2.862297 2025-01-16 01:25:32,758 - INFO - step 4112, loss: 3.897712, best loss: 2.862297 2025-01-16 01:25:32,908 - INFO - step 4113, loss: 3.542690, best loss: 2.862297 2025-01-16 01:25:33,059 - INFO - step 4114, loss: 3.851887, best loss: 2.862297 2025-01-16 01:25:33,209 - INFO - step 4115, loss: 3.747808, best loss: 2.862297 2025-01-16 01:25:33,359 - INFO - step 4116, loss: 3.336480, best loss: 2.862297 2025-01-16 01:25:33,509 - INFO - step 4117, loss: 3.813926, best loss: 2.862297 2025-01-16 01:25:33,659 - INFO - step 4118, loss: 3.470795, best loss: 2.862297 2025-01-16 01:25:33,810 - INFO - step 4119, loss: 3.854395, best loss: 2.862297 2025-01-16 01:25:33,959 - INFO - step 4120, loss: 3.594429, best loss: 2.862297 2025-01-16 01:25:34,110 - INFO - step 4121, loss: 3.752791, best loss: 2.862297 2025-01-16 01:25:34,260 - INFO - step 4122, loss: 3.592988, best loss: 2.862297 2025-01-16 01:25:34,410 - INFO - step 4123, loss: 3.763356, best loss: 2.862297 2025-01-16 01:25:34,560 - INFO - step 4124, loss: 4.090600, best loss: 2.862297 2025-01-16 01:25:34,710 - INFO - step 4125, loss: 3.873066, best loss: 2.862297 2025-01-16 01:25:34,860 - INFO - step 4126, loss: 3.999577, best loss: 2.862297 2025-01-16 01:25:35,010 - INFO - step 4127, loss: 3.527202, best loss: 2.862297 2025-01-16 01:25:35,160 - INFO - step 4128, loss: 3.917963, best loss: 2.862297 2025-01-16 01:25:35,310 - INFO - step 4129, loss: 3.908357, best loss: 2.862297 2025-01-16 01:25:35,460 - INFO - step 4130, loss: 3.586198, best loss: 2.862297 2025-01-16 01:25:35,611 - INFO - step 4131, loss: 3.379079, best loss: 2.862297 2025-01-16 01:25:35,761 - INFO - step 4132, loss: 3.416521, best loss: 2.862297 2025-01-16 01:25:35,911 - INFO - step 4133, loss: 3.557086, best loss: 2.862297 2025-01-16 01:25:36,061 - INFO - step 4134, loss: 3.712661, best loss: 2.862297 2025-01-16 01:25:36,212 - INFO - step 4135, loss: 4.015768, best loss: 2.862297 2025-01-16 01:25:36,362 - INFO - step 4136, loss: 4.206882, best loss: 2.862297 2025-01-16 01:25:36,512 - INFO - step 4137, loss: 4.000655, best loss: 2.862297 2025-01-16 01:25:36,662 - INFO - step 4138, loss: 4.119845, best loss: 2.862297 2025-01-16 01:25:36,812 - INFO - step 4139, loss: 3.964381, best loss: 2.862297 2025-01-16 01:25:36,962 - INFO - step 4140, loss: 3.970246, best loss: 2.862297 2025-01-16 01:25:37,113 - INFO - step 4141, loss: 3.357446, best loss: 2.862297 2025-01-16 01:25:37,263 - INFO - step 4142, loss: 3.965347, best loss: 2.862297 2025-01-16 01:25:37,413 - INFO - step 4143, loss: 4.177313, best loss: 2.862297 2025-01-16 01:25:37,563 - INFO - step 4144, loss: 3.838452, best loss: 2.862297 2025-01-16 01:25:37,714 - INFO - step 4145, loss: 3.914247, best loss: 2.862297 2025-01-16 01:25:37,864 - INFO - step 4146, loss: 3.955124, best loss: 2.862297 2025-01-16 01:25:38,014 - INFO - step 4147, loss: 3.542886, best loss: 2.862297 2025-01-16 01:25:41,489 - INFO - step 4148, loss: 2.749489, best loss: 2.749489 2025-01-16 01:25:41,651 - INFO - step 4149, loss: 3.786740, best loss: 2.749489 2025-01-16 01:25:41,802 - INFO - step 4150, loss: 3.961241, best loss: 2.749489 2025-01-16 01:25:41,953 - INFO - step 4151, loss: 3.942385, best loss: 2.749489 2025-01-16 01:25:42,103 - INFO - step 4152, loss: 3.943730, best loss: 2.749489 2025-01-16 01:25:42,253 - INFO - step 4153, loss: 3.555988, best loss: 2.749489 2025-01-16 01:25:42,402 - INFO - step 4154, loss: 3.789003, best loss: 2.749489 2025-01-16 01:25:42,553 - INFO - step 4155, loss: 3.847007, best loss: 2.749489 2025-01-16 01:25:42,703 - INFO - step 4156, loss: 3.612565, best loss: 2.749489 2025-01-16 01:25:42,853 - INFO - step 4157, loss: 3.809402, best loss: 2.749489 2025-01-16 01:25:43,003 - INFO - step 4158, loss: 3.665127, best loss: 2.749489 2025-01-16 01:25:43,153 - INFO - step 4159, loss: 3.522966, best loss: 2.749489 2025-01-16 01:25:43,303 - INFO - step 4160, loss: 3.887512, best loss: 2.749489 2025-01-16 01:25:43,453 - INFO - step 4161, loss: 3.477662, best loss: 2.749489 2025-01-16 01:25:43,603 - INFO - step 4162, loss: 3.754333, best loss: 2.749489 2025-01-16 01:25:43,754 - INFO - step 4163, loss: 4.075269, best loss: 2.749489 2025-01-16 01:25:43,904 - INFO - step 4164, loss: 3.694911, best loss: 2.749489 2025-01-16 01:25:44,054 - INFO - step 4165, loss: 3.265526, best loss: 2.749489 2025-01-16 01:25:44,204 - INFO - step 4166, loss: 3.832593, best loss: 2.749489 2025-01-16 01:25:44,355 - INFO - step 4167, loss: 3.965368, best loss: 2.749489 2025-01-16 01:25:44,505 - INFO - step 4168, loss: 4.061496, best loss: 2.749489 2025-01-16 01:25:44,655 - INFO - step 4169, loss: 3.888242, best loss: 2.749489 2025-01-16 01:25:44,805 - INFO - step 4170, loss: 3.969795, best loss: 2.749489 2025-01-16 01:25:44,956 - INFO - step 4171, loss: 3.796111, best loss: 2.749489 2025-01-16 01:25:45,106 - INFO - step 4172, loss: 4.100374, best loss: 2.749489 2025-01-16 01:25:45,256 - INFO - step 4173, loss: 3.647425, best loss: 2.749489 2025-01-16 01:25:45,406 - INFO - step 4174, loss: 3.760511, best loss: 2.749489 2025-01-16 01:25:45,556 - INFO - step 4175, loss: 3.934244, best loss: 2.749489 2025-01-16 01:25:45,706 - INFO - step 4176, loss: 3.983355, best loss: 2.749489 2025-01-16 01:25:45,856 - INFO - step 4177, loss: 3.907456, best loss: 2.749489 2025-01-16 01:25:46,007 - INFO - step 4178, loss: 3.672558, best loss: 2.749489 2025-01-16 01:25:46,157 - INFO - step 4179, loss: 3.719522, best loss: 2.749489 2025-01-16 01:25:46,307 - INFO - step 4180, loss: 3.920203, best loss: 2.749489 2025-01-16 01:25:46,457 - INFO - step 4181, loss: 4.173411, best loss: 2.749489 2025-01-16 01:25:46,608 - INFO - step 4182, loss: 3.895333, best loss: 2.749489 2025-01-16 01:25:46,758 - INFO - step 4183, loss: 4.185467, best loss: 2.749489 2025-01-16 01:25:46,908 - INFO - step 4184, loss: 4.201129, best loss: 2.749489 2025-01-16 01:25:47,059 - INFO - step 4185, loss: 4.112042, best loss: 2.749489 2025-01-16 01:25:47,209 - INFO - step 4186, loss: 4.349805, best loss: 2.749489 2025-01-16 01:25:47,359 - INFO - step 4187, loss: 4.201950, best loss: 2.749489 2025-01-16 01:25:47,509 - INFO - step 4188, loss: 3.656975, best loss: 2.749489 2025-01-16 01:25:47,659 - INFO - step 4189, loss: 4.129270, best loss: 2.749489 2025-01-16 01:25:47,809 - INFO - step 4190, loss: 4.078054, best loss: 2.749489 2025-01-16 01:25:47,959 - INFO - step 4191, loss: 4.154912, best loss: 2.749489 2025-01-16 01:25:48,109 - INFO - step 4192, loss: 3.704550, best loss: 2.749489 2025-01-16 01:25:48,260 - INFO - step 4193, loss: 3.936642, best loss: 2.749489 2025-01-16 01:25:48,410 - INFO - step 4194, loss: 3.918778, best loss: 2.749489 2025-01-16 01:25:48,560 - INFO - step 4195, loss: 3.807360, best loss: 2.749489 2025-01-16 01:25:48,710 - INFO - step 4196, loss: 4.093111, best loss: 2.749489 2025-01-16 01:25:48,860 - INFO - step 4197, loss: 3.590298, best loss: 2.749489 2025-01-16 01:25:49,011 - INFO - step 4198, loss: 3.628089, best loss: 2.749489 2025-01-16 01:25:49,161 - INFO - step 4199, loss: 4.117082, best loss: 2.749489 2025-01-16 01:25:49,311 - INFO - step 4200, loss: 3.965201, best loss: 2.749489 2025-01-16 01:25:49,461 - INFO - step 4201, loss: 4.025457, best loss: 2.749489 2025-01-16 01:25:49,612 - INFO - step 4202, loss: 3.900649, best loss: 2.749489 2025-01-16 01:25:49,763 - INFO - step 4203, loss: 4.280877, best loss: 2.749489 2025-01-16 01:25:49,913 - INFO - step 4204, loss: 4.135184, best loss: 2.749489 2025-01-16 01:25:50,063 - INFO - step 4205, loss: 3.716476, best loss: 2.749489 2025-01-16 01:25:50,213 - INFO - step 4206, loss: 3.667692, best loss: 2.749489 2025-01-16 01:25:50,363 - INFO - step 4207, loss: 4.099979, best loss: 2.749489 2025-01-16 01:25:50,513 - INFO - step 4208, loss: 3.704119, best loss: 2.749489 2025-01-16 01:25:50,663 - INFO - step 4209, loss: 3.532760, best loss: 2.749489 2025-01-16 01:25:50,814 - INFO - step 4210, loss: 3.985318, best loss: 2.749489 2025-01-16 01:25:50,964 - INFO - step 4211, loss: 3.955568, best loss: 2.749489 2025-01-16 01:25:51,114 - INFO - step 4212, loss: 3.907073, best loss: 2.749489 2025-01-16 01:25:51,264 - INFO - step 4213, loss: 3.490451, best loss: 2.749489 2025-01-16 01:25:51,414 - INFO - step 4214, loss: 3.279541, best loss: 2.749489 2025-01-16 01:25:51,564 - INFO - step 4215, loss: 3.292260, best loss: 2.749489 2025-01-16 01:25:51,714 - INFO - step 4216, loss: 3.386210, best loss: 2.749489 2025-01-16 01:25:51,864 - INFO - step 4217, loss: 3.516036, best loss: 2.749489 2025-01-16 01:25:52,014 - INFO - step 4218, loss: 3.855265, best loss: 2.749489 2025-01-16 01:25:52,165 - INFO - step 4219, loss: 3.628444, best loss: 2.749489 2025-01-16 01:25:52,315 - INFO - step 4220, loss: 3.595647, best loss: 2.749489 2025-01-16 01:25:52,465 - INFO - step 4221, loss: 3.796838, best loss: 2.749489 2025-01-16 01:25:52,615 - INFO - step 4222, loss: 3.699390, best loss: 2.749489 2025-01-16 01:25:52,765 - INFO - step 4223, loss: 3.686511, best loss: 2.749489 2025-01-16 01:25:52,915 - INFO - step 4224, loss: 3.867475, best loss: 2.749489 2025-01-16 01:25:53,065 - INFO - step 4225, loss: 3.986942, best loss: 2.749489 2025-01-16 01:25:53,215 - INFO - step 4226, loss: 3.457990, best loss: 2.749489 2025-01-16 01:25:53,365 - INFO - step 4227, loss: 3.434497, best loss: 2.749489 2025-01-16 01:25:53,515 - INFO - step 4228, loss: 3.741126, best loss: 2.749489 2025-01-16 01:25:53,666 - INFO - step 4229, loss: 3.883524, best loss: 2.749489 2025-01-16 01:25:53,816 - INFO - step 4230, loss: 3.367823, best loss: 2.749489 2025-01-16 01:25:53,966 - INFO - step 4231, loss: 3.383302, best loss: 2.749489 2025-01-16 01:25:54,116 - INFO - step 4232, loss: 3.674592, best loss: 2.749489 2025-01-16 01:25:54,266 - INFO - step 4233, loss: 3.667725, best loss: 2.749489 2025-01-16 01:25:54,416 - INFO - step 4234, loss: 3.383089, best loss: 2.749489 2025-01-16 01:25:54,567 - INFO - step 4235, loss: 3.852125, best loss: 2.749489 2025-01-16 01:25:54,717 - INFO - step 4236, loss: 3.534549, best loss: 2.749489 2025-01-16 01:25:54,867 - INFO - step 4237, loss: 3.463133, best loss: 2.749489 2025-01-16 01:25:55,017 - INFO - step 4238, loss: 3.420337, best loss: 2.749489 2025-01-16 01:25:55,167 - INFO - step 4239, loss: 3.678061, best loss: 2.749489 2025-01-16 01:25:55,317 - INFO - step 4240, loss: 3.278752, best loss: 2.749489 2025-01-16 01:25:55,467 - INFO - step 4241, loss: 3.498667, best loss: 2.749489 2025-01-16 01:25:55,618 - INFO - step 4242, loss: 3.321330, best loss: 2.749489 2025-01-16 01:25:55,768 - INFO - step 4243, loss: 3.790518, best loss: 2.749489 2025-01-16 01:25:55,918 - INFO - step 4244, loss: 4.127449, best loss: 2.749489 2025-01-16 01:25:56,068 - INFO - step 4245, loss: 4.368065, best loss: 2.749489 2025-01-16 01:25:56,219 - INFO - step 4246, loss: 3.862068, best loss: 2.749489 2025-01-16 01:25:56,369 - INFO - step 4247, loss: 4.045083, best loss: 2.749489 2025-01-16 01:25:56,519 - INFO - step 4248, loss: 3.862574, best loss: 2.749489 2025-01-16 01:25:56,670 - INFO - step 4249, loss: 3.824003, best loss: 2.749489 2025-01-16 01:25:56,820 - INFO - step 4250, loss: 3.630243, best loss: 2.749489 2025-01-16 01:25:56,970 - INFO - step 4251, loss: 3.917173, best loss: 2.749489 2025-01-16 01:25:57,120 - INFO - step 4252, loss: 3.549527, best loss: 2.749489 2025-01-16 01:25:57,270 - INFO - step 4253, loss: 3.304691, best loss: 2.749489 2025-01-16 01:25:57,420 - INFO - step 4254, loss: 3.636805, best loss: 2.749489 2025-01-16 01:25:57,571 - INFO - step 4255, loss: 3.585687, best loss: 2.749489 2025-01-16 01:25:57,721 - INFO - step 4256, loss: 3.798795, best loss: 2.749489 2025-01-16 01:25:57,871 - INFO - step 4257, loss: 2.891657, best loss: 2.749489 2025-01-16 01:25:58,021 - INFO - step 4258, loss: 3.545652, best loss: 2.749489 2025-01-16 01:25:58,171 - INFO - step 4259, loss: 3.791545, best loss: 2.749489 2025-01-16 01:25:58,322 - INFO - step 4260, loss: 3.812650, best loss: 2.749489 2025-01-16 01:25:58,472 - INFO - step 4261, loss: 3.563323, best loss: 2.749489 2025-01-16 01:25:58,622 - INFO - step 4262, loss: 3.575264, best loss: 2.749489 2025-01-16 01:25:58,772 - INFO - step 4263, loss: 3.636823, best loss: 2.749489 2025-01-16 01:25:58,922 - INFO - step 4264, loss: 3.353485, best loss: 2.749489 2025-01-16 01:25:59,072 - INFO - step 4265, loss: 3.704967, best loss: 2.749489 2025-01-16 01:25:59,222 - INFO - step 4266, loss: 3.569767, best loss: 2.749489 2025-01-16 01:25:59,372 - INFO - step 4267, loss: 3.686168, best loss: 2.749489 2025-01-16 01:25:59,522 - INFO - step 4268, loss: 3.314159, best loss: 2.749489 2025-01-16 01:25:59,673 - INFO - step 4269, loss: 3.429406, best loss: 2.749489 2025-01-16 01:25:59,823 - INFO - step 4270, loss: 3.370323, best loss: 2.749489 2025-01-16 01:25:59,972 - INFO - step 4271, loss: 3.293917, best loss: 2.749489 2025-01-16 01:26:00,123 - INFO - step 4272, loss: 3.379754, best loss: 2.749489 2025-01-16 01:26:00,273 - INFO - step 4273, loss: 3.375053, best loss: 2.749489 2025-01-16 01:26:00,423 - INFO - step 4274, loss: 3.328655, best loss: 2.749489 2025-01-16 01:26:00,573 - INFO - step 4275, loss: 3.096836, best loss: 2.749489 2025-01-16 01:26:00,723 - INFO - step 4276, loss: 3.038821, best loss: 2.749489 2025-01-16 01:26:00,873 - INFO - step 4277, loss: 2.842447, best loss: 2.749489 2025-01-16 01:26:01,023 - INFO - step 4278, loss: 3.592859, best loss: 2.749489 2025-01-16 01:26:01,173 - INFO - step 4279, loss: 3.875648, best loss: 2.749489 2025-01-16 01:26:01,323 - INFO - step 4280, loss: 3.815034, best loss: 2.749489 2025-01-16 01:26:01,474 - INFO - step 4281, loss: 4.008559, best loss: 2.749489 2025-01-16 01:26:01,624 - INFO - step 4282, loss: 4.104756, best loss: 2.749489 2025-01-16 01:26:01,774 - INFO - step 4283, loss: 3.695743, best loss: 2.749489 2025-01-16 01:26:01,924 - INFO - step 4284, loss: 3.860230, best loss: 2.749489 2025-01-16 01:26:02,074 - INFO - step 4285, loss: 4.013488, best loss: 2.749489 2025-01-16 01:26:02,225 - INFO - step 4286, loss: 3.647027, best loss: 2.749489 2025-01-16 01:26:02,375 - INFO - step 4287, loss: 3.146519, best loss: 2.749489 2025-01-16 01:26:02,525 - INFO - step 4288, loss: 3.569203, best loss: 2.749489 2025-01-16 01:26:02,675 - INFO - step 4289, loss: 3.369127, best loss: 2.749489 2025-01-16 01:26:02,825 - INFO - step 4290, loss: 3.992298, best loss: 2.749489 2025-01-16 01:26:02,976 - INFO - step 4291, loss: 3.780792, best loss: 2.749489 2025-01-16 01:26:03,126 - INFO - step 4292, loss: 3.853098, best loss: 2.749489 2025-01-16 01:26:03,275 - INFO - step 4293, loss: 3.696195, best loss: 2.749489 2025-01-16 01:26:03,426 - INFO - step 4294, loss: 3.954488, best loss: 2.749489 2025-01-16 01:26:03,576 - INFO - step 4295, loss: 3.424424, best loss: 2.749489 2025-01-16 01:26:03,726 - INFO - step 4296, loss: 3.966463, best loss: 2.749489 2025-01-16 01:26:03,876 - INFO - step 4297, loss: 3.738121, best loss: 2.749489 2025-01-16 01:26:04,026 - INFO - step 4298, loss: 3.975042, best loss: 2.749489 2025-01-16 01:26:04,177 - INFO - step 4299, loss: 3.768500, best loss: 2.749489 2025-01-16 01:26:04,327 - INFO - step 4300, loss: 3.669958, best loss: 2.749489 2025-01-16 01:26:04,477 - INFO - step 4301, loss: 4.158088, best loss: 2.749489 2025-01-16 01:26:04,628 - INFO - step 4302, loss: 3.472375, best loss: 2.749489 2025-01-16 01:26:04,778 - INFO - step 4303, loss: 3.818438, best loss: 2.749489 2025-01-16 01:26:04,928 - INFO - step 4304, loss: 3.879171, best loss: 2.749489 2025-01-16 01:26:05,078 - INFO - step 4305, loss: 3.902975, best loss: 2.749489 2025-01-16 01:26:05,228 - INFO - step 4306, loss: 3.838550, best loss: 2.749489 2025-01-16 01:26:05,379 - INFO - step 4307, loss: 3.609859, best loss: 2.749489 2025-01-16 01:26:05,529 - INFO - step 4308, loss: 3.613549, best loss: 2.749489 2025-01-16 01:26:05,679 - INFO - step 4309, loss: 3.724940, best loss: 2.749489 2025-01-16 01:26:05,829 - INFO - step 4310, loss: 3.207138, best loss: 2.749489 2025-01-16 01:26:05,979 - INFO - step 4311, loss: 4.059733, best loss: 2.749489 2025-01-16 01:26:06,130 - INFO - step 4312, loss: 3.191065, best loss: 2.749489 2025-01-16 01:26:06,280 - INFO - step 4313, loss: 3.179739, best loss: 2.749489 2025-01-16 01:26:06,430 - INFO - step 4314, loss: 3.679142, best loss: 2.749489 2025-01-16 01:26:06,580 - INFO - step 4315, loss: 3.668246, best loss: 2.749489 2025-01-16 01:26:06,731 - INFO - step 4316, loss: 3.637190, best loss: 2.749489 2025-01-16 01:26:06,881 - INFO - step 4317, loss: 3.416675, best loss: 2.749489 2025-01-16 01:26:07,031 - INFO - step 4318, loss: 3.571974, best loss: 2.749489 2025-01-16 01:26:07,181 - INFO - step 4319, loss: 3.622626, best loss: 2.749489 2025-01-16 01:26:07,331 - INFO - step 4320, loss: 3.455667, best loss: 2.749489 2025-01-16 01:26:07,481 - INFO - step 4321, loss: 3.338004, best loss: 2.749489 2025-01-16 01:26:07,631 - INFO - step 4322, loss: 3.743540, best loss: 2.749489 2025-01-16 01:26:07,781 - INFO - step 4323, loss: 3.613981, best loss: 2.749489 2025-01-16 01:26:07,932 - INFO - step 4324, loss: 3.599527, best loss: 2.749489 2025-01-16 01:26:08,082 - INFO - step 4325, loss: 3.323225, best loss: 2.749489 2025-01-16 01:26:08,232 - INFO - step 4326, loss: 3.575858, best loss: 2.749489 2025-01-16 01:26:08,382 - INFO - step 4327, loss: 3.947843, best loss: 2.749489 2025-01-16 01:26:08,532 - INFO - step 4328, loss: 3.498835, best loss: 2.749489 2025-01-16 01:26:08,682 - INFO - step 4329, loss: 3.848345, best loss: 2.749489 2025-01-16 01:26:08,833 - INFO - step 4330, loss: 3.882322, best loss: 2.749489 2025-01-16 01:26:08,982 - INFO - step 4331, loss: 3.809613, best loss: 2.749489 2025-01-16 01:26:09,132 - INFO - step 4332, loss: 3.841565, best loss: 2.749489 2025-01-16 01:26:09,282 - INFO - step 4333, loss: 3.745720, best loss: 2.749489 2025-01-16 01:26:09,432 - INFO - step 4334, loss: 3.737211, best loss: 2.749489 2025-01-16 01:26:09,582 - INFO - step 4335, loss: 3.566223, best loss: 2.749489 2025-01-16 01:26:09,732 - INFO - step 4336, loss: 4.312076, best loss: 2.749489 2025-01-16 01:26:09,883 - INFO - step 4337, loss: 3.813152, best loss: 2.749489 2025-01-16 01:26:10,033 - INFO - step 4338, loss: 4.102228, best loss: 2.749489 2025-01-16 01:26:10,183 - INFO - step 4339, loss: 3.171322, best loss: 2.749489 2025-01-16 01:26:10,333 - INFO - step 4340, loss: 3.251020, best loss: 2.749489 2025-01-16 01:26:10,484 - INFO - step 4341, loss: 3.709604, best loss: 2.749489 2025-01-16 01:26:10,634 - INFO - step 4342, loss: 3.688148, best loss: 2.749489 2025-01-16 01:26:10,784 - INFO - step 4343, loss: 3.580763, best loss: 2.749489 2025-01-16 01:26:10,935 - INFO - step 4344, loss: 3.668545, best loss: 2.749489 2025-01-16 01:26:11,085 - INFO - step 4345, loss: 3.503407, best loss: 2.749489 2025-01-16 01:26:11,235 - INFO - step 4346, loss: 3.682888, best loss: 2.749489 2025-01-16 01:26:11,385 - INFO - step 4347, loss: 3.766077, best loss: 2.749489 2025-01-16 01:26:11,535 - INFO - step 4348, loss: 3.307246, best loss: 2.749489 2025-01-16 01:26:11,685 - INFO - step 4349, loss: 3.489405, best loss: 2.749489 2025-01-16 01:26:11,835 - INFO - step 4350, loss: 3.582911, best loss: 2.749489 2025-01-16 01:26:11,985 - INFO - step 4351, loss: 3.800504, best loss: 2.749489 2025-01-16 01:26:12,136 - INFO - step 4352, loss: 3.642300, best loss: 2.749489 2025-01-16 01:26:12,286 - INFO - step 4353, loss: 3.881133, best loss: 2.749489 2025-01-16 01:26:12,436 - INFO - step 4354, loss: 3.681928, best loss: 2.749489 2025-01-16 01:26:12,586 - INFO - step 4355, loss: 3.265786, best loss: 2.749489 2025-01-16 01:26:12,736 - INFO - step 4356, loss: 3.740016, best loss: 2.749489 2025-01-16 01:26:12,886 - INFO - step 4357, loss: 3.079773, best loss: 2.749489 2025-01-16 01:26:13,036 - INFO - step 4358, loss: 3.380123, best loss: 2.749489 2025-01-16 01:26:13,187 - INFO - step 4359, loss: 3.567730, best loss: 2.749489 2025-01-16 01:26:13,337 - INFO - step 4360, loss: 3.463728, best loss: 2.749489 2025-01-16 01:26:13,487 - INFO - step 4361, loss: 3.436764, best loss: 2.749489 2025-01-16 01:26:13,637 - INFO - step 4362, loss: 3.689117, best loss: 2.749489 2025-01-16 01:26:13,787 - INFO - step 4363, loss: 3.943672, best loss: 2.749489 2025-01-16 01:26:13,937 - INFO - step 4364, loss: 3.721967, best loss: 2.749489 2025-01-16 01:26:14,087 - INFO - step 4365, loss: 3.933642, best loss: 2.749489 2025-01-16 01:26:14,238 - INFO - step 4366, loss: 3.617088, best loss: 2.749489 2025-01-16 01:26:14,388 - INFO - step 4367, loss: 3.702513, best loss: 2.749489 2025-01-16 01:26:14,538 - INFO - step 4368, loss: 3.509355, best loss: 2.749489 2025-01-16 01:26:14,688 - INFO - step 4369, loss: 3.175445, best loss: 2.749489 2025-01-16 01:26:14,838 - INFO - step 4370, loss: 3.964172, best loss: 2.749489 2025-01-16 01:26:14,988 - INFO - step 4371, loss: 3.825532, best loss: 2.749489 2025-01-16 01:26:15,138 - INFO - step 4372, loss: 3.497128, best loss: 2.749489 2025-01-16 01:26:15,288 - INFO - step 4373, loss: 3.421438, best loss: 2.749489 2025-01-16 01:26:15,439 - INFO - step 4374, loss: 3.216916, best loss: 2.749489 2025-01-16 01:26:15,589 - INFO - step 4375, loss: 3.488487, best loss: 2.749489 2025-01-16 01:26:15,739 - INFO - step 4376, loss: 3.379749, best loss: 2.749489 2025-01-16 01:26:15,890 - INFO - step 4377, loss: 3.334430, best loss: 2.749489 2025-01-16 01:26:16,040 - INFO - step 4378, loss: 3.901541, best loss: 2.749489 2025-01-16 01:26:16,190 - INFO - step 4379, loss: 3.897768, best loss: 2.749489 2025-01-16 01:26:16,340 - INFO - step 4380, loss: 3.590056, best loss: 2.749489 2025-01-16 01:26:16,490 - INFO - step 4381, loss: 3.895539, best loss: 2.749489 2025-01-16 01:26:16,641 - INFO - step 4382, loss: 3.579434, best loss: 2.749489 2025-01-16 01:26:16,791 - INFO - step 4383, loss: 3.938481, best loss: 2.749489 2025-01-16 01:26:16,941 - INFO - step 4384, loss: 3.995199, best loss: 2.749489 2025-01-16 01:26:17,091 - INFO - step 4385, loss: 4.013355, best loss: 2.749489 2025-01-16 01:26:17,241 - INFO - step 4386, loss: 4.110570, best loss: 2.749489 2025-01-16 01:26:17,391 - INFO - step 4387, loss: 3.760801, best loss: 2.749489 2025-01-16 01:26:17,542 - INFO - step 4388, loss: 3.969149, best loss: 2.749489 2025-01-16 01:26:17,692 - INFO - step 4389, loss: 3.802370, best loss: 2.749489 2025-01-16 01:26:17,842 - INFO - step 4390, loss: 3.560684, best loss: 2.749489 2025-01-16 01:26:17,993 - INFO - step 4391, loss: 4.044292, best loss: 2.749489 2025-01-16 01:26:18,143 - INFO - step 4392, loss: 3.966109, best loss: 2.749489 2025-01-16 01:26:18,293 - INFO - step 4393, loss: 3.889083, best loss: 2.749489 2025-01-16 01:26:18,443 - INFO - step 4394, loss: 3.691126, best loss: 2.749489 2025-01-16 01:26:18,593 - INFO - step 4395, loss: 3.910986, best loss: 2.749489 2025-01-16 01:26:18,743 - INFO - step 4396, loss: 3.737444, best loss: 2.749489 2025-01-16 01:26:18,893 - INFO - step 4397, loss: 3.525311, best loss: 2.749489 2025-01-16 01:26:19,043 - INFO - step 4398, loss: 3.514553, best loss: 2.749489 2025-01-16 01:26:19,193 - INFO - step 4399, loss: 3.914175, best loss: 2.749489 2025-01-16 01:26:19,343 - INFO - step 4400, loss: 3.837328, best loss: 2.749489 2025-01-16 01:26:19,493 - INFO - step 4401, loss: 4.045259, best loss: 2.749489 2025-01-16 01:26:19,643 - INFO - step 4402, loss: 3.941163, best loss: 2.749489 2025-01-16 01:26:19,794 - INFO - step 4403, loss: 3.621747, best loss: 2.749489 2025-01-16 01:26:19,944 - INFO - step 4404, loss: 4.001903, best loss: 2.749489 2025-01-16 01:26:20,094 - INFO - step 4405, loss: 3.568754, best loss: 2.749489 2025-01-16 01:26:20,244 - INFO - step 4406, loss: 3.954588, best loss: 2.749489 2025-01-16 01:26:20,394 - INFO - step 4407, loss: 3.575950, best loss: 2.749489 2025-01-16 01:26:20,544 - INFO - step 4408, loss: 3.652402, best loss: 2.749489 2025-01-16 01:26:20,694 - INFO - step 4409, loss: 3.693777, best loss: 2.749489 2025-01-16 01:26:20,844 - INFO - step 4410, loss: 3.719649, best loss: 2.749489 2025-01-16 01:26:20,995 - INFO - step 4411, loss: 3.602880, best loss: 2.749489 2025-01-16 01:26:21,145 - INFO - step 4412, loss: 3.803727, best loss: 2.749489 2025-01-16 01:26:21,295 - INFO - step 4413, loss: 3.415326, best loss: 2.749489 2025-01-16 01:26:21,445 - INFO - step 4414, loss: 3.071178, best loss: 2.749489 2025-01-16 01:26:21,595 - INFO - step 4415, loss: 3.256139, best loss: 2.749489 2025-01-16 01:26:21,745 - INFO - step 4416, loss: 3.506654, best loss: 2.749489 2025-01-16 01:26:21,895 - INFO - step 4417, loss: 4.080431, best loss: 2.749489 2025-01-16 01:26:22,045 - INFO - step 4418, loss: 3.643491, best loss: 2.749489 2025-01-16 01:26:22,196 - INFO - step 4419, loss: 3.292343, best loss: 2.749489 2025-01-16 01:26:22,346 - INFO - step 4420, loss: 3.884473, best loss: 2.749489 2025-01-16 01:26:22,496 - INFO - step 4421, loss: 3.592412, best loss: 2.749489 2025-01-16 01:26:22,646 - INFO - step 4422, loss: 4.203413, best loss: 2.749489 2025-01-16 01:26:22,796 - INFO - step 4423, loss: 3.459311, best loss: 2.749489 2025-01-16 01:26:22,946 - INFO - step 4424, loss: 3.802180, best loss: 2.749489 2025-01-16 01:26:23,096 - INFO - step 4425, loss: 3.555802, best loss: 2.749489 2025-01-16 01:26:23,246 - INFO - step 4426, loss: 4.071519, best loss: 2.749489 2025-01-16 01:26:23,396 - INFO - step 4427, loss: 3.678280, best loss: 2.749489 2025-01-16 01:26:23,547 - INFO - step 4428, loss: 3.444231, best loss: 2.749489 2025-01-16 01:26:23,697 - INFO - step 4429, loss: 3.871047, best loss: 2.749489 2025-01-16 01:26:23,847 - INFO - step 4430, loss: 3.632384, best loss: 2.749489 2025-01-16 01:26:23,997 - INFO - step 4431, loss: 3.488479, best loss: 2.749489 2025-01-16 01:26:24,147 - INFO - step 4432, loss: 4.160422, best loss: 2.749489 2025-01-16 01:26:24,297 - INFO - step 4433, loss: 3.614909, best loss: 2.749489 2025-01-16 01:26:24,447 - INFO - step 4434, loss: 3.219561, best loss: 2.749489 2025-01-16 01:26:24,597 - INFO - step 4435, loss: 3.623308, best loss: 2.749489 2025-01-16 01:26:24,747 - INFO - step 4436, loss: 3.875806, best loss: 2.749489 2025-01-16 01:26:24,898 - INFO - step 4437, loss: 3.786442, best loss: 2.749489 2025-01-16 01:26:25,048 - INFO - step 4438, loss: 3.639229, best loss: 2.749489 2025-01-16 01:26:25,198 - INFO - step 4439, loss: 3.562469, best loss: 2.749489 2025-01-16 01:26:25,349 - INFO - step 4440, loss: 4.014044, best loss: 2.749489 2025-01-16 01:26:25,499 - INFO - step 4441, loss: 3.858418, best loss: 2.749489 2025-01-16 01:26:25,649 - INFO - step 4442, loss: 3.762335, best loss: 2.749489 2025-01-16 01:26:25,799 - INFO - step 4443, loss: 3.542531, best loss: 2.749489 2025-01-16 01:26:25,949 - INFO - step 4444, loss: 3.737303, best loss: 2.749489 2025-01-16 01:26:26,099 - INFO - step 4445, loss: 3.713962, best loss: 2.749489 2025-01-16 01:26:26,249 - INFO - step 4446, loss: 3.299903, best loss: 2.749489 2025-01-16 01:26:26,400 - INFO - step 4447, loss: 3.702522, best loss: 2.749489 2025-01-16 01:26:26,550 - INFO - step 4448, loss: 3.395906, best loss: 2.749489 2025-01-16 01:26:26,700 - INFO - step 4449, loss: 3.687240, best loss: 2.749489 2025-01-16 01:26:26,850 - INFO - step 4450, loss: 3.454686, best loss: 2.749489 2025-01-16 01:26:27,000 - INFO - step 4451, loss: 3.605434, best loss: 2.749489 2025-01-16 01:26:27,150 - INFO - step 4452, loss: 3.468631, best loss: 2.749489 2025-01-16 01:26:27,300 - INFO - step 4453, loss: 3.647118, best loss: 2.749489 2025-01-16 01:26:27,450 - INFO - step 4454, loss: 3.940659, best loss: 2.749489 2025-01-16 01:26:27,600 - INFO - step 4455, loss: 3.803170, best loss: 2.749489 2025-01-16 01:26:27,751 - INFO - step 4456, loss: 3.953537, best loss: 2.749489 2025-01-16 01:26:27,901 - INFO - step 4457, loss: 3.441366, best loss: 2.749489 2025-01-16 01:26:28,051 - INFO - step 4458, loss: 3.738211, best loss: 2.749489 2025-01-16 01:26:28,202 - INFO - step 4459, loss: 3.723377, best loss: 2.749489 2025-01-16 01:26:28,352 - INFO - step 4460, loss: 3.469144, best loss: 2.749489 2025-01-16 01:26:28,502 - INFO - step 4461, loss: 3.289442, best loss: 2.749489 2025-01-16 01:26:28,652 - INFO - step 4462, loss: 3.364825, best loss: 2.749489 2025-01-16 01:26:28,802 - INFO - step 4463, loss: 3.539646, best loss: 2.749489 2025-01-16 01:26:28,953 - INFO - step 4464, loss: 3.601220, best loss: 2.749489 2025-01-16 01:26:29,103 - INFO - step 4465, loss: 3.848895, best loss: 2.749489 2025-01-16 01:26:29,253 - INFO - step 4466, loss: 4.070404, best loss: 2.749489 2025-01-16 01:26:29,403 - INFO - step 4467, loss: 3.846827, best loss: 2.749489 2025-01-16 01:26:29,554 - INFO - step 4468, loss: 3.961501, best loss: 2.749489 2025-01-16 01:26:29,704 - INFO - step 4469, loss: 3.810619, best loss: 2.749489 2025-01-16 01:26:29,854 - INFO - step 4470, loss: 3.830709, best loss: 2.749489 2025-01-16 01:26:30,004 - INFO - step 4471, loss: 3.303113, best loss: 2.749489 2025-01-16 01:26:30,155 - INFO - step 4472, loss: 3.875207, best loss: 2.749489 2025-01-16 01:26:30,305 - INFO - step 4473, loss: 4.171849, best loss: 2.749489 2025-01-16 01:26:30,455 - INFO - step 4474, loss: 3.823026, best loss: 2.749489 2025-01-16 01:26:30,605 - INFO - step 4475, loss: 3.806205, best loss: 2.749489 2025-01-16 01:26:30,755 - INFO - step 4476, loss: 3.865597, best loss: 2.749489 2025-01-16 01:26:30,905 - INFO - step 4477, loss: 3.450439, best loss: 2.749489 2025-01-16 01:26:34,474 - INFO - step 4478, loss: 2.607254, best loss: 2.607254 2025-01-16 01:26:34,637 - INFO - step 4479, loss: 3.643355, best loss: 2.607254 2025-01-16 01:26:34,789 - INFO - step 4480, loss: 3.842466, best loss: 2.607254 2025-01-16 01:26:34,940 - INFO - step 4481, loss: 3.839468, best loss: 2.607254 2025-01-16 01:26:35,090 - INFO - step 4482, loss: 3.877610, best loss: 2.607254 2025-01-16 01:26:35,240 - INFO - step 4483, loss: 3.568617, best loss: 2.607254 2025-01-16 01:26:35,390 - INFO - step 4484, loss: 3.733232, best loss: 2.607254 2025-01-16 01:26:35,540 - INFO - step 4485, loss: 3.734609, best loss: 2.607254 2025-01-16 01:26:35,691 - INFO - step 4486, loss: 3.529074, best loss: 2.607254 2025-01-16 01:26:35,841 - INFO - step 4487, loss: 3.634988, best loss: 2.607254 2025-01-16 01:26:35,991 - INFO - step 4488, loss: 3.550723, best loss: 2.607254 2025-01-16 01:26:36,142 - INFO - step 4489, loss: 3.405854, best loss: 2.607254 2025-01-16 01:26:36,292 - INFO - step 4490, loss: 3.736431, best loss: 2.607254 2025-01-16 01:26:36,442 - INFO - step 4491, loss: 3.381394, best loss: 2.607254 2025-01-16 01:26:36,592 - INFO - step 4492, loss: 3.699431, best loss: 2.607254 2025-01-16 01:26:36,743 - INFO - step 4493, loss: 3.959653, best loss: 2.607254 2025-01-16 01:26:36,893 - INFO - step 4494, loss: 3.533122, best loss: 2.607254 2025-01-16 01:26:37,043 - INFO - step 4495, loss: 3.165866, best loss: 2.607254 2025-01-16 01:26:37,193 - INFO - step 4496, loss: 3.726714, best loss: 2.607254 2025-01-16 01:26:37,343 - INFO - step 4497, loss: 3.850243, best loss: 2.607254 2025-01-16 01:26:37,493 - INFO - step 4498, loss: 3.885660, best loss: 2.607254 2025-01-16 01:26:37,644 - INFO - step 4499, loss: 3.804074, best loss: 2.607254 2025-01-16 01:26:37,794 - INFO - step 4500, loss: 3.869536, best loss: 2.607254 2025-01-16 01:26:37,944 - INFO - step 4501, loss: 3.724131, best loss: 2.607254 2025-01-16 01:26:38,095 - INFO - step 4502, loss: 4.012302, best loss: 2.607254 2025-01-16 01:26:38,245 - INFO - step 4503, loss: 3.508979, best loss: 2.607254 2025-01-16 01:26:38,395 - INFO - step 4504, loss: 3.669342, best loss: 2.607254 2025-01-16 01:26:38,545 - INFO - step 4505, loss: 3.814385, best loss: 2.607254 2025-01-16 01:26:38,696 - INFO - step 4506, loss: 3.894978, best loss: 2.607254 2025-01-16 01:26:38,846 - INFO - step 4507, loss: 3.838172, best loss: 2.607254 2025-01-16 01:26:38,997 - INFO - step 4508, loss: 3.609010, best loss: 2.607254 2025-01-16 01:26:39,147 - INFO - step 4509, loss: 3.645720, best loss: 2.607254 2025-01-16 01:26:39,298 - INFO - step 4510, loss: 3.863825, best loss: 2.607254 2025-01-16 01:26:39,448 - INFO - step 4511, loss: 4.081076, best loss: 2.607254 2025-01-16 01:26:39,599 - INFO - step 4512, loss: 3.814835, best loss: 2.607254 2025-01-16 01:26:39,750 - INFO - step 4513, loss: 4.083130, best loss: 2.607254 2025-01-16 01:26:39,900 - INFO - step 4514, loss: 4.127272, best loss: 2.607254 2025-01-16 01:26:40,050 - INFO - step 4515, loss: 3.995859, best loss: 2.607254 2025-01-16 01:26:40,201 - INFO - step 4516, loss: 4.220652, best loss: 2.607254 2025-01-16 01:26:40,351 - INFO - step 4517, loss: 4.068160, best loss: 2.607254 2025-01-16 01:26:40,501 - INFO - step 4518, loss: 3.590574, best loss: 2.607254 2025-01-16 01:26:40,652 - INFO - step 4519, loss: 4.049139, best loss: 2.607254 2025-01-16 01:26:40,802 - INFO - step 4520, loss: 3.976647, best loss: 2.607254 2025-01-16 01:26:40,952 - INFO - step 4521, loss: 4.087951, best loss: 2.607254 2025-01-16 01:26:41,103 - INFO - step 4522, loss: 3.668799, best loss: 2.607254 2025-01-16 01:26:41,253 - INFO - step 4523, loss: 3.881894, best loss: 2.607254 2025-01-16 01:26:41,403 - INFO - step 4524, loss: 3.824967, best loss: 2.607254 2025-01-16 01:26:41,553 - INFO - step 4525, loss: 3.729307, best loss: 2.607254 2025-01-16 01:26:41,703 - INFO - step 4526, loss: 3.988793, best loss: 2.607254 2025-01-16 01:26:41,854 - INFO - step 4527, loss: 3.535287, best loss: 2.607254 2025-01-16 01:26:42,004 - INFO - step 4528, loss: 3.519217, best loss: 2.607254 2025-01-16 01:26:42,154 - INFO - step 4529, loss: 3.968604, best loss: 2.607254 2025-01-16 01:26:42,304 - INFO - step 4530, loss: 3.899589, best loss: 2.607254 2025-01-16 01:26:42,455 - INFO - step 4531, loss: 3.980834, best loss: 2.607254 2025-01-16 01:26:42,605 - INFO - step 4532, loss: 3.830215, best loss: 2.607254 2025-01-16 01:26:42,755 - INFO - step 4533, loss: 4.175278, best loss: 2.607254 2025-01-16 01:26:42,906 - INFO - step 4534, loss: 4.030378, best loss: 2.607254 2025-01-16 01:26:43,056 - INFO - step 4535, loss: 3.643234, best loss: 2.607254 2025-01-16 01:26:43,206 - INFO - step 4536, loss: 3.630590, best loss: 2.607254 2025-01-16 01:26:43,356 - INFO - step 4537, loss: 4.058003, best loss: 2.607254 2025-01-16 01:26:43,507 - INFO - step 4538, loss: 3.660313, best loss: 2.607254 2025-01-16 01:26:43,657 - INFO - step 4539, loss: 3.361791, best loss: 2.607254 2025-01-16 01:26:43,807 - INFO - step 4540, loss: 3.883500, best loss: 2.607254 2025-01-16 01:26:43,957 - INFO - step 4541, loss: 3.892350, best loss: 2.607254 2025-01-16 01:26:44,107 - INFO - step 4542, loss: 3.785514, best loss: 2.607254 2025-01-16 01:26:44,258 - INFO - step 4543, loss: 3.433666, best loss: 2.607254 2025-01-16 01:26:44,408 - INFO - step 4544, loss: 3.244828, best loss: 2.607254 2025-01-16 01:26:44,558 - INFO - step 4545, loss: 3.221159, best loss: 2.607254 2025-01-16 01:26:44,708 - INFO - step 4546, loss: 3.351902, best loss: 2.607254 2025-01-16 01:26:44,858 - INFO - step 4547, loss: 3.452171, best loss: 2.607254 2025-01-16 01:26:45,009 - INFO - step 4548, loss: 3.717948, best loss: 2.607254 2025-01-16 01:26:45,159 - INFO - step 4549, loss: 3.515196, best loss: 2.607254 2025-01-16 01:26:45,309 - INFO - step 4550, loss: 3.473497, best loss: 2.607254 2025-01-16 01:26:45,460 - INFO - step 4551, loss: 3.654423, best loss: 2.607254 2025-01-16 01:26:45,610 - INFO - step 4552, loss: 3.576324, best loss: 2.607254 2025-01-16 01:26:45,760 - INFO - step 4553, loss: 3.586782, best loss: 2.607254 2025-01-16 01:26:45,911 - INFO - step 4554, loss: 3.758908, best loss: 2.607254 2025-01-16 01:26:46,061 - INFO - step 4555, loss: 3.865012, best loss: 2.607254 2025-01-16 01:26:46,211 - INFO - step 4556, loss: 3.367843, best loss: 2.607254 2025-01-16 01:26:46,361 - INFO - step 4557, loss: 3.368721, best loss: 2.607254 2025-01-16 01:26:46,511 - INFO - step 4558, loss: 3.646538, best loss: 2.607254 2025-01-16 01:26:46,661 - INFO - step 4559, loss: 3.793711, best loss: 2.607254 2025-01-16 01:26:46,811 - INFO - step 4560, loss: 3.304252, best loss: 2.607254 2025-01-16 01:26:46,962 - INFO - step 4561, loss: 3.213067, best loss: 2.607254 2025-01-16 01:26:47,112 - INFO - step 4562, loss: 3.581962, best loss: 2.607254 2025-01-16 01:26:47,262 - INFO - step 4563, loss: 3.510802, best loss: 2.607254 2025-01-16 01:26:47,412 - INFO - step 4564, loss: 3.239146, best loss: 2.607254 2025-01-16 01:26:47,562 - INFO - step 4565, loss: 3.724102, best loss: 2.607254 2025-01-16 01:26:47,712 - INFO - step 4566, loss: 3.425097, best loss: 2.607254 2025-01-16 01:26:47,863 - INFO - step 4567, loss: 3.408866, best loss: 2.607254 2025-01-16 01:26:48,013 - INFO - step 4568, loss: 3.399039, best loss: 2.607254 2025-01-16 01:26:48,164 - INFO - step 4569, loss: 3.632107, best loss: 2.607254 2025-01-16 01:26:48,314 - INFO - step 4570, loss: 3.228928, best loss: 2.607254 2025-01-16 01:26:48,464 - INFO - step 4571, loss: 3.369972, best loss: 2.607254 2025-01-16 01:26:48,615 - INFO - step 4572, loss: 3.182854, best loss: 2.607254 2025-01-16 01:26:48,765 - INFO - step 4573, loss: 3.648751, best loss: 2.607254 2025-01-16 01:26:48,915 - INFO - step 4574, loss: 3.975854, best loss: 2.607254 2025-01-16 01:26:49,066 - INFO - step 4575, loss: 4.254766, best loss: 2.607254 2025-01-16 01:26:49,216 - INFO - step 4576, loss: 3.746787, best loss: 2.607254 2025-01-16 01:26:49,366 - INFO - step 4577, loss: 3.974148, best loss: 2.607254 2025-01-16 01:26:49,516 - INFO - step 4578, loss: 3.793813, best loss: 2.607254 2025-01-16 01:26:49,666 - INFO - step 4579, loss: 3.744411, best loss: 2.607254 2025-01-16 01:26:49,817 - INFO - step 4580, loss: 3.542656, best loss: 2.607254 2025-01-16 01:26:49,967 - INFO - step 4581, loss: 3.902776, best loss: 2.607254 2025-01-16 01:26:50,117 - INFO - step 4582, loss: 3.541759, best loss: 2.607254 2025-01-16 01:26:50,268 - INFO - step 4583, loss: 3.225139, best loss: 2.607254 2025-01-16 01:26:50,418 - INFO - step 4584, loss: 3.512142, best loss: 2.607254 2025-01-16 01:26:50,568 - INFO - step 4585, loss: 3.469298, best loss: 2.607254 2025-01-16 01:26:50,718 - INFO - step 4586, loss: 3.707677, best loss: 2.607254 2025-01-16 01:26:50,868 - INFO - step 4587, loss: 2.890484, best loss: 2.607254 2025-01-16 01:26:51,019 - INFO - step 4588, loss: 3.584011, best loss: 2.607254 2025-01-16 01:26:51,169 - INFO - step 4589, loss: 3.785636, best loss: 2.607254 2025-01-16 01:26:51,319 - INFO - step 4590, loss: 3.779998, best loss: 2.607254 2025-01-16 01:26:51,469 - INFO - step 4591, loss: 3.553669, best loss: 2.607254 2025-01-16 01:26:51,619 - INFO - step 4592, loss: 3.548325, best loss: 2.607254 2025-01-16 01:26:51,770 - INFO - step 4593, loss: 3.588276, best loss: 2.607254 2025-01-16 01:26:51,920 - INFO - step 4594, loss: 3.327584, best loss: 2.607254 2025-01-16 01:26:52,070 - INFO - step 4595, loss: 3.674843, best loss: 2.607254 2025-01-16 01:26:52,220 - INFO - step 4596, loss: 3.468775, best loss: 2.607254 2025-01-16 01:26:52,370 - INFO - step 4597, loss: 3.597579, best loss: 2.607254 2025-01-16 01:26:52,521 - INFO - step 4598, loss: 3.269422, best loss: 2.607254 2025-01-16 01:26:52,671 - INFO - step 4599, loss: 3.330859, best loss: 2.607254 2025-01-16 01:26:52,821 - INFO - step 4600, loss: 3.286366, best loss: 2.607254 2025-01-16 01:26:52,972 - INFO - step 4601, loss: 3.265790, best loss: 2.607254 2025-01-16 01:26:53,122 - INFO - step 4602, loss: 3.429255, best loss: 2.607254 2025-01-16 01:26:53,272 - INFO - step 4603, loss: 3.407403, best loss: 2.607254 2025-01-16 01:26:53,423 - INFO - step 4604, loss: 3.253979, best loss: 2.607254 2025-01-16 01:26:53,573 - INFO - step 4605, loss: 2.997764, best loss: 2.607254 2025-01-16 01:26:53,723 - INFO - step 4606, loss: 2.961141, best loss: 2.607254 2025-01-16 01:26:53,873 - INFO - step 4607, loss: 2.786710, best loss: 2.607254 2025-01-16 01:26:54,023 - INFO - step 4608, loss: 3.451241, best loss: 2.607254 2025-01-16 01:26:54,174 - INFO - step 4609, loss: 3.847816, best loss: 2.607254 2025-01-16 01:26:54,324 - INFO - step 4610, loss: 3.775246, best loss: 2.607254 2025-01-16 01:26:54,474 - INFO - step 4611, loss: 3.964992, best loss: 2.607254 2025-01-16 01:26:54,624 - INFO - step 4612, loss: 3.996335, best loss: 2.607254 2025-01-16 01:26:54,774 - INFO - step 4613, loss: 3.592031, best loss: 2.607254 2025-01-16 01:26:54,924 - INFO - step 4614, loss: 3.727810, best loss: 2.607254 2025-01-16 01:26:55,074 - INFO - step 4615, loss: 3.917096, best loss: 2.607254 2025-01-16 01:26:55,224 - INFO - step 4616, loss: 3.535915, best loss: 2.607254 2025-01-16 01:26:55,374 - INFO - step 4617, loss: 3.043717, best loss: 2.607254 2025-01-16 01:26:55,524 - INFO - step 4618, loss: 3.435709, best loss: 2.607254 2025-01-16 01:26:55,675 - INFO - step 4619, loss: 3.259272, best loss: 2.607254 2025-01-16 01:26:55,825 - INFO - step 4620, loss: 3.831096, best loss: 2.607254 2025-01-16 01:26:55,975 - INFO - step 4621, loss: 3.755247, best loss: 2.607254 2025-01-16 01:26:56,125 - INFO - step 4622, loss: 3.812075, best loss: 2.607254 2025-01-16 01:26:56,275 - INFO - step 4623, loss: 3.710893, best loss: 2.607254 2025-01-16 01:26:56,425 - INFO - step 4624, loss: 3.893322, best loss: 2.607254 2025-01-16 01:26:56,576 - INFO - step 4625, loss: 3.352247, best loss: 2.607254 2025-01-16 01:26:56,726 - INFO - step 4626, loss: 3.828807, best loss: 2.607254 2025-01-16 01:26:56,876 - INFO - step 4627, loss: 3.610958, best loss: 2.607254 2025-01-16 01:26:57,026 - INFO - step 4628, loss: 3.847794, best loss: 2.607254 2025-01-16 01:26:57,177 - INFO - step 4629, loss: 3.643606, best loss: 2.607254 2025-01-16 01:26:57,327 - INFO - step 4630, loss: 3.510124, best loss: 2.607254 2025-01-16 01:26:57,477 - INFO - step 4631, loss: 4.073817, best loss: 2.607254 2025-01-16 01:26:57,627 - INFO - step 4632, loss: 3.440927, best loss: 2.607254 2025-01-16 01:26:57,777 - INFO - step 4633, loss: 3.767637, best loss: 2.607254 2025-01-16 01:26:57,927 - INFO - step 4634, loss: 3.806919, best loss: 2.607254 2025-01-16 01:26:58,077 - INFO - step 4635, loss: 3.858195, best loss: 2.607254 2025-01-16 01:26:58,227 - INFO - step 4636, loss: 3.696093, best loss: 2.607254 2025-01-16 01:26:58,377 - INFO - step 4637, loss: 3.477914, best loss: 2.607254 2025-01-16 01:26:58,528 - INFO - step 4638, loss: 3.452090, best loss: 2.607254 2025-01-16 01:26:58,678 - INFO - step 4639, loss: 3.540730, best loss: 2.607254 2025-01-16 01:26:58,828 - INFO - step 4640, loss: 3.140367, best loss: 2.607254 2025-01-16 01:26:58,979 - INFO - step 4641, loss: 3.976189, best loss: 2.607254 2025-01-16 01:26:59,129 - INFO - step 4642, loss: 3.071532, best loss: 2.607254 2025-01-16 01:26:59,279 - INFO - step 4643, loss: 3.080140, best loss: 2.607254 2025-01-16 01:26:59,429 - INFO - step 4644, loss: 3.579783, best loss: 2.607254 2025-01-16 01:26:59,579 - INFO - step 4645, loss: 3.528784, best loss: 2.607254 2025-01-16 01:26:59,730 - INFO - step 4646, loss: 3.485065, best loss: 2.607254 2025-01-16 01:26:59,880 - INFO - step 4647, loss: 3.228618, best loss: 2.607254 2025-01-16 01:27:00,030 - INFO - step 4648, loss: 3.489883, best loss: 2.607254 2025-01-16 01:27:00,180 - INFO - step 4649, loss: 3.516190, best loss: 2.607254 2025-01-16 01:27:00,330 - INFO - step 4650, loss: 3.377364, best loss: 2.607254 2025-01-16 01:27:00,480 - INFO - step 4651, loss: 3.260314, best loss: 2.607254 2025-01-16 01:27:00,630 - INFO - step 4652, loss: 3.665868, best loss: 2.607254 2025-01-16 01:27:00,780 - INFO - step 4653, loss: 3.568074, best loss: 2.607254 2025-01-16 01:27:00,930 - INFO - step 4654, loss: 3.505111, best loss: 2.607254 2025-01-16 01:27:01,081 - INFO - step 4655, loss: 3.110267, best loss: 2.607254 2025-01-16 01:27:01,231 - INFO - step 4656, loss: 3.522384, best loss: 2.607254 2025-01-16 01:27:01,381 - INFO - step 4657, loss: 3.764879, best loss: 2.607254 2025-01-16 01:27:01,531 - INFO - step 4658, loss: 3.332654, best loss: 2.607254 2025-01-16 01:27:01,682 - INFO - step 4659, loss: 3.700324, best loss: 2.607254 2025-01-16 01:27:01,831 - INFO - step 4660, loss: 3.850690, best loss: 2.607254 2025-01-16 01:27:01,982 - INFO - step 4661, loss: 3.814456, best loss: 2.607254 2025-01-16 01:27:02,132 - INFO - step 4662, loss: 3.816661, best loss: 2.607254 2025-01-16 01:27:02,282 - INFO - step 4663, loss: 3.709079, best loss: 2.607254 2025-01-16 01:27:02,432 - INFO - step 4664, loss: 3.631323, best loss: 2.607254 2025-01-16 01:27:02,582 - INFO - step 4665, loss: 3.411244, best loss: 2.607254 2025-01-16 01:27:02,733 - INFO - step 4666, loss: 4.096194, best loss: 2.607254 2025-01-16 01:27:02,883 - INFO - step 4667, loss: 3.695024, best loss: 2.607254 2025-01-16 01:27:03,033 - INFO - step 4668, loss: 3.989360, best loss: 2.607254 2025-01-16 01:27:03,183 - INFO - step 4669, loss: 3.097475, best loss: 2.607254 2025-01-16 01:27:03,333 - INFO - step 4670, loss: 3.210904, best loss: 2.607254 2025-01-16 01:27:03,483 - INFO - step 4671, loss: 3.692585, best loss: 2.607254 2025-01-16 01:27:03,634 - INFO - step 4672, loss: 3.660140, best loss: 2.607254 2025-01-16 01:27:03,784 - INFO - step 4673, loss: 3.534737, best loss: 2.607254 2025-01-16 01:27:03,934 - INFO - step 4674, loss: 3.614007, best loss: 2.607254 2025-01-16 01:27:04,084 - INFO - step 4675, loss: 3.377636, best loss: 2.607254 2025-01-16 01:27:04,235 - INFO - step 4676, loss: 3.544165, best loss: 2.607254 2025-01-16 01:27:04,385 - INFO - step 4677, loss: 3.653978, best loss: 2.607254 2025-01-16 01:27:04,536 - INFO - step 4678, loss: 3.213715, best loss: 2.607254 2025-01-16 01:27:04,686 - INFO - step 4679, loss: 3.450754, best loss: 2.607254 2025-01-16 01:27:04,836 - INFO - step 4680, loss: 3.556676, best loss: 2.607254 2025-01-16 01:27:04,987 - INFO - step 4681, loss: 3.689161, best loss: 2.607254 2025-01-16 01:27:05,136 - INFO - step 4682, loss: 3.555194, best loss: 2.607254 2025-01-16 01:27:05,287 - INFO - step 4683, loss: 3.710859, best loss: 2.607254 2025-01-16 01:27:05,437 - INFO - step 4684, loss: 3.582178, best loss: 2.607254 2025-01-16 01:27:05,587 - INFO - step 4685, loss: 3.231755, best loss: 2.607254 2025-01-16 01:27:05,737 - INFO - step 4686, loss: 3.572522, best loss: 2.607254 2025-01-16 01:27:05,887 - INFO - step 4687, loss: 2.957302, best loss: 2.607254 2025-01-16 01:27:06,037 - INFO - step 4688, loss: 3.274246, best loss: 2.607254 2025-01-16 01:27:06,187 - INFO - step 4689, loss: 3.458523, best loss: 2.607254 2025-01-16 01:27:06,337 - INFO - step 4690, loss: 3.379011, best loss: 2.607254 2025-01-16 01:27:06,488 - INFO - step 4691, loss: 3.310933, best loss: 2.607254 2025-01-16 01:27:06,638 - INFO - step 4692, loss: 3.563147, best loss: 2.607254 2025-01-16 01:27:06,788 - INFO - step 4693, loss: 3.789952, best loss: 2.607254 2025-01-16 01:27:06,938 - INFO - step 4694, loss: 3.576451, best loss: 2.607254 2025-01-16 01:27:07,088 - INFO - step 4695, loss: 3.814660, best loss: 2.607254 2025-01-16 01:27:07,238 - INFO - step 4696, loss: 3.431432, best loss: 2.607254 2025-01-16 01:27:07,389 - INFO - step 4697, loss: 3.514711, best loss: 2.607254 2025-01-16 01:27:07,539 - INFO - step 4698, loss: 3.369429, best loss: 2.607254 2025-01-16 01:27:07,688 - INFO - step 4699, loss: 3.065156, best loss: 2.607254 2025-01-16 01:27:07,839 - INFO - step 4700, loss: 3.765954, best loss: 2.607254 2025-01-16 01:27:07,989 - INFO - step 4701, loss: 3.643248, best loss: 2.607254 2025-01-16 01:27:08,139 - INFO - step 4702, loss: 3.374688, best loss: 2.607254 2025-01-16 01:27:08,289 - INFO - step 4703, loss: 3.286808, best loss: 2.607254 2025-01-16 01:27:08,440 - INFO - step 4704, loss: 3.097148, best loss: 2.607254 2025-01-16 01:27:08,590 - INFO - step 4705, loss: 3.332553, best loss: 2.607254 2025-01-16 01:27:08,740 - INFO - step 4706, loss: 3.234145, best loss: 2.607254 2025-01-16 01:27:08,890 - INFO - step 4707, loss: 3.204859, best loss: 2.607254 2025-01-16 01:27:09,040 - INFO - step 4708, loss: 3.756519, best loss: 2.607254 2025-01-16 01:27:09,190 - INFO - step 4709, loss: 3.761750, best loss: 2.607254 2025-01-16 01:27:09,341 - INFO - step 4710, loss: 3.454959, best loss: 2.607254 2025-01-16 01:27:09,491 - INFO - step 4711, loss: 3.801192, best loss: 2.607254 2025-01-16 01:27:09,642 - INFO - step 4712, loss: 3.475601, best loss: 2.607254 2025-01-16 01:27:09,792 - INFO - step 4713, loss: 3.811919, best loss: 2.607254 2025-01-16 01:27:09,942 - INFO - step 4714, loss: 3.900211, best loss: 2.607254 2025-01-16 01:27:10,092 - INFO - step 4715, loss: 3.913948, best loss: 2.607254 2025-01-16 01:27:10,242 - INFO - step 4716, loss: 3.957300, best loss: 2.607254 2025-01-16 01:27:10,392 - INFO - step 4717, loss: 3.649297, best loss: 2.607254 2025-01-16 01:27:10,542 - INFO - step 4718, loss: 3.868593, best loss: 2.607254 2025-01-16 01:27:10,693 - INFO - step 4719, loss: 3.715587, best loss: 2.607254 2025-01-16 01:27:10,843 - INFO - step 4720, loss: 3.508145, best loss: 2.607254 2025-01-16 01:27:10,993 - INFO - step 4721, loss: 3.973334, best loss: 2.607254 2025-01-16 01:27:11,143 - INFO - step 4722, loss: 3.898805, best loss: 2.607254 2025-01-16 01:27:11,293 - INFO - step 4723, loss: 3.885923, best loss: 2.607254 2025-01-16 01:27:11,443 - INFO - step 4724, loss: 3.594909, best loss: 2.607254 2025-01-16 01:27:11,593 - INFO - step 4725, loss: 3.797952, best loss: 2.607254 2025-01-16 01:27:11,744 - INFO - step 4726, loss: 3.650629, best loss: 2.607254 2025-01-16 01:27:11,894 - INFO - step 4727, loss: 3.389929, best loss: 2.607254 2025-01-16 01:27:12,044 - INFO - step 4728, loss: 3.419341, best loss: 2.607254 2025-01-16 01:27:12,194 - INFO - step 4729, loss: 3.863026, best loss: 2.607254 2025-01-16 01:27:12,344 - INFO - step 4730, loss: 3.706408, best loss: 2.607254 2025-01-16 01:27:12,494 - INFO - step 4731, loss: 3.934146, best loss: 2.607254 2025-01-16 01:27:12,644 - INFO - step 4732, loss: 3.826410, best loss: 2.607254 2025-01-16 01:27:12,795 - INFO - step 4733, loss: 3.452664, best loss: 2.607254 2025-01-16 01:27:12,945 - INFO - step 4734, loss: 3.864088, best loss: 2.607254 2025-01-16 01:27:13,095 - INFO - step 4735, loss: 3.498768, best loss: 2.607254 2025-01-16 01:27:13,245 - INFO - step 4736, loss: 3.839010, best loss: 2.607254 2025-01-16 01:27:13,395 - INFO - step 4737, loss: 3.458849, best loss: 2.607254 2025-01-16 01:27:13,545 - INFO - step 4738, loss: 3.530769, best loss: 2.607254 2025-01-16 01:27:13,695 - INFO - step 4739, loss: 3.602351, best loss: 2.607254 2025-01-16 01:27:13,845 - INFO - step 4740, loss: 3.598573, best loss: 2.607254 2025-01-16 01:27:13,996 - INFO - step 4741, loss: 3.502382, best loss: 2.607254 2025-01-16 01:27:14,146 - INFO - step 4742, loss: 3.710053, best loss: 2.607254 2025-01-16 01:27:14,296 - INFO - step 4743, loss: 3.281571, best loss: 2.607254 2025-01-16 01:27:14,446 - INFO - step 4744, loss: 3.001259, best loss: 2.607254 2025-01-16 01:27:14,596 - INFO - step 4745, loss: 3.184036, best loss: 2.607254 2025-01-16 01:27:14,746 - INFO - step 4746, loss: 3.427139, best loss: 2.607254 2025-01-16 01:27:14,897 - INFO - step 4747, loss: 4.020403, best loss: 2.607254 2025-01-16 01:27:15,047 - INFO - step 4748, loss: 3.512223, best loss: 2.607254 2025-01-16 01:27:15,196 - INFO - step 4749, loss: 3.141579, best loss: 2.607254 2025-01-16 01:27:15,347 - INFO - step 4750, loss: 3.713719, best loss: 2.607254 2025-01-16 01:27:15,497 - INFO - step 4751, loss: 3.444856, best loss: 2.607254 2025-01-16 01:27:15,647 - INFO - step 4752, loss: 4.037930, best loss: 2.607254 2025-01-16 01:27:15,797 - INFO - step 4753, loss: 3.387723, best loss: 2.607254 2025-01-16 01:27:15,948 - INFO - step 4754, loss: 3.712116, best loss: 2.607254 2025-01-16 01:27:16,098 - INFO - step 4755, loss: 3.518841, best loss: 2.607254 2025-01-16 01:27:16,248 - INFO - step 4756, loss: 3.929978, best loss: 2.607254 2025-01-16 01:27:16,398 - INFO - step 4757, loss: 3.600135, best loss: 2.607254 2025-01-16 01:27:16,548 - INFO - step 4758, loss: 3.376399, best loss: 2.607254 2025-01-16 01:27:16,699 - INFO - step 4759, loss: 3.781309, best loss: 2.607254 2025-01-16 01:27:16,849 - INFO - step 4760, loss: 3.548536, best loss: 2.607254 2025-01-16 01:27:16,999 - INFO - step 4761, loss: 3.399098, best loss: 2.607254 2025-01-16 01:27:17,149 - INFO - step 4762, loss: 4.056649, best loss: 2.607254 2025-01-16 01:27:17,299 - INFO - step 4763, loss: 3.550902, best loss: 2.607254 2025-01-16 01:27:17,449 - INFO - step 4764, loss: 3.120778, best loss: 2.607254 2025-01-16 01:27:17,599 - INFO - step 4765, loss: 3.458033, best loss: 2.607254 2025-01-16 01:27:17,749 - INFO - step 4766, loss: 3.777586, best loss: 2.607254 2025-01-16 01:27:17,899 - INFO - step 4767, loss: 3.684412, best loss: 2.607254 2025-01-16 01:27:18,050 - INFO - step 4768, loss: 3.534497, best loss: 2.607254 2025-01-16 01:27:18,200 - INFO - step 4769, loss: 3.461027, best loss: 2.607254 2025-01-16 01:27:18,350 - INFO - step 4770, loss: 3.932531, best loss: 2.607254 2025-01-16 01:27:18,500 - INFO - step 4771, loss: 3.768665, best loss: 2.607254 2025-01-16 01:27:18,650 - INFO - step 4772, loss: 3.684974, best loss: 2.607254 2025-01-16 01:27:18,800 - INFO - step 4773, loss: 3.336157, best loss: 2.607254 2025-01-16 01:27:18,950 - INFO - step 4774, loss: 3.616086, best loss: 2.607254 2025-01-16 01:27:19,100 - INFO - step 4775, loss: 3.551897, best loss: 2.607254 2025-01-16 01:27:19,251 - INFO - step 4776, loss: 3.151022, best loss: 2.607254 2025-01-16 01:27:19,401 - INFO - step 4777, loss: 3.621487, best loss: 2.607254 2025-01-16 01:27:19,551 - INFO - step 4778, loss: 3.316121, best loss: 2.607254 2025-01-16 01:27:19,702 - INFO - step 4779, loss: 3.642183, best loss: 2.607254 2025-01-16 01:27:19,852 - INFO - step 4780, loss: 3.337448, best loss: 2.607254 2025-01-16 01:27:20,002 - INFO - step 4781, loss: 3.555068, best loss: 2.607254 2025-01-16 01:27:20,152 - INFO - step 4782, loss: 3.353589, best loss: 2.607254 2025-01-16 01:27:20,303 - INFO - step 4783, loss: 3.476390, best loss: 2.607254 2025-01-16 01:27:20,453 - INFO - step 4784, loss: 3.786755, best loss: 2.607254 2025-01-16 01:27:20,603 - INFO - step 4785, loss: 3.642498, best loss: 2.607254 2025-01-16 01:27:20,753 - INFO - step 4786, loss: 3.840371, best loss: 2.607254 2025-01-16 01:27:20,903 - INFO - step 4787, loss: 3.377345, best loss: 2.607254 2025-01-16 01:27:21,052 - INFO - step 4788, loss: 3.685740, best loss: 2.607254 2025-01-16 01:27:21,203 - INFO - step 4789, loss: 3.694432, best loss: 2.607254 2025-01-16 01:27:21,353 - INFO - step 4790, loss: 3.457683, best loss: 2.607254 2025-01-16 01:27:21,503 - INFO - step 4791, loss: 3.208281, best loss: 2.607254 2025-01-16 01:27:21,653 - INFO - step 4792, loss: 3.282398, best loss: 2.607254 2025-01-16 01:27:21,803 - INFO - step 4793, loss: 3.455014, best loss: 2.607254 2025-01-16 01:27:21,954 - INFO - step 4794, loss: 3.502218, best loss: 2.607254 2025-01-16 01:27:22,104 - INFO - step 4795, loss: 3.784049, best loss: 2.607254 2025-01-16 01:27:22,254 - INFO - step 4796, loss: 4.009789, best loss: 2.607254 2025-01-16 01:27:22,404 - INFO - step 4797, loss: 3.780609, best loss: 2.607254 2025-01-16 01:27:22,554 - INFO - step 4798, loss: 3.867862, best loss: 2.607254 2025-01-16 01:27:22,705 - INFO - step 4799, loss: 3.706756, best loss: 2.607254 2025-01-16 01:27:22,855 - INFO - step 4800, loss: 3.732919, best loss: 2.607254 2025-01-16 01:27:23,005 - INFO - step 4801, loss: 3.256083, best loss: 2.607254 2025-01-16 01:27:23,155 - INFO - step 4802, loss: 3.780019, best loss: 2.607254 2025-01-16 01:27:23,305 - INFO - step 4803, loss: 4.040516, best loss: 2.607254 2025-01-16 01:27:23,456 - INFO - step 4804, loss: 3.742993, best loss: 2.607254 2025-01-16 01:27:23,606 - INFO - step 4805, loss: 3.727707, best loss: 2.607254 2025-01-16 01:27:23,756 - INFO - step 4806, loss: 3.739423, best loss: 2.607254 2025-01-16 01:27:23,906 - INFO - step 4807, loss: 3.421943, best loss: 2.607254 2025-01-16 01:27:27,352 - INFO - step 4808, loss: 2.566456, best loss: 2.566456 2025-01-16 01:27:27,513 - INFO - step 4809, loss: 3.583000, best loss: 2.566456 2025-01-16 01:27:27,664 - INFO - step 4810, loss: 3.723270, best loss: 2.566456 2025-01-16 01:27:27,814 - INFO - step 4811, loss: 3.693374, best loss: 2.566456 2025-01-16 01:27:27,964 - INFO - step 4812, loss: 3.776435, best loss: 2.566456 2025-01-16 01:27:28,114 - INFO - step 4813, loss: 3.446968, best loss: 2.566456 2025-01-16 01:27:28,264 - INFO - step 4814, loss: 3.606920, best loss: 2.566456 2025-01-16 01:27:28,415 - INFO - step 4815, loss: 3.658126, best loss: 2.566456 2025-01-16 01:27:28,565 - INFO - step 4816, loss: 3.457980, best loss: 2.566456 2025-01-16 01:27:28,715 - INFO - step 4817, loss: 3.621075, best loss: 2.566456 2025-01-16 01:27:28,865 - INFO - step 4818, loss: 3.499870, best loss: 2.566456 2025-01-16 01:27:29,015 - INFO - step 4819, loss: 3.339827, best loss: 2.566456 2025-01-16 01:27:29,165 - INFO - step 4820, loss: 3.658405, best loss: 2.566456 2025-01-16 01:27:29,315 - INFO - step 4821, loss: 3.315964, best loss: 2.566456 2025-01-16 01:27:29,465 - INFO - step 4822, loss: 3.580754, best loss: 2.566456 2025-01-16 01:27:29,616 - INFO - step 4823, loss: 3.835385, best loss: 2.566456 2025-01-16 01:27:29,766 - INFO - step 4824, loss: 3.447775, best loss: 2.566456 2025-01-16 01:27:29,917 - INFO - step 4825, loss: 3.110466, best loss: 2.566456 2025-01-16 01:27:30,067 - INFO - step 4826, loss: 3.625848, best loss: 2.566456 2025-01-16 01:27:30,218 - INFO - step 4827, loss: 3.747388, best loss: 2.566456 2025-01-16 01:27:30,368 - INFO - step 4828, loss: 3.780313, best loss: 2.566456 2025-01-16 01:27:30,518 - INFO - step 4829, loss: 3.625359, best loss: 2.566456 2025-01-16 01:27:30,668 - INFO - step 4830, loss: 3.761157, best loss: 2.566456 2025-01-16 01:27:30,818 - INFO - step 4831, loss: 3.604548, best loss: 2.566456 2025-01-16 01:27:30,969 - INFO - step 4832, loss: 3.819820, best loss: 2.566456 2025-01-16 01:27:31,119 - INFO - step 4833, loss: 3.402682, best loss: 2.566456 2025-01-16 01:27:31,269 - INFO - step 4834, loss: 3.531303, best loss: 2.566456 2025-01-16 01:27:31,419 - INFO - step 4835, loss: 3.721988, best loss: 2.566456 2025-01-16 01:27:31,570 - INFO - step 4836, loss: 3.782511, best loss: 2.566456 2025-01-16 01:27:31,720 - INFO - step 4837, loss: 3.667964, best loss: 2.566456 2025-01-16 01:27:31,870 - INFO - step 4838, loss: 3.537023, best loss: 2.566456 2025-01-16 01:27:32,021 - INFO - step 4839, loss: 3.508037, best loss: 2.566456 2025-01-16 01:27:32,171 - INFO - step 4840, loss: 3.721242, best loss: 2.566456 2025-01-16 01:27:32,321 - INFO - step 4841, loss: 3.919596, best loss: 2.566456 2025-01-16 01:27:32,471 - INFO - step 4842, loss: 3.672888, best loss: 2.566456 2025-01-16 01:27:32,621 - INFO - step 4843, loss: 3.957879, best loss: 2.566456 2025-01-16 01:27:32,771 - INFO - step 4844, loss: 4.024541, best loss: 2.566456 2025-01-16 01:27:32,921 - INFO - step 4845, loss: 3.861600, best loss: 2.566456 2025-01-16 01:27:33,072 - INFO - step 4846, loss: 4.092669, best loss: 2.566456 2025-01-16 01:27:33,222 - INFO - step 4847, loss: 3.890319, best loss: 2.566456 2025-01-16 01:27:33,372 - INFO - step 4848, loss: 3.367318, best loss: 2.566456 2025-01-16 01:27:33,523 - INFO - step 4849, loss: 3.863207, best loss: 2.566456 2025-01-16 01:27:33,673 - INFO - step 4850, loss: 3.849393, best loss: 2.566456 2025-01-16 01:27:33,823 - INFO - step 4851, loss: 3.927605, best loss: 2.566456 2025-01-16 01:27:33,973 - INFO - step 4852, loss: 3.550229, best loss: 2.566456 2025-01-16 01:27:34,124 - INFO - step 4853, loss: 3.745925, best loss: 2.566456 2025-01-16 01:27:34,273 - INFO - step 4854, loss: 3.738494, best loss: 2.566456 2025-01-16 01:27:34,423 - INFO - step 4855, loss: 3.593760, best loss: 2.566456 2025-01-16 01:27:34,574 - INFO - step 4856, loss: 3.860897, best loss: 2.566456 2025-01-16 01:27:34,724 - INFO - step 4857, loss: 3.486225, best loss: 2.566456 2025-01-16 01:27:34,874 - INFO - step 4858, loss: 3.417301, best loss: 2.566456 2025-01-16 01:27:35,025 - INFO - step 4859, loss: 3.914567, best loss: 2.566456 2025-01-16 01:27:35,175 - INFO - step 4860, loss: 3.810817, best loss: 2.566456 2025-01-16 01:27:35,325 - INFO - step 4861, loss: 3.886409, best loss: 2.566456 2025-01-16 01:27:35,475 - INFO - step 4862, loss: 3.689500, best loss: 2.566456 2025-01-16 01:27:35,625 - INFO - step 4863, loss: 3.989863, best loss: 2.566456 2025-01-16 01:27:35,776 - INFO - step 4864, loss: 3.841005, best loss: 2.566456 2025-01-16 01:27:35,926 - INFO - step 4865, loss: 3.527438, best loss: 2.566456 2025-01-16 01:27:36,076 - INFO - step 4866, loss: 3.566204, best loss: 2.566456 2025-01-16 01:27:36,227 - INFO - step 4867, loss: 3.919813, best loss: 2.566456 2025-01-16 01:27:36,377 - INFO - step 4868, loss: 3.559216, best loss: 2.566456 2025-01-16 01:27:36,527 - INFO - step 4869, loss: 3.254135, best loss: 2.566456 2025-01-16 01:27:36,677 - INFO - step 4870, loss: 3.732824, best loss: 2.566456 2025-01-16 01:27:36,828 - INFO - step 4871, loss: 3.751525, best loss: 2.566456 2025-01-16 01:27:36,978 - INFO - step 4872, loss: 3.683876, best loss: 2.566456 2025-01-16 01:27:37,128 - INFO - step 4873, loss: 3.268899, best loss: 2.566456 2025-01-16 01:27:37,278 - INFO - step 4874, loss: 3.143058, best loss: 2.566456 2025-01-16 01:27:37,428 - INFO - step 4875, loss: 3.084443, best loss: 2.566456 2025-01-16 01:27:37,578 - INFO - step 4876, loss: 3.230651, best loss: 2.566456 2025-01-16 01:27:37,728 - INFO - step 4877, loss: 3.337897, best loss: 2.566456 2025-01-16 01:27:37,878 - INFO - step 4878, loss: 3.688650, best loss: 2.566456 2025-01-16 01:27:38,028 - INFO - step 4879, loss: 3.403553, best loss: 2.566456 2025-01-16 01:27:38,178 - INFO - step 4880, loss: 3.403187, best loss: 2.566456 2025-01-16 01:27:38,329 - INFO - step 4881, loss: 3.609668, best loss: 2.566456 2025-01-16 01:27:38,479 - INFO - step 4882, loss: 3.529743, best loss: 2.566456 2025-01-16 01:27:38,629 - INFO - step 4883, loss: 3.417005, best loss: 2.566456 2025-01-16 01:27:38,779 - INFO - step 4884, loss: 3.619478, best loss: 2.566456 2025-01-16 01:27:38,929 - INFO - step 4885, loss: 3.721632, best loss: 2.566456 2025-01-16 01:27:39,079 - INFO - step 4886, loss: 3.253287, best loss: 2.566456 2025-01-16 01:27:39,230 - INFO - step 4887, loss: 3.201309, best loss: 2.566456 2025-01-16 01:27:39,380 - INFO - step 4888, loss: 3.525411, best loss: 2.566456 2025-01-16 01:27:39,531 - INFO - step 4889, loss: 3.758976, best loss: 2.566456 2025-01-16 01:27:39,681 - INFO - step 4890, loss: 3.302589, best loss: 2.566456 2025-01-16 01:27:39,831 - INFO - step 4891, loss: 3.214858, best loss: 2.566456 2025-01-16 01:27:39,981 - INFO - step 4892, loss: 3.553572, best loss: 2.566456 2025-01-16 01:27:40,131 - INFO - step 4893, loss: 3.417294, best loss: 2.566456 2025-01-16 01:27:40,281 - INFO - step 4894, loss: 3.206543, best loss: 2.566456 2025-01-16 01:27:40,431 - INFO - step 4895, loss: 3.563508, best loss: 2.566456 2025-01-16 01:27:40,581 - INFO - step 4896, loss: 3.289298, best loss: 2.566456 2025-01-16 01:27:40,731 - INFO - step 4897, loss: 3.238701, best loss: 2.566456 2025-01-16 01:27:40,882 - INFO - step 4898, loss: 3.255740, best loss: 2.566456 2025-01-16 01:27:41,032 - INFO - step 4899, loss: 3.505126, best loss: 2.566456 2025-01-16 01:27:41,182 - INFO - step 4900, loss: 3.185815, best loss: 2.566456 2025-01-16 01:27:41,332 - INFO - step 4901, loss: 3.312241, best loss: 2.566456 2025-01-16 01:27:41,482 - INFO - step 4902, loss: 3.178651, best loss: 2.566456 2025-01-16 01:27:41,632 - INFO - step 4903, loss: 3.642935, best loss: 2.566456 2025-01-16 01:27:41,783 - INFO - step 4904, loss: 3.868025, best loss: 2.566456 2025-01-16 01:27:41,933 - INFO - step 4905, loss: 4.049457, best loss: 2.566456 2025-01-16 01:27:42,083 - INFO - step 4906, loss: 3.603121, best loss: 2.566456 2025-01-16 01:27:42,233 - INFO - step 4907, loss: 3.826755, best loss: 2.566456 2025-01-16 01:27:42,383 - INFO - step 4908, loss: 3.665980, best loss: 2.566456 2025-01-16 01:27:42,533 - INFO - step 4909, loss: 3.622342, best loss: 2.566456 2025-01-16 01:27:42,684 - INFO - step 4910, loss: 3.459934, best loss: 2.566456 2025-01-16 01:27:42,834 - INFO - step 4911, loss: 3.760676, best loss: 2.566456 2025-01-16 01:27:42,984 - INFO - step 4912, loss: 3.443341, best loss: 2.566456 2025-01-16 01:27:43,134 - INFO - step 4913, loss: 3.151866, best loss: 2.566456 2025-01-16 01:27:43,285 - INFO - step 4914, loss: 3.404789, best loss: 2.566456 2025-01-16 01:27:43,435 - INFO - step 4915, loss: 3.438006, best loss: 2.566456 2025-01-16 01:27:43,585 - INFO - step 4916, loss: 3.682049, best loss: 2.566456 2025-01-16 01:27:43,735 - INFO - step 4917, loss: 2.785781, best loss: 2.566456 2025-01-16 01:27:43,885 - INFO - step 4918, loss: 3.406498, best loss: 2.566456 2025-01-16 01:27:44,035 - INFO - step 4919, loss: 3.619228, best loss: 2.566456 2025-01-16 01:27:44,185 - INFO - step 4920, loss: 3.591042, best loss: 2.566456 2025-01-16 01:27:44,335 - INFO - step 4921, loss: 3.417621, best loss: 2.566456 2025-01-16 01:27:44,485 - INFO - step 4922, loss: 3.498481, best loss: 2.566456 2025-01-16 01:27:44,635 - INFO - step 4923, loss: 3.532783, best loss: 2.566456 2025-01-16 01:27:44,786 - INFO - step 4924, loss: 3.275114, best loss: 2.566456 2025-01-16 01:27:44,936 - INFO - step 4925, loss: 3.610656, best loss: 2.566456 2025-01-16 01:27:45,086 - INFO - step 4926, loss: 3.430554, best loss: 2.566456 2025-01-16 01:27:45,236 - INFO - step 4927, loss: 3.535838, best loss: 2.566456 2025-01-16 01:27:45,387 - INFO - step 4928, loss: 3.201178, best loss: 2.566456 2025-01-16 01:27:45,537 - INFO - step 4929, loss: 3.231637, best loss: 2.566456 2025-01-16 01:27:45,687 - INFO - step 4930, loss: 3.232287, best loss: 2.566456 2025-01-16 01:27:45,837 - INFO - step 4931, loss: 3.139464, best loss: 2.566456 2025-01-16 01:27:45,987 - INFO - step 4932, loss: 3.277081, best loss: 2.566456 2025-01-16 01:27:46,138 - INFO - step 4933, loss: 3.270867, best loss: 2.566456 2025-01-16 01:27:46,288 - INFO - step 4934, loss: 3.179157, best loss: 2.566456 2025-01-16 01:27:46,438 - INFO - step 4935, loss: 2.836306, best loss: 2.566456 2025-01-16 01:27:46,588 - INFO - step 4936, loss: 2.899478, best loss: 2.566456 2025-01-16 01:27:46,738 - INFO - step 4937, loss: 2.721210, best loss: 2.566456 2025-01-16 01:27:46,889 - INFO - step 4938, loss: 3.451904, best loss: 2.566456 2025-01-16 01:27:47,039 - INFO - step 4939, loss: 3.722089, best loss: 2.566456 2025-01-16 01:27:47,189 - INFO - step 4940, loss: 3.668743, best loss: 2.566456 2025-01-16 01:27:47,339 - INFO - step 4941, loss: 3.826578, best loss: 2.566456 2025-01-16 01:27:47,489 - INFO - step 4942, loss: 3.850232, best loss: 2.566456 2025-01-16 01:27:47,639 - INFO - step 4943, loss: 3.493419, best loss: 2.566456 2025-01-16 01:27:47,789 - INFO - step 4944, loss: 3.615448, best loss: 2.566456 2025-01-16 01:27:47,940 - INFO - step 4945, loss: 3.798545, best loss: 2.566456 2025-01-16 01:27:48,090 - INFO - step 4946, loss: 3.451340, best loss: 2.566456 2025-01-16 01:27:48,240 - INFO - step 4947, loss: 3.041740, best loss: 2.566456 2025-01-16 01:27:48,390 - INFO - step 4948, loss: 3.390777, best loss: 2.566456 2025-01-16 01:27:48,540 - INFO - step 4949, loss: 3.229949, best loss: 2.566456 2025-01-16 01:27:48,690 - INFO - step 4950, loss: 3.804832, best loss: 2.566456 2025-01-16 01:27:48,840 - INFO - step 4951, loss: 3.656497, best loss: 2.566456 2025-01-16 01:27:48,990 - INFO - step 4952, loss: 3.664434, best loss: 2.566456 2025-01-16 01:27:49,141 - INFO - step 4953, loss: 3.598009, best loss: 2.566456 2025-01-16 01:27:49,291 - INFO - step 4954, loss: 3.740455, best loss: 2.566456 2025-01-16 01:27:49,441 - INFO - step 4955, loss: 3.280079, best loss: 2.566456 2025-01-16 01:27:49,591 - INFO - step 4956, loss: 3.766175, best loss: 2.566456 2025-01-16 01:27:49,741 - INFO - step 4957, loss: 3.565284, best loss: 2.566456 2025-01-16 01:27:49,891 - INFO - step 4958, loss: 3.753620, best loss: 2.566456 2025-01-16 01:27:50,041 - INFO - step 4959, loss: 3.544686, best loss: 2.566456 2025-01-16 01:27:50,192 - INFO - step 4960, loss: 3.402761, best loss: 2.566456 2025-01-16 01:27:50,342 - INFO - step 4961, loss: 3.877805, best loss: 2.566456 2025-01-16 01:27:50,492 - INFO - step 4962, loss: 3.246302, best loss: 2.566456 2025-01-16 01:27:50,642 - INFO - step 4963, loss: 3.588683, best loss: 2.566456 2025-01-16 01:27:50,792 - INFO - step 4964, loss: 3.690051, best loss: 2.566456 2025-01-16 01:27:50,943 - INFO - step 4965, loss: 3.703357, best loss: 2.566456 2025-01-16 01:27:51,093 - INFO - step 4966, loss: 3.612201, best loss: 2.566456 2025-01-16 01:27:51,243 - INFO - step 4967, loss: 3.411763, best loss: 2.566456 2025-01-16 01:27:51,394 - INFO - step 4968, loss: 3.357816, best loss: 2.566456 2025-01-16 01:27:51,544 - INFO - step 4969, loss: 3.488857, best loss: 2.566456 2025-01-16 01:27:51,694 - INFO - step 4970, loss: 3.051760, best loss: 2.566456 2025-01-16 01:27:51,844 - INFO - step 4971, loss: 3.825251, best loss: 2.566456 2025-01-16 01:27:51,994 - INFO - step 4972, loss: 2.914100, best loss: 2.566456 2025-01-16 01:27:52,144 - INFO - step 4973, loss: 2.967607, best loss: 2.566456 2025-01-16 01:27:52,294 - INFO - step 4974, loss: 3.417832, best loss: 2.566456 2025-01-16 01:27:52,444 - INFO - step 4975, loss: 3.427079, best loss: 2.566456 2025-01-16 01:27:52,594 - INFO - step 4976, loss: 3.391223, best loss: 2.566456 2025-01-16 01:27:52,744 - INFO - step 4977, loss: 3.132837, best loss: 2.566456 2025-01-16 01:27:52,894 - INFO - step 4978, loss: 3.375941, best loss: 2.566456 2025-01-16 01:27:53,045 - INFO - step 4979, loss: 3.397157, best loss: 2.566456 2025-01-16 01:27:53,195 - INFO - step 4980, loss: 3.244840, best loss: 2.566456 2025-01-16 01:27:53,345 - INFO - step 4981, loss: 3.125160, best loss: 2.566456 2025-01-16 01:27:53,495 - INFO - step 4982, loss: 3.480353, best loss: 2.566456 2025-01-16 01:27:53,645 - INFO - step 4983, loss: 3.393072, best loss: 2.566456 2025-01-16 01:27:53,795 - INFO - step 4984, loss: 3.368954, best loss: 2.566456 2025-01-16 01:27:53,944 - INFO - step 4985, loss: 3.060529, best loss: 2.566456 2025-01-16 01:27:54,094 - INFO - step 4986, loss: 3.336149, best loss: 2.566456 2025-01-16 01:27:54,244 - INFO - step 4987, loss: 3.672102, best loss: 2.566456 2025-01-16 01:27:54,395 - INFO - step 4988, loss: 3.225335, best loss: 2.566456 2025-01-16 01:27:54,545 - INFO - step 4989, loss: 3.529368, best loss: 2.566456 2025-01-16 01:27:54,695 - INFO - step 4990, loss: 3.665485, best loss: 2.566456 2025-01-16 01:27:54,845 - INFO - step 4991, loss: 3.638631, best loss: 2.566456 2025-01-16 01:27:54,995 - INFO - step 4992, loss: 3.624660, best loss: 2.566456 2025-01-16 01:27:55,145 - INFO - step 4993, loss: 3.516220, best loss: 2.566456 2025-01-16 01:27:55,295 - INFO - step 4994, loss: 3.514585, best loss: 2.566456 2025-01-16 01:27:55,445 - INFO - step 4995, loss: 3.284890, best loss: 2.566456 2025-01-16 01:27:55,595 - INFO - step 4996, loss: 3.990369, best loss: 2.566456 2025-01-16 01:27:55,745 - INFO - step 4997, loss: 3.581740, best loss: 2.566456 2025-01-16 01:27:55,896 - INFO - step 4998, loss: 3.847762, best loss: 2.566456 2025-01-16 01:27:56,046 - INFO - step 4999, loss: 2.972021, best loss: 2.566456 2025-01-16 01:27:56,196 - INFO - step 5000, loss: 3.157104, best loss: 2.566456 2025-01-16 01:27:56,346 - INFO - step 5001, loss: 3.520429, best loss: 2.566456 2025-01-16 01:27:56,496 - INFO - step 5002, loss: 3.516818, best loss: 2.566456 2025-01-16 01:27:56,646 - INFO - step 5003, loss: 3.406629, best loss: 2.566456 2025-01-16 01:27:56,797 - INFO - step 5004, loss: 3.475240, best loss: 2.566456 2025-01-16 01:27:56,947 - INFO - step 5005, loss: 3.323434, best loss: 2.566456 2025-01-16 01:27:57,097 - INFO - step 5006, loss: 3.407680, best loss: 2.566456 2025-01-16 01:27:57,247 - INFO - step 5007, loss: 3.494908, best loss: 2.566456 2025-01-16 01:27:57,397 - INFO - step 5008, loss: 3.061296, best loss: 2.566456 2025-01-16 01:27:57,547 - INFO - step 5009, loss: 3.312133, best loss: 2.566456 2025-01-16 01:27:57,697 - INFO - step 5010, loss: 3.448854, best loss: 2.566456 2025-01-16 01:27:57,847 - INFO - step 5011, loss: 3.572848, best loss: 2.566456 2025-01-16 01:27:57,997 - INFO - step 5012, loss: 3.449834, best loss: 2.566456 2025-01-16 01:27:58,147 - INFO - step 5013, loss: 3.654511, best loss: 2.566456 2025-01-16 01:27:58,297 - INFO - step 5014, loss: 3.480624, best loss: 2.566456 2025-01-16 01:27:58,447 - INFO - step 5015, loss: 3.071670, best loss: 2.566456 2025-01-16 01:27:58,597 - INFO - step 5016, loss: 3.480896, best loss: 2.566456 2025-01-16 01:27:58,747 - INFO - step 5017, loss: 2.903485, best loss: 2.566456 2025-01-16 01:27:58,897 - INFO - step 5018, loss: 3.189316, best loss: 2.566456 2025-01-16 01:27:59,047 - INFO - step 5019, loss: 3.328232, best loss: 2.566456 2025-01-16 01:27:59,197 - INFO - step 5020, loss: 3.323221, best loss: 2.566456 2025-01-16 01:27:59,347 - INFO - step 5021, loss: 3.245149, best loss: 2.566456 2025-01-16 01:27:59,497 - INFO - step 5022, loss: 3.477036, best loss: 2.566456 2025-01-16 01:27:59,648 - INFO - step 5023, loss: 3.674203, best loss: 2.566456 2025-01-16 01:27:59,798 - INFO - step 5024, loss: 3.487640, best loss: 2.566456 2025-01-16 01:27:59,948 - INFO - step 5025, loss: 3.676474, best loss: 2.566456 2025-01-16 01:28:00,098 - INFO - step 5026, loss: 3.327402, best loss: 2.566456 2025-01-16 01:28:00,248 - INFO - step 5027, loss: 3.471218, best loss: 2.566456 2025-01-16 01:28:00,398 - INFO - step 5028, loss: 3.357414, best loss: 2.566456 2025-01-16 01:28:00,549 - INFO - step 5029, loss: 2.984619, best loss: 2.566456 2025-01-16 01:28:00,699 - INFO - step 5030, loss: 3.673954, best loss: 2.566456 2025-01-16 01:28:00,849 - INFO - step 5031, loss: 3.556692, best loss: 2.566456 2025-01-16 01:28:00,999 - INFO - step 5032, loss: 3.291231, best loss: 2.566456 2025-01-16 01:28:01,149 - INFO - step 5033, loss: 3.185557, best loss: 2.566456 2025-01-16 01:28:01,300 - INFO - step 5034, loss: 3.044384, best loss: 2.566456 2025-01-16 01:28:01,450 - INFO - step 5035, loss: 3.262761, best loss: 2.566456 2025-01-16 01:28:01,600 - INFO - step 5036, loss: 3.144608, best loss: 2.566456 2025-01-16 01:28:01,750 - INFO - step 5037, loss: 3.086901, best loss: 2.566456 2025-01-16 01:28:01,901 - INFO - step 5038, loss: 3.654988, best loss: 2.566456 2025-01-16 01:28:02,050 - INFO - step 5039, loss: 3.655716, best loss: 2.566456 2025-01-16 01:28:02,201 - INFO - step 5040, loss: 3.341952, best loss: 2.566456 2025-01-16 01:28:02,351 - INFO - step 5041, loss: 3.667490, best loss: 2.566456 2025-01-16 01:28:02,501 - INFO - step 5042, loss: 3.318579, best loss: 2.566456 2025-01-16 01:28:02,651 - INFO - step 5043, loss: 3.664371, best loss: 2.566456 2025-01-16 01:28:02,801 - INFO - step 5044, loss: 3.722802, best loss: 2.566456 2025-01-16 01:28:02,951 - INFO - step 5045, loss: 3.786236, best loss: 2.566456 2025-01-16 01:28:03,101 - INFO - step 5046, loss: 3.864201, best loss: 2.566456 2025-01-16 01:28:03,251 - INFO - step 5047, loss: 3.566719, best loss: 2.566456 2025-01-16 01:28:03,401 - INFO - step 5048, loss: 3.771338, best loss: 2.566456 2025-01-16 01:28:03,551 - INFO - step 5049, loss: 3.573254, best loss: 2.566456 2025-01-16 01:28:03,702 - INFO - step 5050, loss: 3.355387, best loss: 2.566456 2025-01-16 01:28:03,852 - INFO - step 5051, loss: 3.835485, best loss: 2.566456 2025-01-16 01:28:04,002 - INFO - step 5052, loss: 3.743083, best loss: 2.566456 2025-01-16 01:28:04,152 - INFO - step 5053, loss: 3.756222, best loss: 2.566456 2025-01-16 01:28:04,302 - INFO - step 5054, loss: 3.483458, best loss: 2.566456 2025-01-16 01:28:04,452 - INFO - step 5055, loss: 3.662590, best loss: 2.566456 2025-01-16 01:28:04,602 - INFO - step 5056, loss: 3.533283, best loss: 2.566456 2025-01-16 01:28:04,752 - INFO - step 5057, loss: 3.348400, best loss: 2.566456 2025-01-16 01:28:04,903 - INFO - step 5058, loss: 3.311568, best loss: 2.566456 2025-01-16 01:28:05,052 - INFO - step 5059, loss: 3.781823, best loss: 2.566456 2025-01-16 01:28:05,202 - INFO - step 5060, loss: 3.617594, best loss: 2.566456 2025-01-16 01:28:05,351 - INFO - step 5061, loss: 3.844075, best loss: 2.566456 2025-01-16 01:28:05,501 - INFO - step 5062, loss: 3.717084, best loss: 2.566456 2025-01-16 01:28:05,651 - INFO - step 5063, loss: 3.378835, best loss: 2.566456 2025-01-16 01:28:05,800 - INFO - step 5064, loss: 3.719872, best loss: 2.566456 2025-01-16 01:28:05,950 - INFO - step 5065, loss: 3.383497, best loss: 2.566456 2025-01-16 01:28:06,100 - INFO - step 5066, loss: 3.666770, best loss: 2.566456 2025-01-16 01:28:06,250 - INFO - step 5067, loss: 3.340529, best loss: 2.566456 2025-01-16 01:28:06,400 - INFO - step 5068, loss: 3.419564, best loss: 2.566456 2025-01-16 01:28:06,550 - INFO - step 5069, loss: 3.491773, best loss: 2.566456 2025-01-16 01:28:06,700 - INFO - step 5070, loss: 3.491297, best loss: 2.566456 2025-01-16 01:28:06,850 - INFO - step 5071, loss: 3.391141, best loss: 2.566456 2025-01-16 01:28:07,000 - INFO - step 5072, loss: 3.566476, best loss: 2.566456 2025-01-16 01:28:07,151 - INFO - step 5073, loss: 3.188963, best loss: 2.566456 2025-01-16 01:28:07,302 - INFO - step 5074, loss: 2.895734, best loss: 2.566456 2025-01-16 01:28:07,452 - INFO - step 5075, loss: 3.051850, best loss: 2.566456 2025-01-16 01:28:07,603 - INFO - step 5076, loss: 3.302711, best loss: 2.566456 2025-01-16 01:28:07,753 - INFO - step 5077, loss: 3.810855, best loss: 2.566456 2025-01-16 01:28:07,904 - INFO - step 5078, loss: 3.441712, best loss: 2.566456 2025-01-16 01:28:08,053 - INFO - step 5079, loss: 3.051173, best loss: 2.566456 2025-01-16 01:28:08,204 - INFO - step 5080, loss: 3.575136, best loss: 2.566456 2025-01-16 01:28:08,354 - INFO - step 5081, loss: 3.305875, best loss: 2.566456 2025-01-16 01:28:08,504 - INFO - step 5082, loss: 3.896291, best loss: 2.566456 2025-01-16 01:28:08,654 - INFO - step 5083, loss: 3.243344, best loss: 2.566456 2025-01-16 01:28:08,804 - INFO - step 5084, loss: 3.571694, best loss: 2.566456 2025-01-16 01:28:08,954 - INFO - step 5085, loss: 3.356823, best loss: 2.566456 2025-01-16 01:28:09,104 - INFO - step 5086, loss: 3.759090, best loss: 2.566456 2025-01-16 01:28:09,254 - INFO - step 5087, loss: 3.479013, best loss: 2.566456 2025-01-16 01:28:09,404 - INFO - step 5088, loss: 3.303147, best loss: 2.566456 2025-01-16 01:28:09,554 - INFO - step 5089, loss: 3.694974, best loss: 2.566456 2025-01-16 01:28:09,704 - INFO - step 5090, loss: 3.416507, best loss: 2.566456 2025-01-16 01:28:09,855 - INFO - step 5091, loss: 3.279708, best loss: 2.566456 2025-01-16 01:28:10,005 - INFO - step 5092, loss: 3.868636, best loss: 2.566456 2025-01-16 01:28:10,155 - INFO - step 5093, loss: 3.407322, best loss: 2.566456 2025-01-16 01:28:10,305 - INFO - step 5094, loss: 3.028684, best loss: 2.566456 2025-01-16 01:28:10,455 - INFO - step 5095, loss: 3.429035, best loss: 2.566456 2025-01-16 01:28:10,605 - INFO - step 5096, loss: 3.667222, best loss: 2.566456 2025-01-16 01:28:10,756 - INFO - step 5097, loss: 3.569049, best loss: 2.566456 2025-01-16 01:28:10,906 - INFO - step 5098, loss: 3.381258, best loss: 2.566456 2025-01-16 01:28:11,056 - INFO - step 5099, loss: 3.276777, best loss: 2.566456 2025-01-16 01:28:11,206 - INFO - step 5100, loss: 3.828140, best loss: 2.566456 2025-01-16 01:28:11,356 - INFO - step 5101, loss: 3.672579, best loss: 2.566456 2025-01-16 01:28:11,506 - INFO - step 5102, loss: 3.617188, best loss: 2.566456 2025-01-16 01:28:11,657 - INFO - step 5103, loss: 3.279202, best loss: 2.566456 2025-01-16 01:28:11,807 - INFO - step 5104, loss: 3.588642, best loss: 2.566456 2025-01-16 01:28:11,957 - INFO - step 5105, loss: 3.448558, best loss: 2.566456 2025-01-16 01:28:12,107 - INFO - step 5106, loss: 3.084469, best loss: 2.566456 2025-01-16 01:28:12,258 - INFO - step 5107, loss: 3.505657, best loss: 2.566456 2025-01-16 01:28:12,408 - INFO - step 5108, loss: 3.240993, best loss: 2.566456 2025-01-16 01:28:12,558 - INFO - step 5109, loss: 3.491367, best loss: 2.566456 2025-01-16 01:28:12,709 - INFO - step 5110, loss: 3.233792, best loss: 2.566456 2025-01-16 01:28:12,859 - INFO - step 5111, loss: 3.411888, best loss: 2.566456 2025-01-16 01:28:13,009 - INFO - step 5112, loss: 3.280891, best loss: 2.566456 2025-01-16 01:28:13,159 - INFO - step 5113, loss: 3.350719, best loss: 2.566456 2025-01-16 01:28:13,309 - INFO - step 5114, loss: 3.667498, best loss: 2.566456 2025-01-16 01:28:13,460 - INFO - step 5115, loss: 3.547449, best loss: 2.566456 2025-01-16 01:28:13,610 - INFO - step 5116, loss: 3.689135, best loss: 2.566456 2025-01-16 01:28:13,761 - INFO - step 5117, loss: 3.226166, best loss: 2.566456 2025-01-16 01:28:13,911 - INFO - step 5118, loss: 3.540220, best loss: 2.566456 2025-01-16 01:28:14,061 - INFO - step 5119, loss: 3.546127, best loss: 2.566456 2025-01-16 01:28:14,211 - INFO - step 5120, loss: 3.222858, best loss: 2.566456 2025-01-16 01:28:14,361 - INFO - step 5121, loss: 3.079957, best loss: 2.566456 2025-01-16 01:28:14,511 - INFO - step 5122, loss: 3.153466, best loss: 2.566456 2025-01-16 01:28:14,662 - INFO - step 5123, loss: 3.333860, best loss: 2.566456 2025-01-16 01:28:14,812 - INFO - step 5124, loss: 3.396390, best loss: 2.566456 2025-01-16 01:28:14,962 - INFO - step 5125, loss: 3.602847, best loss: 2.566456 2025-01-16 01:28:15,112 - INFO - step 5126, loss: 3.845856, best loss: 2.566456 2025-01-16 01:28:15,263 - INFO - step 5127, loss: 3.682137, best loss: 2.566456 2025-01-16 01:28:15,413 - INFO - step 5128, loss: 3.661221, best loss: 2.566456 2025-01-16 01:28:15,563 - INFO - step 5129, loss: 3.615411, best loss: 2.566456 2025-01-16 01:28:15,713 - INFO - step 5130, loss: 3.647068, best loss: 2.566456 2025-01-16 01:28:15,863 - INFO - step 5131, loss: 3.158679, best loss: 2.566456 2025-01-16 01:28:16,013 - INFO - step 5132, loss: 3.619209, best loss: 2.566456 2025-01-16 01:28:16,163 - INFO - step 5133, loss: 3.840662, best loss: 2.566456 2025-01-16 01:28:16,314 - INFO - step 5134, loss: 3.606169, best loss: 2.566456 2025-01-16 01:28:16,464 - INFO - step 5135, loss: 3.574733, best loss: 2.566456 2025-01-16 01:28:16,614 - INFO - step 5136, loss: 3.619838, best loss: 2.566456 2025-01-16 01:28:16,764 - INFO - step 5137, loss: 3.253755, best loss: 2.566456 2025-01-16 01:28:20,263 - INFO - step 5138, loss: 2.452848, best loss: 2.452848 2025-01-16 01:28:20,425 - INFO - step 5139, loss: 3.470212, best loss: 2.452848 2025-01-16 01:28:20,581 - INFO - step 5140, loss: 3.569230, best loss: 2.452848 2025-01-16 01:28:20,731 - INFO - step 5141, loss: 3.550681, best loss: 2.452848 2025-01-16 01:28:20,881 - INFO - step 5142, loss: 3.612350, best loss: 2.452848 2025-01-16 01:28:21,031 - INFO - step 5143, loss: 3.267132, best loss: 2.452848 2025-01-16 01:28:21,182 - INFO - step 5144, loss: 3.466210, best loss: 2.452848 2025-01-16 01:28:21,332 - INFO - step 5145, loss: 3.525701, best loss: 2.452848 2025-01-16 01:28:21,482 - INFO - step 5146, loss: 3.333575, best loss: 2.452848 2025-01-16 01:28:21,632 - INFO - step 5147, loss: 3.448494, best loss: 2.452848 2025-01-16 01:28:21,782 - INFO - step 5148, loss: 3.297699, best loss: 2.452848 2025-01-16 01:28:21,933 - INFO - step 5149, loss: 3.225855, best loss: 2.452848 2025-01-16 01:28:22,083 - INFO - step 5150, loss: 3.530223, best loss: 2.452848 2025-01-16 01:28:22,233 - INFO - step 5151, loss: 3.225225, best loss: 2.452848 2025-01-16 01:28:22,383 - INFO - step 5152, loss: 3.486215, best loss: 2.452848 2025-01-16 01:28:22,533 - INFO - step 5153, loss: 3.712677, best loss: 2.452848 2025-01-16 01:28:22,683 - INFO - step 5154, loss: 3.343210, best loss: 2.452848 2025-01-16 01:28:22,833 - INFO - step 5155, loss: 3.040646, best loss: 2.452848 2025-01-16 01:28:22,983 - INFO - step 5156, loss: 3.493909, best loss: 2.452848 2025-01-16 01:28:23,133 - INFO - step 5157, loss: 3.620119, best loss: 2.452848 2025-01-16 01:28:23,284 - INFO - step 5158, loss: 3.671489, best loss: 2.452848 2025-01-16 01:28:23,434 - INFO - step 5159, loss: 3.529735, best loss: 2.452848 2025-01-16 01:28:23,584 - INFO - step 5160, loss: 3.648457, best loss: 2.452848 2025-01-16 01:28:23,735 - INFO - step 5161, loss: 3.500656, best loss: 2.452848 2025-01-16 01:28:23,885 - INFO - step 5162, loss: 3.758168, best loss: 2.452848 2025-01-16 01:28:24,035 - INFO - step 5163, loss: 3.312082, best loss: 2.452848 2025-01-16 01:28:24,185 - INFO - step 5164, loss: 3.453286, best loss: 2.452848 2025-01-16 01:28:24,335 - INFO - step 5165, loss: 3.591757, best loss: 2.452848 2025-01-16 01:28:24,486 - INFO - step 5166, loss: 3.650111, best loss: 2.452848 2025-01-16 01:28:24,636 - INFO - step 5167, loss: 3.529320, best loss: 2.452848 2025-01-16 01:28:24,786 - INFO - step 5168, loss: 3.322725, best loss: 2.452848 2025-01-16 01:28:24,937 - INFO - step 5169, loss: 3.374938, best loss: 2.452848 2025-01-16 01:28:25,087 - INFO - step 5170, loss: 3.626514, best loss: 2.452848 2025-01-16 01:28:25,237 - INFO - step 5171, loss: 3.864795, best loss: 2.452848 2025-01-16 01:28:25,388 - INFO - step 5172, loss: 3.564004, best loss: 2.452848 2025-01-16 01:28:25,538 - INFO - step 5173, loss: 3.787478, best loss: 2.452848 2025-01-16 01:28:25,688 - INFO - step 5174, loss: 3.861140, best loss: 2.452848 2025-01-16 01:28:25,838 - INFO - step 5175, loss: 3.720180, best loss: 2.452848 2025-01-16 01:28:25,989 - INFO - step 5176, loss: 3.925578, best loss: 2.452848 2025-01-16 01:28:26,139 - INFO - step 5177, loss: 3.704222, best loss: 2.452848 2025-01-16 01:28:26,289 - INFO - step 5178, loss: 3.219442, best loss: 2.452848 2025-01-16 01:28:26,439 - INFO - step 5179, loss: 3.709661, best loss: 2.452848 2025-01-16 01:28:26,589 - INFO - step 5180, loss: 3.659219, best loss: 2.452848 2025-01-16 01:28:26,739 - INFO - step 5181, loss: 3.793583, best loss: 2.452848 2025-01-16 01:28:26,890 - INFO - step 5182, loss: 3.333956, best loss: 2.452848 2025-01-16 01:28:27,040 - INFO - step 5183, loss: 3.555697, best loss: 2.452848 2025-01-16 01:28:27,190 - INFO - step 5184, loss: 3.535916, best loss: 2.452848 2025-01-16 01:28:27,340 - INFO - step 5185, loss: 3.426119, best loss: 2.452848 2025-01-16 01:28:27,490 - INFO - step 5186, loss: 3.637030, best loss: 2.452848 2025-01-16 01:28:27,640 - INFO - step 5187, loss: 3.300454, best loss: 2.452848 2025-01-16 01:28:27,790 - INFO - step 5188, loss: 3.271489, best loss: 2.452848 2025-01-16 01:28:27,940 - INFO - step 5189, loss: 3.749176, best loss: 2.452848 2025-01-16 01:28:28,091 - INFO - step 5190, loss: 3.615719, best loss: 2.452848 2025-01-16 01:28:28,241 - INFO - step 5191, loss: 3.745974, best loss: 2.452848 2025-01-16 01:28:28,391 - INFO - step 5192, loss: 3.577029, best loss: 2.452848 2025-01-16 01:28:28,541 - INFO - step 5193, loss: 3.875916, best loss: 2.452848 2025-01-16 01:28:28,691 - INFO - step 5194, loss: 3.735461, best loss: 2.452848 2025-01-16 01:28:28,842 - INFO - step 5195, loss: 3.405066, best loss: 2.452848 2025-01-16 01:28:28,991 - INFO - step 5196, loss: 3.429747, best loss: 2.452848 2025-01-16 01:28:29,142 - INFO - step 5197, loss: 3.786178, best loss: 2.452848 2025-01-16 01:28:29,292 - INFO - step 5198, loss: 3.405446, best loss: 2.452848 2025-01-16 01:28:29,442 - INFO - step 5199, loss: 3.185072, best loss: 2.452848 2025-01-16 01:28:29,593 - INFO - step 5200, loss: 3.665268, best loss: 2.452848 2025-01-16 01:28:29,743 - INFO - step 5201, loss: 3.602825, best loss: 2.452848 2025-01-16 01:28:29,893 - INFO - step 5202, loss: 3.547834, best loss: 2.452848 2025-01-16 01:28:30,043 - INFO - step 5203, loss: 3.184071, best loss: 2.452848 2025-01-16 01:28:30,194 - INFO - step 5204, loss: 3.057830, best loss: 2.452848 2025-01-16 01:28:30,344 - INFO - step 5205, loss: 2.970858, best loss: 2.452848 2025-01-16 01:28:30,494 - INFO - step 5206, loss: 3.121907, best loss: 2.452848 2025-01-16 01:28:30,644 - INFO - step 5207, loss: 3.249121, best loss: 2.452848 2025-01-16 01:28:30,794 - INFO - step 5208, loss: 3.516524, best loss: 2.452848 2025-01-16 01:28:30,944 - INFO - step 5209, loss: 3.266865, best loss: 2.452848 2025-01-16 01:28:31,094 - INFO - step 5210, loss: 3.299036, best loss: 2.452848 2025-01-16 01:28:31,244 - INFO - step 5211, loss: 3.444606, best loss: 2.452848 2025-01-16 01:28:31,395 - INFO - step 5212, loss: 3.475991, best loss: 2.452848 2025-01-16 01:28:31,545 - INFO - step 5213, loss: 3.372180, best loss: 2.452848 2025-01-16 01:28:31,695 - INFO - step 5214, loss: 3.538831, best loss: 2.452848 2025-01-16 01:28:31,846 - INFO - step 5215, loss: 3.691613, best loss: 2.452848 2025-01-16 01:28:31,996 - INFO - step 5216, loss: 3.183352, best loss: 2.452848 2025-01-16 01:28:32,146 - INFO - step 5217, loss: 3.121505, best loss: 2.452848 2025-01-16 01:28:32,296 - INFO - step 5218, loss: 3.358432, best loss: 2.452848 2025-01-16 01:28:32,447 - INFO - step 5219, loss: 3.509821, best loss: 2.452848 2025-01-16 01:28:32,597 - INFO - step 5220, loss: 3.125167, best loss: 2.452848 2025-01-16 01:28:32,747 - INFO - step 5221, loss: 3.144304, best loss: 2.452848 2025-01-16 01:28:32,897 - INFO - step 5222, loss: 3.411191, best loss: 2.452848 2025-01-16 01:28:33,048 - INFO - step 5223, loss: 3.369254, best loss: 2.452848 2025-01-16 01:28:33,198 - INFO - step 5224, loss: 3.141597, best loss: 2.452848 2025-01-16 01:28:33,348 - INFO - step 5225, loss: 3.485494, best loss: 2.452848 2025-01-16 01:28:33,498 - INFO - step 5226, loss: 3.225716, best loss: 2.452848 2025-01-16 01:28:33,649 - INFO - step 5227, loss: 3.146984, best loss: 2.452848 2025-01-16 01:28:33,799 - INFO - step 5228, loss: 3.104263, best loss: 2.452848 2025-01-16 01:28:33,949 - INFO - step 5229, loss: 3.357046, best loss: 2.452848 2025-01-16 01:28:34,099 - INFO - step 5230, loss: 3.011864, best loss: 2.452848 2025-01-16 01:28:34,250 - INFO - step 5231, loss: 3.195969, best loss: 2.452848 2025-01-16 01:28:34,400 - INFO - step 5232, loss: 3.025136, best loss: 2.452848 2025-01-16 01:28:34,550 - INFO - step 5233, loss: 3.495244, best loss: 2.452848 2025-01-16 01:28:34,700 - INFO - step 5234, loss: 3.758842, best loss: 2.452848 2025-01-16 01:28:34,850 - INFO - step 5235, loss: 3.902649, best loss: 2.452848 2025-01-16 01:28:35,000 - INFO - step 5236, loss: 3.485417, best loss: 2.452848 2025-01-16 01:28:35,151 - INFO - step 5237, loss: 3.673267, best loss: 2.452848 2025-01-16 01:28:35,301 - INFO - step 5238, loss: 3.497793, best loss: 2.452848 2025-01-16 01:28:35,451 - INFO - step 5239, loss: 3.387381, best loss: 2.452848 2025-01-16 01:28:35,601 - INFO - step 5240, loss: 3.258116, best loss: 2.452848 2025-01-16 01:28:35,751 - INFO - step 5241, loss: 3.572742, best loss: 2.452848 2025-01-16 01:28:35,901 - INFO - step 5242, loss: 3.296070, best loss: 2.452848 2025-01-16 01:28:36,051 - INFO - step 5243, loss: 3.054264, best loss: 2.452848 2025-01-16 01:28:36,201 - INFO - step 5244, loss: 3.370730, best loss: 2.452848 2025-01-16 01:28:36,351 - INFO - step 5245, loss: 3.313011, best loss: 2.452848 2025-01-16 01:28:36,502 - INFO - step 5246, loss: 3.528442, best loss: 2.452848 2025-01-16 01:28:36,652 - INFO - step 5247, loss: 2.654805, best loss: 2.452848 2025-01-16 01:28:36,802 - INFO - step 5248, loss: 3.272692, best loss: 2.452848 2025-01-16 01:28:36,952 - INFO - step 5249, loss: 3.497189, best loss: 2.452848 2025-01-16 01:28:37,102 - INFO - step 5250, loss: 3.417922, best loss: 2.452848 2025-01-16 01:28:37,252 - INFO - step 5251, loss: 3.331996, best loss: 2.452848 2025-01-16 01:28:37,402 - INFO - step 5252, loss: 3.292616, best loss: 2.452848 2025-01-16 01:28:37,553 - INFO - step 5253, loss: 3.375994, best loss: 2.452848 2025-01-16 01:28:37,703 - INFO - step 5254, loss: 3.126009, best loss: 2.452848 2025-01-16 01:28:37,853 - INFO - step 5255, loss: 3.401362, best loss: 2.452848 2025-01-16 01:28:38,003 - INFO - step 5256, loss: 3.375029, best loss: 2.452848 2025-01-16 01:28:38,153 - INFO - step 5257, loss: 3.488477, best loss: 2.452848 2025-01-16 01:28:38,303 - INFO - step 5258, loss: 3.093227, best loss: 2.452848 2025-01-16 01:28:38,454 - INFO - step 5259, loss: 3.185306, best loss: 2.452848 2025-01-16 01:28:38,604 - INFO - step 5260, loss: 3.119452, best loss: 2.452848 2025-01-16 01:28:38,754 - INFO - step 5261, loss: 3.071327, best loss: 2.452848 2025-01-16 01:28:38,904 - INFO - step 5262, loss: 3.169322, best loss: 2.452848 2025-01-16 01:28:39,054 - INFO - step 5263, loss: 3.154528, best loss: 2.452848 2025-01-16 01:28:39,205 - INFO - step 5264, loss: 3.103030, best loss: 2.452848 2025-01-16 01:28:39,355 - INFO - step 5265, loss: 2.820557, best loss: 2.452848 2025-01-16 01:28:39,506 - INFO - step 5266, loss: 2.779764, best loss: 2.452848 2025-01-16 01:28:39,656 - INFO - step 5267, loss: 2.653667, best loss: 2.452848 2025-01-16 01:28:39,806 - INFO - step 5268, loss: 3.342255, best loss: 2.452848 2025-01-16 01:28:39,956 - INFO - step 5269, loss: 3.526480, best loss: 2.452848 2025-01-16 01:28:40,106 - INFO - step 5270, loss: 3.556479, best loss: 2.452848 2025-01-16 01:28:40,256 - INFO - step 5271, loss: 3.636606, best loss: 2.452848 2025-01-16 01:28:40,407 - INFO - step 5272, loss: 3.623298, best loss: 2.452848 2025-01-16 01:28:40,557 - INFO - step 5273, loss: 3.370352, best loss: 2.452848 2025-01-16 01:28:40,707 - INFO - step 5274, loss: 3.465372, best loss: 2.452848 2025-01-16 01:28:40,858 - INFO - step 5275, loss: 3.720452, best loss: 2.452848 2025-01-16 01:28:41,008 - INFO - step 5276, loss: 3.360745, best loss: 2.452848 2025-01-16 01:28:41,158 - INFO - step 5277, loss: 2.854922, best loss: 2.452848 2025-01-16 01:28:41,309 - INFO - step 5278, loss: 3.242965, best loss: 2.452848 2025-01-16 01:28:41,459 - INFO - step 5279, loss: 3.057472, best loss: 2.452848 2025-01-16 01:28:41,609 - INFO - step 5280, loss: 3.597272, best loss: 2.452848 2025-01-16 01:28:41,759 - INFO - step 5281, loss: 3.440551, best loss: 2.452848 2025-01-16 01:28:41,909 - INFO - step 5282, loss: 3.551344, best loss: 2.452848 2025-01-16 01:28:42,059 - INFO - step 5283, loss: 3.403815, best loss: 2.452848 2025-01-16 01:28:42,210 - INFO - step 5284, loss: 3.539798, best loss: 2.452848 2025-01-16 01:28:42,360 - INFO - step 5285, loss: 3.126928, best loss: 2.452848 2025-01-16 01:28:42,510 - INFO - step 5286, loss: 3.592497, best loss: 2.452848 2025-01-16 01:28:42,660 - INFO - step 5287, loss: 3.402873, best loss: 2.452848 2025-01-16 01:28:42,810 - INFO - step 5288, loss: 3.648571, best loss: 2.452848 2025-01-16 01:28:42,961 - INFO - step 5289, loss: 3.467440, best loss: 2.452848 2025-01-16 01:28:43,111 - INFO - step 5290, loss: 3.352338, best loss: 2.452848 2025-01-16 01:28:43,261 - INFO - step 5291, loss: 3.776366, best loss: 2.452848 2025-01-16 01:28:43,412 - INFO - step 5292, loss: 3.185426, best loss: 2.452848 2025-01-16 01:28:43,562 - INFO - step 5293, loss: 3.504179, best loss: 2.452848 2025-01-16 01:28:43,712 - INFO - step 5294, loss: 3.544025, best loss: 2.452848 2025-01-16 01:28:43,862 - INFO - step 5295, loss: 3.508423, best loss: 2.452848 2025-01-16 01:28:44,012 - INFO - step 5296, loss: 3.414947, best loss: 2.452848 2025-01-16 01:28:44,163 - INFO - step 5297, loss: 3.256415, best loss: 2.452848 2025-01-16 01:28:44,313 - INFO - step 5298, loss: 3.259851, best loss: 2.452848 2025-01-16 01:28:44,464 - INFO - step 5299, loss: 3.347663, best loss: 2.452848 2025-01-16 01:28:44,614 - INFO - step 5300, loss: 2.984367, best loss: 2.452848 2025-01-16 01:28:44,764 - INFO - step 5301, loss: 3.720889, best loss: 2.452848 2025-01-16 01:28:44,914 - INFO - step 5302, loss: 2.848598, best loss: 2.452848 2025-01-16 01:28:45,065 - INFO - step 5303, loss: 2.918159, best loss: 2.452848 2025-01-16 01:28:45,215 - INFO - step 5304, loss: 3.351274, best loss: 2.452848 2025-01-16 01:28:45,365 - INFO - step 5305, loss: 3.358355, best loss: 2.452848 2025-01-16 01:28:45,515 - INFO - step 5306, loss: 3.222726, best loss: 2.452848 2025-01-16 01:28:45,666 - INFO - step 5307, loss: 2.967048, best loss: 2.452848 2025-01-16 01:28:45,816 - INFO - step 5308, loss: 3.240116, best loss: 2.452848 2025-01-16 01:28:45,966 - INFO - step 5309, loss: 3.307688, best loss: 2.452848 2025-01-16 01:28:46,116 - INFO - step 5310, loss: 3.119893, best loss: 2.452848 2025-01-16 01:28:46,266 - INFO - step 5311, loss: 3.003650, best loss: 2.452848 2025-01-16 01:28:46,416 - INFO - step 5312, loss: 3.345570, best loss: 2.452848 2025-01-16 01:28:46,566 - INFO - step 5313, loss: 3.234353, best loss: 2.452848 2025-01-16 01:28:46,717 - INFO - step 5314, loss: 3.216116, best loss: 2.452848 2025-01-16 01:28:46,867 - INFO - step 5315, loss: 2.967983, best loss: 2.452848 2025-01-16 01:28:47,017 - INFO - step 5316, loss: 3.198135, best loss: 2.452848 2025-01-16 01:28:47,167 - INFO - step 5317, loss: 3.506109, best loss: 2.452848 2025-01-16 01:28:47,317 - INFO - step 5318, loss: 3.125142, best loss: 2.452848 2025-01-16 01:28:47,468 - INFO - step 5319, loss: 3.393137, best loss: 2.452848 2025-01-16 01:28:47,618 - INFO - step 5320, loss: 3.559620, best loss: 2.452848 2025-01-16 01:28:47,768 - INFO - step 5321, loss: 3.465010, best loss: 2.452848 2025-01-16 01:28:47,918 - INFO - step 5322, loss: 3.503702, best loss: 2.452848 2025-01-16 01:28:48,068 - INFO - step 5323, loss: 3.395069, best loss: 2.452848 2025-01-16 01:28:48,218 - INFO - step 5324, loss: 3.328456, best loss: 2.452848 2025-01-16 01:28:48,368 - INFO - step 5325, loss: 3.134883, best loss: 2.452848 2025-01-16 01:28:48,518 - INFO - step 5326, loss: 3.832713, best loss: 2.452848 2025-01-16 01:28:48,669 - INFO - step 5327, loss: 3.401427, best loss: 2.452848 2025-01-16 01:28:48,819 - INFO - step 5328, loss: 3.692299, best loss: 2.452848 2025-01-16 01:28:48,969 - INFO - step 5329, loss: 2.864003, best loss: 2.452848 2025-01-16 01:28:49,119 - INFO - step 5330, loss: 3.066059, best loss: 2.452848 2025-01-16 01:28:49,269 - INFO - step 5331, loss: 3.477542, best loss: 2.452848 2025-01-16 01:28:49,420 - INFO - step 5332, loss: 3.518133, best loss: 2.452848 2025-01-16 01:28:49,571 - INFO - step 5333, loss: 3.304318, best loss: 2.452848 2025-01-16 01:28:49,721 - INFO - step 5334, loss: 3.379209, best loss: 2.452848 2025-01-16 01:28:49,872 - INFO - step 5335, loss: 3.199992, best loss: 2.452848 2025-01-16 01:28:50,022 - INFO - step 5336, loss: 3.312865, best loss: 2.452848 2025-01-16 01:28:50,172 - INFO - step 5337, loss: 3.410294, best loss: 2.452848 2025-01-16 01:28:50,322 - INFO - step 5338, loss: 2.975375, best loss: 2.452848 2025-01-16 01:28:50,472 - INFO - step 5339, loss: 3.259463, best loss: 2.452848 2025-01-16 01:28:50,623 - INFO - step 5340, loss: 3.382735, best loss: 2.452848 2025-01-16 01:28:50,773 - INFO - step 5341, loss: 3.488334, best loss: 2.452848 2025-01-16 01:28:50,923 - INFO - step 5342, loss: 3.390039, best loss: 2.452848 2025-01-16 01:28:51,073 - INFO - step 5343, loss: 3.571535, best loss: 2.452848 2025-01-16 01:28:51,224 - INFO - step 5344, loss: 3.320574, best loss: 2.452848 2025-01-16 01:28:51,374 - INFO - step 5345, loss: 3.015369, best loss: 2.452848 2025-01-16 01:28:51,524 - INFO - step 5346, loss: 3.388178, best loss: 2.452848 2025-01-16 01:28:51,674 - INFO - step 5347, loss: 2.860004, best loss: 2.452848 2025-01-16 01:28:51,824 - INFO - step 5348, loss: 3.147103, best loss: 2.452848 2025-01-16 01:28:51,974 - INFO - step 5349, loss: 3.250587, best loss: 2.452848 2025-01-16 01:28:52,125 - INFO - step 5350, loss: 3.214815, best loss: 2.452848 2025-01-16 01:28:52,275 - INFO - step 5351, loss: 3.165412, best loss: 2.452848 2025-01-16 01:28:52,425 - INFO - step 5352, loss: 3.366822, best loss: 2.452848 2025-01-16 01:28:52,575 - INFO - step 5353, loss: 3.592474, best loss: 2.452848 2025-01-16 01:28:52,726 - INFO - step 5354, loss: 3.358414, best loss: 2.452848 2025-01-16 01:28:52,876 - INFO - step 5355, loss: 3.581487, best loss: 2.452848 2025-01-16 01:28:53,026 - INFO - step 5356, loss: 3.268124, best loss: 2.452848 2025-01-16 01:28:53,176 - INFO - step 5357, loss: 3.352406, best loss: 2.452848 2025-01-16 01:28:53,326 - INFO - step 5358, loss: 3.243063, best loss: 2.452848 2025-01-16 01:28:53,476 - INFO - step 5359, loss: 2.891199, best loss: 2.452848 2025-01-16 01:28:53,626 - INFO - step 5360, loss: 3.611067, best loss: 2.452848 2025-01-16 01:28:53,776 - INFO - step 5361, loss: 3.435263, best loss: 2.452848 2025-01-16 01:28:53,926 - INFO - step 5362, loss: 3.203737, best loss: 2.452848 2025-01-16 01:28:54,076 - INFO - step 5363, loss: 3.129827, best loss: 2.452848 2025-01-16 01:28:54,227 - INFO - step 5364, loss: 2.990896, best loss: 2.452848 2025-01-16 01:28:54,377 - INFO - step 5365, loss: 3.131651, best loss: 2.452848 2025-01-16 01:28:54,528 - INFO - step 5366, loss: 3.016463, best loss: 2.452848 2025-01-16 01:28:54,678 - INFO - step 5367, loss: 2.972513, best loss: 2.452848 2025-01-16 01:28:54,828 - INFO - step 5368, loss: 3.467155, best loss: 2.452848 2025-01-16 01:28:54,978 - INFO - step 5369, loss: 3.436371, best loss: 2.452848 2025-01-16 01:28:55,128 - INFO - step 5370, loss: 3.205884, best loss: 2.452848 2025-01-16 01:28:55,278 - INFO - step 5371, loss: 3.478788, best loss: 2.452848 2025-01-16 01:28:55,429 - INFO - step 5372, loss: 3.205531, best loss: 2.452848 2025-01-16 01:28:55,579 - INFO - step 5373, loss: 3.526164, best loss: 2.452848 2025-01-16 01:28:55,729 - INFO - step 5374, loss: 3.599167, best loss: 2.452848 2025-01-16 01:28:55,879 - INFO - step 5375, loss: 3.622770, best loss: 2.452848 2025-01-16 01:28:56,029 - INFO - step 5376, loss: 3.713311, best loss: 2.452848 2025-01-16 01:28:56,179 - INFO - step 5377, loss: 3.385101, best loss: 2.452848 2025-01-16 01:28:56,330 - INFO - step 5378, loss: 3.566384, best loss: 2.452848 2025-01-16 01:28:56,480 - INFO - step 5379, loss: 3.450693, best loss: 2.452848 2025-01-16 01:28:56,630 - INFO - step 5380, loss: 3.272961, best loss: 2.452848 2025-01-16 01:28:56,780 - INFO - step 5381, loss: 3.658669, best loss: 2.452848 2025-01-16 01:28:56,930 - INFO - step 5382, loss: 3.587867, best loss: 2.452848 2025-01-16 01:28:57,081 - INFO - step 5383, loss: 3.587062, best loss: 2.452848 2025-01-16 01:28:57,231 - INFO - step 5384, loss: 3.308870, best loss: 2.452848 2025-01-16 01:28:57,381 - INFO - step 5385, loss: 3.483402, best loss: 2.452848 2025-01-16 01:28:57,531 - INFO - step 5386, loss: 3.382740, best loss: 2.452848 2025-01-16 01:28:57,681 - INFO - step 5387, loss: 3.217102, best loss: 2.452848 2025-01-16 01:28:57,831 - INFO - step 5388, loss: 3.173758, best loss: 2.452848 2025-01-16 01:28:57,981 - INFO - step 5389, loss: 3.650435, best loss: 2.452848 2025-01-16 01:28:58,131 - INFO - step 5390, loss: 3.439229, best loss: 2.452848 2025-01-16 01:28:58,281 - INFO - step 5391, loss: 3.684215, best loss: 2.452848 2025-01-16 01:28:58,432 - INFO - step 5392, loss: 3.509652, best loss: 2.452848 2025-01-16 01:28:58,582 - INFO - step 5393, loss: 3.266788, best loss: 2.452848 2025-01-16 01:28:58,732 - INFO - step 5394, loss: 3.616071, best loss: 2.452848 2025-01-16 01:28:58,882 - INFO - step 5395, loss: 3.273022, best loss: 2.452848 2025-01-16 01:28:59,032 - INFO - step 5396, loss: 3.489995, best loss: 2.452848 2025-01-16 01:28:59,182 - INFO - step 5397, loss: 3.236380, best loss: 2.452848 2025-01-16 01:28:59,332 - INFO - step 5398, loss: 3.282873, best loss: 2.452848 2025-01-16 01:28:59,482 - INFO - step 5399, loss: 3.352570, best loss: 2.452848 2025-01-16 01:28:59,633 - INFO - step 5400, loss: 3.407199, best loss: 2.452848 2025-01-16 01:28:59,783 - INFO - step 5401, loss: 3.268601, best loss: 2.452848 2025-01-16 01:28:59,933 - INFO - step 5402, loss: 3.448667, best loss: 2.452848 2025-01-16 01:29:00,084 - INFO - step 5403, loss: 3.099367, best loss: 2.452848 2025-01-16 01:29:00,234 - INFO - step 5404, loss: 2.836076, best loss: 2.452848 2025-01-16 01:29:00,385 - INFO - step 5405, loss: 3.006779, best loss: 2.452848 2025-01-16 01:29:00,535 - INFO - step 5406, loss: 3.182571, best loss: 2.452848 2025-01-16 01:29:00,685 - INFO - step 5407, loss: 3.721381, best loss: 2.452848 2025-01-16 01:29:00,835 - INFO - step 5408, loss: 3.358604, best loss: 2.452848 2025-01-16 01:29:00,985 - INFO - step 5409, loss: 2.914616, best loss: 2.452848 2025-01-16 01:29:01,135 - INFO - step 5410, loss: 3.483858, best loss: 2.452848 2025-01-16 01:29:01,286 - INFO - step 5411, loss: 3.252283, best loss: 2.452848 2025-01-16 01:29:01,436 - INFO - step 5412, loss: 3.805118, best loss: 2.452848 2025-01-16 01:29:01,586 - INFO - step 5413, loss: 3.131639, best loss: 2.452848 2025-01-16 01:29:01,736 - INFO - step 5414, loss: 3.429759, best loss: 2.452848 2025-01-16 01:29:01,886 - INFO - step 5415, loss: 3.241816, best loss: 2.452848 2025-01-16 01:29:02,036 - INFO - step 5416, loss: 3.545439, best loss: 2.452848 2025-01-16 01:29:02,186 - INFO - step 5417, loss: 3.345564, best loss: 2.452848 2025-01-16 01:29:02,336 - INFO - step 5418, loss: 3.175949, best loss: 2.452848 2025-01-16 01:29:02,486 - INFO - step 5419, loss: 3.509409, best loss: 2.452848 2025-01-16 01:29:02,636 - INFO - step 5420, loss: 3.286415, best loss: 2.452848 2025-01-16 01:29:02,786 - INFO - step 5421, loss: 3.170426, best loss: 2.452848 2025-01-16 01:29:02,937 - INFO - step 5422, loss: 3.795442, best loss: 2.452848 2025-01-16 01:29:03,087 - INFO - step 5423, loss: 3.308242, best loss: 2.452848 2025-01-16 01:29:03,237 - INFO - step 5424, loss: 2.914036, best loss: 2.452848 2025-01-16 01:29:03,387 - INFO - step 5425, loss: 3.274115, best loss: 2.452848 2025-01-16 01:29:03,537 - INFO - step 5426, loss: 3.512550, best loss: 2.452848 2025-01-16 01:29:03,687 - INFO - step 5427, loss: 3.419523, best loss: 2.452848 2025-01-16 01:29:03,838 - INFO - step 5428, loss: 3.213196, best loss: 2.452848 2025-01-16 01:29:03,988 - INFO - step 5429, loss: 3.179292, best loss: 2.452848 2025-01-16 01:29:04,138 - INFO - step 5430, loss: 3.727081, best loss: 2.452848 2025-01-16 01:29:04,288 - INFO - step 5431, loss: 3.593752, best loss: 2.452848 2025-01-16 01:29:04,438 - INFO - step 5432, loss: 3.487503, best loss: 2.452848 2025-01-16 01:29:04,588 - INFO - step 5433, loss: 3.136924, best loss: 2.452848 2025-01-16 01:29:04,738 - INFO - step 5434, loss: 3.426180, best loss: 2.452848 2025-01-16 01:29:04,888 - INFO - step 5435, loss: 3.348169, best loss: 2.452848 2025-01-16 01:29:05,039 - INFO - step 5436, loss: 2.991818, best loss: 2.452848 2025-01-16 01:29:05,189 - INFO - step 5437, loss: 3.376901, best loss: 2.452848 2025-01-16 01:29:05,339 - INFO - step 5438, loss: 3.155487, best loss: 2.452848 2025-01-16 01:29:05,489 - INFO - step 5439, loss: 3.476182, best loss: 2.452848 2025-01-16 01:29:05,640 - INFO - step 5440, loss: 3.199417, best loss: 2.452848 2025-01-16 01:29:05,790 - INFO - step 5441, loss: 3.319924, best loss: 2.452848 2025-01-16 01:29:05,940 - INFO - step 5442, loss: 3.232643, best loss: 2.452848 2025-01-16 01:29:06,090 - INFO - step 5443, loss: 3.235862, best loss: 2.452848 2025-01-16 01:29:06,241 - INFO - step 5444, loss: 3.596802, best loss: 2.452848 2025-01-16 01:29:06,391 - INFO - step 5445, loss: 3.367659, best loss: 2.452848 2025-01-16 01:29:06,541 - INFO - step 5446, loss: 3.538835, best loss: 2.452848 2025-01-16 01:29:06,691 - INFO - step 5447, loss: 3.124452, best loss: 2.452848 2025-01-16 01:29:06,841 - INFO - step 5448, loss: 3.416299, best loss: 2.452848 2025-01-16 01:29:06,991 - INFO - step 5449, loss: 3.460839, best loss: 2.452848 2025-01-16 01:29:07,141 - INFO - step 5450, loss: 3.114134, best loss: 2.452848 2025-01-16 01:29:07,292 - INFO - step 5451, loss: 2.960937, best loss: 2.452848 2025-01-16 01:29:07,442 - INFO - step 5452, loss: 3.022260, best loss: 2.452848 2025-01-16 01:29:07,592 - INFO - step 5453, loss: 3.210151, best loss: 2.452848 2025-01-16 01:29:07,742 - INFO - step 5454, loss: 3.285295, best loss: 2.452848 2025-01-16 01:29:07,892 - INFO - step 5455, loss: 3.468730, best loss: 2.452848 2025-01-16 01:29:08,042 - INFO - step 5456, loss: 3.698128, best loss: 2.452848 2025-01-16 01:29:08,192 - INFO - step 5457, loss: 3.536069, best loss: 2.452848 2025-01-16 01:29:08,342 - INFO - step 5458, loss: 3.522010, best loss: 2.452848 2025-01-16 01:29:08,492 - INFO - step 5459, loss: 3.451701, best loss: 2.452848 2025-01-16 01:29:08,642 - INFO - step 5460, loss: 3.484472, best loss: 2.452848 2025-01-16 01:29:08,793 - INFO - step 5461, loss: 3.058676, best loss: 2.452848 2025-01-16 01:29:08,943 - INFO - step 5462, loss: 3.544856, best loss: 2.452848 2025-01-16 01:29:09,093 - INFO - step 5463, loss: 3.719188, best loss: 2.452848 2025-01-16 01:29:09,243 - INFO - step 5464, loss: 3.470142, best loss: 2.452848 2025-01-16 01:29:09,394 - INFO - step 5465, loss: 3.463532, best loss: 2.452848 2025-01-16 01:29:09,544 - INFO - step 5466, loss: 3.497404, best loss: 2.452848 2025-01-16 01:29:09,694 - INFO - step 5467, loss: 3.165009, best loss: 2.452848 2025-01-16 01:29:13,180 - INFO - step 5468, loss: 2.351092, best loss: 2.351092 2025-01-16 01:29:13,341 - INFO - step 5469, loss: 3.347276, best loss: 2.351092 2025-01-16 01:29:13,492 - INFO - step 5470, loss: 3.429907, best loss: 2.351092 2025-01-16 01:29:13,643 - INFO - step 5471, loss: 3.443261, best loss: 2.351092 2025-01-16 01:29:13,793 - INFO - step 5472, loss: 3.499338, best loss: 2.351092 2025-01-16 01:29:13,943 - INFO - step 5473, loss: 3.168087, best loss: 2.351092 2025-01-16 01:29:14,093 - INFO - step 5474, loss: 3.409523, best loss: 2.351092 2025-01-16 01:29:14,243 - INFO - step 5475, loss: 3.424014, best loss: 2.351092 2025-01-16 01:29:14,394 - INFO - step 5476, loss: 3.235695, best loss: 2.351092 2025-01-16 01:29:14,544 - INFO - step 5477, loss: 3.304349, best loss: 2.351092 2025-01-16 01:29:14,694 - INFO - step 5478, loss: 3.170587, best loss: 2.351092 2025-01-16 01:29:14,844 - INFO - step 5479, loss: 3.096604, best loss: 2.351092 2025-01-16 01:29:14,994 - INFO - step 5480, loss: 3.347465, best loss: 2.351092 2025-01-16 01:29:15,144 - INFO - step 5481, loss: 3.150080, best loss: 2.351092 2025-01-16 01:29:15,295 - INFO - step 5482, loss: 3.377888, best loss: 2.351092 2025-01-16 01:29:15,445 - INFO - step 5483, loss: 3.554784, best loss: 2.351092 2025-01-16 01:29:15,595 - INFO - step 5484, loss: 3.199711, best loss: 2.351092 2025-01-16 01:29:15,745 - INFO - step 5485, loss: 2.903567, best loss: 2.351092 2025-01-16 01:29:15,896 - INFO - step 5486, loss: 3.360939, best loss: 2.351092 2025-01-16 01:29:16,046 - INFO - step 5487, loss: 3.441043, best loss: 2.351092 2025-01-16 01:29:16,196 - INFO - step 5488, loss: 3.478961, best loss: 2.351092 2025-01-16 01:29:16,346 - INFO - step 5489, loss: 3.468086, best loss: 2.351092 2025-01-16 01:29:16,496 - INFO - step 5490, loss: 3.515321, best loss: 2.351092 2025-01-16 01:29:16,646 - INFO - step 5491, loss: 3.323709, best loss: 2.351092 2025-01-16 01:29:16,796 - INFO - step 5492, loss: 3.637566, best loss: 2.351092 2025-01-16 01:29:16,946 - INFO - step 5493, loss: 3.212346, best loss: 2.351092 2025-01-16 01:29:17,096 - INFO - step 5494, loss: 3.287468, best loss: 2.351092 2025-01-16 01:29:17,247 - INFO - step 5495, loss: 3.464407, best loss: 2.351092 2025-01-16 01:29:17,397 - INFO - step 5496, loss: 3.529217, best loss: 2.351092 2025-01-16 01:29:17,547 - INFO - step 5497, loss: 3.446983, best loss: 2.351092 2025-01-16 01:29:17,698 - INFO - step 5498, loss: 3.278776, best loss: 2.351092 2025-01-16 01:29:17,848 - INFO - step 5499, loss: 3.259624, best loss: 2.351092 2025-01-16 01:29:17,998 - INFO - step 5500, loss: 3.420973, best loss: 2.351092 2025-01-16 01:29:18,148 - INFO - step 5501, loss: 3.645595, best loss: 2.351092 2025-01-16 01:29:18,298 - INFO - step 5502, loss: 3.428337, best loss: 2.351092 2025-01-16 01:29:18,449 - INFO - step 5503, loss: 3.668878, best loss: 2.351092 2025-01-16 01:29:18,599 - INFO - step 5504, loss: 3.687769, best loss: 2.351092 2025-01-16 01:29:18,749 - INFO - step 5505, loss: 3.574579, best loss: 2.351092 2025-01-16 01:29:18,900 - INFO - step 5506, loss: 3.824770, best loss: 2.351092 2025-01-16 01:29:19,050 - INFO - step 5507, loss: 3.625829, best loss: 2.351092 2025-01-16 01:29:19,200 - INFO - step 5508, loss: 3.143651, best loss: 2.351092 2025-01-16 01:29:19,350 - INFO - step 5509, loss: 3.622664, best loss: 2.351092 2025-01-16 01:29:19,500 - INFO - step 5510, loss: 3.563907, best loss: 2.351092 2025-01-16 01:29:19,650 - INFO - step 5511, loss: 3.628175, best loss: 2.351092 2025-01-16 01:29:19,801 - INFO - step 5512, loss: 3.216258, best loss: 2.351092 2025-01-16 01:29:19,951 - INFO - step 5513, loss: 3.421256, best loss: 2.351092 2025-01-16 01:29:20,101 - INFO - step 5514, loss: 3.403653, best loss: 2.351092 2025-01-16 01:29:20,251 - INFO - step 5515, loss: 3.335016, best loss: 2.351092 2025-01-16 01:29:20,401 - INFO - step 5516, loss: 3.534785, best loss: 2.351092 2025-01-16 01:29:20,552 - INFO - step 5517, loss: 3.163360, best loss: 2.351092 2025-01-16 01:29:20,702 - INFO - step 5518, loss: 3.181612, best loss: 2.351092 2025-01-16 01:29:20,852 - INFO - step 5519, loss: 3.659057, best loss: 2.351092 2025-01-16 01:29:21,003 - INFO - step 5520, loss: 3.556678, best loss: 2.351092 2025-01-16 01:29:21,153 - INFO - step 5521, loss: 3.656083, best loss: 2.351092 2025-01-16 01:29:21,303 - INFO - step 5522, loss: 3.438484, best loss: 2.351092 2025-01-16 01:29:21,453 - INFO - step 5523, loss: 3.689922, best loss: 2.351092 2025-01-16 01:29:21,603 - INFO - step 5524, loss: 3.597947, best loss: 2.351092 2025-01-16 01:29:21,753 - INFO - step 5525, loss: 3.244416, best loss: 2.351092 2025-01-16 01:29:21,904 - INFO - step 5526, loss: 3.256963, best loss: 2.351092 2025-01-16 01:29:22,054 - INFO - step 5527, loss: 3.652539, best loss: 2.351092 2025-01-16 01:29:22,205 - INFO - step 5528, loss: 3.266223, best loss: 2.351092 2025-01-16 01:29:22,355 - INFO - step 5529, loss: 3.038914, best loss: 2.351092 2025-01-16 01:29:22,505 - INFO - step 5530, loss: 3.500650, best loss: 2.351092 2025-01-16 01:29:22,655 - INFO - step 5531, loss: 3.560556, best loss: 2.351092 2025-01-16 01:29:22,806 - INFO - step 5532, loss: 3.443190, best loss: 2.351092 2025-01-16 01:29:22,956 - INFO - step 5533, loss: 3.090187, best loss: 2.351092 2025-01-16 01:29:23,106 - INFO - step 5534, loss: 2.891612, best loss: 2.351092 2025-01-16 01:29:23,256 - INFO - step 5535, loss: 2.899980, best loss: 2.351092 2025-01-16 01:29:23,407 - INFO - step 5536, loss: 2.995714, best loss: 2.351092 2025-01-16 01:29:23,557 - INFO - step 5537, loss: 3.150975, best loss: 2.351092 2025-01-16 01:29:23,707 - INFO - step 5538, loss: 3.367661, best loss: 2.351092 2025-01-16 01:29:23,857 - INFO - step 5539, loss: 3.279802, best loss: 2.351092 2025-01-16 01:29:24,007 - INFO - step 5540, loss: 3.246238, best loss: 2.351092 2025-01-16 01:29:24,158 - INFO - step 5541, loss: 3.423679, best loss: 2.351092 2025-01-16 01:29:24,308 - INFO - step 5542, loss: 3.308950, best loss: 2.351092 2025-01-16 01:29:24,458 - INFO - step 5543, loss: 3.197192, best loss: 2.351092 2025-01-16 01:29:24,608 - INFO - step 5544, loss: 3.417955, best loss: 2.351092 2025-01-16 01:29:24,759 - INFO - step 5545, loss: 3.489847, best loss: 2.351092 2025-01-16 01:29:24,909 - INFO - step 5546, loss: 3.015915, best loss: 2.351092 2025-01-16 01:29:25,059 - INFO - step 5547, loss: 3.042687, best loss: 2.351092 2025-01-16 01:29:25,209 - INFO - step 5548, loss: 3.327630, best loss: 2.351092 2025-01-16 01:29:25,359 - INFO - step 5549, loss: 3.415879, best loss: 2.351092 2025-01-16 01:29:25,510 - INFO - step 5550, loss: 3.021400, best loss: 2.351092 2025-01-16 01:29:25,660 - INFO - step 5551, loss: 3.039993, best loss: 2.351092 2025-01-16 01:29:25,810 - INFO - step 5552, loss: 3.311395, best loss: 2.351092 2025-01-16 01:29:25,960 - INFO - step 5553, loss: 3.125424, best loss: 2.351092 2025-01-16 01:29:26,110 - INFO - step 5554, loss: 2.999521, best loss: 2.351092 2025-01-16 01:29:26,260 - INFO - step 5555, loss: 3.360379, best loss: 2.351092 2025-01-16 01:29:26,410 - INFO - step 5556, loss: 3.121217, best loss: 2.351092 2025-01-16 01:29:26,561 - INFO - step 5557, loss: 3.078260, best loss: 2.351092 2025-01-16 01:29:26,711 - INFO - step 5558, loss: 3.061821, best loss: 2.351092 2025-01-16 01:29:26,861 - INFO - step 5559, loss: 3.271181, best loss: 2.351092 2025-01-16 01:29:27,011 - INFO - step 5560, loss: 3.001021, best loss: 2.351092 2025-01-16 01:29:27,161 - INFO - step 5561, loss: 3.052577, best loss: 2.351092 2025-01-16 01:29:27,312 - INFO - step 5562, loss: 2.893364, best loss: 2.351092 2025-01-16 01:29:27,462 - INFO - step 5563, loss: 3.275488, best loss: 2.351092 2025-01-16 01:29:27,612 - INFO - step 5564, loss: 3.600548, best loss: 2.351092 2025-01-16 01:29:27,763 - INFO - step 5565, loss: 3.800101, best loss: 2.351092 2025-01-16 01:29:27,913 - INFO - step 5566, loss: 3.398564, best loss: 2.351092 2025-01-16 01:29:28,063 - INFO - step 5567, loss: 3.492810, best loss: 2.351092 2025-01-16 01:29:28,213 - INFO - step 5568, loss: 3.364580, best loss: 2.351092 2025-01-16 01:29:28,364 - INFO - step 5569, loss: 3.295390, best loss: 2.351092 2025-01-16 01:29:28,514 - INFO - step 5570, loss: 3.146602, best loss: 2.351092 2025-01-16 01:29:28,664 - INFO - step 5571, loss: 3.469912, best loss: 2.351092 2025-01-16 01:29:28,814 - INFO - step 5572, loss: 3.180775, best loss: 2.351092 2025-01-16 01:29:28,965 - INFO - step 5573, loss: 2.948355, best loss: 2.351092 2025-01-16 01:29:29,115 - INFO - step 5574, loss: 3.242840, best loss: 2.351092 2025-01-16 01:29:29,265 - INFO - step 5575, loss: 3.212530, best loss: 2.351092 2025-01-16 01:29:29,415 - INFO - step 5576, loss: 3.398598, best loss: 2.351092 2025-01-16 01:29:29,566 - INFO - step 5577, loss: 2.579689, best loss: 2.351092 2025-01-16 01:29:29,716 - INFO - step 5578, loss: 3.150388, best loss: 2.351092 2025-01-16 01:29:29,866 - INFO - step 5579, loss: 3.382452, best loss: 2.351092 2025-01-16 01:29:30,016 - INFO - step 5580, loss: 3.325097, best loss: 2.351092 2025-01-16 01:29:30,166 - INFO - step 5581, loss: 3.182955, best loss: 2.351092 2025-01-16 01:29:30,317 - INFO - step 5582, loss: 3.170136, best loss: 2.351092 2025-01-16 01:29:30,467 - INFO - step 5583, loss: 3.249703, best loss: 2.351092 2025-01-16 01:29:30,617 - INFO - step 5584, loss: 3.004989, best loss: 2.351092 2025-01-16 01:29:30,767 - INFO - step 5585, loss: 3.268147, best loss: 2.351092 2025-01-16 01:29:30,918 - INFO - step 5586, loss: 3.207484, best loss: 2.351092 2025-01-16 01:29:31,068 - INFO - step 5587, loss: 3.335938, best loss: 2.351092 2025-01-16 01:29:31,218 - INFO - step 5588, loss: 3.017253, best loss: 2.351092 2025-01-16 01:29:31,368 - INFO - step 5589, loss: 3.129523, best loss: 2.351092 2025-01-16 01:29:31,519 - INFO - step 5590, loss: 3.014709, best loss: 2.351092 2025-01-16 01:29:31,669 - INFO - step 5591, loss: 2.991694, best loss: 2.351092 2025-01-16 01:29:31,819 - INFO - step 5592, loss: 3.117132, best loss: 2.351092 2025-01-16 01:29:31,970 - INFO - step 5593, loss: 3.091823, best loss: 2.351092 2025-01-16 01:29:32,120 - INFO - step 5594, loss: 3.015810, best loss: 2.351092 2025-01-16 01:29:32,270 - INFO - step 5595, loss: 2.734251, best loss: 2.351092 2025-01-16 01:29:32,420 - INFO - step 5596, loss: 2.684782, best loss: 2.351092 2025-01-16 01:29:32,570 - INFO - step 5597, loss: 2.536901, best loss: 2.351092 2025-01-16 01:29:32,720 - INFO - step 5598, loss: 3.233366, best loss: 2.351092 2025-01-16 01:29:32,871 - INFO - step 5599, loss: 3.431352, best loss: 2.351092 2025-01-16 01:29:33,021 - INFO - step 5600, loss: 3.414868, best loss: 2.351092 2025-01-16 01:29:33,171 - INFO - step 5601, loss: 3.557146, best loss: 2.351092 2025-01-16 01:29:33,321 - INFO - step 5602, loss: 3.538886, best loss: 2.351092 2025-01-16 01:29:33,472 - INFO - step 5603, loss: 3.178516, best loss: 2.351092 2025-01-16 01:29:33,622 - INFO - step 5604, loss: 3.273116, best loss: 2.351092 2025-01-16 01:29:33,772 - INFO - step 5605, loss: 3.492987, best loss: 2.351092 2025-01-16 01:29:33,922 - INFO - step 5606, loss: 3.212853, best loss: 2.351092 2025-01-16 01:29:34,072 - INFO - step 5607, loss: 2.728943, best loss: 2.351092 2025-01-16 01:29:34,223 - INFO - step 5608, loss: 3.175205, best loss: 2.351092 2025-01-16 01:29:34,373 - INFO - step 5609, loss: 2.952007, best loss: 2.351092 2025-01-16 01:29:34,523 - INFO - step 5610, loss: 3.478431, best loss: 2.351092 2025-01-16 01:29:34,674 - INFO - step 5611, loss: 3.361655, best loss: 2.351092 2025-01-16 01:29:34,824 - INFO - step 5612, loss: 3.436566, best loss: 2.351092 2025-01-16 01:29:34,974 - INFO - step 5613, loss: 3.377960, best loss: 2.351092 2025-01-16 01:29:35,124 - INFO - step 5614, loss: 3.384472, best loss: 2.351092 2025-01-16 01:29:35,274 - INFO - step 5615, loss: 2.913676, best loss: 2.351092 2025-01-16 01:29:35,424 - INFO - step 5616, loss: 3.484078, best loss: 2.351092 2025-01-16 01:29:35,575 - INFO - step 5617, loss: 3.271673, best loss: 2.351092 2025-01-16 01:29:35,725 - INFO - step 5618, loss: 3.527311, best loss: 2.351092 2025-01-16 01:29:35,875 - INFO - step 5619, loss: 3.331048, best loss: 2.351092 2025-01-16 01:29:36,026 - INFO - step 5620, loss: 3.201471, best loss: 2.351092 2025-01-16 01:29:36,176 - INFO - step 5621, loss: 3.624362, best loss: 2.351092 2025-01-16 01:29:36,326 - INFO - step 5622, loss: 3.124637, best loss: 2.351092 2025-01-16 01:29:36,477 - INFO - step 5623, loss: 3.344779, best loss: 2.351092 2025-01-16 01:29:36,627 - INFO - step 5624, loss: 3.445292, best loss: 2.351092 2025-01-16 01:29:36,777 - INFO - step 5625, loss: 3.491574, best loss: 2.351092 2025-01-16 01:29:36,927 - INFO - step 5626, loss: 3.364745, best loss: 2.351092 2025-01-16 01:29:37,078 - INFO - step 5627, loss: 3.205201, best loss: 2.351092 2025-01-16 01:29:37,228 - INFO - step 5628, loss: 3.121707, best loss: 2.351092 2025-01-16 01:29:37,378 - INFO - step 5629, loss: 3.198427, best loss: 2.351092 2025-01-16 01:29:37,528 - INFO - step 5630, loss: 2.839523, best loss: 2.351092 2025-01-16 01:29:37,679 - INFO - step 5631, loss: 3.542536, best loss: 2.351092 2025-01-16 01:29:37,829 - INFO - step 5632, loss: 2.741671, best loss: 2.351092 2025-01-16 01:29:37,979 - INFO - step 5633, loss: 2.840652, best loss: 2.351092 2025-01-16 01:29:38,129 - INFO - step 5634, loss: 3.252028, best loss: 2.351092 2025-01-16 01:29:38,280 - INFO - step 5635, loss: 3.293326, best loss: 2.351092 2025-01-16 01:29:38,430 - INFO - step 5636, loss: 3.207374, best loss: 2.351092 2025-01-16 01:29:38,580 - INFO - step 5637, loss: 2.926913, best loss: 2.351092 2025-01-16 01:29:38,730 - INFO - step 5638, loss: 3.099425, best loss: 2.351092 2025-01-16 01:29:38,881 - INFO - step 5639, loss: 3.241897, best loss: 2.351092 2025-01-16 01:29:39,031 - INFO - step 5640, loss: 2.990644, best loss: 2.351092 2025-01-16 01:29:39,181 - INFO - step 5641, loss: 2.884707, best loss: 2.351092 2025-01-16 01:29:39,331 - INFO - step 5642, loss: 3.209527, best loss: 2.351092 2025-01-16 01:29:39,482 - INFO - step 5643, loss: 3.198492, best loss: 2.351092 2025-01-16 01:29:39,632 - INFO - step 5644, loss: 3.101486, best loss: 2.351092 2025-01-16 01:29:39,782 - INFO - step 5645, loss: 2.844499, best loss: 2.351092 2025-01-16 01:29:39,932 - INFO - step 5646, loss: 3.178397, best loss: 2.351092 2025-01-16 01:29:40,083 - INFO - step 5647, loss: 3.435062, best loss: 2.351092 2025-01-16 01:29:40,233 - INFO - step 5648, loss: 3.082705, best loss: 2.351092 2025-01-16 01:29:40,383 - INFO - step 5649, loss: 3.280672, best loss: 2.351092 2025-01-16 01:29:40,533 - INFO - step 5650, loss: 3.449862, best loss: 2.351092 2025-01-16 01:29:40,683 - INFO - step 5651, loss: 3.395646, best loss: 2.351092 2025-01-16 01:29:40,834 - INFO - step 5652, loss: 3.359720, best loss: 2.351092 2025-01-16 01:29:40,984 - INFO - step 5653, loss: 3.270106, best loss: 2.351092 2025-01-16 01:29:41,136 - INFO - step 5654, loss: 3.263759, best loss: 2.351092 2025-01-16 01:29:41,286 - INFO - step 5655, loss: 3.124291, best loss: 2.351092 2025-01-16 01:29:41,436 - INFO - step 5656, loss: 3.705623, best loss: 2.351092 2025-01-16 01:29:41,587 - INFO - step 5657, loss: 3.321357, best loss: 2.351092 2025-01-16 01:29:41,737 - INFO - step 5658, loss: 3.541465, best loss: 2.351092 2025-01-16 01:29:41,887 - INFO - step 5659, loss: 2.729998, best loss: 2.351092 2025-01-16 01:29:42,037 - INFO - step 5660, loss: 2.893244, best loss: 2.351092 2025-01-16 01:29:42,187 - INFO - step 5661, loss: 3.302657, best loss: 2.351092 2025-01-16 01:29:42,338 - INFO - step 5662, loss: 3.307796, best loss: 2.351092 2025-01-16 01:29:42,488 - INFO - step 5663, loss: 3.247199, best loss: 2.351092 2025-01-16 01:29:42,638 - INFO - step 5664, loss: 3.354889, best loss: 2.351092 2025-01-16 01:29:42,788 - INFO - step 5665, loss: 3.139150, best loss: 2.351092 2025-01-16 01:29:42,938 - INFO - step 5666, loss: 3.207986, best loss: 2.351092 2025-01-16 01:29:43,088 - INFO - step 5667, loss: 3.326872, best loss: 2.351092 2025-01-16 01:29:43,238 - INFO - step 5668, loss: 2.826940, best loss: 2.351092 2025-01-16 01:29:43,389 - INFO - step 5669, loss: 3.132497, best loss: 2.351092 2025-01-16 01:29:43,539 - INFO - step 5670, loss: 3.275094, best loss: 2.351092 2025-01-16 01:29:43,689 - INFO - step 5671, loss: 3.361659, best loss: 2.351092 2025-01-16 01:29:43,840 - INFO - step 5672, loss: 3.277987, best loss: 2.351092 2025-01-16 01:29:43,990 - INFO - step 5673, loss: 3.473724, best loss: 2.351092 2025-01-16 01:29:44,140 - INFO - step 5674, loss: 3.296004, best loss: 2.351092 2025-01-16 01:29:44,290 - INFO - step 5675, loss: 2.962694, best loss: 2.351092 2025-01-16 01:29:44,440 - INFO - step 5676, loss: 3.385476, best loss: 2.351092 2025-01-16 01:29:44,590 - INFO - step 5677, loss: 2.805747, best loss: 2.351092 2025-01-16 01:29:44,740 - INFO - step 5678, loss: 3.069239, best loss: 2.351092 2025-01-16 01:29:44,890 - INFO - step 5679, loss: 3.117549, best loss: 2.351092 2025-01-16 01:29:45,040 - INFO - step 5680, loss: 3.111240, best loss: 2.351092 2025-01-16 01:29:45,191 - INFO - step 5681, loss: 3.061665, best loss: 2.351092 2025-01-16 01:29:45,341 - INFO - step 5682, loss: 3.253103, best loss: 2.351092 2025-01-16 01:29:45,491 - INFO - step 5683, loss: 3.467635, best loss: 2.351092 2025-01-16 01:29:45,641 - INFO - step 5684, loss: 3.316990, best loss: 2.351092 2025-01-16 01:29:45,791 - INFO - step 5685, loss: 3.521659, best loss: 2.351092 2025-01-16 01:29:45,941 - INFO - step 5686, loss: 3.175261, best loss: 2.351092 2025-01-16 01:29:46,091 - INFO - step 5687, loss: 3.325182, best loss: 2.351092 2025-01-16 01:29:46,241 - INFO - step 5688, loss: 3.186177, best loss: 2.351092 2025-01-16 01:29:46,392 - INFO - step 5689, loss: 2.739521, best loss: 2.351092 2025-01-16 01:29:46,542 - INFO - step 5690, loss: 3.459457, best loss: 2.351092 2025-01-16 01:29:46,692 - INFO - step 5691, loss: 3.310596, best loss: 2.351092 2025-01-16 01:29:46,842 - INFO - step 5692, loss: 3.133027, best loss: 2.351092 2025-01-16 01:29:46,992 - INFO - step 5693, loss: 3.035291, best loss: 2.351092 2025-01-16 01:29:47,142 - INFO - step 5694, loss: 2.927867, best loss: 2.351092 2025-01-16 01:29:47,292 - INFO - step 5695, loss: 3.079307, best loss: 2.351092 2025-01-16 01:29:47,442 - INFO - step 5696, loss: 2.957567, best loss: 2.351092 2025-01-16 01:29:47,592 - INFO - step 5697, loss: 2.944559, best loss: 2.351092 2025-01-16 01:29:47,743 - INFO - step 5698, loss: 3.444142, best loss: 2.351092 2025-01-16 01:29:47,893 - INFO - step 5699, loss: 3.434760, best loss: 2.351092 2025-01-16 01:29:48,043 - INFO - step 5700, loss: 3.133609, best loss: 2.351092 2025-01-16 01:29:48,193 - INFO - step 5701, loss: 3.379823, best loss: 2.351092 2025-01-16 01:29:48,343 - INFO - step 5702, loss: 3.113176, best loss: 2.351092 2025-01-16 01:29:48,494 - INFO - step 5703, loss: 3.428682, best loss: 2.351092 2025-01-16 01:29:48,644 - INFO - step 5704, loss: 3.514966, best loss: 2.351092 2025-01-16 01:29:48,794 - INFO - step 5705, loss: 3.549654, best loss: 2.351092 2025-01-16 01:29:48,944 - INFO - step 5706, loss: 3.638837, best loss: 2.351092 2025-01-16 01:29:49,094 - INFO - step 5707, loss: 3.299569, best loss: 2.351092 2025-01-16 01:29:49,244 - INFO - step 5708, loss: 3.482168, best loss: 2.351092 2025-01-16 01:29:49,394 - INFO - step 5709, loss: 3.321247, best loss: 2.351092 2025-01-16 01:29:49,545 - INFO - step 5710, loss: 3.185123, best loss: 2.351092 2025-01-16 01:29:49,695 - INFO - step 5711, loss: 3.532018, best loss: 2.351092 2025-01-16 01:29:49,845 - INFO - step 5712, loss: 3.453160, best loss: 2.351092 2025-01-16 01:29:49,995 - INFO - step 5713, loss: 3.499566, best loss: 2.351092 2025-01-16 01:29:50,145 - INFO - step 5714, loss: 3.174524, best loss: 2.351092 2025-01-16 01:29:50,296 - INFO - step 5715, loss: 3.376297, best loss: 2.351092 2025-01-16 01:29:50,446 - INFO - step 5716, loss: 3.333346, best loss: 2.351092 2025-01-16 01:29:50,596 - INFO - step 5717, loss: 3.120638, best loss: 2.351092 2025-01-16 01:29:50,746 - INFO - step 5718, loss: 3.049562, best loss: 2.351092 2025-01-16 01:29:50,896 - INFO - step 5719, loss: 3.474714, best loss: 2.351092 2025-01-16 01:29:51,046 - INFO - step 5720, loss: 3.356921, best loss: 2.351092 2025-01-16 01:29:51,196 - INFO - step 5721, loss: 3.596826, best loss: 2.351092 2025-01-16 01:29:51,346 - INFO - step 5722, loss: 3.461171, best loss: 2.351092 2025-01-16 01:29:51,496 - INFO - step 5723, loss: 3.222197, best loss: 2.351092 2025-01-16 01:29:51,646 - INFO - step 5724, loss: 3.552730, best loss: 2.351092 2025-01-16 01:29:51,796 - INFO - step 5725, loss: 3.185334, best loss: 2.351092 2025-01-16 01:29:51,946 - INFO - step 5726, loss: 3.421471, best loss: 2.351092 2025-01-16 01:29:52,096 - INFO - step 5727, loss: 3.116044, best loss: 2.351092 2025-01-16 01:29:52,246 - INFO - step 5728, loss: 3.184118, best loss: 2.351092 2025-01-16 01:29:52,397 - INFO - step 5729, loss: 3.249270, best loss: 2.351092 2025-01-16 01:29:52,547 - INFO - step 5730, loss: 3.282000, best loss: 2.351092 2025-01-16 01:29:52,697 - INFO - step 5731, loss: 3.175296, best loss: 2.351092 2025-01-16 01:29:52,848 - INFO - step 5732, loss: 3.322926, best loss: 2.351092 2025-01-16 01:29:52,997 - INFO - step 5733, loss: 2.965328, best loss: 2.351092 2025-01-16 01:29:53,147 - INFO - step 5734, loss: 2.725068, best loss: 2.351092 2025-01-16 01:29:53,298 - INFO - step 5735, loss: 2.915658, best loss: 2.351092 2025-01-16 01:29:53,448 - INFO - step 5736, loss: 3.144011, best loss: 2.351092 2025-01-16 01:29:53,598 - INFO - step 5737, loss: 3.622890, best loss: 2.351092 2025-01-16 01:29:53,748 - INFO - step 5738, loss: 3.243435, best loss: 2.351092 2025-01-16 01:29:53,898 - INFO - step 5739, loss: 2.789666, best loss: 2.351092 2025-01-16 01:29:54,048 - INFO - step 5740, loss: 3.370995, best loss: 2.351092 2025-01-16 01:29:54,198 - INFO - step 5741, loss: 3.125669, best loss: 2.351092 2025-01-16 01:29:54,348 - INFO - step 5742, loss: 3.700435, best loss: 2.351092 2025-01-16 01:29:54,498 - INFO - step 5743, loss: 2.969458, best loss: 2.351092 2025-01-16 01:29:54,648 - INFO - step 5744, loss: 3.308650, best loss: 2.351092 2025-01-16 01:29:54,799 - INFO - step 5745, loss: 3.137798, best loss: 2.351092 2025-01-16 01:29:54,949 - INFO - step 5746, loss: 3.427313, best loss: 2.351092 2025-01-16 01:29:55,099 - INFO - step 5747, loss: 3.269033, best loss: 2.351092 2025-01-16 01:29:55,249 - INFO - step 5748, loss: 3.108880, best loss: 2.351092 2025-01-16 01:29:55,399 - INFO - step 5749, loss: 3.403307, best loss: 2.351092 2025-01-16 01:29:55,549 - INFO - step 5750, loss: 3.244614, best loss: 2.351092 2025-01-16 01:29:55,700 - INFO - step 5751, loss: 3.076195, best loss: 2.351092 2025-01-16 01:29:55,850 - INFO - step 5752, loss: 3.625031, best loss: 2.351092 2025-01-16 01:29:56,000 - INFO - step 5753, loss: 3.134338, best loss: 2.351092 2025-01-16 01:29:56,150 - INFO - step 5754, loss: 2.752794, best loss: 2.351092 2025-01-16 01:29:56,300 - INFO - step 5755, loss: 3.197565, best loss: 2.351092 2025-01-16 01:29:56,450 - INFO - step 5756, loss: 3.436896, best loss: 2.351092 2025-01-16 01:29:56,600 - INFO - step 5757, loss: 3.288953, best loss: 2.351092 2025-01-16 01:29:56,750 - INFO - step 5758, loss: 3.059796, best loss: 2.351092 2025-01-16 01:29:56,901 - INFO - step 5759, loss: 3.068886, best loss: 2.351092 2025-01-16 01:29:57,051 - INFO - step 5760, loss: 3.540125, best loss: 2.351092 2025-01-16 01:29:57,201 - INFO - step 5761, loss: 3.517115, best loss: 2.351092 2025-01-16 01:29:57,351 - INFO - step 5762, loss: 3.412584, best loss: 2.351092 2025-01-16 01:29:57,501 - INFO - step 5763, loss: 3.110314, best loss: 2.351092 2025-01-16 01:29:57,651 - INFO - step 5764, loss: 3.454862, best loss: 2.351092 2025-01-16 01:29:57,801 - INFO - step 5765, loss: 3.317842, best loss: 2.351092 2025-01-16 01:29:57,951 - INFO - step 5766, loss: 2.994367, best loss: 2.351092 2025-01-16 01:29:58,101 - INFO - step 5767, loss: 3.301605, best loss: 2.351092 2025-01-16 01:29:58,252 - INFO - step 5768, loss: 3.046611, best loss: 2.351092 2025-01-16 01:29:58,402 - INFO - step 5769, loss: 3.288706, best loss: 2.351092 2025-01-16 01:29:58,552 - INFO - step 5770, loss: 3.021744, best loss: 2.351092 2025-01-16 01:29:58,702 - INFO - step 5771, loss: 3.171498, best loss: 2.351092 2025-01-16 01:29:58,853 - INFO - step 5772, loss: 3.051075, best loss: 2.351092 2025-01-16 01:29:59,003 - INFO - step 5773, loss: 3.123983, best loss: 2.351092 2025-01-16 01:29:59,153 - INFO - step 5774, loss: 3.435214, best loss: 2.351092 2025-01-16 01:29:59,303 - INFO - step 5775, loss: 3.354804, best loss: 2.351092 2025-01-16 01:29:59,454 - INFO - step 5776, loss: 3.525025, best loss: 2.351092 2025-01-16 01:29:59,604 - INFO - step 5777, loss: 3.164852, best loss: 2.351092 2025-01-16 01:29:59,754 - INFO - step 5778, loss: 3.399924, best loss: 2.351092 2025-01-16 01:29:59,904 - INFO - step 5779, loss: 3.362642, best loss: 2.351092 2025-01-16 01:30:00,055 - INFO - step 5780, loss: 2.982150, best loss: 2.351092 2025-01-16 01:30:00,205 - INFO - step 5781, loss: 2.861982, best loss: 2.351092 2025-01-16 01:30:00,355 - INFO - step 5782, loss: 2.976107, best loss: 2.351092 2025-01-16 01:30:00,505 - INFO - step 5783, loss: 3.112368, best loss: 2.351092 2025-01-16 01:30:00,655 - INFO - step 5784, loss: 3.195008, best loss: 2.351092 2025-01-16 01:30:00,805 - INFO - step 5785, loss: 3.393828, best loss: 2.351092 2025-01-16 01:30:00,955 - INFO - step 5786, loss: 3.582043, best loss: 2.351092 2025-01-16 01:30:01,105 - INFO - step 5787, loss: 3.440628, best loss: 2.351092 2025-01-16 01:30:01,255 - INFO - step 5788, loss: 3.403890, best loss: 2.351092 2025-01-16 01:30:01,405 - INFO - step 5789, loss: 3.400652, best loss: 2.351092 2025-01-16 01:30:01,555 - INFO - step 5790, loss: 3.388764, best loss: 2.351092 2025-01-16 01:30:01,705 - INFO - step 5791, loss: 2.943453, best loss: 2.351092 2025-01-16 01:30:01,856 - INFO - step 5792, loss: 3.427355, best loss: 2.351092 2025-01-16 01:30:02,006 - INFO - step 5793, loss: 3.613439, best loss: 2.351092 2025-01-16 01:30:02,156 - INFO - step 5794, loss: 3.373183, best loss: 2.351092 2025-01-16 01:30:02,306 - INFO - step 5795, loss: 3.350059, best loss: 2.351092 2025-01-16 01:30:02,456 - INFO - step 5796, loss: 3.410704, best loss: 2.351092 2025-01-16 01:30:02,606 - INFO - step 5797, loss: 3.051106, best loss: 2.351092 2025-01-16 01:30:06,048 - INFO - step 5798, loss: 2.306922, best loss: 2.306922 2025-01-16 01:30:06,207 - INFO - step 5799, loss: 3.232410, best loss: 2.306922 2025-01-16 01:30:06,358 - INFO - step 5800, loss: 3.303527, best loss: 2.306922 2025-01-16 01:30:06,508 - INFO - step 5801, loss: 3.374063, best loss: 2.306922 2025-01-16 01:30:06,658 - INFO - step 5802, loss: 3.380068, best loss: 2.306922 2025-01-16 01:30:06,808 - INFO - step 5803, loss: 3.088477, best loss: 2.306922 2025-01-16 01:30:06,958 - INFO - step 5804, loss: 3.302307, best loss: 2.306922 2025-01-16 01:30:07,108 - INFO - step 5805, loss: 3.335850, best loss: 2.306922 2025-01-16 01:30:07,258 - INFO - step 5806, loss: 3.110304, best loss: 2.306922 2025-01-16 01:30:07,409 - INFO - step 5807, loss: 3.159434, best loss: 2.306922 2025-01-16 01:30:07,559 - INFO - step 5808, loss: 3.064869, best loss: 2.306922 2025-01-16 01:30:07,709 - INFO - step 5809, loss: 3.029978, best loss: 2.306922 2025-01-16 01:30:07,859 - INFO - step 5810, loss: 3.226610, best loss: 2.306922 2025-01-16 01:30:08,009 - INFO - step 5811, loss: 3.027792, best loss: 2.306922 2025-01-16 01:30:08,159 - INFO - step 5812, loss: 3.208602, best loss: 2.306922 2025-01-16 01:30:08,309 - INFO - step 5813, loss: 3.408837, best loss: 2.306922 2025-01-16 01:30:08,459 - INFO - step 5814, loss: 3.070366, best loss: 2.306922 2025-01-16 01:30:08,609 - INFO - step 5815, loss: 2.859056, best loss: 2.306922 2025-01-16 01:30:08,759 - INFO - step 5816, loss: 3.285445, best loss: 2.306922 2025-01-16 01:30:08,910 - INFO - step 5817, loss: 3.290676, best loss: 2.306922 2025-01-16 01:30:09,060 - INFO - step 5818, loss: 3.372032, best loss: 2.306922 2025-01-16 01:30:09,210 - INFO - step 5819, loss: 3.283887, best loss: 2.306922 2025-01-16 01:30:09,360 - INFO - step 5820, loss: 3.368288, best loss: 2.306922 2025-01-16 01:30:09,511 - INFO - step 5821, loss: 3.182237, best loss: 2.306922 2025-01-16 01:30:09,661 - INFO - step 5822, loss: 3.428674, best loss: 2.306922 2025-01-16 01:30:09,811 - INFO - step 5823, loss: 3.127734, best loss: 2.306922 2025-01-16 01:30:09,961 - INFO - step 5824, loss: 3.223743, best loss: 2.306922 2025-01-16 01:30:10,111 - INFO - step 5825, loss: 3.358716, best loss: 2.306922 2025-01-16 01:30:10,261 - INFO - step 5826, loss: 3.394808, best loss: 2.306922 2025-01-16 01:30:10,411 - INFO - step 5827, loss: 3.316212, best loss: 2.306922 2025-01-16 01:30:10,561 - INFO - step 5828, loss: 3.101691, best loss: 2.306922 2025-01-16 01:30:10,712 - INFO - step 5829, loss: 3.128484, best loss: 2.306922 2025-01-16 01:30:10,862 - INFO - step 5830, loss: 3.336385, best loss: 2.306922 2025-01-16 01:30:11,012 - INFO - step 5831, loss: 3.585419, best loss: 2.306922 2025-01-16 01:30:11,162 - INFO - step 5832, loss: 3.380377, best loss: 2.306922 2025-01-16 01:30:11,312 - INFO - step 5833, loss: 3.621895, best loss: 2.306922 2025-01-16 01:30:11,463 - INFO - step 5834, loss: 3.598280, best loss: 2.306922 2025-01-16 01:30:11,613 - INFO - step 5835, loss: 3.432361, best loss: 2.306922 2025-01-16 01:30:11,763 - INFO - step 5836, loss: 3.672085, best loss: 2.306922 2025-01-16 01:30:11,913 - INFO - step 5837, loss: 3.430468, best loss: 2.306922 2025-01-16 01:30:12,063 - INFO - step 5838, loss: 2.989463, best loss: 2.306922 2025-01-16 01:30:12,213 - INFO - step 5839, loss: 3.502819, best loss: 2.306922 2025-01-16 01:30:12,364 - INFO - step 5840, loss: 3.419319, best loss: 2.306922 2025-01-16 01:30:12,514 - INFO - step 5841, loss: 3.559744, best loss: 2.306922 2025-01-16 01:30:12,664 - INFO - step 5842, loss: 3.109028, best loss: 2.306922 2025-01-16 01:30:12,814 - INFO - step 5843, loss: 3.326579, best loss: 2.306922 2025-01-16 01:30:12,964 - INFO - step 5844, loss: 3.357738, best loss: 2.306922 2025-01-16 01:30:13,114 - INFO - step 5845, loss: 3.208817, best loss: 2.306922 2025-01-16 01:30:13,265 - INFO - step 5846, loss: 3.410675, best loss: 2.306922 2025-01-16 01:30:13,415 - INFO - step 5847, loss: 3.048843, best loss: 2.306922 2025-01-16 01:30:13,565 - INFO - step 5848, loss: 2.953462, best loss: 2.306922 2025-01-16 01:30:13,715 - INFO - step 5849, loss: 3.457302, best loss: 2.306922 2025-01-16 01:30:13,865 - INFO - step 5850, loss: 3.443587, best loss: 2.306922 2025-01-16 01:30:14,016 - INFO - step 5851, loss: 3.468288, best loss: 2.306922 2025-01-16 01:30:14,166 - INFO - step 5852, loss: 3.300518, best loss: 2.306922 2025-01-16 01:30:14,316 - INFO - step 5853, loss: 3.543316, best loss: 2.306922 2025-01-16 01:30:14,466 - INFO - step 5854, loss: 3.514774, best loss: 2.306922 2025-01-16 01:30:14,616 - INFO - step 5855, loss: 3.152094, best loss: 2.306922 2025-01-16 01:30:14,766 - INFO - step 5856, loss: 3.142536, best loss: 2.306922 2025-01-16 01:30:14,916 - INFO - step 5857, loss: 3.487867, best loss: 2.306922 2025-01-16 01:30:15,067 - INFO - step 5858, loss: 3.073419, best loss: 2.306922 2025-01-16 01:30:15,217 - INFO - step 5859, loss: 2.873948, best loss: 2.306922 2025-01-16 01:30:15,367 - INFO - step 5860, loss: 3.302829, best loss: 2.306922 2025-01-16 01:30:15,518 - INFO - step 5861, loss: 3.342275, best loss: 2.306922 2025-01-16 01:30:15,668 - INFO - step 5862, loss: 3.349808, best loss: 2.306922 2025-01-16 01:30:15,818 - INFO - step 5863, loss: 2.933040, best loss: 2.306922 2025-01-16 01:30:15,969 - INFO - step 5864, loss: 2.787588, best loss: 2.306922 2025-01-16 01:30:16,119 - INFO - step 5865, loss: 2.802023, best loss: 2.306922 2025-01-16 01:30:16,269 - INFO - step 5866, loss: 2.869606, best loss: 2.306922 2025-01-16 01:30:16,420 - INFO - step 5867, loss: 3.043123, best loss: 2.306922 2025-01-16 01:30:16,570 - INFO - step 5868, loss: 3.318100, best loss: 2.306922 2025-01-16 01:30:16,720 - INFO - step 5869, loss: 3.203579, best loss: 2.306922 2025-01-16 01:30:16,870 - INFO - step 5870, loss: 3.055254, best loss: 2.306922 2025-01-16 01:30:17,020 - INFO - step 5871, loss: 3.209079, best loss: 2.306922 2025-01-16 01:30:17,170 - INFO - step 5872, loss: 3.176052, best loss: 2.306922 2025-01-16 01:30:17,320 - INFO - step 5873, loss: 3.066686, best loss: 2.306922 2025-01-16 01:30:17,470 - INFO - step 5874, loss: 3.274978, best loss: 2.306922 2025-01-16 01:30:17,621 - INFO - step 5875, loss: 3.408566, best loss: 2.306922 2025-01-16 01:30:17,771 - INFO - step 5876, loss: 2.945047, best loss: 2.306922 2025-01-16 01:30:17,921 - INFO - step 5877, loss: 2.928852, best loss: 2.306922 2025-01-16 01:30:18,071 - INFO - step 5878, loss: 3.154279, best loss: 2.306922 2025-01-16 01:30:18,221 - INFO - step 5879, loss: 3.269962, best loss: 2.306922 2025-01-16 01:30:18,372 - INFO - step 5880, loss: 2.890340, best loss: 2.306922 2025-01-16 01:30:18,522 - INFO - step 5881, loss: 2.905249, best loss: 2.306922 2025-01-16 01:30:18,672 - INFO - step 5882, loss: 3.177830, best loss: 2.306922 2025-01-16 01:30:18,822 - INFO - step 5883, loss: 3.025091, best loss: 2.306922 2025-01-16 01:30:18,972 - INFO - step 5884, loss: 2.900814, best loss: 2.306922 2025-01-16 01:30:19,122 - INFO - step 5885, loss: 3.285578, best loss: 2.306922 2025-01-16 01:30:19,273 - INFO - step 5886, loss: 3.020141, best loss: 2.306922 2025-01-16 01:30:19,423 - INFO - step 5887, loss: 2.960147, best loss: 2.306922 2025-01-16 01:30:19,575 - INFO - step 5888, loss: 2.946017, best loss: 2.306922 2025-01-16 01:30:19,725 - INFO - step 5889, loss: 3.170894, best loss: 2.306922 2025-01-16 01:30:19,875 - INFO - step 5890, loss: 2.910749, best loss: 2.306922 2025-01-16 01:30:20,025 - INFO - step 5891, loss: 2.992079, best loss: 2.306922 2025-01-16 01:30:20,175 - INFO - step 5892, loss: 2.817909, best loss: 2.306922 2025-01-16 01:30:20,326 - INFO - step 5893, loss: 3.186235, best loss: 2.306922 2025-01-16 01:30:20,476 - INFO - step 5894, loss: 3.586122, best loss: 2.306922 2025-01-16 01:30:20,626 - INFO - step 5895, loss: 3.641160, best loss: 2.306922 2025-01-16 01:30:20,776 - INFO - step 5896, loss: 3.244041, best loss: 2.306922 2025-01-16 01:30:20,927 - INFO - step 5897, loss: 3.315970, best loss: 2.306922 2025-01-16 01:30:21,077 - INFO - step 5898, loss: 3.229058, best loss: 2.306922 2025-01-16 01:30:21,227 - INFO - step 5899, loss: 3.233867, best loss: 2.306922 2025-01-16 01:30:21,377 - INFO - step 5900, loss: 3.005827, best loss: 2.306922 2025-01-16 01:30:21,528 - INFO - step 5901, loss: 3.304789, best loss: 2.306922 2025-01-16 01:30:21,677 - INFO - step 5902, loss: 3.014592, best loss: 2.306922 2025-01-16 01:30:21,827 - INFO - step 5903, loss: 2.813024, best loss: 2.306922 2025-01-16 01:30:21,978 - INFO - step 5904, loss: 3.083617, best loss: 2.306922 2025-01-16 01:30:22,127 - INFO - step 5905, loss: 3.040543, best loss: 2.306922 2025-01-16 01:30:22,277 - INFO - step 5906, loss: 3.292849, best loss: 2.306922 2025-01-16 01:30:22,428 - INFO - step 5907, loss: 2.513895, best loss: 2.306922 2025-01-16 01:30:22,578 - INFO - step 5908, loss: 3.071302, best loss: 2.306922 2025-01-16 01:30:22,728 - INFO - step 5909, loss: 3.266681, best loss: 2.306922 2025-01-16 01:30:22,878 - INFO - step 5910, loss: 3.175574, best loss: 2.306922 2025-01-16 01:30:23,028 - INFO - step 5911, loss: 3.058661, best loss: 2.306922 2025-01-16 01:30:23,178 - INFO - step 5912, loss: 3.095952, best loss: 2.306922 2025-01-16 01:30:23,328 - INFO - step 5913, loss: 3.114594, best loss: 2.306922 2025-01-16 01:30:23,478 - INFO - step 5914, loss: 2.890411, best loss: 2.306922 2025-01-16 01:30:23,629 - INFO - step 5915, loss: 3.100356, best loss: 2.306922 2025-01-16 01:30:23,779 - INFO - step 5916, loss: 3.045787, best loss: 2.306922 2025-01-16 01:30:23,929 - INFO - step 5917, loss: 3.250070, best loss: 2.306922 2025-01-16 01:30:24,080 - INFO - step 5918, loss: 2.946192, best loss: 2.306922 2025-01-16 01:30:24,230 - INFO - step 5919, loss: 2.987069, best loss: 2.306922 2025-01-16 01:30:24,380 - INFO - step 5920, loss: 2.925180, best loss: 2.306922 2025-01-16 01:30:24,530 - INFO - step 5921, loss: 2.840188, best loss: 2.306922 2025-01-16 01:30:24,680 - INFO - step 5922, loss: 2.986493, best loss: 2.306922 2025-01-16 01:30:24,831 - INFO - step 5923, loss: 2.995901, best loss: 2.306922 2025-01-16 01:30:24,981 - INFO - step 5924, loss: 2.928002, best loss: 2.306922 2025-01-16 01:30:25,131 - INFO - step 5925, loss: 2.650021, best loss: 2.306922 2025-01-16 01:30:25,281 - INFO - step 5926, loss: 2.603271, best loss: 2.306922 2025-01-16 01:30:25,431 - INFO - step 5927, loss: 2.460894, best loss: 2.306922 2025-01-16 01:30:25,581 - INFO - step 5928, loss: 3.097270, best loss: 2.306922 2025-01-16 01:30:25,732 - INFO - step 5929, loss: 3.288301, best loss: 2.306922 2025-01-16 01:30:25,882 - INFO - step 5930, loss: 3.301532, best loss: 2.306922 2025-01-16 01:30:26,032 - INFO - step 5931, loss: 3.435623, best loss: 2.306922 2025-01-16 01:30:26,182 - INFO - step 5932, loss: 3.451889, best loss: 2.306922 2025-01-16 01:30:26,332 - INFO - step 5933, loss: 3.045959, best loss: 2.306922 2025-01-16 01:30:26,482 - INFO - step 5934, loss: 3.209038, best loss: 2.306922 2025-01-16 01:30:26,633 - INFO - step 5935, loss: 3.377831, best loss: 2.306922 2025-01-16 01:30:26,783 - INFO - step 5936, loss: 3.081700, best loss: 2.306922 2025-01-16 01:30:26,933 - INFO - step 5937, loss: 2.617275, best loss: 2.306922 2025-01-16 01:30:27,083 - INFO - step 5938, loss: 2.971704, best loss: 2.306922 2025-01-16 01:30:27,233 - INFO - step 5939, loss: 2.821244, best loss: 2.306922 2025-01-16 01:30:27,383 - INFO - step 5940, loss: 3.358436, best loss: 2.306922 2025-01-16 01:30:27,533 - INFO - step 5941, loss: 3.267382, best loss: 2.306922 2025-01-16 01:30:27,684 - INFO - step 5942, loss: 3.346269, best loss: 2.306922 2025-01-16 01:30:27,834 - INFO - step 5943, loss: 3.313418, best loss: 2.306922 2025-01-16 01:30:27,984 - INFO - step 5944, loss: 3.348027, best loss: 2.306922 2025-01-16 01:30:28,134 - INFO - step 5945, loss: 2.932754, best loss: 2.306922 2025-01-16 01:30:28,284 - INFO - step 5946, loss: 3.350142, best loss: 2.306922 2025-01-16 01:30:28,434 - INFO - step 5947, loss: 3.210901, best loss: 2.306922 2025-01-16 01:30:28,584 - INFO - step 5948, loss: 3.369943, best loss: 2.306922 2025-01-16 01:30:28,734 - INFO - step 5949, loss: 3.244141, best loss: 2.306922 2025-01-16 01:30:28,885 - INFO - step 5950, loss: 3.125137, best loss: 2.306922 2025-01-16 01:30:29,035 - INFO - step 5951, loss: 3.457458, best loss: 2.306922 2025-01-16 01:30:29,185 - INFO - step 5952, loss: 3.027733, best loss: 2.306922 2025-01-16 01:30:29,335 - INFO - step 5953, loss: 3.311993, best loss: 2.306922 2025-01-16 01:30:29,485 - INFO - step 5954, loss: 3.334688, best loss: 2.306922 2025-01-16 01:30:29,636 - INFO - step 5955, loss: 3.350538, best loss: 2.306922 2025-01-16 01:30:29,786 - INFO - step 5956, loss: 3.245429, best loss: 2.306922 2025-01-16 01:30:29,936 - INFO - step 5957, loss: 3.052475, best loss: 2.306922 2025-01-16 01:30:30,086 - INFO - step 5958, loss: 3.060544, best loss: 2.306922 2025-01-16 01:30:30,236 - INFO - step 5959, loss: 3.092690, best loss: 2.306922 2025-01-16 01:30:30,386 - INFO - step 5960, loss: 2.733381, best loss: 2.306922 2025-01-16 01:30:30,537 - INFO - step 5961, loss: 3.443415, best loss: 2.306922 2025-01-16 01:30:30,687 - INFO - step 5962, loss: 2.646810, best loss: 2.306922 2025-01-16 01:30:30,837 - INFO - step 5963, loss: 2.725310, best loss: 2.306922 2025-01-16 01:30:30,988 - INFO - step 5964, loss: 3.134251, best loss: 2.306922 2025-01-16 01:30:31,138 - INFO - step 5965, loss: 3.154002, best loss: 2.306922 2025-01-16 01:30:31,288 - INFO - step 5966, loss: 3.071826, best loss: 2.306922 2025-01-16 01:30:31,438 - INFO - step 5967, loss: 2.832639, best loss: 2.306922 2025-01-16 01:30:31,588 - INFO - step 5968, loss: 3.055557, best loss: 2.306922 2025-01-16 01:30:31,738 - INFO - step 5969, loss: 3.180229, best loss: 2.306922 2025-01-16 01:30:31,888 - INFO - step 5970, loss: 2.895765, best loss: 2.306922 2025-01-16 01:30:32,039 - INFO - step 5971, loss: 2.823471, best loss: 2.306922 2025-01-16 01:30:32,189 - INFO - step 5972, loss: 3.191976, best loss: 2.306922 2025-01-16 01:30:32,339 - INFO - step 5973, loss: 3.058805, best loss: 2.306922 2025-01-16 01:30:32,489 - INFO - step 5974, loss: 2.971428, best loss: 2.306922 2025-01-16 01:30:32,640 - INFO - step 5975, loss: 2.769361, best loss: 2.306922 2025-01-16 01:30:32,790 - INFO - step 5976, loss: 3.124376, best loss: 2.306922 2025-01-16 01:30:32,940 - INFO - step 5977, loss: 3.341939, best loss: 2.306922 2025-01-16 01:30:33,090 - INFO - step 5978, loss: 2.980555, best loss: 2.306922 2025-01-16 01:30:33,240 - INFO - step 5979, loss: 3.198454, best loss: 2.306922 2025-01-16 01:30:33,391 - INFO - step 5980, loss: 3.383526, best loss: 2.306922 2025-01-16 01:30:33,541 - INFO - step 5981, loss: 3.296580, best loss: 2.306922 2025-01-16 01:30:33,691 - INFO - step 5982, loss: 3.306822, best loss: 2.306922 2025-01-16 01:30:33,841 - INFO - step 5983, loss: 3.112459, best loss: 2.306922 2025-01-16 01:30:33,991 - INFO - step 5984, loss: 3.139650, best loss: 2.306922 2025-01-16 01:30:34,141 - INFO - step 5985, loss: 2.988884, best loss: 2.306922 2025-01-16 01:30:34,291 - INFO - step 5986, loss: 3.625200, best loss: 2.306922 2025-01-16 01:30:34,442 - INFO - step 5987, loss: 3.121902, best loss: 2.306922 2025-01-16 01:30:34,592 - INFO - step 5988, loss: 3.456256, best loss: 2.306922 2025-01-16 01:30:34,742 - INFO - step 5989, loss: 2.745345, best loss: 2.306922 2025-01-16 01:30:34,892 - INFO - step 5990, loss: 2.830272, best loss: 2.306922 2025-01-16 01:30:35,042 - INFO - step 5991, loss: 3.183768, best loss: 2.306922 2025-01-16 01:30:35,192 - INFO - step 5992, loss: 3.109704, best loss: 2.306922 2025-01-16 01:30:35,343 - INFO - step 5993, loss: 3.069342, best loss: 2.306922 2025-01-16 01:30:35,493 - INFO - step 5994, loss: 3.103982, best loss: 2.306922 2025-01-16 01:30:35,643 - INFO - step 5995, loss: 2.956607, best loss: 2.306922 2025-01-16 01:30:35,793 - INFO - step 5996, loss: 3.101398, best loss: 2.306922 2025-01-16 01:30:35,944 - INFO - step 5997, loss: 3.211982, best loss: 2.306922 2025-01-16 01:30:36,094 - INFO - step 5998, loss: 2.761522, best loss: 2.306922 2025-01-16 01:30:36,244 - INFO - step 5999, loss: 3.091534, best loss: 2.306922 2025-01-16 01:30:36,394 - INFO - step 6000, loss: 3.226873, best loss: 2.306922 2025-01-16 01:30:36,544 - INFO - step 6001, loss: 3.301357, best loss: 2.306922 2025-01-16 01:30:36,694 - INFO - step 6002, loss: 3.110310, best loss: 2.306922 2025-01-16 01:30:36,844 - INFO - step 6003, loss: 3.292420, best loss: 2.306922 2025-01-16 01:30:36,995 - INFO - step 6004, loss: 3.159779, best loss: 2.306922 2025-01-16 01:30:37,145 - INFO - step 6005, loss: 2.821839, best loss: 2.306922 2025-01-16 01:30:37,295 - INFO - step 6006, loss: 3.194755, best loss: 2.306922 2025-01-16 01:30:37,445 - INFO - step 6007, loss: 2.690229, best loss: 2.306922 2025-01-16 01:30:37,595 - INFO - step 6008, loss: 2.954490, best loss: 2.306922 2025-01-16 01:30:37,745 - INFO - step 6009, loss: 3.079680, best loss: 2.306922 2025-01-16 01:30:37,895 - INFO - step 6010, loss: 3.031124, best loss: 2.306922 2025-01-16 01:30:38,046 - INFO - step 6011, loss: 2.966941, best loss: 2.306922 2025-01-16 01:30:38,196 - INFO - step 6012, loss: 3.133250, best loss: 2.306922 2025-01-16 01:30:38,346 - INFO - step 6013, loss: 3.285688, best loss: 2.306922 2025-01-16 01:30:38,496 - INFO - step 6014, loss: 3.147337, best loss: 2.306922 2025-01-16 01:30:38,646 - INFO - step 6015, loss: 3.365614, best loss: 2.306922 2025-01-16 01:30:38,796 - INFO - step 6016, loss: 3.022330, best loss: 2.306922 2025-01-16 01:30:38,946 - INFO - step 6017, loss: 3.171901, best loss: 2.306922 2025-01-16 01:30:39,096 - INFO - step 6018, loss: 3.071057, best loss: 2.306922 2025-01-16 01:30:39,247 - INFO - step 6019, loss: 2.731734, best loss: 2.306922 2025-01-16 01:30:39,397 - INFO - step 6020, loss: 3.370416, best loss: 2.306922 2025-01-16 01:30:39,548 - INFO - step 6021, loss: 3.220238, best loss: 2.306922 2025-01-16 01:30:39,698 - INFO - step 6022, loss: 3.026262, best loss: 2.306922 2025-01-16 01:30:39,848 - INFO - step 6023, loss: 2.944635, best loss: 2.306922 2025-01-16 01:30:39,998 - INFO - step 6024, loss: 2.750129, best loss: 2.306922 2025-01-16 01:30:40,148 - INFO - step 6025, loss: 2.927701, best loss: 2.306922 2025-01-16 01:30:40,298 - INFO - step 6026, loss: 2.864922, best loss: 2.306922 2025-01-16 01:30:40,449 - INFO - step 6027, loss: 2.797572, best loss: 2.306922 2025-01-16 01:30:40,599 - INFO - step 6028, loss: 3.313810, best loss: 2.306922 2025-01-16 01:30:40,749 - INFO - step 6029, loss: 3.353008, best loss: 2.306922 2025-01-16 01:30:40,900 - INFO - step 6030, loss: 3.071777, best loss: 2.306922 2025-01-16 01:30:41,050 - INFO - step 6031, loss: 3.330125, best loss: 2.306922 2025-01-16 01:30:41,200 - INFO - step 6032, loss: 3.050646, best loss: 2.306922 2025-01-16 01:30:41,350 - INFO - step 6033, loss: 3.414434, best loss: 2.306922 2025-01-16 01:30:41,500 - INFO - step 6034, loss: 3.475272, best loss: 2.306922 2025-01-16 01:30:41,650 - INFO - step 6035, loss: 3.491540, best loss: 2.306922 2025-01-16 01:30:41,800 - INFO - step 6036, loss: 3.555604, best loss: 2.306922 2025-01-16 01:30:41,951 - INFO - step 6037, loss: 3.188876, best loss: 2.306922 2025-01-16 01:30:42,101 - INFO - step 6038, loss: 3.355858, best loss: 2.306922 2025-01-16 01:30:42,251 - INFO - step 6039, loss: 3.283351, best loss: 2.306922 2025-01-16 01:30:42,402 - INFO - step 6040, loss: 3.153793, best loss: 2.306922 2025-01-16 01:30:42,552 - INFO - step 6041, loss: 3.466751, best loss: 2.306922 2025-01-16 01:30:42,702 - INFO - step 6042, loss: 3.438596, best loss: 2.306922 2025-01-16 01:30:42,852 - INFO - step 6043, loss: 3.414631, best loss: 2.306922 2025-01-16 01:30:43,002 - INFO - step 6044, loss: 3.133476, best loss: 2.306922 2025-01-16 01:30:43,152 - INFO - step 6045, loss: 3.239676, best loss: 2.306922 2025-01-16 01:30:43,302 - INFO - step 6046, loss: 3.189051, best loss: 2.306922 2025-01-16 01:30:43,453 - INFO - step 6047, loss: 3.045084, best loss: 2.306922 2025-01-16 01:30:43,603 - INFO - step 6048, loss: 2.943388, best loss: 2.306922 2025-01-16 01:30:43,753 - INFO - step 6049, loss: 3.376415, best loss: 2.306922 2025-01-16 01:30:43,903 - INFO - step 6050, loss: 3.261309, best loss: 2.306922 2025-01-16 01:30:44,053 - INFO - step 6051, loss: 3.503983, best loss: 2.306922 2025-01-16 01:30:44,204 - INFO - step 6052, loss: 3.306164, best loss: 2.306922 2025-01-16 01:30:44,354 - INFO - step 6053, loss: 3.069278, best loss: 2.306922 2025-01-16 01:30:44,505 - INFO - step 6054, loss: 3.444306, best loss: 2.306922 2025-01-16 01:30:44,655 - INFO - step 6055, loss: 3.086816, best loss: 2.306922 2025-01-16 01:30:44,805 - INFO - step 6056, loss: 3.340925, best loss: 2.306922 2025-01-16 01:30:44,955 - INFO - step 6057, loss: 3.038962, best loss: 2.306922 2025-01-16 01:30:45,105 - INFO - step 6058, loss: 3.164499, best loss: 2.306922 2025-01-16 01:30:45,255 - INFO - step 6059, loss: 3.130853, best loss: 2.306922 2025-01-16 01:30:45,406 - INFO - step 6060, loss: 3.273699, best loss: 2.306922 2025-01-16 01:30:45,556 - INFO - step 6061, loss: 3.168770, best loss: 2.306922 2025-01-16 01:30:45,706 - INFO - step 6062, loss: 3.219556, best loss: 2.306922 2025-01-16 01:30:45,856 - INFO - step 6063, loss: 2.857704, best loss: 2.306922 2025-01-16 01:30:46,006 - INFO - step 6064, loss: 2.621265, best loss: 2.306922 2025-01-16 01:30:46,157 - INFO - step 6065, loss: 2.783629, best loss: 2.306922 2025-01-16 01:30:46,307 - INFO - step 6066, loss: 3.026592, best loss: 2.306922 2025-01-16 01:30:46,457 - INFO - step 6067, loss: 3.460999, best loss: 2.306922 2025-01-16 01:30:46,607 - INFO - step 6068, loss: 3.123584, best loss: 2.306922 2025-01-16 01:30:46,757 - INFO - step 6069, loss: 2.713565, best loss: 2.306922 2025-01-16 01:30:46,907 - INFO - step 6070, loss: 3.269413, best loss: 2.306922 2025-01-16 01:30:47,058 - INFO - step 6071, loss: 3.020804, best loss: 2.306922 2025-01-16 01:30:47,208 - INFO - step 6072, loss: 3.574258, best loss: 2.306922 2025-01-16 01:30:47,358 - INFO - step 6073, loss: 2.897418, best loss: 2.306922 2025-01-16 01:30:47,509 - INFO - step 6074, loss: 3.164810, best loss: 2.306922 2025-01-16 01:30:47,659 - INFO - step 6075, loss: 2.997966, best loss: 2.306922 2025-01-16 01:30:47,809 - INFO - step 6076, loss: 3.226509, best loss: 2.306922 2025-01-16 01:30:47,959 - INFO - step 6077, loss: 3.142973, best loss: 2.306922 2025-01-16 01:30:48,109 - INFO - step 6078, loss: 2.953014, best loss: 2.306922 2025-01-16 01:30:48,260 - INFO - step 6079, loss: 3.252307, best loss: 2.306922 2025-01-16 01:30:48,410 - INFO - step 6080, loss: 3.101841, best loss: 2.306922 2025-01-16 01:30:48,560 - INFO - step 6081, loss: 2.958538, best loss: 2.306922 2025-01-16 01:30:48,710 - INFO - step 6082, loss: 3.515266, best loss: 2.306922 2025-01-16 01:30:48,860 - INFO - step 6083, loss: 3.077358, best loss: 2.306922 2025-01-16 01:30:49,010 - INFO - step 6084, loss: 2.679705, best loss: 2.306922 2025-01-16 01:30:49,160 - INFO - step 6085, loss: 3.109997, best loss: 2.306922 2025-01-16 01:30:49,311 - INFO - step 6086, loss: 3.319501, best loss: 2.306922 2025-01-16 01:30:49,461 - INFO - step 6087, loss: 3.193490, best loss: 2.306922 2025-01-16 01:30:49,612 - INFO - step 6088, loss: 2.982494, best loss: 2.306922 2025-01-16 01:30:49,762 - INFO - step 6089, loss: 2.979482, best loss: 2.306922 2025-01-16 01:30:49,912 - INFO - step 6090, loss: 3.438780, best loss: 2.306922 2025-01-16 01:30:50,063 - INFO - step 6091, loss: 3.334310, best loss: 2.306922 2025-01-16 01:30:50,213 - INFO - step 6092, loss: 3.291302, best loss: 2.306922 2025-01-16 01:30:50,363 - INFO - step 6093, loss: 2.938351, best loss: 2.306922 2025-01-16 01:30:50,513 - INFO - step 6094, loss: 3.231928, best loss: 2.306922 2025-01-16 01:30:50,663 - INFO - step 6095, loss: 3.153788, best loss: 2.306922 2025-01-16 01:30:50,813 - INFO - step 6096, loss: 2.869294, best loss: 2.306922 2025-01-16 01:30:50,964 - INFO - step 6097, loss: 3.177659, best loss: 2.306922 2025-01-16 01:30:51,114 - INFO - step 6098, loss: 2.963767, best loss: 2.306922 2025-01-16 01:30:51,264 - INFO - step 6099, loss: 3.182240, best loss: 2.306922 2025-01-16 01:30:51,414 - INFO - step 6100, loss: 2.928669, best loss: 2.306922 2025-01-16 01:30:51,564 - INFO - step 6101, loss: 3.167407, best loss: 2.306922 2025-01-16 01:30:51,714 - INFO - step 6102, loss: 2.999702, best loss: 2.306922 2025-01-16 01:30:51,864 - INFO - step 6103, loss: 2.947953, best loss: 2.306922 2025-01-16 01:30:52,014 - INFO - step 6104, loss: 3.278409, best loss: 2.306922 2025-01-16 01:30:52,164 - INFO - step 6105, loss: 3.124427, best loss: 2.306922 2025-01-16 01:30:52,314 - INFO - step 6106, loss: 3.377766, best loss: 2.306922 2025-01-16 01:30:52,465 - INFO - step 6107, loss: 3.005054, best loss: 2.306922 2025-01-16 01:30:52,615 - INFO - step 6108, loss: 3.203682, best loss: 2.306922 2025-01-16 01:30:52,765 - INFO - step 6109, loss: 3.330989, best loss: 2.306922 2025-01-16 01:30:52,915 - INFO - step 6110, loss: 2.981881, best loss: 2.306922 2025-01-16 01:30:53,065 - INFO - step 6111, loss: 2.881991, best loss: 2.306922 2025-01-16 01:30:53,215 - INFO - step 6112, loss: 2.908419, best loss: 2.306922 2025-01-16 01:30:53,365 - INFO - step 6113, loss: 3.115196, best loss: 2.306922 2025-01-16 01:30:53,515 - INFO - step 6114, loss: 3.134130, best loss: 2.306922 2025-01-16 01:30:53,665 - INFO - step 6115, loss: 3.243369, best loss: 2.306922 2025-01-16 01:30:53,815 - INFO - step 6116, loss: 3.449605, best loss: 2.306922 2025-01-16 01:30:53,965 - INFO - step 6117, loss: 3.296612, best loss: 2.306922 2025-01-16 01:30:54,116 - INFO - step 6118, loss: 3.311688, best loss: 2.306922 2025-01-16 01:30:54,266 - INFO - step 6119, loss: 3.328928, best loss: 2.306922 2025-01-16 01:30:54,416 - INFO - step 6120, loss: 3.323168, best loss: 2.306922 2025-01-16 01:30:54,566 - INFO - step 6121, loss: 2.940230, best loss: 2.306922 2025-01-16 01:30:54,716 - INFO - step 6122, loss: 3.404131, best loss: 2.306922 2025-01-16 01:30:54,866 - INFO - step 6123, loss: 3.505745, best loss: 2.306922 2025-01-16 01:30:55,017 - INFO - step 6124, loss: 3.338736, best loss: 2.306922 2025-01-16 01:30:55,167 - INFO - step 6125, loss: 3.263453, best loss: 2.306922 2025-01-16 01:30:55,317 - INFO - step 6126, loss: 3.339088, best loss: 2.306922 2025-01-16 01:30:55,467 - INFO - step 6127, loss: 3.034648, best loss: 2.306922 2025-01-16 01:30:58,945 - INFO - step 6128, loss: 2.248890, best loss: 2.248890 2025-01-16 01:30:59,107 - INFO - step 6129, loss: 3.199384, best loss: 2.248890 2025-01-16 01:30:59,263 - INFO - step 6130, loss: 3.229793, best loss: 2.248890 2025-01-16 01:30:59,414 - INFO - step 6131, loss: 3.305601, best loss: 2.248890 2025-01-16 01:30:59,564 - INFO - step 6132, loss: 3.342542, best loss: 2.248890 2025-01-16 01:30:59,714 - INFO - step 6133, loss: 3.001309, best loss: 2.248890 2025-01-16 01:30:59,864 - INFO - step 6134, loss: 3.225477, best loss: 2.248890 2025-01-16 01:31:00,014 - INFO - step 6135, loss: 3.279359, best loss: 2.248890 2025-01-16 01:31:00,164 - INFO - step 6136, loss: 3.111690, best loss: 2.248890 2025-01-16 01:31:00,314 - INFO - step 6137, loss: 3.096890, best loss: 2.248890 2025-01-16 01:31:00,464 - INFO - step 6138, loss: 3.024785, best loss: 2.248890 2025-01-16 01:31:00,614 - INFO - step 6139, loss: 2.895387, best loss: 2.248890 2025-01-16 01:31:00,764 - INFO - step 6140, loss: 3.143259, best loss: 2.248890 2025-01-16 01:31:00,914 - INFO - step 6141, loss: 2.930918, best loss: 2.248890 2025-01-16 01:31:01,064 - INFO - step 6142, loss: 3.114649, best loss: 2.248890 2025-01-16 01:31:01,214 - INFO - step 6143, loss: 3.374575, best loss: 2.248890 2025-01-16 01:31:01,364 - INFO - step 6144, loss: 3.045310, best loss: 2.248890 2025-01-16 01:31:01,514 - INFO - step 6145, loss: 2.730306, best loss: 2.248890 2025-01-16 01:31:01,665 - INFO - step 6146, loss: 3.135914, best loss: 2.248890 2025-01-16 01:31:01,815 - INFO - step 6147, loss: 3.238369, best loss: 2.248890 2025-01-16 01:31:01,965 - INFO - step 6148, loss: 3.332962, best loss: 2.248890 2025-01-16 01:31:02,116 - INFO - step 6149, loss: 3.255105, best loss: 2.248890 2025-01-16 01:31:02,266 - INFO - step 6150, loss: 3.247564, best loss: 2.248890 2025-01-16 01:31:02,416 - INFO - step 6151, loss: 3.133047, best loss: 2.248890 2025-01-16 01:31:02,567 - INFO - step 6152, loss: 3.382207, best loss: 2.248890 2025-01-16 01:31:02,717 - INFO - step 6153, loss: 2.999559, best loss: 2.248890 2025-01-16 01:31:02,867 - INFO - step 6154, loss: 3.087002, best loss: 2.248890 2025-01-16 01:31:03,017 - INFO - step 6155, loss: 3.260498, best loss: 2.248890 2025-01-16 01:31:03,167 - INFO - step 6156, loss: 3.339751, best loss: 2.248890 2025-01-16 01:31:03,317 - INFO - step 6157, loss: 3.271347, best loss: 2.248890 2025-01-16 01:31:03,467 - INFO - step 6158, loss: 3.010610, best loss: 2.248890 2025-01-16 01:31:03,617 - INFO - step 6159, loss: 3.087929, best loss: 2.248890 2025-01-16 01:31:03,767 - INFO - step 6160, loss: 3.246462, best loss: 2.248890 2025-01-16 01:31:03,917 - INFO - step 6161, loss: 3.467804, best loss: 2.248890 2025-01-16 01:31:04,068 - INFO - step 6162, loss: 3.246452, best loss: 2.248890 2025-01-16 01:31:04,218 - INFO - step 6163, loss: 3.491177, best loss: 2.248890 2025-01-16 01:31:04,368 - INFO - step 6164, loss: 3.486682, best loss: 2.248890 2025-01-16 01:31:04,518 - INFO - step 6165, loss: 3.373518, best loss: 2.248890 2025-01-16 01:31:04,668 - INFO - step 6166, loss: 3.602573, best loss: 2.248890 2025-01-16 01:31:04,818 - INFO - step 6167, loss: 3.352408, best loss: 2.248890 2025-01-16 01:31:04,968 - INFO - step 6168, loss: 2.955461, best loss: 2.248890 2025-01-16 01:31:05,118 - INFO - step 6169, loss: 3.401042, best loss: 2.248890 2025-01-16 01:31:05,269 - INFO - step 6170, loss: 3.317640, best loss: 2.248890 2025-01-16 01:31:05,419 - INFO - step 6171, loss: 3.362695, best loss: 2.248890 2025-01-16 01:31:05,569 - INFO - step 6172, loss: 3.071767, best loss: 2.248890 2025-01-16 01:31:05,719 - INFO - step 6173, loss: 3.219787, best loss: 2.248890 2025-01-16 01:31:05,869 - INFO - step 6174, loss: 3.218164, best loss: 2.248890 2025-01-16 01:31:06,019 - INFO - step 6175, loss: 3.119533, best loss: 2.248890 2025-01-16 01:31:06,170 - INFO - step 6176, loss: 3.343682, best loss: 2.248890 2025-01-16 01:31:06,320 - INFO - step 6177, loss: 2.990178, best loss: 2.248890 2025-01-16 01:31:06,470 - INFO - step 6178, loss: 2.926671, best loss: 2.248890 2025-01-16 01:31:06,620 - INFO - step 6179, loss: 3.431054, best loss: 2.248890 2025-01-16 01:31:06,771 - INFO - step 6180, loss: 3.356340, best loss: 2.248890 2025-01-16 01:31:06,921 - INFO - step 6181, loss: 3.404232, best loss: 2.248890 2025-01-16 01:31:07,071 - INFO - step 6182, loss: 3.206101, best loss: 2.248890 2025-01-16 01:31:07,221 - INFO - step 6183, loss: 3.398444, best loss: 2.248890 2025-01-16 01:31:07,371 - INFO - step 6184, loss: 3.379216, best loss: 2.248890 2025-01-16 01:31:07,521 - INFO - step 6185, loss: 3.050464, best loss: 2.248890 2025-01-16 01:31:07,672 - INFO - step 6186, loss: 3.058664, best loss: 2.248890 2025-01-16 01:31:07,822 - INFO - step 6187, loss: 3.346552, best loss: 2.248890 2025-01-16 01:31:07,972 - INFO - step 6188, loss: 2.985601, best loss: 2.248890 2025-01-16 01:31:08,122 - INFO - step 6189, loss: 2.782027, best loss: 2.248890 2025-01-16 01:31:08,272 - INFO - step 6190, loss: 3.192411, best loss: 2.248890 2025-01-16 01:31:08,422 - INFO - step 6191, loss: 3.166231, best loss: 2.248890 2025-01-16 01:31:08,572 - INFO - step 6192, loss: 3.177676, best loss: 2.248890 2025-01-16 01:31:08,723 - INFO - step 6193, loss: 2.825030, best loss: 2.248890 2025-01-16 01:31:08,873 - INFO - step 6194, loss: 2.613223, best loss: 2.248890 2025-01-16 01:31:09,023 - INFO - step 6195, loss: 2.603108, best loss: 2.248890 2025-01-16 01:31:09,173 - INFO - step 6196, loss: 2.763702, best loss: 2.248890 2025-01-16 01:31:09,324 - INFO - step 6197, loss: 2.979688, best loss: 2.248890 2025-01-16 01:31:09,474 - INFO - step 6198, loss: 3.233426, best loss: 2.248890 2025-01-16 01:31:09,624 - INFO - step 6199, loss: 3.076514, best loss: 2.248890 2025-01-16 01:31:09,774 - INFO - step 6200, loss: 2.997381, best loss: 2.248890 2025-01-16 01:31:09,924 - INFO - step 6201, loss: 3.066628, best loss: 2.248890 2025-01-16 01:31:10,074 - INFO - step 6202, loss: 3.057637, best loss: 2.248890 2025-01-16 01:31:10,225 - INFO - step 6203, loss: 2.911511, best loss: 2.248890 2025-01-16 01:31:10,375 - INFO - step 6204, loss: 3.182580, best loss: 2.248890 2025-01-16 01:31:10,525 - INFO - step 6205, loss: 3.214460, best loss: 2.248890 2025-01-16 01:31:10,675 - INFO - step 6206, loss: 2.772630, best loss: 2.248890 2025-01-16 01:31:10,825 - INFO - step 6207, loss: 2.809319, best loss: 2.248890 2025-01-16 01:31:10,976 - INFO - step 6208, loss: 3.054154, best loss: 2.248890 2025-01-16 01:31:11,126 - INFO - step 6209, loss: 3.207796, best loss: 2.248890 2025-01-16 01:31:11,276 - INFO - step 6210, loss: 2.825675, best loss: 2.248890 2025-01-16 01:31:11,426 - INFO - step 6211, loss: 2.845442, best loss: 2.248890 2025-01-16 01:31:11,576 - INFO - step 6212, loss: 3.145942, best loss: 2.248890 2025-01-16 01:31:11,727 - INFO - step 6213, loss: 2.906142, best loss: 2.248890 2025-01-16 01:31:11,877 - INFO - step 6214, loss: 2.745228, best loss: 2.248890 2025-01-16 01:31:12,027 - INFO - step 6215, loss: 3.137694, best loss: 2.248890 2025-01-16 01:31:12,177 - INFO - step 6216, loss: 2.892488, best loss: 2.248890 2025-01-16 01:31:12,328 - INFO - step 6217, loss: 2.898992, best loss: 2.248890 2025-01-16 01:31:12,478 - INFO - step 6218, loss: 2.827921, best loss: 2.248890 2025-01-16 01:31:12,628 - INFO - step 6219, loss: 3.106405, best loss: 2.248890 2025-01-16 01:31:12,779 - INFO - step 6220, loss: 2.745395, best loss: 2.248890 2025-01-16 01:31:12,929 - INFO - step 6221, loss: 2.822960, best loss: 2.248890 2025-01-16 01:31:13,079 - INFO - step 6222, loss: 2.740090, best loss: 2.248890 2025-01-16 01:31:13,229 - INFO - step 6223, loss: 3.078969, best loss: 2.248890 2025-01-16 01:31:13,379 - INFO - step 6224, loss: 3.405576, best loss: 2.248890 2025-01-16 01:31:13,530 - INFO - step 6225, loss: 3.465320, best loss: 2.248890 2025-01-16 01:31:13,680 - INFO - step 6226, loss: 3.174522, best loss: 2.248890 2025-01-16 01:31:13,830 - INFO - step 6227, loss: 3.299736, best loss: 2.248890 2025-01-16 01:31:13,980 - INFO - step 6228, loss: 3.228391, best loss: 2.248890 2025-01-16 01:31:14,130 - INFO - step 6229, loss: 3.248744, best loss: 2.248890 2025-01-16 01:31:14,281 - INFO - step 6230, loss: 2.943815, best loss: 2.248890 2025-01-16 01:31:14,431 - INFO - step 6231, loss: 3.195791, best loss: 2.248890 2025-01-16 01:31:14,581 - INFO - step 6232, loss: 2.889719, best loss: 2.248890 2025-01-16 01:31:14,732 - INFO - step 6233, loss: 2.700836, best loss: 2.248890 2025-01-16 01:31:14,882 - INFO - step 6234, loss: 2.964546, best loss: 2.248890 2025-01-16 01:31:15,032 - INFO - step 6235, loss: 2.928252, best loss: 2.248890 2025-01-16 01:31:15,182 - INFO - step 6236, loss: 3.197802, best loss: 2.248890 2025-01-16 01:31:15,333 - INFO - step 6237, loss: 2.406370, best loss: 2.248890 2025-01-16 01:31:15,483 - INFO - step 6238, loss: 3.025145, best loss: 2.248890 2025-01-16 01:31:15,633 - INFO - step 6239, loss: 3.162514, best loss: 2.248890 2025-01-16 01:31:15,783 - INFO - step 6240, loss: 3.098558, best loss: 2.248890 2025-01-16 01:31:15,934 - INFO - step 6241, loss: 2.986922, best loss: 2.248890 2025-01-16 01:31:16,084 - INFO - step 6242, loss: 2.915401, best loss: 2.248890 2025-01-16 01:31:16,234 - INFO - step 6243, loss: 3.019525, best loss: 2.248890 2025-01-16 01:31:16,384 - INFO - step 6244, loss: 2.763220, best loss: 2.248890 2025-01-16 01:31:16,534 - INFO - step 6245, loss: 3.025105, best loss: 2.248890 2025-01-16 01:31:16,684 - INFO - step 6246, loss: 2.863610, best loss: 2.248890 2025-01-16 01:31:16,835 - INFO - step 6247, loss: 3.098087, best loss: 2.248890 2025-01-16 01:31:16,985 - INFO - step 6248, loss: 2.759265, best loss: 2.248890 2025-01-16 01:31:17,135 - INFO - step 6249, loss: 2.894583, best loss: 2.248890 2025-01-16 01:31:17,285 - INFO - step 6250, loss: 2.783479, best loss: 2.248890 2025-01-16 01:31:17,435 - INFO - step 6251, loss: 2.726700, best loss: 2.248890 2025-01-16 01:31:17,586 - INFO - step 6252, loss: 2.936656, best loss: 2.248890 2025-01-16 01:31:17,736 - INFO - step 6253, loss: 2.853016, best loss: 2.248890 2025-01-16 01:31:17,886 - INFO - step 6254, loss: 2.846208, best loss: 2.248890 2025-01-16 01:31:18,036 - INFO - step 6255, loss: 2.601966, best loss: 2.248890 2025-01-16 01:31:18,186 - INFO - step 6256, loss: 2.487099, best loss: 2.248890 2025-01-16 01:31:18,337 - INFO - step 6257, loss: 2.368144, best loss: 2.248890 2025-01-16 01:31:18,487 - INFO - step 6258, loss: 2.991983, best loss: 2.248890 2025-01-16 01:31:18,637 - INFO - step 6259, loss: 3.209843, best loss: 2.248890 2025-01-16 01:31:18,787 - INFO - step 6260, loss: 3.239335, best loss: 2.248890 2025-01-16 01:31:18,937 - INFO - step 6261, loss: 3.427647, best loss: 2.248890 2025-01-16 01:31:19,088 - INFO - step 6262, loss: 3.400554, best loss: 2.248890 2025-01-16 01:31:19,238 - INFO - step 6263, loss: 3.036698, best loss: 2.248890 2025-01-16 01:31:19,388 - INFO - step 6264, loss: 3.064885, best loss: 2.248890 2025-01-16 01:31:19,538 - INFO - step 6265, loss: 3.297734, best loss: 2.248890 2025-01-16 01:31:19,689 - INFO - step 6266, loss: 2.992889, best loss: 2.248890 2025-01-16 01:31:19,839 - INFO - step 6267, loss: 2.572411, best loss: 2.248890 2025-01-16 01:31:19,989 - INFO - step 6268, loss: 2.883910, best loss: 2.248890 2025-01-16 01:31:20,139 - INFO - step 6269, loss: 2.745574, best loss: 2.248890 2025-01-16 01:31:20,289 - INFO - step 6270, loss: 3.220654, best loss: 2.248890 2025-01-16 01:31:20,439 - INFO - step 6271, loss: 3.144555, best loss: 2.248890 2025-01-16 01:31:20,589 - INFO - step 6272, loss: 3.243311, best loss: 2.248890 2025-01-16 01:31:20,739 - INFO - step 6273, loss: 3.110092, best loss: 2.248890 2025-01-16 01:31:20,889 - INFO - step 6274, loss: 3.189014, best loss: 2.248890 2025-01-16 01:31:21,040 - INFO - step 6275, loss: 2.824378, best loss: 2.248890 2025-01-16 01:31:21,190 - INFO - step 6276, loss: 3.220750, best loss: 2.248890 2025-01-16 01:31:21,340 - INFO - step 6277, loss: 3.089321, best loss: 2.248890 2025-01-16 01:31:21,490 - INFO - step 6278, loss: 3.272600, best loss: 2.248890 2025-01-16 01:31:21,640 - INFO - step 6279, loss: 3.172834, best loss: 2.248890 2025-01-16 01:31:21,791 - INFO - step 6280, loss: 3.063179, best loss: 2.248890 2025-01-16 01:31:21,941 - INFO - step 6281, loss: 3.354115, best loss: 2.248890 2025-01-16 01:31:22,090 - INFO - step 6282, loss: 2.840957, best loss: 2.248890 2025-01-16 01:31:22,241 - INFO - step 6283, loss: 3.166223, best loss: 2.248890 2025-01-16 01:31:22,391 - INFO - step 6284, loss: 3.248038, best loss: 2.248890 2025-01-16 01:31:22,541 - INFO - step 6285, loss: 3.187111, best loss: 2.248890 2025-01-16 01:31:22,692 - INFO - step 6286, loss: 3.111704, best loss: 2.248890 2025-01-16 01:31:22,842 - INFO - step 6287, loss: 2.933809, best loss: 2.248890 2025-01-16 01:31:22,992 - INFO - step 6288, loss: 2.926102, best loss: 2.248890 2025-01-16 01:31:23,142 - INFO - step 6289, loss: 2.988093, best loss: 2.248890 2025-01-16 01:31:23,293 - INFO - step 6290, loss: 2.672268, best loss: 2.248890 2025-01-16 01:31:23,443 - INFO - step 6291, loss: 3.354702, best loss: 2.248890 2025-01-16 01:31:23,593 - INFO - step 6292, loss: 2.541866, best loss: 2.248890 2025-01-16 01:31:23,743 - INFO - step 6293, loss: 2.679463, best loss: 2.248890 2025-01-16 01:31:23,893 - INFO - step 6294, loss: 2.966171, best loss: 2.248890 2025-01-16 01:31:24,043 - INFO - step 6295, loss: 3.053817, best loss: 2.248890 2025-01-16 01:31:24,194 - INFO - step 6296, loss: 2.947632, best loss: 2.248890 2025-01-16 01:31:24,344 - INFO - step 6297, loss: 2.774256, best loss: 2.248890 2025-01-16 01:31:24,494 - INFO - step 6298, loss: 2.955880, best loss: 2.248890 2025-01-16 01:31:24,644 - INFO - step 6299, loss: 3.094546, best loss: 2.248890 2025-01-16 01:31:24,795 - INFO - step 6300, loss: 2.842789, best loss: 2.248890 2025-01-16 01:31:24,945 - INFO - step 6301, loss: 2.748898, best loss: 2.248890 2025-01-16 01:31:25,095 - INFO - step 6302, loss: 3.077139, best loss: 2.248890 2025-01-16 01:31:25,245 - INFO - step 6303, loss: 3.034533, best loss: 2.248890 2025-01-16 01:31:25,395 - INFO - step 6304, loss: 2.893264, best loss: 2.248890 2025-01-16 01:31:25,545 - INFO - step 6305, loss: 2.659910, best loss: 2.248890 2025-01-16 01:31:25,695 - INFO - step 6306, loss: 2.963969, best loss: 2.248890 2025-01-16 01:31:25,845 - INFO - step 6307, loss: 3.264345, best loss: 2.248890 2025-01-16 01:31:25,995 - INFO - step 6308, loss: 2.914996, best loss: 2.248890 2025-01-16 01:31:26,145 - INFO - step 6309, loss: 3.024463, best loss: 2.248890 2025-01-16 01:31:26,295 - INFO - step 6310, loss: 3.272140, best loss: 2.248890 2025-01-16 01:31:26,445 - INFO - step 6311, loss: 3.172918, best loss: 2.248890 2025-01-16 01:31:26,596 - INFO - step 6312, loss: 3.152294, best loss: 2.248890 2025-01-16 01:31:26,746 - INFO - step 6313, loss: 3.002244, best loss: 2.248890 2025-01-16 01:31:26,897 - INFO - step 6314, loss: 3.018522, best loss: 2.248890 2025-01-16 01:31:27,047 - INFO - step 6315, loss: 2.847690, best loss: 2.248890 2025-01-16 01:31:27,197 - INFO - step 6316, loss: 3.412579, best loss: 2.248890 2025-01-16 01:31:27,347 - INFO - step 6317, loss: 3.093052, best loss: 2.248890 2025-01-16 01:31:27,498 - INFO - step 6318, loss: 3.334032, best loss: 2.248890 2025-01-16 01:31:27,648 - INFO - step 6319, loss: 2.704397, best loss: 2.248890 2025-01-16 01:31:27,798 - INFO - step 6320, loss: 2.763073, best loss: 2.248890 2025-01-16 01:31:27,948 - INFO - step 6321, loss: 3.148687, best loss: 2.248890 2025-01-16 01:31:28,098 - INFO - step 6322, loss: 3.101537, best loss: 2.248890 2025-01-16 01:31:28,248 - INFO - step 6323, loss: 3.002105, best loss: 2.248890 2025-01-16 01:31:28,398 - INFO - step 6324, loss: 3.038953, best loss: 2.248890 2025-01-16 01:31:28,548 - INFO - step 6325, loss: 2.878504, best loss: 2.248890 2025-01-16 01:31:28,698 - INFO - step 6326, loss: 2.961659, best loss: 2.248890 2025-01-16 01:31:28,849 - INFO - step 6327, loss: 3.044505, best loss: 2.248890 2025-01-16 01:31:28,999 - INFO - step 6328, loss: 2.613364, best loss: 2.248890 2025-01-16 01:31:29,149 - INFO - step 6329, loss: 2.946222, best loss: 2.248890 2025-01-16 01:31:29,299 - INFO - step 6330, loss: 3.100621, best loss: 2.248890 2025-01-16 01:31:29,450 - INFO - step 6331, loss: 3.251714, best loss: 2.248890 2025-01-16 01:31:29,600 - INFO - step 6332, loss: 3.104988, best loss: 2.248890 2025-01-16 01:31:29,750 - INFO - step 6333, loss: 3.271533, best loss: 2.248890 2025-01-16 01:31:29,900 - INFO - step 6334, loss: 3.151925, best loss: 2.248890 2025-01-16 01:31:30,050 - INFO - step 6335, loss: 2.740679, best loss: 2.248890 2025-01-16 01:31:30,200 - INFO - step 6336, loss: 3.158866, best loss: 2.248890 2025-01-16 01:31:30,350 - INFO - step 6337, loss: 2.680492, best loss: 2.248890 2025-01-16 01:31:30,501 - INFO - step 6338, loss: 2.846910, best loss: 2.248890 2025-01-16 01:31:30,651 - INFO - step 6339, loss: 2.966809, best loss: 2.248890 2025-01-16 01:31:30,801 - INFO - step 6340, loss: 2.933629, best loss: 2.248890 2025-01-16 01:31:30,951 - INFO - step 6341, loss: 2.841127, best loss: 2.248890 2025-01-16 01:31:31,101 - INFO - step 6342, loss: 3.087663, best loss: 2.248890 2025-01-16 01:31:31,252 - INFO - step 6343, loss: 3.206666, best loss: 2.248890 2025-01-16 01:31:31,402 - INFO - step 6344, loss: 3.132693, best loss: 2.248890 2025-01-16 01:31:31,552 - INFO - step 6345, loss: 3.275074, best loss: 2.248890 2025-01-16 01:31:31,702 - INFO - step 6346, loss: 2.978881, best loss: 2.248890 2025-01-16 01:31:31,852 - INFO - step 6347, loss: 3.087817, best loss: 2.248890 2025-01-16 01:31:32,002 - INFO - step 6348, loss: 2.972940, best loss: 2.248890 2025-01-16 01:31:32,152 - INFO - step 6349, loss: 2.554443, best loss: 2.248890 2025-01-16 01:31:32,302 - INFO - step 6350, loss: 3.266185, best loss: 2.248890 2025-01-16 01:31:32,453 - INFO - step 6351, loss: 3.052231, best loss: 2.248890 2025-01-16 01:31:32,603 - INFO - step 6352, loss: 2.881810, best loss: 2.248890 2025-01-16 01:31:32,753 - INFO - step 6353, loss: 2.869900, best loss: 2.248890 2025-01-16 01:31:32,903 - INFO - step 6354, loss: 2.737667, best loss: 2.248890 2025-01-16 01:31:33,053 - INFO - step 6355, loss: 2.885656, best loss: 2.248890 2025-01-16 01:31:33,204 - INFO - step 6356, loss: 2.768671, best loss: 2.248890 2025-01-16 01:31:33,353 - INFO - step 6357, loss: 2.706491, best loss: 2.248890 2025-01-16 01:31:33,504 - INFO - step 6358, loss: 3.179944, best loss: 2.248890 2025-01-16 01:31:33,654 - INFO - step 6359, loss: 3.123716, best loss: 2.248890 2025-01-16 01:31:33,804 - INFO - step 6360, loss: 2.960331, best loss: 2.248890 2025-01-16 01:31:33,954 - INFO - step 6361, loss: 3.229141, best loss: 2.248890 2025-01-16 01:31:34,104 - INFO - step 6362, loss: 2.958837, best loss: 2.248890 2025-01-16 01:31:34,254 - INFO - step 6363, loss: 3.248201, best loss: 2.248890 2025-01-16 01:31:34,405 - INFO - step 6364, loss: 3.274687, best loss: 2.248890 2025-01-16 01:31:34,555 - INFO - step 6365, loss: 3.348125, best loss: 2.248890 2025-01-16 01:31:34,705 - INFO - step 6366, loss: 3.515464, best loss: 2.248890 2025-01-16 01:31:34,855 - INFO - step 6367, loss: 3.096328, best loss: 2.248890 2025-01-16 01:31:35,005 - INFO - step 6368, loss: 3.351885, best loss: 2.248890 2025-01-16 01:31:35,156 - INFO - step 6369, loss: 3.163304, best loss: 2.248890 2025-01-16 01:31:35,306 - INFO - step 6370, loss: 3.069221, best loss: 2.248890 2025-01-16 01:31:35,456 - INFO - step 6371, loss: 3.345090, best loss: 2.248890 2025-01-16 01:31:35,607 - INFO - step 6372, loss: 3.287537, best loss: 2.248890 2025-01-16 01:31:35,757 - INFO - step 6373, loss: 3.269750, best loss: 2.248890 2025-01-16 01:31:35,907 - INFO - step 6374, loss: 2.976298, best loss: 2.248890 2025-01-16 01:31:36,058 - INFO - step 6375, loss: 3.190790, best loss: 2.248890 2025-01-16 01:31:36,208 - INFO - step 6376, loss: 3.139757, best loss: 2.248890 2025-01-16 01:31:36,358 - INFO - step 6377, loss: 3.030127, best loss: 2.248890 2025-01-16 01:31:36,508 - INFO - step 6378, loss: 2.950655, best loss: 2.248890 2025-01-16 01:31:36,658 - INFO - step 6379, loss: 3.331806, best loss: 2.248890 2025-01-16 01:31:36,808 - INFO - step 6380, loss: 3.223286, best loss: 2.248890 2025-01-16 01:31:36,959 - INFO - step 6381, loss: 3.398988, best loss: 2.248890 2025-01-16 01:31:37,109 - INFO - step 6382, loss: 3.176248, best loss: 2.248890 2025-01-16 01:31:37,259 - INFO - step 6383, loss: 2.971287, best loss: 2.248890 2025-01-16 01:31:37,409 - INFO - step 6384, loss: 3.354142, best loss: 2.248890 2025-01-16 01:31:37,559 - INFO - step 6385, loss: 3.039785, best loss: 2.248890 2025-01-16 01:31:37,709 - INFO - step 6386, loss: 3.223046, best loss: 2.248890 2025-01-16 01:31:37,859 - INFO - step 6387, loss: 2.983113, best loss: 2.248890 2025-01-16 01:31:38,009 - INFO - step 6388, loss: 3.050960, best loss: 2.248890 2025-01-16 01:31:38,159 - INFO - step 6389, loss: 3.150671, best loss: 2.248890 2025-01-16 01:31:38,310 - INFO - step 6390, loss: 3.167202, best loss: 2.248890 2025-01-16 01:31:38,460 - INFO - step 6391, loss: 3.086699, best loss: 2.248890 2025-01-16 01:31:38,610 - INFO - step 6392, loss: 3.125966, best loss: 2.248890 2025-01-16 01:31:38,760 - INFO - step 6393, loss: 2.831092, best loss: 2.248890 2025-01-16 01:31:38,910 - INFO - step 6394, loss: 2.632770, best loss: 2.248890 2025-01-16 01:31:39,060 - INFO - step 6395, loss: 2.753039, best loss: 2.248890 2025-01-16 01:31:39,211 - INFO - step 6396, loss: 2.953253, best loss: 2.248890 2025-01-16 01:31:39,361 - INFO - step 6397, loss: 3.417450, best loss: 2.248890 2025-01-16 01:31:39,511 - INFO - step 6398, loss: 3.011827, best loss: 2.248890 2025-01-16 01:31:39,661 - INFO - step 6399, loss: 2.591313, best loss: 2.248890 2025-01-16 01:31:39,811 - INFO - step 6400, loss: 3.170673, best loss: 2.248890 2025-01-16 01:31:39,961 - INFO - step 6401, loss: 2.905683, best loss: 2.248890 2025-01-16 01:31:40,111 - INFO - step 6402, loss: 3.482691, best loss: 2.248890 2025-01-16 01:31:40,261 - INFO - step 6403, loss: 2.789228, best loss: 2.248890 2025-01-16 01:31:40,411 - INFO - step 6404, loss: 3.149107, best loss: 2.248890 2025-01-16 01:31:40,562 - INFO - step 6405, loss: 2.990911, best loss: 2.248890 2025-01-16 01:31:40,712 - INFO - step 6406, loss: 3.187408, best loss: 2.248890 2025-01-16 01:31:40,862 - INFO - step 6407, loss: 3.081520, best loss: 2.248890 2025-01-16 01:31:41,012 - INFO - step 6408, loss: 2.845884, best loss: 2.248890 2025-01-16 01:31:41,162 - INFO - step 6409, loss: 3.123577, best loss: 2.248890 2025-01-16 01:31:41,312 - INFO - step 6410, loss: 3.020021, best loss: 2.248890 2025-01-16 01:31:41,462 - INFO - step 6411, loss: 2.869761, best loss: 2.248890 2025-01-16 01:31:41,612 - INFO - step 6412, loss: 3.436925, best loss: 2.248890 2025-01-16 01:31:41,762 - INFO - step 6413, loss: 2.969362, best loss: 2.248890 2025-01-16 01:31:41,912 - INFO - step 6414, loss: 2.605892, best loss: 2.248890 2025-01-16 01:31:42,063 - INFO - step 6415, loss: 2.968611, best loss: 2.248890 2025-01-16 01:31:42,213 - INFO - step 6416, loss: 3.265864, best loss: 2.248890 2025-01-16 01:31:42,363 - INFO - step 6417, loss: 3.103857, best loss: 2.248890 2025-01-16 01:31:42,514 - INFO - step 6418, loss: 2.968630, best loss: 2.248890 2025-01-16 01:31:42,664 - INFO - step 6419, loss: 2.942664, best loss: 2.248890 2025-01-16 01:31:42,815 - INFO - step 6420, loss: 3.351020, best loss: 2.248890 2025-01-16 01:31:42,965 - INFO - step 6421, loss: 3.317984, best loss: 2.248890 2025-01-16 01:31:43,115 - INFO - step 6422, loss: 3.164834, best loss: 2.248890 2025-01-16 01:31:43,265 - INFO - step 6423, loss: 2.878091, best loss: 2.248890 2025-01-16 01:31:43,415 - INFO - step 6424, loss: 3.128350, best loss: 2.248890 2025-01-16 01:31:43,565 - INFO - step 6425, loss: 3.018186, best loss: 2.248890 2025-01-16 01:31:43,715 - INFO - step 6426, loss: 2.754014, best loss: 2.248890 2025-01-16 01:31:43,865 - INFO - step 6427, loss: 3.068302, best loss: 2.248890 2025-01-16 01:31:44,015 - INFO - step 6428, loss: 2.864732, best loss: 2.248890 2025-01-16 01:31:44,166 - INFO - step 6429, loss: 3.093442, best loss: 2.248890 2025-01-16 01:31:44,316 - INFO - step 6430, loss: 2.914243, best loss: 2.248890 2025-01-16 01:31:44,466 - INFO - step 6431, loss: 3.040809, best loss: 2.248890 2025-01-16 01:31:44,617 - INFO - step 6432, loss: 2.860304, best loss: 2.248890 2025-01-16 01:31:44,767 - INFO - step 6433, loss: 2.861512, best loss: 2.248890 2025-01-16 01:31:44,917 - INFO - step 6434, loss: 3.211003, best loss: 2.248890 2025-01-16 01:31:45,067 - INFO - step 6435, loss: 3.106056, best loss: 2.248890 2025-01-16 01:31:45,217 - INFO - step 6436, loss: 3.283182, best loss: 2.248890 2025-01-16 01:31:45,367 - INFO - step 6437, loss: 2.890064, best loss: 2.248890 2025-01-16 01:31:45,518 - INFO - step 6438, loss: 3.083109, best loss: 2.248890 2025-01-16 01:31:45,668 - INFO - step 6439, loss: 3.096312, best loss: 2.248890 2025-01-16 01:31:45,818 - INFO - step 6440, loss: 2.760017, best loss: 2.248890 2025-01-16 01:31:45,968 - INFO - step 6441, loss: 2.748118, best loss: 2.248890 2025-01-16 01:31:46,119 - INFO - step 6442, loss: 2.777552, best loss: 2.248890 2025-01-16 01:31:46,269 - INFO - step 6443, loss: 3.013132, best loss: 2.248890 2025-01-16 01:31:46,419 - INFO - step 6444, loss: 3.098530, best loss: 2.248890 2025-01-16 01:31:46,569 - INFO - step 6445, loss: 3.248377, best loss: 2.248890 2025-01-16 01:31:46,719 - INFO - step 6446, loss: 3.508483, best loss: 2.248890 2025-01-16 01:31:46,870 - INFO - step 6447, loss: 3.312597, best loss: 2.248890 2025-01-16 01:31:47,020 - INFO - step 6448, loss: 3.247875, best loss: 2.248890 2025-01-16 01:31:47,170 - INFO - step 6449, loss: 3.212814, best loss: 2.248890 2025-01-16 01:31:47,320 - INFO - step 6450, loss: 3.226163, best loss: 2.248890 2025-01-16 01:31:47,471 - INFO - step 6451, loss: 2.794497, best loss: 2.248890 2025-01-16 01:31:47,621 - INFO - step 6452, loss: 3.217601, best loss: 2.248890 2025-01-16 01:31:47,771 - INFO - step 6453, loss: 3.390343, best loss: 2.248890 2025-01-16 01:31:47,921 - INFO - step 6454, loss: 3.223758, best loss: 2.248890 2025-01-16 01:31:48,071 - INFO - step 6455, loss: 3.241470, best loss: 2.248890 2025-01-16 01:31:48,222 - INFO - step 6456, loss: 3.360307, best loss: 2.248890 2025-01-16 01:31:48,372 - INFO - step 6457, loss: 2.982851, best loss: 2.248890 2025-01-16 01:31:51,919 - INFO - step 6458, loss: 2.205240, best loss: 2.205240 2025-01-16 01:31:52,078 - INFO - step 6459, loss: 3.161215, best loss: 2.205240 2025-01-16 01:31:52,230 - INFO - step 6460, loss: 3.175133, best loss: 2.205240 2025-01-16 01:31:52,381 - INFO - step 6461, loss: 3.194898, best loss: 2.205240 2025-01-16 01:31:52,531 - INFO - step 6462, loss: 3.268278, best loss: 2.205240 2025-01-16 01:31:52,681 - INFO - step 6463, loss: 2.904756, best loss: 2.205240 2025-01-16 01:31:52,831 - INFO - step 6464, loss: 3.153273, best loss: 2.205240 2025-01-16 01:31:52,981 - INFO - step 6465, loss: 3.136059, best loss: 2.205240 2025-01-16 01:31:53,131 - INFO - step 6466, loss: 2.999737, best loss: 2.205240 2025-01-16 01:31:53,281 - INFO - step 6467, loss: 3.032571, best loss: 2.205240 2025-01-16 01:31:53,431 - INFO - step 6468, loss: 2.987198, best loss: 2.205240 2025-01-16 01:31:53,581 - INFO - step 6469, loss: 2.813978, best loss: 2.205240 2025-01-16 01:31:53,731 - INFO - step 6470, loss: 3.113006, best loss: 2.205240 2025-01-16 01:31:53,881 - INFO - step 6471, loss: 2.934799, best loss: 2.205240 2025-01-16 01:31:54,032 - INFO - step 6472, loss: 3.138509, best loss: 2.205240 2025-01-16 01:31:54,182 - INFO - step 6473, loss: 3.279429, best loss: 2.205240 2025-01-16 01:31:54,332 - INFO - step 6474, loss: 2.987221, best loss: 2.205240 2025-01-16 01:31:54,483 - INFO - step 6475, loss: 2.707585, best loss: 2.205240 2025-01-16 01:31:54,633 - INFO - step 6476, loss: 3.040371, best loss: 2.205240 2025-01-16 01:31:54,783 - INFO - step 6477, loss: 3.095889, best loss: 2.205240 2025-01-16 01:31:54,933 - INFO - step 6478, loss: 3.228896, best loss: 2.205240 2025-01-16 01:31:55,084 - INFO - step 6479, loss: 3.120626, best loss: 2.205240 2025-01-16 01:31:55,234 - INFO - step 6480, loss: 3.121354, best loss: 2.205240 2025-01-16 01:31:55,384 - INFO - step 6481, loss: 3.031425, best loss: 2.205240 2025-01-16 01:31:55,535 - INFO - step 6482, loss: 3.287657, best loss: 2.205240 2025-01-16 01:31:55,685 - INFO - step 6483, loss: 2.925375, best loss: 2.205240 2025-01-16 01:31:55,835 - INFO - step 6484, loss: 3.008849, best loss: 2.205240 2025-01-16 01:31:55,986 - INFO - step 6485, loss: 3.207268, best loss: 2.205240 2025-01-16 01:31:56,136 - INFO - step 6486, loss: 3.261724, best loss: 2.205240 2025-01-16 01:31:56,286 - INFO - step 6487, loss: 3.164422, best loss: 2.205240 2025-01-16 01:31:56,436 - INFO - step 6488, loss: 3.026103, best loss: 2.205240 2025-01-16 01:31:56,586 - INFO - step 6489, loss: 2.974212, best loss: 2.205240 2025-01-16 01:31:56,736 - INFO - step 6490, loss: 3.153758, best loss: 2.205240 2025-01-16 01:31:56,887 - INFO - step 6491, loss: 3.358784, best loss: 2.205240 2025-01-16 01:31:57,037 - INFO - step 6492, loss: 3.156965, best loss: 2.205240 2025-01-16 01:31:57,187 - INFO - step 6493, loss: 3.383733, best loss: 2.205240 2025-01-16 01:31:57,337 - INFO - step 6494, loss: 3.385591, best loss: 2.205240 2025-01-16 01:31:57,487 - INFO - step 6495, loss: 3.200640, best loss: 2.205240 2025-01-16 01:31:57,637 - INFO - step 6496, loss: 3.418555, best loss: 2.205240 2025-01-16 01:31:57,788 - INFO - step 6497, loss: 3.227663, best loss: 2.205240 2025-01-16 01:31:57,938 - INFO - step 6498, loss: 2.866946, best loss: 2.205240 2025-01-16 01:31:58,088 - INFO - step 6499, loss: 3.314591, best loss: 2.205240 2025-01-16 01:31:58,238 - INFO - step 6500, loss: 3.212142, best loss: 2.205240 2025-01-16 01:31:58,389 - INFO - step 6501, loss: 3.256509, best loss: 2.205240 2025-01-16 01:31:58,539 - INFO - step 6502, loss: 2.948755, best loss: 2.205240 2025-01-16 01:31:58,690 - INFO - step 6503, loss: 3.161423, best loss: 2.205240 2025-01-16 01:31:58,840 - INFO - step 6504, loss: 3.124922, best loss: 2.205240 2025-01-16 01:31:58,990 - INFO - step 6505, loss: 3.017489, best loss: 2.205240 2025-01-16 01:31:59,140 - INFO - step 6506, loss: 3.199944, best loss: 2.205240 2025-01-16 01:31:59,290 - INFO - step 6507, loss: 2.846131, best loss: 2.205240 2025-01-16 01:31:59,440 - INFO - step 6508, loss: 2.799575, best loss: 2.205240 2025-01-16 01:31:59,591 - INFO - step 6509, loss: 3.275158, best loss: 2.205240 2025-01-16 01:31:59,741 - INFO - step 6510, loss: 3.313667, best loss: 2.205240 2025-01-16 01:31:59,891 - INFO - step 6511, loss: 3.329801, best loss: 2.205240 2025-01-16 01:32:00,041 - INFO - step 6512, loss: 3.184817, best loss: 2.205240 2025-01-16 01:32:00,191 - INFO - step 6513, loss: 3.329897, best loss: 2.205240 2025-01-16 01:32:00,341 - INFO - step 6514, loss: 3.332281, best loss: 2.205240 2025-01-16 01:32:00,491 - INFO - step 6515, loss: 2.963937, best loss: 2.205240 2025-01-16 01:32:00,642 - INFO - step 6516, loss: 2.929649, best loss: 2.205240 2025-01-16 01:32:00,792 - INFO - step 6517, loss: 3.266219, best loss: 2.205240 2025-01-16 01:32:00,942 - INFO - step 6518, loss: 2.875156, best loss: 2.205240 2025-01-16 01:32:01,092 - INFO - step 6519, loss: 2.657692, best loss: 2.205240 2025-01-16 01:32:01,242 - INFO - step 6520, loss: 3.027148, best loss: 2.205240 2025-01-16 01:32:01,392 - INFO - step 6521, loss: 3.131714, best loss: 2.205240 2025-01-16 01:32:01,542 - INFO - step 6522, loss: 3.105217, best loss: 2.205240 2025-01-16 01:32:01,692 - INFO - step 6523, loss: 2.716825, best loss: 2.205240 2025-01-16 01:32:01,842 - INFO - step 6524, loss: 2.529164, best loss: 2.205240 2025-01-16 01:32:01,993 - INFO - step 6525, loss: 2.498478, best loss: 2.205240 2025-01-16 01:32:02,143 - INFO - step 6526, loss: 2.693572, best loss: 2.205240 2025-01-16 01:32:02,293 - INFO - step 6527, loss: 2.866093, best loss: 2.205240 2025-01-16 01:32:02,443 - INFO - step 6528, loss: 3.096276, best loss: 2.205240 2025-01-16 01:32:02,593 - INFO - step 6529, loss: 2.989130, best loss: 2.205240 2025-01-16 01:32:02,743 - INFO - step 6530, loss: 2.857444, best loss: 2.205240 2025-01-16 01:32:02,894 - INFO - step 6531, loss: 2.977672, best loss: 2.205240 2025-01-16 01:32:03,044 - INFO - step 6532, loss: 2.942727, best loss: 2.205240 2025-01-16 01:32:03,194 - INFO - step 6533, loss: 2.832541, best loss: 2.205240 2025-01-16 01:32:03,344 - INFO - step 6534, loss: 3.041795, best loss: 2.205240 2025-01-16 01:32:03,495 - INFO - step 6535, loss: 3.067111, best loss: 2.205240 2025-01-16 01:32:03,645 - INFO - step 6536, loss: 2.774179, best loss: 2.205240 2025-01-16 01:32:03,795 - INFO - step 6537, loss: 2.706527, best loss: 2.205240 2025-01-16 01:32:03,945 - INFO - step 6538, loss: 2.963073, best loss: 2.205240 2025-01-16 01:32:04,096 - INFO - step 6539, loss: 3.044253, best loss: 2.205240 2025-01-16 01:32:04,246 - INFO - step 6540, loss: 2.758211, best loss: 2.205240 2025-01-16 01:32:04,396 - INFO - step 6541, loss: 2.742162, best loss: 2.205240 2025-01-16 01:32:04,547 - INFO - step 6542, loss: 3.013386, best loss: 2.205240 2025-01-16 01:32:04,697 - INFO - step 6543, loss: 2.801462, best loss: 2.205240 2025-01-16 01:32:04,847 - INFO - step 6544, loss: 2.720343, best loss: 2.205240 2025-01-16 01:32:04,997 - INFO - step 6545, loss: 3.118802, best loss: 2.205240 2025-01-16 01:32:05,147 - INFO - step 6546, loss: 2.815643, best loss: 2.205240 2025-01-16 01:32:05,297 - INFO - step 6547, loss: 2.790556, best loss: 2.205240 2025-01-16 01:32:05,447 - INFO - step 6548, loss: 2.742133, best loss: 2.205240 2025-01-16 01:32:05,598 - INFO - step 6549, loss: 2.992774, best loss: 2.205240 2025-01-16 01:32:05,748 - INFO - step 6550, loss: 2.583862, best loss: 2.205240 2025-01-16 01:32:05,898 - INFO - step 6551, loss: 2.728736, best loss: 2.205240 2025-01-16 01:32:06,048 - INFO - step 6552, loss: 2.650187, best loss: 2.205240 2025-01-16 01:32:06,199 - INFO - step 6553, loss: 2.947071, best loss: 2.205240 2025-01-16 01:32:06,349 - INFO - step 6554, loss: 3.295569, best loss: 2.205240 2025-01-16 01:32:06,499 - INFO - step 6555, loss: 3.368269, best loss: 2.205240 2025-01-16 01:32:06,649 - INFO - step 6556, loss: 3.109379, best loss: 2.205240 2025-01-16 01:32:06,799 - INFO - step 6557, loss: 3.106678, best loss: 2.205240 2025-01-16 01:32:06,949 - INFO - step 6558, loss: 3.050390, best loss: 2.205240 2025-01-16 01:32:07,100 - INFO - step 6559, loss: 3.010933, best loss: 2.205240 2025-01-16 01:32:07,250 - INFO - step 6560, loss: 2.845839, best loss: 2.205240 2025-01-16 01:32:07,400 - INFO - step 6561, loss: 3.063165, best loss: 2.205240 2025-01-16 01:32:07,550 - INFO - step 6562, loss: 2.856523, best loss: 2.205240 2025-01-16 01:32:07,700 - INFO - step 6563, loss: 2.701694, best loss: 2.205240 2025-01-16 01:32:07,850 - INFO - step 6564, loss: 2.961491, best loss: 2.205240 2025-01-16 01:32:08,000 - INFO - step 6565, loss: 2.894433, best loss: 2.205240 2025-01-16 01:32:08,151 - INFO - step 6566, loss: 3.049327, best loss: 2.205240 2025-01-16 01:32:08,301 - INFO - step 6567, loss: 2.322453, best loss: 2.205240 2025-01-16 01:32:08,451 - INFO - step 6568, loss: 2.901578, best loss: 2.205240 2025-01-16 01:32:08,601 - INFO - step 6569, loss: 3.019864, best loss: 2.205240 2025-01-16 01:32:08,752 - INFO - step 6570, loss: 2.942780, best loss: 2.205240 2025-01-16 01:32:08,902 - INFO - step 6571, loss: 2.928058, best loss: 2.205240 2025-01-16 01:32:09,052 - INFO - step 6572, loss: 2.834896, best loss: 2.205240 2025-01-16 01:32:09,202 - INFO - step 6573, loss: 2.932510, best loss: 2.205240 2025-01-16 01:32:09,353 - INFO - step 6574, loss: 2.772504, best loss: 2.205240 2025-01-16 01:32:09,503 - INFO - step 6575, loss: 2.974680, best loss: 2.205240 2025-01-16 01:32:09,653 - INFO - step 6576, loss: 2.882641, best loss: 2.205240 2025-01-16 01:32:09,804 - INFO - step 6577, loss: 2.984874, best loss: 2.205240 2025-01-16 01:32:09,954 - INFO - step 6578, loss: 2.687634, best loss: 2.205240 2025-01-16 01:32:10,104 - INFO - step 6579, loss: 2.766669, best loss: 2.205240 2025-01-16 01:32:10,254 - INFO - step 6580, loss: 2.576212, best loss: 2.205240 2025-01-16 01:32:10,405 - INFO - step 6581, loss: 2.615723, best loss: 2.205240 2025-01-16 01:32:10,555 - INFO - step 6582, loss: 2.795063, best loss: 2.205240 2025-01-16 01:32:10,705 - INFO - step 6583, loss: 2.731159, best loss: 2.205240 2025-01-16 01:32:10,855 - INFO - step 6584, loss: 2.738520, best loss: 2.205240 2025-01-16 01:32:11,006 - INFO - step 6585, loss: 2.505444, best loss: 2.205240 2025-01-16 01:32:11,156 - INFO - step 6586, loss: 2.482447, best loss: 2.205240 2025-01-16 01:32:11,306 - INFO - step 6587, loss: 2.320640, best loss: 2.205240 2025-01-16 01:32:11,456 - INFO - step 6588, loss: 2.951380, best loss: 2.205240 2025-01-16 01:32:11,606 - INFO - step 6589, loss: 3.090430, best loss: 2.205240 2025-01-16 01:32:11,757 - INFO - step 6590, loss: 3.170625, best loss: 2.205240 2025-01-16 01:32:11,907 - INFO - step 6591, loss: 3.283740, best loss: 2.205240 2025-01-16 01:32:12,057 - INFO - step 6592, loss: 3.248930, best loss: 2.205240 2025-01-16 01:32:12,207 - INFO - step 6593, loss: 2.892670, best loss: 2.205240 2025-01-16 01:32:12,357 - INFO - step 6594, loss: 2.918085, best loss: 2.205240 2025-01-16 01:32:12,508 - INFO - step 6595, loss: 3.172819, best loss: 2.205240 2025-01-16 01:32:12,658 - INFO - step 6596, loss: 2.936655, best loss: 2.205240 2025-01-16 01:32:12,808 - INFO - step 6597, loss: 2.496998, best loss: 2.205240 2025-01-16 01:32:12,959 - INFO - step 6598, loss: 2.794437, best loss: 2.205240 2025-01-16 01:32:13,109 - INFO - step 6599, loss: 2.733874, best loss: 2.205240 2025-01-16 01:32:13,259 - INFO - step 6600, loss: 3.166702, best loss: 2.205240 2025-01-16 01:32:13,409 - INFO - step 6601, loss: 3.075743, best loss: 2.205240 2025-01-16 01:32:13,560 - INFO - step 6602, loss: 3.155864, best loss: 2.205240 2025-01-16 01:32:13,710 - INFO - step 6603, loss: 3.017444, best loss: 2.205240 2025-01-16 01:32:13,860 - INFO - step 6604, loss: 3.042235, best loss: 2.205240 2025-01-16 01:32:14,010 - INFO - step 6605, loss: 2.679907, best loss: 2.205240 2025-01-16 01:32:14,161 - INFO - step 6606, loss: 3.163786, best loss: 2.205240 2025-01-16 01:32:14,311 - INFO - step 6607, loss: 2.940161, best loss: 2.205240 2025-01-16 01:32:14,461 - INFO - step 6608, loss: 3.114575, best loss: 2.205240 2025-01-16 01:32:14,611 - INFO - step 6609, loss: 3.038147, best loss: 2.205240 2025-01-16 01:32:14,761 - INFO - step 6610, loss: 2.912571, best loss: 2.205240 2025-01-16 01:32:14,911 - INFO - step 6611, loss: 3.163019, best loss: 2.205240 2025-01-16 01:32:15,062 - INFO - step 6612, loss: 2.727734, best loss: 2.205240 2025-01-16 01:32:15,212 - INFO - step 6613, loss: 3.042037, best loss: 2.205240 2025-01-16 01:32:15,362 - INFO - step 6614, loss: 3.118940, best loss: 2.205240 2025-01-16 01:32:15,512 - INFO - step 6615, loss: 3.063288, best loss: 2.205240 2025-01-16 01:32:15,663 - INFO - step 6616, loss: 3.032479, best loss: 2.205240 2025-01-16 01:32:15,813 - INFO - step 6617, loss: 2.808064, best loss: 2.205240 2025-01-16 01:32:15,963 - INFO - step 6618, loss: 2.851342, best loss: 2.205240 2025-01-16 01:32:16,113 - INFO - step 6619, loss: 2.883460, best loss: 2.205240 2025-01-16 01:32:16,264 - INFO - step 6620, loss: 2.551197, best loss: 2.205240 2025-01-16 01:32:16,414 - INFO - step 6621, loss: 3.222604, best loss: 2.205240 2025-01-16 01:32:16,565 - INFO - step 6622, loss: 2.486270, best loss: 2.205240 2025-01-16 01:32:16,715 - INFO - step 6623, loss: 2.557763, best loss: 2.205240 2025-01-16 01:32:16,865 - INFO - step 6624, loss: 2.928278, best loss: 2.205240 2025-01-16 01:32:17,015 - INFO - step 6625, loss: 2.965897, best loss: 2.205240 2025-01-16 01:32:17,165 - INFO - step 6626, loss: 2.834805, best loss: 2.205240 2025-01-16 01:32:17,316 - INFO - step 6627, loss: 2.666832, best loss: 2.205240 2025-01-16 01:32:17,466 - INFO - step 6628, loss: 2.862849, best loss: 2.205240 2025-01-16 01:32:17,616 - INFO - step 6629, loss: 2.942149, best loss: 2.205240 2025-01-16 01:32:17,766 - INFO - step 6630, loss: 2.694012, best loss: 2.205240 2025-01-16 01:32:17,916 - INFO - step 6631, loss: 2.614927, best loss: 2.205240 2025-01-16 01:32:18,067 - INFO - step 6632, loss: 2.911653, best loss: 2.205240 2025-01-16 01:32:18,217 - INFO - step 6633, loss: 2.896492, best loss: 2.205240 2025-01-16 01:32:18,367 - INFO - step 6634, loss: 2.784376, best loss: 2.205240 2025-01-16 01:32:18,518 - INFO - step 6635, loss: 2.597790, best loss: 2.205240 2025-01-16 01:32:18,668 - INFO - step 6636, loss: 2.871223, best loss: 2.205240 2025-01-16 01:32:18,818 - INFO - step 6637, loss: 3.130523, best loss: 2.205240 2025-01-16 01:32:18,968 - INFO - step 6638, loss: 2.853338, best loss: 2.205240 2025-01-16 01:32:19,119 - INFO - step 6639, loss: 2.999641, best loss: 2.205240 2025-01-16 01:32:19,269 - INFO - step 6640, loss: 3.156467, best loss: 2.205240 2025-01-16 01:32:19,420 - INFO - step 6641, loss: 3.079001, best loss: 2.205240 2025-01-16 01:32:19,570 - INFO - step 6642, loss: 3.108513, best loss: 2.205240 2025-01-16 01:32:19,721 - INFO - step 6643, loss: 2.888663, best loss: 2.205240 2025-01-16 01:32:19,871 - INFO - step 6644, loss: 2.893245, best loss: 2.205240 2025-01-16 01:32:20,021 - INFO - step 6645, loss: 2.740952, best loss: 2.205240 2025-01-16 01:32:20,171 - INFO - step 6646, loss: 3.288522, best loss: 2.205240 2025-01-16 01:32:20,322 - INFO - step 6647, loss: 2.948888, best loss: 2.205240 2025-01-16 01:32:20,472 - INFO - step 6648, loss: 3.214087, best loss: 2.205240 2025-01-16 01:32:20,622 - INFO - step 6649, loss: 2.542421, best loss: 2.205240 2025-01-16 01:32:20,773 - INFO - step 6650, loss: 2.688358, best loss: 2.205240 2025-01-16 01:32:20,923 - INFO - step 6651, loss: 2.999799, best loss: 2.205240 2025-01-16 01:32:21,074 - INFO - step 6652, loss: 3.000383, best loss: 2.205240 2025-01-16 01:32:21,224 - INFO - step 6653, loss: 2.900695, best loss: 2.205240 2025-01-16 01:32:21,374 - INFO - step 6654, loss: 2.987334, best loss: 2.205240 2025-01-16 01:32:21,525 - INFO - step 6655, loss: 2.802943, best loss: 2.205240 2025-01-16 01:32:21,675 - INFO - step 6656, loss: 2.924860, best loss: 2.205240 2025-01-16 01:32:21,825 - INFO - step 6657, loss: 3.049746, best loss: 2.205240 2025-01-16 01:32:21,975 - INFO - step 6658, loss: 2.654589, best loss: 2.205240 2025-01-16 01:32:22,125 - INFO - step 6659, loss: 2.905776, best loss: 2.205240 2025-01-16 01:32:22,276 - INFO - step 6660, loss: 3.000831, best loss: 2.205240 2025-01-16 01:32:22,426 - INFO - step 6661, loss: 3.138039, best loss: 2.205240 2025-01-16 01:32:22,576 - INFO - step 6662, loss: 2.966904, best loss: 2.205240 2025-01-16 01:32:22,727 - INFO - step 6663, loss: 3.122897, best loss: 2.205240 2025-01-16 01:32:22,877 - INFO - step 6664, loss: 3.012140, best loss: 2.205240 2025-01-16 01:32:23,027 - INFO - step 6665, loss: 2.676777, best loss: 2.205240 2025-01-16 01:32:23,178 - INFO - step 6666, loss: 3.002955, best loss: 2.205240 2025-01-16 01:32:23,328 - INFO - step 6667, loss: 2.533807, best loss: 2.205240 2025-01-16 01:32:23,478 - INFO - step 6668, loss: 2.783580, best loss: 2.205240 2025-01-16 01:32:23,628 - INFO - step 6669, loss: 2.882010, best loss: 2.205240 2025-01-16 01:32:23,778 - INFO - step 6670, loss: 2.872702, best loss: 2.205240 2025-01-16 01:32:23,929 - INFO - step 6671, loss: 2.839947, best loss: 2.205240 2025-01-16 01:32:24,079 - INFO - step 6672, loss: 2.943324, best loss: 2.205240 2025-01-16 01:32:24,229 - INFO - step 6673, loss: 3.108261, best loss: 2.205240 2025-01-16 01:32:24,379 - INFO - step 6674, loss: 2.991455, best loss: 2.205240 2025-01-16 01:32:24,530 - INFO - step 6675, loss: 3.137223, best loss: 2.205240 2025-01-16 01:32:24,680 - INFO - step 6676, loss: 2.925558, best loss: 2.205240 2025-01-16 01:32:24,830 - INFO - step 6677, loss: 2.991820, best loss: 2.205240 2025-01-16 01:32:24,980 - INFO - step 6678, loss: 2.906685, best loss: 2.205240 2025-01-16 01:32:25,130 - INFO - step 6679, loss: 2.532381, best loss: 2.205240 2025-01-16 01:32:25,280 - INFO - step 6680, loss: 3.208876, best loss: 2.205240 2025-01-16 01:32:25,431 - INFO - step 6681, loss: 2.944554, best loss: 2.205240 2025-01-16 01:32:25,581 - INFO - step 6682, loss: 2.798927, best loss: 2.205240 2025-01-16 01:32:25,731 - INFO - step 6683, loss: 2.754247, best loss: 2.205240 2025-01-16 01:32:25,881 - INFO - step 6684, loss: 2.582756, best loss: 2.205240 2025-01-16 01:32:26,031 - INFO - step 6685, loss: 2.810364, best loss: 2.205240 2025-01-16 01:32:26,182 - INFO - step 6686, loss: 2.668561, best loss: 2.205240 2025-01-16 01:32:26,332 - INFO - step 6687, loss: 2.639963, best loss: 2.205240 2025-01-16 01:32:26,482 - INFO - step 6688, loss: 3.051412, best loss: 2.205240 2025-01-16 01:32:26,632 - INFO - step 6689, loss: 3.046217, best loss: 2.205240 2025-01-16 01:32:26,783 - INFO - step 6690, loss: 2.810952, best loss: 2.205240 2025-01-16 01:32:26,933 - INFO - step 6691, loss: 3.064941, best loss: 2.205240 2025-01-16 01:32:27,083 - INFO - step 6692, loss: 2.827175, best loss: 2.205240 2025-01-16 01:32:27,233 - INFO - step 6693, loss: 3.146063, best loss: 2.205240 2025-01-16 01:32:27,383 - INFO - step 6694, loss: 3.184152, best loss: 2.205240 2025-01-16 01:32:27,534 - INFO - step 6695, loss: 3.323127, best loss: 2.205240 2025-01-16 01:32:27,684 - INFO - step 6696, loss: 3.285836, best loss: 2.205240 2025-01-16 01:32:27,834 - INFO - step 6697, loss: 3.048580, best loss: 2.205240 2025-01-16 01:32:27,984 - INFO - step 6698, loss: 3.232186, best loss: 2.205240 2025-01-16 01:32:28,134 - INFO - step 6699, loss: 3.190711, best loss: 2.205240 2025-01-16 01:32:28,284 - INFO - step 6700, loss: 2.973162, best loss: 2.205240 2025-01-16 01:32:28,434 - INFO - step 6701, loss: 3.373179, best loss: 2.205240 2025-01-16 01:32:28,584 - INFO - step 6702, loss: 3.281085, best loss: 2.205240 2025-01-16 01:32:28,734 - INFO - step 6703, loss: 3.249803, best loss: 2.205240 2025-01-16 01:32:28,885 - INFO - step 6704, loss: 3.003128, best loss: 2.205240 2025-01-16 01:32:29,035 - INFO - step 6705, loss: 3.106326, best loss: 2.205240 2025-01-16 01:32:29,185 - INFO - step 6706, loss: 3.016950, best loss: 2.205240 2025-01-16 01:32:29,336 - INFO - step 6707, loss: 2.852972, best loss: 2.205240 2025-01-16 01:32:29,486 - INFO - step 6708, loss: 2.851797, best loss: 2.205240 2025-01-16 01:32:29,637 - INFO - step 6709, loss: 3.226440, best loss: 2.205240 2025-01-16 01:32:29,787 - INFO - step 6710, loss: 3.111807, best loss: 2.205240 2025-01-16 01:32:29,937 - INFO - step 6711, loss: 3.353328, best loss: 2.205240 2025-01-16 01:32:30,087 - INFO - step 6712, loss: 3.181563, best loss: 2.205240 2025-01-16 01:32:30,237 - INFO - step 6713, loss: 2.968189, best loss: 2.205240 2025-01-16 01:32:30,387 - INFO - step 6714, loss: 3.325346, best loss: 2.205240 2025-01-16 01:32:30,537 - INFO - step 6715, loss: 2.980703, best loss: 2.205240 2025-01-16 01:32:30,687 - INFO - step 6716, loss: 3.218390, best loss: 2.205240 2025-01-16 01:32:30,837 - INFO - step 6717, loss: 2.936090, best loss: 2.205240 2025-01-16 01:32:30,987 - INFO - step 6718, loss: 2.973308, best loss: 2.205240 2025-01-16 01:32:31,137 - INFO - step 6719, loss: 3.061696, best loss: 2.205240 2025-01-16 01:32:31,287 - INFO - step 6720, loss: 3.052452, best loss: 2.205240 2025-01-16 01:32:31,437 - INFO - step 6721, loss: 2.994936, best loss: 2.205240 2025-01-16 01:32:31,587 - INFO - step 6722, loss: 3.077588, best loss: 2.205240 2025-01-16 01:32:31,738 - INFO - step 6723, loss: 2.754010, best loss: 2.205240 2025-01-16 01:32:31,888 - INFO - step 6724, loss: 2.517624, best loss: 2.205240 2025-01-16 01:32:32,038 - INFO - step 6725, loss: 2.691532, best loss: 2.205240 2025-01-16 01:32:32,188 - INFO - step 6726, loss: 2.925174, best loss: 2.205240 2025-01-16 01:32:32,338 - INFO - step 6727, loss: 3.384992, best loss: 2.205240 2025-01-16 01:32:32,488 - INFO - step 6728, loss: 2.955815, best loss: 2.205240 2025-01-16 01:32:32,638 - INFO - step 6729, loss: 2.507640, best loss: 2.205240 2025-01-16 01:32:32,789 - INFO - step 6730, loss: 3.102559, best loss: 2.205240 2025-01-16 01:32:32,939 - INFO - step 6731, loss: 2.886497, best loss: 2.205240 2025-01-16 01:32:33,089 - INFO - step 6732, loss: 3.367820, best loss: 2.205240 2025-01-16 01:32:33,239 - INFO - step 6733, loss: 2.710191, best loss: 2.205240 2025-01-16 01:32:33,389 - INFO - step 6734, loss: 3.031570, best loss: 2.205240 2025-01-16 01:32:33,539 - INFO - step 6735, loss: 2.844186, best loss: 2.205240 2025-01-16 01:32:33,690 - INFO - step 6736, loss: 3.090152, best loss: 2.205240 2025-01-16 01:32:33,840 - INFO - step 6737, loss: 2.986527, best loss: 2.205240 2025-01-16 01:32:33,990 - INFO - step 6738, loss: 2.831148, best loss: 2.205240 2025-01-16 01:32:34,140 - INFO - step 6739, loss: 3.089766, best loss: 2.205240 2025-01-16 01:32:34,290 - INFO - step 6740, loss: 2.944577, best loss: 2.205240 2025-01-16 01:32:34,440 - INFO - step 6741, loss: 2.814199, best loss: 2.205240 2025-01-16 01:32:34,590 - INFO - step 6742, loss: 3.351178, best loss: 2.205240 2025-01-16 01:32:34,740 - INFO - step 6743, loss: 2.855987, best loss: 2.205240 2025-01-16 01:32:34,890 - INFO - step 6744, loss: 2.468318, best loss: 2.205240 2025-01-16 01:32:35,041 - INFO - step 6745, loss: 2.876525, best loss: 2.205240 2025-01-16 01:32:35,191 - INFO - step 6746, loss: 3.111236, best loss: 2.205240 2025-01-16 01:32:35,341 - INFO - step 6747, loss: 3.014496, best loss: 2.205240 2025-01-16 01:32:35,491 - INFO - step 6748, loss: 2.789345, best loss: 2.205240 2025-01-16 01:32:35,641 - INFO - step 6749, loss: 2.813757, best loss: 2.205240 2025-01-16 01:32:35,791 - INFO - step 6750, loss: 3.218106, best loss: 2.205240 2025-01-16 01:32:35,941 - INFO - step 6751, loss: 3.168321, best loss: 2.205240 2025-01-16 01:32:36,091 - INFO - step 6752, loss: 3.056824, best loss: 2.205240 2025-01-16 01:32:36,241 - INFO - step 6753, loss: 2.793078, best loss: 2.205240 2025-01-16 01:32:36,391 - INFO - step 6754, loss: 3.063972, best loss: 2.205240 2025-01-16 01:32:36,541 - INFO - step 6755, loss: 2.941173, best loss: 2.205240 2025-01-16 01:32:36,692 - INFO - step 6756, loss: 2.698136, best loss: 2.205240 2025-01-16 01:32:36,841 - INFO - step 6757, loss: 2.991201, best loss: 2.205240 2025-01-16 01:32:36,991 - INFO - step 6758, loss: 2.743114, best loss: 2.205240 2025-01-16 01:32:37,142 - INFO - step 6759, loss: 3.052960, best loss: 2.205240 2025-01-16 01:32:37,292 - INFO - step 6760, loss: 2.737608, best loss: 2.205240 2025-01-16 01:32:37,442 - INFO - step 6761, loss: 2.864442, best loss: 2.205240 2025-01-16 01:32:37,592 - INFO - step 6762, loss: 2.714874, best loss: 2.205240 2025-01-16 01:32:37,742 - INFO - step 6763, loss: 2.715959, best loss: 2.205240 2025-01-16 01:32:37,892 - INFO - step 6764, loss: 3.060389, best loss: 2.205240 2025-01-16 01:32:38,043 - INFO - step 6765, loss: 2.940114, best loss: 2.205240 2025-01-16 01:32:38,193 - INFO - step 6766, loss: 3.165020, best loss: 2.205240 2025-01-16 01:32:38,343 - INFO - step 6767, loss: 2.826374, best loss: 2.205240 2025-01-16 01:32:38,493 - INFO - step 6768, loss: 2.982371, best loss: 2.205240 2025-01-16 01:32:38,643 - INFO - step 6769, loss: 3.039015, best loss: 2.205240 2025-01-16 01:32:38,793 - INFO - step 6770, loss: 2.702533, best loss: 2.205240 2025-01-16 01:32:38,943 - INFO - step 6771, loss: 2.611326, best loss: 2.205240 2025-01-16 01:32:39,094 - INFO - step 6772, loss: 2.680613, best loss: 2.205240 2025-01-16 01:32:39,244 - INFO - step 6773, loss: 2.911021, best loss: 2.205240 2025-01-16 01:32:39,394 - INFO - step 6774, loss: 2.938171, best loss: 2.205240 2025-01-16 01:32:39,544 - INFO - step 6775, loss: 3.007629, best loss: 2.205240 2025-01-16 01:32:39,694 - INFO - step 6776, loss: 3.280729, best loss: 2.205240 2025-01-16 01:32:39,845 - INFO - step 6777, loss: 3.181959, best loss: 2.205240 2025-01-16 01:32:39,995 - INFO - step 6778, loss: 3.187368, best loss: 2.205240 2025-01-16 01:32:40,145 - INFO - step 6779, loss: 3.180614, best loss: 2.205240 2025-01-16 01:32:40,294 - INFO - step 6780, loss: 3.160146, best loss: 2.205240 2025-01-16 01:32:40,444 - INFO - step 6781, loss: 2.701434, best loss: 2.205240 2025-01-16 01:32:40,594 - INFO - step 6782, loss: 3.098281, best loss: 2.205240 2025-01-16 01:32:40,745 - INFO - step 6783, loss: 3.244469, best loss: 2.205240 2025-01-16 01:32:40,895 - INFO - step 6784, loss: 3.031892, best loss: 2.205240 2025-01-16 01:32:41,045 - INFO - step 6785, loss: 3.031430, best loss: 2.205240 2025-01-16 01:32:41,195 - INFO - step 6786, loss: 3.131296, best loss: 2.205240 2025-01-16 01:32:41,346 - INFO - step 6787, loss: 2.798841, best loss: 2.205240 2025-01-16 01:32:44,818 - INFO - step 6788, loss: 2.052101, best loss: 2.052101 2025-01-16 01:32:44,980 - INFO - step 6789, loss: 3.012624, best loss: 2.052101 2025-01-16 01:32:45,134 - INFO - step 6790, loss: 3.025234, best loss: 2.052101 2025-01-16 01:32:45,284 - INFO - step 6791, loss: 3.095355, best loss: 2.052101 2025-01-16 01:32:45,435 - INFO - step 6792, loss: 3.151643, best loss: 2.052101 2025-01-16 01:32:45,585 - INFO - step 6793, loss: 2.902678, best loss: 2.052101 2025-01-16 01:32:45,735 - INFO - step 6794, loss: 3.049060, best loss: 2.052101 2025-01-16 01:32:45,885 - INFO - step 6795, loss: 3.024719, best loss: 2.052101 2025-01-16 01:32:46,035 - INFO - step 6796, loss: 2.908390, best loss: 2.052101 2025-01-16 01:32:46,186 - INFO - step 6797, loss: 2.949693, best loss: 2.052101 2025-01-16 01:32:46,336 - INFO - step 6798, loss: 2.872202, best loss: 2.052101 2025-01-16 01:32:46,486 - INFO - step 6799, loss: 2.751563, best loss: 2.052101 2025-01-16 01:32:46,636 - INFO - step 6800, loss: 3.004226, best loss: 2.052101 2025-01-16 01:32:46,787 - INFO - step 6801, loss: 2.813469, best loss: 2.052101 2025-01-16 01:32:46,937 - INFO - step 6802, loss: 3.052665, best loss: 2.052101 2025-01-16 01:32:47,087 - INFO - step 6803, loss: 3.185306, best loss: 2.052101 2025-01-16 01:32:47,237 - INFO - step 6804, loss: 2.883555, best loss: 2.052101 2025-01-16 01:32:47,387 - INFO - step 6805, loss: 2.638365, best loss: 2.052101 2025-01-16 01:32:47,538 - INFO - step 6806, loss: 2.981344, best loss: 2.052101 2025-01-16 01:32:47,688 - INFO - step 6807, loss: 3.040169, best loss: 2.052101 2025-01-16 01:32:47,838 - INFO - step 6808, loss: 3.076918, best loss: 2.052101 2025-01-16 01:32:47,988 - INFO - step 6809, loss: 3.057020, best loss: 2.052101 2025-01-16 01:32:48,138 - INFO - step 6810, loss: 3.050020, best loss: 2.052101 2025-01-16 01:32:48,288 - INFO - step 6811, loss: 2.864343, best loss: 2.052101 2025-01-16 01:32:48,439 - INFO - step 6812, loss: 3.173598, best loss: 2.052101 2025-01-16 01:32:48,589 - INFO - step 6813, loss: 2.848503, best loss: 2.052101 2025-01-16 01:32:48,739 - INFO - step 6814, loss: 2.939194, best loss: 2.052101 2025-01-16 01:32:48,889 - INFO - step 6815, loss: 3.124937, best loss: 2.052101 2025-01-16 01:32:49,039 - INFO - step 6816, loss: 3.207623, best loss: 2.052101 2025-01-16 01:32:49,189 - INFO - step 6817, loss: 3.065683, best loss: 2.052101 2025-01-16 01:32:49,339 - INFO - step 6818, loss: 2.886032, best loss: 2.052101 2025-01-16 01:32:49,490 - INFO - step 6819, loss: 2.905387, best loss: 2.052101 2025-01-16 01:32:49,641 - INFO - step 6820, loss: 3.081453, best loss: 2.052101 2025-01-16 01:32:49,791 - INFO - step 6821, loss: 3.247164, best loss: 2.052101 2025-01-16 01:32:49,941 - INFO - step 6822, loss: 3.102934, best loss: 2.052101 2025-01-16 01:32:50,092 - INFO - step 6823, loss: 3.340590, best loss: 2.052101 2025-01-16 01:32:50,242 - INFO - step 6824, loss: 3.300192, best loss: 2.052101 2025-01-16 01:32:50,392 - INFO - step 6825, loss: 3.089921, best loss: 2.052101 2025-01-16 01:32:50,542 - INFO - step 6826, loss: 3.385566, best loss: 2.052101 2025-01-16 01:32:50,692 - INFO - step 6827, loss: 3.103641, best loss: 2.052101 2025-01-16 01:32:50,842 - INFO - step 6828, loss: 2.710932, best loss: 2.052101 2025-01-16 01:32:50,993 - INFO - step 6829, loss: 3.199899, best loss: 2.052101 2025-01-16 01:32:51,143 - INFO - step 6830, loss: 3.142229, best loss: 2.052101 2025-01-16 01:32:51,293 - INFO - step 6831, loss: 3.106181, best loss: 2.052101 2025-01-16 01:32:51,443 - INFO - step 6832, loss: 2.841619, best loss: 2.052101 2025-01-16 01:32:51,594 - INFO - step 6833, loss: 3.055032, best loss: 2.052101 2025-01-16 01:32:51,744 - INFO - step 6834, loss: 3.105650, best loss: 2.052101 2025-01-16 01:32:51,894 - INFO - step 6835, loss: 2.962587, best loss: 2.052101 2025-01-16 01:32:52,044 - INFO - step 6836, loss: 3.133134, best loss: 2.052101 2025-01-16 01:32:52,194 - INFO - step 6837, loss: 2.784910, best loss: 2.052101 2025-01-16 01:32:52,345 - INFO - step 6838, loss: 2.677120, best loss: 2.052101 2025-01-16 01:32:52,495 - INFO - step 6839, loss: 3.163414, best loss: 2.052101 2025-01-16 01:32:52,645 - INFO - step 6840, loss: 3.180274, best loss: 2.052101 2025-01-16 01:32:52,795 - INFO - step 6841, loss: 3.192368, best loss: 2.052101 2025-01-16 01:32:52,946 - INFO - step 6842, loss: 3.063331, best loss: 2.052101 2025-01-16 01:32:53,096 - INFO - step 6843, loss: 3.179916, best loss: 2.052101 2025-01-16 01:32:53,246 - INFO - step 6844, loss: 3.220096, best loss: 2.052101 2025-01-16 01:32:53,396 - INFO - step 6845, loss: 2.890078, best loss: 2.052101 2025-01-16 01:32:53,546 - INFO - step 6846, loss: 2.916157, best loss: 2.052101 2025-01-16 01:32:53,696 - INFO - step 6847, loss: 3.205647, best loss: 2.052101 2025-01-16 01:32:53,847 - INFO - step 6848, loss: 2.819164, best loss: 2.052101 2025-01-16 01:32:53,997 - INFO - step 6849, loss: 2.610211, best loss: 2.052101 2025-01-16 01:32:54,147 - INFO - step 6850, loss: 2.989933, best loss: 2.052101 2025-01-16 01:32:54,297 - INFO - step 6851, loss: 3.006516, best loss: 2.052101 2025-01-16 01:32:54,447 - INFO - step 6852, loss: 2.992334, best loss: 2.052101 2025-01-16 01:32:54,597 - INFO - step 6853, loss: 2.557070, best loss: 2.052101 2025-01-16 01:32:54,748 - INFO - step 6854, loss: 2.376716, best loss: 2.052101 2025-01-16 01:32:54,898 - INFO - step 6855, loss: 2.437876, best loss: 2.052101 2025-01-16 01:32:55,048 - INFO - step 6856, loss: 2.627028, best loss: 2.052101 2025-01-16 01:32:55,198 - INFO - step 6857, loss: 2.858264, best loss: 2.052101 2025-01-16 01:32:55,348 - INFO - step 6858, loss: 3.112136, best loss: 2.052101 2025-01-16 01:32:55,499 - INFO - step 6859, loss: 3.014489, best loss: 2.052101 2025-01-16 01:32:55,649 - INFO - step 6860, loss: 2.859433, best loss: 2.052101 2025-01-16 01:32:55,799 - INFO - step 6861, loss: 2.973655, best loss: 2.052101 2025-01-16 01:32:55,949 - INFO - step 6862, loss: 2.910349, best loss: 2.052101 2025-01-16 01:32:56,099 - INFO - step 6863, loss: 2.771057, best loss: 2.052101 2025-01-16 01:32:56,250 - INFO - step 6864, loss: 2.949166, best loss: 2.052101 2025-01-16 01:32:56,400 - INFO - step 6865, loss: 2.982183, best loss: 2.052101 2025-01-16 01:32:56,550 - INFO - step 6866, loss: 2.657115, best loss: 2.052101 2025-01-16 01:32:56,700 - INFO - step 6867, loss: 2.666125, best loss: 2.052101 2025-01-16 01:32:56,850 - INFO - step 6868, loss: 2.851327, best loss: 2.052101 2025-01-16 01:32:57,000 - INFO - step 6869, loss: 2.998091, best loss: 2.052101 2025-01-16 01:32:57,150 - INFO - step 6870, loss: 2.660072, best loss: 2.052101 2025-01-16 01:32:57,300 - INFO - step 6871, loss: 2.687077, best loss: 2.052101 2025-01-16 01:32:57,451 - INFO - step 6872, loss: 2.926603, best loss: 2.052101 2025-01-16 01:32:57,601 - INFO - step 6873, loss: 2.663129, best loss: 2.052101 2025-01-16 01:32:57,751 - INFO - step 6874, loss: 2.643239, best loss: 2.052101 2025-01-16 01:32:57,901 - INFO - step 6875, loss: 3.009118, best loss: 2.052101 2025-01-16 01:32:58,051 - INFO - step 6876, loss: 2.755116, best loss: 2.052101 2025-01-16 01:32:58,201 - INFO - step 6877, loss: 2.708889, best loss: 2.052101 2025-01-16 01:32:58,352 - INFO - step 6878, loss: 2.677949, best loss: 2.052101 2025-01-16 01:32:58,502 - INFO - step 6879, loss: 2.955669, best loss: 2.052101 2025-01-16 01:32:58,652 - INFO - step 6880, loss: 2.606021, best loss: 2.052101 2025-01-16 01:32:58,802 - INFO - step 6881, loss: 2.693957, best loss: 2.052101 2025-01-16 01:32:58,952 - INFO - step 6882, loss: 2.627619, best loss: 2.052101 2025-01-16 01:32:59,102 - INFO - step 6883, loss: 2.889357, best loss: 2.052101 2025-01-16 01:32:59,252 - INFO - step 6884, loss: 3.215812, best loss: 2.052101 2025-01-16 01:32:59,403 - INFO - step 6885, loss: 3.284768, best loss: 2.052101 2025-01-16 01:32:59,553 - INFO - step 6886, loss: 3.000349, best loss: 2.052101 2025-01-16 01:32:59,703 - INFO - step 6887, loss: 3.053312, best loss: 2.052101 2025-01-16 01:32:59,853 - INFO - step 6888, loss: 2.973953, best loss: 2.052101 2025-01-16 01:33:00,003 - INFO - step 6889, loss: 2.919538, best loss: 2.052101 2025-01-16 01:33:00,153 - INFO - step 6890, loss: 2.751803, best loss: 2.052101 2025-01-16 01:33:00,303 - INFO - step 6891, loss: 2.966287, best loss: 2.052101 2025-01-16 01:33:00,454 - INFO - step 6892, loss: 2.735962, best loss: 2.052101 2025-01-16 01:33:00,604 - INFO - step 6893, loss: 2.648008, best loss: 2.052101 2025-01-16 01:33:00,754 - INFO - step 6894, loss: 2.807982, best loss: 2.052101 2025-01-16 01:33:00,904 - INFO - step 6895, loss: 2.797840, best loss: 2.052101 2025-01-16 01:33:01,054 - INFO - step 6896, loss: 2.929357, best loss: 2.052101 2025-01-16 01:33:01,204 - INFO - step 6897, loss: 2.295713, best loss: 2.052101 2025-01-16 01:33:01,354 - INFO - step 6898, loss: 2.842228, best loss: 2.052101 2025-01-16 01:33:01,504 - INFO - step 6899, loss: 2.969907, best loss: 2.052101 2025-01-16 01:33:01,655 - INFO - step 6900, loss: 2.857138, best loss: 2.052101 2025-01-16 01:33:01,805 - INFO - step 6901, loss: 2.799299, best loss: 2.052101 2025-01-16 01:33:01,956 - INFO - step 6902, loss: 2.728848, best loss: 2.052101 2025-01-16 01:33:02,106 - INFO - step 6903, loss: 2.761499, best loss: 2.052101 2025-01-16 01:33:02,256 - INFO - step 6904, loss: 2.662052, best loss: 2.052101 2025-01-16 01:33:02,406 - INFO - step 6905, loss: 2.815019, best loss: 2.052101 2025-01-16 01:33:02,556 - INFO - step 6906, loss: 2.771511, best loss: 2.052101 2025-01-16 01:33:02,706 - INFO - step 6907, loss: 2.881484, best loss: 2.052101 2025-01-16 01:33:02,856 - INFO - step 6908, loss: 2.709316, best loss: 2.052101 2025-01-16 01:33:03,006 - INFO - step 6909, loss: 2.756704, best loss: 2.052101 2025-01-16 01:33:03,156 - INFO - step 6910, loss: 2.541289, best loss: 2.052101 2025-01-16 01:33:03,306 - INFO - step 6911, loss: 2.584910, best loss: 2.052101 2025-01-16 01:33:03,456 - INFO - step 6912, loss: 2.759225, best loss: 2.052101 2025-01-16 01:33:03,607 - INFO - step 6913, loss: 2.620549, best loss: 2.052101 2025-01-16 01:33:03,756 - INFO - step 6914, loss: 2.674351, best loss: 2.052101 2025-01-16 01:33:03,907 - INFO - step 6915, loss: 2.406401, best loss: 2.052101 2025-01-16 01:33:04,057 - INFO - step 6916, loss: 2.340974, best loss: 2.052101 2025-01-16 01:33:04,207 - INFO - step 6917, loss: 2.230207, best loss: 2.052101 2025-01-16 01:33:04,357 - INFO - step 6918, loss: 2.846661, best loss: 2.052101 2025-01-16 01:33:04,507 - INFO - step 6919, loss: 3.011263, best loss: 2.052101 2025-01-16 01:33:04,657 - INFO - step 6920, loss: 3.028683, best loss: 2.052101 2025-01-16 01:33:04,807 - INFO - step 6921, loss: 3.203475, best loss: 2.052101 2025-01-16 01:33:04,957 - INFO - step 6922, loss: 3.148466, best loss: 2.052101 2025-01-16 01:33:05,107 - INFO - step 6923, loss: 2.808941, best loss: 2.052101 2025-01-16 01:33:05,258 - INFO - step 6924, loss: 2.884237, best loss: 2.052101 2025-01-16 01:33:05,408 - INFO - step 6925, loss: 3.145716, best loss: 2.052101 2025-01-16 01:33:05,558 - INFO - step 6926, loss: 2.804937, best loss: 2.052101 2025-01-16 01:33:05,708 - INFO - step 6927, loss: 2.393059, best loss: 2.052101 2025-01-16 01:33:05,859 - INFO - step 6928, loss: 2.667691, best loss: 2.052101 2025-01-16 01:33:06,009 - INFO - step 6929, loss: 2.561450, best loss: 2.052101 2025-01-16 01:33:06,159 - INFO - step 6930, loss: 3.060035, best loss: 2.052101 2025-01-16 01:33:06,309 - INFO - step 6931, loss: 2.937819, best loss: 2.052101 2025-01-16 01:33:06,459 - INFO - step 6932, loss: 2.953162, best loss: 2.052101 2025-01-16 01:33:06,609 - INFO - step 6933, loss: 2.970712, best loss: 2.052101 2025-01-16 01:33:06,760 - INFO - step 6934, loss: 2.961108, best loss: 2.052101 2025-01-16 01:33:06,910 - INFO - step 6935, loss: 2.637666, best loss: 2.052101 2025-01-16 01:33:07,060 - INFO - step 6936, loss: 3.094105, best loss: 2.052101 2025-01-16 01:33:07,210 - INFO - step 6937, loss: 2.940068, best loss: 2.052101 2025-01-16 01:33:07,360 - INFO - step 6938, loss: 3.092062, best loss: 2.052101 2025-01-16 01:33:07,510 - INFO - step 6939, loss: 3.024558, best loss: 2.052101 2025-01-16 01:33:07,661 - INFO - step 6940, loss: 2.830776, best loss: 2.052101 2025-01-16 01:33:07,811 - INFO - step 6941, loss: 3.052289, best loss: 2.052101 2025-01-16 01:33:07,961 - INFO - step 6942, loss: 2.654562, best loss: 2.052101 2025-01-16 01:33:08,111 - INFO - step 6943, loss: 2.964556, best loss: 2.052101 2025-01-16 01:33:08,261 - INFO - step 6944, loss: 3.088275, best loss: 2.052101 2025-01-16 01:33:08,411 - INFO - step 6945, loss: 3.017461, best loss: 2.052101 2025-01-16 01:33:08,561 - INFO - step 6946, loss: 2.969103, best loss: 2.052101 2025-01-16 01:33:08,711 - INFO - step 6947, loss: 2.845062, best loss: 2.052101 2025-01-16 01:33:08,861 - INFO - step 6948, loss: 2.795167, best loss: 2.052101 2025-01-16 01:33:09,011 - INFO - step 6949, loss: 2.802910, best loss: 2.052101 2025-01-16 01:33:09,161 - INFO - step 6950, loss: 2.448121, best loss: 2.052101 2025-01-16 01:33:09,312 - INFO - step 6951, loss: 3.118030, best loss: 2.052101 2025-01-16 01:33:09,462 - INFO - step 6952, loss: 2.406029, best loss: 2.052101 2025-01-16 01:33:09,613 - INFO - step 6953, loss: 2.507898, best loss: 2.052101 2025-01-16 01:33:09,763 - INFO - step 6954, loss: 2.795434, best loss: 2.052101 2025-01-16 01:33:09,913 - INFO - step 6955, loss: 2.917742, best loss: 2.052101 2025-01-16 01:33:10,063 - INFO - step 6956, loss: 2.748000, best loss: 2.052101 2025-01-16 01:33:10,213 - INFO - step 6957, loss: 2.554642, best loss: 2.052101 2025-01-16 01:33:10,363 - INFO - step 6958, loss: 2.754939, best loss: 2.052101 2025-01-16 01:33:10,513 - INFO - step 6959, loss: 2.897527, best loss: 2.052101 2025-01-16 01:33:10,663 - INFO - step 6960, loss: 2.560093, best loss: 2.052101 2025-01-16 01:33:10,814 - INFO - step 6961, loss: 2.488131, best loss: 2.052101 2025-01-16 01:33:10,964 - INFO - step 6962, loss: 2.823386, best loss: 2.052101 2025-01-16 01:33:11,114 - INFO - step 6963, loss: 2.768724, best loss: 2.052101 2025-01-16 01:33:11,264 - INFO - step 6964, loss: 2.680747, best loss: 2.052101 2025-01-16 01:33:11,415 - INFO - step 6965, loss: 2.500194, best loss: 2.052101 2025-01-16 01:33:11,565 - INFO - step 6966, loss: 2.745737, best loss: 2.052101 2025-01-16 01:33:11,715 - INFO - step 6967, loss: 3.043853, best loss: 2.052101 2025-01-16 01:33:11,865 - INFO - step 6968, loss: 2.766170, best loss: 2.052101 2025-01-16 01:33:12,015 - INFO - step 6969, loss: 2.812841, best loss: 2.052101 2025-01-16 01:33:12,165 - INFO - step 6970, loss: 3.033754, best loss: 2.052101 2025-01-16 01:33:12,314 - INFO - step 6971, loss: 2.963965, best loss: 2.052101 2025-01-16 01:33:12,464 - INFO - step 6972, loss: 2.929574, best loss: 2.052101 2025-01-16 01:33:12,615 - INFO - step 6973, loss: 2.850199, best loss: 2.052101 2025-01-16 01:33:12,765 - INFO - step 6974, loss: 2.809232, best loss: 2.052101 2025-01-16 01:33:12,915 - INFO - step 6975, loss: 2.652514, best loss: 2.052101 2025-01-16 01:33:13,065 - INFO - step 6976, loss: 3.173653, best loss: 2.052101 2025-01-16 01:33:13,216 - INFO - step 6977, loss: 2.811614, best loss: 2.052101 2025-01-16 01:33:13,366 - INFO - step 6978, loss: 3.055461, best loss: 2.052101 2025-01-16 01:33:13,516 - INFO - step 6979, loss: 2.467453, best loss: 2.052101 2025-01-16 01:33:13,666 - INFO - step 6980, loss: 2.575465, best loss: 2.052101 2025-01-16 01:33:13,816 - INFO - step 6981, loss: 2.916059, best loss: 2.052101 2025-01-16 01:33:13,966 - INFO - step 6982, loss: 2.855898, best loss: 2.052101 2025-01-16 01:33:14,116 - INFO - step 6983, loss: 2.811337, best loss: 2.052101 2025-01-16 01:33:14,267 - INFO - step 6984, loss: 2.912078, best loss: 2.052101 2025-01-16 01:33:14,417 - INFO - step 6985, loss: 2.703455, best loss: 2.052101 2025-01-16 01:33:14,567 - INFO - step 6986, loss: 2.795896, best loss: 2.052101 2025-01-16 01:33:14,718 - INFO - step 6987, loss: 2.919229, best loss: 2.052101 2025-01-16 01:33:14,868 - INFO - step 6988, loss: 2.491716, best loss: 2.052101 2025-01-16 01:33:15,018 - INFO - step 6989, loss: 2.773101, best loss: 2.052101 2025-01-16 01:33:15,168 - INFO - step 6990, loss: 2.884710, best loss: 2.052101 2025-01-16 01:33:15,318 - INFO - step 6991, loss: 2.997487, best loss: 2.052101 2025-01-16 01:33:15,468 - INFO - step 6992, loss: 2.853075, best loss: 2.052101 2025-01-16 01:33:15,619 - INFO - step 6993, loss: 2.996310, best loss: 2.052101 2025-01-16 01:33:15,769 - INFO - step 6994, loss: 2.885244, best loss: 2.052101 2025-01-16 01:33:15,919 - INFO - step 6995, loss: 2.532861, best loss: 2.052101 2025-01-16 01:33:16,069 - INFO - step 6996, loss: 3.023727, best loss: 2.052101 2025-01-16 01:33:16,219 - INFO - step 6997, loss: 2.468698, best loss: 2.052101 2025-01-16 01:33:16,369 - INFO - step 6998, loss: 2.672115, best loss: 2.052101 2025-01-16 01:33:16,519 - INFO - step 6999, loss: 2.786547, best loss: 2.052101 2025-01-16 01:33:16,669 - INFO - step 7000, loss: 2.679437, best loss: 2.052101 2025-01-16 01:33:16,820 - INFO - step 7001, loss: 2.683080, best loss: 2.052101 2025-01-16 01:33:16,970 - INFO - step 7002, loss: 2.866860, best loss: 2.052101 2025-01-16 01:33:17,120 - INFO - step 7003, loss: 3.007997, best loss: 2.052101 2025-01-16 01:33:17,270 - INFO - step 7004, loss: 2.901036, best loss: 2.052101 2025-01-16 01:33:17,420 - INFO - step 7005, loss: 3.063915, best loss: 2.052101 2025-01-16 01:33:17,571 - INFO - step 7006, loss: 2.812859, best loss: 2.052101 2025-01-16 01:33:17,721 - INFO - step 7007, loss: 2.907697, best loss: 2.052101 2025-01-16 01:33:17,871 - INFO - step 7008, loss: 2.814229, best loss: 2.052101 2025-01-16 01:33:18,021 - INFO - step 7009, loss: 2.412803, best loss: 2.052101 2025-01-16 01:33:18,171 - INFO - step 7010, loss: 3.023375, best loss: 2.052101 2025-01-16 01:33:18,320 - INFO - step 7011, loss: 2.859163, best loss: 2.052101 2025-01-16 01:33:18,471 - INFO - step 7012, loss: 2.723654, best loss: 2.052101 2025-01-16 01:33:18,621 - INFO - step 7013, loss: 2.663294, best loss: 2.052101 2025-01-16 01:33:18,771 - INFO - step 7014, loss: 2.518704, best loss: 2.052101 2025-01-16 01:33:18,921 - INFO - step 7015, loss: 2.671811, best loss: 2.052101 2025-01-16 01:33:19,071 - INFO - step 7016, loss: 2.570959, best loss: 2.052101 2025-01-16 01:33:19,222 - INFO - step 7017, loss: 2.565787, best loss: 2.052101 2025-01-16 01:33:19,372 - INFO - step 7018, loss: 2.979747, best loss: 2.052101 2025-01-16 01:33:19,522 - INFO - step 7019, loss: 2.934286, best loss: 2.052101 2025-01-16 01:33:19,672 - INFO - step 7020, loss: 2.706066, best loss: 2.052101 2025-01-16 01:33:19,823 - INFO - step 7021, loss: 2.968464, best loss: 2.052101 2025-01-16 01:33:19,973 - INFO - step 7022, loss: 2.780773, best loss: 2.052101 2025-01-16 01:33:20,123 - INFO - step 7023, loss: 2.996086, best loss: 2.052101 2025-01-16 01:33:20,273 - INFO - step 7024, loss: 3.099832, best loss: 2.052101 2025-01-16 01:33:20,423 - INFO - step 7025, loss: 3.151318, best loss: 2.052101 2025-01-16 01:33:20,573 - INFO - step 7026, loss: 3.147961, best loss: 2.052101 2025-01-16 01:33:20,723 - INFO - step 7027, loss: 2.883787, best loss: 2.052101 2025-01-16 01:33:20,873 - INFO - step 7028, loss: 3.029504, best loss: 2.052101 2025-01-16 01:33:21,024 - INFO - step 7029, loss: 3.005371, best loss: 2.052101 2025-01-16 01:33:21,174 - INFO - step 7030, loss: 2.908123, best loss: 2.052101 2025-01-16 01:33:21,324 - INFO - step 7031, loss: 3.249235, best loss: 2.052101 2025-01-16 01:33:21,475 - INFO - step 7032, loss: 3.141985, best loss: 2.052101 2025-01-16 01:33:21,624 - INFO - step 7033, loss: 3.215083, best loss: 2.052101 2025-01-16 01:33:21,775 - INFO - step 7034, loss: 2.940121, best loss: 2.052101 2025-01-16 01:33:21,925 - INFO - step 7035, loss: 2.953097, best loss: 2.052101 2025-01-16 01:33:22,074 - INFO - step 7036, loss: 2.998961, best loss: 2.052101 2025-01-16 01:33:22,225 - INFO - step 7037, loss: 2.759063, best loss: 2.052101 2025-01-16 01:33:22,375 - INFO - step 7038, loss: 2.756884, best loss: 2.052101 2025-01-16 01:33:22,525 - INFO - step 7039, loss: 2.986227, best loss: 2.052101 2025-01-16 01:33:22,675 - INFO - step 7040, loss: 2.971867, best loss: 2.052101 2025-01-16 01:33:22,825 - INFO - step 7041, loss: 3.142765, best loss: 2.052101 2025-01-16 01:33:22,975 - INFO - step 7042, loss: 3.001672, best loss: 2.052101 2025-01-16 01:33:23,125 - INFO - step 7043, loss: 2.844262, best loss: 2.052101 2025-01-16 01:33:23,275 - INFO - step 7044, loss: 3.204060, best loss: 2.052101 2025-01-16 01:33:23,425 - INFO - step 7045, loss: 2.905992, best loss: 2.052101 2025-01-16 01:33:23,576 - INFO - step 7046, loss: 3.007101, best loss: 2.052101 2025-01-16 01:33:23,726 - INFO - step 7047, loss: 2.740371, best loss: 2.052101 2025-01-16 01:33:23,877 - INFO - step 7048, loss: 2.902467, best loss: 2.052101 2025-01-16 01:33:24,027 - INFO - step 7049, loss: 2.926232, best loss: 2.052101 2025-01-16 01:33:24,177 - INFO - step 7050, loss: 2.993234, best loss: 2.052101 2025-01-16 01:33:24,327 - INFO - step 7051, loss: 2.902844, best loss: 2.052101 2025-01-16 01:33:24,478 - INFO - step 7052, loss: 2.972641, best loss: 2.052101 2025-01-16 01:33:24,628 - INFO - step 7053, loss: 2.652072, best loss: 2.052101 2025-01-16 01:33:24,778 - INFO - step 7054, loss: 2.412517, best loss: 2.052101 2025-01-16 01:33:24,928 - INFO - step 7055, loss: 2.620303, best loss: 2.052101 2025-01-16 01:33:25,078 - INFO - step 7056, loss: 2.795763, best loss: 2.052101 2025-01-16 01:33:25,228 - INFO - step 7057, loss: 3.232686, best loss: 2.052101 2025-01-16 01:33:25,379 - INFO - step 7058, loss: 2.819821, best loss: 2.052101 2025-01-16 01:33:25,529 - INFO - step 7059, loss: 2.390925, best loss: 2.052101 2025-01-16 01:33:25,679 - INFO - step 7060, loss: 2.978002, best loss: 2.052101 2025-01-16 01:33:25,829 - INFO - step 7061, loss: 2.768423, best loss: 2.052101 2025-01-16 01:33:25,979 - INFO - step 7062, loss: 3.268663, best loss: 2.052101 2025-01-16 01:33:26,129 - INFO - step 7063, loss: 2.665722, best loss: 2.052101 2025-01-16 01:33:26,280 - INFO - step 7064, loss: 2.959175, best loss: 2.052101 2025-01-16 01:33:26,430 - INFO - step 7065, loss: 2.725344, best loss: 2.052101 2025-01-16 01:33:26,580 - INFO - step 7066, loss: 2.933837, best loss: 2.052101 2025-01-16 01:33:26,730 - INFO - step 7067, loss: 2.832978, best loss: 2.052101 2025-01-16 01:33:26,880 - INFO - step 7068, loss: 2.735244, best loss: 2.052101 2025-01-16 01:33:27,030 - INFO - step 7069, loss: 2.925350, best loss: 2.052101 2025-01-16 01:33:27,180 - INFO - step 7070, loss: 2.894215, best loss: 2.052101 2025-01-16 01:33:27,330 - INFO - step 7071, loss: 2.703187, best loss: 2.052101 2025-01-16 01:33:27,480 - INFO - step 7072, loss: 3.195853, best loss: 2.052101 2025-01-16 01:33:27,630 - INFO - step 7073, loss: 2.775744, best loss: 2.052101 2025-01-16 01:33:27,780 - INFO - step 7074, loss: 2.412192, best loss: 2.052101 2025-01-16 01:33:27,931 - INFO - step 7075, loss: 2.794557, best loss: 2.052101 2025-01-16 01:33:28,081 - INFO - step 7076, loss: 2.994089, best loss: 2.052101 2025-01-16 01:33:28,231 - INFO - step 7077, loss: 2.870935, best loss: 2.052101 2025-01-16 01:33:28,381 - INFO - step 7078, loss: 2.665680, best loss: 2.052101 2025-01-16 01:33:28,531 - INFO - step 7079, loss: 2.696973, best loss: 2.052101 2025-01-16 01:33:28,681 - INFO - step 7080, loss: 3.117604, best loss: 2.052101 2025-01-16 01:33:28,832 - INFO - step 7081, loss: 3.087276, best loss: 2.052101 2025-01-16 01:33:28,981 - INFO - step 7082, loss: 2.977493, best loss: 2.052101 2025-01-16 01:33:29,131 - INFO - step 7083, loss: 2.648298, best loss: 2.052101 2025-01-16 01:33:29,282 - INFO - step 7084, loss: 2.923519, best loss: 2.052101 2025-01-16 01:33:29,432 - INFO - step 7085, loss: 2.884708, best loss: 2.052101 2025-01-16 01:33:29,583 - INFO - step 7086, loss: 2.595374, best loss: 2.052101 2025-01-16 01:33:29,733 - INFO - step 7087, loss: 2.865116, best loss: 2.052101 2025-01-16 01:33:29,883 - INFO - step 7088, loss: 2.696731, best loss: 2.052101 2025-01-16 01:33:30,033 - INFO - step 7089, loss: 2.917830, best loss: 2.052101 2025-01-16 01:33:30,183 - INFO - step 7090, loss: 2.664852, best loss: 2.052101 2025-01-16 01:33:30,333 - INFO - step 7091, loss: 2.773350, best loss: 2.052101 2025-01-16 01:33:30,484 - INFO - step 7092, loss: 2.574919, best loss: 2.052101 2025-01-16 01:33:30,634 - INFO - step 7093, loss: 2.528922, best loss: 2.052101 2025-01-16 01:33:30,784 - INFO - step 7094, loss: 2.956845, best loss: 2.052101 2025-01-16 01:33:30,934 - INFO - step 7095, loss: 2.776415, best loss: 2.052101 2025-01-16 01:33:31,084 - INFO - step 7096, loss: 3.059580, best loss: 2.052101 2025-01-16 01:33:31,234 - INFO - step 7097, loss: 2.728337, best loss: 2.052101 2025-01-16 01:33:31,384 - INFO - step 7098, loss: 2.881626, best loss: 2.052101 2025-01-16 01:33:31,534 - INFO - step 7099, loss: 2.939569, best loss: 2.052101 2025-01-16 01:33:31,684 - INFO - step 7100, loss: 2.637196, best loss: 2.052101 2025-01-16 01:33:31,834 - INFO - step 7101, loss: 2.530240, best loss: 2.052101 2025-01-16 01:33:31,985 - INFO - step 7102, loss: 2.584670, best loss: 2.052101 2025-01-16 01:33:32,135 - INFO - step 7103, loss: 2.770308, best loss: 2.052101 2025-01-16 01:33:32,285 - INFO - step 7104, loss: 2.826821, best loss: 2.052101 2025-01-16 01:33:32,435 - INFO - step 7105, loss: 2.925539, best loss: 2.052101 2025-01-16 01:33:32,586 - INFO - step 7106, loss: 3.149812, best loss: 2.052101 2025-01-16 01:33:32,736 - INFO - step 7107, loss: 3.024060, best loss: 2.052101 2025-01-16 01:33:32,886 - INFO - step 7108, loss: 3.010767, best loss: 2.052101 2025-01-16 01:33:33,036 - INFO - step 7109, loss: 3.048898, best loss: 2.052101 2025-01-16 01:33:33,186 - INFO - step 7110, loss: 3.058146, best loss: 2.052101 2025-01-16 01:33:33,336 - INFO - step 7111, loss: 2.602490, best loss: 2.052101 2025-01-16 01:33:33,486 - INFO - step 7112, loss: 3.071935, best loss: 2.052101 2025-01-16 01:33:33,636 - INFO - step 7113, loss: 3.159494, best loss: 2.052101 2025-01-16 01:33:33,787 - INFO - step 7114, loss: 2.924166, best loss: 2.052101 2025-01-16 01:33:33,937 - INFO - step 7115, loss: 2.931494, best loss: 2.052101 2025-01-16 01:33:34,087 - INFO - step 7116, loss: 3.030674, best loss: 2.052101 2025-01-16 01:33:34,237 - INFO - step 7117, loss: 2.736183, best loss: 2.052101 2025-01-16 01:33:37,698 - INFO - step 7118, loss: 2.033727, best loss: 2.033727 2025-01-16 01:33:37,860 - INFO - step 7119, loss: 2.877712, best loss: 2.033727 2025-01-16 01:33:38,012 - INFO - step 7120, loss: 2.965395, best loss: 2.033727 2025-01-16 01:33:38,162 - INFO - step 7121, loss: 2.954599, best loss: 2.033727 2025-01-16 01:33:38,312 - INFO - step 7122, loss: 3.012980, best loss: 2.033727 2025-01-16 01:33:38,462 - INFO - step 7123, loss: 2.789485, best loss: 2.033727 2025-01-16 01:33:38,612 - INFO - step 7124, loss: 2.945005, best loss: 2.033727 2025-01-16 01:33:38,763 - INFO - step 7125, loss: 2.931111, best loss: 2.033727 2025-01-16 01:33:38,913 - INFO - step 7126, loss: 2.843248, best loss: 2.033727 2025-01-16 01:33:39,063 - INFO - step 7127, loss: 2.856169, best loss: 2.033727 2025-01-16 01:33:39,213 - INFO - step 7128, loss: 2.774247, best loss: 2.033727 2025-01-16 01:33:39,363 - INFO - step 7129, loss: 2.615159, best loss: 2.033727 2025-01-16 01:33:39,513 - INFO - step 7130, loss: 2.934955, best loss: 2.033727 2025-01-16 01:33:39,663 - INFO - step 7131, loss: 2.682598, best loss: 2.033727 2025-01-16 01:33:39,813 - INFO - step 7132, loss: 2.821711, best loss: 2.033727 2025-01-16 01:33:39,964 - INFO - step 7133, loss: 3.050093, best loss: 2.033727 2025-01-16 01:33:40,114 - INFO - step 7134, loss: 2.732351, best loss: 2.033727 2025-01-16 01:33:40,264 - INFO - step 7135, loss: 2.517536, best loss: 2.033727 2025-01-16 01:33:40,414 - INFO - step 7136, loss: 2.816946, best loss: 2.033727 2025-01-16 01:33:40,565 - INFO - step 7137, loss: 2.860869, best loss: 2.033727 2025-01-16 01:33:40,715 - INFO - step 7138, loss: 2.974057, best loss: 2.033727 2025-01-16 01:33:40,864 - INFO - step 7139, loss: 2.946631, best loss: 2.033727 2025-01-16 01:33:41,014 - INFO - step 7140, loss: 2.911594, best loss: 2.033727 2025-01-16 01:33:41,164 - INFO - step 7141, loss: 2.770286, best loss: 2.033727 2025-01-16 01:33:41,315 - INFO - step 7142, loss: 3.019103, best loss: 2.033727 2025-01-16 01:33:41,465 - INFO - step 7143, loss: 2.717546, best loss: 2.033727 2025-01-16 01:33:41,615 - INFO - step 7144, loss: 2.825839, best loss: 2.033727 2025-01-16 01:33:41,765 - INFO - step 7145, loss: 3.014457, best loss: 2.033727 2025-01-16 01:33:41,915 - INFO - step 7146, loss: 3.071080, best loss: 2.033727 2025-01-16 01:33:42,065 - INFO - step 7147, loss: 2.981043, best loss: 2.033727 2025-01-16 01:33:42,216 - INFO - step 7148, loss: 2.772753, best loss: 2.033727 2025-01-16 01:33:42,366 - INFO - step 7149, loss: 2.819993, best loss: 2.033727 2025-01-16 01:33:42,516 - INFO - step 7150, loss: 2.970906, best loss: 2.033727 2025-01-16 01:33:42,666 - INFO - step 7151, loss: 3.094900, best loss: 2.033727 2025-01-16 01:33:42,816 - INFO - step 7152, loss: 2.984382, best loss: 2.033727 2025-01-16 01:33:42,966 - INFO - step 7153, loss: 3.183505, best loss: 2.033727 2025-01-16 01:33:43,116 - INFO - step 7154, loss: 3.171067, best loss: 2.033727 2025-01-16 01:33:43,266 - INFO - step 7155, loss: 2.951722, best loss: 2.033727 2025-01-16 01:33:43,416 - INFO - step 7156, loss: 3.229661, best loss: 2.033727 2025-01-16 01:33:43,566 - INFO - step 7157, loss: 2.956247, best loss: 2.033727 2025-01-16 01:33:43,717 - INFO - step 7158, loss: 2.541045, best loss: 2.033727 2025-01-16 01:33:43,867 - INFO - step 7159, loss: 3.087717, best loss: 2.033727 2025-01-16 01:33:44,017 - INFO - step 7160, loss: 2.980050, best loss: 2.033727 2025-01-16 01:33:44,167 - INFO - step 7161, loss: 2.941142, best loss: 2.033727 2025-01-16 01:33:44,317 - INFO - step 7162, loss: 2.730940, best loss: 2.033727 2025-01-16 01:33:44,467 - INFO - step 7163, loss: 2.882119, best loss: 2.033727 2025-01-16 01:33:44,617 - INFO - step 7164, loss: 2.965546, best loss: 2.033727 2025-01-16 01:33:44,767 - INFO - step 7165, loss: 2.794733, best loss: 2.033727 2025-01-16 01:33:44,917 - INFO - step 7166, loss: 2.993455, best loss: 2.033727 2025-01-16 01:33:45,067 - INFO - step 7167, loss: 2.674433, best loss: 2.033727 2025-01-16 01:33:45,217 - INFO - step 7168, loss: 2.573567, best loss: 2.033727 2025-01-16 01:33:45,367 - INFO - step 7169, loss: 3.009993, best loss: 2.033727 2025-01-16 01:33:45,517 - INFO - step 7170, loss: 3.021609, best loss: 2.033727 2025-01-16 01:33:45,668 - INFO - step 7171, loss: 3.089679, best loss: 2.033727 2025-01-16 01:33:45,818 - INFO - step 7172, loss: 2.912097, best loss: 2.033727 2025-01-16 01:33:45,968 - INFO - step 7173, loss: 3.052942, best loss: 2.033727 2025-01-16 01:33:46,118 - INFO - step 7174, loss: 3.091669, best loss: 2.033727 2025-01-16 01:33:46,268 - INFO - step 7175, loss: 2.752919, best loss: 2.033727 2025-01-16 01:33:46,418 - INFO - step 7176, loss: 2.746968, best loss: 2.033727 2025-01-16 01:33:46,568 - INFO - step 7177, loss: 3.025912, best loss: 2.033727 2025-01-16 01:33:46,719 - INFO - step 7178, loss: 2.656700, best loss: 2.033727 2025-01-16 01:33:46,869 - INFO - step 7179, loss: 2.515213, best loss: 2.033727 2025-01-16 01:33:47,019 - INFO - step 7180, loss: 2.983779, best loss: 2.033727 2025-01-16 01:33:47,169 - INFO - step 7181, loss: 2.979199, best loss: 2.033727 2025-01-16 01:33:47,319 - INFO - step 7182, loss: 2.943484, best loss: 2.033727 2025-01-16 01:33:47,469 - INFO - step 7183, loss: 2.448903, best loss: 2.033727 2025-01-16 01:33:47,619 - INFO - step 7184, loss: 2.370102, best loss: 2.033727 2025-01-16 01:33:47,769 - INFO - step 7185, loss: 2.386353, best loss: 2.033727 2025-01-16 01:33:47,920 - INFO - step 7186, loss: 2.529530, best loss: 2.033727 2025-01-16 01:33:48,070 - INFO - step 7187, loss: 2.731116, best loss: 2.033727 2025-01-16 01:33:48,220 - INFO - step 7188, loss: 2.952004, best loss: 2.033727 2025-01-16 01:33:48,370 - INFO - step 7189, loss: 2.814260, best loss: 2.033727 2025-01-16 01:33:48,520 - INFO - step 7190, loss: 2.738236, best loss: 2.033727 2025-01-16 01:33:48,670 - INFO - step 7191, loss: 2.846911, best loss: 2.033727 2025-01-16 01:33:48,820 - INFO - step 7192, loss: 2.841403, best loss: 2.033727 2025-01-16 01:33:48,971 - INFO - step 7193, loss: 2.693654, best loss: 2.033727 2025-01-16 01:33:49,121 - INFO - step 7194, loss: 2.920250, best loss: 2.033727 2025-01-16 01:33:49,271 - INFO - step 7195, loss: 2.933176, best loss: 2.033727 2025-01-16 01:33:49,421 - INFO - step 7196, loss: 2.614598, best loss: 2.033727 2025-01-16 01:33:49,572 - INFO - step 7197, loss: 2.553761, best loss: 2.033727 2025-01-16 01:33:49,722 - INFO - step 7198, loss: 2.687995, best loss: 2.033727 2025-01-16 01:33:49,872 - INFO - step 7199, loss: 2.853900, best loss: 2.033727 2025-01-16 01:33:50,022 - INFO - step 7200, loss: 2.535361, best loss: 2.033727 2025-01-16 01:33:50,173 - INFO - step 7201, loss: 2.553023, best loss: 2.033727 2025-01-16 01:33:50,323 - INFO - step 7202, loss: 2.813756, best loss: 2.033727 2025-01-16 01:33:50,473 - INFO - step 7203, loss: 2.527863, best loss: 2.033727 2025-01-16 01:33:50,623 - INFO - step 7204, loss: 2.606261, best loss: 2.033727 2025-01-16 01:33:50,773 - INFO - step 7205, loss: 3.002511, best loss: 2.033727 2025-01-16 01:33:50,923 - INFO - step 7206, loss: 2.708503, best loss: 2.033727 2025-01-16 01:33:51,073 - INFO - step 7207, loss: 2.738652, best loss: 2.033727 2025-01-16 01:33:51,223 - INFO - step 7208, loss: 2.572176, best loss: 2.033727 2025-01-16 01:33:51,373 - INFO - step 7209, loss: 2.883245, best loss: 2.033727 2025-01-16 01:33:51,524 - INFO - step 7210, loss: 2.528983, best loss: 2.033727 2025-01-16 01:33:51,674 - INFO - step 7211, loss: 2.597700, best loss: 2.033727 2025-01-16 01:33:51,824 - INFO - step 7212, loss: 2.501391, best loss: 2.033727 2025-01-16 01:33:51,975 - INFO - step 7213, loss: 2.776108, best loss: 2.033727 2025-01-16 01:33:52,125 - INFO - step 7214, loss: 3.097431, best loss: 2.033727 2025-01-16 01:33:52,275 - INFO - step 7215, loss: 3.089139, best loss: 2.033727 2025-01-16 01:33:52,425 - INFO - step 7216, loss: 2.915382, best loss: 2.033727 2025-01-16 01:33:52,575 - INFO - step 7217, loss: 2.949706, best loss: 2.033727 2025-01-16 01:33:52,725 - INFO - step 7218, loss: 2.870972, best loss: 2.033727 2025-01-16 01:33:52,875 - INFO - step 7219, loss: 2.866740, best loss: 2.033727 2025-01-16 01:33:53,025 - INFO - step 7220, loss: 2.647288, best loss: 2.033727 2025-01-16 01:33:53,175 - INFO - step 7221, loss: 2.891357, best loss: 2.033727 2025-01-16 01:33:53,325 - INFO - step 7222, loss: 2.632622, best loss: 2.033727 2025-01-16 01:33:53,475 - INFO - step 7223, loss: 2.485600, best loss: 2.033727 2025-01-16 01:33:53,626 - INFO - step 7224, loss: 2.732825, best loss: 2.033727 2025-01-16 01:33:53,775 - INFO - step 7225, loss: 2.711062, best loss: 2.033727 2025-01-16 01:33:53,925 - INFO - step 7226, loss: 2.848870, best loss: 2.033727 2025-01-16 01:33:54,075 - INFO - step 7227, loss: 2.175658, best loss: 2.033727 2025-01-16 01:33:54,226 - INFO - step 7228, loss: 2.703465, best loss: 2.033727 2025-01-16 01:33:54,376 - INFO - step 7229, loss: 2.826480, best loss: 2.033727 2025-01-16 01:33:54,526 - INFO - step 7230, loss: 2.684926, best loss: 2.033727 2025-01-16 01:33:54,676 - INFO - step 7231, loss: 2.721462, best loss: 2.033727 2025-01-16 01:33:54,826 - INFO - step 7232, loss: 2.640527, best loss: 2.033727 2025-01-16 01:33:54,976 - INFO - step 7233, loss: 2.685178, best loss: 2.033727 2025-01-16 01:33:55,126 - INFO - step 7234, loss: 2.615658, best loss: 2.033727 2025-01-16 01:33:55,277 - INFO - step 7235, loss: 2.719293, best loss: 2.033727 2025-01-16 01:33:55,427 - INFO - step 7236, loss: 2.629967, best loss: 2.033727 2025-01-16 01:33:55,577 - INFO - step 7237, loss: 2.753665, best loss: 2.033727 2025-01-16 01:33:55,727 - INFO - step 7238, loss: 2.586343, best loss: 2.033727 2025-01-16 01:33:55,878 - INFO - step 7239, loss: 2.611578, best loss: 2.033727 2025-01-16 01:33:56,028 - INFO - step 7240, loss: 2.490184, best loss: 2.033727 2025-01-16 01:33:56,178 - INFO - step 7241, loss: 2.474433, best loss: 2.033727 2025-01-16 01:33:56,328 - INFO - step 7242, loss: 2.641667, best loss: 2.033727 2025-01-16 01:33:56,478 - INFO - step 7243, loss: 2.554459, best loss: 2.033727 2025-01-16 01:33:56,629 - INFO - step 7244, loss: 2.578489, best loss: 2.033727 2025-01-16 01:33:56,779 - INFO - step 7245, loss: 2.301298, best loss: 2.033727 2025-01-16 01:33:56,929 - INFO - step 7246, loss: 2.314475, best loss: 2.033727 2025-01-16 01:33:57,079 - INFO - step 7247, loss: 2.154056, best loss: 2.033727 2025-01-16 01:33:57,229 - INFO - step 7248, loss: 2.718773, best loss: 2.033727 2025-01-16 01:33:57,380 - INFO - step 7249, loss: 2.830343, best loss: 2.033727 2025-01-16 01:33:57,530 - INFO - step 7250, loss: 2.912963, best loss: 2.033727 2025-01-16 01:33:57,680 - INFO - step 7251, loss: 3.068781, best loss: 2.033727 2025-01-16 01:33:57,830 - INFO - step 7252, loss: 2.928747, best loss: 2.033727 2025-01-16 01:33:57,980 - INFO - step 7253, loss: 2.667058, best loss: 2.033727 2025-01-16 01:33:58,131 - INFO - step 7254, loss: 2.674073, best loss: 2.033727 2025-01-16 01:33:58,281 - INFO - step 7255, loss: 2.992385, best loss: 2.033727 2025-01-16 01:33:58,431 - INFO - step 7256, loss: 2.677425, best loss: 2.033727 2025-01-16 01:33:58,581 - INFO - step 7257, loss: 2.210524, best loss: 2.033727 2025-01-16 01:33:58,731 - INFO - step 7258, loss: 2.532357, best loss: 2.033727 2025-01-16 01:33:58,881 - INFO - step 7259, loss: 2.461866, best loss: 2.033727 2025-01-16 01:33:59,031 - INFO - step 7260, loss: 2.886945, best loss: 2.033727 2025-01-16 01:33:59,181 - INFO - step 7261, loss: 2.802933, best loss: 2.033727 2025-01-16 01:33:59,332 - INFO - step 7262, loss: 2.816433, best loss: 2.033727 2025-01-16 01:33:59,482 - INFO - step 7263, loss: 2.821860, best loss: 2.033727 2025-01-16 01:33:59,632 - INFO - step 7264, loss: 2.837140, best loss: 2.033727 2025-01-16 01:33:59,783 - INFO - step 7265, loss: 2.532315, best loss: 2.033727 2025-01-16 01:33:59,933 - INFO - step 7266, loss: 2.928335, best loss: 2.033727 2025-01-16 01:34:00,084 - INFO - step 7267, loss: 2.810245, best loss: 2.033727 2025-01-16 01:34:00,234 - INFO - step 7268, loss: 3.005352, best loss: 2.033727 2025-01-16 01:34:00,384 - INFO - step 7269, loss: 2.924093, best loss: 2.033727 2025-01-16 01:34:00,534 - INFO - step 7270, loss: 2.685658, best loss: 2.033727 2025-01-16 01:34:00,684 - INFO - step 7271, loss: 2.918994, best loss: 2.033727 2025-01-16 01:34:00,834 - INFO - step 7272, loss: 2.562928, best loss: 2.033727 2025-01-16 01:34:00,984 - INFO - step 7273, loss: 2.900943, best loss: 2.033727 2025-01-16 01:34:01,134 - INFO - step 7274, loss: 2.950396, best loss: 2.033727 2025-01-16 01:34:01,285 - INFO - step 7275, loss: 2.884653, best loss: 2.033727 2025-01-16 01:34:01,435 - INFO - step 7276, loss: 2.881727, best loss: 2.033727 2025-01-16 01:34:01,585 - INFO - step 7277, loss: 2.722068, best loss: 2.033727 2025-01-16 01:34:01,736 - INFO - step 7278, loss: 2.652778, best loss: 2.033727 2025-01-16 01:34:01,886 - INFO - step 7279, loss: 2.709710, best loss: 2.033727 2025-01-16 01:34:02,036 - INFO - step 7280, loss: 2.448831, best loss: 2.033727 2025-01-16 01:34:02,186 - INFO - step 7281, loss: 3.060390, best loss: 2.033727 2025-01-16 01:34:02,337 - INFO - step 7282, loss: 2.320225, best loss: 2.033727 2025-01-16 01:34:02,487 - INFO - step 7283, loss: 2.404743, best loss: 2.033727 2025-01-16 01:34:02,637 - INFO - step 7284, loss: 2.719115, best loss: 2.033727 2025-01-16 01:34:02,787 - INFO - step 7285, loss: 2.817273, best loss: 2.033727 2025-01-16 01:34:02,937 - INFO - step 7286, loss: 2.686678, best loss: 2.033727 2025-01-16 01:34:03,087 - INFO - step 7287, loss: 2.500586, best loss: 2.033727 2025-01-16 01:34:03,238 - INFO - step 7288, loss: 2.679414, best loss: 2.033727 2025-01-16 01:34:03,388 - INFO - step 7289, loss: 2.832627, best loss: 2.033727 2025-01-16 01:34:03,538 - INFO - step 7290, loss: 2.543379, best loss: 2.033727 2025-01-16 01:34:03,688 - INFO - step 7291, loss: 2.442665, best loss: 2.033727 2025-01-16 01:34:03,839 - INFO - step 7292, loss: 2.708768, best loss: 2.033727 2025-01-16 01:34:03,989 - INFO - step 7293, loss: 2.679065, best loss: 2.033727 2025-01-16 01:34:04,139 - INFO - step 7294, loss: 2.556847, best loss: 2.033727 2025-01-16 01:34:04,289 - INFO - step 7295, loss: 2.410059, best loss: 2.033727 2025-01-16 01:34:04,439 - INFO - step 7296, loss: 2.710726, best loss: 2.033727 2025-01-16 01:34:04,590 - INFO - step 7297, loss: 2.872201, best loss: 2.033727 2025-01-16 01:34:04,740 - INFO - step 7298, loss: 2.635202, best loss: 2.033727 2025-01-16 01:34:04,890 - INFO - step 7299, loss: 2.689896, best loss: 2.033727 2025-01-16 01:34:05,040 - INFO - step 7300, loss: 2.908377, best loss: 2.033727 2025-01-16 01:34:05,190 - INFO - step 7301, loss: 2.913659, best loss: 2.033727 2025-01-16 01:34:05,340 - INFO - step 7302, loss: 2.820601, best loss: 2.033727 2025-01-16 01:34:05,490 - INFO - step 7303, loss: 2.704348, best loss: 2.033727 2025-01-16 01:34:05,640 - INFO - step 7304, loss: 2.714977, best loss: 2.033727 2025-01-16 01:34:05,790 - INFO - step 7305, loss: 2.540689, best loss: 2.033727 2025-01-16 01:34:05,941 - INFO - step 7306, loss: 3.123211, best loss: 2.033727 2025-01-16 01:34:06,091 - INFO - step 7307, loss: 2.733054, best loss: 2.033727 2025-01-16 01:34:06,241 - INFO - step 7308, loss: 2.897626, best loss: 2.033727 2025-01-16 01:34:06,391 - INFO - step 7309, loss: 2.336557, best loss: 2.033727 2025-01-16 01:34:06,541 - INFO - step 7310, loss: 2.462709, best loss: 2.033727 2025-01-16 01:34:06,692 - INFO - step 7311, loss: 2.730142, best loss: 2.033727 2025-01-16 01:34:06,842 - INFO - step 7312, loss: 2.737866, best loss: 2.033727 2025-01-16 01:34:06,992 - INFO - step 7313, loss: 2.670683, best loss: 2.033727 2025-01-16 01:34:07,142 - INFO - step 7314, loss: 2.765257, best loss: 2.033727 2025-01-16 01:34:07,292 - INFO - step 7315, loss: 2.616091, best loss: 2.033727 2025-01-16 01:34:07,442 - INFO - step 7316, loss: 2.679145, best loss: 2.033727 2025-01-16 01:34:07,592 - INFO - step 7317, loss: 2.739010, best loss: 2.033727 2025-01-16 01:34:07,743 - INFO - step 7318, loss: 2.359260, best loss: 2.033727 2025-01-16 01:34:07,893 - INFO - step 7319, loss: 2.675884, best loss: 2.033727 2025-01-16 01:34:08,043 - INFO - step 7320, loss: 2.785934, best loss: 2.033727 2025-01-16 01:34:08,193 - INFO - step 7321, loss: 2.911000, best loss: 2.033727 2025-01-16 01:34:08,343 - INFO - step 7322, loss: 2.789826, best loss: 2.033727 2025-01-16 01:34:08,493 - INFO - step 7323, loss: 2.861288, best loss: 2.033727 2025-01-16 01:34:08,644 - INFO - step 7324, loss: 2.800772, best loss: 2.033727 2025-01-16 01:34:08,794 - INFO - step 7325, loss: 2.429260, best loss: 2.033727 2025-01-16 01:34:08,944 - INFO - step 7326, loss: 2.801960, best loss: 2.033727 2025-01-16 01:34:09,094 - INFO - step 7327, loss: 2.331252, best loss: 2.033727 2025-01-16 01:34:09,244 - INFO - step 7328, loss: 2.558424, best loss: 2.033727 2025-01-16 01:34:09,395 - INFO - step 7329, loss: 2.697305, best loss: 2.033727 2025-01-16 01:34:09,545 - INFO - step 7330, loss: 2.597229, best loss: 2.033727 2025-01-16 01:34:09,695 - INFO - step 7331, loss: 2.612059, best loss: 2.033727 2025-01-16 01:34:09,845 - INFO - step 7332, loss: 2.733468, best loss: 2.033727 2025-01-16 01:34:09,995 - INFO - step 7333, loss: 2.907387, best loss: 2.033727 2025-01-16 01:34:10,145 - INFO - step 7334, loss: 2.777541, best loss: 2.033727 2025-01-16 01:34:10,295 - INFO - step 7335, loss: 2.900530, best loss: 2.033727 2025-01-16 01:34:10,445 - INFO - step 7336, loss: 2.683697, best loss: 2.033727 2025-01-16 01:34:10,595 - INFO - step 7337, loss: 2.811374, best loss: 2.033727 2025-01-16 01:34:10,745 - INFO - step 7338, loss: 2.665922, best loss: 2.033727 2025-01-16 01:34:10,896 - INFO - step 7339, loss: 2.363001, best loss: 2.033727 2025-01-16 01:34:11,046 - INFO - step 7340, loss: 2.948869, best loss: 2.033727 2025-01-16 01:34:11,196 - INFO - step 7341, loss: 2.755904, best loss: 2.033727 2025-01-16 01:34:11,346 - INFO - step 7342, loss: 2.613798, best loss: 2.033727 2025-01-16 01:34:11,496 - INFO - step 7343, loss: 2.606841, best loss: 2.033727 2025-01-16 01:34:11,646 - INFO - step 7344, loss: 2.454522, best loss: 2.033727 2025-01-16 01:34:11,797 - INFO - step 7345, loss: 2.568757, best loss: 2.033727 2025-01-16 01:34:11,947 - INFO - step 7346, loss: 2.456745, best loss: 2.033727 2025-01-16 01:34:12,097 - INFO - step 7347, loss: 2.507704, best loss: 2.033727 2025-01-16 01:34:12,247 - INFO - step 7348, loss: 2.822100, best loss: 2.033727 2025-01-16 01:34:12,398 - INFO - step 7349, loss: 2.793772, best loss: 2.033727 2025-01-16 01:34:12,548 - INFO - step 7350, loss: 2.577120, best loss: 2.033727 2025-01-16 01:34:12,698 - INFO - step 7351, loss: 2.910795, best loss: 2.033727 2025-01-16 01:34:12,849 - INFO - step 7352, loss: 2.577216, best loss: 2.033727 2025-01-16 01:34:12,999 - INFO - step 7353, loss: 2.865314, best loss: 2.033727 2025-01-16 01:34:13,149 - INFO - step 7354, loss: 2.966263, best loss: 2.033727 2025-01-16 01:34:13,299 - INFO - step 7355, loss: 3.055897, best loss: 2.033727 2025-01-16 01:34:13,450 - INFO - step 7356, loss: 3.119863, best loss: 2.033727 2025-01-16 01:34:13,600 - INFO - step 7357, loss: 2.830338, best loss: 2.033727 2025-01-16 01:34:13,750 - INFO - step 7358, loss: 2.936255, best loss: 2.033727 2025-01-16 01:34:13,901 - INFO - step 7359, loss: 2.865279, best loss: 2.033727 2025-01-16 01:34:14,051 - INFO - step 7360, loss: 2.763397, best loss: 2.033727 2025-01-16 01:34:14,201 - INFO - step 7361, loss: 3.080552, best loss: 2.033727 2025-01-16 01:34:14,352 - INFO - step 7362, loss: 3.018871, best loss: 2.033727 2025-01-16 01:34:14,502 - INFO - step 7363, loss: 3.117981, best loss: 2.033727 2025-01-16 01:34:14,653 - INFO - step 7364, loss: 2.762828, best loss: 2.033727 2025-01-16 01:34:14,803 - INFO - step 7365, loss: 2.880465, best loss: 2.033727 2025-01-16 01:34:14,953 - INFO - step 7366, loss: 2.872044, best loss: 2.033727 2025-01-16 01:34:15,104 - INFO - step 7367, loss: 2.705678, best loss: 2.033727 2025-01-16 01:34:15,254 - INFO - step 7368, loss: 2.709018, best loss: 2.033727 2025-01-16 01:34:15,404 - INFO - step 7369, loss: 3.011939, best loss: 2.033727 2025-01-16 01:34:15,555 - INFO - step 7370, loss: 2.945678, best loss: 2.033727 2025-01-16 01:34:15,705 - INFO - step 7371, loss: 3.060720, best loss: 2.033727 2025-01-16 01:34:15,855 - INFO - step 7372, loss: 2.862428, best loss: 2.033727 2025-01-16 01:34:16,005 - INFO - step 7373, loss: 2.667081, best loss: 2.033727 2025-01-16 01:34:16,155 - INFO - step 7374, loss: 3.015978, best loss: 2.033727 2025-01-16 01:34:16,305 - INFO - step 7375, loss: 2.772717, best loss: 2.033727 2025-01-16 01:34:16,455 - INFO - step 7376, loss: 2.942079, best loss: 2.033727 2025-01-16 01:34:16,605 - INFO - step 7377, loss: 2.710734, best loss: 2.033727 2025-01-16 01:34:16,755 - INFO - step 7378, loss: 2.764030, best loss: 2.033727 2025-01-16 01:34:16,905 - INFO - step 7379, loss: 2.800733, best loss: 2.033727 2025-01-16 01:34:17,055 - INFO - step 7380, loss: 2.876590, best loss: 2.033727 2025-01-16 01:34:17,205 - INFO - step 7381, loss: 2.742961, best loss: 2.033727 2025-01-16 01:34:17,355 - INFO - step 7382, loss: 2.859589, best loss: 2.033727 2025-01-16 01:34:17,505 - INFO - step 7383, loss: 2.555105, best loss: 2.033727 2025-01-16 01:34:17,655 - INFO - step 7384, loss: 2.399878, best loss: 2.033727 2025-01-16 01:34:17,805 - INFO - step 7385, loss: 2.540859, best loss: 2.033727 2025-01-16 01:34:17,955 - INFO - step 7386, loss: 2.727781, best loss: 2.033727 2025-01-16 01:34:18,105 - INFO - step 7387, loss: 3.046852, best loss: 2.033727 2025-01-16 01:34:18,256 - INFO - step 7388, loss: 2.700637, best loss: 2.033727 2025-01-16 01:34:18,406 - INFO - step 7389, loss: 2.309261, best loss: 2.033727 2025-01-16 01:34:18,556 - INFO - step 7390, loss: 2.815022, best loss: 2.033727 2025-01-16 01:34:18,706 - INFO - step 7391, loss: 2.680217, best loss: 2.033727 2025-01-16 01:34:18,856 - INFO - step 7392, loss: 3.156264, best loss: 2.033727 2025-01-16 01:34:19,006 - INFO - step 7393, loss: 2.571795, best loss: 2.033727 2025-01-16 01:34:19,156 - INFO - step 7394, loss: 2.844039, best loss: 2.033727 2025-01-16 01:34:19,306 - INFO - step 7395, loss: 2.656102, best loss: 2.033727 2025-01-16 01:34:19,456 - INFO - step 7396, loss: 2.810892, best loss: 2.033727 2025-01-16 01:34:19,607 - INFO - step 7397, loss: 2.699913, best loss: 2.033727 2025-01-16 01:34:19,757 - INFO - step 7398, loss: 2.656823, best loss: 2.033727 2025-01-16 01:34:19,907 - INFO - step 7399, loss: 2.859013, best loss: 2.033727 2025-01-16 01:34:20,058 - INFO - step 7400, loss: 2.782180, best loss: 2.033727 2025-01-16 01:34:20,207 - INFO - step 7401, loss: 2.623428, best loss: 2.033727 2025-01-16 01:34:20,357 - INFO - step 7402, loss: 3.040003, best loss: 2.033727 2025-01-16 01:34:20,507 - INFO - step 7403, loss: 2.701217, best loss: 2.033727 2025-01-16 01:34:20,658 - INFO - step 7404, loss: 2.312178, best loss: 2.033727 2025-01-16 01:34:20,807 - INFO - step 7405, loss: 2.760334, best loss: 2.033727 2025-01-16 01:34:20,957 - INFO - step 7406, loss: 2.957262, best loss: 2.033727 2025-01-16 01:34:21,107 - INFO - step 7407, loss: 2.837179, best loss: 2.033727 2025-01-16 01:34:21,257 - INFO - step 7408, loss: 2.572496, best loss: 2.033727 2025-01-16 01:34:21,407 - INFO - step 7409, loss: 2.592297, best loss: 2.033727 2025-01-16 01:34:21,558 - INFO - step 7410, loss: 3.004106, best loss: 2.033727 2025-01-16 01:34:21,708 - INFO - step 7411, loss: 2.934913, best loss: 2.033727 2025-01-16 01:34:21,858 - INFO - step 7412, loss: 2.887516, best loss: 2.033727 2025-01-16 01:34:22,008 - INFO - step 7413, loss: 2.560725, best loss: 2.033727 2025-01-16 01:34:22,158 - INFO - step 7414, loss: 2.850197, best loss: 2.033727 2025-01-16 01:34:22,308 - INFO - step 7415, loss: 2.769067, best loss: 2.033727 2025-01-16 01:34:22,458 - INFO - step 7416, loss: 2.521369, best loss: 2.033727 2025-01-16 01:34:22,608 - INFO - step 7417, loss: 2.709756, best loss: 2.033727 2025-01-16 01:34:22,758 - INFO - step 7418, loss: 2.619927, best loss: 2.033727 2025-01-16 01:34:22,908 - INFO - step 7419, loss: 2.812881, best loss: 2.033727 2025-01-16 01:34:23,058 - INFO - step 7420, loss: 2.606813, best loss: 2.033727 2025-01-16 01:34:23,208 - INFO - step 7421, loss: 2.707401, best loss: 2.033727 2025-01-16 01:34:23,358 - INFO - step 7422, loss: 2.455220, best loss: 2.033727 2025-01-16 01:34:23,508 - INFO - step 7423, loss: 2.422466, best loss: 2.033727 2025-01-16 01:34:23,658 - INFO - step 7424, loss: 2.881041, best loss: 2.033727 2025-01-16 01:34:23,809 - INFO - step 7425, loss: 2.738168, best loss: 2.033727 2025-01-16 01:34:23,959 - INFO - step 7426, loss: 2.983461, best loss: 2.033727 2025-01-16 01:34:24,109 - INFO - step 7427, loss: 2.632351, best loss: 2.033727 2025-01-16 01:34:24,259 - INFO - step 7428, loss: 2.761097, best loss: 2.033727 2025-01-16 01:34:24,409 - INFO - step 7429, loss: 2.895226, best loss: 2.033727 2025-01-16 01:34:24,560 - INFO - step 7430, loss: 2.532209, best loss: 2.033727 2025-01-16 01:34:24,709 - INFO - step 7431, loss: 2.441741, best loss: 2.033727 2025-01-16 01:34:24,860 - INFO - step 7432, loss: 2.561523, best loss: 2.033727 2025-01-16 01:34:25,010 - INFO - step 7433, loss: 2.700254, best loss: 2.033727 2025-01-16 01:34:25,159 - INFO - step 7434, loss: 2.784871, best loss: 2.033727 2025-01-16 01:34:25,309 - INFO - step 7435, loss: 2.891107, best loss: 2.033727 2025-01-16 01:34:25,459 - INFO - step 7436, loss: 3.086633, best loss: 2.033727 2025-01-16 01:34:25,609 - INFO - step 7437, loss: 2.981877, best loss: 2.033727 2025-01-16 01:34:25,760 - INFO - step 7438, loss: 2.951422, best loss: 2.033727 2025-01-16 01:34:25,910 - INFO - step 7439, loss: 2.950552, best loss: 2.033727 2025-01-16 01:34:26,061 - INFO - step 7440, loss: 2.965005, best loss: 2.033727 2025-01-16 01:34:26,211 - INFO - step 7441, loss: 2.537006, best loss: 2.033727 2025-01-16 01:34:26,362 - INFO - step 7442, loss: 2.958344, best loss: 2.033727 2025-01-16 01:34:26,513 - INFO - step 7443, loss: 3.019406, best loss: 2.033727 2025-01-16 01:34:26,663 - INFO - step 7444, loss: 2.867918, best loss: 2.033727 2025-01-16 01:34:26,813 - INFO - step 7445, loss: 2.832060, best loss: 2.033727 2025-01-16 01:34:26,963 - INFO - step 7446, loss: 2.964201, best loss: 2.033727 2025-01-16 01:34:27,113 - INFO - step 7447, loss: 2.643156, best loss: 2.033727 2025-01-16 01:34:30,593 - INFO - step 7448, loss: 1.954166, best loss: 1.954166 2025-01-16 01:34:30,755 - INFO - step 7449, loss: 2.781860, best loss: 1.954166 2025-01-16 01:34:30,909 - INFO - step 7450, loss: 2.841021, best loss: 1.954166 2025-01-16 01:34:31,060 - INFO - step 7451, loss: 2.904269, best loss: 1.954166 2025-01-16 01:34:31,210 - INFO - step 7452, loss: 2.925325, best loss: 1.954166 2025-01-16 01:34:31,360 - INFO - step 7453, loss: 2.737036, best loss: 1.954166 2025-01-16 01:34:31,510 - INFO - step 7454, loss: 2.810555, best loss: 1.954166 2025-01-16 01:34:31,660 - INFO - step 7455, loss: 2.860004, best loss: 1.954166 2025-01-16 01:34:31,810 - INFO - step 7456, loss: 2.784523, best loss: 1.954166 2025-01-16 01:34:31,961 - INFO - step 7457, loss: 2.766240, best loss: 1.954166 2025-01-16 01:34:32,111 - INFO - step 7458, loss: 2.688286, best loss: 1.954166 2025-01-16 01:34:32,260 - INFO - step 7459, loss: 2.546919, best loss: 1.954166 2025-01-16 01:34:32,411 - INFO - step 7460, loss: 2.818989, best loss: 1.954166 2025-01-16 01:34:32,561 - INFO - step 7461, loss: 2.609352, best loss: 1.954166 2025-01-16 01:34:32,711 - INFO - step 7462, loss: 2.829521, best loss: 1.954166 2025-01-16 01:34:32,861 - INFO - step 7463, loss: 2.997765, best loss: 1.954166 2025-01-16 01:34:33,011 - INFO - step 7464, loss: 2.719675, best loss: 1.954166 2025-01-16 01:34:33,161 - INFO - step 7465, loss: 2.463596, best loss: 1.954166 2025-01-16 01:34:33,311 - INFO - step 7466, loss: 2.844035, best loss: 1.954166 2025-01-16 01:34:33,462 - INFO - step 7467, loss: 2.777606, best loss: 1.954166 2025-01-16 01:34:33,612 - INFO - step 7468, loss: 2.854866, best loss: 1.954166 2025-01-16 01:34:33,762 - INFO - step 7469, loss: 2.796035, best loss: 1.954166 2025-01-16 01:34:33,912 - INFO - step 7470, loss: 2.788931, best loss: 1.954166 2025-01-16 01:34:34,062 - INFO - step 7471, loss: 2.719798, best loss: 1.954166 2025-01-16 01:34:34,212 - INFO - step 7472, loss: 2.932708, best loss: 1.954166 2025-01-16 01:34:34,362 - INFO - step 7473, loss: 2.665633, best loss: 1.954166 2025-01-16 01:34:34,512 - INFO - step 7474, loss: 2.777391, best loss: 1.954166 2025-01-16 01:34:34,662 - INFO - step 7475, loss: 2.916875, best loss: 1.954166 2025-01-16 01:34:34,812 - INFO - step 7476, loss: 2.953931, best loss: 1.954166 2025-01-16 01:34:34,963 - INFO - step 7477, loss: 2.853554, best loss: 1.954166 2025-01-16 01:34:35,113 - INFO - step 7478, loss: 2.642261, best loss: 1.954166 2025-01-16 01:34:35,263 - INFO - step 7479, loss: 2.683012, best loss: 1.954166 2025-01-16 01:34:35,413 - INFO - step 7480, loss: 2.903641, best loss: 1.954166 2025-01-16 01:34:35,564 - INFO - step 7481, loss: 3.009087, best loss: 1.954166 2025-01-16 01:34:35,714 - INFO - step 7482, loss: 2.874635, best loss: 1.954166 2025-01-16 01:34:35,864 - INFO - step 7483, loss: 3.124693, best loss: 1.954166 2025-01-16 01:34:36,014 - INFO - step 7484, loss: 3.048124, best loss: 1.954166 2025-01-16 01:34:36,164 - INFO - step 7485, loss: 2.872926, best loss: 1.954166 2025-01-16 01:34:36,314 - INFO - step 7486, loss: 3.146805, best loss: 1.954166 2025-01-16 01:34:36,464 - INFO - step 7487, loss: 2.866528, best loss: 1.954166 2025-01-16 01:34:36,615 - INFO - step 7488, loss: 2.520845, best loss: 1.954166 2025-01-16 01:34:36,765 - INFO - step 7489, loss: 2.983069, best loss: 1.954166 2025-01-16 01:34:36,915 - INFO - step 7490, loss: 2.913529, best loss: 1.954166 2025-01-16 01:34:37,065 - INFO - step 7491, loss: 2.863761, best loss: 1.954166 2025-01-16 01:34:37,215 - INFO - step 7492, loss: 2.640999, best loss: 1.954166 2025-01-16 01:34:37,366 - INFO - step 7493, loss: 2.867768, best loss: 1.954166 2025-01-16 01:34:37,516 - INFO - step 7494, loss: 2.872696, best loss: 1.954166 2025-01-16 01:34:37,665 - INFO - step 7495, loss: 2.738056, best loss: 1.954166 2025-01-16 01:34:37,816 - INFO - step 7496, loss: 2.949614, best loss: 1.954166 2025-01-16 01:34:37,966 - INFO - step 7497, loss: 2.613424, best loss: 1.954166 2025-01-16 01:34:38,116 - INFO - step 7498, loss: 2.543465, best loss: 1.954166 2025-01-16 01:34:38,267 - INFO - step 7499, loss: 2.942314, best loss: 1.954166 2025-01-16 01:34:38,417 - INFO - step 7500, loss: 2.968379, best loss: 1.954166 2025-01-16 01:34:38,567 - INFO - step 7501, loss: 3.001382, best loss: 1.954166 2025-01-16 01:34:38,717 - INFO - step 7502, loss: 2.784924, best loss: 1.954166 2025-01-16 01:34:38,867 - INFO - step 7503, loss: 2.902479, best loss: 1.954166 2025-01-16 01:34:39,017 - INFO - step 7504, loss: 2.940704, best loss: 1.954166 2025-01-16 01:34:39,167 - INFO - step 7505, loss: 2.657002, best loss: 1.954166 2025-01-16 01:34:39,318 - INFO - step 7506, loss: 2.653444, best loss: 1.954166 2025-01-16 01:34:39,468 - INFO - step 7507, loss: 2.902269, best loss: 1.954166 2025-01-16 01:34:39,618 - INFO - step 7508, loss: 2.556682, best loss: 1.954166 2025-01-16 01:34:39,768 - INFO - step 7509, loss: 2.347296, best loss: 1.954166 2025-01-16 01:34:39,919 - INFO - step 7510, loss: 2.817537, best loss: 1.954166 2025-01-16 01:34:40,069 - INFO - step 7511, loss: 2.806682, best loss: 1.954166 2025-01-16 01:34:40,218 - INFO - step 7512, loss: 2.829865, best loss: 1.954166 2025-01-16 01:34:40,368 - INFO - step 7513, loss: 2.364112, best loss: 1.954166 2025-01-16 01:34:40,518 - INFO - step 7514, loss: 2.233521, best loss: 1.954166 2025-01-16 01:34:40,668 - INFO - step 7515, loss: 2.219932, best loss: 1.954166 2025-01-16 01:34:40,819 - INFO - step 7516, loss: 2.480864, best loss: 1.954166 2025-01-16 01:34:40,969 - INFO - step 7517, loss: 2.666696, best loss: 1.954166 2025-01-16 01:34:41,119 - INFO - step 7518, loss: 2.859526, best loss: 1.954166 2025-01-16 01:34:41,269 - INFO - step 7519, loss: 2.776256, best loss: 1.954166 2025-01-16 01:34:41,419 - INFO - step 7520, loss: 2.613869, best loss: 1.954166 2025-01-16 01:34:41,569 - INFO - step 7521, loss: 2.722065, best loss: 1.954166 2025-01-16 01:34:41,719 - INFO - step 7522, loss: 2.737360, best loss: 1.954166 2025-01-16 01:34:41,869 - INFO - step 7523, loss: 2.580914, best loss: 1.954166 2025-01-16 01:34:42,020 - INFO - step 7524, loss: 2.779658, best loss: 1.954166 2025-01-16 01:34:42,170 - INFO - step 7525, loss: 2.798320, best loss: 1.954166 2025-01-16 01:34:42,320 - INFO - step 7526, loss: 2.497144, best loss: 1.954166 2025-01-16 01:34:42,470 - INFO - step 7527, loss: 2.453834, best loss: 1.954166 2025-01-16 01:34:42,620 - INFO - step 7528, loss: 2.671108, best loss: 1.954166 2025-01-16 01:34:42,770 - INFO - step 7529, loss: 2.749627, best loss: 1.954166 2025-01-16 01:34:42,921 - INFO - step 7530, loss: 2.429663, best loss: 1.954166 2025-01-16 01:34:43,071 - INFO - step 7531, loss: 2.496148, best loss: 1.954166 2025-01-16 01:34:43,221 - INFO - step 7532, loss: 2.701875, best loss: 1.954166 2025-01-16 01:34:43,371 - INFO - step 7533, loss: 2.451157, best loss: 1.954166 2025-01-16 01:34:43,522 - INFO - step 7534, loss: 2.516981, best loss: 1.954166 2025-01-16 01:34:43,672 - INFO - step 7535, loss: 2.866262, best loss: 1.954166 2025-01-16 01:34:43,822 - INFO - step 7536, loss: 2.528336, best loss: 1.954166 2025-01-16 01:34:43,972 - INFO - step 7537, loss: 2.581604, best loss: 1.954166 2025-01-16 01:34:44,122 - INFO - step 7538, loss: 2.515035, best loss: 1.954166 2025-01-16 01:34:44,273 - INFO - step 7539, loss: 2.820364, best loss: 1.954166 2025-01-16 01:34:44,423 - INFO - step 7540, loss: 2.568862, best loss: 1.954166 2025-01-16 01:34:44,573 - INFO - step 7541, loss: 2.529054, best loss: 1.954166 2025-01-16 01:34:44,723 - INFO - step 7542, loss: 2.470934, best loss: 1.954166 2025-01-16 01:34:44,873 - INFO - step 7543, loss: 2.786280, best loss: 1.954166 2025-01-16 01:34:45,023 - INFO - step 7544, loss: 3.016582, best loss: 1.954166 2025-01-16 01:34:45,173 - INFO - step 7545, loss: 3.030431, best loss: 1.954166 2025-01-16 01:34:45,324 - INFO - step 7546, loss: 2.788558, best loss: 1.954166 2025-01-16 01:34:45,474 - INFO - step 7547, loss: 2.858747, best loss: 1.954166 2025-01-16 01:34:45,624 - INFO - step 7548, loss: 2.745813, best loss: 1.954166 2025-01-16 01:34:45,774 - INFO - step 7549, loss: 2.739804, best loss: 1.954166 2025-01-16 01:34:45,924 - INFO - step 7550, loss: 2.571892, best loss: 1.954166 2025-01-16 01:34:46,074 - INFO - step 7551, loss: 2.818419, best loss: 1.954166 2025-01-16 01:34:46,224 - INFO - step 7552, loss: 2.687752, best loss: 1.954166 2025-01-16 01:34:46,375 - INFO - step 7553, loss: 2.509995, best loss: 1.954166 2025-01-16 01:34:46,525 - INFO - step 7554, loss: 2.685744, best loss: 1.954166 2025-01-16 01:34:46,675 - INFO - step 7555, loss: 2.667971, best loss: 1.954166 2025-01-16 01:34:46,825 - INFO - step 7556, loss: 2.820264, best loss: 1.954166 2025-01-16 01:34:46,975 - INFO - step 7557, loss: 2.119962, best loss: 1.954166 2025-01-16 01:34:47,125 - INFO - step 7558, loss: 2.617912, best loss: 1.954166 2025-01-16 01:34:47,275 - INFO - step 7559, loss: 2.745055, best loss: 1.954166 2025-01-16 01:34:47,425 - INFO - step 7560, loss: 2.593072, best loss: 1.954166 2025-01-16 01:34:47,576 - INFO - step 7561, loss: 2.561689, best loss: 1.954166 2025-01-16 01:34:47,725 - INFO - step 7562, loss: 2.526890, best loss: 1.954166 2025-01-16 01:34:47,876 - INFO - step 7563, loss: 2.551553, best loss: 1.954166 2025-01-16 01:34:48,026 - INFO - step 7564, loss: 2.510911, best loss: 1.954166 2025-01-16 01:34:48,176 - INFO - step 7565, loss: 2.588250, best loss: 1.954166 2025-01-16 01:34:48,326 - INFO - step 7566, loss: 2.530814, best loss: 1.954166 2025-01-16 01:34:48,477 - INFO - step 7567, loss: 2.693162, best loss: 1.954166 2025-01-16 01:34:48,627 - INFO - step 7568, loss: 2.522435, best loss: 1.954166 2025-01-16 01:34:48,777 - INFO - step 7569, loss: 2.556228, best loss: 1.954166 2025-01-16 01:34:48,927 - INFO - step 7570, loss: 2.397970, best loss: 1.954166 2025-01-16 01:34:49,077 - INFO - step 7571, loss: 2.415709, best loss: 1.954166 2025-01-16 01:34:49,227 - INFO - step 7572, loss: 2.589504, best loss: 1.954166 2025-01-16 01:34:49,377 - INFO - step 7573, loss: 2.499507, best loss: 1.954166 2025-01-16 01:34:49,528 - INFO - step 7574, loss: 2.435771, best loss: 1.954166 2025-01-16 01:34:49,678 - INFO - step 7575, loss: 2.171084, best loss: 1.954166 2025-01-16 01:34:49,827 - INFO - step 7576, loss: 2.196008, best loss: 1.954166 2025-01-16 01:34:49,978 - INFO - step 7577, loss: 2.080151, best loss: 1.954166 2025-01-16 01:34:50,128 - INFO - step 7578, loss: 2.598404, best loss: 1.954166 2025-01-16 01:34:50,278 - INFO - step 7579, loss: 2.725163, best loss: 1.954166 2025-01-16 01:34:50,428 - INFO - step 7580, loss: 2.804160, best loss: 1.954166 2025-01-16 01:34:50,578 - INFO - step 7581, loss: 2.939420, best loss: 1.954166 2025-01-16 01:34:50,728 - INFO - step 7582, loss: 2.891826, best loss: 1.954166 2025-01-16 01:34:50,879 - INFO - step 7583, loss: 2.538799, best loss: 1.954166 2025-01-16 01:34:51,029 - INFO - step 7584, loss: 2.673952, best loss: 1.954166 2025-01-16 01:34:51,179 - INFO - step 7585, loss: 2.861641, best loss: 1.954166 2025-01-16 01:34:51,329 - INFO - step 7586, loss: 2.679158, best loss: 1.954166 2025-01-16 01:34:51,479 - INFO - step 7587, loss: 2.196738, best loss: 1.954166 2025-01-16 01:34:51,629 - INFO - step 7588, loss: 2.521504, best loss: 1.954166 2025-01-16 01:34:51,779 - INFO - step 7589, loss: 2.397847, best loss: 1.954166 2025-01-16 01:34:51,929 - INFO - step 7590, loss: 2.819457, best loss: 1.954166 2025-01-16 01:34:52,079 - INFO - step 7591, loss: 2.746707, best loss: 1.954166 2025-01-16 01:34:52,230 - INFO - step 7592, loss: 2.763017, best loss: 1.954166 2025-01-16 01:34:52,380 - INFO - step 7593, loss: 2.813419, best loss: 1.954166 2025-01-16 01:34:52,530 - INFO - step 7594, loss: 2.762214, best loss: 1.954166 2025-01-16 01:34:52,680 - INFO - step 7595, loss: 2.399453, best loss: 1.954166 2025-01-16 01:34:52,830 - INFO - step 7596, loss: 2.831269, best loss: 1.954166 2025-01-16 01:34:52,980 - INFO - step 7597, loss: 2.702758, best loss: 1.954166 2025-01-16 01:34:53,130 - INFO - step 7598, loss: 2.834439, best loss: 1.954166 2025-01-16 01:34:53,281 - INFO - step 7599, loss: 2.786607, best loss: 1.954166 2025-01-16 01:34:53,431 - INFO - step 7600, loss: 2.546150, best loss: 1.954166 2025-01-16 01:34:53,581 - INFO - step 7601, loss: 2.811196, best loss: 1.954166 2025-01-16 01:34:53,732 - INFO - step 7602, loss: 2.520422, best loss: 1.954166 2025-01-16 01:34:53,882 - INFO - step 7603, loss: 2.807319, best loss: 1.954166 2025-01-16 01:34:54,032 - INFO - step 7604, loss: 2.912080, best loss: 1.954166 2025-01-16 01:34:54,182 - INFO - step 7605, loss: 2.822229, best loss: 1.954166 2025-01-16 01:34:54,332 - INFO - step 7606, loss: 2.756765, best loss: 1.954166 2025-01-16 01:34:54,482 - INFO - step 7607, loss: 2.669982, best loss: 1.954166 2025-01-16 01:34:54,632 - INFO - step 7608, loss: 2.608656, best loss: 1.954166 2025-01-16 01:34:54,782 - INFO - step 7609, loss: 2.673522, best loss: 1.954166 2025-01-16 01:34:54,932 - INFO - step 7610, loss: 2.369764, best loss: 1.954166 2025-01-16 01:34:55,083 - INFO - step 7611, loss: 2.987050, best loss: 1.954166 2025-01-16 01:34:55,233 - INFO - step 7612, loss: 2.247574, best loss: 1.954166 2025-01-16 01:34:55,383 - INFO - step 7613, loss: 2.391069, best loss: 1.954166 2025-01-16 01:34:55,533 - INFO - step 7614, loss: 2.668563, best loss: 1.954166 2025-01-16 01:34:55,683 - INFO - step 7615, loss: 2.797443, best loss: 1.954166 2025-01-16 01:34:55,833 - INFO - step 7616, loss: 2.609502, best loss: 1.954166 2025-01-16 01:34:55,983 - INFO - step 7617, loss: 2.455596, best loss: 1.954166 2025-01-16 01:34:56,134 - INFO - step 7618, loss: 2.594537, best loss: 1.954166 2025-01-16 01:34:56,284 - INFO - step 7619, loss: 2.748443, best loss: 1.954166 2025-01-16 01:34:56,434 - INFO - step 7620, loss: 2.421416, best loss: 1.954166 2025-01-16 01:34:56,584 - INFO - step 7621, loss: 2.418001, best loss: 1.954166 2025-01-16 01:34:56,734 - INFO - step 7622, loss: 2.632650, best loss: 1.954166 2025-01-16 01:34:56,885 - INFO - step 7623, loss: 2.642375, best loss: 1.954166 2025-01-16 01:34:57,035 - INFO - step 7624, loss: 2.475881, best loss: 1.954166 2025-01-16 01:34:57,185 - INFO - step 7625, loss: 2.395259, best loss: 1.954166 2025-01-16 01:34:57,335 - INFO - step 7626, loss: 2.719145, best loss: 1.954166 2025-01-16 01:34:57,485 - INFO - step 7627, loss: 2.787544, best loss: 1.954166 2025-01-16 01:34:57,635 - INFO - step 7628, loss: 2.627353, best loss: 1.954166 2025-01-16 01:34:57,785 - INFO - step 7629, loss: 2.598642, best loss: 1.954166 2025-01-16 01:34:57,935 - INFO - step 7630, loss: 2.852943, best loss: 1.954166 2025-01-16 01:34:58,086 - INFO - step 7631, loss: 2.802007, best loss: 1.954166 2025-01-16 01:34:58,236 - INFO - step 7632, loss: 2.780391, best loss: 1.954166 2025-01-16 01:34:58,386 - INFO - step 7633, loss: 2.606133, best loss: 1.954166 2025-01-16 01:34:58,536 - INFO - step 7634, loss: 2.620647, best loss: 1.954166 2025-01-16 01:34:58,687 - INFO - step 7635, loss: 2.487564, best loss: 1.954166 2025-01-16 01:34:58,837 - INFO - step 7636, loss: 2.985135, best loss: 1.954166 2025-01-16 01:34:58,986 - INFO - step 7637, loss: 2.727305, best loss: 1.954166 2025-01-16 01:34:59,137 - INFO - step 7638, loss: 2.798132, best loss: 1.954166 2025-01-16 01:34:59,287 - INFO - step 7639, loss: 2.271469, best loss: 1.954166 2025-01-16 01:34:59,437 - INFO - step 7640, loss: 2.414205, best loss: 1.954166 2025-01-16 01:34:59,587 - INFO - step 7641, loss: 2.678077, best loss: 1.954166 2025-01-16 01:34:59,737 - INFO - step 7642, loss: 2.597382, best loss: 1.954166 2025-01-16 01:34:59,887 - INFO - step 7643, loss: 2.590731, best loss: 1.954166 2025-01-16 01:35:00,038 - INFO - step 7644, loss: 2.667295, best loss: 1.954166 2025-01-16 01:35:00,188 - INFO - step 7645, loss: 2.541973, best loss: 1.954166 2025-01-16 01:35:00,338 - INFO - step 7646, loss: 2.624183, best loss: 1.954166 2025-01-16 01:35:00,488 - INFO - step 7647, loss: 2.635646, best loss: 1.954166 2025-01-16 01:35:00,638 - INFO - step 7648, loss: 2.287719, best loss: 1.954166 2025-01-16 01:35:00,788 - INFO - step 7649, loss: 2.597685, best loss: 1.954166 2025-01-16 01:35:00,938 - INFO - step 7650, loss: 2.658823, best loss: 1.954166 2025-01-16 01:35:01,088 - INFO - step 7651, loss: 2.796709, best loss: 1.954166 2025-01-16 01:35:01,238 - INFO - step 7652, loss: 2.717939, best loss: 1.954166 2025-01-16 01:35:01,388 - INFO - step 7653, loss: 2.794542, best loss: 1.954166 2025-01-16 01:35:01,539 - INFO - step 7654, loss: 2.702307, best loss: 1.954166 2025-01-16 01:35:01,689 - INFO - step 7655, loss: 2.350911, best loss: 1.954166 2025-01-16 01:35:01,839 - INFO - step 7656, loss: 2.718209, best loss: 1.954166 2025-01-16 01:35:01,989 - INFO - step 7657, loss: 2.261590, best loss: 1.954166 2025-01-16 01:35:02,139 - INFO - step 7658, loss: 2.475655, best loss: 1.954166 2025-01-16 01:35:02,290 - INFO - step 7659, loss: 2.596400, best loss: 1.954166 2025-01-16 01:35:02,440 - INFO - step 7660, loss: 2.565741, best loss: 1.954166 2025-01-16 01:35:02,590 - INFO - step 7661, loss: 2.549352, best loss: 1.954166 2025-01-16 01:35:02,740 - INFO - step 7662, loss: 2.713162, best loss: 1.954166 2025-01-16 01:35:02,890 - INFO - step 7663, loss: 2.791157, best loss: 1.954166 2025-01-16 01:35:03,040 - INFO - step 7664, loss: 2.646054, best loss: 1.954166 2025-01-16 01:35:03,190 - INFO - step 7665, loss: 2.813859, best loss: 1.954166 2025-01-16 01:35:03,341 - INFO - step 7666, loss: 2.643789, best loss: 1.954166 2025-01-16 01:35:03,491 - INFO - step 7667, loss: 2.710415, best loss: 1.954166 2025-01-16 01:35:03,641 - INFO - step 7668, loss: 2.559430, best loss: 1.954166 2025-01-16 01:35:03,791 - INFO - step 7669, loss: 2.280806, best loss: 1.954166 2025-01-16 01:35:03,941 - INFO - step 7670, loss: 2.911915, best loss: 1.954166 2025-01-16 01:35:04,091 - INFO - step 7671, loss: 2.680763, best loss: 1.954166 2025-01-16 01:35:04,241 - INFO - step 7672, loss: 2.544520, best loss: 1.954166 2025-01-16 01:35:04,391 - INFO - step 7673, loss: 2.552065, best loss: 1.954166 2025-01-16 01:35:04,542 - INFO - step 7674, loss: 2.442233, best loss: 1.954166 2025-01-16 01:35:04,692 - INFO - step 7675, loss: 2.568004, best loss: 1.954166 2025-01-16 01:35:04,842 - INFO - step 7676, loss: 2.390526, best loss: 1.954166 2025-01-16 01:35:04,992 - INFO - step 7677, loss: 2.351557, best loss: 1.954166 2025-01-16 01:35:05,142 - INFO - step 7678, loss: 2.756534, best loss: 1.954166 2025-01-16 01:35:05,292 - INFO - step 7679, loss: 2.732375, best loss: 1.954166 2025-01-16 01:35:05,443 - INFO - step 7680, loss: 2.542456, best loss: 1.954166 2025-01-16 01:35:05,593 - INFO - step 7681, loss: 2.759376, best loss: 1.954166 2025-01-16 01:35:05,743 - INFO - step 7682, loss: 2.533994, best loss: 1.954166 2025-01-16 01:35:05,893 - INFO - step 7683, loss: 2.803418, best loss: 1.954166 2025-01-16 01:35:06,043 - INFO - step 7684, loss: 2.924582, best loss: 1.954166 2025-01-16 01:35:06,194 - INFO - step 7685, loss: 2.973535, best loss: 1.954166 2025-01-16 01:35:06,344 - INFO - step 7686, loss: 3.041763, best loss: 1.954166 2025-01-16 01:35:06,494 - INFO - step 7687, loss: 2.739941, best loss: 1.954166 2025-01-16 01:35:06,644 - INFO - step 7688, loss: 2.819007, best loss: 1.954166 2025-01-16 01:35:06,795 - INFO - step 7689, loss: 2.821494, best loss: 1.954166 2025-01-16 01:35:06,945 - INFO - step 7690, loss: 2.663400, best loss: 1.954166 2025-01-16 01:35:07,095 - INFO - step 7691, loss: 2.982258, best loss: 1.954166 2025-01-16 01:35:07,245 - INFO - step 7692, loss: 2.879013, best loss: 1.954166 2025-01-16 01:35:07,395 - INFO - step 7693, loss: 2.996445, best loss: 1.954166 2025-01-16 01:35:07,546 - INFO - step 7694, loss: 2.653953, best loss: 1.954166 2025-01-16 01:35:07,696 - INFO - step 7695, loss: 2.718400, best loss: 1.954166 2025-01-16 01:35:07,846 - INFO - step 7696, loss: 2.697494, best loss: 1.954166 2025-01-16 01:35:07,996 - INFO - step 7697, loss: 2.584300, best loss: 1.954166 2025-01-16 01:35:08,146 - INFO - step 7698, loss: 2.597145, best loss: 1.954166 2025-01-16 01:35:08,296 - INFO - step 7699, loss: 2.907609, best loss: 1.954166 2025-01-16 01:35:08,446 - INFO - step 7700, loss: 2.791144, best loss: 1.954166 2025-01-16 01:35:08,596 - INFO - step 7701, loss: 2.968377, best loss: 1.954166 2025-01-16 01:35:08,746 - INFO - step 7702, loss: 2.812754, best loss: 1.954166 2025-01-16 01:35:08,896 - INFO - step 7703, loss: 2.614574, best loss: 1.954166 2025-01-16 01:35:09,046 - INFO - step 7704, loss: 3.016555, best loss: 1.954166 2025-01-16 01:35:09,196 - INFO - step 7705, loss: 2.671684, best loss: 1.954166 2025-01-16 01:35:09,347 - INFO - step 7706, loss: 2.803954, best loss: 1.954166 2025-01-16 01:35:09,497 - INFO - step 7707, loss: 2.622824, best loss: 1.954166 2025-01-16 01:35:09,647 - INFO - step 7708, loss: 2.705395, best loss: 1.954166 2025-01-16 01:35:09,797 - INFO - step 7709, loss: 2.697745, best loss: 1.954166 2025-01-16 01:35:09,948 - INFO - step 7710, loss: 2.755605, best loss: 1.954166 2025-01-16 01:35:10,098 - INFO - step 7711, loss: 2.664980, best loss: 1.954166 2025-01-16 01:35:10,248 - INFO - step 7712, loss: 2.755275, best loss: 1.954166 2025-01-16 01:35:10,398 - INFO - step 7713, loss: 2.465609, best loss: 1.954166 2025-01-16 01:35:10,548 - INFO - step 7714, loss: 2.298506, best loss: 1.954166 2025-01-16 01:35:10,698 - INFO - step 7715, loss: 2.499484, best loss: 1.954166 2025-01-16 01:35:10,848 - INFO - step 7716, loss: 2.651944, best loss: 1.954166 2025-01-16 01:35:10,999 - INFO - step 7717, loss: 2.988667, best loss: 1.954166 2025-01-16 01:35:11,149 - INFO - step 7718, loss: 2.662991, best loss: 1.954166 2025-01-16 01:35:11,299 - INFO - step 7719, loss: 2.266778, best loss: 1.954166 2025-01-16 01:35:11,450 - INFO - step 7720, loss: 2.768665, best loss: 1.954166 2025-01-16 01:35:11,600 - INFO - step 7721, loss: 2.623470, best loss: 1.954166 2025-01-16 01:35:11,750 - INFO - step 7722, loss: 3.045261, best loss: 1.954166 2025-01-16 01:35:11,900 - INFO - step 7723, loss: 2.408372, best loss: 1.954166 2025-01-16 01:35:12,050 - INFO - step 7724, loss: 2.811739, best loss: 1.954166 2025-01-16 01:35:12,200 - INFO - step 7725, loss: 2.616961, best loss: 1.954166 2025-01-16 01:35:12,351 - INFO - step 7726, loss: 2.816483, best loss: 1.954166 2025-01-16 01:35:12,501 - INFO - step 7727, loss: 2.692524, best loss: 1.954166 2025-01-16 01:35:12,651 - INFO - step 7728, loss: 2.610424, best loss: 1.954166 2025-01-16 01:35:12,801 - INFO - step 7729, loss: 2.811123, best loss: 1.954166 2025-01-16 01:35:12,951 - INFO - step 7730, loss: 2.664746, best loss: 1.954166 2025-01-16 01:35:13,101 - INFO - step 7731, loss: 2.519159, best loss: 1.954166 2025-01-16 01:35:13,251 - INFO - step 7732, loss: 2.971436, best loss: 1.954166 2025-01-16 01:35:13,401 - INFO - step 7733, loss: 2.603174, best loss: 1.954166 2025-01-16 01:35:13,551 - INFO - step 7734, loss: 2.154997, best loss: 1.954166 2025-01-16 01:35:13,701 - INFO - step 7735, loss: 2.591958, best loss: 1.954166 2025-01-16 01:35:13,851 - INFO - step 7736, loss: 2.877330, best loss: 1.954166 2025-01-16 01:35:14,001 - INFO - step 7737, loss: 2.775330, best loss: 1.954166 2025-01-16 01:35:14,152 - INFO - step 7738, loss: 2.492347, best loss: 1.954166 2025-01-16 01:35:14,302 - INFO - step 7739, loss: 2.545088, best loss: 1.954166 2025-01-16 01:35:14,452 - INFO - step 7740, loss: 2.965724, best loss: 1.954166 2025-01-16 01:35:14,602 - INFO - step 7741, loss: 2.930240, best loss: 1.954166 2025-01-16 01:35:14,752 - INFO - step 7742, loss: 2.806515, best loss: 1.954166 2025-01-16 01:35:14,903 - INFO - step 7743, loss: 2.535290, best loss: 1.954166 2025-01-16 01:35:15,053 - INFO - step 7744, loss: 2.799584, best loss: 1.954166 2025-01-16 01:35:15,203 - INFO - step 7745, loss: 2.717996, best loss: 1.954166 2025-01-16 01:35:15,353 - INFO - step 7746, loss: 2.409918, best loss: 1.954166 2025-01-16 01:35:15,503 - INFO - step 7747, loss: 2.649945, best loss: 1.954166 2025-01-16 01:35:15,653 - INFO - step 7748, loss: 2.542722, best loss: 1.954166 2025-01-16 01:35:15,804 - INFO - step 7749, loss: 2.698056, best loss: 1.954166 2025-01-16 01:35:15,954 - INFO - step 7750, loss: 2.550247, best loss: 1.954166 2025-01-16 01:35:16,104 - INFO - step 7751, loss: 2.619961, best loss: 1.954166 2025-01-16 01:35:16,254 - INFO - step 7752, loss: 2.410308, best loss: 1.954166 2025-01-16 01:35:16,404 - INFO - step 7753, loss: 2.373666, best loss: 1.954166 2025-01-16 01:35:16,555 - INFO - step 7754, loss: 2.824083, best loss: 1.954166 2025-01-16 01:35:16,705 - INFO - step 7755, loss: 2.589635, best loss: 1.954166 2025-01-16 01:35:16,855 - INFO - step 7756, loss: 2.830624, best loss: 1.954166 2025-01-16 01:35:17,004 - INFO - step 7757, loss: 2.562227, best loss: 1.954166 2025-01-16 01:35:17,154 - INFO - step 7758, loss: 2.706751, best loss: 1.954166 2025-01-16 01:35:17,304 - INFO - step 7759, loss: 2.840664, best loss: 1.954166 2025-01-16 01:35:17,455 - INFO - step 7760, loss: 2.447162, best loss: 1.954166 2025-01-16 01:35:17,605 - INFO - step 7761, loss: 2.358684, best loss: 1.954166 2025-01-16 01:35:17,755 - INFO - step 7762, loss: 2.409642, best loss: 1.954166 2025-01-16 01:35:17,905 - INFO - step 7763, loss: 2.644539, best loss: 1.954166 2025-01-16 01:35:18,055 - INFO - step 7764, loss: 2.663010, best loss: 1.954166 2025-01-16 01:35:18,205 - INFO - step 7765, loss: 2.712747, best loss: 1.954166 2025-01-16 01:35:18,355 - INFO - step 7766, loss: 3.043485, best loss: 1.954166 2025-01-16 01:35:18,506 - INFO - step 7767, loss: 2.897653, best loss: 1.954166 2025-01-16 01:35:18,655 - INFO - step 7768, loss: 2.849393, best loss: 1.954166 2025-01-16 01:35:18,806 - INFO - step 7769, loss: 2.833437, best loss: 1.954166 2025-01-16 01:35:18,956 - INFO - step 7770, loss: 2.865815, best loss: 1.954166 2025-01-16 01:35:19,106 - INFO - step 7771, loss: 2.456616, best loss: 1.954166 2025-01-16 01:35:19,256 - INFO - step 7772, loss: 2.887938, best loss: 1.954166 2025-01-16 01:35:19,407 - INFO - step 7773, loss: 2.943914, best loss: 1.954166 2025-01-16 01:35:19,557 - INFO - step 7774, loss: 2.741982, best loss: 1.954166 2025-01-16 01:35:19,707 - INFO - step 7775, loss: 2.742785, best loss: 1.954166 2025-01-16 01:35:19,857 - INFO - step 7776, loss: 2.889732, best loss: 1.954166 2025-01-16 01:35:20,007 - INFO - step 7777, loss: 2.570397, best loss: 1.954166 2025-01-16 01:35:23,525 - INFO - step 7778, loss: 1.898043, best loss: 1.898043 2025-01-16 01:35:23,687 - INFO - step 7779, loss: 2.748065, best loss: 1.898043 2025-01-16 01:35:23,839 - INFO - step 7780, loss: 2.740178, best loss: 1.898043 2025-01-16 01:35:23,989 - INFO - step 7781, loss: 2.737402, best loss: 1.898043 2025-01-16 01:35:24,140 - INFO - step 7782, loss: 2.832169, best loss: 1.898043 2025-01-16 01:35:24,290 - INFO - step 7783, loss: 2.635764, best loss: 1.898043 2025-01-16 01:35:24,440 - INFO - step 7784, loss: 2.753461, best loss: 1.898043 2025-01-16 01:35:24,590 - INFO - step 7785, loss: 2.758449, best loss: 1.898043 2025-01-16 01:35:24,741 - INFO - step 7786, loss: 2.643760, best loss: 1.898043 2025-01-16 01:35:24,891 - INFO - step 7787, loss: 2.660084, best loss: 1.898043 2025-01-16 01:35:25,041 - INFO - step 7788, loss: 2.630867, best loss: 1.898043 2025-01-16 01:35:25,191 - INFO - step 7789, loss: 2.415869, best loss: 1.898043 2025-01-16 01:35:25,342 - INFO - step 7790, loss: 2.709144, best loss: 1.898043 2025-01-16 01:35:25,492 - INFO - step 7791, loss: 2.563952, best loss: 1.898043 2025-01-16 01:35:25,642 - INFO - step 7792, loss: 2.726699, best loss: 1.898043 2025-01-16 01:35:25,792 - INFO - step 7793, loss: 2.846206, best loss: 1.898043 2025-01-16 01:35:25,943 - INFO - step 7794, loss: 2.615028, best loss: 1.898043 2025-01-16 01:35:26,093 - INFO - step 7795, loss: 2.392302, best loss: 1.898043 2025-01-16 01:35:26,243 - INFO - step 7796, loss: 2.701271, best loss: 1.898043 2025-01-16 01:35:26,393 - INFO - step 7797, loss: 2.680630, best loss: 1.898043 2025-01-16 01:35:26,544 - INFO - step 7798, loss: 2.749617, best loss: 1.898043 2025-01-16 01:35:26,695 - INFO - step 7799, loss: 2.691491, best loss: 1.898043 2025-01-16 01:35:26,845 - INFO - step 7800, loss: 2.721671, best loss: 1.898043 2025-01-16 01:35:26,995 - INFO - step 7801, loss: 2.580355, best loss: 1.898043 2025-01-16 01:35:27,145 - INFO - step 7802, loss: 2.787086, best loss: 1.898043 2025-01-16 01:35:27,296 - INFO - step 7803, loss: 2.584535, best loss: 1.898043 2025-01-16 01:35:27,446 - INFO - step 7804, loss: 2.640557, best loss: 1.898043 2025-01-16 01:35:27,596 - INFO - step 7805, loss: 2.777494, best loss: 1.898043 2025-01-16 01:35:27,746 - INFO - step 7806, loss: 2.886578, best loss: 1.898043 2025-01-16 01:35:27,897 - INFO - step 7807, loss: 2.799835, best loss: 1.898043 2025-01-16 01:35:28,047 - INFO - step 7808, loss: 2.599871, best loss: 1.898043 2025-01-16 01:35:28,197 - INFO - step 7809, loss: 2.608030, best loss: 1.898043 2025-01-16 01:35:28,347 - INFO - step 7810, loss: 2.823086, best loss: 1.898043 2025-01-16 01:35:28,497 - INFO - step 7811, loss: 2.916752, best loss: 1.898043 2025-01-16 01:35:28,647 - INFO - step 7812, loss: 2.798667, best loss: 1.898043 2025-01-16 01:35:28,797 - INFO - step 7813, loss: 2.977584, best loss: 1.898043 2025-01-16 01:35:28,948 - INFO - step 7814, loss: 3.001854, best loss: 1.898043 2025-01-16 01:35:29,098 - INFO - step 7815, loss: 2.724087, best loss: 1.898043 2025-01-16 01:35:29,249 - INFO - step 7816, loss: 3.043320, best loss: 1.898043 2025-01-16 01:35:29,399 - INFO - step 7817, loss: 2.769686, best loss: 1.898043 2025-01-16 01:35:29,550 - INFO - step 7818, loss: 2.374735, best loss: 1.898043 2025-01-16 01:35:29,700 - INFO - step 7819, loss: 2.857715, best loss: 1.898043 2025-01-16 01:35:29,850 - INFO - step 7820, loss: 2.816912, best loss: 1.898043 2025-01-16 01:35:30,000 - INFO - step 7821, loss: 2.712195, best loss: 1.898043 2025-01-16 01:35:30,150 - INFO - step 7822, loss: 2.559845, best loss: 1.898043 2025-01-16 01:35:30,299 - INFO - step 7823, loss: 2.793954, best loss: 1.898043 2025-01-16 01:35:30,450 - INFO - step 7824, loss: 2.781862, best loss: 1.898043 2025-01-16 01:35:30,600 - INFO - step 7825, loss: 2.699361, best loss: 1.898043 2025-01-16 01:35:30,750 - INFO - step 7826, loss: 2.874546, best loss: 1.898043 2025-01-16 01:35:30,900 - INFO - step 7827, loss: 2.468787, best loss: 1.898043 2025-01-16 01:35:31,050 - INFO - step 7828, loss: 2.488086, best loss: 1.898043 2025-01-16 01:35:31,200 - INFO - step 7829, loss: 2.890518, best loss: 1.898043 2025-01-16 01:35:31,351 - INFO - step 7830, loss: 2.849978, best loss: 1.898043 2025-01-16 01:35:31,501 - INFO - step 7831, loss: 2.938201, best loss: 1.898043 2025-01-16 01:35:31,651 - INFO - step 7832, loss: 2.721687, best loss: 1.898043 2025-01-16 01:35:31,801 - INFO - step 7833, loss: 2.803715, best loss: 1.898043 2025-01-16 01:35:31,951 - INFO - step 7834, loss: 2.882479, best loss: 1.898043 2025-01-16 01:35:32,101 - INFO - step 7835, loss: 2.602626, best loss: 1.898043 2025-01-16 01:35:32,251 - INFO - step 7836, loss: 2.575473, best loss: 1.898043 2025-01-16 01:35:32,401 - INFO - step 7837, loss: 2.811486, best loss: 1.898043 2025-01-16 01:35:32,552 - INFO - step 7838, loss: 2.463331, best loss: 1.898043 2025-01-16 01:35:32,702 - INFO - step 7839, loss: 2.250591, best loss: 1.898043 2025-01-16 01:35:32,852 - INFO - step 7840, loss: 2.740826, best loss: 1.898043 2025-01-16 01:35:33,002 - INFO - step 7841, loss: 2.692027, best loss: 1.898043 2025-01-16 01:35:33,152 - INFO - step 7842, loss: 2.695415, best loss: 1.898043 2025-01-16 01:35:33,302 - INFO - step 7843, loss: 2.290468, best loss: 1.898043 2025-01-16 01:35:33,452 - INFO - step 7844, loss: 2.128849, best loss: 1.898043 2025-01-16 01:35:33,602 - INFO - step 7845, loss: 2.129360, best loss: 1.898043 2025-01-16 01:35:33,753 - INFO - step 7846, loss: 2.318518, best loss: 1.898043 2025-01-16 01:35:33,903 - INFO - step 7847, loss: 2.537625, best loss: 1.898043 2025-01-16 01:35:34,053 - INFO - step 7848, loss: 2.692640, best loss: 1.898043 2025-01-16 01:35:34,203 - INFO - step 7849, loss: 2.682081, best loss: 1.898043 2025-01-16 01:35:34,353 - INFO - step 7850, loss: 2.496388, best loss: 1.898043 2025-01-16 01:35:34,504 - INFO - step 7851, loss: 2.537176, best loss: 1.898043 2025-01-16 01:35:34,654 - INFO - step 7852, loss: 2.612742, best loss: 1.898043 2025-01-16 01:35:34,804 - INFO - step 7853, loss: 2.484542, best loss: 1.898043 2025-01-16 01:35:34,954 - INFO - step 7854, loss: 2.652396, best loss: 1.898043 2025-01-16 01:35:35,104 - INFO - step 7855, loss: 2.680445, best loss: 1.898043 2025-01-16 01:35:35,254 - INFO - step 7856, loss: 2.387213, best loss: 1.898043 2025-01-16 01:35:35,405 - INFO - step 7857, loss: 2.340520, best loss: 1.898043 2025-01-16 01:35:35,555 - INFO - step 7858, loss: 2.507022, best loss: 1.898043 2025-01-16 01:35:35,705 - INFO - step 7859, loss: 2.653903, best loss: 1.898043 2025-01-16 01:35:35,855 - INFO - step 7860, loss: 2.330696, best loss: 1.898043 2025-01-16 01:35:36,006 - INFO - step 7861, loss: 2.451519, best loss: 1.898043 2025-01-16 01:35:36,156 - INFO - step 7862, loss: 2.591660, best loss: 1.898043 2025-01-16 01:35:36,306 - INFO - step 7863, loss: 2.374205, best loss: 1.898043 2025-01-16 01:35:36,456 - INFO - step 7864, loss: 2.404764, best loss: 1.898043 2025-01-16 01:35:36,606 - INFO - step 7865, loss: 2.720312, best loss: 1.898043 2025-01-16 01:35:36,756 - INFO - step 7866, loss: 2.415161, best loss: 1.898043 2025-01-16 01:35:36,906 - INFO - step 7867, loss: 2.445537, best loss: 1.898043 2025-01-16 01:35:37,056 - INFO - step 7868, loss: 2.397132, best loss: 1.898043 2025-01-16 01:35:37,206 - INFO - step 7869, loss: 2.695171, best loss: 1.898043 2025-01-16 01:35:37,357 - INFO - step 7870, loss: 2.370699, best loss: 1.898043 2025-01-16 01:35:37,507 - INFO - step 7871, loss: 2.430508, best loss: 1.898043 2025-01-16 01:35:37,657 - INFO - step 7872, loss: 2.389719, best loss: 1.898043 2025-01-16 01:35:37,808 - INFO - step 7873, loss: 2.642034, best loss: 1.898043 2025-01-16 01:35:37,958 - INFO - step 7874, loss: 2.960993, best loss: 1.898043 2025-01-16 01:35:38,108 - INFO - step 7875, loss: 2.897135, best loss: 1.898043 2025-01-16 01:35:38,258 - INFO - step 7876, loss: 2.600877, best loss: 1.898043 2025-01-16 01:35:38,408 - INFO - step 7877, loss: 2.713291, best loss: 1.898043 2025-01-16 01:35:38,559 - INFO - step 7878, loss: 2.615746, best loss: 1.898043 2025-01-16 01:35:38,709 - INFO - step 7879, loss: 2.642545, best loss: 1.898043 2025-01-16 01:35:38,859 - INFO - step 7880, loss: 2.470356, best loss: 1.898043 2025-01-16 01:35:39,009 - INFO - step 7881, loss: 2.704801, best loss: 1.898043 2025-01-16 01:35:39,160 - INFO - step 7882, loss: 2.517302, best loss: 1.898043 2025-01-16 01:35:39,310 - INFO - step 7883, loss: 2.356899, best loss: 1.898043 2025-01-16 01:35:39,460 - INFO - step 7884, loss: 2.589401, best loss: 1.898043 2025-01-16 01:35:39,611 - INFO - step 7885, loss: 2.479141, best loss: 1.898043 2025-01-16 01:35:39,761 - INFO - step 7886, loss: 2.692097, best loss: 1.898043 2025-01-16 01:35:39,911 - INFO - step 7887, loss: 2.097525, best loss: 1.898043 2025-01-16 01:35:40,062 - INFO - step 7888, loss: 2.549121, best loss: 1.898043 2025-01-16 01:35:40,212 - INFO - step 7889, loss: 2.700751, best loss: 1.898043 2025-01-16 01:35:40,362 - INFO - step 7890, loss: 2.520769, best loss: 1.898043 2025-01-16 01:35:40,512 - INFO - step 7891, loss: 2.467469, best loss: 1.898043 2025-01-16 01:35:40,663 - INFO - step 7892, loss: 2.425763, best loss: 1.898043 2025-01-16 01:35:40,813 - INFO - step 7893, loss: 2.459478, best loss: 1.898043 2025-01-16 01:35:40,963 - INFO - step 7894, loss: 2.348516, best loss: 1.898043 2025-01-16 01:35:41,114 - INFO - step 7895, loss: 2.473065, best loss: 1.898043 2025-01-16 01:35:41,264 - INFO - step 7896, loss: 2.435863, best loss: 1.898043 2025-01-16 01:35:41,414 - INFO - step 7897, loss: 2.608196, best loss: 1.898043 2025-01-16 01:35:41,565 - INFO - step 7898, loss: 2.390409, best loss: 1.898043 2025-01-16 01:35:41,715 - INFO - step 7899, loss: 2.416209, best loss: 1.898043 2025-01-16 01:35:41,865 - INFO - step 7900, loss: 2.285302, best loss: 1.898043 2025-01-16 01:35:42,015 - INFO - step 7901, loss: 2.275613, best loss: 1.898043 2025-01-16 01:35:42,165 - INFO - step 7902, loss: 2.515532, best loss: 1.898043 2025-01-16 01:35:42,316 - INFO - step 7903, loss: 2.410486, best loss: 1.898043 2025-01-16 01:35:42,466 - INFO - step 7904, loss: 2.360228, best loss: 1.898043 2025-01-16 01:35:42,616 - INFO - step 7905, loss: 2.069000, best loss: 1.898043 2025-01-16 01:35:42,766 - INFO - step 7906, loss: 2.129908, best loss: 1.898043 2025-01-16 01:35:42,916 - INFO - step 7907, loss: 1.990395, best loss: 1.898043 2025-01-16 01:35:43,067 - INFO - step 7908, loss: 2.525508, best loss: 1.898043 2025-01-16 01:35:43,217 - INFO - step 7909, loss: 2.648621, best loss: 1.898043 2025-01-16 01:35:43,367 - INFO - step 7910, loss: 2.712208, best loss: 1.898043 2025-01-16 01:35:43,517 - INFO - step 7911, loss: 2.864849, best loss: 1.898043 2025-01-16 01:35:43,668 - INFO - step 7912, loss: 2.672671, best loss: 1.898043 2025-01-16 01:35:43,818 - INFO - step 7913, loss: 2.397806, best loss: 1.898043 2025-01-16 01:35:43,968 - INFO - step 7914, loss: 2.545155, best loss: 1.898043 2025-01-16 01:35:44,118 - INFO - step 7915, loss: 2.755053, best loss: 1.898043 2025-01-16 01:35:44,269 - INFO - step 7916, loss: 2.529475, best loss: 1.898043 2025-01-16 01:35:44,419 - INFO - step 7917, loss: 2.127636, best loss: 1.898043 2025-01-16 01:35:44,569 - INFO - step 7918, loss: 2.394983, best loss: 1.898043 2025-01-16 01:35:44,719 - INFO - step 7919, loss: 2.280250, best loss: 1.898043 2025-01-16 01:35:44,869 - INFO - step 7920, loss: 2.683919, best loss: 1.898043 2025-01-16 01:35:45,020 - INFO - step 7921, loss: 2.621923, best loss: 1.898043 2025-01-16 01:35:45,170 - INFO - step 7922, loss: 2.606800, best loss: 1.898043 2025-01-16 01:35:45,320 - INFO - step 7923, loss: 2.650279, best loss: 1.898043 2025-01-16 01:35:45,470 - INFO - step 7924, loss: 2.533554, best loss: 1.898043 2025-01-16 01:35:45,621 - INFO - step 7925, loss: 2.282820, best loss: 1.898043 2025-01-16 01:35:45,771 - INFO - step 7926, loss: 2.739101, best loss: 1.898043 2025-01-16 01:35:45,921 - INFO - step 7927, loss: 2.503788, best loss: 1.898043 2025-01-16 01:35:46,071 - INFO - step 7928, loss: 2.740051, best loss: 1.898043 2025-01-16 01:35:46,221 - INFO - step 7929, loss: 2.715188, best loss: 1.898043 2025-01-16 01:35:46,371 - INFO - step 7930, loss: 2.433689, best loss: 1.898043 2025-01-16 01:35:46,522 - INFO - step 7931, loss: 2.637201, best loss: 1.898043 2025-01-16 01:35:46,672 - INFO - step 7932, loss: 2.423750, best loss: 1.898043 2025-01-16 01:35:46,822 - INFO - step 7933, loss: 2.637894, best loss: 1.898043 2025-01-16 01:35:46,972 - INFO - step 7934, loss: 2.728856, best loss: 1.898043 2025-01-16 01:35:47,122 - INFO - step 7935, loss: 2.758127, best loss: 1.898043 2025-01-16 01:35:47,273 - INFO - step 7936, loss: 2.674824, best loss: 1.898043 2025-01-16 01:35:47,423 - INFO - step 7937, loss: 2.478327, best loss: 1.898043 2025-01-16 01:35:47,573 - INFO - step 7938, loss: 2.613351, best loss: 1.898043 2025-01-16 01:35:47,723 - INFO - step 7939, loss: 2.605914, best loss: 1.898043 2025-01-16 01:35:47,874 - INFO - step 7940, loss: 2.308713, best loss: 1.898043 2025-01-16 01:35:48,024 - INFO - step 7941, loss: 2.899554, best loss: 1.898043 2025-01-16 01:35:48,174 - INFO - step 7942, loss: 2.115514, best loss: 1.898043 2025-01-16 01:35:48,325 - INFO - step 7943, loss: 2.254264, best loss: 1.898043 2025-01-16 01:35:48,475 - INFO - step 7944, loss: 2.565697, best loss: 1.898043 2025-01-16 01:35:48,625 - INFO - step 7945, loss: 2.661259, best loss: 1.898043 2025-01-16 01:35:48,776 - INFO - step 7946, loss: 2.503675, best loss: 1.898043 2025-01-16 01:35:48,926 - INFO - step 7947, loss: 2.319657, best loss: 1.898043 2025-01-16 01:35:49,076 - INFO - step 7948, loss: 2.541181, best loss: 1.898043 2025-01-16 01:35:49,226 - INFO - step 7949, loss: 2.644715, best loss: 1.898043 2025-01-16 01:35:49,376 - INFO - step 7950, loss: 2.329476, best loss: 1.898043 2025-01-16 01:35:49,527 - INFO - step 7951, loss: 2.274909, best loss: 1.898043 2025-01-16 01:35:49,677 - INFO - step 7952, loss: 2.543318, best loss: 1.898043 2025-01-16 01:35:49,827 - INFO - step 7953, loss: 2.549184, best loss: 1.898043 2025-01-16 01:35:49,977 - INFO - step 7954, loss: 2.403149, best loss: 1.898043 2025-01-16 01:35:50,128 - INFO - step 7955, loss: 2.278026, best loss: 1.898043 2025-01-16 01:35:50,278 - INFO - step 7956, loss: 2.587507, best loss: 1.898043 2025-01-16 01:35:50,428 - INFO - step 7957, loss: 2.756898, best loss: 1.898043 2025-01-16 01:35:50,578 - INFO - step 7958, loss: 2.481176, best loss: 1.898043 2025-01-16 01:35:50,729 - INFO - step 7959, loss: 2.523732, best loss: 1.898043 2025-01-16 01:35:50,879 - INFO - step 7960, loss: 2.819652, best loss: 1.898043 2025-01-16 01:35:51,030 - INFO - step 7961, loss: 2.659674, best loss: 1.898043 2025-01-16 01:35:51,180 - INFO - step 7962, loss: 2.670614, best loss: 1.898043 2025-01-16 01:35:51,330 - INFO - step 7963, loss: 2.484791, best loss: 1.898043 2025-01-16 01:35:51,480 - INFO - step 7964, loss: 2.456364, best loss: 1.898043 2025-01-16 01:35:51,630 - INFO - step 7965, loss: 2.337111, best loss: 1.898043 2025-01-16 01:35:51,780 - INFO - step 7966, loss: 2.840857, best loss: 1.898043 2025-01-16 01:35:51,931 - INFO - step 7967, loss: 2.574663, best loss: 1.898043 2025-01-16 01:35:52,081 - INFO - step 7968, loss: 2.701713, best loss: 1.898043 2025-01-16 01:35:52,231 - INFO - step 7969, loss: 2.186103, best loss: 1.898043 2025-01-16 01:35:52,381 - INFO - step 7970, loss: 2.326717, best loss: 1.898043 2025-01-16 01:35:52,532 - INFO - step 7971, loss: 2.585649, best loss: 1.898043 2025-01-16 01:35:52,682 - INFO - step 7972, loss: 2.520504, best loss: 1.898043 2025-01-16 01:35:52,832 - INFO - step 7973, loss: 2.566513, best loss: 1.898043 2025-01-16 01:35:52,982 - INFO - step 7974, loss: 2.537171, best loss: 1.898043 2025-01-16 01:35:53,133 - INFO - step 7975, loss: 2.407570, best loss: 1.898043 2025-01-16 01:35:53,284 - INFO - step 7976, loss: 2.464470, best loss: 1.898043 2025-01-16 01:35:53,434 - INFO - step 7977, loss: 2.528430, best loss: 1.898043 2025-01-16 01:35:53,584 - INFO - step 7978, loss: 2.170937, best loss: 1.898043 2025-01-16 01:35:53,735 - INFO - step 7979, loss: 2.487000, best loss: 1.898043 2025-01-16 01:35:53,885 - INFO - step 7980, loss: 2.559338, best loss: 1.898043 2025-01-16 01:35:54,035 - INFO - step 7981, loss: 2.745191, best loss: 1.898043 2025-01-16 01:35:54,186 - INFO - step 7982, loss: 2.548054, best loss: 1.898043 2025-01-16 01:35:54,336 - INFO - step 7983, loss: 2.634510, best loss: 1.898043 2025-01-16 01:35:54,486 - INFO - step 7984, loss: 2.637516, best loss: 1.898043 2025-01-16 01:35:54,636 - INFO - step 7985, loss: 2.338672, best loss: 1.898043 2025-01-16 01:35:54,786 - INFO - step 7986, loss: 2.654299, best loss: 1.898043 2025-01-16 01:35:54,936 - INFO - step 7987, loss: 2.214437, best loss: 1.898043 2025-01-16 01:35:55,087 - INFO - step 7988, loss: 2.424003, best loss: 1.898043 2025-01-16 01:35:55,237 - INFO - step 7989, loss: 2.501452, best loss: 1.898043 2025-01-16 01:35:55,387 - INFO - step 7990, loss: 2.409929, best loss: 1.898043 2025-01-16 01:35:55,537 - INFO - step 7991, loss: 2.437320, best loss: 1.898043 2025-01-16 01:35:55,688 - INFO - step 7992, loss: 2.575142, best loss: 1.898043 2025-01-16 01:35:55,838 - INFO - step 7993, loss: 2.677072, best loss: 1.898043 2025-01-16 01:35:55,988 - INFO - step 7994, loss: 2.615480, best loss: 1.898043 2025-01-16 01:35:56,138 - INFO - step 7995, loss: 2.699876, best loss: 1.898043 2025-01-16 01:35:56,288 - INFO - step 7996, loss: 2.501384, best loss: 1.898043 2025-01-16 01:35:56,439 - INFO - step 7997, loss: 2.610202, best loss: 1.898043 2025-01-16 01:35:56,589 - INFO - step 7998, loss: 2.505703, best loss: 1.898043 2025-01-16 01:35:56,739 - INFO - step 7999, loss: 2.185213, best loss: 1.898043 2025-01-16 01:35:56,889 - INFO - step 8000, loss: 2.720631, best loss: 1.898043 2025-01-16 01:35:57,039 - INFO - step 8001, loss: 2.567087, best loss: 1.898043 2025-01-16 01:35:57,190 - INFO - step 8002, loss: 2.413224, best loss: 1.898043 2025-01-16 01:35:57,340 - INFO - step 8003, loss: 2.440217, best loss: 1.898043 2025-01-16 01:35:57,490 - INFO - step 8004, loss: 2.250604, best loss: 1.898043 2025-01-16 01:35:57,640 - INFO - step 8005, loss: 2.486317, best loss: 1.898043 2025-01-16 01:35:57,790 - INFO - step 8006, loss: 2.274830, best loss: 1.898043 2025-01-16 01:35:57,940 - INFO - step 8007, loss: 2.274392, best loss: 1.898043 2025-01-16 01:35:58,090 - INFO - step 8008, loss: 2.664970, best loss: 1.898043 2025-01-16 01:35:58,240 - INFO - step 8009, loss: 2.640819, best loss: 1.898043 2025-01-16 01:35:58,391 - INFO - step 8010, loss: 2.364232, best loss: 1.898043 2025-01-16 01:35:58,541 - INFO - step 8011, loss: 2.622331, best loss: 1.898043 2025-01-16 01:35:58,691 - INFO - step 8012, loss: 2.414261, best loss: 1.898043 2025-01-16 01:35:58,842 - INFO - step 8013, loss: 2.711926, best loss: 1.898043 2025-01-16 01:35:58,992 - INFO - step 8014, loss: 2.784474, best loss: 1.898043 2025-01-16 01:35:59,142 - INFO - step 8015, loss: 2.809107, best loss: 1.898043 2025-01-16 01:35:59,292 - INFO - step 8016, loss: 2.815000, best loss: 1.898043 2025-01-16 01:35:59,442 - INFO - step 8017, loss: 2.639503, best loss: 1.898043 2025-01-16 01:35:59,593 - INFO - step 8018, loss: 2.736440, best loss: 1.898043 2025-01-16 01:35:59,743 - INFO - step 8019, loss: 2.695806, best loss: 1.898043 2025-01-16 01:35:59,893 - INFO - step 8020, loss: 2.550051, best loss: 1.898043 2025-01-16 01:36:00,043 - INFO - step 8021, loss: 2.821520, best loss: 1.898043 2025-01-16 01:36:00,194 - INFO - step 8022, loss: 2.727026, best loss: 1.898043 2025-01-16 01:36:00,344 - INFO - step 8023, loss: 2.947976, best loss: 1.898043 2025-01-16 01:36:00,494 - INFO - step 8024, loss: 2.563319, best loss: 1.898043 2025-01-16 01:36:00,645 - INFO - step 8025, loss: 2.666337, best loss: 1.898043 2025-01-16 01:36:00,795 - INFO - step 8026, loss: 2.695404, best loss: 1.898043 2025-01-16 01:36:00,945 - INFO - step 8027, loss: 2.502453, best loss: 1.898043 2025-01-16 01:36:01,095 - INFO - step 8028, loss: 2.448961, best loss: 1.898043 2025-01-16 01:36:01,246 - INFO - step 8029, loss: 2.785898, best loss: 1.898043 2025-01-16 01:36:01,396 - INFO - step 8030, loss: 2.694604, best loss: 1.898043 2025-01-16 01:36:01,546 - INFO - step 8031, loss: 2.855747, best loss: 1.898043 2025-01-16 01:36:01,696 - INFO - step 8032, loss: 2.772358, best loss: 1.898043 2025-01-16 01:36:01,846 - INFO - step 8033, loss: 2.595766, best loss: 1.898043 2025-01-16 01:36:01,996 - INFO - step 8034, loss: 2.870064, best loss: 1.898043 2025-01-16 01:36:02,146 - INFO - step 8035, loss: 2.624974, best loss: 1.898043 2025-01-16 01:36:02,296 - INFO - step 8036, loss: 2.731084, best loss: 1.898043 2025-01-16 01:36:02,447 - INFO - step 8037, loss: 2.514796, best loss: 1.898043 2025-01-16 01:36:02,597 - INFO - step 8038, loss: 2.544309, best loss: 1.898043 2025-01-16 01:36:02,747 - INFO - step 8039, loss: 2.633221, best loss: 1.898043 2025-01-16 01:36:02,897 - INFO - step 8040, loss: 2.705194, best loss: 1.898043 2025-01-16 01:36:03,047 - INFO - step 8041, loss: 2.667316, best loss: 1.898043 2025-01-16 01:36:03,197 - INFO - step 8042, loss: 2.642108, best loss: 1.898043 2025-01-16 01:36:03,347 - INFO - step 8043, loss: 2.377237, best loss: 1.898043 2025-01-16 01:36:03,497 - INFO - step 8044, loss: 2.208854, best loss: 1.898043 2025-01-16 01:36:03,647 - INFO - step 8045, loss: 2.339272, best loss: 1.898043 2025-01-16 01:36:03,798 - INFO - step 8046, loss: 2.507483, best loss: 1.898043 2025-01-16 01:36:03,948 - INFO - step 8047, loss: 2.872313, best loss: 1.898043 2025-01-16 01:36:04,098 - INFO - step 8048, loss: 2.563464, best loss: 1.898043 2025-01-16 01:36:04,248 - INFO - step 8049, loss: 2.245486, best loss: 1.898043 2025-01-16 01:36:04,398 - INFO - step 8050, loss: 2.658223, best loss: 1.898043 2025-01-16 01:36:04,548 - INFO - step 8051, loss: 2.513550, best loss: 1.898043 2025-01-16 01:36:04,698 - INFO - step 8052, loss: 2.958999, best loss: 1.898043 2025-01-16 01:36:04,848 - INFO - step 8053, loss: 2.305922, best loss: 1.898043 2025-01-16 01:36:04,998 - INFO - step 8054, loss: 2.634131, best loss: 1.898043 2025-01-16 01:36:05,149 - INFO - step 8055, loss: 2.546048, best loss: 1.898043 2025-01-16 01:36:05,299 - INFO - step 8056, loss: 2.647336, best loss: 1.898043 2025-01-16 01:36:05,449 - INFO - step 8057, loss: 2.593832, best loss: 1.898043 2025-01-16 01:36:05,599 - INFO - step 8058, loss: 2.499472, best loss: 1.898043 2025-01-16 01:36:05,749 - INFO - step 8059, loss: 2.753951, best loss: 1.898043 2025-01-16 01:36:05,900 - INFO - step 8060, loss: 2.629807, best loss: 1.898043 2025-01-16 01:36:06,050 - INFO - step 8061, loss: 2.475110, best loss: 1.898043 2025-01-16 01:36:06,200 - INFO - step 8062, loss: 2.862070, best loss: 1.898043 2025-01-16 01:36:06,350 - INFO - step 8063, loss: 2.509009, best loss: 1.898043 2025-01-16 01:36:06,500 - INFO - step 8064, loss: 2.096992, best loss: 1.898043 2025-01-16 01:36:06,650 - INFO - step 8065, loss: 2.573267, best loss: 1.898043 2025-01-16 01:36:06,801 - INFO - step 8066, loss: 2.765167, best loss: 1.898043 2025-01-16 01:36:06,951 - INFO - step 8067, loss: 2.679205, best loss: 1.898043 2025-01-16 01:36:07,101 - INFO - step 8068, loss: 2.395844, best loss: 1.898043 2025-01-16 01:36:07,251 - INFO - step 8069, loss: 2.494159, best loss: 1.898043 2025-01-16 01:36:07,401 - INFO - step 8070, loss: 2.778556, best loss: 1.898043 2025-01-16 01:36:07,551 - INFO - step 8071, loss: 2.840991, best loss: 1.898043 2025-01-16 01:36:07,701 - INFO - step 8072, loss: 2.729692, best loss: 1.898043 2025-01-16 01:36:07,852 - INFO - step 8073, loss: 2.520222, best loss: 1.898043 2025-01-16 01:36:08,002 - INFO - step 8074, loss: 2.737750, best loss: 1.898043 2025-01-16 01:36:08,152 - INFO - step 8075, loss: 2.662379, best loss: 1.898043 2025-01-16 01:36:08,302 - INFO - step 8076, loss: 2.359884, best loss: 1.898043 2025-01-16 01:36:08,452 - INFO - step 8077, loss: 2.649696, best loss: 1.898043 2025-01-16 01:36:08,603 - INFO - step 8078, loss: 2.452755, best loss: 1.898043 2025-01-16 01:36:08,753 - INFO - step 8079, loss: 2.627465, best loss: 1.898043 2025-01-16 01:36:08,903 - INFO - step 8080, loss: 2.410369, best loss: 1.898043 2025-01-16 01:36:09,053 - INFO - step 8081, loss: 2.468086, best loss: 1.898043 2025-01-16 01:36:09,203 - INFO - step 8082, loss: 2.326791, best loss: 1.898043 2025-01-16 01:36:09,353 - INFO - step 8083, loss: 2.288296, best loss: 1.898043 2025-01-16 01:36:09,504 - INFO - step 8084, loss: 2.725368, best loss: 1.898043 2025-01-16 01:36:09,654 - INFO - step 8085, loss: 2.587527, best loss: 1.898043 2025-01-16 01:36:09,805 - INFO - step 8086, loss: 2.760856, best loss: 1.898043 2025-01-16 01:36:09,955 - INFO - step 8087, loss: 2.487293, best loss: 1.898043 2025-01-16 01:36:10,105 - INFO - step 8088, loss: 2.586617, best loss: 1.898043 2025-01-16 01:36:10,256 - INFO - step 8089, loss: 2.651548, best loss: 1.898043 2025-01-16 01:36:10,406 - INFO - step 8090, loss: 2.350887, best loss: 1.898043 2025-01-16 01:36:10,556 - INFO - step 8091, loss: 2.338636, best loss: 1.898043 2025-01-16 01:36:10,706 - INFO - step 8092, loss: 2.357795, best loss: 1.898043 2025-01-16 01:36:10,856 - INFO - step 8093, loss: 2.590600, best loss: 1.898043 2025-01-16 01:36:11,006 - INFO - step 8094, loss: 2.645004, best loss: 1.898043 2025-01-16 01:36:11,156 - INFO - step 8095, loss: 2.656840, best loss: 1.898043 2025-01-16 01:36:11,307 - INFO - step 8096, loss: 2.852108, best loss: 1.898043 2025-01-16 01:36:11,457 - INFO - step 8097, loss: 2.813166, best loss: 1.898043 2025-01-16 01:36:11,607 - INFO - step 8098, loss: 2.782643, best loss: 1.898043 2025-01-16 01:36:11,757 - INFO - step 8099, loss: 2.767245, best loss: 1.898043 2025-01-16 01:36:11,908 - INFO - step 8100, loss: 2.809611, best loss: 1.898043 2025-01-16 01:36:12,058 - INFO - step 8101, loss: 2.398958, best loss: 1.898043 2025-01-16 01:36:12,208 - INFO - step 8102, loss: 2.762085, best loss: 1.898043 2025-01-16 01:36:12,358 - INFO - step 8103, loss: 2.832579, best loss: 1.898043 2025-01-16 01:36:12,509 - INFO - step 8104, loss: 2.606914, best loss: 1.898043 2025-01-16 01:36:12,659 - INFO - step 8105, loss: 2.644098, best loss: 1.898043 2025-01-16 01:36:12,809 - INFO - step 8106, loss: 2.844574, best loss: 1.898043 2025-01-16 01:36:12,959 - INFO - step 8107, loss: 2.568681, best loss: 1.898043 2025-01-16 01:36:13,109 - INFO - step 8108, loss: 1.903812, best loss: 1.898043 2025-01-16 01:36:13,259 - INFO - step 8109, loss: 2.659982, best loss: 1.898043 2025-01-16 01:36:13,409 - INFO - step 8110, loss: 2.533371, best loss: 1.898043 2025-01-16 01:36:13,559 - INFO - step 8111, loss: 2.656020, best loss: 1.898043 2025-01-16 01:36:13,709 - INFO - step 8112, loss: 2.721482, best loss: 1.898043 2025-01-16 01:36:13,859 - INFO - step 8113, loss: 2.496181, best loss: 1.898043 2025-01-16 01:36:14,010 - INFO - step 8114, loss: 2.678694, best loss: 1.898043 2025-01-16 01:36:14,160 - INFO - step 8115, loss: 2.640815, best loss: 1.898043 2025-01-16 01:36:14,310 - INFO - step 8116, loss: 2.545947, best loss: 1.898043 2025-01-16 01:36:14,460 - INFO - step 8117, loss: 2.555449, best loss: 1.898043 2025-01-16 01:36:14,610 - INFO - step 8118, loss: 2.512258, best loss: 1.898043 2025-01-16 01:36:14,761 - INFO - step 8119, loss: 2.316436, best loss: 1.898043 2025-01-16 01:36:14,911 - INFO - step 8120, loss: 2.584085, best loss: 1.898043 2025-01-16 01:36:15,061 - INFO - step 8121, loss: 2.443052, best loss: 1.898043 2025-01-16 01:36:15,212 - INFO - step 8122, loss: 2.627463, best loss: 1.898043 2025-01-16 01:36:15,362 - INFO - step 8123, loss: 2.800881, best loss: 1.898043 2025-01-16 01:36:15,512 - INFO - step 8124, loss: 2.464902, best loss: 1.898043 2025-01-16 01:36:15,662 - INFO - step 8125, loss: 2.257333, best loss: 1.898043 2025-01-16 01:36:15,812 - INFO - step 8126, loss: 2.542737, best loss: 1.898043 2025-01-16 01:36:15,962 - INFO - step 8127, loss: 2.532801, best loss: 1.898043 2025-01-16 01:36:16,112 - INFO - step 8128, loss: 2.629513, best loss: 1.898043 2025-01-16 01:36:16,262 - INFO - step 8129, loss: 2.597299, best loss: 1.898043 2025-01-16 01:36:16,412 - INFO - step 8130, loss: 2.570567, best loss: 1.898043 2025-01-16 01:36:16,563 - INFO - step 8131, loss: 2.521702, best loss: 1.898043 2025-01-16 01:36:16,713 - INFO - step 8132, loss: 2.747923, best loss: 1.898043 2025-01-16 01:36:16,863 - INFO - step 8133, loss: 2.521325, best loss: 1.898043 2025-01-16 01:36:17,013 - INFO - step 8134, loss: 2.606345, best loss: 1.898043 2025-01-16 01:36:17,163 - INFO - step 8135, loss: 2.674417, best loss: 1.898043 2025-01-16 01:36:17,313 - INFO - step 8136, loss: 2.714470, best loss: 1.898043 2025-01-16 01:36:17,463 - INFO - step 8137, loss: 2.679704, best loss: 1.898043 2025-01-16 01:36:17,613 - INFO - step 8138, loss: 2.498285, best loss: 1.898043 2025-01-16 01:36:17,763 - INFO - step 8139, loss: 2.597051, best loss: 1.898043 2025-01-16 01:36:17,913 - INFO - step 8140, loss: 2.762491, best loss: 1.898043 2025-01-16 01:36:18,064 - INFO - step 8141, loss: 2.821772, best loss: 1.898043 2025-01-16 01:36:18,214 - INFO - step 8142, loss: 2.736974, best loss: 1.898043 2025-01-16 01:36:18,364 - INFO - step 8143, loss: 2.936333, best loss: 1.898043 2025-01-16 01:36:18,514 - INFO - step 8144, loss: 2.967635, best loss: 1.898043 2025-01-16 01:36:18,664 - INFO - step 8145, loss: 2.707593, best loss: 1.898043 2025-01-16 01:36:18,814 - INFO - step 8146, loss: 2.955184, best loss: 1.898043 2025-01-16 01:36:18,965 - INFO - step 8147, loss: 2.670102, best loss: 1.898043 2025-01-16 01:36:19,115 - INFO - step 8148, loss: 2.344978, best loss: 1.898043 2025-01-16 01:36:19,265 - INFO - step 8149, loss: 2.727271, best loss: 1.898043 2025-01-16 01:36:19,415 - INFO - step 8150, loss: 2.766158, best loss: 1.898043 2025-01-16 01:36:19,566 - INFO - step 8151, loss: 2.665544, best loss: 1.898043 2025-01-16 01:36:19,716 - INFO - step 8152, loss: 2.488849, best loss: 1.898043 2025-01-16 01:36:19,866 - INFO - step 8153, loss: 2.746481, best loss: 1.898043 2025-01-16 01:36:20,016 - INFO - step 8154, loss: 2.708085, best loss: 1.898043 2025-01-16 01:36:20,166 - INFO - step 8155, loss: 2.589863, best loss: 1.898043 2025-01-16 01:36:20,316 - INFO - step 8156, loss: 2.747283, best loss: 1.898043 2025-01-16 01:36:20,467 - INFO - step 8157, loss: 2.415295, best loss: 1.898043 2025-01-16 01:36:20,617 - INFO - step 8158, loss: 2.370669, best loss: 1.898043 2025-01-16 01:36:20,767 - INFO - step 8159, loss: 2.818588, best loss: 1.898043 2025-01-16 01:36:20,917 - INFO - step 8160, loss: 2.780761, best loss: 1.898043 2025-01-16 01:36:21,067 - INFO - step 8161, loss: 2.902385, best loss: 1.898043 2025-01-16 01:36:21,217 - INFO - step 8162, loss: 2.681736, best loss: 1.898043 2025-01-16 01:36:21,368 - INFO - step 8163, loss: 2.703524, best loss: 1.898043 2025-01-16 01:36:21,518 - INFO - step 8164, loss: 2.736520, best loss: 1.898043 2025-01-16 01:36:21,668 - INFO - step 8165, loss: 2.513100, best loss: 1.898043 2025-01-16 01:36:21,818 - INFO - step 8166, loss: 2.506173, best loss: 1.898043 2025-01-16 01:36:21,968 - INFO - step 8167, loss: 2.828770, best loss: 1.898043 2025-01-16 01:36:22,118 - INFO - step 8168, loss: 2.408764, best loss: 1.898043 2025-01-16 01:36:22,268 - INFO - step 8169, loss: 2.180399, best loss: 1.898043 2025-01-16 01:36:22,419 - INFO - step 8170, loss: 2.598239, best loss: 1.898043 2025-01-16 01:36:22,569 - INFO - step 8171, loss: 2.567260, best loss: 1.898043 2025-01-16 01:36:22,719 - INFO - step 8172, loss: 2.607762, best loss: 1.898043 2025-01-16 01:36:22,869 - INFO - step 8173, loss: 2.188088, best loss: 1.898043 2025-01-16 01:36:23,019 - INFO - step 8174, loss: 2.138104, best loss: 1.898043 2025-01-16 01:36:23,170 - INFO - step 8175, loss: 2.050101, best loss: 1.898043 2025-01-16 01:36:23,320 - INFO - step 8176, loss: 2.298098, best loss: 1.898043 2025-01-16 01:36:23,470 - INFO - step 8177, loss: 2.494515, best loss: 1.898043 2025-01-16 01:36:23,620 - INFO - step 8178, loss: 2.657722, best loss: 1.898043 2025-01-16 01:36:23,770 - INFO - step 8179, loss: 2.549585, best loss: 1.898043 2025-01-16 01:36:23,920 - INFO - step 8180, loss: 2.476448, best loss: 1.898043 2025-01-16 01:36:24,070 - INFO - step 8181, loss: 2.544106, best loss: 1.898043 2025-01-16 01:36:24,220 - INFO - step 8182, loss: 2.558549, best loss: 1.898043 2025-01-16 01:36:24,370 - INFO - step 8183, loss: 2.390689, best loss: 1.898043 2025-01-16 01:36:24,520 - INFO - step 8184, loss: 2.594467, best loss: 1.898043 2025-01-16 01:36:24,670 - INFO - step 8185, loss: 2.577471, best loss: 1.898043 2025-01-16 01:36:24,820 - INFO - step 8186, loss: 2.266294, best loss: 1.898043 2025-01-16 01:36:24,971 - INFO - step 8187, loss: 2.245736, best loss: 1.898043 2025-01-16 01:36:25,121 - INFO - step 8188, loss: 2.451193, best loss: 1.898043 2025-01-16 01:36:25,271 - INFO - step 8189, loss: 2.546882, best loss: 1.898043 2025-01-16 01:36:25,421 - INFO - step 8190, loss: 2.293366, best loss: 1.898043 2025-01-16 01:36:25,572 - INFO - step 8191, loss: 2.280660, best loss: 1.898043 2025-01-16 01:36:25,722 - INFO - step 8192, loss: 2.535041, best loss: 1.898043 2025-01-16 01:36:25,872 - INFO - step 8193, loss: 2.335531, best loss: 1.898043 2025-01-16 01:36:26,022 - INFO - step 8194, loss: 2.346785, best loss: 1.898043 2025-01-16 01:36:26,172 - INFO - step 8195, loss: 2.627633, best loss: 1.898043 2025-01-16 01:36:26,323 - INFO - step 8196, loss: 2.381591, best loss: 1.898043 2025-01-16 01:36:26,473 - INFO - step 8197, loss: 2.433379, best loss: 1.898043 2025-01-16 01:36:26,623 - INFO - step 8198, loss: 2.345399, best loss: 1.898043 2025-01-16 01:36:26,773 - INFO - step 8199, loss: 2.592472, best loss: 1.898043 2025-01-16 01:36:26,923 - INFO - step 8200, loss: 2.305514, best loss: 1.898043 2025-01-16 01:36:27,073 - INFO - step 8201, loss: 2.323610, best loss: 1.898043 2025-01-16 01:36:27,223 - INFO - step 8202, loss: 2.277042, best loss: 1.898043 2025-01-16 01:36:27,374 - INFO - step 8203, loss: 2.514874, best loss: 1.898043 2025-01-16 01:36:27,524 - INFO - step 8204, loss: 2.786075, best loss: 1.898043 2025-01-16 01:36:27,674 - INFO - step 8205, loss: 2.779551, best loss: 1.898043 2025-01-16 01:36:27,824 - INFO - step 8206, loss: 2.606184, best loss: 1.898043 2025-01-16 01:36:27,974 - INFO - step 8207, loss: 2.613853, best loss: 1.898043 2025-01-16 01:36:28,124 - INFO - step 8208, loss: 2.558317, best loss: 1.898043 2025-01-16 01:36:28,275 - INFO - step 8209, loss: 2.492501, best loss: 1.898043 2025-01-16 01:36:28,425 - INFO - step 8210, loss: 2.409612, best loss: 1.898043 2025-01-16 01:36:28,575 - INFO - step 8211, loss: 2.591893, best loss: 1.898043 2025-01-16 01:36:28,725 - INFO - step 8212, loss: 2.427339, best loss: 1.898043 2025-01-16 01:36:28,876 - INFO - step 8213, loss: 2.328565, best loss: 1.898043 2025-01-16 01:36:29,026 - INFO - step 8214, loss: 2.417857, best loss: 1.898043 2025-01-16 01:36:29,176 - INFO - step 8215, loss: 2.445720, best loss: 1.898043 2025-01-16 01:36:29,326 - INFO - step 8216, loss: 2.589767, best loss: 1.898043 2025-01-16 01:36:29,476 - INFO - step 8217, loss: 1.986208, best loss: 1.898043 2025-01-16 01:36:29,627 - INFO - step 8218, loss: 2.529967, best loss: 1.898043 2025-01-16 01:36:29,777 - INFO - step 8219, loss: 2.601447, best loss: 1.898043 2025-01-16 01:36:29,927 - INFO - step 8220, loss: 2.534984, best loss: 1.898043 2025-01-16 01:36:30,077 - INFO - step 8221, loss: 2.501776, best loss: 1.898043 2025-01-16 01:36:30,227 - INFO - step 8222, loss: 2.423295, best loss: 1.898043 2025-01-16 01:36:30,378 - INFO - step 8223, loss: 2.486237, best loss: 1.898043 2025-01-16 01:36:30,528 - INFO - step 8224, loss: 2.342958, best loss: 1.898043 2025-01-16 01:36:30,678 - INFO - step 8225, loss: 2.457875, best loss: 1.898043 2025-01-16 01:36:30,828 - INFO - step 8226, loss: 2.403409, best loss: 1.898043 2025-01-16 01:36:30,979 - INFO - step 8227, loss: 2.509220, best loss: 1.898043 2025-01-16 01:36:31,129 - INFO - step 8228, loss: 2.280286, best loss: 1.898043 2025-01-16 01:36:31,279 - INFO - step 8229, loss: 2.321085, best loss: 1.898043 2025-01-16 01:36:31,429 - INFO - step 8230, loss: 2.168597, best loss: 1.898043 2025-01-16 01:36:31,580 - INFO - step 8231, loss: 2.221369, best loss: 1.898043 2025-01-16 01:36:31,730 - INFO - step 8232, loss: 2.395471, best loss: 1.898043 2025-01-16 01:36:31,880 - INFO - step 8233, loss: 2.332569, best loss: 1.898043 2025-01-16 01:36:32,031 - INFO - step 8234, loss: 2.260379, best loss: 1.898043 2025-01-16 01:36:32,181 - INFO - step 8235, loss: 2.030152, best loss: 1.898043 2025-01-16 01:36:32,331 - INFO - step 8236, loss: 2.053298, best loss: 1.898043 2025-01-16 01:36:32,481 - INFO - step 8237, loss: 1.955431, best loss: 1.898043 2025-01-16 01:36:32,632 - INFO - step 8238, loss: 2.432402, best loss: 1.898043 2025-01-16 01:36:32,782 - INFO - step 8239, loss: 2.557771, best loss: 1.898043 2025-01-16 01:36:32,932 - INFO - step 8240, loss: 2.633959, best loss: 1.898043 2025-01-16 01:36:33,082 - INFO - step 8241, loss: 2.722066, best loss: 1.898043 2025-01-16 01:36:33,232 - INFO - step 8242, loss: 2.617816, best loss: 1.898043 2025-01-16 01:36:33,383 - INFO - step 8243, loss: 2.322066, best loss: 1.898043 2025-01-16 01:36:33,533 - INFO - step 8244, loss: 2.368082, best loss: 1.898043 2025-01-16 01:36:33,683 - INFO - step 8245, loss: 2.666558, best loss: 1.898043 2025-01-16 01:36:33,833 - INFO - step 8246, loss: 2.539960, best loss: 1.898043 2025-01-16 01:36:33,983 - INFO - step 8247, loss: 2.033228, best loss: 1.898043 2025-01-16 01:36:34,134 - INFO - step 8248, loss: 2.327111, best loss: 1.898043 2025-01-16 01:36:34,284 - INFO - step 8249, loss: 2.208218, best loss: 1.898043 2025-01-16 01:36:34,434 - INFO - step 8250, loss: 2.575136, best loss: 1.898043 2025-01-16 01:36:34,584 - INFO - step 8251, loss: 2.572561, best loss: 1.898043 2025-01-16 01:36:34,734 - INFO - step 8252, loss: 2.493976, best loss: 1.898043 2025-01-16 01:36:34,884 - INFO - step 8253, loss: 2.597985, best loss: 1.898043 2025-01-16 01:36:35,034 - INFO - step 8254, loss: 2.542584, best loss: 1.898043 2025-01-16 01:36:35,184 - INFO - step 8255, loss: 2.164364, best loss: 1.898043 2025-01-16 01:36:35,335 - INFO - step 8256, loss: 2.628898, best loss: 1.898043 2025-01-16 01:36:35,485 - INFO - step 8257, loss: 2.468230, best loss: 1.898043 2025-01-16 01:36:35,635 - INFO - step 8258, loss: 2.669162, best loss: 1.898043 2025-01-16 01:36:35,785 - INFO - step 8259, loss: 2.569144, best loss: 1.898043 2025-01-16 01:36:35,936 - INFO - step 8260, loss: 2.390092, best loss: 1.898043 2025-01-16 01:36:36,086 - INFO - step 8261, loss: 2.580868, best loss: 1.898043 2025-01-16 01:36:36,236 - INFO - step 8262, loss: 2.251666, best loss: 1.898043 2025-01-16 01:36:36,386 - INFO - step 8263, loss: 2.562613, best loss: 1.898043 2025-01-16 01:36:36,536 - INFO - step 8264, loss: 2.664322, best loss: 1.898043 2025-01-16 01:36:36,687 - INFO - step 8265, loss: 2.608407, best loss: 1.898043 2025-01-16 01:36:36,837 - INFO - step 8266, loss: 2.548018, best loss: 1.898043 2025-01-16 01:36:36,987 - INFO - step 8267, loss: 2.424250, best loss: 1.898043 2025-01-16 01:36:37,137 - INFO - step 8268, loss: 2.470496, best loss: 1.898043 2025-01-16 01:36:37,287 - INFO - step 8269, loss: 2.440904, best loss: 1.898043 2025-01-16 01:36:37,438 - INFO - step 8270, loss: 2.202738, best loss: 1.898043 2025-01-16 01:36:37,588 - INFO - step 8271, loss: 2.755992, best loss: 1.898043 2025-01-16 01:36:37,738 - INFO - step 8272, loss: 2.089011, best loss: 1.898043 2025-01-16 01:36:37,888 - INFO - step 8273, loss: 2.198107, best loss: 1.898043 2025-01-16 01:36:38,038 - INFO - step 8274, loss: 2.507444, best loss: 1.898043 2025-01-16 01:36:38,189 - INFO - step 8275, loss: 2.667771, best loss: 1.898043 2025-01-16 01:36:38,339 - INFO - step 8276, loss: 2.451931, best loss: 1.898043 2025-01-16 01:36:38,489 - INFO - step 8277, loss: 2.297954, best loss: 1.898043 2025-01-16 01:36:38,640 - INFO - step 8278, loss: 2.459562, best loss: 1.898043 2025-01-16 01:36:38,790 - INFO - step 8279, loss: 2.625027, best loss: 1.898043 2025-01-16 01:36:38,940 - INFO - step 8280, loss: 2.253217, best loss: 1.898043 2025-01-16 01:36:39,090 - INFO - step 8281, loss: 2.195686, best loss: 1.898043 2025-01-16 01:36:39,240 - INFO - step 8282, loss: 2.484578, best loss: 1.898043 2025-01-16 01:36:39,390 - INFO - step 8283, loss: 2.484969, best loss: 1.898043 2025-01-16 01:36:39,541 - INFO - step 8284, loss: 2.353913, best loss: 1.898043 2025-01-16 01:36:39,691 - INFO - step 8285, loss: 2.160861, best loss: 1.898043 2025-01-16 01:36:39,841 - INFO - step 8286, loss: 2.506897, best loss: 1.898043 2025-01-16 01:36:39,991 - INFO - step 8287, loss: 2.667391, best loss: 1.898043 2025-01-16 01:36:40,142 - INFO - step 8288, loss: 2.436094, best loss: 1.898043 2025-01-16 01:36:40,292 - INFO - step 8289, loss: 2.474604, best loss: 1.898043 2025-01-16 01:36:40,442 - INFO - step 8290, loss: 2.715514, best loss: 1.898043 2025-01-16 01:36:40,593 - INFO - step 8291, loss: 2.611729, best loss: 1.898043 2025-01-16 01:36:40,743 - INFO - step 8292, loss: 2.645482, best loss: 1.898043 2025-01-16 01:36:40,893 - INFO - step 8293, loss: 2.373666, best loss: 1.898043 2025-01-16 01:36:41,043 - INFO - step 8294, loss: 2.395503, best loss: 1.898043 2025-01-16 01:36:41,194 - INFO - step 8295, loss: 2.301897, best loss: 1.898043 2025-01-16 01:36:41,344 - INFO - step 8296, loss: 2.771864, best loss: 1.898043 2025-01-16 01:36:41,494 - INFO - step 8297, loss: 2.465297, best loss: 1.898043 2025-01-16 01:36:41,645 - INFO - step 8298, loss: 2.629446, best loss: 1.898043 2025-01-16 01:36:41,795 - INFO - step 8299, loss: 2.099644, best loss: 1.898043 2025-01-16 01:36:41,945 - INFO - step 8300, loss: 2.267405, best loss: 1.898043 2025-01-16 01:36:42,095 - INFO - step 8301, loss: 2.500116, best loss: 1.898043 2025-01-16 01:36:42,246 - INFO - step 8302, loss: 2.497458, best loss: 1.898043 2025-01-16 01:36:42,396 - INFO - step 8303, loss: 2.427458, best loss: 1.898043 2025-01-16 01:36:42,546 - INFO - step 8304, loss: 2.517250, best loss: 1.898043 2025-01-16 01:36:42,696 - INFO - step 8305, loss: 2.367904, best loss: 1.898043 2025-01-16 01:36:42,847 - INFO - step 8306, loss: 2.438876, best loss: 1.898043 2025-01-16 01:36:42,997 - INFO - step 8307, loss: 2.514272, best loss: 1.898043 2025-01-16 01:36:43,147 - INFO - step 8308, loss: 2.135535, best loss: 1.898043 2025-01-16 01:36:43,297 - INFO - step 8309, loss: 2.419138, best loss: 1.898043 2025-01-16 01:36:43,447 - INFO - step 8310, loss: 2.457300, best loss: 1.898043 2025-01-16 01:36:43,597 - INFO - step 8311, loss: 2.669638, best loss: 1.898043 2025-01-16 01:36:43,748 - INFO - step 8312, loss: 2.518595, best loss: 1.898043 2025-01-16 01:36:43,898 - INFO - step 8313, loss: 2.588394, best loss: 1.898043 2025-01-16 01:36:44,048 - INFO - step 8314, loss: 2.558360, best loss: 1.898043 2025-01-16 01:36:44,198 - INFO - step 8315, loss: 2.256395, best loss: 1.898043 2025-01-16 01:36:44,349 - INFO - step 8316, loss: 2.594615, best loss: 1.898043 2025-01-16 01:36:44,499 - INFO - step 8317, loss: 2.147083, best loss: 1.898043 2025-01-16 01:36:44,650 - INFO - step 8318, loss: 2.362455, best loss: 1.898043 2025-01-16 01:36:44,800 - INFO - step 8319, loss: 2.459081, best loss: 1.898043 2025-01-16 01:36:44,950 - INFO - step 8320, loss: 2.481699, best loss: 1.898043 2025-01-16 01:36:45,100 - INFO - step 8321, loss: 2.404881, best loss: 1.898043 2025-01-16 01:36:45,250 - INFO - step 8322, loss: 2.564415, best loss: 1.898043 2025-01-16 01:36:45,400 - INFO - step 8323, loss: 2.599449, best loss: 1.898043 2025-01-16 01:36:45,550 - INFO - step 8324, loss: 2.524918, best loss: 1.898043 2025-01-16 01:36:45,701 - INFO - step 8325, loss: 2.594422, best loss: 1.898043 2025-01-16 01:36:45,851 - INFO - step 8326, loss: 2.443661, best loss: 1.898043 2025-01-16 01:36:46,001 - INFO - step 8327, loss: 2.545137, best loss: 1.898043 2025-01-16 01:36:46,151 - INFO - step 8328, loss: 2.394750, best loss: 1.898043 2025-01-16 01:36:46,301 - INFO - step 8329, loss: 2.175285, best loss: 1.898043 2025-01-16 01:36:46,451 - INFO - step 8330, loss: 2.644722, best loss: 1.898043 2025-01-16 01:36:46,602 - INFO - step 8331, loss: 2.442577, best loss: 1.898043 2025-01-16 01:36:46,752 - INFO - step 8332, loss: 2.346917, best loss: 1.898043 2025-01-16 01:36:46,902 - INFO - step 8333, loss: 2.366285, best loss: 1.898043 2025-01-16 01:36:47,053 - INFO - step 8334, loss: 2.212603, best loss: 1.898043 2025-01-16 01:36:47,203 - INFO - step 8335, loss: 2.399898, best loss: 1.898043 2025-01-16 01:36:47,353 - INFO - step 8336, loss: 2.234700, best loss: 1.898043 2025-01-16 01:36:47,503 - INFO - step 8337, loss: 2.225231, best loss: 1.898043 2025-01-16 01:36:47,653 - INFO - step 8338, loss: 2.553954, best loss: 1.898043 2025-01-16 01:36:47,804 - INFO - step 8339, loss: 2.538187, best loss: 1.898043 2025-01-16 01:36:47,954 - INFO - step 8340, loss: 2.326964, best loss: 1.898043 2025-01-16 01:36:48,104 - INFO - step 8341, loss: 2.538440, best loss: 1.898043 2025-01-16 01:36:48,254 - INFO - step 8342, loss: 2.390502, best loss: 1.898043 2025-01-16 01:36:48,404 - INFO - step 8343, loss: 2.619100, best loss: 1.898043 2025-01-16 01:36:48,554 - INFO - step 8344, loss: 2.688278, best loss: 1.898043 2025-01-16 01:36:48,705 - INFO - step 8345, loss: 2.732699, best loss: 1.898043 2025-01-16 01:36:48,855 - INFO - step 8346, loss: 2.773382, best loss: 1.898043 2025-01-16 01:36:49,005 - INFO - step 8347, loss: 2.523839, best loss: 1.898043 2025-01-16 01:36:49,156 - INFO - step 8348, loss: 2.688822, best loss: 1.898043 2025-01-16 01:36:49,306 - INFO - step 8349, loss: 2.599660, best loss: 1.898043 2025-01-16 01:36:49,456 - INFO - step 8350, loss: 2.495570, best loss: 1.898043 2025-01-16 01:36:49,607 - INFO - step 8351, loss: 2.744824, best loss: 1.898043 2025-01-16 01:36:49,757 - INFO - step 8352, loss: 2.694374, best loss: 1.898043 2025-01-16 01:36:49,907 - INFO - step 8353, loss: 2.859247, best loss: 1.898043 2025-01-16 01:36:50,057 - INFO - step 8354, loss: 2.420712, best loss: 1.898043 2025-01-16 01:36:50,208 - INFO - step 8355, loss: 2.533748, best loss: 1.898043 2025-01-16 01:36:50,358 - INFO - step 8356, loss: 2.557395, best loss: 1.898043 2025-01-16 01:36:50,508 - INFO - step 8357, loss: 2.414107, best loss: 1.898043 2025-01-16 01:36:50,658 - INFO - step 8358, loss: 2.390566, best loss: 1.898043 2025-01-16 01:36:50,809 - INFO - step 8359, loss: 2.741094, best loss: 1.898043 2025-01-16 01:36:50,958 - INFO - step 8360, loss: 2.601908, best loss: 1.898043 2025-01-16 01:36:51,109 - INFO - step 8361, loss: 2.763715, best loss: 1.898043 2025-01-16 01:36:51,259 - INFO - step 8362, loss: 2.585800, best loss: 1.898043 2025-01-16 01:36:51,409 - INFO - step 8363, loss: 2.457976, best loss: 1.898043 2025-01-16 01:36:51,559 - INFO - step 8364, loss: 2.787955, best loss: 1.898043 2025-01-16 01:36:51,710 - INFO - step 8365, loss: 2.534845, best loss: 1.898043 2025-01-16 01:36:51,860 - INFO - step 8366, loss: 2.643580, best loss: 1.898043 2025-01-16 01:36:52,010 - INFO - step 8367, loss: 2.430594, best loss: 1.898043 2025-01-16 01:36:52,161 - INFO - step 8368, loss: 2.459114, best loss: 1.898043 2025-01-16 01:36:52,311 - INFO - step 8369, loss: 2.552176, best loss: 1.898043 2025-01-16 01:36:52,461 - INFO - step 8370, loss: 2.609858, best loss: 1.898043 2025-01-16 01:36:52,611 - INFO - step 8371, loss: 2.566607, best loss: 1.898043 2025-01-16 01:36:52,761 - INFO - step 8372, loss: 2.570037, best loss: 1.898043 2025-01-16 01:36:52,911 - INFO - step 8373, loss: 2.304307, best loss: 1.898043 2025-01-16 01:36:53,062 - INFO - step 8374, loss: 2.132561, best loss: 1.898043 2025-01-16 01:36:53,212 - INFO - step 8375, loss: 2.238724, best loss: 1.898043 2025-01-16 01:36:53,362 - INFO - step 8376, loss: 2.404216, best loss: 1.898043 2025-01-16 01:36:53,512 - INFO - step 8377, loss: 2.763906, best loss: 1.898043 2025-01-16 01:36:53,663 - INFO - step 8378, loss: 2.451471, best loss: 1.898043 2025-01-16 01:36:53,813 - INFO - step 8379, loss: 2.056082, best loss: 1.898043 2025-01-16 01:36:53,963 - INFO - step 8380, loss: 2.530839, best loss: 1.898043 2025-01-16 01:36:54,113 - INFO - step 8381, loss: 2.456320, best loss: 1.898043 2025-01-16 01:36:54,263 - INFO - step 8382, loss: 2.860151, best loss: 1.898043 2025-01-16 01:36:54,414 - INFO - step 8383, loss: 2.248626, best loss: 1.898043 2025-01-16 01:36:54,564 - INFO - step 8384, loss: 2.602427, best loss: 1.898043 2025-01-16 01:36:54,714 - INFO - step 8385, loss: 2.448922, best loss: 1.898043 2025-01-16 01:36:54,865 - INFO - step 8386, loss: 2.481920, best loss: 1.898043 2025-01-16 01:36:55,015 - INFO - step 8387, loss: 2.499515, best loss: 1.898043 2025-01-16 01:36:55,165 - INFO - step 8388, loss: 2.381240, best loss: 1.898043 2025-01-16 01:36:55,315 - INFO - step 8389, loss: 2.561337, best loss: 1.898043 2025-01-16 01:36:55,465 - INFO - step 8390, loss: 2.518466, best loss: 1.898043 2025-01-16 01:36:55,615 - INFO - step 8391, loss: 2.409244, best loss: 1.898043 2025-01-16 01:36:55,765 - INFO - step 8392, loss: 2.819228, best loss: 1.898043 2025-01-16 01:36:55,916 - INFO - step 8393, loss: 2.370434, best loss: 1.898043 2025-01-16 01:36:56,066 - INFO - step 8394, loss: 2.054378, best loss: 1.898043 2025-01-16 01:36:56,216 - INFO - step 8395, loss: 2.528941, best loss: 1.898043 2025-01-16 01:36:56,367 - INFO - step 8396, loss: 2.658605, best loss: 1.898043 2025-01-16 01:36:56,517 - INFO - step 8397, loss: 2.518473, best loss: 1.898043 2025-01-16 01:36:56,667 - INFO - step 8398, loss: 2.360109, best loss: 1.898043 2025-01-16 01:36:56,818 - INFO - step 8399, loss: 2.380724, best loss: 1.898043 2025-01-16 01:36:56,968 - INFO - step 8400, loss: 2.708809, best loss: 1.898043 2025-01-16 01:36:57,118 - INFO - step 8401, loss: 2.737460, best loss: 1.898043 2025-01-16 01:36:57,268 - INFO - step 8402, loss: 2.584123, best loss: 1.898043 2025-01-16 01:36:57,419 - INFO - step 8403, loss: 2.382731, best loss: 1.898043 2025-01-16 01:36:57,569 - INFO - step 8404, loss: 2.628745, best loss: 1.898043 2025-01-16 01:36:57,719 - INFO - step 8405, loss: 2.526221, best loss: 1.898043 2025-01-16 01:36:57,869 - INFO - step 8406, loss: 2.349523, best loss: 1.898043 2025-01-16 01:36:58,019 - INFO - step 8407, loss: 2.545148, best loss: 1.898043 2025-01-16 01:36:58,170 - INFO - step 8408, loss: 2.412327, best loss: 1.898043 2025-01-16 01:36:58,320 - INFO - step 8409, loss: 2.562420, best loss: 1.898043 2025-01-16 01:36:58,470 - INFO - step 8410, loss: 2.362276, best loss: 1.898043 2025-01-16 01:36:58,621 - INFO - step 8411, loss: 2.381560, best loss: 1.898043 2025-01-16 01:36:58,771 - INFO - step 8412, loss: 2.278030, best loss: 1.898043 2025-01-16 01:36:58,921 - INFO - step 8413, loss: 2.149438, best loss: 1.898043 2025-01-16 01:36:59,071 - INFO - step 8414, loss: 2.582668, best loss: 1.898043 2025-01-16 01:36:59,221 - INFO - step 8415, loss: 2.446530, best loss: 1.898043 2025-01-16 01:36:59,372 - INFO - step 8416, loss: 2.688490, best loss: 1.898043 2025-01-16 01:36:59,522 - INFO - step 8417, loss: 2.410725, best loss: 1.898043 2025-01-16 01:36:59,672 - INFO - step 8418, loss: 2.510693, best loss: 1.898043 2025-01-16 01:36:59,822 - INFO - step 8419, loss: 2.621459, best loss: 1.898043 2025-01-16 01:36:59,973 - INFO - step 8420, loss: 2.400118, best loss: 1.898043 2025-01-16 01:37:00,123 - INFO - step 8421, loss: 2.293329, best loss: 1.898043 2025-01-16 01:37:00,273 - INFO - step 8422, loss: 2.310175, best loss: 1.898043 2025-01-16 01:37:00,424 - INFO - step 8423, loss: 2.461597, best loss: 1.898043 2025-01-16 01:37:00,574 - INFO - step 8424, loss: 2.514684, best loss: 1.898043 2025-01-16 01:37:00,724 - INFO - step 8425, loss: 2.527544, best loss: 1.898043 2025-01-16 01:37:00,874 - INFO - step 8426, loss: 2.697319, best loss: 1.898043 2025-01-16 01:37:01,025 - INFO - step 8427, loss: 2.685982, best loss: 1.898043 2025-01-16 01:37:01,175 - INFO - step 8428, loss: 2.612417, best loss: 1.898043 2025-01-16 01:37:01,325 - INFO - step 8429, loss: 2.664081, best loss: 1.898043 2025-01-16 01:37:01,475 - INFO - step 8430, loss: 2.744036, best loss: 1.898043 2025-01-16 01:37:01,626 - INFO - step 8431, loss: 2.368064, best loss: 1.898043 2025-01-16 01:37:01,776 - INFO - step 8432, loss: 2.645359, best loss: 1.898043 2025-01-16 01:37:01,926 - INFO - step 8433, loss: 2.801574, best loss: 1.898043 2025-01-16 01:37:02,076 - INFO - step 8434, loss: 2.554063, best loss: 1.898043 2025-01-16 01:37:02,226 - INFO - step 8435, loss: 2.559461, best loss: 1.898043 2025-01-16 01:37:02,377 - INFO - step 8436, loss: 2.709999, best loss: 1.898043 2025-01-16 01:37:02,527 - INFO - step 8437, loss: 2.476680, best loss: 1.898043 2025-01-16 01:37:06,059 - INFO - step 8438, loss: 1.838395, best loss: 1.838395 2025-01-16 01:37:06,219 - INFO - step 8439, loss: 2.550906, best loss: 1.838395 2025-01-16 01:37:06,370 - INFO - step 8440, loss: 2.537724, best loss: 1.838395 2025-01-16 01:37:06,520 - INFO - step 8441, loss: 2.576846, best loss: 1.838395 2025-01-16 01:37:06,671 - INFO - step 8442, loss: 2.668577, best loss: 1.838395 2025-01-16 01:37:06,821 - INFO - step 8443, loss: 2.441977, best loss: 1.838395 2025-01-16 01:37:06,971 - INFO - step 8444, loss: 2.629882, best loss: 1.838395 2025-01-16 01:37:07,121 - INFO - step 8445, loss: 2.555298, best loss: 1.838395 2025-01-16 01:37:07,271 - INFO - step 8446, loss: 2.518140, best loss: 1.838395 2025-01-16 01:37:07,421 - INFO - step 8447, loss: 2.476199, best loss: 1.838395 2025-01-16 01:37:07,572 - INFO - step 8448, loss: 2.460676, best loss: 1.838395 2025-01-16 01:37:07,722 - INFO - step 8449, loss: 2.228030, best loss: 1.838395 2025-01-16 01:37:07,872 - INFO - step 8450, loss: 2.431732, best loss: 1.838395 2025-01-16 01:37:08,022 - INFO - step 8451, loss: 2.321938, best loss: 1.838395 2025-01-16 01:37:08,172 - INFO - step 8452, loss: 2.620801, best loss: 1.838395 2025-01-16 01:37:08,322 - INFO - step 8453, loss: 2.663173, best loss: 1.838395 2025-01-16 01:37:08,473 - INFO - step 8454, loss: 2.422652, best loss: 1.838395 2025-01-16 01:37:08,623 - INFO - step 8455, loss: 2.182667, best loss: 1.838395 2025-01-16 01:37:08,773 - INFO - step 8456, loss: 2.489318, best loss: 1.838395 2025-01-16 01:37:08,923 - INFO - step 8457, loss: 2.488392, best loss: 1.838395 2025-01-16 01:37:09,073 - INFO - step 8458, loss: 2.526540, best loss: 1.838395 2025-01-16 01:37:09,224 - INFO - step 8459, loss: 2.520807, best loss: 1.838395 2025-01-16 01:37:09,374 - INFO - step 8460, loss: 2.503762, best loss: 1.838395 2025-01-16 01:37:09,525 - INFO - step 8461, loss: 2.410782, best loss: 1.838395 2025-01-16 01:37:09,675 - INFO - step 8462, loss: 2.569111, best loss: 1.838395 2025-01-16 01:37:09,825 - INFO - step 8463, loss: 2.414548, best loss: 1.838395 2025-01-16 01:37:09,976 - INFO - step 8464, loss: 2.489980, best loss: 1.838395 2025-01-16 01:37:10,127 - INFO - step 8465, loss: 2.597554, best loss: 1.838395 2025-01-16 01:37:10,278 - INFO - step 8466, loss: 2.669935, best loss: 1.838395 2025-01-16 01:37:10,428 - INFO - step 8467, loss: 2.649178, best loss: 1.838395 2025-01-16 01:37:10,578 - INFO - step 8468, loss: 2.403392, best loss: 1.838395 2025-01-16 01:37:10,728 - INFO - step 8469, loss: 2.463623, best loss: 1.838395 2025-01-16 01:37:10,878 - INFO - step 8470, loss: 2.667901, best loss: 1.838395 2025-01-16 01:37:11,029 - INFO - step 8471, loss: 2.754883, best loss: 1.838395 2025-01-16 01:37:11,179 - INFO - step 8472, loss: 2.594444, best loss: 1.838395 2025-01-16 01:37:11,330 - INFO - step 8473, loss: 2.800123, best loss: 1.838395 2025-01-16 01:37:11,480 - INFO - step 8474, loss: 2.899068, best loss: 1.838395 2025-01-16 01:37:11,630 - INFO - step 8475, loss: 2.500171, best loss: 1.838395 2025-01-16 01:37:11,780 - INFO - step 8476, loss: 2.805840, best loss: 1.838395 2025-01-16 01:37:11,930 - INFO - step 8477, loss: 2.514893, best loss: 1.838395 2025-01-16 01:37:12,081 - INFO - step 8478, loss: 2.225307, best loss: 1.838395 2025-01-16 01:37:12,231 - INFO - step 8479, loss: 2.690227, best loss: 1.838395 2025-01-16 01:37:12,381 - INFO - step 8480, loss: 2.628457, best loss: 1.838395 2025-01-16 01:37:12,532 - INFO - step 8481, loss: 2.526708, best loss: 1.838395 2025-01-16 01:37:12,682 - INFO - step 8482, loss: 2.334027, best loss: 1.838395 2025-01-16 01:37:12,833 - INFO - step 8483, loss: 2.612015, best loss: 1.838395 2025-01-16 01:37:12,983 - INFO - step 8484, loss: 2.633316, best loss: 1.838395 2025-01-16 01:37:13,133 - INFO - step 8485, loss: 2.475015, best loss: 1.838395 2025-01-16 01:37:13,283 - INFO - step 8486, loss: 2.700767, best loss: 1.838395 2025-01-16 01:37:13,434 - INFO - step 8487, loss: 2.345282, best loss: 1.838395 2025-01-16 01:37:13,584 - INFO - step 8488, loss: 2.241682, best loss: 1.838395 2025-01-16 01:37:13,734 - INFO - step 8489, loss: 2.702366, best loss: 1.838395 2025-01-16 01:37:13,884 - INFO - step 8490, loss: 2.743467, best loss: 1.838395 2025-01-16 01:37:14,035 - INFO - step 8491, loss: 2.832593, best loss: 1.838395 2025-01-16 01:37:14,185 - INFO - step 8492, loss: 2.621429, best loss: 1.838395 2025-01-16 01:37:14,335 - INFO - step 8493, loss: 2.608720, best loss: 1.838395 2025-01-16 01:37:14,485 - INFO - step 8494, loss: 2.660996, best loss: 1.838395 2025-01-16 01:37:14,636 - INFO - step 8495, loss: 2.444126, best loss: 1.838395 2025-01-16 01:37:14,786 - INFO - step 8496, loss: 2.405145, best loss: 1.838395 2025-01-16 01:37:14,936 - INFO - step 8497, loss: 2.690850, best loss: 1.838395 2025-01-16 01:37:15,086 - INFO - step 8498, loss: 2.359952, best loss: 1.838395 2025-01-16 01:37:15,236 - INFO - step 8499, loss: 2.158394, best loss: 1.838395 2025-01-16 01:37:15,387 - INFO - step 8500, loss: 2.511674, best loss: 1.838395 2025-01-16 01:37:15,537 - INFO - step 8501, loss: 2.475652, best loss: 1.838395 2025-01-16 01:37:15,687 - INFO - step 8502, loss: 2.573149, best loss: 1.838395 2025-01-16 01:37:15,837 - INFO - step 8503, loss: 2.173130, best loss: 1.838395 2025-01-16 01:37:15,988 - INFO - step 8504, loss: 2.012540, best loss: 1.838395 2025-01-16 01:37:16,138 - INFO - step 8505, loss: 1.997057, best loss: 1.838395 2025-01-16 01:37:16,288 - INFO - step 8506, loss: 2.222584, best loss: 1.838395 2025-01-16 01:37:16,438 - INFO - step 8507, loss: 2.411284, best loss: 1.838395 2025-01-16 01:37:16,588 - INFO - step 8508, loss: 2.578720, best loss: 1.838395 2025-01-16 01:37:16,739 - INFO - step 8509, loss: 2.541152, best loss: 1.838395 2025-01-16 01:37:16,889 - INFO - step 8510, loss: 2.375870, best loss: 1.838395 2025-01-16 01:37:17,039 - INFO - step 8511, loss: 2.433843, best loss: 1.838395 2025-01-16 01:37:17,190 - INFO - step 8512, loss: 2.478752, best loss: 1.838395 2025-01-16 01:37:17,340 - INFO - step 8513, loss: 2.322651, best loss: 1.838395 2025-01-16 01:37:17,490 - INFO - step 8514, loss: 2.492134, best loss: 1.838395 2025-01-16 01:37:17,640 - INFO - step 8515, loss: 2.491327, best loss: 1.838395 2025-01-16 01:37:17,790 - INFO - step 8516, loss: 2.205928, best loss: 1.838395 2025-01-16 01:37:17,941 - INFO - step 8517, loss: 2.153211, best loss: 1.838395 2025-01-16 01:37:18,091 - INFO - step 8518, loss: 2.315540, best loss: 1.838395 2025-01-16 01:37:18,241 - INFO - step 8519, loss: 2.422966, best loss: 1.838395 2025-01-16 01:37:18,391 - INFO - step 8520, loss: 2.219235, best loss: 1.838395 2025-01-16 01:37:18,541 - INFO - step 8521, loss: 2.230711, best loss: 1.838395 2025-01-16 01:37:18,691 - INFO - step 8522, loss: 2.375081, best loss: 1.838395 2025-01-16 01:37:18,841 - INFO - step 8523, loss: 2.210305, best loss: 1.838395 2025-01-16 01:37:18,992 - INFO - step 8524, loss: 2.291485, best loss: 1.838395 2025-01-16 01:37:19,142 - INFO - step 8525, loss: 2.511676, best loss: 1.838395 2025-01-16 01:37:19,292 - INFO - step 8526, loss: 2.217601, best loss: 1.838395 2025-01-16 01:37:19,442 - INFO - step 8527, loss: 2.275217, best loss: 1.838395 2025-01-16 01:37:19,592 - INFO - step 8528, loss: 2.218309, best loss: 1.838395 2025-01-16 01:37:19,743 - INFO - step 8529, loss: 2.499235, best loss: 1.838395 2025-01-16 01:37:19,893 - INFO - step 8530, loss: 2.185205, best loss: 1.838395 2025-01-16 01:37:20,043 - INFO - step 8531, loss: 2.250221, best loss: 1.838395 2025-01-16 01:37:20,193 - INFO - step 8532, loss: 2.198390, best loss: 1.838395 2025-01-16 01:37:20,343 - INFO - step 8533, loss: 2.430079, best loss: 1.838395 2025-01-16 01:37:20,493 - INFO - step 8534, loss: 2.637371, best loss: 1.838395 2025-01-16 01:37:20,643 - INFO - step 8535, loss: 2.527908, best loss: 1.838395 2025-01-16 01:37:20,793 - INFO - step 8536, loss: 2.423876, best loss: 1.838395 2025-01-16 01:37:20,944 - INFO - step 8537, loss: 2.478172, best loss: 1.838395 2025-01-16 01:37:21,094 - INFO - step 8538, loss: 2.442551, best loss: 1.838395 2025-01-16 01:37:21,244 - INFO - step 8539, loss: 2.421334, best loss: 1.838395 2025-01-16 01:37:21,394 - INFO - step 8540, loss: 2.264294, best loss: 1.838395 2025-01-16 01:37:21,544 - INFO - step 8541, loss: 2.444350, best loss: 1.838395 2025-01-16 01:37:21,694 - INFO - step 8542, loss: 2.306625, best loss: 1.838395 2025-01-16 01:37:21,844 - INFO - step 8543, loss: 2.201123, best loss: 1.838395 2025-01-16 01:37:21,994 - INFO - step 8544, loss: 2.394461, best loss: 1.838395 2025-01-16 01:37:22,144 - INFO - step 8545, loss: 2.329626, best loss: 1.838395 2025-01-16 01:37:22,294 - INFO - step 8546, loss: 2.522902, best loss: 1.838395 2025-01-16 01:37:22,445 - INFO - step 8547, loss: 1.963727, best loss: 1.838395 2025-01-16 01:37:22,595 - INFO - step 8548, loss: 2.439205, best loss: 1.838395 2025-01-16 01:37:22,745 - INFO - step 8549, loss: 2.455299, best loss: 1.838395 2025-01-16 01:37:22,895 - INFO - step 8550, loss: 2.332600, best loss: 1.838395 2025-01-16 01:37:23,045 - INFO - step 8551, loss: 2.402316, best loss: 1.838395 2025-01-16 01:37:23,195 - INFO - step 8552, loss: 2.316394, best loss: 1.838395 2025-01-16 01:37:23,346 - INFO - step 8553, loss: 2.385984, best loss: 1.838395 2025-01-16 01:37:23,496 - INFO - step 8554, loss: 2.258612, best loss: 1.838395 2025-01-16 01:37:23,646 - INFO - step 8555, loss: 2.332611, best loss: 1.838395 2025-01-16 01:37:23,796 - INFO - step 8556, loss: 2.261322, best loss: 1.838395 2025-01-16 01:37:23,946 - INFO - step 8557, loss: 2.439163, best loss: 1.838395 2025-01-16 01:37:24,096 - INFO - step 8558, loss: 2.188854, best loss: 1.838395 2025-01-16 01:37:24,246 - INFO - step 8559, loss: 2.240361, best loss: 1.838395 2025-01-16 01:37:24,396 - INFO - step 8560, loss: 2.126137, best loss: 1.838395 2025-01-16 01:37:24,546 - INFO - step 8561, loss: 2.154346, best loss: 1.838395 2025-01-16 01:37:24,697 - INFO - step 8562, loss: 2.306817, best loss: 1.838395 2025-01-16 01:37:24,847 - INFO - step 8563, loss: 2.262588, best loss: 1.838395 2025-01-16 01:37:24,997 - INFO - step 8564, loss: 2.260241, best loss: 1.838395 2025-01-16 01:37:25,148 - INFO - step 8565, loss: 1.875355, best loss: 1.838395 2025-01-16 01:37:25,298 - INFO - step 8566, loss: 1.998847, best loss: 1.838395 2025-01-16 01:37:25,448 - INFO - step 8567, loss: 1.849964, best loss: 1.838395 2025-01-16 01:37:25,598 - INFO - step 8568, loss: 2.385435, best loss: 1.838395 2025-01-16 01:37:25,748 - INFO - step 8569, loss: 2.514573, best loss: 1.838395 2025-01-16 01:37:25,898 - INFO - step 8570, loss: 2.503973, best loss: 1.838395 2025-01-16 01:37:26,048 - INFO - step 8571, loss: 2.570850, best loss: 1.838395 2025-01-16 01:37:26,198 - INFO - step 8572, loss: 2.483060, best loss: 1.838395 2025-01-16 01:37:26,349 - INFO - step 8573, loss: 2.237010, best loss: 1.838395 2025-01-16 01:37:26,499 - INFO - step 8574, loss: 2.342669, best loss: 1.838395 2025-01-16 01:37:26,649 - INFO - step 8575, loss: 2.545278, best loss: 1.838395 2025-01-16 01:37:26,799 - INFO - step 8576, loss: 2.406279, best loss: 1.838395 2025-01-16 01:37:26,949 - INFO - step 8577, loss: 1.942563, best loss: 1.838395 2025-01-16 01:37:27,099 - INFO - step 8578, loss: 2.251490, best loss: 1.838395 2025-01-16 01:37:27,249 - INFO - step 8579, loss: 2.092997, best loss: 1.838395 2025-01-16 01:37:27,399 - INFO - step 8580, loss: 2.488417, best loss: 1.838395 2025-01-16 01:37:27,549 - INFO - step 8581, loss: 2.432677, best loss: 1.838395 2025-01-16 01:37:27,699 - INFO - step 8582, loss: 2.448064, best loss: 1.838395 2025-01-16 01:37:27,849 - INFO - step 8583, loss: 2.600467, best loss: 1.838395 2025-01-16 01:37:27,999 - INFO - step 8584, loss: 2.499783, best loss: 1.838395 2025-01-16 01:37:28,149 - INFO - step 8585, loss: 2.108053, best loss: 1.838395 2025-01-16 01:37:28,299 - INFO - step 8586, loss: 2.574871, best loss: 1.838395 2025-01-16 01:37:28,449 - INFO - step 8587, loss: 2.399735, best loss: 1.838395 2025-01-16 01:37:28,599 - INFO - step 8588, loss: 2.629868, best loss: 1.838395 2025-01-16 01:37:28,749 - INFO - step 8589, loss: 2.583326, best loss: 1.838395 2025-01-16 01:37:28,899 - INFO - step 8590, loss: 2.289412, best loss: 1.838395 2025-01-16 01:37:29,050 - INFO - step 8591, loss: 2.411615, best loss: 1.838395 2025-01-16 01:37:29,199 - INFO - step 8592, loss: 2.173622, best loss: 1.838395 2025-01-16 01:37:29,349 - INFO - step 8593, loss: 2.434924, best loss: 1.838395 2025-01-16 01:37:29,500 - INFO - step 8594, loss: 2.594072, best loss: 1.838395 2025-01-16 01:37:29,650 - INFO - step 8595, loss: 2.525743, best loss: 1.838395 2025-01-16 01:37:29,800 - INFO - step 8596, loss: 2.460719, best loss: 1.838395 2025-01-16 01:37:29,950 - INFO - step 8597, loss: 2.362139, best loss: 1.838395 2025-01-16 01:37:30,100 - INFO - step 8598, loss: 2.400976, best loss: 1.838395 2025-01-16 01:37:30,250 - INFO - step 8599, loss: 2.392231, best loss: 1.838395 2025-01-16 01:37:30,400 - INFO - step 8600, loss: 2.186741, best loss: 1.838395 2025-01-16 01:37:30,550 - INFO - step 8601, loss: 2.649804, best loss: 1.838395 2025-01-16 01:37:30,700 - INFO - step 8602, loss: 2.060103, best loss: 1.838395 2025-01-16 01:37:30,850 - INFO - step 8603, loss: 2.172179, best loss: 1.838395 2025-01-16 01:37:31,000 - INFO - step 8604, loss: 2.425352, best loss: 1.838395 2025-01-16 01:37:31,150 - INFO - step 8605, loss: 2.545390, best loss: 1.838395 2025-01-16 01:37:31,300 - INFO - step 8606, loss: 2.404076, best loss: 1.838395 2025-01-16 01:37:31,450 - INFO - step 8607, loss: 2.220205, best loss: 1.838395 2025-01-16 01:37:31,600 - INFO - step 8608, loss: 2.380671, best loss: 1.838395 2025-01-16 01:37:31,750 - INFO - step 8609, loss: 2.553551, best loss: 1.838395 2025-01-16 01:37:31,901 - INFO - step 8610, loss: 2.244735, best loss: 1.838395 2025-01-16 01:37:32,051 - INFO - step 8611, loss: 2.117446, best loss: 1.838395 2025-01-16 01:37:32,201 - INFO - step 8612, loss: 2.373702, best loss: 1.838395 2025-01-16 01:37:32,351 - INFO - step 8613, loss: 2.419621, best loss: 1.838395 2025-01-16 01:37:32,501 - INFO - step 8614, loss: 2.299599, best loss: 1.838395 2025-01-16 01:37:32,651 - INFO - step 8615, loss: 2.117370, best loss: 1.838395 2025-01-16 01:37:32,801 - INFO - step 8616, loss: 2.423448, best loss: 1.838395 2025-01-16 01:37:32,951 - INFO - step 8617, loss: 2.643934, best loss: 1.838395 2025-01-16 01:37:33,101 - INFO - step 8618, loss: 2.385235, best loss: 1.838395 2025-01-16 01:37:33,251 - INFO - step 8619, loss: 2.329108, best loss: 1.838395 2025-01-16 01:37:33,401 - INFO - step 8620, loss: 2.589162, best loss: 1.838395 2025-01-16 01:37:33,551 - INFO - step 8621, loss: 2.513451, best loss: 1.838395 2025-01-16 01:37:33,702 - INFO - step 8622, loss: 2.528259, best loss: 1.838395 2025-01-16 01:37:33,852 - INFO - step 8623, loss: 2.341616, best loss: 1.838395 2025-01-16 01:37:34,002 - INFO - step 8624, loss: 2.352156, best loss: 1.838395 2025-01-16 01:37:34,152 - INFO - step 8625, loss: 2.289942, best loss: 1.838395 2025-01-16 01:37:34,302 - INFO - step 8626, loss: 2.651520, best loss: 1.838395 2025-01-16 01:37:34,452 - INFO - step 8627, loss: 2.437943, best loss: 1.838395 2025-01-16 01:37:34,602 - INFO - step 8628, loss: 2.506262, best loss: 1.838395 2025-01-16 01:37:34,753 - INFO - step 8629, loss: 2.009096, best loss: 1.838395 2025-01-16 01:37:34,903 - INFO - step 8630, loss: 2.224809, best loss: 1.838395 2025-01-16 01:37:35,054 - INFO - step 8631, loss: 2.464669, best loss: 1.838395 2025-01-16 01:37:35,204 - INFO - step 8632, loss: 2.379817, best loss: 1.838395 2025-01-16 01:37:35,354 - INFO - step 8633, loss: 2.406041, best loss: 1.838395 2025-01-16 01:37:35,505 - INFO - step 8634, loss: 2.455070, best loss: 1.838395 2025-01-16 01:37:35,655 - INFO - step 8635, loss: 2.293090, best loss: 1.838395 2025-01-16 01:37:35,805 - INFO - step 8636, loss: 2.366671, best loss: 1.838395 2025-01-16 01:37:35,955 - INFO - step 8637, loss: 2.414194, best loss: 1.838395 2025-01-16 01:37:36,105 - INFO - step 8638, loss: 2.051713, best loss: 1.838395 2025-01-16 01:37:36,255 - INFO - step 8639, loss: 2.365171, best loss: 1.838395 2025-01-16 01:37:36,405 - INFO - step 8640, loss: 2.447060, best loss: 1.838395 2025-01-16 01:37:36,556 - INFO - step 8641, loss: 2.630379, best loss: 1.838395 2025-01-16 01:37:36,706 - INFO - step 8642, loss: 2.420974, best loss: 1.838395 2025-01-16 01:37:36,856 - INFO - step 8643, loss: 2.555197, best loss: 1.838395 2025-01-16 01:37:37,006 - INFO - step 8644, loss: 2.446351, best loss: 1.838395 2025-01-16 01:37:37,156 - INFO - step 8645, loss: 2.173034, best loss: 1.838395 2025-01-16 01:37:37,306 - INFO - step 8646, loss: 2.513541, best loss: 1.838395 2025-01-16 01:37:37,456 - INFO - step 8647, loss: 2.057067, best loss: 1.838395 2025-01-16 01:37:37,606 - INFO - step 8648, loss: 2.281755, best loss: 1.838395 2025-01-16 01:37:37,756 - INFO - step 8649, loss: 2.408461, best loss: 1.838395 2025-01-16 01:37:37,906 - INFO - step 8650, loss: 2.336146, best loss: 1.838395 2025-01-16 01:37:38,056 - INFO - step 8651, loss: 2.286366, best loss: 1.838395 2025-01-16 01:37:38,206 - INFO - step 8652, loss: 2.480485, best loss: 1.838395 2025-01-16 01:37:38,356 - INFO - step 8653, loss: 2.505171, best loss: 1.838395 2025-01-16 01:37:38,506 - INFO - step 8654, loss: 2.486025, best loss: 1.838395 2025-01-16 01:37:38,656 - INFO - step 8655, loss: 2.514551, best loss: 1.838395 2025-01-16 01:37:38,807 - INFO - step 8656, loss: 2.332776, best loss: 1.838395 2025-01-16 01:37:38,957 - INFO - step 8657, loss: 2.461348, best loss: 1.838395 2025-01-16 01:37:39,107 - INFO - step 8658, loss: 2.403665, best loss: 1.838395 2025-01-16 01:37:39,257 - INFO - step 8659, loss: 2.093620, best loss: 1.838395 2025-01-16 01:37:39,408 - INFO - step 8660, loss: 2.510776, best loss: 1.838395 2025-01-16 01:37:39,558 - INFO - step 8661, loss: 2.381171, best loss: 1.838395 2025-01-16 01:37:39,708 - INFO - step 8662, loss: 2.365827, best loss: 1.838395 2025-01-16 01:37:39,858 - INFO - step 8663, loss: 2.330783, best loss: 1.838395 2025-01-16 01:37:40,008 - INFO - step 8664, loss: 2.120457, best loss: 1.838395 2025-01-16 01:37:40,158 - INFO - step 8665, loss: 2.311951, best loss: 1.838395 2025-01-16 01:37:40,308 - INFO - step 8666, loss: 2.186358, best loss: 1.838395 2025-01-16 01:37:40,458 - INFO - step 8667, loss: 2.131291, best loss: 1.838395 2025-01-16 01:37:40,608 - INFO - step 8668, loss: 2.536906, best loss: 1.838395 2025-01-16 01:37:40,758 - INFO - step 8669, loss: 2.456885, best loss: 1.838395 2025-01-16 01:37:40,909 - INFO - step 8670, loss: 2.238103, best loss: 1.838395 2025-01-16 01:37:41,059 - INFO - step 8671, loss: 2.556744, best loss: 1.838395 2025-01-16 01:37:41,209 - INFO - step 8672, loss: 2.344091, best loss: 1.838395 2025-01-16 01:37:41,359 - INFO - step 8673, loss: 2.524032, best loss: 1.838395 2025-01-16 01:37:41,509 - INFO - step 8674, loss: 2.608812, best loss: 1.838395 2025-01-16 01:37:41,660 - INFO - step 8675, loss: 2.679603, best loss: 1.838395 2025-01-16 01:37:41,810 - INFO - step 8676, loss: 2.622741, best loss: 1.838395 2025-01-16 01:37:41,960 - INFO - step 8677, loss: 2.427869, best loss: 1.838395 2025-01-16 01:37:42,110 - INFO - step 8678, loss: 2.554799, best loss: 1.838395 2025-01-16 01:37:42,260 - INFO - step 8679, loss: 2.477781, best loss: 1.838395 2025-01-16 01:37:42,410 - INFO - step 8680, loss: 2.410771, best loss: 1.838395 2025-01-16 01:37:42,561 - INFO - step 8681, loss: 2.673795, best loss: 1.838395 2025-01-16 01:37:42,711 - INFO - step 8682, loss: 2.553040, best loss: 1.838395 2025-01-16 01:37:42,861 - INFO - step 8683, loss: 2.721159, best loss: 1.838395 2025-01-16 01:37:43,011 - INFO - step 8684, loss: 2.424556, best loss: 1.838395 2025-01-16 01:37:43,161 - INFO - step 8685, loss: 2.465339, best loss: 1.838395 2025-01-16 01:37:43,312 - INFO - step 8686, loss: 2.505178, best loss: 1.838395 2025-01-16 01:37:43,462 - INFO - step 8687, loss: 2.327817, best loss: 1.838395 2025-01-16 01:37:43,612 - INFO - step 8688, loss: 2.327857, best loss: 1.838395 2025-01-16 01:37:43,762 - INFO - step 8689, loss: 2.679291, best loss: 1.838395 2025-01-16 01:37:43,912 - INFO - step 8690, loss: 2.574435, best loss: 1.838395 2025-01-16 01:37:44,063 - INFO - step 8691, loss: 2.716364, best loss: 1.838395 2025-01-16 01:37:44,213 - INFO - step 8692, loss: 2.521895, best loss: 1.838395 2025-01-16 01:37:44,363 - INFO - step 8693, loss: 2.343169, best loss: 1.838395 2025-01-16 01:37:44,513 - INFO - step 8694, loss: 2.720057, best loss: 1.838395 2025-01-16 01:37:44,663 - INFO - step 8695, loss: 2.474229, best loss: 1.838395 2025-01-16 01:37:44,813 - INFO - step 8696, loss: 2.587868, best loss: 1.838395 2025-01-16 01:37:44,963 - INFO - step 8697, loss: 2.391187, best loss: 1.838395 2025-01-16 01:37:45,113 - INFO - step 8698, loss: 2.416275, best loss: 1.838395 2025-01-16 01:37:45,263 - INFO - step 8699, loss: 2.418828, best loss: 1.838395 2025-01-16 01:37:45,413 - INFO - step 8700, loss: 2.566136, best loss: 1.838395 2025-01-16 01:37:45,563 - INFO - step 8701, loss: 2.539370, best loss: 1.838395 2025-01-16 01:37:45,713 - INFO - step 8702, loss: 2.432145, best loss: 1.838395 2025-01-16 01:37:45,863 - INFO - step 8703, loss: 2.247668, best loss: 1.838395 2025-01-16 01:37:46,013 - INFO - step 8704, loss: 2.080763, best loss: 1.838395 2025-01-16 01:37:46,164 - INFO - step 8705, loss: 2.172315, best loss: 1.838395 2025-01-16 01:37:46,314 - INFO - step 8706, loss: 2.384711, best loss: 1.838395 2025-01-16 01:37:46,464 - INFO - step 8707, loss: 2.719852, best loss: 1.838395 2025-01-16 01:37:46,614 - INFO - step 8708, loss: 2.352516, best loss: 1.838395 2025-01-16 01:37:46,764 - INFO - step 8709, loss: 2.036566, best loss: 1.838395 2025-01-16 01:37:46,915 - INFO - step 8710, loss: 2.452925, best loss: 1.838395 2025-01-16 01:37:47,065 - INFO - step 8711, loss: 2.370485, best loss: 1.838395 2025-01-16 01:37:47,215 - INFO - step 8712, loss: 2.733499, best loss: 1.838395 2025-01-16 01:37:47,365 - INFO - step 8713, loss: 2.174571, best loss: 1.838395 2025-01-16 01:37:47,516 - INFO - step 8714, loss: 2.465138, best loss: 1.838395 2025-01-16 01:37:47,666 - INFO - step 8715, loss: 2.346543, best loss: 1.838395 2025-01-16 01:37:47,816 - INFO - step 8716, loss: 2.391980, best loss: 1.838395 2025-01-16 01:37:47,967 - INFO - step 8717, loss: 2.433168, best loss: 1.838395 2025-01-16 01:37:48,117 - INFO - step 8718, loss: 2.320524, best loss: 1.838395 2025-01-16 01:37:48,267 - INFO - step 8719, loss: 2.500138, best loss: 1.838395 2025-01-16 01:37:48,417 - INFO - step 8720, loss: 2.441822, best loss: 1.838395 2025-01-16 01:37:48,567 - INFO - step 8721, loss: 2.368016, best loss: 1.838395 2025-01-16 01:37:48,717 - INFO - step 8722, loss: 2.715431, best loss: 1.838395 2025-01-16 01:37:48,868 - INFO - step 8723, loss: 2.247552, best loss: 1.838395 2025-01-16 01:37:49,018 - INFO - step 8724, loss: 1.909791, best loss: 1.838395 2025-01-16 01:37:49,168 - INFO - step 8725, loss: 2.427731, best loss: 1.838395 2025-01-16 01:37:49,319 - INFO - step 8726, loss: 2.563496, best loss: 1.838395 2025-01-16 01:37:49,469 - INFO - step 8727, loss: 2.428978, best loss: 1.838395 2025-01-16 01:37:49,619 - INFO - step 8728, loss: 2.163100, best loss: 1.838395 2025-01-16 01:37:49,770 - INFO - step 8729, loss: 2.237286, best loss: 1.838395 2025-01-16 01:37:49,920 - INFO - step 8730, loss: 2.565086, best loss: 1.838395 2025-01-16 01:37:50,070 - INFO - step 8731, loss: 2.610005, best loss: 1.838395 2025-01-16 01:37:50,220 - INFO - step 8732, loss: 2.496758, best loss: 1.838395 2025-01-16 01:37:50,371 - INFO - step 8733, loss: 2.330240, best loss: 1.838395 2025-01-16 01:37:50,521 - INFO - step 8734, loss: 2.514624, best loss: 1.838395 2025-01-16 01:37:50,671 - INFO - step 8735, loss: 2.445411, best loss: 1.838395 2025-01-16 01:37:50,821 - INFO - step 8736, loss: 2.241532, best loss: 1.838395 2025-01-16 01:37:50,971 - INFO - step 8737, loss: 2.447092, best loss: 1.838395 2025-01-16 01:37:51,122 - INFO - step 8738, loss: 2.274794, best loss: 1.838395 2025-01-16 01:37:51,272 - INFO - step 8739, loss: 2.433362, best loss: 1.838395 2025-01-16 01:37:51,422 - INFO - step 8740, loss: 2.259520, best loss: 1.838395 2025-01-16 01:37:51,572 - INFO - step 8741, loss: 2.418556, best loss: 1.838395 2025-01-16 01:37:51,722 - INFO - step 8742, loss: 2.150977, best loss: 1.838395 2025-01-16 01:37:51,872 - INFO - step 8743, loss: 2.151968, best loss: 1.838395 2025-01-16 01:37:52,022 - INFO - step 8744, loss: 2.505091, best loss: 1.838395 2025-01-16 01:37:52,172 - INFO - step 8745, loss: 2.443838, best loss: 1.838395 2025-01-16 01:37:52,322 - INFO - step 8746, loss: 2.645687, best loss: 1.838395 2025-01-16 01:37:52,472 - INFO - step 8747, loss: 2.359403, best loss: 1.838395 2025-01-16 01:37:52,622 - INFO - step 8748, loss: 2.483825, best loss: 1.838395 2025-01-16 01:37:52,772 - INFO - step 8749, loss: 2.526218, best loss: 1.838395 2025-01-16 01:37:52,922 - INFO - step 8750, loss: 2.339045, best loss: 1.838395 2025-01-16 01:37:53,072 - INFO - step 8751, loss: 2.235453, best loss: 1.838395 2025-01-16 01:37:53,222 - INFO - step 8752, loss: 2.319776, best loss: 1.838395 2025-01-16 01:37:53,372 - INFO - step 8753, loss: 2.422086, best loss: 1.838395 2025-01-16 01:37:53,522 - INFO - step 8754, loss: 2.458120, best loss: 1.838395 2025-01-16 01:37:53,672 - INFO - step 8755, loss: 2.560281, best loss: 1.838395 2025-01-16 01:37:53,822 - INFO - step 8756, loss: 2.673400, best loss: 1.838395 2025-01-16 01:37:53,972 - INFO - step 8757, loss: 2.647237, best loss: 1.838395 2025-01-16 01:37:54,122 - INFO - step 8758, loss: 2.557707, best loss: 1.838395 2025-01-16 01:37:54,273 - INFO - step 8759, loss: 2.549325, best loss: 1.838395 2025-01-16 01:37:54,423 - INFO - step 8760, loss: 2.663491, best loss: 1.838395 2025-01-16 01:37:54,573 - INFO - step 8761, loss: 2.261884, best loss: 1.838395 2025-01-16 01:37:54,723 - INFO - step 8762, loss: 2.581945, best loss: 1.838395 2025-01-16 01:37:54,873 - INFO - step 8763, loss: 2.711194, best loss: 1.838395 2025-01-16 01:37:55,023 - INFO - step 8764, loss: 2.531612, best loss: 1.838395 2025-01-16 01:37:55,174 - INFO - step 8765, loss: 2.440763, best loss: 1.838395 2025-01-16 01:37:55,324 - INFO - step 8766, loss: 2.617367, best loss: 1.838395 2025-01-16 01:37:55,474 - INFO - step 8767, loss: 2.376798, best loss: 1.838395 2025-01-16 01:37:58,898 - INFO - step 8768, loss: 1.725199, best loss: 1.725199 2025-01-16 01:37:59,059 - INFO - step 8769, loss: 2.525320, best loss: 1.725199 2025-01-16 01:37:59,210 - INFO - step 8770, loss: 2.464441, best loss: 1.725199 2025-01-16 01:37:59,360 - INFO - step 8771, loss: 2.497604, best loss: 1.725199 2025-01-16 01:37:59,510 - INFO - step 8772, loss: 2.604284, best loss: 1.725199 2025-01-16 01:37:59,660 - INFO - step 8773, loss: 2.360283, best loss: 1.725199 2025-01-16 01:37:59,810 - INFO - step 8774, loss: 2.495234, best loss: 1.725199 2025-01-16 01:37:59,961 - INFO - step 8775, loss: 2.528901, best loss: 1.725199 2025-01-16 01:38:00,111 - INFO - step 8776, loss: 2.405297, best loss: 1.725199 2025-01-16 01:38:00,261 - INFO - step 8777, loss: 2.374891, best loss: 1.725199 2025-01-16 01:38:00,411 - INFO - step 8778, loss: 2.314714, best loss: 1.725199 2025-01-16 01:38:00,561 - INFO - step 8779, loss: 2.231740, best loss: 1.725199 2025-01-16 01:38:00,711 - INFO - step 8780, loss: 2.409107, best loss: 1.725199 2025-01-16 01:38:00,861 - INFO - step 8781, loss: 2.292904, best loss: 1.725199 2025-01-16 01:38:01,011 - INFO - step 8782, loss: 2.457645, best loss: 1.725199 2025-01-16 01:38:01,162 - INFO - step 8783, loss: 2.593766, best loss: 1.725199 2025-01-16 01:38:01,312 - INFO - step 8784, loss: 2.350307, best loss: 1.725199 2025-01-16 01:38:01,462 - INFO - step 8785, loss: 2.228709, best loss: 1.725199 2025-01-16 01:38:01,612 - INFO - step 8786, loss: 2.387531, best loss: 1.725199 2025-01-16 01:38:01,763 - INFO - step 8787, loss: 2.401864, best loss: 1.725199 2025-01-16 01:38:01,913 - INFO - step 8788, loss: 2.444849, best loss: 1.725199 2025-01-16 01:38:02,063 - INFO - step 8789, loss: 2.426240, best loss: 1.725199 2025-01-16 01:38:02,213 - INFO - step 8790, loss: 2.340868, best loss: 1.725199 2025-01-16 01:38:02,363 - INFO - step 8791, loss: 2.340613, best loss: 1.725199 2025-01-16 01:38:02,514 - INFO - step 8792, loss: 2.383499, best loss: 1.725199 2025-01-16 01:38:02,664 - INFO - step 8793, loss: 2.345407, best loss: 1.725199 2025-01-16 01:38:02,814 - INFO - step 8794, loss: 2.396387, best loss: 1.725199 2025-01-16 01:38:02,964 - INFO - step 8795, loss: 2.485364, best loss: 1.725199 2025-01-16 01:38:03,114 - INFO - step 8796, loss: 2.518961, best loss: 1.725199 2025-01-16 01:38:03,264 - INFO - step 8797, loss: 2.477433, best loss: 1.725199 2025-01-16 01:38:03,414 - INFO - step 8798, loss: 2.320590, best loss: 1.725199 2025-01-16 01:38:03,564 - INFO - step 8799, loss: 2.318341, best loss: 1.725199 2025-01-16 01:38:03,714 - INFO - step 8800, loss: 2.545412, best loss: 1.725199 2025-01-16 01:38:03,865 - INFO - step 8801, loss: 2.626199, best loss: 1.725199 2025-01-16 01:38:04,015 - INFO - step 8802, loss: 2.512437, best loss: 1.725199 2025-01-16 01:38:04,165 - INFO - step 8803, loss: 2.683897, best loss: 1.725199 2025-01-16 01:38:04,315 - INFO - step 8804, loss: 2.782368, best loss: 1.725199 2025-01-16 01:38:04,465 - INFO - step 8805, loss: 2.376827, best loss: 1.725199 2025-01-16 01:38:04,615 - INFO - step 8806, loss: 2.650896, best loss: 1.725199 2025-01-16 01:38:04,765 - INFO - step 8807, loss: 2.437345, best loss: 1.725199 2025-01-16 01:38:04,915 - INFO - step 8808, loss: 2.152251, best loss: 1.725199 2025-01-16 01:38:05,066 - INFO - step 8809, loss: 2.616604, best loss: 1.725199 2025-01-16 01:38:05,216 - INFO - step 8810, loss: 2.573987, best loss: 1.725199 2025-01-16 01:38:05,366 - INFO - step 8811, loss: 2.476610, best loss: 1.725199 2025-01-16 01:38:05,516 - INFO - step 8812, loss: 2.224684, best loss: 1.725199 2025-01-16 01:38:05,666 - INFO - step 8813, loss: 2.525672, best loss: 1.725199 2025-01-16 01:38:05,817 - INFO - step 8814, loss: 2.488786, best loss: 1.725199 2025-01-16 01:38:05,967 - INFO - step 8815, loss: 2.332417, best loss: 1.725199 2025-01-16 01:38:06,117 - INFO - step 8816, loss: 2.521636, best loss: 1.725199 2025-01-16 01:38:06,268 - INFO - step 8817, loss: 2.274401, best loss: 1.725199 2025-01-16 01:38:06,432 - INFO - step 8818, loss: 2.150034, best loss: 1.725199 2025-01-16 01:38:06,583 - INFO - step 8819, loss: 2.650108, best loss: 1.725199 2025-01-16 01:38:06,733 - INFO - step 8820, loss: 2.649776, best loss: 1.725199 2025-01-16 01:38:06,883 - INFO - step 8821, loss: 2.699533, best loss: 1.725199 2025-01-16 01:38:07,033 - INFO - step 8822, loss: 2.510760, best loss: 1.725199 2025-01-16 01:38:07,183 - INFO - step 8823, loss: 2.508085, best loss: 1.725199 2025-01-16 01:38:07,333 - INFO - step 8824, loss: 2.549984, best loss: 1.725199 2025-01-16 01:38:07,483 - INFO - step 8825, loss: 2.346132, best loss: 1.725199 2025-01-16 01:38:07,633 - INFO - step 8826, loss: 2.294142, best loss: 1.725199 2025-01-16 01:38:07,783 - INFO - step 8827, loss: 2.612155, best loss: 1.725199 2025-01-16 01:38:07,933 - INFO - step 8828, loss: 2.190148, best loss: 1.725199 2025-01-16 01:38:08,084 - INFO - step 8829, loss: 1.987586, best loss: 1.725199 2025-01-16 01:38:08,234 - INFO - step 8830, loss: 2.426660, best loss: 1.725199 2025-01-16 01:38:08,384 - INFO - step 8831, loss: 2.348868, best loss: 1.725199 2025-01-16 01:38:08,534 - INFO - step 8832, loss: 2.415222, best loss: 1.725199 2025-01-16 01:38:08,684 - INFO - step 8833, loss: 2.015487, best loss: 1.725199 2025-01-16 01:38:08,834 - INFO - step 8834, loss: 1.951997, best loss: 1.725199 2025-01-16 01:38:08,984 - INFO - step 8835, loss: 1.896017, best loss: 1.725199 2025-01-16 01:38:09,135 - INFO - step 8836, loss: 2.101949, best loss: 1.725199 2025-01-16 01:38:09,285 - INFO - step 8837, loss: 2.323145, best loss: 1.725199 2025-01-16 01:38:09,435 - INFO - step 8838, loss: 2.528993, best loss: 1.725199 2025-01-16 01:38:09,586 - INFO - step 8839, loss: 2.414460, best loss: 1.725199 2025-01-16 01:38:09,736 - INFO - step 8840, loss: 2.296486, best loss: 1.725199 2025-01-16 01:38:09,886 - INFO - step 8841, loss: 2.329962, best loss: 1.725199 2025-01-16 01:38:10,036 - INFO - step 8842, loss: 2.345343, best loss: 1.725199 2025-01-16 01:38:10,186 - INFO - step 8843, loss: 2.212075, best loss: 1.725199 2025-01-16 01:38:10,336 - INFO - step 8844, loss: 2.346988, best loss: 1.725199 2025-01-16 01:38:10,486 - INFO - step 8845, loss: 2.364042, best loss: 1.725199 2025-01-16 01:38:10,636 - INFO - step 8846, loss: 2.139182, best loss: 1.725199 2025-01-16 01:38:10,786 - INFO - step 8847, loss: 2.096327, best loss: 1.725199 2025-01-16 01:38:10,936 - INFO - step 8848, loss: 2.189405, best loss: 1.725199 2025-01-16 01:38:11,087 - INFO - step 8849, loss: 2.347345, best loss: 1.725199 2025-01-16 01:38:11,237 - INFO - step 8850, loss: 2.098578, best loss: 1.725199 2025-01-16 01:38:11,387 - INFO - step 8851, loss: 2.123527, best loss: 1.725199 2025-01-16 01:38:11,537 - INFO - step 8852, loss: 2.300043, best loss: 1.725199 2025-01-16 01:38:11,687 - INFO - step 8853, loss: 2.083989, best loss: 1.725199 2025-01-16 01:38:11,837 - INFO - step 8854, loss: 2.217573, best loss: 1.725199 2025-01-16 01:38:11,987 - INFO - step 8855, loss: 2.386060, best loss: 1.725199 2025-01-16 01:38:12,138 - INFO - step 8856, loss: 2.150179, best loss: 1.725199 2025-01-16 01:38:12,288 - INFO - step 8857, loss: 2.149646, best loss: 1.725199 2025-01-16 01:38:12,438 - INFO - step 8858, loss: 2.214304, best loss: 1.725199 2025-01-16 01:38:12,588 - INFO - step 8859, loss: 2.399466, best loss: 1.725199 2025-01-16 01:38:12,738 - INFO - step 8860, loss: 2.168406, best loss: 1.725199 2025-01-16 01:38:12,889 - INFO - step 8861, loss: 2.141931, best loss: 1.725199 2025-01-16 01:38:13,039 - INFO - step 8862, loss: 2.115415, best loss: 1.725199 2025-01-16 01:38:13,189 - INFO - step 8863, loss: 2.250536, best loss: 1.725199 2025-01-16 01:38:13,339 - INFO - step 8864, loss: 2.517134, best loss: 1.725199 2025-01-16 01:38:13,489 - INFO - step 8865, loss: 2.511452, best loss: 1.725199 2025-01-16 01:38:13,640 - INFO - step 8866, loss: 2.390428, best loss: 1.725199 2025-01-16 01:38:13,790 - INFO - step 8867, loss: 2.374691, best loss: 1.725199 2025-01-16 01:38:13,940 - INFO - step 8868, loss: 2.366420, best loss: 1.725199 2025-01-16 01:38:14,090 - INFO - step 8869, loss: 2.314528, best loss: 1.725199 2025-01-16 01:38:14,241 - INFO - step 8870, loss: 2.200911, best loss: 1.725199 2025-01-16 01:38:14,391 - INFO - step 8871, loss: 2.346405, best loss: 1.725199 2025-01-16 01:38:14,541 - INFO - step 8872, loss: 2.208195, best loss: 1.725199 2025-01-16 01:38:14,691 - INFO - step 8873, loss: 2.118723, best loss: 1.725199 2025-01-16 01:38:14,842 - INFO - step 8874, loss: 2.286701, best loss: 1.725199 2025-01-16 01:38:14,992 - INFO - step 8875, loss: 2.222588, best loss: 1.725199 2025-01-16 01:38:15,142 - INFO - step 8876, loss: 2.358716, best loss: 1.725199 2025-01-16 01:38:15,292 - INFO - step 8877, loss: 1.836358, best loss: 1.725199 2025-01-16 01:38:15,442 - INFO - step 8878, loss: 2.284258, best loss: 1.725199 2025-01-16 01:38:15,592 - INFO - step 8879, loss: 2.421691, best loss: 1.725199 2025-01-16 01:38:15,743 - INFO - step 8880, loss: 2.258611, best loss: 1.725199 2025-01-16 01:38:15,893 - INFO - step 8881, loss: 2.252976, best loss: 1.725199 2025-01-16 01:38:16,043 - INFO - step 8882, loss: 2.237527, best loss: 1.725199 2025-01-16 01:38:16,193 - INFO - step 8883, loss: 2.290768, best loss: 1.725199 2025-01-16 01:38:16,343 - INFO - step 8884, loss: 2.146564, best loss: 1.725199 2025-01-16 01:38:16,493 - INFO - step 8885, loss: 2.162245, best loss: 1.725199 2025-01-16 01:38:16,643 - INFO - step 8886, loss: 2.142405, best loss: 1.725199 2025-01-16 01:38:16,793 - INFO - step 8887, loss: 2.313473, best loss: 1.725199 2025-01-16 01:38:16,943 - INFO - step 8888, loss: 2.109972, best loss: 1.725199 2025-01-16 01:38:17,093 - INFO - step 8889, loss: 2.131409, best loss: 1.725199 2025-01-16 01:38:17,243 - INFO - step 8890, loss: 2.029039, best loss: 1.725199 2025-01-16 01:38:17,393 - INFO - step 8891, loss: 2.034644, best loss: 1.725199 2025-01-16 01:38:17,543 - INFO - step 8892, loss: 2.250326, best loss: 1.725199 2025-01-16 01:38:17,693 - INFO - step 8893, loss: 2.136741, best loss: 1.725199 2025-01-16 01:38:17,843 - INFO - step 8894, loss: 2.130127, best loss: 1.725199 2025-01-16 01:38:17,993 - INFO - step 8895, loss: 1.886472, best loss: 1.725199 2025-01-16 01:38:18,143 - INFO - step 8896, loss: 1.902645, best loss: 1.725199 2025-01-16 01:38:18,293 - INFO - step 8897, loss: 1.770471, best loss: 1.725199 2025-01-16 01:38:18,443 - INFO - step 8898, loss: 2.282305, best loss: 1.725199 2025-01-16 01:38:18,593 - INFO - step 8899, loss: 2.313311, best loss: 1.725199 2025-01-16 01:38:18,743 - INFO - step 8900, loss: 2.431160, best loss: 1.725199 2025-01-16 01:38:18,893 - INFO - step 8901, loss: 2.546382, best loss: 1.725199 2025-01-16 01:38:19,043 - INFO - step 8902, loss: 2.451069, best loss: 1.725199 2025-01-16 01:38:19,193 - INFO - step 8903, loss: 2.125304, best loss: 1.725199 2025-01-16 01:38:19,343 - INFO - step 8904, loss: 2.193340, best loss: 1.725199 2025-01-16 01:38:19,493 - INFO - step 8905, loss: 2.468240, best loss: 1.725199 2025-01-16 01:38:19,644 - INFO - step 8906, loss: 2.379725, best loss: 1.725199 2025-01-16 01:38:19,794 - INFO - step 8907, loss: 1.775217, best loss: 1.725199 2025-01-16 01:38:19,944 - INFO - step 8908, loss: 2.113276, best loss: 1.725199 2025-01-16 01:38:20,094 - INFO - step 8909, loss: 1.947729, best loss: 1.725199 2025-01-16 01:38:20,244 - INFO - step 8910, loss: 2.385917, best loss: 1.725199 2025-01-16 01:38:20,394 - INFO - step 8911, loss: 2.338930, best loss: 1.725199 2025-01-16 01:38:20,544 - INFO - step 8912, loss: 2.326045, best loss: 1.725199 2025-01-16 01:38:20,694 - INFO - step 8913, loss: 2.434842, best loss: 1.725199 2025-01-16 01:38:20,844 - INFO - step 8914, loss: 2.346300, best loss: 1.725199 2025-01-16 01:38:20,994 - INFO - step 8915, loss: 2.057341, best loss: 1.725199 2025-01-16 01:38:21,144 - INFO - step 8916, loss: 2.596727, best loss: 1.725199 2025-01-16 01:38:21,294 - INFO - step 8917, loss: 2.342995, best loss: 1.725199 2025-01-16 01:38:21,444 - INFO - step 8918, loss: 2.520411, best loss: 1.725199 2025-01-16 01:38:21,595 - INFO - step 8919, loss: 2.488711, best loss: 1.725199 2025-01-16 01:38:21,745 - INFO - step 8920, loss: 2.299117, best loss: 1.725199 2025-01-16 01:38:21,895 - INFO - step 8921, loss: 2.362801, best loss: 1.725199 2025-01-16 01:38:22,045 - INFO - step 8922, loss: 2.067396, best loss: 1.725199 2025-01-16 01:38:22,195 - INFO - step 8923, loss: 2.452832, best loss: 1.725199 2025-01-16 01:38:22,345 - INFO - step 8924, loss: 2.508966, best loss: 1.725199 2025-01-16 01:38:22,495 - INFO - step 8925, loss: 2.471153, best loss: 1.725199 2025-01-16 01:38:22,645 - INFO - step 8926, loss: 2.368935, best loss: 1.725199 2025-01-16 01:38:22,795 - INFO - step 8927, loss: 2.258286, best loss: 1.725199 2025-01-16 01:38:22,945 - INFO - step 8928, loss: 2.292824, best loss: 1.725199 2025-01-16 01:38:23,095 - INFO - step 8929, loss: 2.316085, best loss: 1.725199 2025-01-16 01:38:23,246 - INFO - step 8930, loss: 2.094102, best loss: 1.725199 2025-01-16 01:38:23,396 - INFO - step 8931, loss: 2.576431, best loss: 1.725199 2025-01-16 01:38:23,546 - INFO - step 8932, loss: 2.039679, best loss: 1.725199 2025-01-16 01:38:23,696 - INFO - step 8933, loss: 2.065447, best loss: 1.725199 2025-01-16 01:38:23,846 - INFO - step 8934, loss: 2.368095, best loss: 1.725199 2025-01-16 01:38:23,996 - INFO - step 8935, loss: 2.418564, best loss: 1.725199 2025-01-16 01:38:24,146 - INFO - step 8936, loss: 2.303119, best loss: 1.725199 2025-01-16 01:38:24,296 - INFO - step 8937, loss: 2.071274, best loss: 1.725199 2025-01-16 01:38:24,446 - INFO - step 8938, loss: 2.246242, best loss: 1.725199 2025-01-16 01:38:24,596 - INFO - step 8939, loss: 2.398947, best loss: 1.725199 2025-01-16 01:38:24,747 - INFO - step 8940, loss: 2.113294, best loss: 1.725199 2025-01-16 01:38:24,897 - INFO - step 8941, loss: 1.967740, best loss: 1.725199 2025-01-16 01:38:25,047 - INFO - step 8942, loss: 2.318679, best loss: 1.725199 2025-01-16 01:38:25,197 - INFO - step 8943, loss: 2.294916, best loss: 1.725199 2025-01-16 01:38:25,347 - INFO - step 8944, loss: 2.148955, best loss: 1.725199 2025-01-16 01:38:25,497 - INFO - step 8945, loss: 1.994550, best loss: 1.725199 2025-01-16 01:38:25,646 - INFO - step 8946, loss: 2.358327, best loss: 1.725199 2025-01-16 01:38:25,796 - INFO - step 8947, loss: 2.576557, best loss: 1.725199 2025-01-16 01:38:25,947 - INFO - step 8948, loss: 2.369885, best loss: 1.725199 2025-01-16 01:38:26,096 - INFO - step 8949, loss: 2.223608, best loss: 1.725199 2025-01-16 01:38:26,246 - INFO - step 8950, loss: 2.451247, best loss: 1.725199 2025-01-16 01:38:26,397 - INFO - step 8951, loss: 2.415413, best loss: 1.725199 2025-01-16 01:38:26,547 - INFO - step 8952, loss: 2.429744, best loss: 1.725199 2025-01-16 01:38:26,697 - INFO - step 8953, loss: 2.172311, best loss: 1.725199 2025-01-16 01:38:26,847 - INFO - step 8954, loss: 2.267370, best loss: 1.725199 2025-01-16 01:38:26,997 - INFO - step 8955, loss: 2.095498, best loss: 1.725199 2025-01-16 01:38:27,146 - INFO - step 8956, loss: 2.569810, best loss: 1.725199 2025-01-16 01:38:27,296 - INFO - step 8957, loss: 2.380616, best loss: 1.725199 2025-01-16 01:38:27,447 - INFO - step 8958, loss: 2.424246, best loss: 1.725199 2025-01-16 01:38:27,597 - INFO - step 8959, loss: 1.968005, best loss: 1.725199 2025-01-16 01:38:27,747 - INFO - step 8960, loss: 2.146385, best loss: 1.725199 2025-01-16 01:38:27,897 - INFO - step 8961, loss: 2.315880, best loss: 1.725199 2025-01-16 01:38:28,047 - INFO - step 8962, loss: 2.271998, best loss: 1.725199 2025-01-16 01:38:28,197 - INFO - step 8963, loss: 2.297530, best loss: 1.725199 2025-01-16 01:38:28,347 - INFO - step 8964, loss: 2.337958, best loss: 1.725199 2025-01-16 01:38:28,497 - INFO - step 8965, loss: 2.164093, best loss: 1.725199 2025-01-16 01:38:28,647 - INFO - step 8966, loss: 2.302483, best loss: 1.725199 2025-01-16 01:38:28,798 - INFO - step 8967, loss: 2.292476, best loss: 1.725199 2025-01-16 01:38:28,948 - INFO - step 8968, loss: 1.964291, best loss: 1.725199 2025-01-16 01:38:29,098 - INFO - step 8969, loss: 2.374427, best loss: 1.725199 2025-01-16 01:38:29,248 - INFO - step 8970, loss: 2.344132, best loss: 1.725199 2025-01-16 01:38:29,398 - INFO - step 8971, loss: 2.416024, best loss: 1.725199 2025-01-16 01:38:29,548 - INFO - step 8972, loss: 2.351820, best loss: 1.725199 2025-01-16 01:38:29,699 - INFO - step 8973, loss: 2.423588, best loss: 1.725199 2025-01-16 01:38:29,849 - INFO - step 8974, loss: 2.383606, best loss: 1.725199 2025-01-16 01:38:29,999 - INFO - step 8975, loss: 2.046149, best loss: 1.725199 2025-01-16 01:38:30,149 - INFO - step 8976, loss: 2.409076, best loss: 1.725199 2025-01-16 01:38:30,299 - INFO - step 8977, loss: 1.978508, best loss: 1.725199 2025-01-16 01:38:30,449 - INFO - step 8978, loss: 2.203400, best loss: 1.725199 2025-01-16 01:38:30,599 - INFO - step 8979, loss: 2.279032, best loss: 1.725199 2025-01-16 01:38:30,749 - INFO - step 8980, loss: 2.246216, best loss: 1.725199 2025-01-16 01:38:30,899 - INFO - step 8981, loss: 2.242831, best loss: 1.725199 2025-01-16 01:38:31,049 - INFO - step 8982, loss: 2.346271, best loss: 1.725199 2025-01-16 01:38:31,199 - INFO - step 8983, loss: 2.450628, best loss: 1.725199 2025-01-16 01:38:31,349 - INFO - step 8984, loss: 2.337806, best loss: 1.725199 2025-01-16 01:38:31,499 - INFO - step 8985, loss: 2.484560, best loss: 1.725199 2025-01-16 01:38:31,650 - INFO - step 8986, loss: 2.326894, best loss: 1.725199 2025-01-16 01:38:31,800 - INFO - step 8987, loss: 2.355500, best loss: 1.725199 2025-01-16 01:38:31,950 - INFO - step 8988, loss: 2.292098, best loss: 1.725199 2025-01-16 01:38:32,100 - INFO - step 8989, loss: 2.017943, best loss: 1.725199 2025-01-16 01:38:32,250 - INFO - step 8990, loss: 2.419167, best loss: 1.725199 2025-01-16 01:38:32,399 - INFO - step 8991, loss: 2.228553, best loss: 1.725199 2025-01-16 01:38:32,550 - INFO - step 8992, loss: 2.224241, best loss: 1.725199 2025-01-16 01:38:32,700 - INFO - step 8993, loss: 2.214833, best loss: 1.725199 2025-01-16 01:38:32,850 - INFO - step 8994, loss: 2.078529, best loss: 1.725199 2025-01-16 01:38:33,000 - INFO - step 8995, loss: 2.222718, best loss: 1.725199 2025-01-16 01:38:33,150 - INFO - step 8996, loss: 2.081676, best loss: 1.725199 2025-01-16 01:38:33,300 - INFO - step 8997, loss: 2.022828, best loss: 1.725199 2025-01-16 01:38:33,450 - INFO - step 8998, loss: 2.353050, best loss: 1.725199 2025-01-16 01:38:33,600 - INFO - step 8999, loss: 2.353982, best loss: 1.725199 2025-01-16 01:38:33,750 - INFO - step 9000, loss: 2.185991, best loss: 1.725199 2025-01-16 01:38:33,901 - INFO - step 9001, loss: 2.455095, best loss: 1.725199 2025-01-16 01:38:34,051 - INFO - step 9002, loss: 2.199243, best loss: 1.725199 2025-01-16 01:38:34,201 - INFO - step 9003, loss: 2.451817, best loss: 1.725199 2025-01-16 01:38:34,351 - INFO - step 9004, loss: 2.482212, best loss: 1.725199 2025-01-16 01:38:34,501 - INFO - step 9005, loss: 2.536001, best loss: 1.725199 2025-01-16 01:38:34,651 - INFO - step 9006, loss: 2.535597, best loss: 1.725199 2025-01-16 01:38:34,802 - INFO - step 9007, loss: 2.344640, best loss: 1.725199 2025-01-16 01:38:34,951 - INFO - step 9008, loss: 2.422388, best loss: 1.725199 2025-01-16 01:38:35,102 - INFO - step 9009, loss: 2.414300, best loss: 1.725199 2025-01-16 01:38:35,251 - INFO - step 9010, loss: 2.308162, best loss: 1.725199 2025-01-16 01:38:35,401 - INFO - step 9011, loss: 2.568786, best loss: 1.725199 2025-01-16 01:38:35,551 - INFO - step 9012, loss: 2.391546, best loss: 1.725199 2025-01-16 01:38:35,701 - INFO - step 9013, loss: 2.608465, best loss: 1.725199 2025-01-16 01:38:35,851 - INFO - step 9014, loss: 2.319267, best loss: 1.725199 2025-01-16 01:38:36,001 - INFO - step 9015, loss: 2.376613, best loss: 1.725199 2025-01-16 01:38:36,151 - INFO - step 9016, loss: 2.392264, best loss: 1.725199 2025-01-16 01:38:36,301 - INFO - step 9017, loss: 2.215002, best loss: 1.725199 2025-01-16 01:38:36,451 - INFO - step 9018, loss: 2.268035, best loss: 1.725199 2025-01-16 01:38:36,601 - INFO - step 9019, loss: 2.502321, best loss: 1.725199 2025-01-16 01:38:36,751 - INFO - step 9020, loss: 2.336190, best loss: 1.725199 2025-01-16 01:38:36,901 - INFO - step 9021, loss: 2.565452, best loss: 1.725199 2025-01-16 01:38:37,051 - INFO - step 9022, loss: 2.444980, best loss: 1.725199 2025-01-16 01:38:37,201 - INFO - step 9023, loss: 2.270144, best loss: 1.725199 2025-01-16 01:38:37,352 - INFO - step 9024, loss: 2.551335, best loss: 1.725199 2025-01-16 01:38:37,502 - INFO - step 9025, loss: 2.346956, best loss: 1.725199 2025-01-16 01:38:37,652 - INFO - step 9026, loss: 2.379020, best loss: 1.725199 2025-01-16 01:38:37,803 - INFO - step 9027, loss: 2.240204, best loss: 1.725199 2025-01-16 01:38:37,953 - INFO - step 9028, loss: 2.257406, best loss: 1.725199 2025-01-16 01:38:38,103 - INFO - step 9029, loss: 2.339488, best loss: 1.725199 2025-01-16 01:38:38,253 - INFO - step 9030, loss: 2.426507, best loss: 1.725199 2025-01-16 01:38:38,403 - INFO - step 9031, loss: 2.386528, best loss: 1.725199 2025-01-16 01:38:38,554 - INFO - step 9032, loss: 2.319509, best loss: 1.725199 2025-01-16 01:38:38,704 - INFO - step 9033, loss: 2.126920, best loss: 1.725199 2025-01-16 01:38:38,854 - INFO - step 9034, loss: 1.981652, best loss: 1.725199 2025-01-16 01:38:39,005 - INFO - step 9035, loss: 2.050352, best loss: 1.725199 2025-01-16 01:38:39,155 - INFO - step 9036, loss: 2.238874, best loss: 1.725199 2025-01-16 01:38:39,305 - INFO - step 9037, loss: 2.605960, best loss: 1.725199 2025-01-16 01:38:39,455 - INFO - step 9038, loss: 2.251364, best loss: 1.725199 2025-01-16 01:38:39,606 - INFO - step 9039, loss: 1.896008, best loss: 1.725199 2025-01-16 01:38:39,756 - INFO - step 9040, loss: 2.392277, best loss: 1.725199 2025-01-16 01:38:39,906 - INFO - step 9041, loss: 2.343411, best loss: 1.725199 2025-01-16 01:38:40,056 - INFO - step 9042, loss: 2.619090, best loss: 1.725199 2025-01-16 01:38:40,206 - INFO - step 9043, loss: 2.117500, best loss: 1.725199 2025-01-16 01:38:40,356 - INFO - step 9044, loss: 2.404290, best loss: 1.725199 2025-01-16 01:38:40,507 - INFO - step 9045, loss: 2.213915, best loss: 1.725199 2025-01-16 01:38:40,657 - INFO - step 9046, loss: 2.314229, best loss: 1.725199 2025-01-16 01:38:40,807 - INFO - step 9047, loss: 2.331507, best loss: 1.725199 2025-01-16 01:38:40,957 - INFO - step 9048, loss: 2.252775, best loss: 1.725199 2025-01-16 01:38:41,107 - INFO - step 9049, loss: 2.354659, best loss: 1.725199 2025-01-16 01:38:41,257 - INFO - step 9050, loss: 2.329177, best loss: 1.725199 2025-01-16 01:38:41,408 - INFO - step 9051, loss: 2.283692, best loss: 1.725199 2025-01-16 01:38:41,558 - INFO - step 9052, loss: 2.589717, best loss: 1.725199 2025-01-16 01:38:41,708 - INFO - step 9053, loss: 2.141752, best loss: 1.725199 2025-01-16 01:38:41,858 - INFO - step 9054, loss: 1.845479, best loss: 1.725199 2025-01-16 01:38:42,008 - INFO - step 9055, loss: 2.261017, best loss: 1.725199 2025-01-16 01:38:42,158 - INFO - step 9056, loss: 2.457494, best loss: 1.725199 2025-01-16 01:38:42,309 - INFO - step 9057, loss: 2.361769, best loss: 1.725199 2025-01-16 01:38:42,459 - INFO - step 9058, loss: 2.068273, best loss: 1.725199 2025-01-16 01:38:42,609 - INFO - step 9059, loss: 2.139456, best loss: 1.725199 2025-01-16 01:38:42,759 - INFO - step 9060, loss: 2.456891, best loss: 1.725199 2025-01-16 01:38:42,909 - INFO - step 9061, loss: 2.437793, best loss: 1.725199 2025-01-16 01:38:43,059 - INFO - step 9062, loss: 2.336217, best loss: 1.725199 2025-01-16 01:38:43,209 - INFO - step 9063, loss: 2.160536, best loss: 1.725199 2025-01-16 01:38:43,360 - INFO - step 9064, loss: 2.442308, best loss: 1.725199 2025-01-16 01:38:43,510 - INFO - step 9065, loss: 2.326205, best loss: 1.725199 2025-01-16 01:38:43,660 - INFO - step 9066, loss: 2.150963, best loss: 1.725199 2025-01-16 01:38:43,810 - INFO - step 9067, loss: 2.309472, best loss: 1.725199 2025-01-16 01:38:43,960 - INFO - step 9068, loss: 2.184974, best loss: 1.725199 2025-01-16 01:38:44,110 - INFO - step 9069, loss: 2.337205, best loss: 1.725199 2025-01-16 01:38:44,260 - INFO - step 9070, loss: 2.200404, best loss: 1.725199 2025-01-16 01:38:44,411 - INFO - step 9071, loss: 2.208744, best loss: 1.725199 2025-01-16 01:38:44,561 - INFO - step 9072, loss: 2.076510, best loss: 1.725199 2025-01-16 01:38:44,711 - INFO - step 9073, loss: 2.025987, best loss: 1.725199 2025-01-16 01:38:44,861 - INFO - step 9074, loss: 2.402212, best loss: 1.725199 2025-01-16 01:38:45,012 - INFO - step 9075, loss: 2.232567, best loss: 1.725199 2025-01-16 01:38:45,162 - INFO - step 9076, loss: 2.525656, best loss: 1.725199 2025-01-16 01:38:45,312 - INFO - step 9077, loss: 2.260666, best loss: 1.725199 2025-01-16 01:38:45,462 - INFO - step 9078, loss: 2.320776, best loss: 1.725199 2025-01-16 01:38:45,612 - INFO - step 9079, loss: 2.457084, best loss: 1.725199 2025-01-16 01:38:45,763 - INFO - step 9080, loss: 2.152154, best loss: 1.725199 2025-01-16 01:38:45,913 - INFO - step 9081, loss: 2.160115, best loss: 1.725199 2025-01-16 01:38:46,063 - INFO - step 9082, loss: 2.131102, best loss: 1.725199 2025-01-16 01:38:46,213 - INFO - step 9083, loss: 2.292852, best loss: 1.725199 2025-01-16 01:38:46,363 - INFO - step 9084, loss: 2.334335, best loss: 1.725199 2025-01-16 01:38:46,514 - INFO - step 9085, loss: 2.381977, best loss: 1.725199 2025-01-16 01:38:46,664 - INFO - step 9086, loss: 2.566175, best loss: 1.725199 2025-01-16 01:38:46,814 - INFO - step 9087, loss: 2.551846, best loss: 1.725199 2025-01-16 01:38:46,964 - INFO - step 9088, loss: 2.437873, best loss: 1.725199 2025-01-16 01:38:47,114 - INFO - step 9089, loss: 2.420825, best loss: 1.725199 2025-01-16 01:38:47,265 - INFO - step 9090, loss: 2.520813, best loss: 1.725199 2025-01-16 01:38:47,415 - INFO - step 9091, loss: 2.170938, best loss: 1.725199 2025-01-16 01:38:47,565 - INFO - step 9092, loss: 2.448036, best loss: 1.725199 2025-01-16 01:38:47,715 - INFO - step 9093, loss: 2.540434, best loss: 1.725199 2025-01-16 01:38:47,865 - INFO - step 9094, loss: 2.334823, best loss: 1.725199 2025-01-16 01:38:48,015 - INFO - step 9095, loss: 2.296863, best loss: 1.725199 2025-01-16 01:38:48,165 - INFO - step 9096, loss: 2.544052, best loss: 1.725199 2025-01-16 01:38:48,315 - INFO - step 9097, loss: 2.289920, best loss: 1.725199 2025-01-16 01:38:51,873 - INFO - step 9098, loss: 1.701074, best loss: 1.701074 2025-01-16 01:38:52,032 - INFO - step 9099, loss: 2.351941, best loss: 1.701074 2025-01-16 01:38:52,183 - INFO - step 9100, loss: 2.376363, best loss: 1.701074 2025-01-16 01:38:52,334 - INFO - step 9101, loss: 2.413211, best loss: 1.701074 2025-01-16 01:38:52,484 - INFO - step 9102, loss: 2.507035, best loss: 1.701074 2025-01-16 01:38:52,634 - INFO - step 9103, loss: 2.304284, best loss: 1.701074 2025-01-16 01:38:52,784 - INFO - step 9104, loss: 2.352357, best loss: 1.701074 2025-01-16 01:38:52,935 - INFO - step 9105, loss: 2.427918, best loss: 1.701074 2025-01-16 01:38:53,085 - INFO - step 9106, loss: 2.329961, best loss: 1.701074 2025-01-16 01:38:53,235 - INFO - step 9107, loss: 2.279321, best loss: 1.701074 2025-01-16 01:38:53,385 - INFO - step 9108, loss: 2.220139, best loss: 1.701074 2025-01-16 01:38:53,535 - INFO - step 9109, loss: 2.130977, best loss: 1.701074 2025-01-16 01:38:53,685 - INFO - step 9110, loss: 2.307244, best loss: 1.701074 2025-01-16 01:38:53,836 - INFO - step 9111, loss: 2.212417, best loss: 1.701074 2025-01-16 01:38:53,986 - INFO - step 9112, loss: 2.377722, best loss: 1.701074 2025-01-16 01:38:54,136 - INFO - step 9113, loss: 2.440234, best loss: 1.701074 2025-01-16 01:38:54,286 - INFO - step 9114, loss: 2.252757, best loss: 1.701074 2025-01-16 01:38:54,437 - INFO - step 9115, loss: 2.082098, best loss: 1.701074 2025-01-16 01:38:54,587 - INFO - step 9116, loss: 2.246588, best loss: 1.701074 2025-01-16 01:38:54,737 - INFO - step 9117, loss: 2.274495, best loss: 1.701074 2025-01-16 01:38:54,887 - INFO - step 9118, loss: 2.304449, best loss: 1.701074 2025-01-16 01:38:55,037 - INFO - step 9119, loss: 2.370533, best loss: 1.701074 2025-01-16 01:38:55,188 - INFO - step 9120, loss: 2.265342, best loss: 1.701074 2025-01-16 01:38:55,338 - INFO - step 9121, loss: 2.203747, best loss: 1.701074 2025-01-16 01:38:55,488 - INFO - step 9122, loss: 2.332206, best loss: 1.701074 2025-01-16 01:38:55,638 - INFO - step 9123, loss: 2.203912, best loss: 1.701074 2025-01-16 01:38:55,788 - INFO - step 9124, loss: 2.290957, best loss: 1.701074 2025-01-16 01:38:55,939 - INFO - step 9125, loss: 2.383110, best loss: 1.701074 2025-01-16 01:38:56,089 - INFO - step 9126, loss: 2.471499, best loss: 1.701074 2025-01-16 01:38:56,239 - INFO - step 9127, loss: 2.379444, best loss: 1.701074 2025-01-16 01:38:56,389 - INFO - step 9128, loss: 2.239379, best loss: 1.701074 2025-01-16 01:38:56,540 - INFO - step 9129, loss: 2.257661, best loss: 1.701074 2025-01-16 01:38:56,690 - INFO - step 9130, loss: 2.423009, best loss: 1.701074 2025-01-16 01:38:56,840 - INFO - step 9131, loss: 2.603276, best loss: 1.701074 2025-01-16 01:38:56,990 - INFO - step 9132, loss: 2.386404, best loss: 1.701074 2025-01-16 01:38:57,140 - INFO - step 9133, loss: 2.579751, best loss: 1.701074 2025-01-16 01:38:57,291 - INFO - step 9134, loss: 2.642494, best loss: 1.701074 2025-01-16 01:38:57,441 - INFO - step 9135, loss: 2.305861, best loss: 1.701074 2025-01-16 01:38:57,591 - INFO - step 9136, loss: 2.558137, best loss: 1.701074 2025-01-16 01:38:57,741 - INFO - step 9137, loss: 2.307816, best loss: 1.701074 2025-01-16 01:38:57,892 - INFO - step 9138, loss: 2.050147, best loss: 1.701074 2025-01-16 01:38:58,042 - INFO - step 9139, loss: 2.467444, best loss: 1.701074 2025-01-16 01:38:58,192 - INFO - step 9140, loss: 2.484300, best loss: 1.701074 2025-01-16 01:38:58,342 - INFO - step 9141, loss: 2.269784, best loss: 1.701074 2025-01-16 01:38:58,492 - INFO - step 9142, loss: 2.059568, best loss: 1.701074 2025-01-16 01:38:58,642 - INFO - step 9143, loss: 2.386780, best loss: 1.701074 2025-01-16 01:38:58,793 - INFO - step 9144, loss: 2.374445, best loss: 1.701074 2025-01-16 01:38:58,943 - INFO - step 9145, loss: 2.270778, best loss: 1.701074 2025-01-16 01:38:59,093 - INFO - step 9146, loss: 2.400121, best loss: 1.701074 2025-01-16 01:38:59,243 - INFO - step 9147, loss: 2.176326, best loss: 1.701074 2025-01-16 01:38:59,393 - INFO - step 9148, loss: 2.015623, best loss: 1.701074 2025-01-16 01:38:59,544 - INFO - step 9149, loss: 2.537182, best loss: 1.701074 2025-01-16 01:38:59,694 - INFO - step 9150, loss: 2.486511, best loss: 1.701074 2025-01-16 01:38:59,844 - INFO - step 9151, loss: 2.549991, best loss: 1.701074 2025-01-16 01:38:59,994 - INFO - step 9152, loss: 2.350123, best loss: 1.701074 2025-01-16 01:39:00,143 - INFO - step 9153, loss: 2.354605, best loss: 1.701074 2025-01-16 01:39:00,293 - INFO - step 9154, loss: 2.412023, best loss: 1.701074 2025-01-16 01:39:00,444 - INFO - step 9155, loss: 2.152768, best loss: 1.701074 2025-01-16 01:39:00,594 - INFO - step 9156, loss: 2.142810, best loss: 1.701074 2025-01-16 01:39:00,744 - INFO - step 9157, loss: 2.504714, best loss: 1.701074 2025-01-16 01:39:00,894 - INFO - step 9158, loss: 2.061768, best loss: 1.701074 2025-01-16 01:39:01,044 - INFO - step 9159, loss: 1.917626, best loss: 1.701074 2025-01-16 01:39:01,194 - INFO - step 9160, loss: 2.239114, best loss: 1.701074 2025-01-16 01:39:01,344 - INFO - step 9161, loss: 2.279151, best loss: 1.701074 2025-01-16 01:39:01,494 - INFO - step 9162, loss: 2.310949, best loss: 1.701074 2025-01-16 01:39:01,644 - INFO - step 9163, loss: 1.921176, best loss: 1.701074 2025-01-16 01:39:01,794 - INFO - step 9164, loss: 1.876253, best loss: 1.701074 2025-01-16 01:39:01,945 - INFO - step 9165, loss: 1.772200, best loss: 1.701074 2025-01-16 01:39:02,095 - INFO - step 9166, loss: 2.033774, best loss: 1.701074 2025-01-16 01:39:02,245 - INFO - step 9167, loss: 2.226075, best loss: 1.701074 2025-01-16 01:39:02,395 - INFO - step 9168, loss: 2.349083, best loss: 1.701074 2025-01-16 01:39:02,545 - INFO - step 9169, loss: 2.265678, best loss: 1.701074 2025-01-16 01:39:02,695 - INFO - step 9170, loss: 2.245346, best loss: 1.701074 2025-01-16 01:39:02,845 - INFO - step 9171, loss: 2.214055, best loss: 1.701074 2025-01-16 01:39:02,995 - INFO - step 9172, loss: 2.251942, best loss: 1.701074 2025-01-16 01:39:03,144 - INFO - step 9173, loss: 2.033494, best loss: 1.701074 2025-01-16 01:39:03,294 - INFO - step 9174, loss: 2.274032, best loss: 1.701074 2025-01-16 01:39:03,444 - INFO - step 9175, loss: 2.230356, best loss: 1.701074 2025-01-16 01:39:03,594 - INFO - step 9176, loss: 1.991312, best loss: 1.701074 2025-01-16 01:39:03,744 - INFO - step 9177, loss: 1.970011, best loss: 1.701074 2025-01-16 01:39:03,894 - INFO - step 9178, loss: 2.064931, best loss: 1.701074 2025-01-16 01:39:04,044 - INFO - step 9179, loss: 2.225171, best loss: 1.701074 2025-01-16 01:39:04,194 - INFO - step 9180, loss: 1.976425, best loss: 1.701074 2025-01-16 01:39:04,344 - INFO - step 9181, loss: 2.036432, best loss: 1.701074 2025-01-16 01:39:04,493 - INFO - step 9182, loss: 2.139152, best loss: 1.701074 2025-01-16 01:39:04,643 - INFO - step 9183, loss: 2.014455, best loss: 1.701074 2025-01-16 01:39:04,794 - INFO - step 9184, loss: 2.078275, best loss: 1.701074 2025-01-16 01:39:04,944 - INFO - step 9185, loss: 2.280144, best loss: 1.701074 2025-01-16 01:39:05,094 - INFO - step 9186, loss: 2.069927, best loss: 1.701074 2025-01-16 01:39:05,244 - INFO - step 9187, loss: 2.149363, best loss: 1.701074 2025-01-16 01:39:05,394 - INFO - step 9188, loss: 2.093342, best loss: 1.701074 2025-01-16 01:39:05,544 - INFO - step 9189, loss: 2.281864, best loss: 1.701074 2025-01-16 01:39:05,694 - INFO - step 9190, loss: 2.089078, best loss: 1.701074 2025-01-16 01:39:05,844 - INFO - step 9191, loss: 2.033550, best loss: 1.701074 2025-01-16 01:39:05,994 - INFO - step 9192, loss: 2.043699, best loss: 1.701074 2025-01-16 01:39:06,144 - INFO - step 9193, loss: 2.224941, best loss: 1.701074 2025-01-16 01:39:06,295 - INFO - step 9194, loss: 2.429835, best loss: 1.701074 2025-01-16 01:39:06,445 - INFO - step 9195, loss: 2.372440, best loss: 1.701074 2025-01-16 01:39:06,595 - INFO - step 9196, loss: 2.250709, best loss: 1.701074 2025-01-16 01:39:06,745 - INFO - step 9197, loss: 2.302110, best loss: 1.701074 2025-01-16 01:39:06,895 - INFO - step 9198, loss: 2.276365, best loss: 1.701074 2025-01-16 01:39:07,045 - INFO - step 9199, loss: 2.204346, best loss: 1.701074 2025-01-16 01:39:07,195 - INFO - step 9200, loss: 2.068494, best loss: 1.701074 2025-01-16 01:39:07,346 - INFO - step 9201, loss: 2.245325, best loss: 1.701074 2025-01-16 01:39:07,496 - INFO - step 9202, loss: 2.066486, best loss: 1.701074 2025-01-16 01:39:07,646 - INFO - step 9203, loss: 2.005502, best loss: 1.701074 2025-01-16 01:39:07,796 - INFO - step 9204, loss: 2.116015, best loss: 1.701074 2025-01-16 01:39:07,946 - INFO - step 9205, loss: 2.134938, best loss: 1.701074 2025-01-16 01:39:08,096 - INFO - step 9206, loss: 2.274927, best loss: 1.701074 2025-01-16 01:39:08,247 - INFO - step 9207, loss: 1.767701, best loss: 1.701074 2025-01-16 01:39:08,397 - INFO - step 9208, loss: 2.257342, best loss: 1.701074 2025-01-16 01:39:08,547 - INFO - step 9209, loss: 2.253488, best loss: 1.701074 2025-01-16 01:39:08,697 - INFO - step 9210, loss: 2.173563, best loss: 1.701074 2025-01-16 01:39:08,848 - INFO - step 9211, loss: 2.199015, best loss: 1.701074 2025-01-16 01:39:08,999 - INFO - step 9212, loss: 2.075799, best loss: 1.701074 2025-01-16 01:39:09,149 - INFO - step 9213, loss: 2.195705, best loss: 1.701074 2025-01-16 01:39:09,300 - INFO - step 9214, loss: 2.042409, best loss: 1.701074 2025-01-16 01:39:09,450 - INFO - step 9215, loss: 2.083961, best loss: 1.701074 2025-01-16 01:39:09,601 - INFO - step 9216, loss: 2.073594, best loss: 1.701074 2025-01-16 01:39:09,751 - INFO - step 9217, loss: 2.204205, best loss: 1.701074 2025-01-16 01:39:09,901 - INFO - step 9218, loss: 2.049368, best loss: 1.701074 2025-01-16 01:39:10,052 - INFO - step 9219, loss: 2.099355, best loss: 1.701074 2025-01-16 01:39:10,202 - INFO - step 9220, loss: 1.965479, best loss: 1.701074 2025-01-16 01:39:10,352 - INFO - step 9221, loss: 1.945536, best loss: 1.701074 2025-01-16 01:39:10,502 - INFO - step 9222, loss: 2.150023, best loss: 1.701074 2025-01-16 01:39:10,652 - INFO - step 9223, loss: 2.059273, best loss: 1.701074 2025-01-16 01:39:10,802 - INFO - step 9224, loss: 2.063866, best loss: 1.701074 2025-01-16 01:39:10,952 - INFO - step 9225, loss: 1.790104, best loss: 1.701074 2025-01-16 01:39:11,103 - INFO - step 9226, loss: 1.777264, best loss: 1.701074 2025-01-16 01:39:11,253 - INFO - step 9227, loss: 1.704464, best loss: 1.701074 2025-01-16 01:39:11,403 - INFO - step 9228, loss: 2.206567, best loss: 1.701074 2025-01-16 01:39:11,554 - INFO - step 9229, loss: 2.162040, best loss: 1.701074 2025-01-16 01:39:11,704 - INFO - step 9230, loss: 2.308955, best loss: 1.701074 2025-01-16 01:39:11,854 - INFO - step 9231, loss: 2.392045, best loss: 1.701074 2025-01-16 01:39:12,004 - INFO - step 9232, loss: 2.255125, best loss: 1.701074 2025-01-16 01:39:12,154 - INFO - step 9233, loss: 2.018746, best loss: 1.701074 2025-01-16 01:39:12,304 - INFO - step 9234, loss: 2.105690, best loss: 1.701074 2025-01-16 01:39:12,454 - INFO - step 9235, loss: 2.337318, best loss: 1.701074 2025-01-16 01:39:12,604 - INFO - step 9236, loss: 2.253645, best loss: 1.701074 2025-01-16 01:39:16,023 - INFO - step 9237, loss: 1.661517, best loss: 1.661517 2025-01-16 01:39:16,173 - INFO - step 9238, loss: 2.073863, best loss: 1.661517 2025-01-16 01:39:16,323 - INFO - step 9239, loss: 1.922688, best loss: 1.661517 2025-01-16 01:39:16,474 - INFO - step 9240, loss: 2.315726, best loss: 1.661517 2025-01-16 01:39:16,624 - INFO - step 9241, loss: 2.224417, best loss: 1.661517 2025-01-16 01:39:16,774 - INFO - step 9242, loss: 2.234059, best loss: 1.661517 2025-01-16 01:39:16,925 - INFO - step 9243, loss: 2.333075, best loss: 1.661517 2025-01-16 01:39:17,075 - INFO - step 9244, loss: 2.270090, best loss: 1.661517 2025-01-16 01:39:17,225 - INFO - step 9245, loss: 1.962448, best loss: 1.661517 2025-01-16 01:39:17,375 - INFO - step 9246, loss: 2.470251, best loss: 1.661517 2025-01-16 01:39:17,525 - INFO - step 9247, loss: 2.244987, best loss: 1.661517 2025-01-16 01:39:17,676 - INFO - step 9248, loss: 2.387850, best loss: 1.661517 2025-01-16 01:39:17,826 - INFO - step 9249, loss: 2.391679, best loss: 1.661517 2025-01-16 01:39:17,976 - INFO - step 9250, loss: 2.160854, best loss: 1.661517 2025-01-16 01:39:18,126 - INFO - step 9251, loss: 2.244857, best loss: 1.661517 2025-01-16 01:39:18,276 - INFO - step 9252, loss: 2.019992, best loss: 1.661517 2025-01-16 01:39:18,427 - INFO - step 9253, loss: 2.344022, best loss: 1.661517 2025-01-16 01:39:18,577 - INFO - step 9254, loss: 2.451700, best loss: 1.661517 2025-01-16 01:39:18,727 - INFO - step 9255, loss: 2.414917, best loss: 1.661517 2025-01-16 01:39:18,877 - INFO - step 9256, loss: 2.249285, best loss: 1.661517 2025-01-16 01:39:19,027 - INFO - step 9257, loss: 2.140344, best loss: 1.661517 2025-01-16 01:39:19,178 - INFO - step 9258, loss: 2.237074, best loss: 1.661517 2025-01-16 01:39:19,328 - INFO - step 9259, loss: 2.210724, best loss: 1.661517 2025-01-16 01:39:19,478 - INFO - step 9260, loss: 2.013969, best loss: 1.661517 2025-01-16 01:39:19,628 - INFO - step 9261, loss: 2.500490, best loss: 1.661517 2025-01-16 01:39:19,779 - INFO - step 9262, loss: 1.851217, best loss: 1.661517 2025-01-16 01:39:19,929 - INFO - step 9263, loss: 2.029834, best loss: 1.661517 2025-01-16 01:39:20,079 - INFO - step 9264, loss: 2.282715, best loss: 1.661517 2025-01-16 01:39:20,229 - INFO - step 9265, loss: 2.384597, best loss: 1.661517 2025-01-16 01:39:20,380 - INFO - step 9266, loss: 2.220034, best loss: 1.661517 2025-01-16 01:39:20,531 - INFO - step 9267, loss: 2.048906, best loss: 1.661517 2025-01-16 01:39:20,681 - INFO - step 9268, loss: 2.239870, best loss: 1.661517 2025-01-16 01:39:20,831 - INFO - step 9269, loss: 2.318947, best loss: 1.661517 2025-01-16 01:39:20,981 - INFO - step 9270, loss: 2.046602, best loss: 1.661517 2025-01-16 01:39:21,131 - INFO - step 9271, loss: 1.992516, best loss: 1.661517 2025-01-16 01:39:21,281 - INFO - step 9272, loss: 2.247311, best loss: 1.661517 2025-01-16 01:39:21,431 - INFO - step 9273, loss: 2.244154, best loss: 1.661517 2025-01-16 01:39:21,581 - INFO - step 9274, loss: 2.060287, best loss: 1.661517 2025-01-16 01:39:21,731 - INFO - step 9275, loss: 1.924436, best loss: 1.661517 2025-01-16 01:39:21,881 - INFO - step 9276, loss: 2.244170, best loss: 1.661517 2025-01-16 01:39:22,032 - INFO - step 9277, loss: 2.357088, best loss: 1.661517 2025-01-16 01:39:22,182 - INFO - step 9278, loss: 2.232442, best loss: 1.661517 2025-01-16 01:39:22,370 - INFO - step 9279, loss: 2.117268, best loss: 1.661517 2025-01-16 01:39:22,520 - INFO - step 9280, loss: 2.416309, best loss: 1.661517 2025-01-16 01:39:22,671 - INFO - step 9281, loss: 2.352809, best loss: 1.661517 2025-01-16 01:39:22,821 - INFO - step 9282, loss: 2.289052, best loss: 1.661517 2025-01-16 01:39:22,971 - INFO - step 9283, loss: 2.130059, best loss: 1.661517 2025-01-16 01:39:23,121 - INFO - step 9284, loss: 2.162825, best loss: 1.661517 2025-01-16 01:39:23,271 - INFO - step 9285, loss: 2.111118, best loss: 1.661517 2025-01-16 01:39:23,421 - INFO - step 9286, loss: 2.525249, best loss: 1.661517 2025-01-16 01:39:23,571 - INFO - step 9287, loss: 2.335713, best loss: 1.661517 2025-01-16 01:39:23,721 - INFO - step 9288, loss: 2.321897, best loss: 1.661517 2025-01-16 01:39:23,871 - INFO - step 9289, loss: 1.884946, best loss: 1.661517 2025-01-16 01:39:24,021 - INFO - step 9290, loss: 2.018115, best loss: 1.661517 2025-01-16 01:39:24,171 - INFO - step 9291, loss: 2.236040, best loss: 1.661517 2025-01-16 01:39:24,321 - INFO - step 9292, loss: 2.203741, best loss: 1.661517 2025-01-16 01:39:24,471 - INFO - step 9293, loss: 2.197857, best loss: 1.661517 2025-01-16 01:39:24,622 - INFO - step 9294, loss: 2.328176, best loss: 1.661517 2025-01-16 01:39:24,772 - INFO - step 9295, loss: 2.101938, best loss: 1.661517 2025-01-16 01:39:24,922 - INFO - step 9296, loss: 2.153707, best loss: 1.661517 2025-01-16 01:39:25,073 - INFO - step 9297, loss: 2.257522, best loss: 1.661517 2025-01-16 01:39:25,223 - INFO - step 9298, loss: 1.918999, best loss: 1.661517 2025-01-16 01:39:25,373 - INFO - step 9299, loss: 2.258520, best loss: 1.661517 2025-01-16 01:39:25,523 - INFO - step 9300, loss: 2.301625, best loss: 1.661517 2025-01-16 01:39:25,673 - INFO - step 9301, loss: 2.358346, best loss: 1.661517 2025-01-16 01:39:25,823 - INFO - step 9302, loss: 2.316722, best loss: 1.661517 2025-01-16 01:39:25,974 - INFO - step 9303, loss: 2.348849, best loss: 1.661517 2025-01-16 01:39:26,123 - INFO - step 9304, loss: 2.342592, best loss: 1.661517 2025-01-16 01:39:26,274 - INFO - step 9305, loss: 2.049676, best loss: 1.661517 2025-01-16 01:39:26,424 - INFO - step 9306, loss: 2.377175, best loss: 1.661517 2025-01-16 01:39:26,574 - INFO - step 9307, loss: 1.926592, best loss: 1.661517 2025-01-16 01:39:26,724 - INFO - step 9308, loss: 2.107348, best loss: 1.661517 2025-01-16 01:39:26,875 - INFO - step 9309, loss: 2.211139, best loss: 1.661517 2025-01-16 01:39:27,025 - INFO - step 9310, loss: 2.164022, best loss: 1.661517 2025-01-16 01:39:27,175 - INFO - step 9311, loss: 2.240890, best loss: 1.661517 2025-01-16 01:39:27,325 - INFO - step 9312, loss: 2.310061, best loss: 1.661517 2025-01-16 01:39:27,475 - INFO - step 9313, loss: 2.389675, best loss: 1.661517 2025-01-16 01:39:27,625 - INFO - step 9314, loss: 2.299663, best loss: 1.661517 2025-01-16 01:39:27,775 - INFO - step 9315, loss: 2.395731, best loss: 1.661517 2025-01-16 01:39:27,925 - INFO - step 9316, loss: 2.258055, best loss: 1.661517 2025-01-16 01:39:28,075 - INFO - step 9317, loss: 2.237195, best loss: 1.661517 2025-01-16 01:39:28,226 - INFO - step 9318, loss: 2.111804, best loss: 1.661517 2025-01-16 01:39:28,376 - INFO - step 9319, loss: 1.944365, best loss: 1.661517 2025-01-16 01:39:28,526 - INFO - step 9320, loss: 2.393399, best loss: 1.661517 2025-01-16 01:39:28,676 - INFO - step 9321, loss: 2.162470, best loss: 1.661517 2025-01-16 01:39:28,826 - INFO - step 9322, loss: 2.199831, best loss: 1.661517 2025-01-16 01:39:28,976 - INFO - step 9323, loss: 2.096278, best loss: 1.661517 2025-01-16 01:39:29,126 - INFO - step 9324, loss: 1.961734, best loss: 1.661517 2025-01-16 01:39:29,276 - INFO - step 9325, loss: 2.171335, best loss: 1.661517 2025-01-16 01:39:29,426 - INFO - step 9326, loss: 1.996764, best loss: 1.661517 2025-01-16 01:39:29,577 - INFO - step 9327, loss: 1.995031, best loss: 1.661517 2025-01-16 01:39:29,727 - INFO - step 9328, loss: 2.286223, best loss: 1.661517 2025-01-16 01:39:29,877 - INFO - step 9329, loss: 2.316584, best loss: 1.661517 2025-01-16 01:39:30,027 - INFO - step 9330, loss: 2.193790, best loss: 1.661517 2025-01-16 01:39:30,176 - INFO - step 9331, loss: 2.368505, best loss: 1.661517 2025-01-16 01:39:30,326 - INFO - step 9332, loss: 2.076246, best loss: 1.661517 2025-01-16 01:39:30,476 - INFO - step 9333, loss: 2.371336, best loss: 1.661517 2025-01-16 01:39:30,626 - INFO - step 9334, loss: 2.419954, best loss: 1.661517 2025-01-16 01:39:30,776 - INFO - step 9335, loss: 2.472852, best loss: 1.661517 2025-01-16 01:39:30,927 - INFO - step 9336, loss: 2.454813, best loss: 1.661517 2025-01-16 01:39:31,077 - INFO - step 9337, loss: 2.219732, best loss: 1.661517 2025-01-16 01:39:31,227 - INFO - step 9338, loss: 2.303470, best loss: 1.661517 2025-01-16 01:39:31,377 - INFO - step 9339, loss: 2.297785, best loss: 1.661517 2025-01-16 01:39:31,527 - INFO - step 9340, loss: 2.240536, best loss: 1.661517 2025-01-16 01:39:31,677 - INFO - step 9341, loss: 2.422437, best loss: 1.661517 2025-01-16 01:39:31,827 - INFO - step 9342, loss: 2.250667, best loss: 1.661517 2025-01-16 01:39:31,977 - INFO - step 9343, loss: 2.552535, best loss: 1.661517 2025-01-16 01:39:32,127 - INFO - step 9344, loss: 2.205882, best loss: 1.661517 2025-01-16 01:39:32,277 - INFO - step 9345, loss: 2.201649, best loss: 1.661517 2025-01-16 01:39:32,427 - INFO - step 9346, loss: 2.276951, best loss: 1.661517 2025-01-16 01:39:32,577 - INFO - step 9347, loss: 2.191811, best loss: 1.661517 2025-01-16 01:39:32,728 - INFO - step 9348, loss: 2.202886, best loss: 1.661517 2025-01-16 01:39:32,877 - INFO - step 9349, loss: 2.411693, best loss: 1.661517 2025-01-16 01:39:33,027 - INFO - step 9350, loss: 2.264958, best loss: 1.661517 2025-01-16 01:39:33,177 - INFO - step 9351, loss: 2.525405, best loss: 1.661517 2025-01-16 01:39:33,327 - INFO - step 9352, loss: 2.314428, best loss: 1.661517 2025-01-16 01:39:33,477 - INFO - step 9353, loss: 2.160770, best loss: 1.661517 2025-01-16 01:39:33,627 - INFO - step 9354, loss: 2.483678, best loss: 1.661517 2025-01-16 01:39:33,777 - INFO - step 9355, loss: 2.298761, best loss: 1.661517 2025-01-16 01:39:33,927 - INFO - step 9356, loss: 2.280729, best loss: 1.661517 2025-01-16 01:39:34,077 - INFO - step 9357, loss: 2.216903, best loss: 1.661517 2025-01-16 01:39:34,228 - INFO - step 9358, loss: 2.208586, best loss: 1.661517 2025-01-16 01:39:34,378 - INFO - step 9359, loss: 2.264307, best loss: 1.661517 2025-01-16 01:39:34,528 - INFO - step 9360, loss: 2.322121, best loss: 1.661517 2025-01-16 01:39:34,678 - INFO - step 9361, loss: 2.303263, best loss: 1.661517 2025-01-16 01:39:34,828 - INFO - step 9362, loss: 2.298091, best loss: 1.661517 2025-01-16 01:39:34,978 - INFO - step 9363, loss: 2.046175, best loss: 1.661517 2025-01-16 01:39:35,128 - INFO - step 9364, loss: 2.001344, best loss: 1.661517 2025-01-16 01:39:35,278 - INFO - step 9365, loss: 2.003695, best loss: 1.661517 2025-01-16 01:39:35,428 - INFO - step 9366, loss: 2.220916, best loss: 1.661517 2025-01-16 01:39:35,578 - INFO - step 9367, loss: 2.534645, best loss: 1.661517 2025-01-16 01:39:35,728 - INFO - step 9368, loss: 2.140543, best loss: 1.661517 2025-01-16 01:39:35,878 - INFO - step 9369, loss: 1.782811, best loss: 1.661517 2025-01-16 01:39:36,028 - INFO - step 9370, loss: 2.283186, best loss: 1.661517 2025-01-16 01:39:36,179 - INFO - step 9371, loss: 2.256135, best loss: 1.661517 2025-01-16 01:39:36,328 - INFO - step 9372, loss: 2.515530, best loss: 1.661517 2025-01-16 01:39:36,479 - INFO - step 9373, loss: 1.981352, best loss: 1.661517 2025-01-16 01:39:36,629 - INFO - step 9374, loss: 2.356784, best loss: 1.661517 2025-01-16 01:39:36,779 - INFO - step 9375, loss: 2.121251, best loss: 1.661517 2025-01-16 01:39:36,929 - INFO - step 9376, loss: 2.120928, best loss: 1.661517 2025-01-16 01:39:37,079 - INFO - step 9377, loss: 2.202650, best loss: 1.661517 2025-01-16 01:39:37,229 - INFO - step 9378, loss: 2.134168, best loss: 1.661517 2025-01-16 01:39:37,379 - INFO - step 9379, loss: 2.251353, best loss: 1.661517 2025-01-16 01:39:37,528 - INFO - step 9380, loss: 2.263860, best loss: 1.661517 2025-01-16 01:39:37,678 - INFO - step 9381, loss: 2.157970, best loss: 1.661517 2025-01-16 01:39:37,828 - INFO - step 9382, loss: 2.459993, best loss: 1.661517 2025-01-16 01:39:37,978 - INFO - step 9383, loss: 2.039100, best loss: 1.661517 2025-01-16 01:39:38,128 - INFO - step 9384, loss: 1.696311, best loss: 1.661517 2025-01-16 01:39:38,278 - INFO - step 9385, loss: 2.199703, best loss: 1.661517 2025-01-16 01:39:38,428 - INFO - step 9386, loss: 2.423853, best loss: 1.661517 2025-01-16 01:39:38,578 - INFO - step 9387, loss: 2.223951, best loss: 1.661517 2025-01-16 01:39:38,728 - INFO - step 9388, loss: 1.951547, best loss: 1.661517 2025-01-16 01:39:38,878 - INFO - step 9389, loss: 2.033283, best loss: 1.661517 2025-01-16 01:39:39,028 - INFO - step 9390, loss: 2.315329, best loss: 1.661517 2025-01-16 01:39:39,178 - INFO - step 9391, loss: 2.279613, best loss: 1.661517 2025-01-16 01:39:39,328 - INFO - step 9392, loss: 2.208955, best loss: 1.661517 2025-01-16 01:39:39,478 - INFO - step 9393, loss: 2.069892, best loss: 1.661517 2025-01-16 01:39:39,628 - INFO - step 9394, loss: 2.309928, best loss: 1.661517 2025-01-16 01:39:39,778 - INFO - step 9395, loss: 2.280652, best loss: 1.661517 2025-01-16 01:39:39,928 - INFO - step 9396, loss: 2.040087, best loss: 1.661517 2025-01-16 01:39:40,078 - INFO - step 9397, loss: 2.189286, best loss: 1.661517 2025-01-16 01:39:40,229 - INFO - step 9398, loss: 2.107512, best loss: 1.661517 2025-01-16 01:39:40,379 - INFO - step 9399, loss: 2.220676, best loss: 1.661517 2025-01-16 01:39:40,529 - INFO - step 9400, loss: 2.032771, best loss: 1.661517 2025-01-16 01:39:40,680 - INFO - step 9401, loss: 2.132595, best loss: 1.661517 2025-01-16 01:39:40,830 - INFO - step 9402, loss: 1.961379, best loss: 1.661517 2025-01-16 01:39:40,980 - INFO - step 9403, loss: 1.889779, best loss: 1.661517 2025-01-16 01:39:41,130 - INFO - step 9404, loss: 2.247873, best loss: 1.661517 2025-01-16 01:39:41,280 - INFO - step 9405, loss: 2.164854, best loss: 1.661517 2025-01-16 01:39:41,431 - INFO - step 9406, loss: 2.433670, best loss: 1.661517 2025-01-16 01:39:41,581 - INFO - step 9407, loss: 2.158266, best loss: 1.661517 2025-01-16 01:39:41,731 - INFO - step 9408, loss: 2.251400, best loss: 1.661517 2025-01-16 01:39:41,881 - INFO - step 9409, loss: 2.312567, best loss: 1.661517 2025-01-16 01:39:42,031 - INFO - step 9410, loss: 2.011539, best loss: 1.661517 2025-01-16 01:39:42,181 - INFO - step 9411, loss: 1.994931, best loss: 1.661517 2025-01-16 01:39:42,331 - INFO - step 9412, loss: 2.069672, best loss: 1.661517 2025-01-16 01:39:42,481 - INFO - step 9413, loss: 2.223751, best loss: 1.661517 2025-01-16 01:39:42,631 - INFO - step 9414, loss: 2.216119, best loss: 1.661517 2025-01-16 01:39:42,781 - INFO - step 9415, loss: 2.211125, best loss: 1.661517 2025-01-16 01:39:42,931 - INFO - step 9416, loss: 2.511001, best loss: 1.661517 2025-01-16 01:39:43,081 - INFO - step 9417, loss: 2.468449, best loss: 1.661517 2025-01-16 01:39:43,231 - INFO - step 9418, loss: 2.334179, best loss: 1.661517 2025-01-16 01:39:43,381 - INFO - step 9419, loss: 2.377432, best loss: 1.661517 2025-01-16 01:39:43,531 - INFO - step 9420, loss: 2.474674, best loss: 1.661517 2025-01-16 01:39:43,681 - INFO - step 9421, loss: 2.105733, best loss: 1.661517 2025-01-16 01:39:43,830 - INFO - step 9422, loss: 2.382328, best loss: 1.661517 2025-01-16 01:39:43,980 - INFO - step 9423, loss: 2.470596, best loss: 1.661517 2025-01-16 01:39:44,130 - INFO - step 9424, loss: 2.250531, best loss: 1.661517 2025-01-16 01:39:44,280 - INFO - step 9425, loss: 2.229833, best loss: 1.661517 2025-01-16 01:39:44,430 - INFO - step 9426, loss: 2.444427, best loss: 1.661517 2025-01-16 01:39:44,580 - INFO - step 9427, loss: 2.205935, best loss: 1.661517 2025-01-16 01:39:48,030 - INFO - step 9428, loss: 1.659492, best loss: 1.659492 2025-01-16 01:39:48,191 - INFO - step 9429, loss: 2.280672, best loss: 1.659492 2025-01-16 01:39:48,342 - INFO - step 9430, loss: 2.193903, best loss: 1.659492 2025-01-16 01:39:48,492 - INFO - step 9431, loss: 2.326900, best loss: 1.659492 2025-01-16 01:39:48,642 - INFO - step 9432, loss: 2.361686, best loss: 1.659492 2025-01-16 01:39:48,792 - INFO - step 9433, loss: 2.173695, best loss: 1.659492 2025-01-16 01:39:48,943 - INFO - step 9434, loss: 2.323455, best loss: 1.659492 2025-01-16 01:39:49,093 - INFO - step 9435, loss: 2.345275, best loss: 1.659492 2025-01-16 01:39:49,243 - INFO - step 9436, loss: 2.235187, best loss: 1.659492 2025-01-16 01:39:49,393 - INFO - step 9437, loss: 2.152153, best loss: 1.659492 2025-01-16 01:39:49,543 - INFO - step 9438, loss: 2.171896, best loss: 1.659492 2025-01-16 01:39:49,693 - INFO - step 9439, loss: 2.028571, best loss: 1.659492 2025-01-16 01:39:49,843 - INFO - step 9440, loss: 2.174887, best loss: 1.659492 2025-01-16 01:39:49,993 - INFO - step 9441, loss: 2.122171, best loss: 1.659492 2025-01-16 01:39:50,143 - INFO - step 9442, loss: 2.266875, best loss: 1.659492 2025-01-16 01:39:50,293 - INFO - step 9443, loss: 2.358264, best loss: 1.659492 2025-01-16 01:39:50,443 - INFO - step 9444, loss: 2.180831, best loss: 1.659492 2025-01-16 01:39:50,593 - INFO - step 9445, loss: 2.029390, best loss: 1.659492 2025-01-16 01:39:50,743 - INFO - step 9446, loss: 2.257792, best loss: 1.659492 2025-01-16 01:39:50,893 - INFO - step 9447, loss: 2.191161, best loss: 1.659492 2025-01-16 01:39:51,043 - INFO - step 9448, loss: 2.196538, best loss: 1.659492 2025-01-16 01:39:51,193 - INFO - step 9449, loss: 2.284096, best loss: 1.659492 2025-01-16 01:39:51,343 - INFO - step 9450, loss: 2.228724, best loss: 1.659492 2025-01-16 01:39:51,493 - INFO - step 9451, loss: 2.100068, best loss: 1.659492 2025-01-16 01:39:51,643 - INFO - step 9452, loss: 2.277410, best loss: 1.659492 2025-01-16 01:39:51,793 - INFO - step 9453, loss: 2.136186, best loss: 1.659492 2025-01-16 01:39:51,943 - INFO - step 9454, loss: 2.199864, best loss: 1.659492 2025-01-16 01:39:52,093 - INFO - step 9455, loss: 2.314551, best loss: 1.659492 2025-01-16 01:39:52,243 - INFO - step 9456, loss: 2.444650, best loss: 1.659492 2025-01-16 01:39:52,392 - INFO - step 9457, loss: 2.375807, best loss: 1.659492 2025-01-16 01:39:52,542 - INFO - step 9458, loss: 2.180210, best loss: 1.659492 2025-01-16 01:39:52,692 - INFO - step 9459, loss: 2.198467, best loss: 1.659492 2025-01-16 01:39:52,842 - INFO - step 9460, loss: 2.434376, best loss: 1.659492 2025-01-16 01:39:52,993 - INFO - step 9461, loss: 2.399293, best loss: 1.659492 2025-01-16 01:39:53,143 - INFO - step 9462, loss: 2.421210, best loss: 1.659492 2025-01-16 01:39:53,293 - INFO - step 9463, loss: 2.465379, best loss: 1.659492 2025-01-16 01:39:53,443 - INFO - step 9464, loss: 2.542986, best loss: 1.659492 2025-01-16 01:39:53,593 - INFO - step 9465, loss: 2.207714, best loss: 1.659492 2025-01-16 01:39:53,743 - INFO - step 9466, loss: 2.480529, best loss: 1.659492 2025-01-16 01:39:53,893 - INFO - step 9467, loss: 2.161731, best loss: 1.659492 2025-01-16 01:39:54,043 - INFO - step 9468, loss: 1.930367, best loss: 1.659492 2025-01-16 01:39:54,193 - INFO - step 9469, loss: 2.395738, best loss: 1.659492 2025-01-16 01:39:54,343 - INFO - step 9470, loss: 2.398411, best loss: 1.659492 2025-01-16 01:39:54,493 - INFO - step 9471, loss: 2.130797, best loss: 1.659492 2025-01-16 01:39:54,643 - INFO - step 9472, loss: 1.979991, best loss: 1.659492 2025-01-16 01:39:54,793 - INFO - step 9473, loss: 2.278917, best loss: 1.659492 2025-01-16 01:39:54,943 - INFO - step 9474, loss: 2.320696, best loss: 1.659492 2025-01-16 01:39:55,093 - INFO - step 9475, loss: 2.215223, best loss: 1.659492 2025-01-16 01:39:55,243 - INFO - step 9476, loss: 2.332249, best loss: 1.659492 2025-01-16 01:39:55,393 - INFO - step 9477, loss: 2.133180, best loss: 1.659492 2025-01-16 01:39:55,543 - INFO - step 9478, loss: 1.982763, best loss: 1.659492 2025-01-16 01:39:55,693 - INFO - step 9479, loss: 2.467762, best loss: 1.659492 2025-01-16 01:39:55,843 - INFO - step 9480, loss: 2.425829, best loss: 1.659492 2025-01-16 01:39:55,994 - INFO - step 9481, loss: 2.502854, best loss: 1.659492 2025-01-16 01:39:56,144 - INFO - step 9482, loss: 2.361619, best loss: 1.659492 2025-01-16 01:39:56,294 - INFO - step 9483, loss: 2.272797, best loss: 1.659492 2025-01-16 01:39:56,444 - INFO - step 9484, loss: 2.354519, best loss: 1.659492 2025-01-16 01:39:56,594 - INFO - step 9485, loss: 2.136102, best loss: 1.659492 2025-01-16 01:39:56,744 - INFO - step 9486, loss: 2.057843, best loss: 1.659492 2025-01-16 01:39:56,894 - INFO - step 9487, loss: 2.347217, best loss: 1.659492 2025-01-16 01:39:57,044 - INFO - step 9488, loss: 1.966931, best loss: 1.659492 2025-01-16 01:39:57,194 - INFO - step 9489, loss: 1.789304, best loss: 1.659492 2025-01-16 01:39:57,344 - INFO - step 9490, loss: 2.268444, best loss: 1.659492 2025-01-16 01:39:57,494 - INFO - step 9491, loss: 2.157051, best loss: 1.659492 2025-01-16 01:39:57,644 - INFO - step 9492, loss: 2.278158, best loss: 1.659492 2025-01-16 01:39:57,794 - INFO - step 9493, loss: 1.833762, best loss: 1.659492 2025-01-16 01:39:57,944 - INFO - step 9494, loss: 1.793635, best loss: 1.659492 2025-01-16 01:39:58,094 - INFO - step 9495, loss: 1.663751, best loss: 1.659492 2025-01-16 01:39:58,244 - INFO - step 9496, loss: 1.966367, best loss: 1.659492 2025-01-16 01:39:58,394 - INFO - step 9497, loss: 2.138013, best loss: 1.659492 2025-01-16 01:39:58,544 - INFO - step 9498, loss: 2.205795, best loss: 1.659492 2025-01-16 01:39:58,694 - INFO - step 9499, loss: 2.209901, best loss: 1.659492 2025-01-16 01:39:58,844 - INFO - step 9500, loss: 2.160581, best loss: 1.659492 2025-01-16 01:39:58,994 - INFO - step 9501, loss: 2.113910, best loss: 1.659492 2025-01-16 01:39:59,144 - INFO - step 9502, loss: 2.158745, best loss: 1.659492 2025-01-16 01:39:59,294 - INFO - step 9503, loss: 1.942799, best loss: 1.659492 2025-01-16 01:39:59,445 - INFO - step 9504, loss: 2.124151, best loss: 1.659492 2025-01-16 01:39:59,595 - INFO - step 9505, loss: 2.153943, best loss: 1.659492 2025-01-16 01:39:59,745 - INFO - step 9506, loss: 1.887425, best loss: 1.659492 2025-01-16 01:39:59,895 - INFO - step 9507, loss: 1.867231, best loss: 1.659492 2025-01-16 01:40:00,045 - INFO - step 9508, loss: 2.019221, best loss: 1.659492 2025-01-16 01:40:00,195 - INFO - step 9509, loss: 2.178053, best loss: 1.659492 2025-01-16 01:40:00,345 - INFO - step 9510, loss: 1.895388, best loss: 1.659492 2025-01-16 01:40:00,495 - INFO - step 9511, loss: 1.947065, best loss: 1.659492 2025-01-16 01:40:00,645 - INFO - step 9512, loss: 2.082875, best loss: 1.659492 2025-01-16 01:40:00,795 - INFO - step 9513, loss: 1.875706, best loss: 1.659492 2025-01-16 01:40:00,945 - INFO - step 9514, loss: 2.018365, best loss: 1.659492 2025-01-16 01:40:01,095 - INFO - step 9515, loss: 2.277225, best loss: 1.659492 2025-01-16 01:40:01,244 - INFO - step 9516, loss: 2.007063, best loss: 1.659492 2025-01-16 01:40:01,395 - INFO - step 9517, loss: 2.087082, best loss: 1.659492 2025-01-16 01:40:01,545 - INFO - step 9518, loss: 1.998500, best loss: 1.659492 2025-01-16 01:40:01,695 - INFO - step 9519, loss: 2.259203, best loss: 1.659492 2025-01-16 01:40:01,845 - INFO - step 9520, loss: 1.962884, best loss: 1.659492 2025-01-16 01:40:01,995 - INFO - step 9521, loss: 1.969713, best loss: 1.659492 2025-01-16 01:40:02,144 - INFO - step 9522, loss: 1.959544, best loss: 1.659492 2025-01-16 01:40:02,295 - INFO - step 9523, loss: 2.160241, best loss: 1.659492 2025-01-16 01:40:02,445 - INFO - step 9524, loss: 2.405971, best loss: 1.659492 2025-01-16 01:40:02,595 - INFO - step 9525, loss: 2.312682, best loss: 1.659492 2025-01-16 01:40:02,745 - INFO - step 9526, loss: 2.236374, best loss: 1.659492 2025-01-16 01:40:02,896 - INFO - step 9527, loss: 2.249576, best loss: 1.659492 2025-01-16 01:40:03,045 - INFO - step 9528, loss: 2.181771, best loss: 1.659492 2025-01-16 01:40:03,195 - INFO - step 9529, loss: 2.150192, best loss: 1.659492 2025-01-16 01:40:03,345 - INFO - step 9530, loss: 2.043310, best loss: 1.659492 2025-01-16 01:40:03,494 - INFO - step 9531, loss: 2.105755, best loss: 1.659492 2025-01-16 01:40:03,644 - INFO - step 9532, loss: 1.976348, best loss: 1.659492 2025-01-16 01:40:03,795 - INFO - step 9533, loss: 1.920294, best loss: 1.659492 2025-01-16 01:40:03,945 - INFO - step 9534, loss: 2.067158, best loss: 1.659492 2025-01-16 01:40:04,095 - INFO - step 9535, loss: 2.050815, best loss: 1.659492 2025-01-16 01:40:04,245 - INFO - step 9536, loss: 2.155730, best loss: 1.659492 2025-01-16 01:40:04,396 - INFO - step 9537, loss: 1.711103, best loss: 1.659492 2025-01-16 01:40:04,546 - INFO - step 9538, loss: 2.179203, best loss: 1.659492 2025-01-16 01:40:04,696 - INFO - step 9539, loss: 2.153918, best loss: 1.659492 2025-01-16 01:40:04,847 - INFO - step 9540, loss: 2.065568, best loss: 1.659492 2025-01-16 01:40:04,997 - INFO - step 9541, loss: 2.063568, best loss: 1.659492 2025-01-16 01:40:05,147 - INFO - step 9542, loss: 1.998978, best loss: 1.659492 2025-01-16 01:40:05,297 - INFO - step 9543, loss: 2.080805, best loss: 1.659492 2025-01-16 01:40:05,448 - INFO - step 9544, loss: 1.929357, best loss: 1.659492 2025-01-16 01:40:05,598 - INFO - step 9545, loss: 2.005401, best loss: 1.659492 2025-01-16 01:40:05,748 - INFO - step 9546, loss: 1.998644, best loss: 1.659492 2025-01-16 01:40:05,899 - INFO - step 9547, loss: 2.087080, best loss: 1.659492 2025-01-16 01:40:06,049 - INFO - step 9548, loss: 2.011575, best loss: 1.659492 2025-01-16 01:40:06,199 - INFO - step 9549, loss: 2.011068, best loss: 1.659492 2025-01-16 01:40:06,349 - INFO - step 9550, loss: 1.835009, best loss: 1.659492 2025-01-16 01:40:06,499 - INFO - step 9551, loss: 1.868716, best loss: 1.659492 2025-01-16 01:40:06,649 - INFO - step 9552, loss: 2.013839, best loss: 1.659492 2025-01-16 01:40:06,799 - INFO - step 9553, loss: 1.970504, best loss: 1.659492 2025-01-16 01:40:06,949 - INFO - step 9554, loss: 1.966859, best loss: 1.659492 2025-01-16 01:40:07,099 - INFO - step 9555, loss: 1.725928, best loss: 1.659492 2025-01-16 01:40:10,604 - INFO - step 9556, loss: 1.657997, best loss: 1.657997 2025-01-16 01:40:14,153 - INFO - step 9557, loss: 1.657102, best loss: 1.657102 2025-01-16 01:40:14,303 - INFO - step 9558, loss: 2.071379, best loss: 1.657102 2025-01-16 01:40:14,453 - INFO - step 9559, loss: 2.098553, best loss: 1.657102 2025-01-16 01:40:14,603 - INFO - step 9560, loss: 2.238473, best loss: 1.657102 2025-01-16 01:40:14,753 - INFO - step 9561, loss: 2.370748, best loss: 1.657102 2025-01-16 01:40:14,903 - INFO - step 9562, loss: 2.180346, best loss: 1.657102 2025-01-16 01:40:15,053 - INFO - step 9563, loss: 1.942883, best loss: 1.657102 2025-01-16 01:40:15,204 - INFO - step 9564, loss: 1.979333, best loss: 1.657102 2025-01-16 01:40:15,354 - INFO - step 9565, loss: 2.182418, best loss: 1.657102 2025-01-16 01:40:15,505 - INFO - step 9566, loss: 2.123828, best loss: 1.657102 2025-01-16 01:40:19,217 - INFO - step 9567, loss: 1.629223, best loss: 1.629223 2025-01-16 01:40:19,367 - INFO - step 9568, loss: 1.881729, best loss: 1.629223 2025-01-16 01:40:19,517 - INFO - step 9569, loss: 1.854132, best loss: 1.629223 2025-01-16 01:40:19,667 - INFO - step 9570, loss: 2.173594, best loss: 1.629223 2025-01-16 01:40:19,818 - INFO - step 9571, loss: 2.191448, best loss: 1.629223 2025-01-16 01:40:19,968 - INFO - step 9572, loss: 2.218401, best loss: 1.629223 2025-01-16 01:40:20,118 - INFO - step 9573, loss: 2.261947, best loss: 1.629223 2025-01-16 01:40:20,268 - INFO - step 9574, loss: 2.220874, best loss: 1.629223 2025-01-16 01:40:20,418 - INFO - step 9575, loss: 1.895899, best loss: 1.629223 2025-01-16 01:40:20,569 - INFO - step 9576, loss: 2.338668, best loss: 1.629223 2025-01-16 01:40:20,719 - INFO - step 9577, loss: 2.177716, best loss: 1.629223 2025-01-16 01:40:20,869 - INFO - step 9578, loss: 2.316938, best loss: 1.629223 2025-01-16 01:40:21,019 - INFO - step 9579, loss: 2.333199, best loss: 1.629223 2025-01-16 01:40:21,170 - INFO - step 9580, loss: 1.998310, best loss: 1.629223 2025-01-16 01:40:21,321 - INFO - step 9581, loss: 2.052407, best loss: 1.629223 2025-01-16 01:40:21,471 - INFO - step 9582, loss: 1.890007, best loss: 1.629223 2025-01-16 01:40:21,621 - INFO - step 9583, loss: 2.204558, best loss: 1.629223 2025-01-16 01:40:21,771 - INFO - step 9584, loss: 2.344018, best loss: 1.629223 2025-01-16 01:40:21,922 - INFO - step 9585, loss: 2.257822, best loss: 1.629223 2025-01-16 01:40:22,072 - INFO - step 9586, loss: 2.198313, best loss: 1.629223 2025-01-16 01:40:22,222 - INFO - step 9587, loss: 2.086509, best loss: 1.629223 2025-01-16 01:40:22,372 - INFO - step 9588, loss: 2.130193, best loss: 1.629223 2025-01-16 01:40:22,523 - INFO - step 9589, loss: 2.093558, best loss: 1.629223 2025-01-16 01:40:22,673 - INFO - step 9590, loss: 1.949015, best loss: 1.629223 2025-01-16 01:40:22,823 - INFO - step 9591, loss: 2.346904, best loss: 1.629223 2025-01-16 01:40:22,973 - INFO - step 9592, loss: 1.725576, best loss: 1.629223 2025-01-16 01:40:23,123 - INFO - step 9593, loss: 1.961315, best loss: 1.629223 2025-01-16 01:40:23,273 - INFO - step 9594, loss: 2.207083, best loss: 1.629223 2025-01-16 01:40:23,423 - INFO - step 9595, loss: 2.276223, best loss: 1.629223 2025-01-16 01:40:23,574 - INFO - step 9596, loss: 2.073396, best loss: 1.629223 2025-01-16 01:40:23,724 - INFO - step 9597, loss: 1.926725, best loss: 1.629223 2025-01-16 01:40:23,874 - INFO - step 9598, loss: 2.106409, best loss: 1.629223 2025-01-16 01:40:24,024 - INFO - step 9599, loss: 2.242373, best loss: 1.629223 2025-01-16 01:40:24,175 - INFO - step 9600, loss: 1.946137, best loss: 1.629223 2025-01-16 01:40:24,325 - INFO - step 9601, loss: 1.935263, best loss: 1.629223 2025-01-16 01:40:24,475 - INFO - step 9602, loss: 2.136619, best loss: 1.629223 2025-01-16 01:40:24,625 - INFO - step 9603, loss: 2.125599, best loss: 1.629223 2025-01-16 01:40:24,775 - INFO - step 9604, loss: 2.014420, best loss: 1.629223 2025-01-16 01:40:24,925 - INFO - step 9605, loss: 1.834085, best loss: 1.629223 2025-01-16 01:40:25,075 - INFO - step 9606, loss: 2.155371, best loss: 1.629223 2025-01-16 01:40:25,225 - INFO - step 9607, loss: 2.269814, best loss: 1.629223 2025-01-16 01:40:25,376 - INFO - step 9608, loss: 2.128967, best loss: 1.629223 2025-01-16 01:40:25,526 - INFO - step 9609, loss: 2.073401, best loss: 1.629223 2025-01-16 01:40:25,676 - INFO - step 9610, loss: 2.239959, best loss: 1.629223 2025-01-16 01:40:25,826 - INFO - step 9611, loss: 2.263391, best loss: 1.629223 2025-01-16 01:40:25,976 - INFO - step 9612, loss: 2.229403, best loss: 1.629223 2025-01-16 01:40:26,126 - INFO - step 9613, loss: 2.044685, best loss: 1.629223 2025-01-16 01:40:26,276 - INFO - step 9614, loss: 2.022943, best loss: 1.629223 2025-01-16 01:40:26,427 - INFO - step 9615, loss: 2.003525, best loss: 1.629223 2025-01-16 01:40:26,577 - INFO - step 9616, loss: 2.388243, best loss: 1.629223 2025-01-16 01:40:26,727 - INFO - step 9617, loss: 2.216265, best loss: 1.629223 2025-01-16 01:40:26,877 - INFO - step 9618, loss: 2.215235, best loss: 1.629223 2025-01-16 01:40:27,027 - INFO - step 9619, loss: 1.797493, best loss: 1.629223 2025-01-16 01:40:27,177 - INFO - step 9620, loss: 1.943457, best loss: 1.629223 2025-01-16 01:40:27,327 - INFO - step 9621, loss: 2.154578, best loss: 1.629223 2025-01-16 01:40:27,477 - INFO - step 9622, loss: 2.140391, best loss: 1.629223 2025-01-16 01:40:27,628 - INFO - step 9623, loss: 2.160929, best loss: 1.629223 2025-01-16 01:40:27,778 - INFO - step 9624, loss: 2.148343, best loss: 1.629223 2025-01-16 01:40:27,929 - INFO - step 9625, loss: 2.076204, best loss: 1.629223 2025-01-16 01:40:28,079 - INFO - step 9626, loss: 2.048077, best loss: 1.629223 2025-01-16 01:40:28,229 - INFO - step 9627, loss: 2.106105, best loss: 1.629223 2025-01-16 01:40:28,379 - INFO - step 9628, loss: 1.784787, best loss: 1.629223 2025-01-16 01:40:28,530 - INFO - step 9629, loss: 2.176763, best loss: 1.629223 2025-01-16 01:40:28,680 - INFO - step 9630, loss: 2.198343, best loss: 1.629223 2025-01-16 01:40:28,830 - INFO - step 9631, loss: 2.277640, best loss: 1.629223 2025-01-16 01:40:28,980 - INFO - step 9632, loss: 2.150904, best loss: 1.629223 2025-01-16 01:40:29,130 - INFO - step 9633, loss: 2.240626, best loss: 1.629223 2025-01-16 01:40:29,281 - INFO - step 9634, loss: 2.203479, best loss: 1.629223 2025-01-16 01:40:29,431 - INFO - step 9635, loss: 1.873189, best loss: 1.629223 2025-01-16 01:40:29,581 - INFO - step 9636, loss: 2.216895, best loss: 1.629223 2025-01-16 01:40:29,732 - INFO - step 9637, loss: 1.832942, best loss: 1.629223 2025-01-16 01:40:29,882 - INFO - step 9638, loss: 2.025434, best loss: 1.629223 2025-01-16 01:40:30,032 - INFO - step 9639, loss: 2.038771, best loss: 1.629223 2025-01-16 01:40:30,183 - INFO - step 9640, loss: 1.993115, best loss: 1.629223 2025-01-16 01:40:30,333 - INFO - step 9641, loss: 2.076873, best loss: 1.629223 2025-01-16 01:40:30,483 - INFO - step 9642, loss: 2.192861, best loss: 1.629223 2025-01-16 01:40:30,633 - INFO - step 9643, loss: 2.298642, best loss: 1.629223 2025-01-16 01:40:30,783 - INFO - step 9644, loss: 2.172228, best loss: 1.629223 2025-01-16 01:40:30,933 - INFO - step 9645, loss: 2.220311, best loss: 1.629223 2025-01-16 01:40:31,084 - INFO - step 9646, loss: 2.094722, best loss: 1.629223 2025-01-16 01:40:31,234 - INFO - step 9647, loss: 2.143844, best loss: 1.629223 2025-01-16 01:40:31,384 - INFO - step 9648, loss: 2.056152, best loss: 1.629223 2025-01-16 01:40:31,534 - INFO - step 9649, loss: 1.854695, best loss: 1.629223 2025-01-16 01:40:31,684 - INFO - step 9650, loss: 2.241019, best loss: 1.629223 2025-01-16 01:40:31,834 - INFO - step 9651, loss: 2.030821, best loss: 1.629223 2025-01-16 01:40:31,985 - INFO - step 9652, loss: 2.048975, best loss: 1.629223 2025-01-16 01:40:32,135 - INFO - step 9653, loss: 2.007193, best loss: 1.629223 2025-01-16 01:40:32,285 - INFO - step 9654, loss: 1.901173, best loss: 1.629223 2025-01-16 01:40:32,435 - INFO - step 9655, loss: 2.008079, best loss: 1.629223 2025-01-16 01:40:32,585 - INFO - step 9656, loss: 1.833951, best loss: 1.629223 2025-01-16 01:40:32,736 - INFO - step 9657, loss: 1.870692, best loss: 1.629223 2025-01-16 01:40:32,886 - INFO - step 9658, loss: 2.085060, best loss: 1.629223 2025-01-16 01:40:33,036 - INFO - step 9659, loss: 2.161555, best loss: 1.629223 2025-01-16 01:40:33,186 - INFO - step 9660, loss: 2.014949, best loss: 1.629223 2025-01-16 01:40:33,337 - INFO - step 9661, loss: 2.178766, best loss: 1.629223 2025-01-16 01:40:33,487 - INFO - step 9662, loss: 2.021447, best loss: 1.629223 2025-01-16 01:40:33,637 - INFO - step 9663, loss: 2.257785, best loss: 1.629223 2025-01-16 01:40:33,787 - INFO - step 9664, loss: 2.305856, best loss: 1.629223 2025-01-16 01:40:33,938 - INFO - step 9665, loss: 2.333627, best loss: 1.629223 2025-01-16 01:40:34,088 - INFO - step 9666, loss: 2.326005, best loss: 1.629223 2025-01-16 01:40:34,238 - INFO - step 9667, loss: 2.085622, best loss: 1.629223 2025-01-16 01:40:34,388 - INFO - step 9668, loss: 2.207547, best loss: 1.629223 2025-01-16 01:40:34,539 - INFO - step 9669, loss: 2.215160, best loss: 1.629223 2025-01-16 01:40:34,689 - INFO - step 9670, loss: 2.154613, best loss: 1.629223 2025-01-16 01:40:34,840 - INFO - step 9671, loss: 2.377940, best loss: 1.629223 2025-01-16 01:40:34,990 - INFO - step 9672, loss: 2.194894, best loss: 1.629223 2025-01-16 01:40:35,141 - INFO - step 9673, loss: 2.462719, best loss: 1.629223 2025-01-16 01:40:35,291 - INFO - step 9674, loss: 2.093875, best loss: 1.629223 2025-01-16 01:40:35,441 - INFO - step 9675, loss: 2.086422, best loss: 1.629223 2025-01-16 01:40:35,591 - INFO - step 9676, loss: 2.174225, best loss: 1.629223 2025-01-16 01:40:35,741 - INFO - step 9677, loss: 2.156713, best loss: 1.629223 2025-01-16 01:40:35,892 - INFO - step 9678, loss: 2.114533, best loss: 1.629223 2025-01-16 01:40:36,042 - INFO - step 9679, loss: 2.297635, best loss: 1.629223 2025-01-16 01:40:36,192 - INFO - step 9680, loss: 2.238472, best loss: 1.629223 2025-01-16 01:40:36,343 - INFO - step 9681, loss: 2.366347, best loss: 1.629223 2025-01-16 01:40:36,493 - INFO - step 9682, loss: 2.163882, best loss: 1.629223 2025-01-16 01:40:36,643 - INFO - step 9683, loss: 2.093959, best loss: 1.629223 2025-01-16 01:40:36,793 - INFO - step 9684, loss: 2.363073, best loss: 1.629223 2025-01-16 01:40:36,943 - INFO - step 9685, loss: 2.206090, best loss: 1.629223 2025-01-16 01:40:37,094 - INFO - step 9686, loss: 2.172819, best loss: 1.629223 2025-01-16 01:40:37,244 - INFO - step 9687, loss: 2.149051, best loss: 1.629223 2025-01-16 01:40:37,394 - INFO - step 9688, loss: 2.077682, best loss: 1.629223 2025-01-16 01:40:37,544 - INFO - step 9689, loss: 2.193859, best loss: 1.629223 2025-01-16 01:40:37,694 - INFO - step 9690, loss: 2.272865, best loss: 1.629223 2025-01-16 01:40:37,844 - INFO - step 9691, loss: 2.220881, best loss: 1.629223 2025-01-16 01:40:37,994 - INFO - step 9692, loss: 2.143232, best loss: 1.629223 2025-01-16 01:40:38,144 - INFO - step 9693, loss: 1.997166, best loss: 1.629223 2025-01-16 01:40:38,294 - INFO - step 9694, loss: 1.875782, best loss: 1.629223 2025-01-16 01:40:38,445 - INFO - step 9695, loss: 1.990293, best loss: 1.629223 2025-01-16 01:40:38,595 - INFO - step 9696, loss: 2.123833, best loss: 1.629223 2025-01-16 01:40:38,745 - INFO - step 9697, loss: 2.397121, best loss: 1.629223 2025-01-16 01:40:38,895 - INFO - step 9698, loss: 2.118874, best loss: 1.629223 2025-01-16 01:40:39,045 - INFO - step 9699, loss: 1.740420, best loss: 1.629223 2025-01-16 01:40:39,195 - INFO - step 9700, loss: 2.160337, best loss: 1.629223 2025-01-16 01:40:39,346 - INFO - step 9701, loss: 2.197434, best loss: 1.629223 2025-01-16 01:40:39,496 - INFO - step 9702, loss: 2.457634, best loss: 1.629223 2025-01-16 01:40:39,646 - INFO - step 9703, loss: 1.904818, best loss: 1.629223 2025-01-16 01:40:39,796 - INFO - step 9704, loss: 2.248121, best loss: 1.629223 2025-01-16 01:40:39,946 - INFO - step 9705, loss: 2.022643, best loss: 1.629223 2025-01-16 01:40:40,096 - INFO - step 9706, loss: 2.063265, best loss: 1.629223 2025-01-16 01:40:40,246 - INFO - step 9707, loss: 2.077796, best loss: 1.629223 2025-01-16 01:40:40,397 - INFO - step 9708, loss: 2.140156, best loss: 1.629223 2025-01-16 01:40:40,547 - INFO - step 9709, loss: 2.166317, best loss: 1.629223 2025-01-16 01:40:40,697 - INFO - step 9710, loss: 2.190195, best loss: 1.629223 2025-01-16 01:40:40,847 - INFO - step 9711, loss: 2.057269, best loss: 1.629223 2025-01-16 01:40:40,997 - INFO - step 9712, loss: 2.337323, best loss: 1.629223 2025-01-16 01:40:41,147 - INFO - step 9713, loss: 1.899497, best loss: 1.629223 2025-01-16 01:40:41,297 - INFO - step 9714, loss: 1.719925, best loss: 1.629223 2025-01-16 01:40:41,447 - INFO - step 9715, loss: 2.093425, best loss: 1.629223 2025-01-16 01:40:41,597 - INFO - step 9716, loss: 2.354099, best loss: 1.629223 2025-01-16 01:40:41,748 - INFO - step 9717, loss: 2.145487, best loss: 1.629223 2025-01-16 01:40:41,898 - INFO - step 9718, loss: 1.907618, best loss: 1.629223 2025-01-16 01:40:42,048 - INFO - step 9719, loss: 2.072573, best loss: 1.629223 2025-01-16 01:40:42,198 - INFO - step 9720, loss: 2.295696, best loss: 1.629223 2025-01-16 01:40:42,348 - INFO - step 9721, loss: 2.297415, best loss: 1.629223 2025-01-16 01:40:42,498 - INFO - step 9722, loss: 2.192866, best loss: 1.629223 2025-01-16 01:40:42,649 - INFO - step 9723, loss: 1.989790, best loss: 1.629223 2025-01-16 01:40:42,799 - INFO - step 9724, loss: 2.232769, best loss: 1.629223 2025-01-16 01:40:42,949 - INFO - step 9725, loss: 2.168239, best loss: 1.629223 2025-01-16 01:40:43,100 - INFO - step 9726, loss: 1.988216, best loss: 1.629223 2025-01-16 01:40:43,250 - INFO - step 9727, loss: 2.156901, best loss: 1.629223 2025-01-16 01:40:43,400 - INFO - step 9728, loss: 2.074075, best loss: 1.629223 2025-01-16 01:40:43,550 - INFO - step 9729, loss: 2.148746, best loss: 1.629223 2025-01-16 01:40:43,701 - INFO - step 9730, loss: 2.007961, best loss: 1.629223 2025-01-16 01:40:43,851 - INFO - step 9731, loss: 2.073245, best loss: 1.629223 2025-01-16 01:40:44,001 - INFO - step 9732, loss: 1.919161, best loss: 1.629223 2025-01-16 01:40:44,151 - INFO - step 9733, loss: 1.821865, best loss: 1.629223 2025-01-16 01:40:44,301 - INFO - step 9734, loss: 2.148726, best loss: 1.629223 2025-01-16 01:40:44,451 - INFO - step 9735, loss: 2.083715, best loss: 1.629223 2025-01-16 01:40:44,602 - INFO - step 9736, loss: 2.325214, best loss: 1.629223 2025-01-16 01:40:44,752 - INFO - step 9737, loss: 2.063533, best loss: 1.629223 2025-01-16 01:40:44,902 - INFO - step 9738, loss: 2.193524, best loss: 1.629223 2025-01-16 01:40:45,052 - INFO - step 9739, loss: 2.195200, best loss: 1.629223 2025-01-16 01:40:45,202 - INFO - step 9740, loss: 1.952151, best loss: 1.629223 2025-01-16 01:40:45,353 - INFO - step 9741, loss: 1.898143, best loss: 1.629223 2025-01-16 01:40:45,503 - INFO - step 9742, loss: 1.933746, best loss: 1.629223 2025-01-16 01:40:45,653 - INFO - step 9743, loss: 2.160886, best loss: 1.629223 2025-01-16 01:40:45,804 - INFO - step 9744, loss: 2.152739, best loss: 1.629223 2025-01-16 01:40:45,954 - INFO - step 9745, loss: 2.162232, best loss: 1.629223 2025-01-16 01:40:46,104 - INFO - step 9746, loss: 2.446802, best loss: 1.629223 2025-01-16 01:40:46,254 - INFO - step 9747, loss: 2.299261, best loss: 1.629223 2025-01-16 01:40:46,404 - INFO - step 9748, loss: 2.291137, best loss: 1.629223 2025-01-16 01:40:46,554 - INFO - step 9749, loss: 2.253717, best loss: 1.629223 2025-01-16 01:40:46,705 - INFO - step 9750, loss: 2.420931, best loss: 1.629223 2025-01-16 01:40:46,855 - INFO - step 9751, loss: 2.027498, best loss: 1.629223 2025-01-16 01:40:47,005 - INFO - step 9752, loss: 2.289316, best loss: 1.629223 2025-01-16 01:40:47,155 - INFO - step 9753, loss: 2.316254, best loss: 1.629223 2025-01-16 01:40:47,305 - INFO - step 9754, loss: 2.171696, best loss: 1.629223 2025-01-16 01:40:47,455 - INFO - step 9755, loss: 2.166307, best loss: 1.629223 2025-01-16 01:40:47,605 - INFO - step 9756, loss: 2.323407, best loss: 1.629223 2025-01-16 01:40:47,755 - INFO - step 9757, loss: 2.153364, best loss: 1.629223 2025-01-16 01:40:51,057 - INFO - step 9758, loss: 1.532623, best loss: 1.532623 2025-01-16 01:40:51,218 - INFO - step 9759, loss: 2.171752, best loss: 1.532623 2025-01-16 01:40:51,369 - INFO - step 9760, loss: 2.111829, best loss: 1.532623 2025-01-16 01:40:51,520 - INFO - step 9761, loss: 2.211746, best loss: 1.532623 2025-01-16 01:40:51,670 - INFO - step 9762, loss: 2.258671, best loss: 1.532623 2025-01-16 01:40:51,820 - INFO - step 9763, loss: 2.081398, best loss: 1.532623 2025-01-16 01:40:51,970 - INFO - step 9764, loss: 2.192096, best loss: 1.532623 2025-01-16 01:40:52,120 - INFO - step 9765, loss: 2.224197, best loss: 1.532623 2025-01-16 01:40:52,271 - INFO - step 9766, loss: 2.120509, best loss: 1.532623 2025-01-16 01:40:52,421 - INFO - step 9767, loss: 2.058697, best loss: 1.532623 2025-01-16 01:40:52,571 - INFO - step 9768, loss: 2.068226, best loss: 1.532623 2025-01-16 01:40:52,721 - INFO - step 9769, loss: 1.963494, best loss: 1.532623 2025-01-16 01:40:52,871 - INFO - step 9770, loss: 2.096200, best loss: 1.532623 2025-01-16 01:40:53,022 - INFO - step 9771, loss: 2.060208, best loss: 1.532623 2025-01-16 01:40:53,172 - INFO - step 9772, loss: 2.154383, best loss: 1.532623 2025-01-16 01:40:53,322 - INFO - step 9773, loss: 2.247623, best loss: 1.532623 2025-01-16 01:40:53,473 - INFO - step 9774, loss: 2.100549, best loss: 1.532623 2025-01-16 01:40:53,622 - INFO - step 9775, loss: 1.903310, best loss: 1.532623 2025-01-16 01:40:53,773 - INFO - step 9776, loss: 2.062531, best loss: 1.532623 2025-01-16 01:40:53,923 - INFO - step 9777, loss: 2.098378, best loss: 1.532623 2025-01-16 01:40:54,073 - INFO - step 9778, loss: 2.105131, best loss: 1.532623 2025-01-16 01:40:54,223 - INFO - step 9779, loss: 2.149275, best loss: 1.532623 2025-01-16 01:40:54,373 - INFO - step 9780, loss: 2.104352, best loss: 1.532623 2025-01-16 01:40:54,523 - INFO - step 9781, loss: 1.997181, best loss: 1.532623 2025-01-16 01:40:54,674 - INFO - step 9782, loss: 2.077487, best loss: 1.532623 2025-01-16 01:40:54,824 - INFO - step 9783, loss: 2.093057, best loss: 1.532623 2025-01-16 01:40:54,974 - INFO - step 9784, loss: 2.132803, best loss: 1.532623 2025-01-16 01:40:55,127 - INFO - step 9785, loss: 2.286799, best loss: 1.532623 2025-01-16 01:40:55,277 - INFO - step 9786, loss: 2.368615, best loss: 1.532623 2025-01-16 01:40:55,427 - INFO - step 9787, loss: 2.208918, best loss: 1.532623 2025-01-16 01:40:55,577 - INFO - step 9788, loss: 2.109875, best loss: 1.532623 2025-01-16 01:40:55,727 - INFO - step 9789, loss: 2.103646, best loss: 1.532623 2025-01-16 01:40:55,878 - INFO - step 9790, loss: 2.306316, best loss: 1.532623 2025-01-16 01:40:56,028 - INFO - step 9791, loss: 2.403909, best loss: 1.532623 2025-01-16 01:40:56,178 - INFO - step 9792, loss: 2.308817, best loss: 1.532623 2025-01-16 01:40:56,328 - INFO - step 9793, loss: 2.400657, best loss: 1.532623 2025-01-16 01:40:56,478 - INFO - step 9794, loss: 2.456796, best loss: 1.532623 2025-01-16 01:40:56,631 - INFO - step 9795, loss: 2.135879, best loss: 1.532623 2025-01-16 01:40:56,781 - INFO - step 9796, loss: 2.369530, best loss: 1.532623 2025-01-16 01:40:56,931 - INFO - step 9797, loss: 2.028284, best loss: 1.532623 2025-01-16 01:40:57,081 - INFO - step 9798, loss: 1.884609, best loss: 1.532623 2025-01-16 01:40:57,231 - INFO - step 9799, loss: 2.337008, best loss: 1.532623 2025-01-16 01:40:57,381 - INFO - step 9800, loss: 2.307749, best loss: 1.532623 2025-01-16 01:40:57,531 - INFO - step 9801, loss: 2.049947, best loss: 1.532623 2025-01-16 01:40:57,681 - INFO - step 9802, loss: 1.891136, best loss: 1.532623 2025-01-16 01:40:57,832 - INFO - step 9803, loss: 2.261636, best loss: 1.532623 2025-01-16 01:40:57,982 - INFO - step 9804, loss: 2.216216, best loss: 1.532623 2025-01-16 01:40:58,132 - INFO - step 9805, loss: 2.160100, best loss: 1.532623 2025-01-16 01:40:58,282 - INFO - step 9806, loss: 2.264808, best loss: 1.532623 2025-01-16 01:40:58,432 - INFO - step 9807, loss: 1.982215, best loss: 1.532623 2025-01-16 01:40:58,582 - INFO - step 9808, loss: 1.855990, best loss: 1.532623 2025-01-16 01:40:58,732 - INFO - step 9809, loss: 2.332506, best loss: 1.532623 2025-01-16 01:40:58,882 - INFO - step 9810, loss: 2.406984, best loss: 1.532623 2025-01-16 01:40:59,032 - INFO - step 9811, loss: 2.461030, best loss: 1.532623 2025-01-16 01:40:59,182 - INFO - step 9812, loss: 2.233085, best loss: 1.532623 2025-01-16 01:40:59,332 - INFO - step 9813, loss: 2.253797, best loss: 1.532623 2025-01-16 01:40:59,482 - INFO - step 9814, loss: 2.317042, best loss: 1.532623 2025-01-16 01:40:59,632 - INFO - step 9815, loss: 2.078353, best loss: 1.532623 2025-01-16 01:40:59,782 - INFO - step 9816, loss: 2.003092, best loss: 1.532623 2025-01-16 01:40:59,932 - INFO - step 9817, loss: 2.271583, best loss: 1.532623 2025-01-16 01:41:00,082 - INFO - step 9818, loss: 1.925209, best loss: 1.532623 2025-01-16 01:41:00,232 - INFO - step 9819, loss: 1.753824, best loss: 1.532623 2025-01-16 01:41:00,382 - INFO - step 9820, loss: 2.061754, best loss: 1.532623 2025-01-16 01:41:00,532 - INFO - step 9821, loss: 2.079507, best loss: 1.532623 2025-01-16 01:41:00,682 - INFO - step 9822, loss: 2.184016, best loss: 1.532623 2025-01-16 01:41:00,832 - INFO - step 9823, loss: 1.774375, best loss: 1.532623 2025-01-16 01:41:00,982 - INFO - step 9824, loss: 1.732440, best loss: 1.532623 2025-01-16 01:41:01,132 - INFO - step 9825, loss: 1.628601, best loss: 1.532623 2025-01-16 01:41:01,282 - INFO - step 9826, loss: 1.853132, best loss: 1.532623 2025-01-16 01:41:01,432 - INFO - step 9827, loss: 2.124397, best loss: 1.532623 2025-01-16 01:41:01,582 - INFO - step 9828, loss: 2.197677, best loss: 1.532623 2025-01-16 01:41:01,732 - INFO - step 9829, loss: 2.098632, best loss: 1.532623 2025-01-16 01:41:01,882 - INFO - step 9830, loss: 2.100448, best loss: 1.532623 2025-01-16 01:41:02,032 - INFO - step 9831, loss: 2.019548, best loss: 1.532623 2025-01-16 01:41:02,182 - INFO - step 9832, loss: 2.063346, best loss: 1.532623 2025-01-16 01:41:02,332 - INFO - step 9833, loss: 1.863282, best loss: 1.532623 2025-01-16 01:41:02,482 - INFO - step 9834, loss: 2.088312, best loss: 1.532623 2025-01-16 01:41:02,632 - INFO - step 9835, loss: 2.043503, best loss: 1.532623 2025-01-16 01:41:02,783 - INFO - step 9836, loss: 1.785412, best loss: 1.532623 2025-01-16 01:41:02,933 - INFO - step 9837, loss: 1.811310, best loss: 1.532623 2025-01-16 01:41:03,083 - INFO - step 9838, loss: 1.856465, best loss: 1.532623 2025-01-16 01:41:03,234 - INFO - step 9839, loss: 2.047021, best loss: 1.532623 2025-01-16 01:41:03,384 - INFO - step 9840, loss: 1.788645, best loss: 1.532623 2025-01-16 01:41:03,534 - INFO - step 9841, loss: 1.866083, best loss: 1.532623 2025-01-16 01:41:03,684 - INFO - step 9842, loss: 1.994681, best loss: 1.532623 2025-01-16 01:41:03,834 - INFO - step 9843, loss: 1.755482, best loss: 1.532623 2025-01-16 01:41:03,984 - INFO - step 9844, loss: 1.910345, best loss: 1.532623 2025-01-16 01:41:04,134 - INFO - step 9845, loss: 2.126549, best loss: 1.532623 2025-01-16 01:41:04,284 - INFO - step 9846, loss: 1.976559, best loss: 1.532623 2025-01-16 01:41:04,434 - INFO - step 9847, loss: 1.987742, best loss: 1.532623 2025-01-16 01:41:04,584 - INFO - step 9848, loss: 1.873899, best loss: 1.532623 2025-01-16 01:41:04,734 - INFO - step 9849, loss: 2.142303, best loss: 1.532623 2025-01-16 01:41:04,884 - INFO - step 9850, loss: 1.850906, best loss: 1.532623 2025-01-16 01:41:05,034 - INFO - step 9851, loss: 1.916225, best loss: 1.532623 2025-01-16 01:41:05,184 - INFO - step 9852, loss: 1.935465, best loss: 1.532623 2025-01-16 01:41:05,334 - INFO - step 9853, loss: 2.115136, best loss: 1.532623 2025-01-16 01:41:05,485 - INFO - step 9854, loss: 2.292754, best loss: 1.532623 2025-01-16 01:41:05,635 - INFO - step 9855, loss: 2.184461, best loss: 1.532623 2025-01-16 01:41:05,785 - INFO - step 9856, loss: 2.087061, best loss: 1.532623 2025-01-16 01:41:05,935 - INFO - step 9857, loss: 2.074302, best loss: 1.532623 2025-01-16 01:41:06,085 - INFO - step 9858, loss: 2.086382, best loss: 1.532623 2025-01-16 01:41:06,235 - INFO - step 9859, loss: 2.061962, best loss: 1.532623 2025-01-16 01:41:06,385 - INFO - step 9860, loss: 1.982640, best loss: 1.532623 2025-01-16 01:41:06,535 - INFO - step 9861, loss: 2.009253, best loss: 1.532623 2025-01-16 01:41:06,685 - INFO - step 9862, loss: 1.900718, best loss: 1.532623 2025-01-16 01:41:06,835 - INFO - step 9863, loss: 1.885455, best loss: 1.532623 2025-01-16 01:41:06,985 - INFO - step 9864, loss: 2.023234, best loss: 1.532623 2025-01-16 01:41:07,135 - INFO - step 9865, loss: 1.921371, best loss: 1.532623 2025-01-16 01:41:07,285 - INFO - step 9866, loss: 2.134367, best loss: 1.532623 2025-01-16 01:41:07,436 - INFO - step 9867, loss: 1.643212, best loss: 1.532623 2025-01-16 01:41:07,586 - INFO - step 9868, loss: 2.073733, best loss: 1.532623 2025-01-16 01:41:07,736 - INFO - step 9869, loss: 2.022563, best loss: 1.532623 2025-01-16 01:41:07,886 - INFO - step 9870, loss: 1.965365, best loss: 1.532623 2025-01-16 01:41:08,036 - INFO - step 9871, loss: 1.987373, best loss: 1.532623 2025-01-16 01:41:08,186 - INFO - step 9872, loss: 1.959811, best loss: 1.532623 2025-01-16 01:41:08,336 - INFO - step 9873, loss: 2.006507, best loss: 1.532623 2025-01-16 01:41:08,486 - INFO - step 9874, loss: 1.874722, best loss: 1.532623 2025-01-16 01:41:08,636 - INFO - step 9875, loss: 1.934752, best loss: 1.532623 2025-01-16 01:41:08,787 - INFO - step 9876, loss: 1.919590, best loss: 1.532623 2025-01-16 01:41:08,937 - INFO - step 9877, loss: 2.059968, best loss: 1.532623 2025-01-16 01:41:09,087 - INFO - step 9878, loss: 1.937754, best loss: 1.532623 2025-01-16 01:41:09,237 - INFO - step 9879, loss: 1.926700, best loss: 1.532623 2025-01-16 01:41:09,387 - INFO - step 9880, loss: 1.807390, best loss: 1.532623 2025-01-16 01:41:09,537 - INFO - step 9881, loss: 1.820434, best loss: 1.532623 2025-01-16 01:41:09,687 - INFO - step 9882, loss: 2.002127, best loss: 1.532623 2025-01-16 01:41:09,837 - INFO - step 9883, loss: 1.820609, best loss: 1.532623 2025-01-16 01:41:09,987 - INFO - step 9884, loss: 1.868389, best loss: 1.532623 2025-01-16 01:41:10,138 - INFO - step 9885, loss: 1.638314, best loss: 1.532623 2025-01-16 01:41:10,288 - INFO - step 9886, loss: 1.616605, best loss: 1.532623 2025-01-16 01:41:10,438 - INFO - step 9887, loss: 1.591100, best loss: 1.532623 2025-01-16 01:41:10,588 - INFO - step 9888, loss: 2.007764, best loss: 1.532623 2025-01-16 01:41:10,738 - INFO - step 9889, loss: 2.006432, best loss: 1.532623 2025-01-16 01:41:10,889 - INFO - step 9890, loss: 2.089404, best loss: 1.532623 2025-01-16 01:41:11,039 - INFO - step 9891, loss: 2.133442, best loss: 1.532623 2025-01-16 01:41:11,189 - INFO - step 9892, loss: 2.109577, best loss: 1.532623 2025-01-16 01:41:11,339 - INFO - step 9893, loss: 1.843825, best loss: 1.532623 2025-01-16 01:41:11,489 - INFO - step 9894, loss: 1.923531, best loss: 1.532623 2025-01-16 01:41:11,639 - INFO - step 9895, loss: 2.127621, best loss: 1.532623 2025-01-16 01:41:11,789 - INFO - step 9896, loss: 2.100721, best loss: 1.532623 2025-01-16 01:41:15,399 - INFO - step 9897, loss: 1.514293, best loss: 1.514293 2025-01-16 01:41:15,551 - INFO - step 9898, loss: 1.770036, best loss: 1.514293 2025-01-16 01:41:15,701 - INFO - step 9899, loss: 1.720309, best loss: 1.514293 2025-01-16 01:41:15,851 - INFO - step 9900, loss: 2.067408, best loss: 1.514293 2025-01-16 01:41:16,001 - INFO - step 9901, loss: 2.061151, best loss: 1.514293 2025-01-16 01:41:16,151 - INFO - step 9902, loss: 2.012024, best loss: 1.514293 2025-01-16 01:41:16,302 - INFO - step 9903, loss: 2.133719, best loss: 1.514293 2025-01-16 01:41:16,452 - INFO - step 9904, loss: 2.070456, best loss: 1.514293 2025-01-16 01:41:16,602 - INFO - step 9905, loss: 1.773275, best loss: 1.514293 2025-01-16 01:41:16,752 - INFO - step 9906, loss: 2.197627, best loss: 1.514293 2025-01-16 01:41:16,902 - INFO - step 9907, loss: 2.108436, best loss: 1.514293 2025-01-16 01:41:17,052 - INFO - step 9908, loss: 2.246236, best loss: 1.514293 2025-01-16 01:41:17,202 - INFO - step 9909, loss: 2.198537, best loss: 1.514293 2025-01-16 01:41:17,353 - INFO - step 9910, loss: 1.977660, best loss: 1.514293 2025-01-16 01:41:17,503 - INFO - step 9911, loss: 1.972298, best loss: 1.514293 2025-01-16 01:41:17,653 - INFO - step 9912, loss: 1.828928, best loss: 1.514293 2025-01-16 01:41:17,803 - INFO - step 9913, loss: 2.157301, best loss: 1.514293 2025-01-16 01:41:17,954 - INFO - step 9914, loss: 2.246064, best loss: 1.514293 2025-01-16 01:41:18,104 - INFO - step 9915, loss: 2.249110, best loss: 1.514293 2025-01-16 01:41:18,254 - INFO - step 9916, loss: 2.125949, best loss: 1.514293 2025-01-16 01:41:18,404 - INFO - step 9917, loss: 2.001108, best loss: 1.514293 2025-01-16 01:41:18,554 - INFO - step 9918, loss: 2.010811, best loss: 1.514293 2025-01-16 01:41:18,705 - INFO - step 9919, loss: 1.989986, best loss: 1.514293 2025-01-16 01:41:18,855 - INFO - step 9920, loss: 1.848321, best loss: 1.514293 2025-01-16 01:41:19,006 - INFO - step 9921, loss: 2.295790, best loss: 1.514293 2025-01-16 01:41:19,156 - INFO - step 9922, loss: 1.718097, best loss: 1.514293 2025-01-16 01:41:19,306 - INFO - step 9923, loss: 1.856688, best loss: 1.514293 2025-01-16 01:41:19,456 - INFO - step 9924, loss: 2.103388, best loss: 1.514293 2025-01-16 01:41:19,607 - INFO - step 9925, loss: 2.235560, best loss: 1.514293 2025-01-16 01:41:19,757 - INFO - step 9926, loss: 2.080634, best loss: 1.514293 2025-01-16 01:41:19,908 - INFO - step 9927, loss: 1.907054, best loss: 1.514293 2025-01-16 01:41:20,057 - INFO - step 9928, loss: 2.082797, best loss: 1.514293 2025-01-16 01:41:20,208 - INFO - step 9929, loss: 2.177351, best loss: 1.514293 2025-01-16 01:41:20,358 - INFO - step 9930, loss: 1.873752, best loss: 1.514293 2025-01-16 01:41:20,508 - INFO - step 9931, loss: 1.814621, best loss: 1.514293 2025-01-16 01:41:20,658 - INFO - step 9932, loss: 2.045021, best loss: 1.514293 2025-01-16 01:41:20,808 - INFO - step 9933, loss: 2.053318, best loss: 1.514293 2025-01-16 01:41:20,959 - INFO - step 9934, loss: 1.890680, best loss: 1.514293 2025-01-16 01:41:21,109 - INFO - step 9935, loss: 1.743822, best loss: 1.514293 2025-01-16 01:41:21,259 - INFO - step 9936, loss: 2.055764, best loss: 1.514293 2025-01-16 01:41:21,409 - INFO - step 9937, loss: 2.138301, best loss: 1.514293 2025-01-16 01:41:21,559 - INFO - step 9938, loss: 2.014139, best loss: 1.514293 2025-01-16 01:41:21,710 - INFO - step 9939, loss: 1.990277, best loss: 1.514293 2025-01-16 01:41:21,860 - INFO - step 9940, loss: 2.195292, best loss: 1.514293 2025-01-16 01:41:22,010 - INFO - step 9941, loss: 2.094821, best loss: 1.514293 2025-01-16 01:41:22,160 - INFO - step 9942, loss: 2.149971, best loss: 1.514293 2025-01-16 01:41:22,310 - INFO - step 9943, loss: 2.044744, best loss: 1.514293 2025-01-16 01:41:22,461 - INFO - step 9944, loss: 1.953130, best loss: 1.514293 2025-01-16 01:41:22,611 - INFO - step 9945, loss: 1.988805, best loss: 1.514293 2025-01-16 01:41:22,761 - INFO - step 9946, loss: 2.336648, best loss: 1.514293 2025-01-16 01:41:22,911 - INFO - step 9947, loss: 2.080903, best loss: 1.514293 2025-01-16 01:41:23,062 - INFO - step 9948, loss: 2.095859, best loss: 1.514293 2025-01-16 01:41:23,212 - INFO - step 9949, loss: 1.722491, best loss: 1.514293 2025-01-16 01:41:23,362 - INFO - step 9950, loss: 1.812225, best loss: 1.514293 2025-01-16 01:41:23,513 - INFO - step 9951, loss: 2.063530, best loss: 1.514293 2025-01-16 01:41:23,663 - INFO - step 9952, loss: 2.059287, best loss: 1.514293 2025-01-16 01:41:23,814 - INFO - step 9953, loss: 1.960672, best loss: 1.514293 2025-01-16 01:41:23,964 - INFO - step 9954, loss: 2.096082, best loss: 1.514293 2025-01-16 01:41:24,114 - INFO - step 9955, loss: 1.906999, best loss: 1.514293 2025-01-16 01:41:24,264 - INFO - step 9956, loss: 1.967720, best loss: 1.514293 2025-01-16 01:41:24,414 - INFO - step 9957, loss: 1.998519, best loss: 1.514293 2025-01-16 01:41:24,565 - INFO - step 9958, loss: 1.668959, best loss: 1.514293 2025-01-16 01:41:24,715 - INFO - step 9959, loss: 2.082654, best loss: 1.514293 2025-01-16 01:41:24,865 - INFO - step 9960, loss: 2.125726, best loss: 1.514293 2025-01-16 01:41:25,015 - INFO - step 9961, loss: 2.141396, best loss: 1.514293 2025-01-16 01:41:25,165 - INFO - step 9962, loss: 2.077921, best loss: 1.514293 2025-01-16 01:41:25,315 - INFO - step 9963, loss: 2.108657, best loss: 1.514293 2025-01-16 01:41:25,465 - INFO - step 9964, loss: 2.078542, best loss: 1.514293 2025-01-16 01:41:25,615 - INFO - step 9965, loss: 1.797752, best loss: 1.514293 2025-01-16 01:41:25,765 - INFO - step 9966, loss: 2.119694, best loss: 1.514293 2025-01-16 01:41:25,915 - INFO - step 9967, loss: 1.793869, best loss: 1.514293 2025-01-16 01:41:26,066 - INFO - step 9968, loss: 1.918631, best loss: 1.514293 2025-01-16 01:41:26,216 - INFO - step 9969, loss: 2.007424, best loss: 1.514293 2025-01-16 01:41:26,365 - INFO - step 9970, loss: 1.933635, best loss: 1.514293 2025-01-16 01:41:26,516 - INFO - step 9971, loss: 2.002778, best loss: 1.514293 2025-01-16 01:41:26,666 - INFO - step 9972, loss: 2.051610, best loss: 1.514293 2025-01-16 01:41:26,816 - INFO - step 9973, loss: 2.165205, best loss: 1.514293 2025-01-16 01:41:26,966 - INFO - step 9974, loss: 2.135882, best loss: 1.514293 2025-01-16 01:41:27,116 - INFO - step 9975, loss: 2.110149, best loss: 1.514293 2025-01-16 01:41:27,266 - INFO - step 9976, loss: 1.987673, best loss: 1.514293 2025-01-16 01:41:27,416 - INFO - step 9977, loss: 2.150829, best loss: 1.514293 2025-01-16 01:41:27,567 - INFO - step 9978, loss: 1.981981, best loss: 1.514293 2025-01-16 01:41:27,717 - INFO - step 9979, loss: 1.790207, best loss: 1.514293 2025-01-16 01:41:27,867 - INFO - step 9980, loss: 2.174578, best loss: 1.514293 2025-01-16 01:41:28,017 - INFO - step 9981, loss: 1.933563, best loss: 1.514293 2025-01-16 01:41:28,167 - INFO - step 9982, loss: 1.920080, best loss: 1.514293 2025-01-16 01:41:28,317 - INFO - step 9983, loss: 1.940640, best loss: 1.514293 2025-01-16 01:41:28,467 - INFO - step 9984, loss: 1.871631, best loss: 1.514293 2025-01-16 01:41:28,618 - INFO - step 9985, loss: 1.950784, best loss: 1.514293 2025-01-16 01:41:28,768 - INFO - step 9986, loss: 1.741247, best loss: 1.514293 2025-01-16 01:41:28,918 - INFO - step 9987, loss: 1.842158, best loss: 1.514293 2025-01-16 01:41:29,068 - INFO - step 9988, loss: 2.035088, best loss: 1.514293 2025-01-16 01:41:29,218 - INFO - step 9989, loss: 2.098204, best loss: 1.514293 2025-01-16 01:41:29,368 - INFO - step 9990, loss: 1.893036, best loss: 1.514293 2025-01-16 01:41:29,519 - INFO - step 9991, loss: 2.091869, best loss: 1.514293 2025-01-16 01:41:29,669 - INFO - step 9992, loss: 1.841504, best loss: 1.514293 2025-01-16 01:41:29,819 - INFO - step 9993, loss: 2.221980, best loss: 1.514293 2025-01-16 01:41:29,969 - INFO - step 9994, loss: 2.261462, best loss: 1.514293 2025-01-16 01:41:30,119 - INFO - step 9995, loss: 2.272823, best loss: 1.514293 2025-01-16 01:41:30,269 - INFO - step 9996, loss: 2.277964, best loss: 1.514293 2025-01-16 01:41:30,420 - INFO - step 9997, loss: 2.040378, best loss: 1.514293 2025-01-16 01:41:30,570 - INFO - step 9998, loss: 2.096784, best loss: 1.514293 2025-01-16 01:41:30,720 - INFO - step 9999, loss: 2.061828, best loss: 1.514293 2025-01-16 01:41:30,870 - INFO - step 10000, loss: 2.068230, best loss: 1.514293 2025-01-16 01:41:31,020 - INFO - step 10001, loss: 2.268491, best loss: 1.514293 2025-01-16 01:41:31,170 - INFO - step 10002, loss: 2.096524, best loss: 1.514293 2025-01-16 01:41:31,320 - INFO - step 10003, loss: 2.364818, best loss: 1.514293 2025-01-16 01:41:31,470 - INFO - step 10004, loss: 2.021621, best loss: 1.514293 2025-01-16 01:41:31,620 - INFO - step 10005, loss: 2.027090, best loss: 1.514293 2025-01-16 01:41:31,770 - INFO - step 10006, loss: 2.032808, best loss: 1.514293 2025-01-16 01:41:31,920 - INFO - step 10007, loss: 2.022294, best loss: 1.514293 2025-01-16 01:41:32,070 - INFO - step 10008, loss: 2.032682, best loss: 1.514293 2025-01-16 01:41:32,221 - INFO - step 10009, loss: 2.181999, best loss: 1.514293 2025-01-16 01:41:32,371 - INFO - step 10010, loss: 2.064499, best loss: 1.514293 2025-01-16 01:41:32,521 - INFO - step 10011, loss: 2.242463, best loss: 1.514293 2025-01-16 01:41:32,671 - INFO - step 10012, loss: 2.083057, best loss: 1.514293 2025-01-16 01:41:32,821 - INFO - step 10013, loss: 2.070580, best loss: 1.514293 2025-01-16 01:41:32,971 - INFO - step 10014, loss: 2.280210, best loss: 1.514293 2025-01-16 01:41:33,121 - INFO - step 10015, loss: 2.133152, best loss: 1.514293 2025-01-16 01:41:33,271 - INFO - step 10016, loss: 2.193641, best loss: 1.514293 2025-01-16 01:41:33,421 - INFO - step 10017, loss: 2.072880, best loss: 1.514293 2025-01-16 01:41:33,571 - INFO - step 10018, loss: 2.002789, best loss: 1.514293 2025-01-16 01:41:33,722 - INFO - step 10019, loss: 2.162090, best loss: 1.514293 2025-01-16 01:41:33,872 - INFO - step 10020, loss: 2.173663, best loss: 1.514293 2025-01-16 01:41:34,022 - INFO - step 10021, loss: 2.136496, best loss: 1.514293 2025-01-16 01:41:34,172 - INFO - step 10022, loss: 2.113017, best loss: 1.514293 2025-01-16 01:41:34,322 - INFO - step 10023, loss: 1.966801, best loss: 1.514293 2025-01-16 01:41:34,473 - INFO - step 10024, loss: 1.849795, best loss: 1.514293 2025-01-16 01:41:34,623 - INFO - step 10025, loss: 1.858951, best loss: 1.514293 2025-01-16 01:41:34,773 - INFO - step 10026, loss: 2.028168, best loss: 1.514293 2025-01-16 01:41:34,923 - INFO - step 10027, loss: 2.260455, best loss: 1.514293 2025-01-16 01:41:35,073 - INFO - step 10028, loss: 1.944944, best loss: 1.514293 2025-01-16 01:41:35,224 - INFO - step 10029, loss: 1.636787, best loss: 1.514293 2025-01-16 01:41:35,374 - INFO - step 10030, loss: 2.034808, best loss: 1.514293 2025-01-16 01:41:35,524 - INFO - step 10031, loss: 2.084542, best loss: 1.514293 2025-01-16 01:41:35,674 - INFO - step 10032, loss: 2.342165, best loss: 1.514293 2025-01-16 01:41:35,824 - INFO - step 10033, loss: 1.877531, best loss: 1.514293 2025-01-16 01:41:35,975 - INFO - step 10034, loss: 2.181905, best loss: 1.514293 2025-01-16 01:41:36,125 - INFO - step 10035, loss: 1.906044, best loss: 1.514293 2025-01-16 01:41:36,275 - INFO - step 10036, loss: 1.908664, best loss: 1.514293 2025-01-16 01:41:36,425 - INFO - step 10037, loss: 2.015911, best loss: 1.514293 2025-01-16 01:41:36,576 - INFO - step 10038, loss: 2.000199, best loss: 1.514293 2025-01-16 01:41:36,726 - INFO - step 10039, loss: 2.121357, best loss: 1.514293 2025-01-16 01:41:36,876 - INFO - step 10040, loss: 2.068731, best loss: 1.514293 2025-01-16 01:41:37,026 - INFO - step 10041, loss: 2.004504, best loss: 1.514293 2025-01-16 01:41:37,176 - INFO - step 10042, loss: 2.275451, best loss: 1.514293 2025-01-16 01:41:37,327 - INFO - step 10043, loss: 1.898631, best loss: 1.514293 2025-01-16 01:41:37,477 - INFO - step 10044, loss: 1.586843, best loss: 1.514293 2025-01-16 01:41:37,627 - INFO - step 10045, loss: 2.038221, best loss: 1.514293 2025-01-16 01:41:37,777 - INFO - step 10046, loss: 2.268962, best loss: 1.514293 2025-01-16 01:41:37,927 - INFO - step 10047, loss: 2.052838, best loss: 1.514293 2025-01-16 01:41:38,078 - INFO - step 10048, loss: 1.851437, best loss: 1.514293 2025-01-16 01:41:38,228 - INFO - step 10049, loss: 1.977660, best loss: 1.514293 2025-01-16 01:41:38,378 - INFO - step 10050, loss: 2.131669, best loss: 1.514293 2025-01-16 01:41:38,528 - INFO - step 10051, loss: 2.156050, best loss: 1.514293 2025-01-16 01:41:38,678 - INFO - step 10052, loss: 2.096923, best loss: 1.514293 2025-01-16 01:41:38,828 - INFO - step 10053, loss: 1.883482, best loss: 1.514293 2025-01-16 01:41:38,979 - INFO - step 10054, loss: 2.102918, best loss: 1.514293 2025-01-16 01:41:39,129 - INFO - step 10055, loss: 2.082548, best loss: 1.514293 2025-01-16 01:41:39,279 - INFO - step 10056, loss: 1.977044, best loss: 1.514293 2025-01-16 01:41:39,429 - INFO - step 10057, loss: 2.025966, best loss: 1.514293 2025-01-16 01:41:39,580 - INFO - step 10058, loss: 1.977465, best loss: 1.514293 2025-01-16 01:41:39,730 - INFO - step 10059, loss: 2.102316, best loss: 1.514293 2025-01-16 01:41:39,880 - INFO - step 10060, loss: 1.923504, best loss: 1.514293 2025-01-16 01:41:40,030 - INFO - step 10061, loss: 1.929074, best loss: 1.514293 2025-01-16 01:41:40,180 - INFO - step 10062, loss: 1.879475, best loss: 1.514293 2025-01-16 01:41:40,331 - INFO - step 10063, loss: 1.713244, best loss: 1.514293 2025-01-16 01:41:40,481 - INFO - step 10064, loss: 2.088288, best loss: 1.514293 2025-01-16 01:41:40,631 - INFO - step 10065, loss: 2.018937, best loss: 1.514293 2025-01-16 01:41:40,782 - INFO - step 10066, loss: 2.229245, best loss: 1.514293 2025-01-16 01:41:40,932 - INFO - step 10067, loss: 2.064508, best loss: 1.514293 2025-01-16 01:41:41,082 - INFO - step 10068, loss: 2.055062, best loss: 1.514293 2025-01-16 01:41:41,233 - INFO - step 10069, loss: 2.195435, best loss: 1.514293 2025-01-16 01:41:41,383 - INFO - step 10070, loss: 1.988841, best loss: 1.514293 2025-01-16 01:41:41,533 - INFO - step 10071, loss: 1.840680, best loss: 1.514293 2025-01-16 01:41:41,683 - INFO - step 10072, loss: 1.867179, best loss: 1.514293 2025-01-16 01:41:41,833 - INFO - step 10073, loss: 2.044227, best loss: 1.514293 2025-01-16 01:41:41,983 - INFO - step 10074, loss: 2.074589, best loss: 1.514293 2025-01-16 01:41:42,134 - INFO - step 10075, loss: 2.124228, best loss: 1.514293 2025-01-16 01:41:42,284 - INFO - step 10076, loss: 2.335285, best loss: 1.514293 2025-01-16 01:41:42,434 - INFO - step 10077, loss: 2.246099, best loss: 1.514293 2025-01-16 01:41:42,584 - INFO - step 10078, loss: 2.221692, best loss: 1.514293 2025-01-16 01:41:42,735 - INFO - step 10079, loss: 2.122195, best loss: 1.514293 2025-01-16 01:41:42,885 - INFO - step 10080, loss: 2.346940, best loss: 1.514293 2025-01-16 01:41:43,035 - INFO - step 10081, loss: 1.945969, best loss: 1.514293 2025-01-16 01:41:43,185 - INFO - step 10082, loss: 2.184572, best loss: 1.514293 2025-01-16 01:41:43,335 - INFO - step 10083, loss: 2.272613, best loss: 1.514293 2025-01-16 01:41:43,486 - INFO - step 10084, loss: 2.068829, best loss: 1.514293 2025-01-16 01:41:43,636 - INFO - step 10085, loss: 2.054083, best loss: 1.514293 2025-01-16 01:41:43,786 - INFO - step 10086, loss: 2.196833, best loss: 1.514293 2025-01-16 01:41:43,936 - INFO - step 10087, loss: 2.039690, best loss: 1.514293 2025-01-16 01:41:47,594 - INFO - step 10088, loss: 1.505040, best loss: 1.505040 2025-01-16 01:41:47,753 - INFO - step 10089, loss: 2.111918, best loss: 1.505040 2025-01-16 01:41:47,904 - INFO - step 10090, loss: 1.981172, best loss: 1.505040 2025-01-16 01:41:48,055 - INFO - step 10091, loss: 2.172937, best loss: 1.505040 2025-01-16 01:41:48,205 - INFO - step 10092, loss: 2.210284, best loss: 1.505040 2025-01-16 01:41:48,355 - INFO - step 10093, loss: 2.069325, best loss: 1.505040 2025-01-16 01:41:48,505 - INFO - step 10094, loss: 2.129472, best loss: 1.505040 2025-01-16 01:41:48,655 - INFO - step 10095, loss: 2.193586, best loss: 1.505040 2025-01-16 01:41:48,806 - INFO - step 10096, loss: 2.068991, best loss: 1.505040 2025-01-16 01:41:48,956 - INFO - step 10097, loss: 2.044050, best loss: 1.505040 2025-01-16 01:41:49,106 - INFO - step 10098, loss: 1.997958, best loss: 1.505040 2025-01-16 01:41:49,256 - INFO - step 10099, loss: 1.840596, best loss: 1.505040 2025-01-16 01:41:49,406 - INFO - step 10100, loss: 2.014793, best loss: 1.505040 2025-01-16 01:41:49,557 - INFO - step 10101, loss: 1.933479, best loss: 1.505040 2025-01-16 01:41:49,707 - INFO - step 10102, loss: 2.132700, best loss: 1.505040 2025-01-16 01:41:49,857 - INFO - step 10103, loss: 2.191191, best loss: 1.505040 2025-01-16 01:41:50,007 - INFO - step 10104, loss: 2.014804, best loss: 1.505040 2025-01-16 01:41:50,157 - INFO - step 10105, loss: 1.853654, best loss: 1.505040 2025-01-16 01:41:50,308 - INFO - step 10106, loss: 1.945122, best loss: 1.505040 2025-01-16 01:41:50,458 - INFO - step 10107, loss: 2.007042, best loss: 1.505040 2025-01-16 01:41:50,608 - INFO - step 10108, loss: 2.113931, best loss: 1.505040 2025-01-16 01:41:50,760 - INFO - step 10109, loss: 2.046004, best loss: 1.505040 2025-01-16 01:41:50,910 - INFO - step 10110, loss: 2.069691, best loss: 1.505040 2025-01-16 01:41:51,060 - INFO - step 10111, loss: 1.896558, best loss: 1.505040 2025-01-16 01:41:51,210 - INFO - step 10112, loss: 1.985072, best loss: 1.505040 2025-01-16 01:41:51,360 - INFO - step 10113, loss: 2.020325, best loss: 1.505040 2025-01-16 01:41:51,510 - INFO - step 10114, loss: 2.098748, best loss: 1.505040 2025-01-16 01:41:51,660 - INFO - step 10115, loss: 2.153342, best loss: 1.505040 2025-01-16 01:41:51,811 - INFO - step 10116, loss: 2.243638, best loss: 1.505040 2025-01-16 01:41:51,961 - INFO - step 10117, loss: 2.089998, best loss: 1.505040 2025-01-16 01:41:52,111 - INFO - step 10118, loss: 1.965648, best loss: 1.505040 2025-01-16 01:41:52,261 - INFO - step 10119, loss: 2.067676, best loss: 1.505040 2025-01-16 01:41:52,411 - INFO - step 10120, loss: 2.236818, best loss: 1.505040 2025-01-16 01:41:52,562 - INFO - step 10121, loss: 2.279066, best loss: 1.505040 2025-01-16 01:41:52,712 - INFO - step 10122, loss: 2.236349, best loss: 1.505040 2025-01-16 01:41:52,862 - INFO - step 10123, loss: 2.303446, best loss: 1.505040 2025-01-16 01:41:53,012 - INFO - step 10124, loss: 2.416332, best loss: 1.505040 2025-01-16 01:41:53,163 - INFO - step 10125, loss: 2.061329, best loss: 1.505040 2025-01-16 01:41:53,313 - INFO - step 10126, loss: 2.239621, best loss: 1.505040 2025-01-16 01:41:53,463 - INFO - step 10127, loss: 1.957193, best loss: 1.505040 2025-01-16 01:41:53,614 - INFO - step 10128, loss: 1.832185, best loss: 1.505040 2025-01-16 01:41:53,764 - INFO - step 10129, loss: 2.210641, best loss: 1.505040 2025-01-16 01:41:53,914 - INFO - step 10130, loss: 2.195828, best loss: 1.505040 2025-01-16 01:41:54,065 - INFO - step 10131, loss: 1.960392, best loss: 1.505040 2025-01-16 01:41:54,215 - INFO - step 10132, loss: 1.869897, best loss: 1.505040 2025-01-16 01:41:54,366 - INFO - step 10133, loss: 2.162811, best loss: 1.505040 2025-01-16 01:41:54,516 - INFO - step 10134, loss: 2.212255, best loss: 1.505040 2025-01-16 01:41:54,667 - INFO - step 10135, loss: 2.062442, best loss: 1.505040 2025-01-16 01:41:54,817 - INFO - step 10136, loss: 2.175577, best loss: 1.505040 2025-01-16 01:41:54,967 - INFO - step 10137, loss: 1.972592, best loss: 1.505040 2025-01-16 01:41:55,118 - INFO - step 10138, loss: 1.767594, best loss: 1.505040 2025-01-16 01:41:55,268 - INFO - step 10139, loss: 2.267094, best loss: 1.505040 2025-01-16 01:41:55,418 - INFO - step 10140, loss: 2.309951, best loss: 1.505040 2025-01-16 01:41:55,568 - INFO - step 10141, loss: 2.319669, best loss: 1.505040 2025-01-16 01:41:55,718 - INFO - step 10142, loss: 2.170661, best loss: 1.505040 2025-01-16 01:41:55,869 - INFO - step 10143, loss: 2.176549, best loss: 1.505040 2025-01-16 01:41:56,020 - INFO - step 10144, loss: 2.247515, best loss: 1.505040 2025-01-16 01:41:56,170 - INFO - step 10145, loss: 2.044413, best loss: 1.505040 2025-01-16 01:41:56,320 - INFO - step 10146, loss: 2.012339, best loss: 1.505040 2025-01-16 01:41:56,470 - INFO - step 10147, loss: 2.317333, best loss: 1.505040 2025-01-16 01:41:56,620 - INFO - step 10148, loss: 1.853409, best loss: 1.505040 2025-01-16 01:41:56,770 - INFO - step 10149, loss: 1.733864, best loss: 1.505040 2025-01-16 01:41:56,920 - INFO - step 10150, loss: 2.004223, best loss: 1.505040 2025-01-16 01:41:57,070 - INFO - step 10151, loss: 1.964818, best loss: 1.505040 2025-01-16 01:41:57,220 - INFO - step 10152, loss: 2.102599, best loss: 1.505040 2025-01-16 01:41:57,370 - INFO - step 10153, loss: 1.697714, best loss: 1.505040 2025-01-16 01:41:57,521 - INFO - step 10154, loss: 1.674994, best loss: 1.505040 2025-01-16 01:41:57,671 - INFO - step 10155, loss: 1.553371, best loss: 1.505040 2025-01-16 01:41:57,821 - INFO - step 10156, loss: 1.871037, best loss: 1.505040 2025-01-16 01:41:57,971 - INFO - step 10157, loss: 2.005361, best loss: 1.505040 2025-01-16 01:41:58,121 - INFO - step 10158, loss: 2.103652, best loss: 1.505040 2025-01-16 01:41:58,271 - INFO - step 10159, loss: 2.070966, best loss: 1.505040 2025-01-16 01:41:58,421 - INFO - step 10160, loss: 2.067276, best loss: 1.505040 2025-01-16 01:41:58,571 - INFO - step 10161, loss: 1.975084, best loss: 1.505040 2025-01-16 01:41:58,721 - INFO - step 10162, loss: 2.128169, best loss: 1.505040 2025-01-16 01:41:58,871 - INFO - step 10163, loss: 1.798170, best loss: 1.505040 2025-01-16 01:41:59,021 - INFO - step 10164, loss: 2.007186, best loss: 1.505040 2025-01-16 01:41:59,171 - INFO - step 10165, loss: 2.004221, best loss: 1.505040 2025-01-16 01:41:59,321 - INFO - step 10166, loss: 1.766793, best loss: 1.505040 2025-01-16 01:41:59,472 - INFO - step 10167, loss: 1.701394, best loss: 1.505040 2025-01-16 01:41:59,622 - INFO - step 10168, loss: 1.828691, best loss: 1.505040 2025-01-16 01:41:59,772 - INFO - step 10169, loss: 1.992447, best loss: 1.505040 2025-01-16 01:41:59,922 - INFO - step 10170, loss: 1.800161, best loss: 1.505040 2025-01-16 01:42:00,072 - INFO - step 10171, loss: 1.893902, best loss: 1.505040 2025-01-16 01:42:00,223 - INFO - step 10172, loss: 1.860947, best loss: 1.505040 2025-01-16 01:42:00,373 - INFO - step 10173, loss: 1.739282, best loss: 1.505040 2025-01-16 01:42:00,523 - INFO - step 10174, loss: 1.889616, best loss: 1.505040 2025-01-16 01:42:00,673 - INFO - step 10175, loss: 2.093827, best loss: 1.505040 2025-01-16 01:42:00,823 - INFO - step 10176, loss: 1.861958, best loss: 1.505040 2025-01-16 01:42:00,974 - INFO - step 10177, loss: 1.941419, best loss: 1.505040 2025-01-16 01:42:01,124 - INFO - step 10178, loss: 1.883257, best loss: 1.505040 2025-01-16 01:42:01,274 - INFO - step 10179, loss: 2.034847, best loss: 1.505040 2025-01-16 01:42:01,424 - INFO - step 10180, loss: 1.825482, best loss: 1.505040 2025-01-16 01:42:01,575 - INFO - step 10181, loss: 1.852188, best loss: 1.505040 2025-01-16 01:42:01,725 - INFO - step 10182, loss: 1.842539, best loss: 1.505040 2025-01-16 01:42:01,875 - INFO - step 10183, loss: 2.009793, best loss: 1.505040 2025-01-16 01:42:02,025 - INFO - step 10184, loss: 2.212192, best loss: 1.505040 2025-01-16 01:42:02,175 - INFO - step 10185, loss: 2.136491, best loss: 1.505040 2025-01-16 01:42:02,326 - INFO - step 10186, loss: 2.087045, best loss: 1.505040 2025-01-16 01:42:02,476 - INFO - step 10187, loss: 2.061174, best loss: 1.505040 2025-01-16 01:42:02,626 - INFO - step 10188, loss: 2.069351, best loss: 1.505040 2025-01-16 01:42:02,776 - INFO - step 10189, loss: 1.935302, best loss: 1.505040 2025-01-16 01:42:02,926 - INFO - step 10190, loss: 1.867701, best loss: 1.505040 2025-01-16 01:42:03,076 - INFO - step 10191, loss: 1.942759, best loss: 1.505040 2025-01-16 01:42:03,226 - INFO - step 10192, loss: 1.879164, best loss: 1.505040 2025-01-16 01:42:03,377 - INFO - step 10193, loss: 1.784928, best loss: 1.505040 2025-01-16 01:42:03,527 - INFO - step 10194, loss: 1.904454, best loss: 1.505040 2025-01-16 01:42:03,677 - INFO - step 10195, loss: 1.941586, best loss: 1.505040 2025-01-16 01:42:03,827 - INFO - step 10196, loss: 2.056967, best loss: 1.505040 2025-01-16 01:42:03,977 - INFO - step 10197, loss: 1.604008, best loss: 1.505040 2025-01-16 01:42:04,127 - INFO - step 10198, loss: 1.962955, best loss: 1.505040 2025-01-16 01:42:04,277 - INFO - step 10199, loss: 1.964431, best loss: 1.505040 2025-01-16 01:42:04,427 - INFO - step 10200, loss: 1.892308, best loss: 1.505040 2025-01-16 01:42:04,577 - INFO - step 10201, loss: 1.902746, best loss: 1.505040 2025-01-16 01:42:04,727 - INFO - step 10202, loss: 1.870483, best loss: 1.505040 2025-01-16 01:42:04,877 - INFO - step 10203, loss: 1.954871, best loss: 1.505040 2025-01-16 01:42:05,027 - INFO - step 10204, loss: 1.812491, best loss: 1.505040 2025-01-16 01:42:05,177 - INFO - step 10205, loss: 1.896453, best loss: 1.505040 2025-01-16 01:42:05,328 - INFO - step 10206, loss: 1.856577, best loss: 1.505040 2025-01-16 01:42:05,478 - INFO - step 10207, loss: 2.020361, best loss: 1.505040 2025-01-16 01:42:05,628 - INFO - step 10208, loss: 1.846070, best loss: 1.505040 2025-01-16 01:42:05,778 - INFO - step 10209, loss: 1.876454, best loss: 1.505040 2025-01-16 01:42:05,928 - INFO - step 10210, loss: 1.797215, best loss: 1.505040 2025-01-16 01:42:06,078 - INFO - step 10211, loss: 1.737446, best loss: 1.505040 2025-01-16 01:42:06,229 - INFO - step 10212, loss: 1.930560, best loss: 1.505040 2025-01-16 01:42:06,379 - INFO - step 10213, loss: 1.807681, best loss: 1.505040 2025-01-16 01:42:06,529 - INFO - step 10214, loss: 1.825490, best loss: 1.505040 2025-01-16 01:42:06,679 - INFO - step 10215, loss: 1.589997, best loss: 1.505040 2025-01-16 01:42:06,830 - INFO - step 10216, loss: 1.562180, best loss: 1.505040 2025-01-16 01:42:06,980 - INFO - step 10217, loss: 1.534457, best loss: 1.505040 2025-01-16 01:42:07,130 - INFO - step 10218, loss: 1.936415, best loss: 1.505040 2025-01-16 01:42:07,280 - INFO - step 10219, loss: 1.930854, best loss: 1.505040 2025-01-16 01:42:07,430 - INFO - step 10220, loss: 2.013414, best loss: 1.505040 2025-01-16 01:42:07,580 - INFO - step 10221, loss: 2.069178, best loss: 1.505040 2025-01-16 01:42:07,731 - INFO - step 10222, loss: 2.009562, best loss: 1.505040 2025-01-16 01:42:07,881 - INFO - step 10223, loss: 1.788839, best loss: 1.505040 2025-01-16 01:42:08,031 - INFO - step 10224, loss: 1.860388, best loss: 1.505040 2025-01-16 01:42:08,181 - INFO - step 10225, loss: 2.132228, best loss: 1.505040 2025-01-16 01:42:08,331 - INFO - step 10226, loss: 2.006677, best loss: 1.505040 2025-01-16 01:42:08,481 - INFO - step 10227, loss: 1.542741, best loss: 1.505040 2025-01-16 01:42:08,631 - INFO - step 10228, loss: 1.795284, best loss: 1.505040 2025-01-16 01:42:08,782 - INFO - step 10229, loss: 1.658005, best loss: 1.505040 2025-01-16 01:42:08,932 - INFO - step 10230, loss: 1.968504, best loss: 1.505040 2025-01-16 01:42:09,082 - INFO - step 10231, loss: 1.978466, best loss: 1.505040 2025-01-16 01:42:09,232 - INFO - step 10232, loss: 1.961722, best loss: 1.505040 2025-01-16 01:42:09,382 - INFO - step 10233, loss: 2.000859, best loss: 1.505040 2025-01-16 01:42:09,532 - INFO - step 10234, loss: 1.965475, best loss: 1.505040 2025-01-16 01:42:09,682 - INFO - step 10235, loss: 1.773935, best loss: 1.505040 2025-01-16 01:42:09,833 - INFO - step 10236, loss: 2.164684, best loss: 1.505040 2025-01-16 01:42:09,983 - INFO - step 10237, loss: 1.925053, best loss: 1.505040 2025-01-16 01:42:10,133 - INFO - step 10238, loss: 2.106505, best loss: 1.505040 2025-01-16 01:42:10,283 - INFO - step 10239, loss: 2.149451, best loss: 1.505040 2025-01-16 01:42:10,433 - INFO - step 10240, loss: 1.923520, best loss: 1.505040 2025-01-16 01:42:10,583 - INFO - step 10241, loss: 1.975799, best loss: 1.505040 2025-01-16 01:42:10,733 - INFO - step 10242, loss: 1.803098, best loss: 1.505040 2025-01-16 01:42:10,884 - INFO - step 10243, loss: 2.091683, best loss: 1.505040 2025-01-16 01:42:11,034 - INFO - step 10244, loss: 2.166380, best loss: 1.505040 2025-01-16 01:42:11,184 - INFO - step 10245, loss: 2.083718, best loss: 1.505040 2025-01-16 01:42:11,334 - INFO - step 10246, loss: 2.133343, best loss: 1.505040 2025-01-16 01:42:11,484 - INFO - step 10247, loss: 1.933661, best loss: 1.505040 2025-01-16 01:42:11,634 - INFO - step 10248, loss: 1.978318, best loss: 1.505040 2025-01-16 01:42:11,784 - INFO - step 10249, loss: 1.899281, best loss: 1.505040 2025-01-16 01:42:11,934 - INFO - step 10250, loss: 1.811659, best loss: 1.505040 2025-01-16 01:42:12,085 - INFO - step 10251, loss: 2.196218, best loss: 1.505040 2025-01-16 01:42:12,235 - INFO - step 10252, loss: 1.632949, best loss: 1.505040 2025-01-16 01:42:12,385 - INFO - step 10253, loss: 1.885628, best loss: 1.505040 2025-01-16 01:42:12,536 - INFO - step 10254, loss: 2.016899, best loss: 1.505040 2025-01-16 01:42:12,686 - INFO - step 10255, loss: 2.073235, best loss: 1.505040 2025-01-16 01:42:12,835 - INFO - step 10256, loss: 1.951799, best loss: 1.505040 2025-01-16 01:42:12,985 - INFO - step 10257, loss: 1.789925, best loss: 1.505040 2025-01-16 01:42:13,136 - INFO - step 10258, loss: 1.943990, best loss: 1.505040 2025-01-16 01:42:13,286 - INFO - step 10259, loss: 2.096799, best loss: 1.505040 2025-01-16 01:42:13,436 - INFO - step 10260, loss: 1.772392, best loss: 1.505040 2025-01-16 01:42:13,586 - INFO - step 10261, loss: 1.773016, best loss: 1.505040 2025-01-16 01:42:13,736 - INFO - step 10262, loss: 2.075257, best loss: 1.505040 2025-01-16 01:42:13,887 - INFO - step 10263, loss: 2.027539, best loss: 1.505040 2025-01-16 01:42:14,037 - INFO - step 10264, loss: 1.866388, best loss: 1.505040 2025-01-16 01:42:14,187 - INFO - step 10265, loss: 1.738319, best loss: 1.505040 2025-01-16 01:42:14,337 - INFO - step 10266, loss: 2.015416, best loss: 1.505040 2025-01-16 01:42:14,487 - INFO - step 10267, loss: 2.109908, best loss: 1.505040 2025-01-16 01:42:14,637 - INFO - step 10268, loss: 2.012657, best loss: 1.505040 2025-01-16 01:42:14,787 - INFO - step 10269, loss: 1.893594, best loss: 1.505040 2025-01-16 01:42:14,937 - INFO - step 10270, loss: 2.088014, best loss: 1.505040 2025-01-16 01:42:15,087 - INFO - step 10271, loss: 2.066556, best loss: 1.505040 2025-01-16 01:42:15,238 - INFO - step 10272, loss: 2.053000, best loss: 1.505040 2025-01-16 01:42:15,388 - INFO - step 10273, loss: 1.937637, best loss: 1.505040 2025-01-16 01:42:15,538 - INFO - step 10274, loss: 1.908775, best loss: 1.505040 2025-01-16 01:42:15,689 - INFO - step 10275, loss: 1.828223, best loss: 1.505040 2025-01-16 01:42:15,839 - INFO - step 10276, loss: 2.232309, best loss: 1.505040 2025-01-16 01:42:15,989 - INFO - step 10277, loss: 2.028139, best loss: 1.505040 2025-01-16 01:42:16,139 - INFO - step 10278, loss: 2.060316, best loss: 1.505040 2025-01-16 01:42:16,289 - INFO - step 10279, loss: 1.656079, best loss: 1.505040 2025-01-16 01:42:16,439 - INFO - step 10280, loss: 1.880110, best loss: 1.505040 2025-01-16 01:42:16,589 - INFO - step 10281, loss: 1.986626, best loss: 1.505040 2025-01-16 01:42:16,739 - INFO - step 10282, loss: 1.963110, best loss: 1.505040 2025-01-16 01:42:16,889 - INFO - step 10283, loss: 1.985598, best loss: 1.505040 2025-01-16 01:42:17,040 - INFO - step 10284, loss: 2.047024, best loss: 1.505040 2025-01-16 01:42:17,190 - INFO - step 10285, loss: 1.924985, best loss: 1.505040 2025-01-16 01:42:17,341 - INFO - step 10286, loss: 1.952412, best loss: 1.505040 2025-01-16 01:42:17,491 - INFO - step 10287, loss: 1.932296, best loss: 1.505040 2025-01-16 01:42:17,641 - INFO - step 10288, loss: 1.657460, best loss: 1.505040 2025-01-16 01:42:17,791 - INFO - step 10289, loss: 2.006972, best loss: 1.505040 2025-01-16 01:42:17,941 - INFO - step 10290, loss: 2.088078, best loss: 1.505040 2025-01-16 01:42:18,091 - INFO - step 10291, loss: 2.084995, best loss: 1.505040 2025-01-16 01:42:18,242 - INFO - step 10292, loss: 2.086894, best loss: 1.505040 2025-01-16 01:42:18,392 - INFO - step 10293, loss: 2.109276, best loss: 1.505040 2025-01-16 01:42:18,542 - INFO - step 10294, loss: 2.031543, best loss: 1.505040 2025-01-16 01:42:18,692 - INFO - step 10295, loss: 1.770282, best loss: 1.505040 2025-01-16 01:42:18,842 - INFO - step 10296, loss: 2.051955, best loss: 1.505040 2025-01-16 01:42:18,992 - INFO - step 10297, loss: 1.734814, best loss: 1.505040 2025-01-16 01:42:19,142 - INFO - step 10298, loss: 1.870047, best loss: 1.505040 2025-01-16 01:42:19,292 - INFO - step 10299, loss: 1.901628, best loss: 1.505040 2025-01-16 01:42:19,442 - INFO - step 10300, loss: 1.911213, best loss: 1.505040 2025-01-16 01:42:19,592 - INFO - step 10301, loss: 1.937727, best loss: 1.505040 2025-01-16 01:42:19,743 - INFO - step 10302, loss: 2.043088, best loss: 1.505040 2025-01-16 01:42:19,893 - INFO - step 10303, loss: 2.086321, best loss: 1.505040 2025-01-16 01:42:20,043 - INFO - step 10304, loss: 2.055211, best loss: 1.505040 2025-01-16 01:42:20,193 - INFO - step 10305, loss: 2.106011, best loss: 1.505040 2025-01-16 01:42:20,343 - INFO - step 10306, loss: 1.927300, best loss: 1.505040 2025-01-16 01:42:20,493 - INFO - step 10307, loss: 2.042127, best loss: 1.505040 2025-01-16 01:42:20,644 - INFO - step 10308, loss: 1.952368, best loss: 1.505040 2025-01-16 01:42:20,794 - INFO - step 10309, loss: 1.816833, best loss: 1.505040 2025-01-16 01:42:20,944 - INFO - step 10310, loss: 2.107038, best loss: 1.505040 2025-01-16 01:42:21,094 - INFO - step 10311, loss: 1.957194, best loss: 1.505040 2025-01-16 01:42:21,244 - INFO - step 10312, loss: 1.902625, best loss: 1.505040 2025-01-16 01:42:21,394 - INFO - step 10313, loss: 1.844584, best loss: 1.505040 2025-01-16 01:42:21,544 - INFO - step 10314, loss: 1.814765, best loss: 1.505040 2025-01-16 01:42:21,695 - INFO - step 10315, loss: 1.900143, best loss: 1.505040 2025-01-16 01:42:21,845 - INFO - step 10316, loss: 1.730105, best loss: 1.505040 2025-01-16 01:42:21,995 - INFO - step 10317, loss: 1.831764, best loss: 1.505040 2025-01-16 01:42:22,145 - INFO - step 10318, loss: 2.008513, best loss: 1.505040 2025-01-16 01:42:22,295 - INFO - step 10319, loss: 2.016901, best loss: 1.505040 2025-01-16 01:42:22,445 - INFO - step 10320, loss: 1.884125, best loss: 1.505040 2025-01-16 01:42:22,595 - INFO - step 10321, loss: 1.951585, best loss: 1.505040 2025-01-16 01:42:22,746 - INFO - step 10322, loss: 1.805577, best loss: 1.505040 2025-01-16 01:42:22,896 - INFO - step 10323, loss: 2.159084, best loss: 1.505040 2025-01-16 01:42:23,046 - INFO - step 10324, loss: 2.163699, best loss: 1.505040 2025-01-16 01:42:23,197 - INFO - step 10325, loss: 2.148345, best loss: 1.505040 2025-01-16 01:42:23,347 - INFO - step 10326, loss: 2.183207, best loss: 1.505040 2025-01-16 01:42:23,497 - INFO - step 10327, loss: 1.929543, best loss: 1.505040 2025-01-16 01:42:23,647 - INFO - step 10328, loss: 2.040504, best loss: 1.505040 2025-01-16 01:42:23,797 - INFO - step 10329, loss: 2.042663, best loss: 1.505040 2025-01-16 01:42:23,947 - INFO - step 10330, loss: 1.971523, best loss: 1.505040 2025-01-16 01:42:24,097 - INFO - step 10331, loss: 2.098875, best loss: 1.505040 2025-01-16 01:42:24,247 - INFO - step 10332, loss: 2.004832, best loss: 1.505040 2025-01-16 01:42:24,398 - INFO - step 10333, loss: 2.286132, best loss: 1.505040 2025-01-16 01:42:24,548 - INFO - step 10334, loss: 1.942437, best loss: 1.505040 2025-01-16 01:42:24,698 - INFO - step 10335, loss: 1.960637, best loss: 1.505040 2025-01-16 01:42:24,848 - INFO - step 10336, loss: 2.000051, best loss: 1.505040 2025-01-16 01:42:24,998 - INFO - step 10337, loss: 1.969816, best loss: 1.505040 2025-01-16 01:42:25,148 - INFO - step 10338, loss: 1.884189, best loss: 1.505040 2025-01-16 01:42:25,299 - INFO - step 10339, loss: 2.149449, best loss: 1.505040 2025-01-16 01:42:25,449 - INFO - step 10340, loss: 2.021140, best loss: 1.505040 2025-01-16 01:42:25,599 - INFO - step 10341, loss: 2.145563, best loss: 1.505040 2025-01-16 01:42:25,749 - INFO - step 10342, loss: 2.051178, best loss: 1.505040 2025-01-16 01:42:25,899 - INFO - step 10343, loss: 1.972897, best loss: 1.505040 2025-01-16 01:42:26,049 - INFO - step 10344, loss: 2.248309, best loss: 1.505040 2025-01-16 01:42:26,200 - INFO - step 10345, loss: 2.035861, best loss: 1.505040 2025-01-16 01:42:26,350 - INFO - step 10346, loss: 2.058780, best loss: 1.505040 2025-01-16 01:42:26,500 - INFO - step 10347, loss: 2.096349, best loss: 1.505040 2025-01-16 01:42:26,650 - INFO - step 10348, loss: 1.986906, best loss: 1.505040 2025-01-16 01:42:26,800 - INFO - step 10349, loss: 2.074317, best loss: 1.505040 2025-01-16 01:42:26,950 - INFO - step 10350, loss: 2.122674, best loss: 1.505040 2025-01-16 01:42:27,100 - INFO - step 10351, loss: 2.031162, best loss: 1.505040 2025-01-16 01:42:27,250 - INFO - step 10352, loss: 2.015678, best loss: 1.505040 2025-01-16 01:42:27,401 - INFO - step 10353, loss: 1.808861, best loss: 1.505040 2025-01-16 01:42:27,551 - INFO - step 10354, loss: 1.794132, best loss: 1.505040 2025-01-16 01:42:27,701 - INFO - step 10355, loss: 1.855742, best loss: 1.505040 2025-01-16 01:42:27,852 - INFO - step 10356, loss: 2.038626, best loss: 1.505040 2025-01-16 01:42:28,002 - INFO - step 10357, loss: 2.326176, best loss: 1.505040 2025-01-16 01:42:28,152 - INFO - step 10358, loss: 1.883445, best loss: 1.505040 2025-01-16 01:42:28,302 - INFO - step 10359, loss: 1.589802, best loss: 1.505040 2025-01-16 01:42:28,452 - INFO - step 10360, loss: 2.079275, best loss: 1.505040 2025-01-16 01:42:28,602 - INFO - step 10361, loss: 2.030304, best loss: 1.505040 2025-01-16 01:42:28,752 - INFO - step 10362, loss: 2.281379, best loss: 1.505040 2025-01-16 01:42:28,902 - INFO - step 10363, loss: 1.792691, best loss: 1.505040 2025-01-16 01:42:29,052 - INFO - step 10364, loss: 2.099941, best loss: 1.505040 2025-01-16 01:42:29,202 - INFO - step 10365, loss: 1.919066, best loss: 1.505040 2025-01-16 01:42:29,352 - INFO - step 10366, loss: 1.846120, best loss: 1.505040 2025-01-16 01:42:29,503 - INFO - step 10367, loss: 2.026775, best loss: 1.505040 2025-01-16 01:42:29,653 - INFO - step 10368, loss: 1.963888, best loss: 1.505040 2025-01-16 01:42:29,803 - INFO - step 10369, loss: 2.114734, best loss: 1.505040 2025-01-16 01:42:29,953 - INFO - step 10370, loss: 2.021843, best loss: 1.505040 2025-01-16 01:42:30,103 - INFO - step 10371, loss: 1.968504, best loss: 1.505040 2025-01-16 01:42:30,253 - INFO - step 10372, loss: 2.134950, best loss: 1.505040 2025-01-16 01:42:30,403 - INFO - step 10373, loss: 1.847254, best loss: 1.505040 2025-01-16 01:42:30,553 - INFO - step 10374, loss: 1.537677, best loss: 1.505040 2025-01-16 01:42:30,704 - INFO - step 10375, loss: 1.970868, best loss: 1.505040 2025-01-16 01:42:30,854 - INFO - step 10376, loss: 2.279727, best loss: 1.505040 2025-01-16 01:42:31,004 - INFO - step 10377, loss: 2.048460, best loss: 1.505040 2025-01-16 01:42:31,154 - INFO - step 10378, loss: 1.843996, best loss: 1.505040 2025-01-16 01:42:31,304 - INFO - step 10379, loss: 1.877940, best loss: 1.505040 2025-01-16 01:42:31,455 - INFO - step 10380, loss: 2.118603, best loss: 1.505040 2025-01-16 01:42:31,605 - INFO - step 10381, loss: 2.135835, best loss: 1.505040 2025-01-16 01:42:31,755 - INFO - step 10382, loss: 2.000592, best loss: 1.505040 2025-01-16 01:42:31,905 - INFO - step 10383, loss: 1.860513, best loss: 1.505040 2025-01-16 01:42:32,055 - INFO - step 10384, loss: 2.085658, best loss: 1.505040 2025-01-16 01:42:32,205 - INFO - step 10385, loss: 2.030837, best loss: 1.505040 2025-01-16 01:42:32,355 - INFO - step 10386, loss: 1.802126, best loss: 1.505040 2025-01-16 01:42:32,505 - INFO - step 10387, loss: 1.969134, best loss: 1.505040 2025-01-16 01:42:32,655 - INFO - step 10388, loss: 1.850727, best loss: 1.505040 2025-01-16 01:42:32,805 - INFO - step 10389, loss: 2.112946, best loss: 1.505040 2025-01-16 01:42:32,955 - INFO - step 10390, loss: 1.936119, best loss: 1.505040 2025-01-16 01:42:33,105 - INFO - step 10391, loss: 1.889706, best loss: 1.505040 2025-01-16 01:42:33,255 - INFO - step 10392, loss: 1.768468, best loss: 1.505040 2025-01-16 01:42:33,405 - INFO - step 10393, loss: 1.709961, best loss: 1.505040 2025-01-16 01:42:33,556 - INFO - step 10394, loss: 1.978074, best loss: 1.505040 2025-01-16 01:42:33,706 - INFO - step 10395, loss: 1.922717, best loss: 1.505040 2025-01-16 01:42:33,856 - INFO - step 10396, loss: 2.137654, best loss: 1.505040 2025-01-16 01:42:34,006 - INFO - step 10397, loss: 1.973719, best loss: 1.505040 2025-01-16 01:42:34,156 - INFO - step 10398, loss: 1.971186, best loss: 1.505040 2025-01-16 01:42:34,307 - INFO - step 10399, loss: 2.160132, best loss: 1.505040 2025-01-16 01:42:34,457 - INFO - step 10400, loss: 1.869629, best loss: 1.505040 2025-01-16 01:42:34,608 - INFO - step 10401, loss: 1.785340, best loss: 1.505040 2025-01-16 01:42:34,758 - INFO - step 10402, loss: 1.829543, best loss: 1.505040 2025-01-16 01:42:34,908 - INFO - step 10403, loss: 2.018136, best loss: 1.505040 2025-01-16 01:42:35,058 - INFO - step 10404, loss: 2.003943, best loss: 1.505040 2025-01-16 01:42:35,208 - INFO - step 10405, loss: 2.076170, best loss: 1.505040 2025-01-16 01:42:35,358 - INFO - step 10406, loss: 2.259451, best loss: 1.505040 2025-01-16 01:42:35,509 - INFO - step 10407, loss: 2.217927, best loss: 1.505040 2025-01-16 01:42:35,659 - INFO - step 10408, loss: 2.164824, best loss: 1.505040 2025-01-16 01:42:35,809 - INFO - step 10409, loss: 2.109427, best loss: 1.505040 2025-01-16 01:42:35,959 - INFO - step 10410, loss: 2.249987, best loss: 1.505040 2025-01-16 01:42:36,109 - INFO - step 10411, loss: 1.879971, best loss: 1.505040 2025-01-16 01:42:36,259 - INFO - step 10412, loss: 2.154472, best loss: 1.505040 2025-01-16 01:42:36,409 - INFO - step 10413, loss: 2.156137, best loss: 1.505040 2025-01-16 01:42:36,560 - INFO - step 10414, loss: 2.018339, best loss: 1.505040 2025-01-16 01:42:36,710 - INFO - step 10415, loss: 2.002177, best loss: 1.505040 2025-01-16 01:42:36,860 - INFO - step 10416, loss: 2.151662, best loss: 1.505040 2025-01-16 01:42:37,010 - INFO - step 10417, loss: 1.948438, best loss: 1.505040 2025-01-16 01:42:40,580 - INFO - step 10418, loss: 1.458850, best loss: 1.458850 2025-01-16 01:42:40,740 - INFO - step 10419, loss: 2.055037, best loss: 1.458850 2025-01-16 01:42:40,891 - INFO - step 10420, loss: 1.903229, best loss: 1.458850 2025-01-16 01:42:41,041 - INFO - step 10421, loss: 2.066787, best loss: 1.458850 2025-01-16 01:42:41,191 - INFO - step 10422, loss: 2.088413, best loss: 1.458850 2025-01-16 01:42:41,341 - INFO - step 10423, loss: 1.944594, best loss: 1.458850 2025-01-16 01:42:41,492 - INFO - step 10424, loss: 2.055158, best loss: 1.458850 2025-01-16 01:42:41,642 - INFO - step 10425, loss: 2.046773, best loss: 1.458850 2025-01-16 01:42:41,792 - INFO - step 10426, loss: 1.942473, best loss: 1.458850 2025-01-16 01:42:41,942 - INFO - step 10427, loss: 1.895424, best loss: 1.458850 2025-01-16 01:42:42,092 - INFO - step 10428, loss: 1.928988, best loss: 1.458850 2025-01-16 01:42:42,242 - INFO - step 10429, loss: 1.764818, best loss: 1.458850 2025-01-16 01:42:42,393 - INFO - step 10430, loss: 1.908469, best loss: 1.458850 2025-01-16 01:42:42,543 - INFO - step 10431, loss: 1.915291, best loss: 1.458850 2025-01-16 01:42:42,693 - INFO - step 10432, loss: 2.000532, best loss: 1.458850 2025-01-16 01:42:42,843 - INFO - step 10433, loss: 2.065379, best loss: 1.458850 2025-01-16 01:42:42,993 - INFO - step 10434, loss: 1.889059, best loss: 1.458850 2025-01-16 01:42:43,143 - INFO - step 10435, loss: 1.803522, best loss: 1.458850 2025-01-16 01:42:43,294 - INFO - step 10436, loss: 1.911738, best loss: 1.458850 2025-01-16 01:42:43,444 - INFO - step 10437, loss: 1.900962, best loss: 1.458850 2025-01-16 01:42:43,594 - INFO - step 10438, loss: 1.970720, best loss: 1.458850 2025-01-16 01:42:43,744 - INFO - step 10439, loss: 1.943268, best loss: 1.458850 2025-01-16 01:42:43,894 - INFO - step 10440, loss: 1.966941, best loss: 1.458850 2025-01-16 01:42:44,044 - INFO - step 10441, loss: 1.787654, best loss: 1.458850 2025-01-16 01:42:44,194 - INFO - step 10442, loss: 1.871268, best loss: 1.458850 2025-01-16 01:42:44,345 - INFO - step 10443, loss: 1.908858, best loss: 1.458850 2025-01-16 01:42:44,495 - INFO - step 10444, loss: 1.935806, best loss: 1.458850 2025-01-16 01:42:44,645 - INFO - step 10445, loss: 2.015204, best loss: 1.458850 2025-01-16 01:42:44,796 - INFO - step 10446, loss: 2.075924, best loss: 1.458850 2025-01-16 01:42:44,946 - INFO - step 10447, loss: 1.993780, best loss: 1.458850 2025-01-16 01:42:45,096 - INFO - step 10448, loss: 1.842083, best loss: 1.458850 2025-01-16 01:42:45,246 - INFO - step 10449, loss: 1.894594, best loss: 1.458850 2025-01-16 01:42:45,396 - INFO - step 10450, loss: 2.055708, best loss: 1.458850 2025-01-16 01:42:45,547 - INFO - step 10451, loss: 2.157680, best loss: 1.458850 2025-01-16 01:42:45,697 - INFO - step 10452, loss: 2.075116, best loss: 1.458850 2025-01-16 01:42:45,847 - INFO - step 10453, loss: 2.218493, best loss: 1.458850 2025-01-16 01:42:45,997 - INFO - step 10454, loss: 2.307758, best loss: 1.458850 2025-01-16 01:42:46,148 - INFO - step 10455, loss: 1.948539, best loss: 1.458850 2025-01-16 01:42:46,298 - INFO - step 10456, loss: 2.181601, best loss: 1.458850 2025-01-16 01:42:46,448 - INFO - step 10457, loss: 1.932170, best loss: 1.458850 2025-01-16 01:42:46,598 - INFO - step 10458, loss: 1.742063, best loss: 1.458850 2025-01-16 01:42:46,748 - INFO - step 10459, loss: 2.096598, best loss: 1.458850 2025-01-16 01:42:46,898 - INFO - step 10460, loss: 2.068635, best loss: 1.458850 2025-01-16 01:42:47,048 - INFO - step 10461, loss: 1.858176, best loss: 1.458850 2025-01-16 01:42:47,199 - INFO - step 10462, loss: 1.761784, best loss: 1.458850 2025-01-16 01:42:47,349 - INFO - step 10463, loss: 2.051392, best loss: 1.458850 2025-01-16 01:42:47,499 - INFO - step 10464, loss: 2.039835, best loss: 1.458850 2025-01-16 01:42:47,649 - INFO - step 10465, loss: 1.970123, best loss: 1.458850 2025-01-16 01:42:47,799 - INFO - step 10466, loss: 2.129889, best loss: 1.458850 2025-01-16 01:42:47,949 - INFO - step 10467, loss: 1.864070, best loss: 1.458850 2025-01-16 01:42:48,099 - INFO - step 10468, loss: 1.681661, best loss: 1.458850 2025-01-16 01:42:48,249 - INFO - step 10469, loss: 2.174719, best loss: 1.458850 2025-01-16 01:42:48,399 - INFO - step 10470, loss: 2.232186, best loss: 1.458850 2025-01-16 01:42:48,550 - INFO - step 10471, loss: 2.282753, best loss: 1.458850 2025-01-16 01:42:48,700 - INFO - step 10472, loss: 2.042411, best loss: 1.458850 2025-01-16 01:42:48,850 - INFO - step 10473, loss: 1.988017, best loss: 1.458850 2025-01-16 01:42:49,001 - INFO - step 10474, loss: 2.102321, best loss: 1.458850 2025-01-16 01:42:49,151 - INFO - step 10475, loss: 1.899256, best loss: 1.458850 2025-01-16 01:42:49,301 - INFO - step 10476, loss: 1.906390, best loss: 1.458850 2025-01-16 01:42:49,452 - INFO - step 10477, loss: 2.173540, best loss: 1.458850 2025-01-16 01:42:49,603 - INFO - step 10478, loss: 1.752760, best loss: 1.458850 2025-01-16 01:42:49,753 - INFO - step 10479, loss: 1.614110, best loss: 1.458850 2025-01-16 01:42:49,903 - INFO - step 10480, loss: 2.012209, best loss: 1.458850 2025-01-16 01:42:50,053 - INFO - step 10481, loss: 1.938257, best loss: 1.458850 2025-01-16 01:42:50,203 - INFO - step 10482, loss: 2.049692, best loss: 1.458850 2025-01-16 01:42:50,353 - INFO - step 10483, loss: 1.654527, best loss: 1.458850 2025-01-16 01:42:50,503 - INFO - step 10484, loss: 1.600344, best loss: 1.458850 2025-01-16 01:42:50,653 - INFO - step 10485, loss: 1.533121, best loss: 1.458850 2025-01-16 01:42:50,803 - INFO - step 10486, loss: 1.689740, best loss: 1.458850 2025-01-16 01:42:50,953 - INFO - step 10487, loss: 1.916122, best loss: 1.458850 2025-01-16 01:42:51,103 - INFO - step 10488, loss: 2.006930, best loss: 1.458850 2025-01-16 01:42:51,253 - INFO - step 10489, loss: 2.047657, best loss: 1.458850 2025-01-16 01:42:51,403 - INFO - step 10490, loss: 2.004600, best loss: 1.458850 2025-01-16 01:42:51,553 - INFO - step 10491, loss: 1.981963, best loss: 1.458850 2025-01-16 01:42:51,703 - INFO - step 10492, loss: 2.034831, best loss: 1.458850 2025-01-16 01:42:51,853 - INFO - step 10493, loss: 1.805887, best loss: 1.458850 2025-01-16 01:42:52,003 - INFO - step 10494, loss: 2.017539, best loss: 1.458850 2025-01-16 01:42:52,153 - INFO - step 10495, loss: 1.921114, best loss: 1.458850 2025-01-16 01:42:52,303 - INFO - step 10496, loss: 1.766652, best loss: 1.458850 2025-01-16 01:42:52,453 - INFO - step 10497, loss: 1.665373, best loss: 1.458850 2025-01-16 01:42:52,603 - INFO - step 10498, loss: 1.716007, best loss: 1.458850 2025-01-16 01:42:52,753 - INFO - step 10499, loss: 1.868229, best loss: 1.458850 2025-01-16 01:42:52,903 - INFO - step 10500, loss: 1.699609, best loss: 1.458850 2025-01-16 01:42:53,053 - INFO - step 10501, loss: 1.758557, best loss: 1.458850 2025-01-16 01:42:53,203 - INFO - step 10502, loss: 1.798057, best loss: 1.458850 2025-01-16 01:42:53,352 - INFO - step 10503, loss: 1.632759, best loss: 1.458850 2025-01-16 01:42:53,502 - INFO - step 10504, loss: 1.776896, best loss: 1.458850 2025-01-16 01:42:53,652 - INFO - step 10505, loss: 1.982862, best loss: 1.458850 2025-01-16 01:42:53,802 - INFO - step 10506, loss: 1.814627, best loss: 1.458850 2025-01-16 01:42:53,952 - INFO - step 10507, loss: 1.866999, best loss: 1.458850 2025-01-16 01:42:54,102 - INFO - step 10508, loss: 1.818371, best loss: 1.458850 2025-01-16 01:42:54,252 - INFO - step 10509, loss: 1.981406, best loss: 1.458850 2025-01-16 01:42:54,402 - INFO - step 10510, loss: 1.779818, best loss: 1.458850 2025-01-16 01:42:54,552 - INFO - step 10511, loss: 1.737997, best loss: 1.458850 2025-01-16 01:42:54,703 - INFO - step 10512, loss: 1.830131, best loss: 1.458850 2025-01-16 01:42:54,853 - INFO - step 10513, loss: 1.892835, best loss: 1.458850 2025-01-16 01:42:55,003 - INFO - step 10514, loss: 2.114071, best loss: 1.458850 2025-01-16 01:42:55,153 - INFO - step 10515, loss: 2.016452, best loss: 1.458850 2025-01-16 01:42:55,302 - INFO - step 10516, loss: 2.014726, best loss: 1.458850 2025-01-16 01:42:55,452 - INFO - step 10517, loss: 1.958683, best loss: 1.458850 2025-01-16 01:42:55,603 - INFO - step 10518, loss: 1.997629, best loss: 1.458850 2025-01-16 01:42:55,753 - INFO - step 10519, loss: 1.913294, best loss: 1.458850 2025-01-16 01:42:55,903 - INFO - step 10520, loss: 1.815839, best loss: 1.458850 2025-01-16 01:42:56,053 - INFO - step 10521, loss: 1.931249, best loss: 1.458850 2025-01-16 01:42:56,203 - INFO - step 10522, loss: 1.859360, best loss: 1.458850 2025-01-16 01:42:56,353 - INFO - step 10523, loss: 1.784266, best loss: 1.458850 2025-01-16 01:42:56,503 - INFO - step 10524, loss: 1.880250, best loss: 1.458850 2025-01-16 01:42:56,654 - INFO - step 10525, loss: 1.779774, best loss: 1.458850 2025-01-16 01:42:56,804 - INFO - step 10526, loss: 1.937622, best loss: 1.458850 2025-01-16 01:43:00,314 - INFO - step 10527, loss: 1.458663, best loss: 1.458663 2025-01-16 01:43:00,465 - INFO - step 10528, loss: 1.890511, best loss: 1.458663 2025-01-16 01:43:00,615 - INFO - step 10529, loss: 1.896309, best loss: 1.458663 2025-01-16 01:43:00,765 - INFO - step 10530, loss: 1.726018, best loss: 1.458663 2025-01-16 01:43:00,916 - INFO - step 10531, loss: 1.840666, best loss: 1.458663 2025-01-16 01:43:01,066 - INFO - step 10532, loss: 1.741163, best loss: 1.458663 2025-01-16 01:43:01,216 - INFO - step 10533, loss: 1.835485, best loss: 1.458663 2025-01-16 01:43:01,367 - INFO - step 10534, loss: 1.758771, best loss: 1.458663 2025-01-16 01:43:01,517 - INFO - step 10535, loss: 1.759015, best loss: 1.458663 2025-01-16 01:43:01,667 - INFO - step 10536, loss: 1.800732, best loss: 1.458663 2025-01-16 01:43:01,817 - INFO - step 10537, loss: 1.905011, best loss: 1.458663 2025-01-16 01:43:01,967 - INFO - step 10538, loss: 1.795497, best loss: 1.458663 2025-01-16 01:43:02,117 - INFO - step 10539, loss: 1.774363, best loss: 1.458663 2025-01-16 01:43:02,267 - INFO - step 10540, loss: 1.636447, best loss: 1.458663 2025-01-16 01:43:02,417 - INFO - step 10541, loss: 1.685589, best loss: 1.458663 2025-01-16 01:43:02,567 - INFO - step 10542, loss: 1.855304, best loss: 1.458663 2025-01-16 01:43:02,717 - INFO - step 10543, loss: 1.733691, best loss: 1.458663 2025-01-16 01:43:02,867 - INFO - step 10544, loss: 1.784122, best loss: 1.458663 2025-01-16 01:43:03,017 - INFO - step 10545, loss: 1.496599, best loss: 1.458663 2025-01-16 01:43:03,168 - INFO - step 10546, loss: 1.499099, best loss: 1.458663 2025-01-16 01:43:06,662 - INFO - step 10547, loss: 1.445518, best loss: 1.445518 2025-01-16 01:43:06,812 - INFO - step 10548, loss: 1.874563, best loss: 1.445518 2025-01-16 01:43:06,962 - INFO - step 10549, loss: 1.831473, best loss: 1.445518 2025-01-16 01:43:07,112 - INFO - step 10550, loss: 1.949887, best loss: 1.445518 2025-01-16 01:43:07,262 - INFO - step 10551, loss: 2.018134, best loss: 1.445518 2025-01-16 01:43:07,412 - INFO - step 10552, loss: 1.895540, best loss: 1.445518 2025-01-16 01:43:07,562 - INFO - step 10553, loss: 1.623497, best loss: 1.445518 2025-01-16 01:43:07,713 - INFO - step 10554, loss: 1.741276, best loss: 1.445518 2025-01-16 01:43:07,862 - INFO - step 10555, loss: 1.965941, best loss: 1.445518 2025-01-16 01:43:08,013 - INFO - step 10556, loss: 1.893555, best loss: 1.445518 2025-01-16 01:43:08,163 - INFO - step 10557, loss: 1.465718, best loss: 1.445518 2025-01-16 01:43:08,313 - INFO - step 10558, loss: 1.705020, best loss: 1.445518 2025-01-16 01:43:08,463 - INFO - step 10559, loss: 1.614090, best loss: 1.445518 2025-01-16 01:43:08,613 - INFO - step 10560, loss: 1.872423, best loss: 1.445518 2025-01-16 01:43:08,763 - INFO - step 10561, loss: 1.923330, best loss: 1.445518 2025-01-16 01:43:08,913 - INFO - step 10562, loss: 1.943138, best loss: 1.445518 2025-01-16 01:43:09,063 - INFO - step 10563, loss: 1.974927, best loss: 1.445518 2025-01-16 01:43:09,213 - INFO - step 10564, loss: 1.948819, best loss: 1.445518 2025-01-16 01:43:09,362 - INFO - step 10565, loss: 1.634099, best loss: 1.445518 2025-01-16 01:43:09,513 - INFO - step 10566, loss: 2.040241, best loss: 1.445518 2025-01-16 01:43:09,663 - INFO - step 10567, loss: 1.823190, best loss: 1.445518 2025-01-16 01:43:09,813 - INFO - step 10568, loss: 2.053376, best loss: 1.445518 2025-01-16 01:43:09,963 - INFO - step 10569, loss: 2.012651, best loss: 1.445518 2025-01-16 01:43:10,113 - INFO - step 10570, loss: 1.785002, best loss: 1.445518 2025-01-16 01:43:10,263 - INFO - step 10571, loss: 1.838688, best loss: 1.445518 2025-01-16 01:43:10,413 - INFO - step 10572, loss: 1.659752, best loss: 1.445518 2025-01-16 01:43:10,563 - INFO - step 10573, loss: 2.027650, best loss: 1.445518 2025-01-16 01:43:10,714 - INFO - step 10574, loss: 2.064050, best loss: 1.445518 2025-01-16 01:43:10,864 - INFO - step 10575, loss: 2.003112, best loss: 1.445518 2025-01-16 01:43:11,014 - INFO - step 10576, loss: 2.008823, best loss: 1.445518 2025-01-16 01:43:11,164 - INFO - step 10577, loss: 1.870769, best loss: 1.445518 2025-01-16 01:43:11,314 - INFO - step 10578, loss: 1.911175, best loss: 1.445518 2025-01-16 01:43:11,465 - INFO - step 10579, loss: 1.874573, best loss: 1.445518 2025-01-16 01:43:11,614 - INFO - step 10580, loss: 1.730317, best loss: 1.445518 2025-01-16 01:43:11,765 - INFO - step 10581, loss: 2.081269, best loss: 1.445518 2025-01-16 01:43:11,915 - INFO - step 10582, loss: 1.639289, best loss: 1.445518 2025-01-16 01:43:12,065 - INFO - step 10583, loss: 1.751678, best loss: 1.445518 2025-01-16 01:43:12,215 - INFO - step 10584, loss: 1.884538, best loss: 1.445518 2025-01-16 01:43:12,365 - INFO - step 10585, loss: 2.080525, best loss: 1.445518 2025-01-16 01:43:12,515 - INFO - step 10586, loss: 1.874163, best loss: 1.445518 2025-01-16 01:43:12,665 - INFO - step 10587, loss: 1.785069, best loss: 1.445518 2025-01-16 01:43:12,815 - INFO - step 10588, loss: 1.898075, best loss: 1.445518 2025-01-16 01:43:12,965 - INFO - step 10589, loss: 1.975857, best loss: 1.445518 2025-01-16 01:43:13,116 - INFO - step 10590, loss: 1.744799, best loss: 1.445518 2025-01-16 01:43:13,266 - INFO - step 10591, loss: 1.703898, best loss: 1.445518 2025-01-16 01:43:13,416 - INFO - step 10592, loss: 1.895207, best loss: 1.445518 2025-01-16 01:43:13,566 - INFO - step 10593, loss: 1.962841, best loss: 1.445518 2025-01-16 01:43:13,716 - INFO - step 10594, loss: 1.697568, best loss: 1.445518 2025-01-16 01:43:13,866 - INFO - step 10595, loss: 1.659901, best loss: 1.445518 2025-01-16 01:43:14,016 - INFO - step 10596, loss: 1.931680, best loss: 1.445518 2025-01-16 01:43:14,166 - INFO - step 10597, loss: 2.093341, best loss: 1.445518 2025-01-16 01:43:14,317 - INFO - step 10598, loss: 1.916094, best loss: 1.445518 2025-01-16 01:43:14,467 - INFO - step 10599, loss: 1.797176, best loss: 1.445518 2025-01-16 01:43:14,617 - INFO - step 10600, loss: 2.000426, best loss: 1.445518 2025-01-16 01:43:14,767 - INFO - step 10601, loss: 1.936359, best loss: 1.445518 2025-01-16 01:43:14,917 - INFO - step 10602, loss: 1.953411, best loss: 1.445518 2025-01-16 01:43:15,067 - INFO - step 10603, loss: 1.813260, best loss: 1.445518 2025-01-16 01:43:15,217 - INFO - step 10604, loss: 1.790366, best loss: 1.445518 2025-01-16 01:43:15,367 - INFO - step 10605, loss: 1.782766, best loss: 1.445518 2025-01-16 01:43:15,517 - INFO - step 10606, loss: 2.155684, best loss: 1.445518 2025-01-16 01:43:15,667 - INFO - step 10607, loss: 1.974820, best loss: 1.445518 2025-01-16 01:43:15,818 - INFO - step 10608, loss: 1.930446, best loss: 1.445518 2025-01-16 01:43:15,968 - INFO - step 10609, loss: 1.611307, best loss: 1.445518 2025-01-16 01:43:16,118 - INFO - step 10610, loss: 1.681960, best loss: 1.445518 2025-01-16 01:43:16,268 - INFO - step 10611, loss: 1.888124, best loss: 1.445518 2025-01-16 01:43:16,418 - INFO - step 10612, loss: 1.879625, best loss: 1.445518 2025-01-16 01:43:16,568 - INFO - step 10613, loss: 1.889173, best loss: 1.445518 2025-01-16 01:43:16,718 - INFO - step 10614, loss: 1.964906, best loss: 1.445518 2025-01-16 01:43:16,868 - INFO - step 10615, loss: 1.756853, best loss: 1.445518 2025-01-16 01:43:17,018 - INFO - step 10616, loss: 1.873798, best loss: 1.445518 2025-01-16 01:43:17,168 - INFO - step 10617, loss: 1.842470, best loss: 1.445518 2025-01-16 01:43:17,318 - INFO - step 10618, loss: 1.530671, best loss: 1.445518 2025-01-16 01:43:17,469 - INFO - step 10619, loss: 1.860245, best loss: 1.445518 2025-01-16 01:43:17,619 - INFO - step 10620, loss: 1.915096, best loss: 1.445518 2025-01-16 01:43:17,769 - INFO - step 10621, loss: 2.010013, best loss: 1.445518 2025-01-16 01:43:17,919 - INFO - step 10622, loss: 1.928875, best loss: 1.445518 2025-01-16 01:43:18,069 - INFO - step 10623, loss: 1.912178, best loss: 1.445518 2025-01-16 01:43:18,219 - INFO - step 10624, loss: 1.910160, best loss: 1.445518 2025-01-16 01:43:18,369 - INFO - step 10625, loss: 1.619133, best loss: 1.445518 2025-01-16 01:43:18,519 - INFO - step 10626, loss: 1.943703, best loss: 1.445518 2025-01-16 01:43:18,669 - INFO - step 10627, loss: 1.612191, best loss: 1.445518 2025-01-16 01:43:18,820 - INFO - step 10628, loss: 1.804158, best loss: 1.445518 2025-01-16 01:43:18,970 - INFO - step 10629, loss: 1.769083, best loss: 1.445518 2025-01-16 01:43:19,120 - INFO - step 10630, loss: 1.798131, best loss: 1.445518 2025-01-16 01:43:19,270 - INFO - step 10631, loss: 1.822464, best loss: 1.445518 2025-01-16 01:43:19,420 - INFO - step 10632, loss: 1.947630, best loss: 1.445518 2025-01-16 01:43:19,571 - INFO - step 10633, loss: 1.965795, best loss: 1.445518 2025-01-16 01:43:19,721 - INFO - step 10634, loss: 1.919944, best loss: 1.445518 2025-01-16 01:43:19,871 - INFO - step 10635, loss: 1.960798, best loss: 1.445518 2025-01-16 01:43:20,022 - INFO - step 10636, loss: 1.900179, best loss: 1.445518 2025-01-16 01:43:20,172 - INFO - step 10637, loss: 1.933614, best loss: 1.445518 2025-01-16 01:43:20,322 - INFO - step 10638, loss: 1.859327, best loss: 1.445518 2025-01-16 01:43:20,472 - INFO - step 10639, loss: 1.665113, best loss: 1.445518 2025-01-16 01:43:20,622 - INFO - step 10640, loss: 1.947600, best loss: 1.445518 2025-01-16 01:43:20,772 - INFO - step 10641, loss: 1.821214, best loss: 1.445518 2025-01-16 01:43:20,923 - INFO - step 10642, loss: 1.737476, best loss: 1.445518 2025-01-16 01:43:21,073 - INFO - step 10643, loss: 1.761638, best loss: 1.445518 2025-01-16 01:43:21,223 - INFO - step 10644, loss: 1.680323, best loss: 1.445518 2025-01-16 01:43:21,373 - INFO - step 10645, loss: 1.858417, best loss: 1.445518 2025-01-16 01:43:21,524 - INFO - step 10646, loss: 1.661833, best loss: 1.445518 2025-01-16 01:43:21,674 - INFO - step 10647, loss: 1.687556, best loss: 1.445518 2025-01-16 01:43:21,824 - INFO - step 10648, loss: 1.834970, best loss: 1.445518 2025-01-16 01:43:21,974 - INFO - step 10649, loss: 1.874028, best loss: 1.445518 2025-01-16 01:43:22,124 - INFO - step 10650, loss: 1.748517, best loss: 1.445518 2025-01-16 01:43:22,274 - INFO - step 10651, loss: 1.822558, best loss: 1.445518 2025-01-16 01:43:22,424 - INFO - step 10652, loss: 1.752137, best loss: 1.445518 2025-01-16 01:43:22,575 - INFO - step 10653, loss: 2.049760, best loss: 1.445518 2025-01-16 01:43:22,725 - INFO - step 10654, loss: 2.042780, best loss: 1.445518 2025-01-16 01:43:22,875 - INFO - step 10655, loss: 2.057872, best loss: 1.445518 2025-01-16 01:43:23,025 - INFO - step 10656, loss: 2.076916, best loss: 1.445518 2025-01-16 01:43:23,175 - INFO - step 10657, loss: 1.847661, best loss: 1.445518 2025-01-16 01:43:23,325 - INFO - step 10658, loss: 1.943709, best loss: 1.445518 2025-01-16 01:43:23,475 - INFO - step 10659, loss: 1.904797, best loss: 1.445518 2025-01-16 01:43:23,626 - INFO - step 10660, loss: 1.830340, best loss: 1.445518 2025-01-16 01:43:23,776 - INFO - step 10661, loss: 1.972912, best loss: 1.445518 2025-01-16 01:43:23,926 - INFO - step 10662, loss: 1.856160, best loss: 1.445518 2025-01-16 01:43:24,076 - INFO - step 10663, loss: 2.102538, best loss: 1.445518 2025-01-16 01:43:24,226 - INFO - step 10664, loss: 1.784396, best loss: 1.445518 2025-01-16 01:43:24,376 - INFO - step 10665, loss: 1.784458, best loss: 1.445518 2025-01-16 01:43:24,526 - INFO - step 10666, loss: 1.841309, best loss: 1.445518 2025-01-16 01:43:24,676 - INFO - step 10667, loss: 1.782410, best loss: 1.445518 2025-01-16 01:43:24,826 - INFO - step 10668, loss: 1.809380, best loss: 1.445518 2025-01-16 01:43:24,976 - INFO - step 10669, loss: 1.984401, best loss: 1.445518 2025-01-16 01:43:25,127 - INFO - step 10670, loss: 1.915696, best loss: 1.445518 2025-01-16 01:43:25,277 - INFO - step 10671, loss: 2.086844, best loss: 1.445518 2025-01-16 01:43:25,427 - INFO - step 10672, loss: 1.913635, best loss: 1.445518 2025-01-16 01:43:25,577 - INFO - step 10673, loss: 1.854609, best loss: 1.445518 2025-01-16 01:43:25,727 - INFO - step 10674, loss: 2.058993, best loss: 1.445518 2025-01-16 01:43:25,877 - INFO - step 10675, loss: 1.904773, best loss: 1.445518 2025-01-16 01:43:26,027 - INFO - step 10676, loss: 1.851444, best loss: 1.445518 2025-01-16 01:43:26,177 - INFO - step 10677, loss: 1.875369, best loss: 1.445518 2025-01-16 01:43:26,327 - INFO - step 10678, loss: 1.821480, best loss: 1.445518 2025-01-16 01:43:26,478 - INFO - step 10679, loss: 1.898489, best loss: 1.445518 2025-01-16 01:43:26,628 - INFO - step 10680, loss: 1.960382, best loss: 1.445518 2025-01-16 01:43:26,778 - INFO - step 10681, loss: 1.853379, best loss: 1.445518 2025-01-16 01:43:26,928 - INFO - step 10682, loss: 1.986484, best loss: 1.445518 2025-01-16 01:43:27,078 - INFO - step 10683, loss: 1.736357, best loss: 1.445518 2025-01-16 01:43:27,228 - INFO - step 10684, loss: 1.688766, best loss: 1.445518 2025-01-16 01:43:27,378 - INFO - step 10685, loss: 1.731811, best loss: 1.445518 2025-01-16 01:43:27,529 - INFO - step 10686, loss: 1.913349, best loss: 1.445518 2025-01-16 01:43:27,679 - INFO - step 10687, loss: 2.180637, best loss: 1.445518 2025-01-16 01:43:27,829 - INFO - step 10688, loss: 1.810486, best loss: 1.445518 2025-01-16 01:43:27,979 - INFO - step 10689, loss: 1.514831, best loss: 1.445518 2025-01-16 01:43:28,129 - INFO - step 10690, loss: 1.922579, best loss: 1.445518 2025-01-16 01:43:28,280 - INFO - step 10691, loss: 1.919869, best loss: 1.445518 2025-01-16 01:43:28,430 - INFO - step 10692, loss: 2.178594, best loss: 1.445518 2025-01-16 01:43:28,580 - INFO - step 10693, loss: 1.639860, best loss: 1.445518 2025-01-16 01:43:28,730 - INFO - step 10694, loss: 2.010376, best loss: 1.445518 2025-01-16 01:43:28,880 - INFO - step 10695, loss: 1.720567, best loss: 1.445518 2025-01-16 01:43:29,031 - INFO - step 10696, loss: 1.719921, best loss: 1.445518 2025-01-16 01:43:29,181 - INFO - step 10697, loss: 1.931144, best loss: 1.445518 2025-01-16 01:43:29,331 - INFO - step 10698, loss: 1.866043, best loss: 1.445518 2025-01-16 01:43:29,481 - INFO - step 10699, loss: 1.981816, best loss: 1.445518 2025-01-16 01:43:29,631 - INFO - step 10700, loss: 1.924322, best loss: 1.445518 2025-01-16 01:43:29,782 - INFO - step 10701, loss: 1.873072, best loss: 1.445518 2025-01-16 01:43:29,932 - INFO - step 10702, loss: 2.064095, best loss: 1.445518 2025-01-16 01:43:30,082 - INFO - step 10703, loss: 1.770975, best loss: 1.445518 2025-01-16 01:43:30,232 - INFO - step 10704, loss: 1.459651, best loss: 1.445518 2025-01-16 01:43:30,382 - INFO - step 10705, loss: 1.845472, best loss: 1.445518 2025-01-16 01:43:30,533 - INFO - step 10706, loss: 2.122169, best loss: 1.445518 2025-01-16 01:43:30,683 - INFO - step 10707, loss: 1.859901, best loss: 1.445518 2025-01-16 01:43:30,833 - INFO - step 10708, loss: 1.722856, best loss: 1.445518 2025-01-16 01:43:30,984 - INFO - step 10709, loss: 1.787471, best loss: 1.445518 2025-01-16 01:43:31,133 - INFO - step 10710, loss: 2.036051, best loss: 1.445518 2025-01-16 01:43:31,284 - INFO - step 10711, loss: 2.002414, best loss: 1.445518 2025-01-16 01:43:31,434 - INFO - step 10712, loss: 1.847251, best loss: 1.445518 2025-01-16 01:43:31,584 - INFO - step 10713, loss: 1.783780, best loss: 1.445518 2025-01-16 01:43:31,734 - INFO - step 10714, loss: 1.962722, best loss: 1.445518 2025-01-16 01:43:31,884 - INFO - step 10715, loss: 1.954656, best loss: 1.445518 2025-01-16 01:43:32,034 - INFO - step 10716, loss: 1.710159, best loss: 1.445518 2025-01-16 01:43:32,184 - INFO - step 10717, loss: 1.904144, best loss: 1.445518 2025-01-16 01:43:32,334 - INFO - step 10718, loss: 1.828018, best loss: 1.445518 2025-01-16 01:43:32,484 - INFO - step 10719, loss: 1.967035, best loss: 1.445518 2025-01-16 01:43:32,634 - INFO - step 10720, loss: 1.823475, best loss: 1.445518 2025-01-16 01:43:32,785 - INFO - step 10721, loss: 1.781334, best loss: 1.445518 2025-01-16 01:43:32,935 - INFO - step 10722, loss: 1.699484, best loss: 1.445518 2025-01-16 01:43:33,085 - INFO - step 10723, loss: 1.569680, best loss: 1.445518 2025-01-16 01:43:33,235 - INFO - step 10724, loss: 1.862749, best loss: 1.445518 2025-01-16 01:43:33,385 - INFO - step 10725, loss: 1.812253, best loss: 1.445518 2025-01-16 01:43:33,535 - INFO - step 10726, loss: 2.040392, best loss: 1.445518 2025-01-16 01:43:33,685 - INFO - step 10727, loss: 1.873975, best loss: 1.445518 2025-01-16 01:43:33,835 - INFO - step 10728, loss: 1.916031, best loss: 1.445518 2025-01-16 01:43:33,985 - INFO - step 10729, loss: 1.970667, best loss: 1.445518 2025-01-16 01:43:34,135 - INFO - step 10730, loss: 1.754023, best loss: 1.445518 2025-01-16 01:43:34,285 - INFO - step 10731, loss: 1.660943, best loss: 1.445518 2025-01-16 01:43:34,435 - INFO - step 10732, loss: 1.796039, best loss: 1.445518 2025-01-16 01:43:34,586 - INFO - step 10733, loss: 1.913360, best loss: 1.445518 2025-01-16 01:43:34,736 - INFO - step 10734, loss: 1.910790, best loss: 1.445518 2025-01-16 01:43:34,886 - INFO - step 10735, loss: 1.895332, best loss: 1.445518 2025-01-16 01:43:35,036 - INFO - step 10736, loss: 2.120402, best loss: 1.445518 2025-01-16 01:43:35,186 - INFO - step 10737, loss: 2.102096, best loss: 1.445518 2025-01-16 01:43:35,336 - INFO - step 10738, loss: 2.025268, best loss: 1.445518 2025-01-16 01:43:35,486 - INFO - step 10739, loss: 1.995910, best loss: 1.445518 2025-01-16 01:43:35,636 - INFO - step 10740, loss: 2.137555, best loss: 1.445518 2025-01-16 01:43:35,786 - INFO - step 10741, loss: 1.804388, best loss: 1.445518 2025-01-16 01:43:35,936 - INFO - step 10742, loss: 2.088841, best loss: 1.445518 2025-01-16 01:43:36,086 - INFO - step 10743, loss: 2.062171, best loss: 1.445518 2025-01-16 01:43:36,236 - INFO - step 10744, loss: 1.969573, best loss: 1.445518 2025-01-16 01:43:36,386 - INFO - step 10745, loss: 1.898897, best loss: 1.445518 2025-01-16 01:43:36,536 - INFO - step 10746, loss: 2.028079, best loss: 1.445518 2025-01-16 01:43:36,686 - INFO - step 10747, loss: 1.865515, best loss: 1.445518 2025-01-16 01:43:40,209 - INFO - step 10748, loss: 1.435994, best loss: 1.435994 2025-01-16 01:43:40,371 - INFO - step 10749, loss: 1.925574, best loss: 1.435994 2025-01-16 01:43:40,524 - INFO - step 10750, loss: 1.812093, best loss: 1.435994 2025-01-16 01:43:40,674 - INFO - step 10751, loss: 1.969412, best loss: 1.435994 2025-01-16 01:43:40,825 - INFO - step 10752, loss: 2.079440, best loss: 1.435994 2025-01-16 01:43:40,975 - INFO - step 10753, loss: 1.834892, best loss: 1.435994 2025-01-16 01:43:41,125 - INFO - step 10754, loss: 1.922368, best loss: 1.435994 2025-01-16 01:43:41,275 - INFO - step 10755, loss: 1.947136, best loss: 1.435994 2025-01-16 01:43:41,425 - INFO - step 10756, loss: 1.892552, best loss: 1.435994 2025-01-16 01:43:41,575 - INFO - step 10757, loss: 1.870997, best loss: 1.435994 2025-01-16 01:43:41,725 - INFO - step 10758, loss: 1.885560, best loss: 1.435994 2025-01-16 01:43:41,875 - INFO - step 10759, loss: 1.659497, best loss: 1.435994 2025-01-16 01:43:42,026 - INFO - step 10760, loss: 1.827427, best loss: 1.435994 2025-01-16 01:43:42,176 - INFO - step 10761, loss: 1.878698, best loss: 1.435994 2025-01-16 01:43:42,326 - INFO - step 10762, loss: 1.906357, best loss: 1.435994 2025-01-16 01:43:42,477 - INFO - step 10763, loss: 1.941419, best loss: 1.435994 2025-01-16 01:43:42,627 - INFO - step 10764, loss: 1.855395, best loss: 1.435994 2025-01-16 01:43:42,777 - INFO - step 10765, loss: 1.700745, best loss: 1.435994 2025-01-16 01:43:42,927 - INFO - step 10766, loss: 1.813703, best loss: 1.435994 2025-01-16 01:43:43,077 - INFO - step 10767, loss: 1.762844, best loss: 1.435994 2025-01-16 01:43:43,227 - INFO - step 10768, loss: 1.837544, best loss: 1.435994 2025-01-16 01:43:43,378 - INFO - step 10769, loss: 1.804865, best loss: 1.435994 2025-01-16 01:43:43,528 - INFO - step 10770, loss: 1.853736, best loss: 1.435994 2025-01-16 01:43:43,678 - INFO - step 10771, loss: 1.715631, best loss: 1.435994 2025-01-16 01:43:43,828 - INFO - step 10772, loss: 1.779785, best loss: 1.435994 2025-01-16 01:43:43,978 - INFO - step 10773, loss: 1.811108, best loss: 1.435994 2025-01-16 01:43:44,128 - INFO - step 10774, loss: 1.834642, best loss: 1.435994 2025-01-16 01:43:44,278 - INFO - step 10775, loss: 2.002497, best loss: 1.435994 2025-01-16 01:43:44,428 - INFO - step 10776, loss: 2.039607, best loss: 1.435994 2025-01-16 01:43:44,578 - INFO - step 10777, loss: 1.899597, best loss: 1.435994 2025-01-16 01:43:44,729 - INFO - step 10778, loss: 1.855011, best loss: 1.435994 2025-01-16 01:43:44,879 - INFO - step 10779, loss: 1.893831, best loss: 1.435994 2025-01-16 01:43:45,029 - INFO - step 10780, loss: 2.003550, best loss: 1.435994 2025-01-16 01:43:45,179 - INFO - step 10781, loss: 2.019326, best loss: 1.435994 2025-01-16 01:43:45,329 - INFO - step 10782, loss: 1.974281, best loss: 1.435994 2025-01-16 01:43:45,479 - INFO - step 10783, loss: 2.076078, best loss: 1.435994 2025-01-16 01:43:45,629 - INFO - step 10784, loss: 2.170719, best loss: 1.435994 2025-01-16 01:43:45,779 - INFO - step 10785, loss: 1.836776, best loss: 1.435994 2025-01-16 01:43:45,929 - INFO - step 10786, loss: 2.018486, best loss: 1.435994 2025-01-16 01:43:46,079 - INFO - step 10787, loss: 1.787709, best loss: 1.435994 2025-01-16 01:43:46,229 - INFO - step 10788, loss: 1.672525, best loss: 1.435994 2025-01-16 01:43:46,379 - INFO - step 10789, loss: 2.037034, best loss: 1.435994 2025-01-16 01:43:46,529 - INFO - step 10790, loss: 2.042494, best loss: 1.435994 2025-01-16 01:43:46,679 - INFO - step 10791, loss: 1.696958, best loss: 1.435994 2025-01-16 01:43:46,829 - INFO - step 10792, loss: 1.731450, best loss: 1.435994 2025-01-16 01:43:46,979 - INFO - step 10793, loss: 1.939461, best loss: 1.435994 2025-01-16 01:43:47,130 - INFO - step 10794, loss: 1.995605, best loss: 1.435994 2025-01-16 01:43:47,279 - INFO - step 10795, loss: 1.869568, best loss: 1.435994 2025-01-16 01:43:47,429 - INFO - step 10796, loss: 1.987351, best loss: 1.435994 2025-01-16 01:43:47,579 - INFO - step 10797, loss: 1.732680, best loss: 1.435994 2025-01-16 01:43:47,729 - INFO - step 10798, loss: 1.614946, best loss: 1.435994 2025-01-16 01:43:47,879 - INFO - step 10799, loss: 2.074698, best loss: 1.435994 2025-01-16 01:43:48,029 - INFO - step 10800, loss: 2.188763, best loss: 1.435994 2025-01-16 01:43:48,178 - INFO - step 10801, loss: 2.184359, best loss: 1.435994 2025-01-16 01:43:48,328 - INFO - step 10802, loss: 2.011331, best loss: 1.435994 2025-01-16 01:43:48,477 - INFO - step 10803, loss: 1.847684, best loss: 1.435994 2025-01-16 01:43:48,627 - INFO - step 10804, loss: 2.012056, best loss: 1.435994 2025-01-16 01:43:48,778 - INFO - step 10805, loss: 1.851391, best loss: 1.435994 2025-01-16 01:43:48,928 - INFO - step 10806, loss: 1.763608, best loss: 1.435994 2025-01-16 01:43:49,077 - INFO - step 10807, loss: 2.090125, best loss: 1.435994 2025-01-16 01:43:49,227 - INFO - step 10808, loss: 1.615110, best loss: 1.435994 2025-01-16 01:43:49,377 - INFO - step 10809, loss: 1.494312, best loss: 1.435994 2025-01-16 01:43:49,528 - INFO - step 10810, loss: 1.880138, best loss: 1.435994 2025-01-16 01:43:49,678 - INFO - step 10811, loss: 1.829240, best loss: 1.435994 2025-01-16 01:43:49,828 - INFO - step 10812, loss: 1.888229, best loss: 1.435994 2025-01-16 01:43:49,978 - INFO - step 10813, loss: 1.614581, best loss: 1.435994 2025-01-16 01:43:50,128 - INFO - step 10814, loss: 1.553380, best loss: 1.435994 2025-01-16 01:43:53,637 - INFO - step 10815, loss: 1.432995, best loss: 1.432995 2025-01-16 01:43:53,788 - INFO - step 10816, loss: 1.641933, best loss: 1.432995 2025-01-16 01:43:53,938 - INFO - step 10817, loss: 1.915055, best loss: 1.432995 2025-01-16 01:43:54,088 - INFO - step 10818, loss: 1.958981, best loss: 1.432995 2025-01-16 01:43:54,238 - INFO - step 10819, loss: 1.872704, best loss: 1.432995 2025-01-16 01:43:54,388 - INFO - step 10820, loss: 1.862493, best loss: 1.432995 2025-01-16 01:43:54,538 - INFO - step 10821, loss: 1.776323, best loss: 1.432995 2025-01-16 01:43:54,688 - INFO - step 10822, loss: 1.903843, best loss: 1.432995 2025-01-16 01:43:54,838 - INFO - step 10823, loss: 1.662378, best loss: 1.432995 2025-01-16 01:43:54,988 - INFO - step 10824, loss: 1.886309, best loss: 1.432995 2025-01-16 01:43:55,139 - INFO - step 10825, loss: 1.765101, best loss: 1.432995 2025-01-16 01:43:55,289 - INFO - step 10826, loss: 1.674994, best loss: 1.432995 2025-01-16 01:43:55,439 - INFO - step 10827, loss: 1.596200, best loss: 1.432995 2025-01-16 01:43:55,589 - INFO - step 10828, loss: 1.733221, best loss: 1.432995 2025-01-16 01:43:55,739 - INFO - step 10829, loss: 1.866921, best loss: 1.432995 2025-01-16 01:43:55,889 - INFO - step 10830, loss: 1.649598, best loss: 1.432995 2025-01-16 01:43:56,039 - INFO - step 10831, loss: 1.676656, best loss: 1.432995 2025-01-16 01:43:56,189 - INFO - step 10832, loss: 1.634878, best loss: 1.432995 2025-01-16 01:43:56,340 - INFO - step 10833, loss: 1.577189, best loss: 1.432995 2025-01-16 01:43:56,490 - INFO - step 10834, loss: 1.675866, best loss: 1.432995 2025-01-16 01:43:56,640 - INFO - step 10835, loss: 1.878983, best loss: 1.432995 2025-01-16 01:43:56,790 - INFO - step 10836, loss: 1.619544, best loss: 1.432995 2025-01-16 01:43:56,941 - INFO - step 10837, loss: 1.748987, best loss: 1.432995 2025-01-16 01:43:57,091 - INFO - step 10838, loss: 1.712654, best loss: 1.432995 2025-01-16 01:43:57,240 - INFO - step 10839, loss: 1.895380, best loss: 1.432995 2025-01-16 01:43:57,390 - INFO - step 10840, loss: 1.691490, best loss: 1.432995 2025-01-16 01:43:57,540 - INFO - step 10841, loss: 1.688239, best loss: 1.432995 2025-01-16 01:43:57,691 - INFO - step 10842, loss: 1.724903, best loss: 1.432995 2025-01-16 01:43:57,841 - INFO - step 10843, loss: 1.823930, best loss: 1.432995 2025-01-16 01:43:57,991 - INFO - step 10844, loss: 2.095655, best loss: 1.432995 2025-01-16 01:43:58,141 - INFO - step 10845, loss: 1.910347, best loss: 1.432995 2025-01-16 01:43:58,291 - INFO - step 10846, loss: 1.844992, best loss: 1.432995 2025-01-16 01:43:58,441 - INFO - step 10847, loss: 1.852062, best loss: 1.432995 2025-01-16 01:43:58,591 - INFO - step 10848, loss: 1.884359, best loss: 1.432995 2025-01-16 01:43:58,741 - INFO - step 10849, loss: 1.853543, best loss: 1.432995 2025-01-16 01:43:58,892 - INFO - step 10850, loss: 1.678610, best loss: 1.432995 2025-01-16 01:43:59,042 - INFO - step 10851, loss: 1.746217, best loss: 1.432995 2025-01-16 01:43:59,192 - INFO - step 10852, loss: 1.716148, best loss: 1.432995 2025-01-16 01:43:59,342 - INFO - step 10853, loss: 1.713798, best loss: 1.432995 2025-01-16 01:43:59,493 - INFO - step 10854, loss: 1.779465, best loss: 1.432995 2025-01-16 01:43:59,643 - INFO - step 10855, loss: 1.717304, best loss: 1.432995 2025-01-16 01:43:59,794 - INFO - step 10856, loss: 1.879926, best loss: 1.432995 2025-01-16 01:43:59,944 - INFO - step 10857, loss: 1.442938, best loss: 1.432995 2025-01-16 01:44:00,094 - INFO - step 10858, loss: 1.798910, best loss: 1.432995 2025-01-16 01:44:00,245 - INFO - step 10859, loss: 1.847058, best loss: 1.432995 2025-01-16 01:44:00,394 - INFO - step 10860, loss: 1.665056, best loss: 1.432995 2025-01-16 01:44:00,544 - INFO - step 10861, loss: 1.860763, best loss: 1.432995 2025-01-16 01:44:00,695 - INFO - step 10862, loss: 1.659252, best loss: 1.432995 2025-01-16 01:44:00,845 - INFO - step 10863, loss: 1.688626, best loss: 1.432995 2025-01-16 01:44:00,995 - INFO - step 10864, loss: 1.620719, best loss: 1.432995 2025-01-16 01:44:01,145 - INFO - step 10865, loss: 1.584068, best loss: 1.432995 2025-01-16 01:44:01,295 - INFO - step 10866, loss: 1.632850, best loss: 1.432995 2025-01-16 01:44:01,445 - INFO - step 10867, loss: 1.768469, best loss: 1.432995 2025-01-16 01:44:01,595 - INFO - step 10868, loss: 1.664839, best loss: 1.432995 2025-01-16 01:44:01,745 - INFO - step 10869, loss: 1.651403, best loss: 1.432995 2025-01-16 01:44:01,895 - INFO - step 10870, loss: 1.561751, best loss: 1.432995 2025-01-16 01:44:02,045 - INFO - step 10871, loss: 1.541110, best loss: 1.432995 2025-01-16 01:44:02,195 - INFO - step 10872, loss: 1.803098, best loss: 1.432995 2025-01-16 01:44:02,346 - INFO - step 10873, loss: 1.627249, best loss: 1.432995 2025-01-16 01:44:02,496 - INFO - step 10874, loss: 1.670730, best loss: 1.432995 2025-01-16 01:44:05,983 - INFO - step 10875, loss: 1.404500, best loss: 1.404500 2025-01-16 01:44:06,133 - INFO - step 10876, loss: 1.411537, best loss: 1.404500 2025-01-16 01:44:09,658 - INFO - step 10877, loss: 1.370657, best loss: 1.370657 2025-01-16 01:44:09,809 - INFO - step 10878, loss: 1.761055, best loss: 1.370657 2025-01-16 01:44:09,959 - INFO - step 10879, loss: 1.713147, best loss: 1.370657 2025-01-16 01:44:10,109 - INFO - step 10880, loss: 1.817278, best loss: 1.370657 2025-01-16 01:44:10,259 - INFO - step 10881, loss: 1.939480, best loss: 1.370657 2025-01-16 01:44:10,409 - INFO - step 10882, loss: 1.794116, best loss: 1.370657 2025-01-16 01:44:10,559 - INFO - step 10883, loss: 1.561094, best loss: 1.370657 2025-01-16 01:44:10,709 - INFO - step 10884, loss: 1.761194, best loss: 1.370657 2025-01-16 01:44:10,859 - INFO - step 10885, loss: 1.866850, best loss: 1.370657 2025-01-16 01:44:11,009 - INFO - step 10886, loss: 1.799364, best loss: 1.370657 2025-01-16 01:44:11,159 - INFO - step 10887, loss: 1.381168, best loss: 1.370657 2025-01-16 01:44:11,309 - INFO - step 10888, loss: 1.572835, best loss: 1.370657 2025-01-16 01:44:11,459 - INFO - step 10889, loss: 1.505239, best loss: 1.370657 2025-01-16 01:44:11,610 - INFO - step 10890, loss: 1.768222, best loss: 1.370657 2025-01-16 01:44:11,760 - INFO - step 10891, loss: 1.779255, best loss: 1.370657 2025-01-16 01:44:11,910 - INFO - step 10892, loss: 1.834178, best loss: 1.370657 2025-01-16 01:44:12,060 - INFO - step 10893, loss: 1.800635, best loss: 1.370657 2025-01-16 01:44:12,210 - INFO - step 10894, loss: 1.795489, best loss: 1.370657 2025-01-16 01:44:12,360 - INFO - step 10895, loss: 1.549764, best loss: 1.370657 2025-01-16 01:44:12,511 - INFO - step 10896, loss: 1.929006, best loss: 1.370657 2025-01-16 01:44:12,661 - INFO - step 10897, loss: 1.775952, best loss: 1.370657 2025-01-16 01:44:12,811 - INFO - step 10898, loss: 1.981444, best loss: 1.370657 2025-01-16 01:44:12,961 - INFO - step 10899, loss: 1.949340, best loss: 1.370657 2025-01-16 01:44:13,112 - INFO - step 10900, loss: 1.682977, best loss: 1.370657 2025-01-16 01:44:13,262 - INFO - step 10901, loss: 1.742773, best loss: 1.370657 2025-01-16 01:44:13,412 - INFO - step 10902, loss: 1.540645, best loss: 1.370657 2025-01-16 01:44:13,562 - INFO - step 10903, loss: 1.885478, best loss: 1.370657 2025-01-16 01:44:13,712 - INFO - step 10904, loss: 1.980293, best loss: 1.370657 2025-01-16 01:44:13,863 - INFO - step 10905, loss: 1.891770, best loss: 1.370657 2025-01-16 01:44:14,013 - INFO - step 10906, loss: 1.890016, best loss: 1.370657 2025-01-16 01:44:14,163 - INFO - step 10907, loss: 1.823235, best loss: 1.370657 2025-01-16 01:44:14,313 - INFO - step 10908, loss: 1.820815, best loss: 1.370657 2025-01-16 01:44:14,463 - INFO - step 10909, loss: 1.713647, best loss: 1.370657 2025-01-16 01:44:14,614 - INFO - step 10910, loss: 1.637406, best loss: 1.370657 2025-01-16 01:44:14,764 - INFO - step 10911, loss: 2.028477, best loss: 1.370657 2025-01-16 01:44:14,914 - INFO - step 10912, loss: 1.497828, best loss: 1.370657 2025-01-16 01:44:15,064 - INFO - step 10913, loss: 1.603449, best loss: 1.370657 2025-01-16 01:44:15,214 - INFO - step 10914, loss: 1.777523, best loss: 1.370657 2025-01-16 01:44:15,365 - INFO - step 10915, loss: 1.884473, best loss: 1.370657 2025-01-16 01:44:15,515 - INFO - step 10916, loss: 1.758575, best loss: 1.370657 2025-01-16 01:44:15,665 - INFO - step 10917, loss: 1.638735, best loss: 1.370657 2025-01-16 01:44:15,815 - INFO - step 10918, loss: 1.750999, best loss: 1.370657 2025-01-16 01:44:15,966 - INFO - step 10919, loss: 1.795226, best loss: 1.370657 2025-01-16 01:44:16,116 - INFO - step 10920, loss: 1.593146, best loss: 1.370657 2025-01-16 01:44:16,267 - INFO - step 10921, loss: 1.574418, best loss: 1.370657 2025-01-16 01:44:16,417 - INFO - step 10922, loss: 1.808442, best loss: 1.370657 2025-01-16 01:44:16,567 - INFO - step 10923, loss: 1.808554, best loss: 1.370657 2025-01-16 01:44:16,717 - INFO - step 10924, loss: 1.653458, best loss: 1.370657 2025-01-16 01:44:16,868 - INFO - step 10925, loss: 1.526258, best loss: 1.370657 2025-01-16 01:44:17,018 - INFO - step 10926, loss: 1.868971, best loss: 1.370657 2025-01-16 01:44:17,168 - INFO - step 10927, loss: 1.941417, best loss: 1.370657 2025-01-16 01:44:17,318 - INFO - step 10928, loss: 1.793009, best loss: 1.370657 2025-01-16 01:44:17,469 - INFO - step 10929, loss: 1.653275, best loss: 1.370657 2025-01-16 01:44:17,619 - INFO - step 10930, loss: 1.958575, best loss: 1.370657 2025-01-16 01:44:17,769 - INFO - step 10931, loss: 1.833868, best loss: 1.370657 2025-01-16 01:44:17,920 - INFO - step 10932, loss: 1.842595, best loss: 1.370657 2025-01-16 01:44:18,070 - INFO - step 10933, loss: 1.722823, best loss: 1.370657 2025-01-16 01:44:18,220 - INFO - step 10934, loss: 1.694110, best loss: 1.370657 2025-01-16 01:44:18,370 - INFO - step 10935, loss: 1.642838, best loss: 1.370657 2025-01-16 01:44:18,521 - INFO - step 10936, loss: 2.017353, best loss: 1.370657 2025-01-16 01:44:18,671 - INFO - step 10937, loss: 1.826427, best loss: 1.370657 2025-01-16 01:44:18,821 - INFO - step 10938, loss: 1.764354, best loss: 1.370657 2025-01-16 01:44:18,971 - INFO - step 10939, loss: 1.510696, best loss: 1.370657 2025-01-16 01:44:19,121 - INFO - step 10940, loss: 1.614447, best loss: 1.370657 2025-01-16 01:44:19,272 - INFO - step 10941, loss: 1.823180, best loss: 1.370657 2025-01-16 01:44:19,422 - INFO - step 10942, loss: 1.829465, best loss: 1.370657 2025-01-16 01:44:19,573 - INFO - step 10943, loss: 1.763608, best loss: 1.370657 2025-01-16 01:44:19,723 - INFO - step 10944, loss: 1.807761, best loss: 1.370657 2025-01-16 01:44:19,873 - INFO - step 10945, loss: 1.693278, best loss: 1.370657 2025-01-16 01:44:20,023 - INFO - step 10946, loss: 1.810139, best loss: 1.370657 2025-01-16 01:44:20,174 - INFO - step 10947, loss: 1.743072, best loss: 1.370657 2025-01-16 01:44:20,324 - INFO - step 10948, loss: 1.475526, best loss: 1.370657 2025-01-16 01:44:20,474 - INFO - step 10949, loss: 1.777838, best loss: 1.370657 2025-01-16 01:44:20,624 - INFO - step 10950, loss: 1.839120, best loss: 1.370657 2025-01-16 01:44:20,774 - INFO - step 10951, loss: 1.904841, best loss: 1.370657 2025-01-16 01:44:20,925 - INFO - step 10952, loss: 1.907046, best loss: 1.370657 2025-01-16 01:44:21,075 - INFO - step 10953, loss: 1.834744, best loss: 1.370657 2025-01-16 01:44:21,225 - INFO - step 10954, loss: 1.807474, best loss: 1.370657 2025-01-16 01:44:21,375 - INFO - step 10955, loss: 1.519462, best loss: 1.370657 2025-01-16 01:44:21,525 - INFO - step 10956, loss: 1.830539, best loss: 1.370657 2025-01-16 01:44:21,675 - INFO - step 10957, loss: 1.551585, best loss: 1.370657 2025-01-16 01:44:21,825 - INFO - step 10958, loss: 1.708164, best loss: 1.370657 2025-01-16 01:44:21,975 - INFO - step 10959, loss: 1.695963, best loss: 1.370657 2025-01-16 01:44:22,125 - INFO - step 10960, loss: 1.664552, best loss: 1.370657 2025-01-16 01:44:22,276 - INFO - step 10961, loss: 1.775138, best loss: 1.370657 2025-01-16 01:44:22,426 - INFO - step 10962, loss: 1.886815, best loss: 1.370657 2025-01-16 01:44:22,576 - INFO - step 10963, loss: 1.950657, best loss: 1.370657 2025-01-16 01:44:22,726 - INFO - step 10964, loss: 1.871027, best loss: 1.370657 2025-01-16 01:44:22,877 - INFO - step 10965, loss: 1.898072, best loss: 1.370657 2025-01-16 01:44:23,027 - INFO - step 10966, loss: 1.700073, best loss: 1.370657 2025-01-16 01:44:23,177 - INFO - step 10967, loss: 1.828251, best loss: 1.370657 2025-01-16 01:44:23,327 - INFO - step 10968, loss: 1.762835, best loss: 1.370657 2025-01-16 01:44:23,478 - INFO - step 10969, loss: 1.633030, best loss: 1.370657 2025-01-16 01:44:23,628 - INFO - step 10970, loss: 1.880080, best loss: 1.370657 2025-01-16 01:44:23,778 - INFO - step 10971, loss: 1.709065, best loss: 1.370657 2025-01-16 01:44:23,928 - INFO - step 10972, loss: 1.607870, best loss: 1.370657 2025-01-16 01:44:24,078 - INFO - step 10973, loss: 1.626511, best loss: 1.370657 2025-01-16 01:44:24,228 - INFO - step 10974, loss: 1.640106, best loss: 1.370657 2025-01-16 01:44:24,378 - INFO - step 10975, loss: 1.761696, best loss: 1.370657 2025-01-16 01:44:24,529 - INFO - step 10976, loss: 1.550962, best loss: 1.370657 2025-01-16 01:44:24,679 - INFO - step 10977, loss: 1.574905, best loss: 1.370657 2025-01-16 01:44:24,829 - INFO - step 10978, loss: 1.723069, best loss: 1.370657 2025-01-16 01:44:24,979 - INFO - step 10979, loss: 1.794649, best loss: 1.370657 2025-01-16 01:44:25,129 - INFO - step 10980, loss: 1.657333, best loss: 1.370657 2025-01-16 01:44:25,280 - INFO - step 10981, loss: 1.832329, best loss: 1.370657 2025-01-16 01:44:25,430 - INFO - step 10982, loss: 1.732420, best loss: 1.370657 2025-01-16 01:44:25,580 - INFO - step 10983, loss: 1.916562, best loss: 1.370657 2025-01-16 01:44:25,730 - INFO - step 10984, loss: 1.914963, best loss: 1.370657 2025-01-16 01:44:25,880 - INFO - step 10985, loss: 1.941717, best loss: 1.370657 2025-01-16 01:44:26,031 - INFO - step 10986, loss: 1.897730, best loss: 1.370657 2025-01-16 01:44:26,181 - INFO - step 10987, loss: 1.769880, best loss: 1.370657 2025-01-16 01:44:26,331 - INFO - step 10988, loss: 1.781322, best loss: 1.370657 2025-01-16 01:44:26,481 - INFO - step 10989, loss: 1.850117, best loss: 1.370657 2025-01-16 01:44:26,631 - INFO - step 10990, loss: 1.813947, best loss: 1.370657 2025-01-16 01:44:26,782 - INFO - step 10991, loss: 1.894716, best loss: 1.370657 2025-01-16 01:44:26,932 - INFO - step 10992, loss: 1.825279, best loss: 1.370657 2025-01-16 01:44:27,082 - INFO - step 10993, loss: 2.025927, best loss: 1.370657 2025-01-16 01:44:27,232 - INFO - step 10994, loss: 1.727087, best loss: 1.370657 2025-01-16 01:44:27,382 - INFO - step 10995, loss: 1.754769, best loss: 1.370657 2025-01-16 01:44:27,532 - INFO - step 10996, loss: 1.788950, best loss: 1.370657 2025-01-16 01:44:27,682 - INFO - step 10997, loss: 1.700880, best loss: 1.370657 2025-01-16 01:44:27,832 - INFO - step 10998, loss: 1.694830, best loss: 1.370657 2025-01-16 01:44:27,983 - INFO - step 10999, loss: 1.935634, best loss: 1.370657 2025-01-16 01:44:28,133 - INFO - step 11000, loss: 1.785437, best loss: 1.370657 2025-01-16 01:44:28,283 - INFO - step 11001, loss: 1.958757, best loss: 1.370657 2025-01-16 01:44:28,433 - INFO - step 11002, loss: 1.765275, best loss: 1.370657 2025-01-16 01:44:28,583 - INFO - step 11003, loss: 1.764765, best loss: 1.370657 2025-01-16 01:44:28,733 - INFO - step 11004, loss: 1.937592, best loss: 1.370657 2025-01-16 01:44:28,884 - INFO - step 11005, loss: 1.777067, best loss: 1.370657 2025-01-16 01:44:29,034 - INFO - step 11006, loss: 1.880606, best loss: 1.370657 2025-01-16 01:44:29,184 - INFO - step 11007, loss: 1.826585, best loss: 1.370657 2025-01-16 01:44:29,334 - INFO - step 11008, loss: 1.678396, best loss: 1.370657 2025-01-16 01:44:29,484 - INFO - step 11009, loss: 1.799713, best loss: 1.370657 2025-01-16 01:44:29,634 - INFO - step 11010, loss: 1.869458, best loss: 1.370657 2025-01-16 01:44:29,784 - INFO - step 11011, loss: 1.759644, best loss: 1.370657 2025-01-16 01:44:29,934 - INFO - step 11012, loss: 1.816630, best loss: 1.370657 2025-01-16 01:44:30,084 - INFO - step 11013, loss: 1.631332, best loss: 1.370657 2025-01-16 01:44:30,234 - INFO - step 11014, loss: 1.541859, best loss: 1.370657 2025-01-16 01:44:30,384 - INFO - step 11015, loss: 1.655621, best loss: 1.370657 2025-01-16 01:44:30,534 - INFO - step 11016, loss: 1.805493, best loss: 1.370657 2025-01-16 01:44:30,685 - INFO - step 11017, loss: 2.041803, best loss: 1.370657 2025-01-16 01:44:30,835 - INFO - step 11018, loss: 1.733496, best loss: 1.370657 2025-01-16 01:44:30,985 - INFO - step 11019, loss: 1.456474, best loss: 1.370657 2025-01-16 01:44:31,135 - INFO - step 11020, loss: 1.802662, best loss: 1.370657 2025-01-16 01:44:31,285 - INFO - step 11021, loss: 1.785862, best loss: 1.370657 2025-01-16 01:44:31,435 - INFO - step 11022, loss: 2.090906, best loss: 1.370657 2025-01-16 01:44:31,585 - INFO - step 11023, loss: 1.502561, best loss: 1.370657 2025-01-16 01:44:31,735 - INFO - step 11024, loss: 1.861279, best loss: 1.370657 2025-01-16 01:44:31,885 - INFO - step 11025, loss: 1.635636, best loss: 1.370657 2025-01-16 01:44:32,036 - INFO - step 11026, loss: 1.557986, best loss: 1.370657 2025-01-16 01:44:32,186 - INFO - step 11027, loss: 1.772896, best loss: 1.370657 2025-01-16 01:44:32,336 - INFO - step 11028, loss: 1.716064, best loss: 1.370657 2025-01-16 01:44:32,486 - INFO - step 11029, loss: 1.805404, best loss: 1.370657 2025-01-16 01:44:32,636 - INFO - step 11030, loss: 1.862926, best loss: 1.370657 2025-01-16 01:44:32,786 - INFO - step 11031, loss: 1.795993, best loss: 1.370657 2025-01-16 01:44:32,936 - INFO - step 11032, loss: 1.947713, best loss: 1.370657 2025-01-16 01:44:33,086 - INFO - step 11033, loss: 1.686165, best loss: 1.370657 2025-01-16 01:44:33,236 - INFO - step 11034, loss: 1.374945, best loss: 1.370657 2025-01-16 01:44:33,387 - INFO - step 11035, loss: 1.728911, best loss: 1.370657 2025-01-16 01:44:33,537 - INFO - step 11036, loss: 2.034288, best loss: 1.370657 2025-01-16 01:44:33,687 - INFO - step 11037, loss: 1.816460, best loss: 1.370657 2025-01-16 01:44:33,837 - INFO - step 11038, loss: 1.571823, best loss: 1.370657 2025-01-16 01:44:33,987 - INFO - step 11039, loss: 1.742823, best loss: 1.370657 2025-01-16 01:44:34,137 - INFO - step 11040, loss: 1.877372, best loss: 1.370657 2025-01-16 01:44:34,287 - INFO - step 11041, loss: 1.913924, best loss: 1.370657 2025-01-16 01:44:34,437 - INFO - step 11042, loss: 1.800085, best loss: 1.370657 2025-01-16 01:44:34,587 - INFO - step 11043, loss: 1.682167, best loss: 1.370657 2025-01-16 01:44:34,738 - INFO - step 11044, loss: 1.886342, best loss: 1.370657 2025-01-16 01:44:34,888 - INFO - step 11045, loss: 1.906089, best loss: 1.370657 2025-01-16 01:44:35,038 - INFO - step 11046, loss: 1.662169, best loss: 1.370657 2025-01-16 01:44:35,188 - INFO - step 11047, loss: 1.793840, best loss: 1.370657 2025-01-16 01:44:35,338 - INFO - step 11048, loss: 1.784314, best loss: 1.370657 2025-01-16 01:44:35,489 - INFO - step 11049, loss: 1.826200, best loss: 1.370657 2025-01-16 01:44:35,639 - INFO - step 11050, loss: 1.666205, best loss: 1.370657 2025-01-16 01:44:35,790 - INFO - step 11051, loss: 1.684814, best loss: 1.370657 2025-01-16 01:44:35,940 - INFO - step 11052, loss: 1.593529, best loss: 1.370657 2025-01-16 01:44:36,090 - INFO - step 11053, loss: 1.477422, best loss: 1.370657 2025-01-16 01:44:36,240 - INFO - step 11054, loss: 1.756177, best loss: 1.370657 2025-01-16 01:44:36,390 - INFO - step 11055, loss: 1.718239, best loss: 1.370657 2025-01-16 01:44:36,540 - INFO - step 11056, loss: 2.014443, best loss: 1.370657 2025-01-16 01:44:36,690 - INFO - step 11057, loss: 1.808331, best loss: 1.370657 2025-01-16 01:44:36,840 - INFO - step 11058, loss: 1.806354, best loss: 1.370657 2025-01-16 01:44:36,991 - INFO - step 11059, loss: 1.878016, best loss: 1.370657 2025-01-16 01:44:37,141 - INFO - step 11060, loss: 1.647197, best loss: 1.370657 2025-01-16 01:44:37,291 - INFO - step 11061, loss: 1.582176, best loss: 1.370657 2025-01-16 01:44:37,441 - INFO - step 11062, loss: 1.668173, best loss: 1.370657 2025-01-16 01:44:37,591 - INFO - step 11063, loss: 1.853883, best loss: 1.370657 2025-01-16 01:44:37,741 - INFO - step 11064, loss: 1.830299, best loss: 1.370657 2025-01-16 01:44:37,892 - INFO - step 11065, loss: 1.738700, best loss: 1.370657 2025-01-16 01:44:38,042 - INFO - step 11066, loss: 1.981136, best loss: 1.370657 2025-01-16 01:44:38,192 - INFO - step 11067, loss: 2.020975, best loss: 1.370657 2025-01-16 01:44:38,342 - INFO - step 11068, loss: 1.926762, best loss: 1.370657 2025-01-16 01:44:38,492 - INFO - step 11069, loss: 1.863296, best loss: 1.370657 2025-01-16 01:44:38,642 - INFO - step 11070, loss: 2.035623, best loss: 1.370657 2025-01-16 01:44:38,793 - INFO - step 11071, loss: 1.718217, best loss: 1.370657 2025-01-16 01:44:38,942 - INFO - step 11072, loss: 1.912308, best loss: 1.370657 2025-01-16 01:44:39,092 - INFO - step 11073, loss: 1.941556, best loss: 1.370657 2025-01-16 01:44:39,243 - INFO - step 11074, loss: 1.818611, best loss: 1.370657 2025-01-16 01:44:39,393 - INFO - step 11075, loss: 1.778995, best loss: 1.370657 2025-01-16 01:44:39,543 - INFO - step 11076, loss: 1.899225, best loss: 1.370657 2025-01-16 01:44:39,693 - INFO - step 11077, loss: 1.739803, best loss: 1.370657 2025-01-16 01:44:42,930 - INFO - step 11078, loss: 1.313118, best loss: 1.313118 2025-01-16 01:44:43,093 - INFO - step 11079, loss: 1.879521, best loss: 1.313118 2025-01-16 01:44:43,245 - INFO - step 11080, loss: 1.782554, best loss: 1.313118 2025-01-16 01:44:43,395 - INFO - step 11081, loss: 1.867423, best loss: 1.313118 2025-01-16 01:44:43,546 - INFO - step 11082, loss: 1.968035, best loss: 1.313118 2025-01-16 01:44:43,696 - INFO - step 11083, loss: 1.783710, best loss: 1.313118 2025-01-16 01:44:43,846 - INFO - step 11084, loss: 1.795419, best loss: 1.313118 2025-01-16 01:44:43,996 - INFO - step 11085, loss: 1.840347, best loss: 1.313118 2025-01-16 01:44:44,146 - INFO - step 11086, loss: 1.845283, best loss: 1.313118 2025-01-16 01:44:44,296 - INFO - step 11087, loss: 1.734090, best loss: 1.313118 2025-01-16 01:44:44,446 - INFO - step 11088, loss: 1.722879, best loss: 1.313118 2025-01-16 01:44:44,596 - INFO - step 11089, loss: 1.596519, best loss: 1.313118 2025-01-16 01:44:44,746 - INFO - step 11090, loss: 1.699944, best loss: 1.313118 2025-01-16 01:44:44,896 - INFO - step 11091, loss: 1.776531, best loss: 1.313118 2025-01-16 01:44:45,047 - INFO - step 11092, loss: 1.785286, best loss: 1.313118 2025-01-16 01:44:45,197 - INFO - step 11093, loss: 1.902643, best loss: 1.313118 2025-01-16 01:44:45,347 - INFO - step 11094, loss: 1.786026, best loss: 1.313118 2025-01-16 01:44:45,497 - INFO - step 11095, loss: 1.639119, best loss: 1.313118 2025-01-16 01:44:45,647 - INFO - step 11096, loss: 1.782763, best loss: 1.313118 2025-01-16 01:44:45,798 - INFO - step 11097, loss: 1.787854, best loss: 1.313118 2025-01-16 01:44:45,948 - INFO - step 11098, loss: 1.703818, best loss: 1.313118 2025-01-16 01:44:46,098 - INFO - step 11099, loss: 1.800576, best loss: 1.313118 2025-01-16 01:44:46,248 - INFO - step 11100, loss: 1.713268, best loss: 1.313118 2025-01-16 01:44:46,398 - INFO - step 11101, loss: 1.651748, best loss: 1.313118 2025-01-16 01:44:46,548 - INFO - step 11102, loss: 1.733554, best loss: 1.313118 2025-01-16 01:44:46,698 - INFO - step 11103, loss: 1.714487, best loss: 1.313118 2025-01-16 01:44:46,848 - INFO - step 11104, loss: 1.814921, best loss: 1.313118 2025-01-16 01:44:46,998 - INFO - step 11105, loss: 1.897931, best loss: 1.313118 2025-01-16 01:44:47,148 - INFO - step 11106, loss: 1.890241, best loss: 1.313118 2025-01-16 01:44:47,299 - INFO - step 11107, loss: 1.873668, best loss: 1.313118 2025-01-16 01:44:47,450 - INFO - step 11108, loss: 1.725006, best loss: 1.313118 2025-01-16 01:44:47,600 - INFO - step 11109, loss: 1.705453, best loss: 1.313118 2025-01-16 01:44:47,750 - INFO - step 11110, loss: 1.854749, best loss: 1.313118 2025-01-16 01:44:47,900 - INFO - step 11111, loss: 1.934445, best loss: 1.313118 2025-01-16 01:44:48,050 - INFO - step 11112, loss: 1.907124, best loss: 1.313118 2025-01-16 01:44:48,200 - INFO - step 11113, loss: 2.071814, best loss: 1.313118 2025-01-16 01:44:48,350 - INFO - step 11114, loss: 2.046287, best loss: 1.313118 2025-01-16 01:44:48,500 - INFO - step 11115, loss: 1.707414, best loss: 1.313118 2025-01-16 01:44:48,650 - INFO - step 11116, loss: 1.935342, best loss: 1.313118 2025-01-16 01:44:48,800 - INFO - step 11117, loss: 1.696839, best loss: 1.313118 2025-01-16 01:44:48,950 - INFO - step 11118, loss: 1.523203, best loss: 1.313118 2025-01-16 01:44:49,100 - INFO - step 11119, loss: 1.881968, best loss: 1.313118 2025-01-16 01:44:49,250 - INFO - step 11120, loss: 1.817134, best loss: 1.313118 2025-01-16 01:44:49,400 - INFO - step 11121, loss: 1.573324, best loss: 1.313118 2025-01-16 01:44:49,550 - INFO - step 11122, loss: 1.637820, best loss: 1.313118 2025-01-16 01:44:49,701 - INFO - step 11123, loss: 1.867099, best loss: 1.313118 2025-01-16 01:44:49,851 - INFO - step 11124, loss: 1.856173, best loss: 1.313118 2025-01-16 01:44:50,001 - INFO - step 11125, loss: 1.770726, best loss: 1.313118 2025-01-16 01:44:50,151 - INFO - step 11126, loss: 1.867810, best loss: 1.313118 2025-01-16 01:44:50,302 - INFO - step 11127, loss: 1.708709, best loss: 1.313118 2025-01-16 01:44:50,451 - INFO - step 11128, loss: 1.514105, best loss: 1.313118 2025-01-16 01:44:50,601 - INFO - step 11129, loss: 1.841323, best loss: 1.313118 2025-01-16 01:44:50,752 - INFO - step 11130, loss: 2.006307, best loss: 1.313118 2025-01-16 01:44:50,902 - INFO - step 11131, loss: 2.045439, best loss: 1.313118 2025-01-16 01:44:51,051 - INFO - step 11132, loss: 1.812788, best loss: 1.313118 2025-01-16 01:44:51,201 - INFO - step 11133, loss: 1.741383, best loss: 1.313118 2025-01-16 01:44:51,351 - INFO - step 11134, loss: 1.900065, best loss: 1.313118 2025-01-16 01:44:51,502 - INFO - step 11135, loss: 1.742251, best loss: 1.313118 2025-01-16 01:44:51,652 - INFO - step 11136, loss: 1.686351, best loss: 1.313118 2025-01-16 01:44:51,802 - INFO - step 11137, loss: 1.849653, best loss: 1.313118 2025-01-16 01:44:51,952 - INFO - step 11138, loss: 1.527985, best loss: 1.313118 2025-01-16 01:44:52,103 - INFO - step 11139, loss: 1.417707, best loss: 1.313118 2025-01-16 01:44:52,253 - INFO - step 11140, loss: 1.759385, best loss: 1.313118 2025-01-16 01:44:52,403 - INFO - step 11141, loss: 1.808448, best loss: 1.313118 2025-01-16 01:44:52,553 - INFO - step 11142, loss: 1.814127, best loss: 1.313118 2025-01-16 01:44:52,703 - INFO - step 11143, loss: 1.558480, best loss: 1.313118 2025-01-16 01:44:52,853 - INFO - step 11144, loss: 1.483007, best loss: 1.313118 2025-01-16 01:44:53,003 - INFO - step 11145, loss: 1.418580, best loss: 1.313118 2025-01-16 01:44:53,153 - INFO - step 11146, loss: 1.601970, best loss: 1.313118 2025-01-16 01:44:53,303 - INFO - step 11147, loss: 1.763693, best loss: 1.313118 2025-01-16 01:44:53,453 - INFO - step 11148, loss: 1.795240, best loss: 1.313118 2025-01-16 01:44:53,603 - INFO - step 11149, loss: 1.783761, best loss: 1.313118 2025-01-16 01:44:53,754 - INFO - step 11150, loss: 1.725904, best loss: 1.313118 2025-01-16 01:44:53,904 - INFO - step 11151, loss: 1.711628, best loss: 1.313118 2025-01-16 01:44:54,054 - INFO - step 11152, loss: 1.763081, best loss: 1.313118 2025-01-16 01:44:54,204 - INFO - step 11153, loss: 1.534036, best loss: 1.313118 2025-01-16 01:44:54,355 - INFO - step 11154, loss: 1.714523, best loss: 1.313118 2025-01-16 01:44:54,505 - INFO - step 11155, loss: 1.718989, best loss: 1.313118 2025-01-16 01:44:54,655 - INFO - step 11156, loss: 1.540222, best loss: 1.313118 2025-01-16 01:44:54,805 - INFO - step 11157, loss: 1.517582, best loss: 1.313118 2025-01-16 01:44:54,955 - INFO - step 11158, loss: 1.520179, best loss: 1.313118 2025-01-16 01:44:55,105 - INFO - step 11159, loss: 1.714485, best loss: 1.313118 2025-01-16 01:44:55,255 - INFO - step 11160, loss: 1.601715, best loss: 1.313118 2025-01-16 01:44:55,405 - INFO - step 11161, loss: 1.591740, best loss: 1.313118 2025-01-16 01:44:55,556 - INFO - step 11162, loss: 1.654805, best loss: 1.313118 2025-01-16 01:44:55,706 - INFO - step 11163, loss: 1.571097, best loss: 1.313118 2025-01-16 01:44:55,856 - INFO - step 11164, loss: 1.664057, best loss: 1.313118 2025-01-16 01:44:56,005 - INFO - step 11165, loss: 1.799340, best loss: 1.313118 2025-01-16 01:44:56,155 - INFO - step 11166, loss: 1.573025, best loss: 1.313118 2025-01-16 01:44:56,306 - INFO - step 11167, loss: 1.709004, best loss: 1.313118 2025-01-16 01:44:56,456 - INFO - step 11168, loss: 1.639143, best loss: 1.313118 2025-01-16 01:44:56,606 - INFO - step 11169, loss: 1.811880, best loss: 1.313118 2025-01-16 01:44:56,756 - INFO - step 11170, loss: 1.618660, best loss: 1.313118 2025-01-16 01:44:56,906 - INFO - step 11171, loss: 1.586151, best loss: 1.313118 2025-01-16 01:44:57,056 - INFO - step 11172, loss: 1.673846, best loss: 1.313118 2025-01-16 01:44:57,207 - INFO - step 11173, loss: 1.734103, best loss: 1.313118 2025-01-16 01:44:57,357 - INFO - step 11174, loss: 1.917587, best loss: 1.313118 2025-01-16 01:44:57,507 - INFO - step 11175, loss: 1.903538, best loss: 1.313118 2025-01-16 01:44:57,657 - INFO - step 11176, loss: 1.813205, best loss: 1.313118 2025-01-16 01:44:57,807 - INFO - step 11177, loss: 1.782711, best loss: 1.313118 2025-01-16 01:44:57,957 - INFO - step 11178, loss: 1.945803, best loss: 1.313118 2025-01-16 01:44:58,107 - INFO - step 11179, loss: 1.754684, best loss: 1.313118 2025-01-16 01:44:58,257 - INFO - step 11180, loss: 1.657874, best loss: 1.313118 2025-01-16 01:44:58,408 - INFO - step 11181, loss: 1.696713, best loss: 1.313118 2025-01-16 01:44:58,558 - INFO - step 11182, loss: 1.690649, best loss: 1.313118 2025-01-16 01:44:58,708 - INFO - step 11183, loss: 1.635858, best loss: 1.313118 2025-01-16 01:44:58,858 - INFO - step 11184, loss: 1.729897, best loss: 1.313118 2025-01-16 01:44:59,008 - INFO - step 11185, loss: 1.685555, best loss: 1.313118 2025-01-16 01:44:59,158 - INFO - step 11186, loss: 1.796069, best loss: 1.313118 2025-01-16 01:44:59,308 - INFO - step 11187, loss: 1.450032, best loss: 1.313118 2025-01-16 01:44:59,458 - INFO - step 11188, loss: 1.810167, best loss: 1.313118 2025-01-16 01:44:59,609 - INFO - step 11189, loss: 1.764246, best loss: 1.313118 2025-01-16 01:44:59,759 - INFO - step 11190, loss: 1.711408, best loss: 1.313118 2025-01-16 01:44:59,909 - INFO - step 11191, loss: 1.814384, best loss: 1.313118 2025-01-16 01:45:00,059 - INFO - step 11192, loss: 1.655671, best loss: 1.313118 2025-01-16 01:45:00,210 - INFO - step 11193, loss: 1.693109, best loss: 1.313118 2025-01-16 01:45:00,360 - INFO - step 11194, loss: 1.625921, best loss: 1.313118 2025-01-16 01:45:00,510 - INFO - step 11195, loss: 1.602665, best loss: 1.313118 2025-01-16 01:45:00,660 - INFO - step 11196, loss: 1.552575, best loss: 1.313118 2025-01-16 01:45:00,810 - INFO - step 11197, loss: 1.752979, best loss: 1.313118 2025-01-16 01:45:00,960 - INFO - step 11198, loss: 1.637465, best loss: 1.313118 2025-01-16 01:45:01,110 - INFO - step 11199, loss: 1.617621, best loss: 1.313118 2025-01-16 01:45:01,260 - INFO - step 11200, loss: 1.470811, best loss: 1.313118 2025-01-16 01:45:01,410 - INFO - step 11201, loss: 1.527286, best loss: 1.313118 2025-01-16 01:45:01,561 - INFO - step 11202, loss: 1.745581, best loss: 1.313118 2025-01-16 01:45:01,711 - INFO - step 11203, loss: 1.579475, best loss: 1.313118 2025-01-16 01:45:01,861 - INFO - step 11204, loss: 1.635097, best loss: 1.313118 2025-01-16 01:45:02,011 - INFO - step 11205, loss: 1.352108, best loss: 1.313118 2025-01-16 01:45:02,160 - INFO - step 11206, loss: 1.405922, best loss: 1.313118 2025-01-16 01:45:02,310 - INFO - step 11207, loss: 1.342339, best loss: 1.313118 2025-01-16 01:45:02,460 - INFO - step 11208, loss: 1.691967, best loss: 1.313118 2025-01-16 01:45:02,611 - INFO - step 11209, loss: 1.684908, best loss: 1.313118 2025-01-16 01:45:02,761 - INFO - step 11210, loss: 1.740036, best loss: 1.313118 2025-01-16 01:45:02,911 - INFO - step 11211, loss: 1.763867, best loss: 1.313118 2025-01-16 01:45:03,061 - INFO - step 11212, loss: 1.736238, best loss: 1.313118 2025-01-16 01:45:03,211 - INFO - step 11213, loss: 1.526777, best loss: 1.313118 2025-01-16 01:45:03,361 - INFO - step 11214, loss: 1.583683, best loss: 1.313118 2025-01-16 01:45:03,511 - INFO - step 11215, loss: 1.792201, best loss: 1.313118 2025-01-16 01:45:03,661 - INFO - step 11216, loss: 1.741939, best loss: 1.313118 2025-01-16 01:45:07,139 - INFO - step 11217, loss: 1.272360, best loss: 1.272360 2025-01-16 01:45:07,289 - INFO - step 11218, loss: 1.503223, best loss: 1.272360 2025-01-16 01:45:07,440 - INFO - step 11219, loss: 1.366979, best loss: 1.272360 2025-01-16 01:45:07,590 - INFO - step 11220, loss: 1.659651, best loss: 1.272360 2025-01-16 01:45:07,740 - INFO - step 11221, loss: 1.722230, best loss: 1.272360 2025-01-16 01:45:07,890 - INFO - step 11222, loss: 1.683408, best loss: 1.272360 2025-01-16 01:45:08,040 - INFO - step 11223, loss: 1.760428, best loss: 1.272360 2025-01-16 01:45:08,190 - INFO - step 11224, loss: 1.731219, best loss: 1.272360 2025-01-16 01:45:08,340 - INFO - step 11225, loss: 1.511771, best loss: 1.272360 2025-01-16 01:45:08,490 - INFO - step 11226, loss: 1.794511, best loss: 1.272360 2025-01-16 01:45:08,640 - INFO - step 11227, loss: 1.691822, best loss: 1.272360 2025-01-16 01:45:08,791 - INFO - step 11228, loss: 1.876498, best loss: 1.272360 2025-01-16 01:45:08,941 - INFO - step 11229, loss: 1.839295, best loss: 1.272360 2025-01-16 01:45:09,091 - INFO - step 11230, loss: 1.641743, best loss: 1.272360 2025-01-16 01:45:09,241 - INFO - step 11231, loss: 1.644303, best loss: 1.272360 2025-01-16 01:45:09,391 - INFO - step 11232, loss: 1.511993, best loss: 1.272360 2025-01-16 01:45:09,541 - INFO - step 11233, loss: 1.799985, best loss: 1.272360 2025-01-16 01:45:09,691 - INFO - step 11234, loss: 1.849239, best loss: 1.272360 2025-01-16 01:45:09,841 - INFO - step 11235, loss: 1.926027, best loss: 1.272360 2025-01-16 01:45:09,991 - INFO - step 11236, loss: 1.786516, best loss: 1.272360 2025-01-16 01:45:10,141 - INFO - step 11237, loss: 1.649151, best loss: 1.272360 2025-01-16 01:45:10,291 - INFO - step 11238, loss: 1.766207, best loss: 1.272360 2025-01-16 01:45:10,442 - INFO - step 11239, loss: 1.699099, best loss: 1.272360 2025-01-16 01:45:10,592 - INFO - step 11240, loss: 1.612250, best loss: 1.272360 2025-01-16 01:45:10,742 - INFO - step 11241, loss: 1.929349, best loss: 1.272360 2025-01-16 01:45:10,893 - INFO - step 11242, loss: 1.463739, best loss: 1.272360 2025-01-16 01:45:11,043 - INFO - step 11243, loss: 1.572234, best loss: 1.272360 2025-01-16 01:45:11,193 - INFO - step 11244, loss: 1.767229, best loss: 1.272360 2025-01-16 01:45:11,343 - INFO - step 11245, loss: 1.839326, best loss: 1.272360 2025-01-16 01:45:11,493 - INFO - step 11246, loss: 1.713578, best loss: 1.272360 2025-01-16 01:45:11,643 - INFO - step 11247, loss: 1.529353, best loss: 1.272360 2025-01-16 01:45:11,793 - INFO - step 11248, loss: 1.676988, best loss: 1.272360 2025-01-16 01:45:11,943 - INFO - step 11249, loss: 1.756406, best loss: 1.272360 2025-01-16 01:45:12,093 - INFO - step 11250, loss: 1.494524, best loss: 1.272360 2025-01-16 01:45:12,243 - INFO - step 11251, loss: 1.498093, best loss: 1.272360 2025-01-16 01:45:12,393 - INFO - step 11252, loss: 1.701336, best loss: 1.272360 2025-01-16 01:45:12,544 - INFO - step 11253, loss: 1.748940, best loss: 1.272360 2025-01-16 01:45:12,694 - INFO - step 11254, loss: 1.529590, best loss: 1.272360 2025-01-16 01:45:12,844 - INFO - step 11255, loss: 1.472196, best loss: 1.272360 2025-01-16 01:45:12,994 - INFO - step 11256, loss: 1.785051, best loss: 1.272360 2025-01-16 01:45:13,144 - INFO - step 11257, loss: 1.780445, best loss: 1.272360 2025-01-16 01:45:13,294 - INFO - step 11258, loss: 1.714484, best loss: 1.272360 2025-01-16 01:45:13,444 - INFO - step 11259, loss: 1.618272, best loss: 1.272360 2025-01-16 01:45:13,594 - INFO - step 11260, loss: 1.830215, best loss: 1.272360 2025-01-16 01:45:13,745 - INFO - step 11261, loss: 1.797971, best loss: 1.272360 2025-01-16 01:45:13,895 - INFO - step 11262, loss: 1.745392, best loss: 1.272360 2025-01-16 01:45:14,045 - INFO - step 11263, loss: 1.602350, best loss: 1.272360 2025-01-16 01:45:14,195 - INFO - step 11264, loss: 1.603914, best loss: 1.272360 2025-01-16 01:45:14,345 - INFO - step 11265, loss: 1.621417, best loss: 1.272360 2025-01-16 01:45:14,495 - INFO - step 11266, loss: 1.883511, best loss: 1.272360 2025-01-16 01:45:14,646 - INFO - step 11267, loss: 1.818238, best loss: 1.272360 2025-01-16 01:45:14,796 - INFO - step 11268, loss: 1.696875, best loss: 1.272360 2025-01-16 01:45:14,946 - INFO - step 11269, loss: 1.524299, best loss: 1.272360 2025-01-16 01:45:15,096 - INFO - step 11270, loss: 1.560249, best loss: 1.272360 2025-01-16 01:45:15,246 - INFO - step 11271, loss: 1.662384, best loss: 1.272360 2025-01-16 01:45:15,396 - INFO - step 11272, loss: 1.705456, best loss: 1.272360 2025-01-16 01:45:15,546 - INFO - step 11273, loss: 1.703664, best loss: 1.272360 2025-01-16 01:45:15,696 - INFO - step 11274, loss: 1.671334, best loss: 1.272360 2025-01-16 01:45:15,846 - INFO - step 11275, loss: 1.638231, best loss: 1.272360 2025-01-16 01:45:15,997 - INFO - step 11276, loss: 1.732575, best loss: 1.272360 2025-01-16 01:45:16,147 - INFO - step 11277, loss: 1.681247, best loss: 1.272360 2025-01-16 01:45:16,297 - INFO - step 11278, loss: 1.414529, best loss: 1.272360 2025-01-16 01:45:16,447 - INFO - step 11279, loss: 1.698501, best loss: 1.272360 2025-01-16 01:45:16,597 - INFO - step 11280, loss: 1.774676, best loss: 1.272360 2025-01-16 01:45:16,746 - INFO - step 11281, loss: 1.873900, best loss: 1.272360 2025-01-16 01:45:16,896 - INFO - step 11282, loss: 1.814665, best loss: 1.272360 2025-01-16 01:45:17,046 - INFO - step 11283, loss: 1.820875, best loss: 1.272360 2025-01-16 01:45:17,196 - INFO - step 11284, loss: 1.813416, best loss: 1.272360 2025-01-16 01:45:17,346 - INFO - step 11285, loss: 1.492669, best loss: 1.272360 2025-01-16 01:45:17,496 - INFO - step 11286, loss: 1.871242, best loss: 1.272360 2025-01-16 01:45:17,646 - INFO - step 11287, loss: 1.562245, best loss: 1.272360 2025-01-16 01:45:17,796 - INFO - step 11288, loss: 1.653495, best loss: 1.272360 2025-01-16 01:45:17,946 - INFO - step 11289, loss: 1.634184, best loss: 1.272360 2025-01-16 01:45:18,096 - INFO - step 11290, loss: 1.637019, best loss: 1.272360 2025-01-16 01:45:18,246 - INFO - step 11291, loss: 1.660717, best loss: 1.272360 2025-01-16 01:45:18,396 - INFO - step 11292, loss: 1.862858, best loss: 1.272360 2025-01-16 01:45:18,547 - INFO - step 11293, loss: 1.745106, best loss: 1.272360 2025-01-16 01:45:18,697 - INFO - step 11294, loss: 1.787987, best loss: 1.272360 2025-01-16 01:45:18,847 - INFO - step 11295, loss: 1.796306, best loss: 1.272360 2025-01-16 01:45:18,997 - INFO - step 11296, loss: 1.650111, best loss: 1.272360 2025-01-16 01:45:19,147 - INFO - step 11297, loss: 1.807775, best loss: 1.272360 2025-01-16 01:45:19,298 - INFO - step 11298, loss: 1.651286, best loss: 1.272360 2025-01-16 01:45:19,448 - INFO - step 11299, loss: 1.584007, best loss: 1.272360 2025-01-16 01:45:19,598 - INFO - step 11300, loss: 1.733741, best loss: 1.272360 2025-01-16 01:45:19,748 - INFO - step 11301, loss: 1.651080, best loss: 1.272360 2025-01-16 01:45:19,898 - INFO - step 11302, loss: 1.509794, best loss: 1.272360 2025-01-16 01:45:20,049 - INFO - step 11303, loss: 1.547738, best loss: 1.272360 2025-01-16 01:45:20,199 - INFO - step 11304, loss: 1.542026, best loss: 1.272360 2025-01-16 01:45:20,349 - INFO - step 11305, loss: 1.623384, best loss: 1.272360 2025-01-16 01:45:20,499 - INFO - step 11306, loss: 1.503571, best loss: 1.272360 2025-01-16 01:45:20,649 - INFO - step 11307, loss: 1.564583, best loss: 1.272360 2025-01-16 01:45:20,800 - INFO - step 11308, loss: 1.675124, best loss: 1.272360 2025-01-16 01:45:20,950 - INFO - step 11309, loss: 1.736577, best loss: 1.272360 2025-01-16 01:45:21,100 - INFO - step 11310, loss: 1.540293, best loss: 1.272360 2025-01-16 01:45:21,250 - INFO - step 11311, loss: 1.754595, best loss: 1.272360 2025-01-16 01:45:21,400 - INFO - step 11312, loss: 1.582471, best loss: 1.272360 2025-01-16 01:45:21,550 - INFO - step 11313, loss: 1.807634, best loss: 1.272360 2025-01-16 01:45:21,700 - INFO - step 11314, loss: 1.789453, best loss: 1.272360 2025-01-16 01:45:21,850 - INFO - step 11315, loss: 1.869227, best loss: 1.272360 2025-01-16 01:45:22,001 - INFO - step 11316, loss: 1.849976, best loss: 1.272360 2025-01-16 01:45:22,151 - INFO - step 11317, loss: 1.571318, best loss: 1.272360 2025-01-16 01:45:22,301 - INFO - step 11318, loss: 1.676590, best loss: 1.272360 2025-01-16 01:45:22,451 - INFO - step 11319, loss: 1.784091, best loss: 1.272360 2025-01-16 01:45:22,601 - INFO - step 11320, loss: 1.674199, best loss: 1.272360 2025-01-16 01:45:22,751 - INFO - step 11321, loss: 1.703698, best loss: 1.272360 2025-01-16 01:45:22,901 - INFO - step 11322, loss: 1.709637, best loss: 1.272360 2025-01-16 01:45:23,052 - INFO - step 11323, loss: 1.986287, best loss: 1.272360 2025-01-16 01:45:23,202 - INFO - step 11324, loss: 1.663923, best loss: 1.272360 2025-01-16 01:45:23,352 - INFO - step 11325, loss: 1.634295, best loss: 1.272360 2025-01-16 01:45:23,502 - INFO - step 11326, loss: 1.681457, best loss: 1.272360 2025-01-16 01:45:23,652 - INFO - step 11327, loss: 1.602857, best loss: 1.272360 2025-01-16 01:45:23,802 - INFO - step 11328, loss: 1.621256, best loss: 1.272360 2025-01-16 01:45:23,952 - INFO - step 11329, loss: 1.815785, best loss: 1.272360 2025-01-16 01:45:24,102 - INFO - step 11330, loss: 1.701537, best loss: 1.272360 2025-01-16 01:45:24,252 - INFO - step 11331, loss: 1.857164, best loss: 1.272360 2025-01-16 01:45:24,402 - INFO - step 11332, loss: 1.699433, best loss: 1.272360 2025-01-16 01:45:24,553 - INFO - step 11333, loss: 1.709455, best loss: 1.272360 2025-01-16 01:45:24,703 - INFO - step 11334, loss: 1.883882, best loss: 1.272360 2025-01-16 01:45:24,853 - INFO - step 11335, loss: 1.715019, best loss: 1.272360 2025-01-16 01:45:25,003 - INFO - step 11336, loss: 1.778673, best loss: 1.272360 2025-01-16 01:45:25,154 - INFO - step 11337, loss: 1.749941, best loss: 1.272360 2025-01-16 01:45:25,304 - INFO - step 11338, loss: 1.629828, best loss: 1.272360 2025-01-16 01:45:25,455 - INFO - step 11339, loss: 1.726446, best loss: 1.272360 2025-01-16 01:45:25,605 - INFO - step 11340, loss: 1.774306, best loss: 1.272360 2025-01-16 01:45:25,755 - INFO - step 11341, loss: 1.725105, best loss: 1.272360 2025-01-16 01:45:25,905 - INFO - step 11342, loss: 1.671198, best loss: 1.272360 2025-01-16 01:45:26,055 - INFO - step 11343, loss: 1.543272, best loss: 1.272360 2025-01-16 01:45:26,205 - INFO - step 11344, loss: 1.458389, best loss: 1.272360 2025-01-16 01:45:26,355 - INFO - step 11345, loss: 1.582581, best loss: 1.272360 2025-01-16 01:45:26,506 - INFO - step 11346, loss: 1.722430, best loss: 1.272360 2025-01-16 01:45:26,656 - INFO - step 11347, loss: 1.940506, best loss: 1.272360 2025-01-16 01:45:26,806 - INFO - step 11348, loss: 1.651270, best loss: 1.272360 2025-01-16 01:45:26,956 - INFO - step 11349, loss: 1.513287, best loss: 1.272360 2025-01-16 01:45:27,106 - INFO - step 11350, loss: 1.702698, best loss: 1.272360 2025-01-16 01:45:27,256 - INFO - step 11351, loss: 1.757901, best loss: 1.272360 2025-01-16 01:45:27,406 - INFO - step 11352, loss: 2.020343, best loss: 1.272360 2025-01-16 01:45:27,556 - INFO - step 11353, loss: 1.450815, best loss: 1.272360 2025-01-16 01:45:27,707 - INFO - step 11354, loss: 1.786424, best loss: 1.272360 2025-01-16 01:45:27,857 - INFO - step 11355, loss: 1.513984, best loss: 1.272360 2025-01-16 01:45:28,007 - INFO - step 11356, loss: 1.605926, best loss: 1.272360 2025-01-16 01:45:28,157 - INFO - step 11357, loss: 1.744414, best loss: 1.272360 2025-01-16 01:45:28,307 - INFO - step 11358, loss: 1.730234, best loss: 1.272360 2025-01-16 01:45:28,457 - INFO - step 11359, loss: 1.727153, best loss: 1.272360 2025-01-16 01:45:28,608 - INFO - step 11360, loss: 1.748689, best loss: 1.272360 2025-01-16 01:45:28,758 - INFO - step 11361, loss: 1.680878, best loss: 1.272360 2025-01-16 01:45:28,908 - INFO - step 11362, loss: 1.818679, best loss: 1.272360 2025-01-16 01:45:29,058 - INFO - step 11363, loss: 1.591124, best loss: 1.272360 2025-01-16 01:45:29,208 - INFO - step 11364, loss: 1.293437, best loss: 1.272360 2025-01-16 01:45:29,358 - INFO - step 11365, loss: 1.627593, best loss: 1.272360 2025-01-16 01:45:29,508 - INFO - step 11366, loss: 1.913239, best loss: 1.272360 2025-01-16 01:45:29,658 - INFO - step 11367, loss: 1.686940, best loss: 1.272360 2025-01-16 01:45:29,808 - INFO - step 11368, loss: 1.571982, best loss: 1.272360 2025-01-16 01:45:29,959 - INFO - step 11369, loss: 1.655336, best loss: 1.272360 2025-01-16 01:45:30,109 - INFO - step 11370, loss: 1.800792, best loss: 1.272360 2025-01-16 01:45:30,259 - INFO - step 11371, loss: 1.839783, best loss: 1.272360 2025-01-16 01:45:30,409 - INFO - step 11372, loss: 1.720493, best loss: 1.272360 2025-01-16 01:45:30,559 - INFO - step 11373, loss: 1.613885, best loss: 1.272360 2025-01-16 01:45:30,709 - INFO - step 11374, loss: 1.850391, best loss: 1.272360 2025-01-16 01:45:30,859 - INFO - step 11375, loss: 1.801514, best loss: 1.272360 2025-01-16 01:45:31,009 - INFO - step 11376, loss: 1.664681, best loss: 1.272360 2025-01-16 01:45:31,159 - INFO - step 11377, loss: 1.714625, best loss: 1.272360 2025-01-16 01:45:31,309 - INFO - step 11378, loss: 1.678506, best loss: 1.272360 2025-01-16 01:45:31,459 - INFO - step 11379, loss: 1.813343, best loss: 1.272360 2025-01-16 01:45:31,609 - INFO - step 11380, loss: 1.666023, best loss: 1.272360 2025-01-16 01:45:31,759 - INFO - step 11381, loss: 1.663028, best loss: 1.272360 2025-01-16 01:45:31,910 - INFO - step 11382, loss: 1.562780, best loss: 1.272360 2025-01-16 01:45:32,060 - INFO - step 11383, loss: 1.451865, best loss: 1.272360 2025-01-16 01:45:32,210 - INFO - step 11384, loss: 1.717002, best loss: 1.272360 2025-01-16 01:45:32,360 - INFO - step 11385, loss: 1.685197, best loss: 1.272360 2025-01-16 01:45:32,510 - INFO - step 11386, loss: 1.905873, best loss: 1.272360 2025-01-16 01:45:32,660 - INFO - step 11387, loss: 1.654468, best loss: 1.272360 2025-01-16 01:45:32,810 - INFO - step 11388, loss: 1.742178, best loss: 1.272360 2025-01-16 01:45:32,961 - INFO - step 11389, loss: 1.819265, best loss: 1.272360 2025-01-16 01:45:33,111 - INFO - step 11390, loss: 1.600424, best loss: 1.272360 2025-01-16 01:45:33,261 - INFO - step 11391, loss: 1.480478, best loss: 1.272360 2025-01-16 01:45:33,411 - INFO - step 11392, loss: 1.609410, best loss: 1.272360 2025-01-16 01:45:33,561 - INFO - step 11393, loss: 1.737439, best loss: 1.272360 2025-01-16 01:45:33,711 - INFO - step 11394, loss: 1.752061, best loss: 1.272360 2025-01-16 01:45:33,861 - INFO - step 11395, loss: 1.653788, best loss: 1.272360 2025-01-16 01:45:34,011 - INFO - step 11396, loss: 1.918371, best loss: 1.272360 2025-01-16 01:45:34,161 - INFO - step 11397, loss: 1.926450, best loss: 1.272360 2025-01-16 01:45:34,311 - INFO - step 11398, loss: 1.859109, best loss: 1.272360 2025-01-16 01:45:34,461 - INFO - step 11399, loss: 1.735129, best loss: 1.272360 2025-01-16 01:45:34,611 - INFO - step 11400, loss: 1.960704, best loss: 1.272360 2025-01-16 01:45:34,761 - INFO - step 11401, loss: 1.614590, best loss: 1.272360 2025-01-16 01:45:34,911 - INFO - step 11402, loss: 1.863353, best loss: 1.272360 2025-01-16 01:45:35,061 - INFO - step 11403, loss: 1.858401, best loss: 1.272360 2025-01-16 01:45:35,211 - INFO - step 11404, loss: 1.795506, best loss: 1.272360 2025-01-16 01:45:35,361 - INFO - step 11405, loss: 1.743011, best loss: 1.272360 2025-01-16 01:45:35,511 - INFO - step 11406, loss: 1.866419, best loss: 1.272360 2025-01-16 01:45:35,661 - INFO - step 11407, loss: 1.687919, best loss: 1.272360 2025-01-16 01:45:35,811 - INFO - step 11408, loss: 1.281266, best loss: 1.272360 2025-01-16 01:45:35,961 - INFO - step 11409, loss: 1.810512, best loss: 1.272360 2025-01-16 01:45:36,111 - INFO - step 11410, loss: 1.648151, best loss: 1.272360 2025-01-16 01:45:36,261 - INFO - step 11411, loss: 1.736294, best loss: 1.272360 2025-01-16 01:45:36,411 - INFO - step 11412, loss: 1.834324, best loss: 1.272360 2025-01-16 01:45:36,562 - INFO - step 11413, loss: 1.676579, best loss: 1.272360 2025-01-16 01:45:36,712 - INFO - step 11414, loss: 1.761599, best loss: 1.272360 2025-01-16 01:45:36,861 - INFO - step 11415, loss: 1.733265, best loss: 1.272360 2025-01-16 01:45:37,011 - INFO - step 11416, loss: 1.815690, best loss: 1.272360 2025-01-16 01:45:37,161 - INFO - step 11417, loss: 1.647779, best loss: 1.272360 2025-01-16 01:45:37,311 - INFO - step 11418, loss: 1.644372, best loss: 1.272360 2025-01-16 01:45:37,460 - INFO - step 11419, loss: 1.586481, best loss: 1.272360 2025-01-16 01:45:37,610 - INFO - step 11420, loss: 1.621281, best loss: 1.272360 2025-01-16 01:45:37,759 - INFO - step 11421, loss: 1.692365, best loss: 1.272360 2025-01-16 01:45:37,909 - INFO - step 11422, loss: 1.747383, best loss: 1.272360 2025-01-16 01:45:38,058 - INFO - step 11423, loss: 1.773084, best loss: 1.272360 2025-01-16 01:45:38,208 - INFO - step 11424, loss: 1.651209, best loss: 1.272360 2025-01-16 01:45:38,357 - INFO - step 11425, loss: 1.522240, best loss: 1.272360 2025-01-16 01:45:38,507 - INFO - step 11426, loss: 1.652830, best loss: 1.272360 2025-01-16 01:45:38,656 - INFO - step 11427, loss: 1.679038, best loss: 1.272360 2025-01-16 01:45:38,806 - INFO - step 11428, loss: 1.629943, best loss: 1.272360 2025-01-16 01:45:38,956 - INFO - step 11429, loss: 1.680391, best loss: 1.272360 2025-01-16 01:45:39,106 - INFO - step 11430, loss: 1.634730, best loss: 1.272360 2025-01-16 01:45:39,256 - INFO - step 11431, loss: 1.497492, best loss: 1.272360 2025-01-16 01:45:39,406 - INFO - step 11432, loss: 1.608547, best loss: 1.272360 2025-01-16 01:45:39,556 - INFO - step 11433, loss: 1.592573, best loss: 1.272360 2025-01-16 01:45:39,706 - INFO - step 11434, loss: 1.613322, best loss: 1.272360 2025-01-16 01:45:39,856 - INFO - step 11435, loss: 1.770659, best loss: 1.272360 2025-01-16 01:45:40,007 - INFO - step 11436, loss: 1.749622, best loss: 1.272360 2025-01-16 01:45:40,157 - INFO - step 11437, loss: 1.716168, best loss: 1.272360 2025-01-16 01:45:40,307 - INFO - step 11438, loss: 1.538256, best loss: 1.272360 2025-01-16 01:45:40,457 - INFO - step 11439, loss: 1.649915, best loss: 1.272360 2025-01-16 01:45:40,607 - INFO - step 11440, loss: 1.815292, best loss: 1.272360 2025-01-16 01:45:40,757 - INFO - step 11441, loss: 1.807197, best loss: 1.272360 2025-01-16 01:45:40,907 - INFO - step 11442, loss: 1.846384, best loss: 1.272360 2025-01-16 01:45:41,057 - INFO - step 11443, loss: 1.912115, best loss: 1.272360 2025-01-16 01:45:41,207 - INFO - step 11444, loss: 1.975954, best loss: 1.272360 2025-01-16 01:45:41,357 - INFO - step 11445, loss: 1.695866, best loss: 1.272360 2025-01-16 01:45:41,508 - INFO - step 11446, loss: 1.843202, best loss: 1.272360 2025-01-16 01:45:41,658 - INFO - step 11447, loss: 1.543380, best loss: 1.272360 2025-01-16 01:45:41,808 - INFO - step 11448, loss: 1.462243, best loss: 1.272360 2025-01-16 01:45:41,958 - INFO - step 11449, loss: 1.726075, best loss: 1.272360 2025-01-16 01:45:42,108 - INFO - step 11450, loss: 1.755914, best loss: 1.272360 2025-01-16 01:45:42,258 - INFO - step 11451, loss: 1.494189, best loss: 1.272360 2025-01-16 01:45:42,408 - INFO - step 11452, loss: 1.512436, best loss: 1.272360 2025-01-16 01:45:42,558 - INFO - step 11453, loss: 1.762558, best loss: 1.272360 2025-01-16 01:45:42,708 - INFO - step 11454, loss: 1.785933, best loss: 1.272360 2025-01-16 01:45:42,859 - INFO - step 11455, loss: 1.682101, best loss: 1.272360 2025-01-16 01:45:43,009 - INFO - step 11456, loss: 1.756417, best loss: 1.272360 2025-01-16 01:45:43,159 - INFO - step 11457, loss: 1.534667, best loss: 1.272360 2025-01-16 01:45:43,309 - INFO - step 11458, loss: 1.441829, best loss: 1.272360 2025-01-16 01:45:43,459 - INFO - step 11459, loss: 1.731979, best loss: 1.272360 2025-01-16 01:45:43,609 - INFO - step 11460, loss: 1.940778, best loss: 1.272360 2025-01-16 01:45:43,759 - INFO - step 11461, loss: 1.928018, best loss: 1.272360 2025-01-16 01:45:43,910 - INFO - step 11462, loss: 1.761524, best loss: 1.272360 2025-01-16 01:45:44,060 - INFO - step 11463, loss: 1.649658, best loss: 1.272360 2025-01-16 01:45:44,210 - INFO - step 11464, loss: 1.808562, best loss: 1.272360 2025-01-16 01:45:44,360 - INFO - step 11465, loss: 1.670015, best loss: 1.272360 2025-01-16 01:45:44,510 - INFO - step 11466, loss: 1.653095, best loss: 1.272360 2025-01-16 01:45:44,660 - INFO - step 11467, loss: 1.708828, best loss: 1.272360 2025-01-16 01:45:44,810 - INFO - step 11468, loss: 1.455325, best loss: 1.272360 2025-01-16 01:45:44,961 - INFO - step 11469, loss: 1.308231, best loss: 1.272360 2025-01-16 01:45:45,111 - INFO - step 11470, loss: 1.630248, best loss: 1.272360 2025-01-16 01:45:45,261 - INFO - step 11471, loss: 1.674950, best loss: 1.272360 2025-01-16 01:45:45,411 - INFO - step 11472, loss: 1.684888, best loss: 1.272360 2025-01-16 01:45:45,561 - INFO - step 11473, loss: 1.391326, best loss: 1.272360 2025-01-16 01:45:45,711 - INFO - step 11474, loss: 1.410055, best loss: 1.272360 2025-01-16 01:45:45,861 - INFO - step 11475, loss: 1.276340, best loss: 1.272360 2025-01-16 01:45:46,012 - INFO - step 11476, loss: 1.515968, best loss: 1.272360 2025-01-16 01:45:46,162 - INFO - step 11477, loss: 1.703384, best loss: 1.272360 2025-01-16 01:45:46,312 - INFO - step 11478, loss: 1.686483, best loss: 1.272360 2025-01-16 01:45:46,462 - INFO - step 11479, loss: 1.720254, best loss: 1.272360 2025-01-16 01:45:46,612 - INFO - step 11480, loss: 1.669585, best loss: 1.272360 2025-01-16 01:45:46,762 - INFO - step 11481, loss: 1.551148, best loss: 1.272360 2025-01-16 01:45:46,913 - INFO - step 11482, loss: 1.669835, best loss: 1.272360 2025-01-16 01:45:47,063 - INFO - step 11483, loss: 1.443821, best loss: 1.272360 2025-01-16 01:45:47,213 - INFO - step 11484, loss: 1.544670, best loss: 1.272360 2025-01-16 01:45:47,363 - INFO - step 11485, loss: 1.579592, best loss: 1.272360 2025-01-16 01:45:47,513 - INFO - step 11486, loss: 1.478142, best loss: 1.272360 2025-01-16 01:45:47,664 - INFO - step 11487, loss: 1.447989, best loss: 1.272360 2025-01-16 01:45:47,814 - INFO - step 11488, loss: 1.476307, best loss: 1.272360 2025-01-16 01:45:47,964 - INFO - step 11489, loss: 1.647453, best loss: 1.272360 2025-01-16 01:45:48,114 - INFO - step 11490, loss: 1.503151, best loss: 1.272360 2025-01-16 01:45:48,264 - INFO - step 11491, loss: 1.500908, best loss: 1.272360 2025-01-16 01:45:48,414 - INFO - step 11492, loss: 1.509808, best loss: 1.272360 2025-01-16 01:45:48,564 - INFO - step 11493, loss: 1.491892, best loss: 1.272360 2025-01-16 01:45:48,714 - INFO - step 11494, loss: 1.536638, best loss: 1.272360 2025-01-16 01:45:48,864 - INFO - step 11495, loss: 1.821862, best loss: 1.272360 2025-01-16 01:45:49,015 - INFO - step 11496, loss: 1.509001, best loss: 1.272360 2025-01-16 01:45:49,165 - INFO - step 11497, loss: 1.596535, best loss: 1.272360 2025-01-16 01:45:49,315 - INFO - step 11498, loss: 1.574970, best loss: 1.272360 2025-01-16 01:45:49,465 - INFO - step 11499, loss: 1.750035, best loss: 1.272360 2025-01-16 01:45:49,616 - INFO - step 11500, loss: 1.539103, best loss: 1.272360 2025-01-16 01:45:49,766 - INFO - step 11501, loss: 1.454935, best loss: 1.272360 2025-01-16 01:45:49,916 - INFO - step 11502, loss: 1.567931, best loss: 1.272360 2025-01-16 01:45:50,066 - INFO - step 11503, loss: 1.582125, best loss: 1.272360 2025-01-16 01:45:50,216 - INFO - step 11504, loss: 1.806577, best loss: 1.272360 2025-01-16 01:45:50,366 - INFO - step 11505, loss: 1.699626, best loss: 1.272360 2025-01-16 01:45:50,516 - INFO - step 11506, loss: 1.642775, best loss: 1.272360 2025-01-16 01:45:50,666 - INFO - step 11507, loss: 1.626109, best loss: 1.272360 2025-01-16 01:45:50,817 - INFO - step 11508, loss: 1.778707, best loss: 1.272360 2025-01-16 01:45:50,967 - INFO - step 11509, loss: 1.660712, best loss: 1.272360 2025-01-16 01:45:51,117 - INFO - step 11510, loss: 1.557111, best loss: 1.272360 2025-01-16 01:45:51,267 - INFO - step 11511, loss: 1.640754, best loss: 1.272360 2025-01-16 01:45:51,417 - INFO - step 11512, loss: 1.575763, best loss: 1.272360 2025-01-16 01:45:51,567 - INFO - step 11513, loss: 1.474916, best loss: 1.272360 2025-01-16 01:45:51,717 - INFO - step 11514, loss: 1.637136, best loss: 1.272360 2025-01-16 01:45:51,867 - INFO - step 11515, loss: 1.526003, best loss: 1.272360 2025-01-16 01:45:52,017 - INFO - step 11516, loss: 1.602255, best loss: 1.272360 2025-01-16 01:45:52,167 - INFO - step 11517, loss: 1.274537, best loss: 1.272360 2025-01-16 01:45:52,318 - INFO - step 11518, loss: 1.639103, best loss: 1.272360 2025-01-16 01:45:52,468 - INFO - step 11519, loss: 1.592492, best loss: 1.272360 2025-01-16 01:45:52,618 - INFO - step 11520, loss: 1.573954, best loss: 1.272360 2025-01-16 01:45:52,768 - INFO - step 11521, loss: 1.707023, best loss: 1.272360 2025-01-16 01:45:52,918 - INFO - step 11522, loss: 1.536687, best loss: 1.272360 2025-01-16 01:45:53,068 - INFO - step 11523, loss: 1.675405, best loss: 1.272360 2025-01-16 01:45:53,218 - INFO - step 11524, loss: 1.588771, best loss: 1.272360 2025-01-16 01:45:53,368 - INFO - step 11525, loss: 1.545366, best loss: 1.272360 2025-01-16 01:45:53,518 - INFO - step 11526, loss: 1.508743, best loss: 1.272360 2025-01-16 01:45:53,668 - INFO - step 11527, loss: 1.660700, best loss: 1.272360 2025-01-16 01:45:53,818 - INFO - step 11528, loss: 1.539588, best loss: 1.272360 2025-01-16 01:45:53,968 - INFO - step 11529, loss: 1.487502, best loss: 1.272360 2025-01-16 01:45:54,118 - INFO - step 11530, loss: 1.448472, best loss: 1.272360 2025-01-16 01:45:54,269 - INFO - step 11531, loss: 1.439049, best loss: 1.272360 2025-01-16 01:45:54,419 - INFO - step 11532, loss: 1.718644, best loss: 1.272360 2025-01-16 01:45:54,569 - INFO - step 11533, loss: 1.498274, best loss: 1.272360 2025-01-16 01:45:54,719 - INFO - step 11534, loss: 1.496008, best loss: 1.272360 2025-01-16 01:45:54,869 - INFO - step 11535, loss: 1.306712, best loss: 1.272360 2025-01-16 01:45:55,019 - INFO - step 11536, loss: 1.358696, best loss: 1.272360 2025-01-16 01:45:58,544 - INFO - step 11537, loss: 1.255408, best loss: 1.255408 2025-01-16 01:45:58,706 - INFO - step 11538, loss: 1.652491, best loss: 1.255408 2025-01-16 01:45:58,858 - INFO - step 11539, loss: 1.583473, best loss: 1.255408 2025-01-16 01:45:59,008 - INFO - step 11540, loss: 1.732804, best loss: 1.255408 2025-01-16 01:45:59,158 - INFO - step 11541, loss: 1.671361, best loss: 1.255408 2025-01-16 01:45:59,308 - INFO - step 11542, loss: 1.640876, best loss: 1.255408 2025-01-16 01:45:59,458 - INFO - step 11543, loss: 1.464287, best loss: 1.255408 2025-01-16 01:45:59,609 - INFO - step 11544, loss: 1.451585, best loss: 1.255408 2025-01-16 01:45:59,759 - INFO - step 11545, loss: 1.711938, best loss: 1.255408 2025-01-16 01:45:59,909 - INFO - step 11546, loss: 1.665484, best loss: 1.255408 2025-01-16 01:46:03,420 - INFO - step 11547, loss: 1.185902, best loss: 1.185902 2025-01-16 01:46:03,570 - INFO - step 11548, loss: 1.423978, best loss: 1.185902 2025-01-16 01:46:03,720 - INFO - step 11549, loss: 1.282525, best loss: 1.185902 2025-01-16 01:46:03,871 - INFO - step 11550, loss: 1.611521, best loss: 1.185902 2025-01-16 01:46:04,021 - INFO - step 11551, loss: 1.581981, best loss: 1.185902 2025-01-16 01:46:04,171 - INFO - step 11552, loss: 1.623717, best loss: 1.185902 2025-01-16 01:46:04,321 - INFO - step 11553, loss: 1.667162, best loss: 1.185902 2025-01-16 01:46:04,471 - INFO - step 11554, loss: 1.606339, best loss: 1.185902 2025-01-16 01:46:04,621 - INFO - step 11555, loss: 1.377890, best loss: 1.185902 2025-01-16 01:46:04,771 - INFO - step 11556, loss: 1.710398, best loss: 1.185902 2025-01-16 01:46:04,921 - INFO - step 11557, loss: 1.616247, best loss: 1.185902 2025-01-16 01:46:05,072 - INFO - step 11558, loss: 1.780864, best loss: 1.185902 2025-01-16 01:46:05,222 - INFO - step 11559, loss: 1.706003, best loss: 1.185902 2025-01-16 01:46:05,372 - INFO - step 11560, loss: 1.539974, best loss: 1.185902 2025-01-16 01:46:05,522 - INFO - step 11561, loss: 1.629008, best loss: 1.185902 2025-01-16 01:46:05,672 - INFO - step 11562, loss: 1.417509, best loss: 1.185902 2025-01-16 01:46:05,823 - INFO - step 11563, loss: 1.744808, best loss: 1.185902 2025-01-16 01:46:05,973 - INFO - step 11564, loss: 1.789855, best loss: 1.185902 2025-01-16 01:46:06,123 - INFO - step 11565, loss: 1.765280, best loss: 1.185902 2025-01-16 01:46:06,273 - INFO - step 11566, loss: 1.657820, best loss: 1.185902 2025-01-16 01:46:06,423 - INFO - step 11567, loss: 1.535817, best loss: 1.185902 2025-01-16 01:46:06,574 - INFO - step 11568, loss: 1.593757, best loss: 1.185902 2025-01-16 01:46:06,724 - INFO - step 11569, loss: 1.625197, best loss: 1.185902 2025-01-16 01:46:06,874 - INFO - step 11570, loss: 1.528558, best loss: 1.185902 2025-01-16 01:46:07,024 - INFO - step 11571, loss: 1.756367, best loss: 1.185902 2025-01-16 01:46:07,174 - INFO - step 11572, loss: 1.425851, best loss: 1.185902 2025-01-16 01:46:07,324 - INFO - step 11573, loss: 1.500678, best loss: 1.185902 2025-01-16 01:46:07,474 - INFO - step 11574, loss: 1.622217, best loss: 1.185902 2025-01-16 01:46:07,625 - INFO - step 11575, loss: 1.740281, best loss: 1.185902 2025-01-16 01:46:07,775 - INFO - step 11576, loss: 1.673215, best loss: 1.185902 2025-01-16 01:46:07,925 - INFO - step 11577, loss: 1.433010, best loss: 1.185902 2025-01-16 01:46:08,075 - INFO - step 11578, loss: 1.640399, best loss: 1.185902 2025-01-16 01:46:08,225 - INFO - step 11579, loss: 1.718475, best loss: 1.185902 2025-01-16 01:46:08,376 - INFO - step 11580, loss: 1.477962, best loss: 1.185902 2025-01-16 01:46:08,526 - INFO - step 11581, loss: 1.498303, best loss: 1.185902 2025-01-16 01:46:08,676 - INFO - step 11582, loss: 1.551303, best loss: 1.185902 2025-01-16 01:46:08,826 - INFO - step 11583, loss: 1.624847, best loss: 1.185902 2025-01-16 01:46:08,976 - INFO - step 11584, loss: 1.410651, best loss: 1.185902 2025-01-16 01:46:09,126 - INFO - step 11585, loss: 1.360457, best loss: 1.185902 2025-01-16 01:46:09,276 - INFO - step 11586, loss: 1.642724, best loss: 1.185902 2025-01-16 01:46:09,427 - INFO - step 11587, loss: 1.699988, best loss: 1.185902 2025-01-16 01:46:09,577 - INFO - step 11588, loss: 1.571992, best loss: 1.185902 2025-01-16 01:46:09,727 - INFO - step 11589, loss: 1.457845, best loss: 1.185902 2025-01-16 01:46:09,877 - INFO - step 11590, loss: 1.713357, best loss: 1.185902 2025-01-16 01:46:10,027 - INFO - step 11591, loss: 1.669693, best loss: 1.185902 2025-01-16 01:46:10,177 - INFO - step 11592, loss: 1.709985, best loss: 1.185902 2025-01-16 01:46:10,327 - INFO - step 11593, loss: 1.529577, best loss: 1.185902 2025-01-16 01:46:10,477 - INFO - step 11594, loss: 1.480838, best loss: 1.185902 2025-01-16 01:46:10,628 - INFO - step 11595, loss: 1.511773, best loss: 1.185902 2025-01-16 01:46:10,778 - INFO - step 11596, loss: 1.777933, best loss: 1.185902 2025-01-16 01:46:10,928 - INFO - step 11597, loss: 1.663024, best loss: 1.185902 2025-01-16 01:46:11,078 - INFO - step 11598, loss: 1.690361, best loss: 1.185902 2025-01-16 01:46:11,228 - INFO - step 11599, loss: 1.386391, best loss: 1.185902 2025-01-16 01:46:11,379 - INFO - step 11600, loss: 1.485447, best loss: 1.185902 2025-01-16 01:46:11,529 - INFO - step 11601, loss: 1.612581, best loss: 1.185902 2025-01-16 01:46:11,679 - INFO - step 11602, loss: 1.618295, best loss: 1.185902 2025-01-16 01:46:11,829 - INFO - step 11603, loss: 1.613252, best loss: 1.185902 2025-01-16 01:46:11,979 - INFO - step 11604, loss: 1.661502, best loss: 1.185902 2025-01-16 01:46:12,129 - INFO - step 11605, loss: 1.532544, best loss: 1.185902 2025-01-16 01:46:12,280 - INFO - step 11606, loss: 1.578906, best loss: 1.185902 2025-01-16 01:46:12,430 - INFO - step 11607, loss: 1.572319, best loss: 1.185902 2025-01-16 01:46:12,580 - INFO - step 11608, loss: 1.307715, best loss: 1.185902 2025-01-16 01:46:12,730 - INFO - step 11609, loss: 1.668500, best loss: 1.185902 2025-01-16 01:46:12,880 - INFO - step 11610, loss: 1.610636, best loss: 1.185902 2025-01-16 01:46:13,030 - INFO - step 11611, loss: 1.784611, best loss: 1.185902 2025-01-16 01:46:13,180 - INFO - step 11612, loss: 1.742009, best loss: 1.185902 2025-01-16 01:46:13,330 - INFO - step 11613, loss: 1.681783, best loss: 1.185902 2025-01-16 01:46:13,480 - INFO - step 11614, loss: 1.663592, best loss: 1.185902 2025-01-16 01:46:13,630 - INFO - step 11615, loss: 1.398834, best loss: 1.185902 2025-01-16 01:46:13,781 - INFO - step 11616, loss: 1.695987, best loss: 1.185902 2025-01-16 01:46:13,931 - INFO - step 11617, loss: 1.411880, best loss: 1.185902 2025-01-16 01:46:14,081 - INFO - step 11618, loss: 1.488728, best loss: 1.185902 2025-01-16 01:46:14,231 - INFO - step 11619, loss: 1.608185, best loss: 1.185902 2025-01-16 01:46:14,381 - INFO - step 11620, loss: 1.580974, best loss: 1.185902 2025-01-16 01:46:14,531 - INFO - step 11621, loss: 1.605231, best loss: 1.185902 2025-01-16 01:46:14,682 - INFO - step 11622, loss: 1.634948, best loss: 1.185902 2025-01-16 01:46:14,832 - INFO - step 11623, loss: 1.681872, best loss: 1.185902 2025-01-16 01:46:14,982 - INFO - step 11624, loss: 1.717043, best loss: 1.185902 2025-01-16 01:46:15,132 - INFO - step 11625, loss: 1.669608, best loss: 1.185902 2025-01-16 01:46:15,282 - INFO - step 11626, loss: 1.566904, best loss: 1.185902 2025-01-16 01:46:15,432 - INFO - step 11627, loss: 1.672670, best loss: 1.185902 2025-01-16 01:46:15,582 - INFO - step 11628, loss: 1.555527, best loss: 1.185902 2025-01-16 01:46:15,732 - INFO - step 11629, loss: 1.448558, best loss: 1.185902 2025-01-16 01:46:15,882 - INFO - step 11630, loss: 1.623988, best loss: 1.185902 2025-01-16 01:46:16,032 - INFO - step 11631, loss: 1.493905, best loss: 1.185902 2025-01-16 01:46:16,182 - INFO - step 11632, loss: 1.497390, best loss: 1.185902 2025-01-16 01:46:16,332 - INFO - step 11633, loss: 1.435788, best loss: 1.185902 2025-01-16 01:46:16,482 - INFO - step 11634, loss: 1.420704, best loss: 1.185902 2025-01-16 01:46:16,632 - INFO - step 11635, loss: 1.562151, best loss: 1.185902 2025-01-16 01:46:16,782 - INFO - step 11636, loss: 1.352666, best loss: 1.185902 2025-01-16 01:46:16,932 - INFO - step 11637, loss: 1.468271, best loss: 1.185902 2025-01-16 01:46:17,082 - INFO - step 11638, loss: 1.560886, best loss: 1.185902 2025-01-16 01:46:17,232 - INFO - step 11639, loss: 1.617658, best loss: 1.185902 2025-01-16 01:46:17,382 - INFO - step 11640, loss: 1.498972, best loss: 1.185902 2025-01-16 01:46:17,532 - INFO - step 11641, loss: 1.698024, best loss: 1.185902 2025-01-16 01:46:17,683 - INFO - step 11642, loss: 1.526098, best loss: 1.185902 2025-01-16 01:46:17,833 - INFO - step 11643, loss: 1.772148, best loss: 1.185902 2025-01-16 01:46:17,983 - INFO - step 11644, loss: 1.752271, best loss: 1.185902 2025-01-16 01:46:18,133 - INFO - step 11645, loss: 1.719576, best loss: 1.185902 2025-01-16 01:46:18,283 - INFO - step 11646, loss: 1.742159, best loss: 1.185902 2025-01-16 01:46:18,433 - INFO - step 11647, loss: 1.499017, best loss: 1.185902 2025-01-16 01:46:18,583 - INFO - step 11648, loss: 1.600118, best loss: 1.185902 2025-01-16 01:46:18,733 - INFO - step 11649, loss: 1.645260, best loss: 1.185902 2025-01-16 01:46:18,883 - INFO - step 11650, loss: 1.568242, best loss: 1.185902 2025-01-16 01:46:19,033 - INFO - step 11651, loss: 1.636767, best loss: 1.185902 2025-01-16 01:46:19,183 - INFO - step 11652, loss: 1.598455, best loss: 1.185902 2025-01-16 01:46:19,333 - INFO - step 11653, loss: 1.816105, best loss: 1.185902 2025-01-16 01:46:19,484 - INFO - step 11654, loss: 1.500512, best loss: 1.185902 2025-01-16 01:46:19,634 - INFO - step 11655, loss: 1.511829, best loss: 1.185902 2025-01-16 01:46:19,784 - INFO - step 11656, loss: 1.571168, best loss: 1.185902 2025-01-16 01:46:19,934 - INFO - step 11657, loss: 1.537834, best loss: 1.185902 2025-01-16 01:46:20,084 - INFO - step 11658, loss: 1.563777, best loss: 1.185902 2025-01-16 01:46:20,235 - INFO - step 11659, loss: 1.701673, best loss: 1.185902 2025-01-16 01:46:20,385 - INFO - step 11660, loss: 1.586543, best loss: 1.185902 2025-01-16 01:46:20,535 - INFO - step 11661, loss: 1.780052, best loss: 1.185902 2025-01-16 01:46:20,685 - INFO - step 11662, loss: 1.590893, best loss: 1.185902 2025-01-16 01:46:20,835 - INFO - step 11663, loss: 1.548133, best loss: 1.185902 2025-01-16 01:46:20,985 - INFO - step 11664, loss: 1.728563, best loss: 1.185902 2025-01-16 01:46:21,135 - INFO - step 11665, loss: 1.612880, best loss: 1.185902 2025-01-16 01:46:21,285 - INFO - step 11666, loss: 1.597015, best loss: 1.185902 2025-01-16 01:46:21,435 - INFO - step 11667, loss: 1.617109, best loss: 1.185902 2025-01-16 01:46:21,585 - INFO - step 11668, loss: 1.499744, best loss: 1.185902 2025-01-16 01:46:21,735 - INFO - step 11669, loss: 1.589100, best loss: 1.185902 2025-01-16 01:46:21,885 - INFO - step 11670, loss: 1.651743, best loss: 1.185902 2025-01-16 01:46:22,035 - INFO - step 11671, loss: 1.569026, best loss: 1.185902 2025-01-16 01:46:22,185 - INFO - step 11672, loss: 1.665868, best loss: 1.185902 2025-01-16 01:46:22,335 - INFO - step 11673, loss: 1.445200, best loss: 1.185902 2025-01-16 01:46:22,485 - INFO - step 11674, loss: 1.391487, best loss: 1.185902 2025-01-16 01:46:22,636 - INFO - step 11675, loss: 1.413279, best loss: 1.185902 2025-01-16 01:46:22,786 - INFO - step 11676, loss: 1.581160, best loss: 1.185902 2025-01-16 01:46:22,936 - INFO - step 11677, loss: 1.790860, best loss: 1.185902 2025-01-16 01:46:23,085 - INFO - step 11678, loss: 1.527174, best loss: 1.185902 2025-01-16 01:46:23,235 - INFO - step 11679, loss: 1.260893, best loss: 1.185902 2025-01-16 01:46:23,385 - INFO - step 11680, loss: 1.591474, best loss: 1.185902 2025-01-16 01:46:23,535 - INFO - step 11681, loss: 1.665397, best loss: 1.185902 2025-01-16 01:46:23,685 - INFO - step 11682, loss: 1.800180, best loss: 1.185902 2025-01-16 01:46:23,835 - INFO - step 11683, loss: 1.349933, best loss: 1.185902 2025-01-16 01:46:23,985 - INFO - step 11684, loss: 1.604903, best loss: 1.185902 2025-01-16 01:46:24,135 - INFO - step 11685, loss: 1.422576, best loss: 1.185902 2025-01-16 01:46:24,286 - INFO - step 11686, loss: 1.391481, best loss: 1.185902 2025-01-16 01:46:24,436 - INFO - step 11687, loss: 1.580942, best loss: 1.185902 2025-01-16 01:46:24,586 - INFO - step 11688, loss: 1.534541, best loss: 1.185902 2025-01-16 01:46:24,736 - INFO - step 11689, loss: 1.604172, best loss: 1.185902 2025-01-16 01:46:24,886 - INFO - step 11690, loss: 1.632293, best loss: 1.185902 2025-01-16 01:46:25,036 - INFO - step 11691, loss: 1.621860, best loss: 1.185902 2025-01-16 01:46:25,186 - INFO - step 11692, loss: 1.751537, best loss: 1.185902 2025-01-16 01:46:25,336 - INFO - step 11693, loss: 1.449527, best loss: 1.185902 2025-01-16 01:46:28,859 - INFO - step 11694, loss: 1.177307, best loss: 1.177307 2025-01-16 01:46:29,010 - INFO - step 11695, loss: 1.513431, best loss: 1.177307 2025-01-16 01:46:29,160 - INFO - step 11696, loss: 1.791603, best loss: 1.177307 2025-01-16 01:46:29,310 - INFO - step 11697, loss: 1.560638, best loss: 1.177307 2025-01-16 01:46:29,461 - INFO - step 11698, loss: 1.437354, best loss: 1.177307 2025-01-16 01:46:29,611 - INFO - step 11699, loss: 1.477709, best loss: 1.177307 2025-01-16 01:46:29,762 - INFO - step 11700, loss: 1.704814, best loss: 1.177307 2025-01-16 01:46:29,912 - INFO - step 11701, loss: 1.696068, best loss: 1.177307 2025-01-16 01:46:30,062 - INFO - step 11702, loss: 1.606178, best loss: 1.177307 2025-01-16 01:46:30,212 - INFO - step 11703, loss: 1.458396, best loss: 1.177307 2025-01-16 01:46:30,362 - INFO - step 11704, loss: 1.664070, best loss: 1.177307 2025-01-16 01:46:30,512 - INFO - step 11705, loss: 1.674362, best loss: 1.177307 2025-01-16 01:46:30,662 - INFO - step 11706, loss: 1.516905, best loss: 1.177307 2025-01-16 01:46:30,813 - INFO - step 11707, loss: 1.556288, best loss: 1.177307 2025-01-16 01:46:30,963 - INFO - step 11708, loss: 1.587692, best loss: 1.177307 2025-01-16 01:46:31,113 - INFO - step 11709, loss: 1.695413, best loss: 1.177307 2025-01-16 01:46:31,263 - INFO - step 11710, loss: 1.521506, best loss: 1.177307 2025-01-16 01:46:31,413 - INFO - step 11711, loss: 1.542257, best loss: 1.177307 2025-01-16 01:46:31,563 - INFO - step 11712, loss: 1.437273, best loss: 1.177307 2025-01-16 01:46:31,714 - INFO - step 11713, loss: 1.327006, best loss: 1.177307 2025-01-16 01:46:31,864 - INFO - step 11714, loss: 1.614034, best loss: 1.177307 2025-01-16 01:46:32,014 - INFO - step 11715, loss: 1.515844, best loss: 1.177307 2025-01-16 01:46:32,164 - INFO - step 11716, loss: 1.787406, best loss: 1.177307 2025-01-16 01:46:32,314 - INFO - step 11717, loss: 1.633796, best loss: 1.177307 2025-01-16 01:46:32,464 - INFO - step 11718, loss: 1.622265, best loss: 1.177307 2025-01-16 01:46:32,614 - INFO - step 11719, loss: 1.729970, best loss: 1.177307 2025-01-16 01:46:32,764 - INFO - step 11720, loss: 1.489838, best loss: 1.177307 2025-01-16 01:46:32,914 - INFO - step 11721, loss: 1.381944, best loss: 1.177307 2025-01-16 01:46:33,065 - INFO - step 11722, loss: 1.469083, best loss: 1.177307 2025-01-16 01:46:33,215 - INFO - step 11723, loss: 1.560886, best loss: 1.177307 2025-01-16 01:46:33,365 - INFO - step 11724, loss: 1.601117, best loss: 1.177307 2025-01-16 01:46:33,515 - INFO - step 11725, loss: 1.493007, best loss: 1.177307 2025-01-16 01:46:33,665 - INFO - step 11726, loss: 1.794189, best loss: 1.177307 2025-01-16 01:46:33,815 - INFO - step 11727, loss: 1.794930, best loss: 1.177307 2025-01-16 01:46:33,966 - INFO - step 11728, loss: 1.697395, best loss: 1.177307 2025-01-16 01:46:34,116 - INFO - step 11729, loss: 1.654397, best loss: 1.177307 2025-01-16 01:46:34,266 - INFO - step 11730, loss: 1.810199, best loss: 1.177307 2025-01-16 01:46:34,416 - INFO - step 11731, loss: 1.555521, best loss: 1.177307 2025-01-16 01:46:34,566 - INFO - step 11732, loss: 1.709886, best loss: 1.177307 2025-01-16 01:46:34,717 - INFO - step 11733, loss: 1.762915, best loss: 1.177307 2025-01-16 01:46:34,867 - INFO - step 11734, loss: 1.575051, best loss: 1.177307 2025-01-16 01:46:35,017 - INFO - step 11735, loss: 1.566093, best loss: 1.177307 2025-01-16 01:46:35,167 - INFO - step 11736, loss: 1.749756, best loss: 1.177307 2025-01-16 01:46:35,317 - INFO - step 11737, loss: 1.544300, best loss: 1.177307 2025-01-16 01:46:35,467 - INFO - step 11738, loss: 1.187274, best loss: 1.177307 2025-01-16 01:46:35,617 - INFO - step 11739, loss: 1.641957, best loss: 1.177307 2025-01-16 01:46:35,767 - INFO - step 11740, loss: 1.537672, best loss: 1.177307 2025-01-16 01:46:35,917 - INFO - step 11741, loss: 1.620624, best loss: 1.177307 2025-01-16 01:46:36,067 - INFO - step 11742, loss: 1.677582, best loss: 1.177307 2025-01-16 01:46:36,217 - INFO - step 11743, loss: 1.549975, best loss: 1.177307 2025-01-16 01:46:36,367 - INFO - step 11744, loss: 1.560079, best loss: 1.177307 2025-01-16 01:46:36,518 - INFO - step 11745, loss: 1.638050, best loss: 1.177307 2025-01-16 01:46:36,668 - INFO - step 11746, loss: 1.624307, best loss: 1.177307 2025-01-16 01:46:36,818 - INFO - step 11747, loss: 1.529449, best loss: 1.177307 2025-01-16 01:46:36,969 - INFO - step 11748, loss: 1.504047, best loss: 1.177307 2025-01-16 01:46:37,119 - INFO - step 11749, loss: 1.380235, best loss: 1.177307 2025-01-16 01:46:37,269 - INFO - step 11750, loss: 1.511466, best loss: 1.177307 2025-01-16 01:46:37,419 - INFO - step 11751, loss: 1.494606, best loss: 1.177307 2025-01-16 01:46:37,570 - INFO - step 11752, loss: 1.580523, best loss: 1.177307 2025-01-16 01:46:37,720 - INFO - step 11753, loss: 1.652183, best loss: 1.177307 2025-01-16 01:46:37,870 - INFO - step 11754, loss: 1.576795, best loss: 1.177307 2025-01-16 01:46:38,020 - INFO - step 11755, loss: 1.421971, best loss: 1.177307 2025-01-16 01:46:38,170 - INFO - step 11756, loss: 1.480553, best loss: 1.177307 2025-01-16 01:46:38,320 - INFO - step 11757, loss: 1.464082, best loss: 1.177307 2025-01-16 01:46:38,470 - INFO - step 11758, loss: 1.569961, best loss: 1.177307 2025-01-16 01:46:38,620 - INFO - step 11759, loss: 1.622667, best loss: 1.177307 2025-01-16 01:46:38,770 - INFO - step 11760, loss: 1.582933, best loss: 1.177307 2025-01-16 01:46:38,920 - INFO - step 11761, loss: 1.374539, best loss: 1.177307 2025-01-16 01:46:39,070 - INFO - step 11762, loss: 1.422609, best loss: 1.177307 2025-01-16 01:46:39,220 - INFO - step 11763, loss: 1.493896, best loss: 1.177307 2025-01-16 01:46:39,370 - INFO - step 11764, loss: 1.561786, best loss: 1.177307 2025-01-16 01:46:39,520 - INFO - step 11765, loss: 1.616638, best loss: 1.177307 2025-01-16 01:46:39,670 - INFO - step 11766, loss: 1.668045, best loss: 1.177307 2025-01-16 01:46:39,821 - INFO - step 11767, loss: 1.649060, best loss: 1.177307 2025-01-16 01:46:39,971 - INFO - step 11768, loss: 1.501719, best loss: 1.177307 2025-01-16 01:46:40,121 - INFO - step 11769, loss: 1.518440, best loss: 1.177307 2025-01-16 01:46:40,271 - INFO - step 11770, loss: 1.672620, best loss: 1.177307 2025-01-16 01:46:40,421 - INFO - step 11771, loss: 1.664502, best loss: 1.177307 2025-01-16 01:46:40,572 - INFO - step 11772, loss: 1.719421, best loss: 1.177307 2025-01-16 01:46:40,722 - INFO - step 11773, loss: 1.745738, best loss: 1.177307 2025-01-16 01:46:40,872 - INFO - step 11774, loss: 1.859355, best loss: 1.177307 2025-01-16 01:46:41,022 - INFO - step 11775, loss: 1.540148, best loss: 1.177307 2025-01-16 01:46:41,172 - INFO - step 11776, loss: 1.672545, best loss: 1.177307 2025-01-16 01:46:41,322 - INFO - step 11777, loss: 1.471426, best loss: 1.177307 2025-01-16 01:46:41,472 - INFO - step 11778, loss: 1.328569, best loss: 1.177307 2025-01-16 01:46:41,622 - INFO - step 11779, loss: 1.622531, best loss: 1.177307 2025-01-16 01:46:41,772 - INFO - step 11780, loss: 1.632587, best loss: 1.177307 2025-01-16 01:46:41,922 - INFO - step 11781, loss: 1.316655, best loss: 1.177307 2025-01-16 01:46:42,073 - INFO - step 11782, loss: 1.390892, best loss: 1.177307 2025-01-16 01:46:42,222 - INFO - step 11783, loss: 1.678151, best loss: 1.177307 2025-01-16 01:46:42,373 - INFO - step 11784, loss: 1.706010, best loss: 1.177307 2025-01-16 01:46:42,523 - INFO - step 11785, loss: 1.526653, best loss: 1.177307 2025-01-16 01:46:42,673 - INFO - step 11786, loss: 1.672518, best loss: 1.177307 2025-01-16 01:46:42,823 - INFO - step 11787, loss: 1.456985, best loss: 1.177307 2025-01-16 01:46:42,973 - INFO - step 11788, loss: 1.344365, best loss: 1.177307 2025-01-16 01:46:43,124 - INFO - step 11789, loss: 1.684681, best loss: 1.177307 2025-01-16 01:46:43,274 - INFO - step 11790, loss: 1.863894, best loss: 1.177307 2025-01-16 01:46:43,424 - INFO - step 11791, loss: 1.813352, best loss: 1.177307 2025-01-16 01:46:43,574 - INFO - step 11792, loss: 1.597890, best loss: 1.177307 2025-01-16 01:46:43,724 - INFO - step 11793, loss: 1.569715, best loss: 1.177307 2025-01-16 01:46:43,874 - INFO - step 11794, loss: 1.673236, best loss: 1.177307 2025-01-16 01:46:44,024 - INFO - step 11795, loss: 1.548271, best loss: 1.177307 2025-01-16 01:46:44,174 - INFO - step 11796, loss: 1.503892, best loss: 1.177307 2025-01-16 01:46:44,324 - INFO - step 11797, loss: 1.661550, best loss: 1.177307 2025-01-16 01:46:44,474 - INFO - step 11798, loss: 1.382528, best loss: 1.177307 2025-01-16 01:46:44,625 - INFO - step 11799, loss: 1.258265, best loss: 1.177307 2025-01-16 01:46:44,775 - INFO - step 11800, loss: 1.474692, best loss: 1.177307 2025-01-16 01:46:44,925 - INFO - step 11801, loss: 1.499305, best loss: 1.177307 2025-01-16 01:46:45,075 - INFO - step 11802, loss: 1.541569, best loss: 1.177307 2025-01-16 01:46:45,225 - INFO - step 11803, loss: 1.284311, best loss: 1.177307 2025-01-16 01:46:45,375 - INFO - step 11804, loss: 1.285549, best loss: 1.177307 2025-01-16 01:46:45,525 - INFO - step 11805, loss: 1.179965, best loss: 1.177307 2025-01-16 01:46:45,676 - INFO - step 11806, loss: 1.383474, best loss: 1.177307 2025-01-16 01:46:45,826 - INFO - step 11807, loss: 1.521532, best loss: 1.177307 2025-01-16 01:46:45,976 - INFO - step 11808, loss: 1.610463, best loss: 1.177307 2025-01-16 01:46:46,126 - INFO - step 11809, loss: 1.695516, best loss: 1.177307 2025-01-16 01:46:46,276 - INFO - step 11810, loss: 1.552855, best loss: 1.177307 2025-01-16 01:46:46,426 - INFO - step 11811, loss: 1.469309, best loss: 1.177307 2025-01-16 01:46:46,576 - INFO - step 11812, loss: 1.549275, best loss: 1.177307 2025-01-16 01:46:46,726 - INFO - step 11813, loss: 1.364733, best loss: 1.177307 2025-01-16 01:46:46,876 - INFO - step 11814, loss: 1.442558, best loss: 1.177307 2025-01-16 01:46:47,026 - INFO - step 11815, loss: 1.426313, best loss: 1.177307 2025-01-16 01:46:47,177 - INFO - step 11816, loss: 1.326092, best loss: 1.177307 2025-01-16 01:46:47,327 - INFO - step 11817, loss: 1.284489, best loss: 1.177307 2025-01-16 01:46:47,477 - INFO - step 11818, loss: 1.392677, best loss: 1.177307 2025-01-16 01:46:47,627 - INFO - step 11819, loss: 1.536595, best loss: 1.177307 2025-01-16 01:46:47,777 - INFO - step 11820, loss: 1.395099, best loss: 1.177307 2025-01-16 01:46:47,927 - INFO - step 11821, loss: 1.393288, best loss: 1.177307 2025-01-16 01:46:48,077 - INFO - step 11822, loss: 1.417672, best loss: 1.177307 2025-01-16 01:46:48,227 - INFO - step 11823, loss: 1.359257, best loss: 1.177307 2025-01-16 01:46:48,378 - INFO - step 11824, loss: 1.469412, best loss: 1.177307 2025-01-16 01:46:48,528 - INFO - step 11825, loss: 1.636906, best loss: 1.177307 2025-01-16 01:46:48,678 - INFO - step 11826, loss: 1.419789, best loss: 1.177307 2025-01-16 01:46:48,828 - INFO - step 11827, loss: 1.510820, best loss: 1.177307 2025-01-16 01:46:48,978 - INFO - step 11828, loss: 1.490862, best loss: 1.177307 2025-01-16 01:46:49,129 - INFO - step 11829, loss: 1.560809, best loss: 1.177307 2025-01-16 01:46:49,279 - INFO - step 11830, loss: 1.346113, best loss: 1.177307 2025-01-16 01:46:49,429 - INFO - step 11831, loss: 1.373280, best loss: 1.177307 2025-01-16 01:46:49,580 - INFO - step 11832, loss: 1.432551, best loss: 1.177307 2025-01-16 01:46:49,730 - INFO - step 11833, loss: 1.488113, best loss: 1.177307 2025-01-16 01:46:49,880 - INFO - step 11834, loss: 1.728238, best loss: 1.177307 2025-01-16 01:46:50,031 - INFO - step 11835, loss: 1.550895, best loss: 1.177307 2025-01-16 01:46:50,181 - INFO - step 11836, loss: 1.596920, best loss: 1.177307 2025-01-16 01:46:50,331 - INFO - step 11837, loss: 1.507962, best loss: 1.177307 2025-01-16 01:46:50,481 - INFO - step 11838, loss: 1.598426, best loss: 1.177307 2025-01-16 01:46:50,631 - INFO - step 11839, loss: 1.527663, best loss: 1.177307 2025-01-16 01:46:50,781 - INFO - step 11840, loss: 1.451503, best loss: 1.177307 2025-01-16 01:46:50,932 - INFO - step 11841, loss: 1.512660, best loss: 1.177307 2025-01-16 01:46:51,082 - INFO - step 11842, loss: 1.541207, best loss: 1.177307 2025-01-16 01:46:51,232 - INFO - step 11843, loss: 1.508255, best loss: 1.177307 2025-01-16 01:46:51,382 - INFO - step 11844, loss: 1.564621, best loss: 1.177307 2025-01-16 01:46:51,532 - INFO - step 11845, loss: 1.515942, best loss: 1.177307 2025-01-16 01:46:51,682 - INFO - step 11846, loss: 1.611574, best loss: 1.177307 2025-01-16 01:46:51,832 - INFO - step 11847, loss: 1.253927, best loss: 1.177307 2025-01-16 01:46:51,983 - INFO - step 11848, loss: 1.636836, best loss: 1.177307 2025-01-16 01:46:52,133 - INFO - step 11849, loss: 1.553811, best loss: 1.177307 2025-01-16 01:46:52,283 - INFO - step 11850, loss: 1.489136, best loss: 1.177307 2025-01-16 01:46:52,434 - INFO - step 11851, loss: 1.522288, best loss: 1.177307 2025-01-16 01:46:52,584 - INFO - step 11852, loss: 1.457299, best loss: 1.177307 2025-01-16 01:46:52,734 - INFO - step 11853, loss: 1.467878, best loss: 1.177307 2025-01-16 01:46:52,885 - INFO - step 11854, loss: 1.462069, best loss: 1.177307 2025-01-16 01:46:53,035 - INFO - step 11855, loss: 1.426421, best loss: 1.177307 2025-01-16 01:46:53,185 - INFO - step 11856, loss: 1.367646, best loss: 1.177307 2025-01-16 01:46:53,335 - INFO - step 11857, loss: 1.646667, best loss: 1.177307 2025-01-16 01:46:53,485 - INFO - step 11858, loss: 1.488811, best loss: 1.177307 2025-01-16 01:46:53,635 - INFO - step 11859, loss: 1.509467, best loss: 1.177307 2025-01-16 01:46:53,786 - INFO - step 11860, loss: 1.393804, best loss: 1.177307 2025-01-16 01:46:53,936 - INFO - step 11861, loss: 1.354310, best loss: 1.177307 2025-01-16 01:46:54,086 - INFO - step 11862, loss: 1.535292, best loss: 1.177307 2025-01-16 01:46:54,236 - INFO - step 11863, loss: 1.391963, best loss: 1.177307 2025-01-16 01:46:54,386 - INFO - step 11864, loss: 1.440228, best loss: 1.177307 2025-01-16 01:46:54,536 - INFO - step 11865, loss: 1.216058, best loss: 1.177307 2025-01-16 01:46:54,687 - INFO - step 11866, loss: 1.236966, best loss: 1.177307 2025-01-16 01:46:57,795 - INFO - step 11867, loss: 1.153915, best loss: 1.153915 2025-01-16 01:46:57,957 - INFO - step 11868, loss: 1.573609, best loss: 1.153915 2025-01-16 01:46:58,114 - INFO - step 11869, loss: 1.492063, best loss: 1.153915 2025-01-16 01:46:58,265 - INFO - step 11870, loss: 1.584199, best loss: 1.153915 2025-01-16 01:46:58,415 - INFO - step 11871, loss: 1.628665, best loss: 1.153915 2025-01-16 01:46:58,566 - INFO - step 11872, loss: 1.503974, best loss: 1.153915 2025-01-16 01:46:58,716 - INFO - step 11873, loss: 1.348213, best loss: 1.153915 2025-01-16 01:46:58,866 - INFO - step 11874, loss: 1.380289, best loss: 1.153915 2025-01-16 01:46:59,016 - INFO - step 11875, loss: 1.566857, best loss: 1.153915 2025-01-16 01:46:59,166 - INFO - step 11876, loss: 1.532459, best loss: 1.153915 2025-01-16 01:47:02,789 - INFO - step 11877, loss: 1.110608, best loss: 1.110608 2025-01-16 01:47:02,939 - INFO - step 11878, loss: 1.336762, best loss: 1.110608 2025-01-16 01:47:03,090 - INFO - step 11879, loss: 1.202428, best loss: 1.110608 2025-01-16 01:47:03,240 - INFO - step 11880, loss: 1.402489, best loss: 1.110608 2025-01-16 01:47:03,390 - INFO - step 11881, loss: 1.536889, best loss: 1.110608 2025-01-16 01:47:03,540 - INFO - step 11882, loss: 1.438339, best loss: 1.110608 2025-01-16 01:47:03,690 - INFO - step 11883, loss: 1.551227, best loss: 1.110608 2025-01-16 01:47:03,840 - INFO - step 11884, loss: 1.488104, best loss: 1.110608 2025-01-16 01:47:03,990 - INFO - step 11885, loss: 1.357958, best loss: 1.110608 2025-01-16 01:47:04,141 - INFO - step 11886, loss: 1.580527, best loss: 1.110608 2025-01-16 01:47:04,291 - INFO - step 11887, loss: 1.595766, best loss: 1.110608 2025-01-16 01:47:04,441 - INFO - step 11888, loss: 1.647109, best loss: 1.110608 2025-01-16 01:47:04,591 - INFO - step 11889, loss: 1.587727, best loss: 1.110608 2025-01-16 01:47:04,742 - INFO - step 11890, loss: 1.388571, best loss: 1.110608 2025-01-16 01:47:04,892 - INFO - step 11891, loss: 1.400869, best loss: 1.110608 2025-01-16 01:47:05,042 - INFO - step 11892, loss: 1.278681, best loss: 1.110608 2025-01-16 01:47:05,192 - INFO - step 11893, loss: 1.559697, best loss: 1.110608 2025-01-16 01:47:05,342 - INFO - step 11894, loss: 1.660793, best loss: 1.110608 2025-01-16 01:47:05,492 - INFO - step 11895, loss: 1.684520, best loss: 1.110608 2025-01-16 01:47:05,643 - INFO - step 11896, loss: 1.673771, best loss: 1.110608 2025-01-16 01:47:05,793 - INFO - step 11897, loss: 1.467612, best loss: 1.110608 2025-01-16 01:47:05,943 - INFO - step 11898, loss: 1.523970, best loss: 1.110608 2025-01-16 01:47:06,093 - INFO - step 11899, loss: 1.473338, best loss: 1.110608 2025-01-16 01:47:06,243 - INFO - step 11900, loss: 1.437213, best loss: 1.110608 2025-01-16 01:47:06,393 - INFO - step 11901, loss: 1.708144, best loss: 1.110608 2025-01-16 01:47:06,543 - INFO - step 11902, loss: 1.303937, best loss: 1.110608 2025-01-16 01:47:06,693 - INFO - step 11903, loss: 1.397635, best loss: 1.110608 2025-01-16 01:47:06,843 - INFO - step 11904, loss: 1.519896, best loss: 1.110608 2025-01-16 01:47:06,993 - INFO - step 11905, loss: 1.663252, best loss: 1.110608 2025-01-16 01:47:07,143 - INFO - step 11906, loss: 1.542623, best loss: 1.110608 2025-01-16 01:47:07,293 - INFO - step 11907, loss: 1.327191, best loss: 1.110608 2025-01-16 01:47:07,443 - INFO - step 11908, loss: 1.561044, best loss: 1.110608 2025-01-16 01:47:07,594 - INFO - step 11909, loss: 1.566345, best loss: 1.110608 2025-01-16 01:47:07,744 - INFO - step 11910, loss: 1.300841, best loss: 1.110608 2025-01-16 01:47:07,894 - INFO - step 11911, loss: 1.407313, best loss: 1.110608 2025-01-16 01:47:08,044 - INFO - step 11912, loss: 1.455212, best loss: 1.110608 2025-01-16 01:47:08,194 - INFO - step 11913, loss: 1.590092, best loss: 1.110608 2025-01-16 01:47:08,344 - INFO - step 11914, loss: 1.290721, best loss: 1.110608 2025-01-16 01:47:08,494 - INFO - step 11915, loss: 1.315249, best loss: 1.110608 2025-01-16 01:47:08,644 - INFO - step 11916, loss: 1.678445, best loss: 1.110608 2025-01-16 01:47:08,794 - INFO - step 11917, loss: 1.613646, best loss: 1.110608 2025-01-16 01:47:08,944 - INFO - step 11918, loss: 1.539758, best loss: 1.110608 2025-01-16 01:47:09,095 - INFO - step 11919, loss: 1.381797, best loss: 1.110608 2025-01-16 01:47:09,245 - INFO - step 11920, loss: 1.618505, best loss: 1.110608 2025-01-16 01:47:09,395 - INFO - step 11921, loss: 1.572600, best loss: 1.110608 2025-01-16 01:47:09,546 - INFO - step 11922, loss: 1.580179, best loss: 1.110608 2025-01-16 01:47:09,696 - INFO - step 11923, loss: 1.378113, best loss: 1.110608 2025-01-16 01:47:09,846 - INFO - step 11924, loss: 1.477382, best loss: 1.110608 2025-01-16 01:47:09,996 - INFO - step 11925, loss: 1.483754, best loss: 1.110608 2025-01-16 01:47:10,146 - INFO - step 11926, loss: 1.669686, best loss: 1.110608 2025-01-16 01:47:10,296 - INFO - step 11927, loss: 1.571462, best loss: 1.110608 2025-01-16 01:47:10,446 - INFO - step 11928, loss: 1.526724, best loss: 1.110608 2025-01-16 01:47:10,596 - INFO - step 11929, loss: 1.267006, best loss: 1.110608 2025-01-16 01:47:10,746 - INFO - step 11930, loss: 1.366982, best loss: 1.110608 2025-01-16 01:47:10,896 - INFO - step 11931, loss: 1.517155, best loss: 1.110608 2025-01-16 01:47:11,046 - INFO - step 11932, loss: 1.528512, best loss: 1.110608 2025-01-16 01:47:11,196 - INFO - step 11933, loss: 1.463149, best loss: 1.110608 2025-01-16 01:47:11,346 - INFO - step 11934, loss: 1.570548, best loss: 1.110608 2025-01-16 01:47:11,496 - INFO - step 11935, loss: 1.406696, best loss: 1.110608 2025-01-16 01:47:11,647 - INFO - step 11936, loss: 1.483285, best loss: 1.110608 2025-01-16 01:47:11,797 - INFO - step 11937, loss: 1.460413, best loss: 1.110608 2025-01-16 01:47:11,947 - INFO - step 11938, loss: 1.332436, best loss: 1.110608 2025-01-16 01:47:12,097 - INFO - step 11939, loss: 1.552488, best loss: 1.110608 2025-01-16 01:47:12,247 - INFO - step 11940, loss: 1.517369, best loss: 1.110608 2025-01-16 01:47:12,398 - INFO - step 11941, loss: 1.691848, best loss: 1.110608 2025-01-16 01:47:12,548 - INFO - step 11942, loss: 1.576003, best loss: 1.110608 2025-01-16 01:47:12,698 - INFO - step 11943, loss: 1.593203, best loss: 1.110608 2025-01-16 01:47:12,848 - INFO - step 11944, loss: 1.540782, best loss: 1.110608 2025-01-16 01:47:12,998 - INFO - step 11945, loss: 1.362586, best loss: 1.110608 2025-01-16 01:47:13,148 - INFO - step 11946, loss: 1.570830, best loss: 1.110608 2025-01-16 01:47:13,298 - INFO - step 11947, loss: 1.394846, best loss: 1.110608 2025-01-16 01:47:13,449 - INFO - step 11948, loss: 1.426613, best loss: 1.110608 2025-01-16 01:47:13,599 - INFO - step 11949, loss: 1.501501, best loss: 1.110608 2025-01-16 01:47:13,749 - INFO - step 11950, loss: 1.509902, best loss: 1.110608 2025-01-16 01:47:13,899 - INFO - step 11951, loss: 1.450625, best loss: 1.110608 2025-01-16 01:47:14,049 - INFO - step 11952, loss: 1.495457, best loss: 1.110608 2025-01-16 01:47:14,199 - INFO - step 11953, loss: 1.530086, best loss: 1.110608 2025-01-16 01:47:14,349 - INFO - step 11954, loss: 1.591162, best loss: 1.110608 2025-01-16 01:47:14,499 - INFO - step 11955, loss: 1.592218, best loss: 1.110608 2025-01-16 01:47:14,649 - INFO - step 11956, loss: 1.461245, best loss: 1.110608 2025-01-16 01:47:14,799 - INFO - step 11957, loss: 1.590392, best loss: 1.110608 2025-01-16 01:47:14,949 - INFO - step 11958, loss: 1.544637, best loss: 1.110608 2025-01-16 01:47:15,099 - INFO - step 11959, loss: 1.355526, best loss: 1.110608 2025-01-16 01:47:15,249 - INFO - step 11960, loss: 1.584534, best loss: 1.110608 2025-01-16 01:47:15,399 - INFO - step 11961, loss: 1.386822, best loss: 1.110608 2025-01-16 01:47:15,549 - INFO - step 11962, loss: 1.378490, best loss: 1.110608 2025-01-16 01:47:15,699 - INFO - step 11963, loss: 1.344341, best loss: 1.110608 2025-01-16 01:47:15,850 - INFO - step 11964, loss: 1.388887, best loss: 1.110608 2025-01-16 01:47:16,000 - INFO - step 11965, loss: 1.482253, best loss: 1.110608 2025-01-16 01:47:16,150 - INFO - step 11966, loss: 1.332980, best loss: 1.110608 2025-01-16 01:47:16,300 - INFO - step 11967, loss: 1.438945, best loss: 1.110608 2025-01-16 01:47:16,450 - INFO - step 11968, loss: 1.419692, best loss: 1.110608 2025-01-16 01:47:16,600 - INFO - step 11969, loss: 1.565232, best loss: 1.110608 2025-01-16 01:47:16,750 - INFO - step 11970, loss: 1.341367, best loss: 1.110608 2025-01-16 01:47:16,900 - INFO - step 11971, loss: 1.500590, best loss: 1.110608 2025-01-16 01:47:17,050 - INFO - step 11972, loss: 1.451891, best loss: 1.110608 2025-01-16 01:47:17,200 - INFO - step 11973, loss: 1.573981, best loss: 1.110608 2025-01-16 01:47:17,350 - INFO - step 11974, loss: 1.610977, best loss: 1.110608 2025-01-16 01:47:17,500 - INFO - step 11975, loss: 1.658429, best loss: 1.110608 2025-01-16 01:47:17,650 - INFO - step 11976, loss: 1.574575, best loss: 1.110608 2025-01-16 01:47:17,800 - INFO - step 11977, loss: 1.425355, best loss: 1.110608 2025-01-16 01:47:17,950 - INFO - step 11978, loss: 1.513317, best loss: 1.110608 2025-01-16 01:47:18,100 - INFO - step 11979, loss: 1.598875, best loss: 1.110608 2025-01-16 01:47:18,250 - INFO - step 11980, loss: 1.460570, best loss: 1.110608 2025-01-16 01:47:18,401 - INFO - step 11981, loss: 1.619695, best loss: 1.110608 2025-01-16 01:47:18,551 - INFO - step 11982, loss: 1.459253, best loss: 1.110608 2025-01-16 01:47:18,701 - INFO - step 11983, loss: 1.703829, best loss: 1.110608 2025-01-16 01:47:18,851 - INFO - step 11984, loss: 1.445671, best loss: 1.110608 2025-01-16 01:47:19,001 - INFO - step 11985, loss: 1.407341, best loss: 1.110608 2025-01-16 01:47:19,151 - INFO - step 11986, loss: 1.493263, best loss: 1.110608 2025-01-16 01:47:19,301 - INFO - step 11987, loss: 1.457513, best loss: 1.110608 2025-01-16 01:47:19,452 - INFO - step 11988, loss: 1.423837, best loss: 1.110608 2025-01-16 01:47:19,603 - INFO - step 11989, loss: 1.642751, best loss: 1.110608 2025-01-16 01:47:19,753 - INFO - step 11990, loss: 1.477995, best loss: 1.110608 2025-01-16 01:47:19,903 - INFO - step 11991, loss: 1.602643, best loss: 1.110608 2025-01-16 01:47:20,053 - INFO - step 11992, loss: 1.494100, best loss: 1.110608 2025-01-16 01:47:20,203 - INFO - step 11993, loss: 1.472042, best loss: 1.110608 2025-01-16 01:47:20,352 - INFO - step 11994, loss: 1.659194, best loss: 1.110608 2025-01-16 01:47:20,502 - INFO - step 11995, loss: 1.467959, best loss: 1.110608 2025-01-16 01:47:20,652 - INFO - step 11996, loss: 1.598985, best loss: 1.110608 2025-01-16 01:47:20,802 - INFO - step 11997, loss: 1.534191, best loss: 1.110608 2025-01-16 01:47:20,952 - INFO - step 11998, loss: 1.439967, best loss: 1.110608 2025-01-16 01:47:21,102 - INFO - step 11999, loss: 1.578910, best loss: 1.110608 2025-01-16 01:47:21,252 - INFO - step 12000, loss: 1.573571, best loss: 1.110608 2025-01-16 01:47:21,403 - INFO - step 12001, loss: 1.458480, best loss: 1.110608 2025-01-16 01:47:21,553 - INFO - step 12002, loss: 1.531108, best loss: 1.110608 2025-01-16 01:47:21,703 - INFO - step 12003, loss: 1.369634, best loss: 1.110608 2025-01-16 01:47:21,853 - INFO - step 12004, loss: 1.313114, best loss: 1.110608 2025-01-16 01:47:22,003 - INFO - step 12005, loss: 1.351027, best loss: 1.110608 2025-01-16 01:47:22,153 - INFO - step 12006, loss: 1.454175, best loss: 1.110608 2025-01-16 01:47:22,303 - INFO - step 12007, loss: 1.701854, best loss: 1.110608 2025-01-16 01:47:22,453 - INFO - step 12008, loss: 1.374835, best loss: 1.110608 2025-01-16 01:47:22,603 - INFO - step 12009, loss: 1.204533, best loss: 1.110608 2025-01-16 01:47:22,753 - INFO - step 12010, loss: 1.580058, best loss: 1.110608 2025-01-16 01:47:22,903 - INFO - step 12011, loss: 1.521898, best loss: 1.110608 2025-01-16 01:47:23,053 - INFO - step 12012, loss: 1.675251, best loss: 1.110608 2025-01-16 01:47:23,203 - INFO - step 12013, loss: 1.331279, best loss: 1.110608 2025-01-16 01:47:23,354 - INFO - step 12014, loss: 1.550189, best loss: 1.110608 2025-01-16 01:47:23,504 - INFO - step 12015, loss: 1.389785, best loss: 1.110608 2025-01-16 01:47:23,654 - INFO - step 12016, loss: 1.359784, best loss: 1.110608 2025-01-16 01:47:23,804 - INFO - step 12017, loss: 1.481197, best loss: 1.110608 2025-01-16 01:47:23,954 - INFO - step 12018, loss: 1.359545, best loss: 1.110608 2025-01-16 01:47:24,104 - INFO - step 12019, loss: 1.520596, best loss: 1.110608 2025-01-16 01:47:24,254 - INFO - step 12020, loss: 1.475698, best loss: 1.110608 2025-01-16 01:47:24,404 - INFO - step 12021, loss: 1.449024, best loss: 1.110608 2025-01-16 01:47:24,555 - INFO - step 12022, loss: 1.675360, best loss: 1.110608 2025-01-16 01:47:24,705 - INFO - step 12023, loss: 1.326359, best loss: 1.110608 2025-01-16 01:47:24,855 - INFO - step 12024, loss: 1.152588, best loss: 1.110608 2025-01-16 01:47:25,005 - INFO - step 12025, loss: 1.465250, best loss: 1.110608 2025-01-16 01:47:25,155 - INFO - step 12026, loss: 1.657244, best loss: 1.110608 2025-01-16 01:47:25,305 - INFO - step 12027, loss: 1.469873, best loss: 1.110608 2025-01-16 01:47:25,455 - INFO - step 12028, loss: 1.429346, best loss: 1.110608 2025-01-16 01:47:25,605 - INFO - step 12029, loss: 1.456338, best loss: 1.110608 2025-01-16 01:47:25,755 - INFO - step 12030, loss: 1.578813, best loss: 1.110608 2025-01-16 01:47:25,905 - INFO - step 12031, loss: 1.612376, best loss: 1.110608 2025-01-16 01:47:26,056 - INFO - step 12032, loss: 1.486351, best loss: 1.110608 2025-01-16 01:47:26,206 - INFO - step 12033, loss: 1.380210, best loss: 1.110608 2025-01-16 01:47:26,356 - INFO - step 12034, loss: 1.623289, best loss: 1.110608 2025-01-16 01:47:26,506 - INFO - step 12035, loss: 1.521829, best loss: 1.110608 2025-01-16 01:47:26,656 - INFO - step 12036, loss: 1.431110, best loss: 1.110608 2025-01-16 01:47:26,806 - INFO - step 12037, loss: 1.492039, best loss: 1.110608 2025-01-16 01:47:26,956 - INFO - step 12038, loss: 1.457776, best loss: 1.110608 2025-01-16 01:47:27,106 - INFO - step 12039, loss: 1.526792, best loss: 1.110608 2025-01-16 01:47:27,257 - INFO - step 12040, loss: 1.458222, best loss: 1.110608 2025-01-16 01:47:27,407 - INFO - step 12041, loss: 1.490626, best loss: 1.110608 2025-01-16 01:47:27,557 - INFO - step 12042, loss: 1.300589, best loss: 1.110608 2025-01-16 01:47:27,707 - INFO - step 12043, loss: 1.250255, best loss: 1.110608 2025-01-16 01:47:27,857 - INFO - step 12044, loss: 1.496746, best loss: 1.110608 2025-01-16 01:47:28,007 - INFO - step 12045, loss: 1.471512, best loss: 1.110608 2025-01-16 01:47:28,157 - INFO - step 12046, loss: 1.723722, best loss: 1.110608 2025-01-16 01:47:28,307 - INFO - step 12047, loss: 1.470725, best loss: 1.110608 2025-01-16 01:47:28,457 - INFO - step 12048, loss: 1.541106, best loss: 1.110608 2025-01-16 01:47:28,607 - INFO - step 12049, loss: 1.676923, best loss: 1.110608 2025-01-16 01:47:28,757 - INFO - step 12050, loss: 1.448672, best loss: 1.110608 2025-01-16 01:47:28,907 - INFO - step 12051, loss: 1.306304, best loss: 1.110608 2025-01-16 01:47:29,057 - INFO - step 12052, loss: 1.413075, best loss: 1.110608 2025-01-16 01:47:29,207 - INFO - step 12053, loss: 1.517826, best loss: 1.110608 2025-01-16 01:47:29,357 - INFO - step 12054, loss: 1.515192, best loss: 1.110608 2025-01-16 01:47:29,508 - INFO - step 12055, loss: 1.416375, best loss: 1.110608 2025-01-16 01:47:29,658 - INFO - step 12056, loss: 1.759792, best loss: 1.110608 2025-01-16 01:47:29,808 - INFO - step 12057, loss: 1.699123, best loss: 1.110608 2025-01-16 01:47:29,957 - INFO - step 12058, loss: 1.603595, best loss: 1.110608 2025-01-16 01:47:30,107 - INFO - step 12059, loss: 1.575904, best loss: 1.110608 2025-01-16 01:47:30,257 - INFO - step 12060, loss: 1.778181, best loss: 1.110608 2025-01-16 01:47:30,407 - INFO - step 12061, loss: 1.464782, best loss: 1.110608 2025-01-16 01:47:30,557 - INFO - step 12062, loss: 1.656659, best loss: 1.110608 2025-01-16 01:47:30,707 - INFO - step 12063, loss: 1.665015, best loss: 1.110608 2025-01-16 01:47:30,857 - INFO - step 12064, loss: 1.437076, best loss: 1.110608 2025-01-16 01:47:31,007 - INFO - step 12065, loss: 1.434265, best loss: 1.110608 2025-01-16 01:47:31,158 - INFO - step 12066, loss: 1.619175, best loss: 1.110608 2025-01-16 01:47:31,309 - INFO - step 12067, loss: 1.507505, best loss: 1.110608 2025-01-16 01:47:31,459 - INFO - step 12068, loss: 1.127175, best loss: 1.110608 2025-01-16 01:47:31,609 - INFO - step 12069, loss: 1.574944, best loss: 1.110608 2025-01-16 01:47:31,759 - INFO - step 12070, loss: 1.457506, best loss: 1.110608 2025-01-16 01:47:31,909 - INFO - step 12071, loss: 1.444025, best loss: 1.110608 2025-01-16 01:47:32,059 - INFO - step 12072, loss: 1.708272, best loss: 1.110608 2025-01-16 01:47:32,209 - INFO - step 12073, loss: 1.469297, best loss: 1.110608 2025-01-16 01:47:32,359 - INFO - step 12074, loss: 1.462187, best loss: 1.110608 2025-01-16 01:47:32,509 - INFO - step 12075, loss: 1.626075, best loss: 1.110608 2025-01-16 01:47:32,660 - INFO - step 12076, loss: 1.493564, best loss: 1.110608 2025-01-16 01:47:32,810 - INFO - step 12077, loss: 1.393445, best loss: 1.110608 2025-01-16 01:47:32,959 - INFO - step 12078, loss: 1.354277, best loss: 1.110608 2025-01-16 01:47:33,110 - INFO - step 12079, loss: 1.324397, best loss: 1.110608 2025-01-16 01:47:33,260 - INFO - step 12080, loss: 1.359680, best loss: 1.110608 2025-01-16 01:47:33,410 - INFO - step 12081, loss: 1.434883, best loss: 1.110608 2025-01-16 01:47:33,560 - INFO - step 12082, loss: 1.450795, best loss: 1.110608 2025-01-16 01:47:33,710 - INFO - step 12083, loss: 1.525871, best loss: 1.110608 2025-01-16 01:47:33,860 - INFO - step 12084, loss: 1.525658, best loss: 1.110608 2025-01-16 01:47:34,010 - INFO - step 12085, loss: 1.380357, best loss: 1.110608 2025-01-16 01:47:34,160 - INFO - step 12086, loss: 1.408556, best loss: 1.110608 2025-01-16 01:47:34,311 - INFO - step 12087, loss: 1.412665, best loss: 1.110608 2025-01-16 01:47:34,460 - INFO - step 12088, loss: 1.495471, best loss: 1.110608 2025-01-16 01:47:34,610 - INFO - step 12089, loss: 1.511915, best loss: 1.110608 2025-01-16 01:47:34,761 - INFO - step 12090, loss: 1.448312, best loss: 1.110608 2025-01-16 01:47:34,911 - INFO - step 12091, loss: 1.289623, best loss: 1.110608 2025-01-16 01:47:35,061 - INFO - step 12092, loss: 1.372881, best loss: 1.110608 2025-01-16 01:47:35,211 - INFO - step 12093, loss: 1.505810, best loss: 1.110608 2025-01-16 01:47:35,361 - INFO - step 12094, loss: 1.525647, best loss: 1.110608 2025-01-16 01:47:35,511 - INFO - step 12095, loss: 1.556097, best loss: 1.110608 2025-01-16 01:47:35,661 - INFO - step 12096, loss: 1.622016, best loss: 1.110608 2025-01-16 01:47:35,811 - INFO - step 12097, loss: 1.500935, best loss: 1.110608 2025-01-16 01:47:35,961 - INFO - step 12098, loss: 1.429676, best loss: 1.110608 2025-01-16 01:47:36,111 - INFO - step 12099, loss: 1.490670, best loss: 1.110608 2025-01-16 01:47:36,261 - INFO - step 12100, loss: 1.592164, best loss: 1.110608 2025-01-16 01:47:36,411 - INFO - step 12101, loss: 1.626417, best loss: 1.110608 2025-01-16 01:47:36,561 - INFO - step 12102, loss: 1.607813, best loss: 1.110608 2025-01-16 01:47:36,711 - INFO - step 12103, loss: 1.741585, best loss: 1.110608 2025-01-16 01:47:36,861 - INFO - step 12104, loss: 1.806079, best loss: 1.110608 2025-01-16 01:47:37,011 - INFO - step 12105, loss: 1.441412, best loss: 1.110608 2025-01-16 01:47:37,161 - INFO - step 12106, loss: 1.597172, best loss: 1.110608 2025-01-16 01:47:37,311 - INFO - step 12107, loss: 1.363766, best loss: 1.110608 2025-01-16 01:47:37,461 - INFO - step 12108, loss: 1.273334, best loss: 1.110608 2025-01-16 01:47:37,611 - INFO - step 12109, loss: 1.574055, best loss: 1.110608 2025-01-16 01:47:37,761 - INFO - step 12110, loss: 1.539466, best loss: 1.110608 2025-01-16 01:47:37,911 - INFO - step 12111, loss: 1.228753, best loss: 1.110608 2025-01-16 01:47:38,061 - INFO - step 12112, loss: 1.336304, best loss: 1.110608 2025-01-16 01:47:38,211 - INFO - step 12113, loss: 1.565853, best loss: 1.110608 2025-01-16 01:47:38,361 - INFO - step 12114, loss: 1.644789, best loss: 1.110608 2025-01-16 01:47:38,511 - INFO - step 12115, loss: 1.462908, best loss: 1.110608 2025-01-16 01:47:38,661 - INFO - step 12116, loss: 1.563343, best loss: 1.110608 2025-01-16 01:47:38,811 - INFO - step 12117, loss: 1.367473, best loss: 1.110608 2025-01-16 01:47:38,962 - INFO - step 12118, loss: 1.217055, best loss: 1.110608 2025-01-16 01:47:39,112 - INFO - step 12119, loss: 1.623139, best loss: 1.110608 2025-01-16 01:47:39,262 - INFO - step 12120, loss: 1.724603, best loss: 1.110608 2025-01-16 01:47:39,412 - INFO - step 12121, loss: 1.770244, best loss: 1.110608 2025-01-16 01:47:39,562 - INFO - step 12122, loss: 1.538612, best loss: 1.110608 2025-01-16 01:47:39,712 - INFO - step 12123, loss: 1.506142, best loss: 1.110608 2025-01-16 01:47:39,862 - INFO - step 12124, loss: 1.569166, best loss: 1.110608 2025-01-16 01:47:40,012 - INFO - step 12125, loss: 1.522032, best loss: 1.110608 2025-01-16 01:47:40,162 - INFO - step 12126, loss: 1.467128, best loss: 1.110608 2025-01-16 01:47:40,312 - INFO - step 12127, loss: 1.586817, best loss: 1.110608 2025-01-16 01:47:40,462 - INFO - step 12128, loss: 1.385968, best loss: 1.110608 2025-01-16 01:47:40,612 - INFO - step 12129, loss: 1.117867, best loss: 1.110608 2025-01-16 01:47:40,762 - INFO - step 12130, loss: 1.433246, best loss: 1.110608 2025-01-16 01:47:40,912 - INFO - step 12131, loss: 1.420178, best loss: 1.110608 2025-01-16 01:47:41,062 - INFO - step 12132, loss: 1.489529, best loss: 1.110608 2025-01-16 01:47:41,212 - INFO - step 12133, loss: 1.203552, best loss: 1.110608 2025-01-16 01:47:41,362 - INFO - step 12134, loss: 1.263874, best loss: 1.110608 2025-01-16 01:47:44,821 - INFO - step 12135, loss: 1.100134, best loss: 1.100134 2025-01-16 01:47:44,984 - INFO - step 12136, loss: 1.323776, best loss: 1.100134 2025-01-16 01:47:45,137 - INFO - step 12137, loss: 1.467898, best loss: 1.100134 2025-01-16 01:47:45,287 - INFO - step 12138, loss: 1.540488, best loss: 1.100134 2025-01-16 01:47:45,437 - INFO - step 12139, loss: 1.491536, best loss: 1.100134 2025-01-16 01:47:45,588 - INFO - step 12140, loss: 1.494165, best loss: 1.100134 2025-01-16 01:47:45,738 - INFO - step 12141, loss: 1.407406, best loss: 1.100134 2025-01-16 01:47:45,888 - INFO - step 12142, loss: 1.548491, best loss: 1.100134 2025-01-16 01:47:46,038 - INFO - step 12143, loss: 1.322853, best loss: 1.100134 2025-01-16 01:47:46,188 - INFO - step 12144, loss: 1.410386, best loss: 1.100134 2025-01-16 01:47:46,338 - INFO - step 12145, loss: 1.338085, best loss: 1.100134 2025-01-16 01:47:46,488 - INFO - step 12146, loss: 1.349154, best loss: 1.100134 2025-01-16 01:47:46,638 - INFO - step 12147, loss: 1.230779, best loss: 1.100134 2025-01-16 01:47:46,788 - INFO - step 12148, loss: 1.320618, best loss: 1.100134 2025-01-16 01:47:46,938 - INFO - step 12149, loss: 1.375878, best loss: 1.100134 2025-01-16 01:47:47,088 - INFO - step 12150, loss: 1.329257, best loss: 1.100134 2025-01-16 01:47:47,238 - INFO - step 12151, loss: 1.337597, best loss: 1.100134 2025-01-16 01:47:47,388 - INFO - step 12152, loss: 1.353749, best loss: 1.100134 2025-01-16 01:47:47,538 - INFO - step 12153, loss: 1.316114, best loss: 1.100134 2025-01-16 01:47:47,689 - INFO - step 12154, loss: 1.393837, best loss: 1.100134 2025-01-16 01:47:47,838 - INFO - step 12155, loss: 1.602848, best loss: 1.100134 2025-01-16 01:47:47,989 - INFO - step 12156, loss: 1.300237, best loss: 1.100134 2025-01-16 01:47:48,139 - INFO - step 12157, loss: 1.410016, best loss: 1.100134 2025-01-16 01:47:48,289 - INFO - step 12158, loss: 1.345739, best loss: 1.100134 2025-01-16 01:47:48,439 - INFO - step 12159, loss: 1.470864, best loss: 1.100134 2025-01-16 01:47:48,590 - INFO - step 12160, loss: 1.369056, best loss: 1.100134 2025-01-16 01:47:48,740 - INFO - step 12161, loss: 1.353253, best loss: 1.100134 2025-01-16 01:47:48,890 - INFO - step 12162, loss: 1.415861, best loss: 1.100134 2025-01-16 01:47:49,040 - INFO - step 12163, loss: 1.372191, best loss: 1.100134 2025-01-16 01:47:49,190 - INFO - step 12164, loss: 1.540615, best loss: 1.100134 2025-01-16 01:47:49,340 - INFO - step 12165, loss: 1.441524, best loss: 1.100134 2025-01-16 01:47:49,490 - INFO - step 12166, loss: 1.468148, best loss: 1.100134 2025-01-16 01:47:49,641 - INFO - step 12167, loss: 1.435037, best loss: 1.100134 2025-01-16 01:47:49,791 - INFO - step 12168, loss: 1.563395, best loss: 1.100134 2025-01-16 01:47:49,941 - INFO - step 12169, loss: 1.427812, best loss: 1.100134 2025-01-16 01:47:50,091 - INFO - step 12170, loss: 1.347833, best loss: 1.100134 2025-01-16 01:47:50,241 - INFO - step 12171, loss: 1.469170, best loss: 1.100134 2025-01-16 01:47:50,391 - INFO - step 12172, loss: 1.370401, best loss: 1.100134 2025-01-16 01:47:50,541 - INFO - step 12173, loss: 1.334671, best loss: 1.100134 2025-01-16 01:47:50,691 - INFO - step 12174, loss: 1.403381, best loss: 1.100134 2025-01-16 01:47:50,841 - INFO - step 12175, loss: 1.357365, best loss: 1.100134 2025-01-16 01:47:50,991 - INFO - step 12176, loss: 1.483488, best loss: 1.100134 2025-01-16 01:47:51,142 - INFO - step 12177, loss: 1.179892, best loss: 1.100134 2025-01-16 01:47:51,292 - INFO - step 12178, loss: 1.489344, best loss: 1.100134 2025-01-16 01:47:51,442 - INFO - step 12179, loss: 1.453709, best loss: 1.100134 2025-01-16 01:47:51,592 - INFO - step 12180, loss: 1.446310, best loss: 1.100134 2025-01-16 01:47:51,742 - INFO - step 12181, loss: 1.449825, best loss: 1.100134 2025-01-16 01:47:51,892 - INFO - step 12182, loss: 1.336007, best loss: 1.100134 2025-01-16 01:47:52,043 - INFO - step 12183, loss: 1.443674, best loss: 1.100134 2025-01-16 01:47:52,192 - INFO - step 12184, loss: 1.348892, best loss: 1.100134 2025-01-16 01:47:52,342 - INFO - step 12185, loss: 1.307482, best loss: 1.100134 2025-01-16 01:47:52,492 - INFO - step 12186, loss: 1.338683, best loss: 1.100134 2025-01-16 01:47:52,642 - INFO - step 12187, loss: 1.458762, best loss: 1.100134 2025-01-16 01:47:52,793 - INFO - step 12188, loss: 1.396413, best loss: 1.100134 2025-01-16 01:47:52,943 - INFO - step 12189, loss: 1.409294, best loss: 1.100134 2025-01-16 01:47:53,093 - INFO - step 12190, loss: 1.282958, best loss: 1.100134 2025-01-16 01:47:53,243 - INFO - step 12191, loss: 1.283774, best loss: 1.100134 2025-01-16 01:47:53,393 - INFO - step 12192, loss: 1.453662, best loss: 1.100134 2025-01-16 01:47:53,543 - INFO - step 12193, loss: 1.300535, best loss: 1.100134 2025-01-16 01:47:53,693 - INFO - step 12194, loss: 1.381504, best loss: 1.100134 2025-01-16 01:47:53,843 - INFO - step 12195, loss: 1.147698, best loss: 1.100134 2025-01-16 01:47:53,993 - INFO - step 12196, loss: 1.127623, best loss: 1.100134 2025-01-16 01:47:54,144 - INFO - step 12197, loss: 1.122323, best loss: 1.100134 2025-01-16 01:47:54,294 - INFO - step 12198, loss: 1.472772, best loss: 1.100134 2025-01-16 01:47:54,443 - INFO - step 12199, loss: 1.433105, best loss: 1.100134 2025-01-16 01:47:54,594 - INFO - step 12200, loss: 1.482069, best loss: 1.100134 2025-01-16 01:47:54,744 - INFO - step 12201, loss: 1.569997, best loss: 1.100134 2025-01-16 01:47:54,894 - INFO - step 12202, loss: 1.500738, best loss: 1.100134 2025-01-16 01:47:55,044 - INFO - step 12203, loss: 1.245824, best loss: 1.100134 2025-01-16 01:47:55,194 - INFO - step 12204, loss: 1.309172, best loss: 1.100134 2025-01-16 01:47:55,344 - INFO - step 12205, loss: 1.501887, best loss: 1.100134 2025-01-16 01:47:55,495 - INFO - step 12206, loss: 1.485939, best loss: 1.100134 2025-01-16 01:47:58,993 - INFO - step 12207, loss: 1.075299, best loss: 1.075299 2025-01-16 01:47:59,143 - INFO - step 12208, loss: 1.209224, best loss: 1.075299 2025-01-16 01:47:59,294 - INFO - step 12209, loss: 1.217622, best loss: 1.075299 2025-01-16 01:47:59,444 - INFO - step 12210, loss: 1.316204, best loss: 1.075299 2025-01-16 01:47:59,594 - INFO - step 12211, loss: 1.449063, best loss: 1.075299 2025-01-16 01:47:59,744 - INFO - step 12212, loss: 1.319721, best loss: 1.075299 2025-01-16 01:47:59,895 - INFO - step 12213, loss: 1.380759, best loss: 1.075299 2025-01-16 01:48:00,045 - INFO - step 12214, loss: 1.363086, best loss: 1.075299 2025-01-16 01:48:00,195 - INFO - step 12215, loss: 1.222622, best loss: 1.075299 2025-01-16 01:48:00,345 - INFO - step 12216, loss: 1.414751, best loss: 1.075299 2025-01-16 01:48:00,495 - INFO - step 12217, loss: 1.457810, best loss: 1.075299 2025-01-16 01:48:00,646 - INFO - step 12218, loss: 1.491078, best loss: 1.075299 2025-01-16 01:48:00,796 - INFO - step 12219, loss: 1.534852, best loss: 1.075299 2025-01-16 01:48:00,946 - INFO - step 12220, loss: 1.288831, best loss: 1.075299 2025-01-16 01:48:01,096 - INFO - step 12221, loss: 1.308737, best loss: 1.075299 2025-01-16 01:48:01,247 - INFO - step 12222, loss: 1.240669, best loss: 1.075299 2025-01-16 01:48:01,397 - INFO - step 12223, loss: 1.509252, best loss: 1.075299 2025-01-16 01:48:01,547 - INFO - step 12224, loss: 1.488520, best loss: 1.075299 2025-01-16 01:48:01,697 - INFO - step 12225, loss: 1.535580, best loss: 1.075299 2025-01-16 01:48:01,848 - INFO - step 12226, loss: 1.471627, best loss: 1.075299 2025-01-16 01:48:01,998 - INFO - step 12227, loss: 1.423989, best loss: 1.075299 2025-01-16 01:48:02,148 - INFO - step 12228, loss: 1.357440, best loss: 1.075299 2025-01-16 01:48:02,298 - INFO - step 12229, loss: 1.353168, best loss: 1.075299 2025-01-16 01:48:02,448 - INFO - step 12230, loss: 1.337246, best loss: 1.075299 2025-01-16 01:48:02,598 - INFO - step 12231, loss: 1.533532, best loss: 1.075299 2025-01-16 01:48:02,748 - INFO - step 12232, loss: 1.230325, best loss: 1.075299 2025-01-16 01:48:02,899 - INFO - step 12233, loss: 1.369670, best loss: 1.075299 2025-01-16 01:48:03,049 - INFO - step 12234, loss: 1.415799, best loss: 1.075299 2025-01-16 01:48:03,199 - INFO - step 12235, loss: 1.555319, best loss: 1.075299 2025-01-16 01:48:03,349 - INFO - step 12236, loss: 1.418123, best loss: 1.075299 2025-01-16 01:48:03,499 - INFO - step 12237, loss: 1.259137, best loss: 1.075299 2025-01-16 01:48:03,650 - INFO - step 12238, loss: 1.470323, best loss: 1.075299 2025-01-16 01:48:03,800 - INFO - step 12239, loss: 1.520392, best loss: 1.075299 2025-01-16 01:48:03,950 - INFO - step 12240, loss: 1.213344, best loss: 1.075299 2025-01-16 01:48:04,100 - INFO - step 12241, loss: 1.321498, best loss: 1.075299 2025-01-16 01:48:04,250 - INFO - step 12242, loss: 1.354881, best loss: 1.075299 2025-01-16 01:48:04,400 - INFO - step 12243, loss: 1.378554, best loss: 1.075299 2025-01-16 01:48:04,550 - INFO - step 12244, loss: 1.244705, best loss: 1.075299 2025-01-16 01:48:04,700 - INFO - step 12245, loss: 1.257219, best loss: 1.075299 2025-01-16 01:48:04,850 - INFO - step 12246, loss: 1.566153, best loss: 1.075299 2025-01-16 01:48:05,001 - INFO - step 12247, loss: 1.487412, best loss: 1.075299 2025-01-16 01:48:05,151 - INFO - step 12248, loss: 1.424552, best loss: 1.075299 2025-01-16 01:48:05,301 - INFO - step 12249, loss: 1.248772, best loss: 1.075299 2025-01-16 01:48:05,451 - INFO - step 12250, loss: 1.448714, best loss: 1.075299 2025-01-16 01:48:05,601 - INFO - step 12251, loss: 1.456904, best loss: 1.075299 2025-01-16 01:48:05,752 - INFO - step 12252, loss: 1.457517, best loss: 1.075299 2025-01-16 01:48:05,902 - INFO - step 12253, loss: 1.256430, best loss: 1.075299 2025-01-16 01:48:06,052 - INFO - step 12254, loss: 1.342203, best loss: 1.075299 2025-01-16 01:48:06,202 - INFO - step 12255, loss: 1.332239, best loss: 1.075299 2025-01-16 01:48:06,352 - INFO - step 12256, loss: 1.525714, best loss: 1.075299 2025-01-16 01:48:06,502 - INFO - step 12257, loss: 1.431494, best loss: 1.075299 2025-01-16 01:48:06,652 - INFO - step 12258, loss: 1.476729, best loss: 1.075299 2025-01-16 01:48:06,802 - INFO - step 12259, loss: 1.212642, best loss: 1.075299 2025-01-16 01:48:06,953 - INFO - step 12260, loss: 1.258852, best loss: 1.075299 2025-01-16 01:48:07,103 - INFO - step 12261, loss: 1.393260, best loss: 1.075299 2025-01-16 01:48:07,253 - INFO - step 12262, loss: 1.460171, best loss: 1.075299 2025-01-16 01:48:07,403 - INFO - step 12263, loss: 1.427823, best loss: 1.075299 2025-01-16 01:48:07,553 - INFO - step 12264, loss: 1.447586, best loss: 1.075299 2025-01-16 01:48:07,703 - INFO - step 12265, loss: 1.382180, best loss: 1.075299 2025-01-16 01:48:07,854 - INFO - step 12266, loss: 1.496145, best loss: 1.075299 2025-01-16 01:48:08,004 - INFO - step 12267, loss: 1.407865, best loss: 1.075299 2025-01-16 01:48:08,154 - INFO - step 12268, loss: 1.182284, best loss: 1.075299 2025-01-16 01:48:08,304 - INFO - step 12269, loss: 1.498589, best loss: 1.075299 2025-01-16 01:48:08,454 - INFO - step 12270, loss: 1.464317, best loss: 1.075299 2025-01-16 01:48:08,604 - INFO - step 12271, loss: 1.614948, best loss: 1.075299 2025-01-16 01:48:08,754 - INFO - step 12272, loss: 1.567987, best loss: 1.075299 2025-01-16 01:48:08,905 - INFO - step 12273, loss: 1.498385, best loss: 1.075299 2025-01-16 01:48:09,055 - INFO - step 12274, loss: 1.431664, best loss: 1.075299 2025-01-16 01:48:09,205 - INFO - step 12275, loss: 1.233361, best loss: 1.075299 2025-01-16 01:48:09,355 - INFO - step 12276, loss: 1.480459, best loss: 1.075299 2025-01-16 01:48:09,505 - INFO - step 12277, loss: 1.272938, best loss: 1.075299 2025-01-16 01:48:09,655 - INFO - step 12278, loss: 1.326685, best loss: 1.075299 2025-01-16 01:48:09,805 - INFO - step 12279, loss: 1.366548, best loss: 1.075299 2025-01-16 01:48:09,955 - INFO - step 12280, loss: 1.433400, best loss: 1.075299 2025-01-16 01:48:10,105 - INFO - step 12281, loss: 1.411321, best loss: 1.075299 2025-01-16 01:48:10,255 - INFO - step 12282, loss: 1.474925, best loss: 1.075299 2025-01-16 01:48:10,405 - INFO - step 12283, loss: 1.448600, best loss: 1.075299 2025-01-16 01:48:10,555 - INFO - step 12284, loss: 1.460176, best loss: 1.075299 2025-01-16 01:48:10,705 - INFO - step 12285, loss: 1.473310, best loss: 1.075299 2025-01-16 01:48:10,856 - INFO - step 12286, loss: 1.356318, best loss: 1.075299 2025-01-16 01:48:11,006 - INFO - step 12287, loss: 1.549873, best loss: 1.075299 2025-01-16 01:48:11,156 - INFO - step 12288, loss: 1.479776, best loss: 1.075299 2025-01-16 01:48:11,306 - INFO - step 12289, loss: 1.280451, best loss: 1.075299 2025-01-16 01:48:11,456 - INFO - step 12290, loss: 1.471559, best loss: 1.075299 2025-01-16 01:48:11,606 - INFO - step 12291, loss: 1.319144, best loss: 1.075299 2025-01-16 01:48:11,756 - INFO - step 12292, loss: 1.294323, best loss: 1.075299 2025-01-16 01:48:11,906 - INFO - step 12293, loss: 1.314520, best loss: 1.075299 2025-01-16 01:48:12,056 - INFO - step 12294, loss: 1.376016, best loss: 1.075299 2025-01-16 01:48:12,206 - INFO - step 12295, loss: 1.440937, best loss: 1.075299 2025-01-16 01:48:12,356 - INFO - step 12296, loss: 1.239882, best loss: 1.075299 2025-01-16 01:48:12,506 - INFO - step 12297, loss: 1.277110, best loss: 1.075299 2025-01-16 01:48:12,656 - INFO - step 12298, loss: 1.367739, best loss: 1.075299 2025-01-16 01:48:12,806 - INFO - step 12299, loss: 1.441200, best loss: 1.075299 2025-01-16 01:48:12,956 - INFO - step 12300, loss: 1.312177, best loss: 1.075299 2025-01-16 01:48:13,106 - INFO - step 12301, loss: 1.402581, best loss: 1.075299 2025-01-16 01:48:13,256 - INFO - step 12302, loss: 1.377038, best loss: 1.075299 2025-01-16 01:48:13,406 - INFO - step 12303, loss: 1.477596, best loss: 1.075299 2025-01-16 01:48:13,557 - INFO - step 12304, loss: 1.567867, best loss: 1.075299 2025-01-16 01:48:13,707 - INFO - step 12305, loss: 1.627502, best loss: 1.075299 2025-01-16 01:48:13,857 - INFO - step 12306, loss: 1.615360, best loss: 1.075299 2025-01-16 01:48:14,007 - INFO - step 12307, loss: 1.365780, best loss: 1.075299 2025-01-16 01:48:14,157 - INFO - step 12308, loss: 1.452868, best loss: 1.075299 2025-01-16 01:48:14,307 - INFO - step 12309, loss: 1.475551, best loss: 1.075299 2025-01-16 01:48:14,457 - INFO - step 12310, loss: 1.408154, best loss: 1.075299 2025-01-16 01:48:14,607 - INFO - step 12311, loss: 1.456689, best loss: 1.075299 2025-01-16 01:48:14,757 - INFO - step 12312, loss: 1.393049, best loss: 1.075299 2025-01-16 01:48:14,908 - INFO - step 12313, loss: 1.598993, best loss: 1.075299 2025-01-16 01:48:15,058 - INFO - step 12314, loss: 1.329696, best loss: 1.075299 2025-01-16 01:48:15,208 - INFO - step 12315, loss: 1.376292, best loss: 1.075299 2025-01-16 01:48:15,358 - INFO - step 12316, loss: 1.448036, best loss: 1.075299 2025-01-16 01:48:15,508 - INFO - step 12317, loss: 1.280514, best loss: 1.075299 2025-01-16 01:48:15,658 - INFO - step 12318, loss: 1.367028, best loss: 1.075299 2025-01-16 01:48:15,808 - INFO - step 12319, loss: 1.552577, best loss: 1.075299 2025-01-16 01:48:15,958 - INFO - step 12320, loss: 1.394037, best loss: 1.075299 2025-01-16 01:48:16,109 - INFO - step 12321, loss: 1.481323, best loss: 1.075299 2025-01-16 01:48:16,259 - INFO - step 12322, loss: 1.452041, best loss: 1.075299 2025-01-16 01:48:16,409 - INFO - step 12323, loss: 1.390739, best loss: 1.075299 2025-01-16 01:48:16,559 - INFO - step 12324, loss: 1.622509, best loss: 1.075299 2025-01-16 01:48:16,709 - INFO - step 12325, loss: 1.425968, best loss: 1.075299 2025-01-16 01:48:16,859 - INFO - step 12326, loss: 1.425004, best loss: 1.075299 2025-01-16 01:48:17,009 - INFO - step 12327, loss: 1.511769, best loss: 1.075299 2025-01-16 01:48:17,160 - INFO - step 12328, loss: 1.330446, best loss: 1.075299 2025-01-16 01:48:17,310 - INFO - step 12329, loss: 1.427829, best loss: 1.075299 2025-01-16 01:48:17,460 - INFO - step 12330, loss: 1.467995, best loss: 1.075299 2025-01-16 01:48:17,610 - INFO - step 12331, loss: 1.370075, best loss: 1.075299 2025-01-16 01:48:17,760 - INFO - step 12332, loss: 1.482822, best loss: 1.075299 2025-01-16 01:48:17,910 - INFO - step 12333, loss: 1.307325, best loss: 1.075299 2025-01-16 01:48:18,060 - INFO - step 12334, loss: 1.293463, best loss: 1.075299 2025-01-16 01:48:18,210 - INFO - step 12335, loss: 1.281922, best loss: 1.075299 2025-01-16 01:48:18,361 - INFO - step 12336, loss: 1.391943, best loss: 1.075299 2025-01-16 01:48:18,511 - INFO - step 12337, loss: 1.595016, best loss: 1.075299 2025-01-16 01:48:18,661 - INFO - step 12338, loss: 1.294866, best loss: 1.075299 2025-01-16 01:48:18,811 - INFO - step 12339, loss: 1.151909, best loss: 1.075299 2025-01-16 01:48:18,961 - INFO - step 12340, loss: 1.346256, best loss: 1.075299 2025-01-16 01:48:19,111 - INFO - step 12341, loss: 1.437979, best loss: 1.075299 2025-01-16 01:48:19,261 - INFO - step 12342, loss: 1.592514, best loss: 1.075299 2025-01-16 01:48:19,411 - INFO - step 12343, loss: 1.211074, best loss: 1.075299 2025-01-16 01:48:19,562 - INFO - step 12344, loss: 1.518744, best loss: 1.075299 2025-01-16 01:48:19,712 - INFO - step 12345, loss: 1.260372, best loss: 1.075299 2025-01-16 01:48:19,862 - INFO - step 12346, loss: 1.281082, best loss: 1.075299 2025-01-16 01:48:20,012 - INFO - step 12347, loss: 1.420244, best loss: 1.075299 2025-01-16 01:48:20,162 - INFO - step 12348, loss: 1.360890, best loss: 1.075299 2025-01-16 01:48:20,312 - INFO - step 12349, loss: 1.456478, best loss: 1.075299 2025-01-16 01:48:20,463 - INFO - step 12350, loss: 1.484766, best loss: 1.075299 2025-01-16 01:48:20,613 - INFO - step 12351, loss: 1.435988, best loss: 1.075299 2025-01-16 01:48:20,763 - INFO - step 12352, loss: 1.539784, best loss: 1.075299 2025-01-16 01:48:20,913 - INFO - step 12353, loss: 1.278675, best loss: 1.075299 2025-01-16 01:48:21,063 - INFO - step 12354, loss: 1.096401, best loss: 1.075299 2025-01-16 01:48:21,213 - INFO - step 12355, loss: 1.365480, best loss: 1.075299 2025-01-16 01:48:21,363 - INFO - step 12356, loss: 1.658827, best loss: 1.075299 2025-01-16 01:48:21,514 - INFO - step 12357, loss: 1.501323, best loss: 1.075299 2025-01-16 01:48:21,664 - INFO - step 12358, loss: 1.319204, best loss: 1.075299 2025-01-16 01:48:21,814 - INFO - step 12359, loss: 1.343791, best loss: 1.075299 2025-01-16 01:48:21,964 - INFO - step 12360, loss: 1.516338, best loss: 1.075299 2025-01-16 01:48:22,114 - INFO - step 12361, loss: 1.507626, best loss: 1.075299 2025-01-16 01:48:22,264 - INFO - step 12362, loss: 1.423747, best loss: 1.075299 2025-01-16 01:48:22,414 - INFO - step 12363, loss: 1.257352, best loss: 1.075299 2025-01-16 01:48:22,565 - INFO - step 12364, loss: 1.516370, best loss: 1.075299 2025-01-16 01:48:22,715 - INFO - step 12365, loss: 1.491414, best loss: 1.075299 2025-01-16 01:48:22,865 - INFO - step 12366, loss: 1.410835, best loss: 1.075299 2025-01-16 01:48:23,015 - INFO - step 12367, loss: 1.404286, best loss: 1.075299 2025-01-16 01:48:23,165 - INFO - step 12368, loss: 1.391982, best loss: 1.075299 2025-01-16 01:48:23,315 - INFO - step 12369, loss: 1.428289, best loss: 1.075299 2025-01-16 01:48:23,465 - INFO - step 12370, loss: 1.353836, best loss: 1.075299 2025-01-16 01:48:23,615 - INFO - step 12371, loss: 1.359230, best loss: 1.075299 2025-01-16 01:48:23,765 - INFO - step 12372, loss: 1.280258, best loss: 1.075299 2025-01-16 01:48:23,916 - INFO - step 12373, loss: 1.114323, best loss: 1.075299 2025-01-16 01:48:24,066 - INFO - step 12374, loss: 1.463459, best loss: 1.075299 2025-01-16 01:48:24,216 - INFO - step 12375, loss: 1.297908, best loss: 1.075299 2025-01-16 01:48:24,366 - INFO - step 12376, loss: 1.630801, best loss: 1.075299 2025-01-16 01:48:24,517 - INFO - step 12377, loss: 1.375572, best loss: 1.075299 2025-01-16 01:48:24,667 - INFO - step 12378, loss: 1.433561, best loss: 1.075299 2025-01-16 01:48:24,817 - INFO - step 12379, loss: 1.466980, best loss: 1.075299 2025-01-16 01:48:24,967 - INFO - step 12380, loss: 1.345361, best loss: 1.075299 2025-01-16 01:48:25,117 - INFO - step 12381, loss: 1.201182, best loss: 1.075299 2025-01-16 01:48:25,267 - INFO - step 12382, loss: 1.341547, best loss: 1.075299 2025-01-16 01:48:25,418 - INFO - step 12383, loss: 1.377828, best loss: 1.075299 2025-01-16 01:48:25,568 - INFO - step 12384, loss: 1.440648, best loss: 1.075299 2025-01-16 01:48:25,718 - INFO - step 12385, loss: 1.336516, best loss: 1.075299 2025-01-16 01:48:25,868 - INFO - step 12386, loss: 1.553725, best loss: 1.075299 2025-01-16 01:48:26,018 - INFO - step 12387, loss: 1.573169, best loss: 1.075299 2025-01-16 01:48:26,168 - INFO - step 12388, loss: 1.485573, best loss: 1.075299 2025-01-16 01:48:26,318 - INFO - step 12389, loss: 1.476606, best loss: 1.075299 2025-01-16 01:48:26,468 - INFO - step 12390, loss: 1.635588, best loss: 1.075299 2025-01-16 01:48:26,618 - INFO - step 12391, loss: 1.363673, best loss: 1.075299 2025-01-16 01:48:26,769 - INFO - step 12392, loss: 1.587837, best loss: 1.075299 2025-01-16 01:48:26,919 - INFO - step 12393, loss: 1.552567, best loss: 1.075299 2025-01-16 01:48:27,069 - INFO - step 12394, loss: 1.435394, best loss: 1.075299 2025-01-16 01:48:27,219 - INFO - step 12395, loss: 1.374447, best loss: 1.075299 2025-01-16 01:48:27,370 - INFO - step 12396, loss: 1.611852, best loss: 1.075299 2025-01-16 01:48:27,520 - INFO - step 12397, loss: 1.404346, best loss: 1.075299 2025-01-16 01:48:30,788 - INFO - step 12398, loss: 1.047481, best loss: 1.047481 2025-01-16 01:48:30,952 - INFO - step 12399, loss: 1.471846, best loss: 1.047481 2025-01-16 01:48:31,106 - INFO - step 12400, loss: 1.385691, best loss: 1.047481 2025-01-16 01:48:31,256 - INFO - step 12401, loss: 1.425617, best loss: 1.047481 2025-01-16 01:48:31,406 - INFO - step 12402, loss: 1.479277, best loss: 1.047481 2025-01-16 01:48:31,556 - INFO - step 12403, loss: 1.366222, best loss: 1.047481 2025-01-16 01:48:31,707 - INFO - step 12404, loss: 1.360845, best loss: 1.047481 2025-01-16 01:48:31,857 - INFO - step 12405, loss: 1.507982, best loss: 1.047481 2025-01-16 01:48:32,007 - INFO - step 12406, loss: 1.494511, best loss: 1.047481 2025-01-16 01:48:32,158 - INFO - step 12407, loss: 1.409786, best loss: 1.047481 2025-01-16 01:48:32,308 - INFO - step 12408, loss: 1.325294, best loss: 1.047481 2025-01-16 01:48:32,458 - INFO - step 12409, loss: 1.300508, best loss: 1.047481 2025-01-16 01:48:32,608 - INFO - step 12410, loss: 1.323587, best loss: 1.047481 2025-01-16 01:48:32,758 - INFO - step 12411, loss: 1.432497, best loss: 1.047481 2025-01-16 01:48:32,908 - INFO - step 12412, loss: 1.388056, best loss: 1.047481 2025-01-16 01:48:33,058 - INFO - step 12413, loss: 1.493118, best loss: 1.047481 2025-01-16 01:48:33,208 - INFO - step 12414, loss: 1.461927, best loss: 1.047481 2025-01-16 01:48:33,358 - INFO - step 12415, loss: 1.327790, best loss: 1.047481 2025-01-16 01:48:33,509 - INFO - step 12416, loss: 1.344388, best loss: 1.047481 2025-01-16 01:48:33,659 - INFO - step 12417, loss: 1.387038, best loss: 1.047481 2025-01-16 01:48:33,809 - INFO - step 12418, loss: 1.462950, best loss: 1.047481 2025-01-16 01:48:33,960 - INFO - step 12419, loss: 1.448654, best loss: 1.047481 2025-01-16 01:48:34,110 - INFO - step 12420, loss: 1.468455, best loss: 1.047481 2025-01-16 01:48:34,261 - INFO - step 12421, loss: 1.275203, best loss: 1.047481 2025-01-16 01:48:34,411 - INFO - step 12422, loss: 1.263891, best loss: 1.047481 2025-01-16 01:48:34,561 - INFO - step 12423, loss: 1.385937, best loss: 1.047481 2025-01-16 01:48:34,711 - INFO - step 12424, loss: 1.466597, best loss: 1.047481 2025-01-16 01:48:34,861 - INFO - step 12425, loss: 1.490751, best loss: 1.047481 2025-01-16 01:48:35,012 - INFO - step 12426, loss: 1.495888, best loss: 1.047481 2025-01-16 01:48:35,162 - INFO - step 12427, loss: 1.512136, best loss: 1.047481 2025-01-16 01:48:35,312 - INFO - step 12428, loss: 1.358872, best loss: 1.047481 2025-01-16 01:48:35,462 - INFO - step 12429, loss: 1.452051, best loss: 1.047481 2025-01-16 01:48:35,612 - INFO - step 12430, loss: 1.533056, best loss: 1.047481 2025-01-16 01:48:35,762 - INFO - step 12431, loss: 1.513094, best loss: 1.047481 2025-01-16 01:48:35,912 - INFO - step 12432, loss: 1.548571, best loss: 1.047481 2025-01-16 01:48:36,062 - INFO - step 12433, loss: 1.662247, best loss: 1.047481 2025-01-16 01:48:36,212 - INFO - step 12434, loss: 1.625513, best loss: 1.047481 2025-01-16 01:48:36,362 - INFO - step 12435, loss: 1.440786, best loss: 1.047481 2025-01-16 01:48:36,512 - INFO - step 12436, loss: 1.548383, best loss: 1.047481 2025-01-16 01:48:36,662 - INFO - step 12437, loss: 1.334467, best loss: 1.047481 2025-01-16 01:48:36,812 - INFO - step 12438, loss: 1.197187, best loss: 1.047481 2025-01-16 01:48:36,963 - INFO - step 12439, loss: 1.416535, best loss: 1.047481 2025-01-16 01:48:37,113 - INFO - step 12440, loss: 1.437581, best loss: 1.047481 2025-01-16 01:48:37,263 - INFO - step 12441, loss: 1.180660, best loss: 1.047481 2025-01-16 01:48:37,413 - INFO - step 12442, loss: 1.230937, best loss: 1.047481 2025-01-16 01:48:37,563 - INFO - step 12443, loss: 1.444598, best loss: 1.047481 2025-01-16 01:48:37,713 - INFO - step 12444, loss: 1.537043, best loss: 1.047481 2025-01-16 01:48:37,863 - INFO - step 12445, loss: 1.460293, best loss: 1.047481 2025-01-16 01:48:38,013 - INFO - step 12446, loss: 1.464016, best loss: 1.047481 2025-01-16 01:48:38,163 - INFO - step 12447, loss: 1.257114, best loss: 1.047481 2025-01-16 01:48:38,313 - INFO - step 12448, loss: 1.119126, best loss: 1.047481 2025-01-16 01:48:38,463 - INFO - step 12449, loss: 1.573603, best loss: 1.047481 2025-01-16 01:48:38,613 - INFO - step 12450, loss: 1.548048, best loss: 1.047481 2025-01-16 01:48:38,764 - INFO - step 12451, loss: 1.595360, best loss: 1.047481 2025-01-16 01:48:38,914 - INFO - step 12452, loss: 1.447739, best loss: 1.047481 2025-01-16 01:48:39,064 - INFO - step 12453, loss: 1.426815, best loss: 1.047481 2025-01-16 01:48:39,214 - INFO - step 12454, loss: 1.499707, best loss: 1.047481 2025-01-16 01:48:39,364 - INFO - step 12455, loss: 1.374507, best loss: 1.047481 2025-01-16 01:48:39,514 - INFO - step 12456, loss: 1.339191, best loss: 1.047481 2025-01-16 01:48:39,664 - INFO - step 12457, loss: 1.457199, best loss: 1.047481 2025-01-16 01:48:39,815 - INFO - step 12458, loss: 1.247226, best loss: 1.047481 2025-01-16 01:48:39,965 - INFO - step 12459, loss: 1.073690, best loss: 1.047481 2025-01-16 01:48:40,115 - INFO - step 12460, loss: 1.417294, best loss: 1.047481 2025-01-16 01:48:40,265 - INFO - step 12461, loss: 1.336863, best loss: 1.047481 2025-01-16 01:48:40,415 - INFO - step 12462, loss: 1.396090, best loss: 1.047481 2025-01-16 01:48:40,565 - INFO - step 12463, loss: 1.123062, best loss: 1.047481 2025-01-16 01:48:40,716 - INFO - step 12464, loss: 1.194154, best loss: 1.047481 2025-01-16 01:48:40,866 - INFO - step 12465, loss: 1.051980, best loss: 1.047481 2025-01-16 01:48:41,016 - INFO - step 12466, loss: 1.224668, best loss: 1.047481 2025-01-16 01:48:41,166 - INFO - step 12467, loss: 1.454852, best loss: 1.047481 2025-01-16 01:48:41,316 - INFO - step 12468, loss: 1.397245, best loss: 1.047481 2025-01-16 01:48:41,466 - INFO - step 12469, loss: 1.469586, best loss: 1.047481 2025-01-16 01:48:41,616 - INFO - step 12470, loss: 1.405513, best loss: 1.047481 2025-01-16 01:48:41,767 - INFO - step 12471, loss: 1.348452, best loss: 1.047481 2025-01-16 01:48:41,917 - INFO - step 12472, loss: 1.403396, best loss: 1.047481 2025-01-16 01:48:42,067 - INFO - step 12473, loss: 1.234205, best loss: 1.047481 2025-01-16 01:48:42,217 - INFO - step 12474, loss: 1.175411, best loss: 1.047481 2025-01-16 01:48:42,367 - INFO - step 12475, loss: 1.348127, best loss: 1.047481 2025-01-16 01:48:42,517 - INFO - step 12476, loss: 1.219090, best loss: 1.047481 2025-01-16 01:48:42,667 - INFO - step 12477, loss: 1.113666, best loss: 1.047481 2025-01-16 01:48:42,817 - INFO - step 12478, loss: 1.198062, best loss: 1.047481 2025-01-16 01:48:42,967 - INFO - step 12479, loss: 1.296561, best loss: 1.047481 2025-01-16 01:48:43,117 - INFO - step 12480, loss: 1.286422, best loss: 1.047481 2025-01-16 01:48:43,267 - INFO - step 12481, loss: 1.226161, best loss: 1.047481 2025-01-16 01:48:43,417 - INFO - step 12482, loss: 1.150804, best loss: 1.047481 2025-01-16 01:48:43,567 - INFO - step 12483, loss: 1.133345, best loss: 1.047481 2025-01-16 01:48:43,717 - INFO - step 12484, loss: 1.253991, best loss: 1.047481 2025-01-16 01:48:43,868 - INFO - step 12485, loss: 1.379371, best loss: 1.047481 2025-01-16 01:48:44,018 - INFO - step 12486, loss: 1.178132, best loss: 1.047481 2025-01-16 01:48:44,168 - INFO - step 12487, loss: 1.251878, best loss: 1.047481 2025-01-16 01:48:44,318 - INFO - step 12488, loss: 1.276712, best loss: 1.047481 2025-01-16 01:48:44,468 - INFO - step 12489, loss: 1.352860, best loss: 1.047481 2025-01-16 01:48:44,618 - INFO - step 12490, loss: 1.277618, best loss: 1.047481 2025-01-16 01:48:44,768 - INFO - step 12491, loss: 1.203358, best loss: 1.047481 2025-01-16 01:48:44,918 - INFO - step 12492, loss: 1.287913, best loss: 1.047481 2025-01-16 01:48:45,068 - INFO - step 12493, loss: 1.253500, best loss: 1.047481 2025-01-16 01:48:45,218 - INFO - step 12494, loss: 1.471911, best loss: 1.047481 2025-01-16 01:48:45,368 - INFO - step 12495, loss: 1.326623, best loss: 1.047481 2025-01-16 01:48:45,519 - INFO - step 12496, loss: 1.377790, best loss: 1.047481 2025-01-16 01:48:45,669 - INFO - step 12497, loss: 1.311888, best loss: 1.047481 2025-01-16 01:48:45,819 - INFO - step 12498, loss: 1.351139, best loss: 1.047481 2025-01-16 01:48:45,969 - INFO - step 12499, loss: 1.294207, best loss: 1.047481 2025-01-16 01:48:46,119 - INFO - step 12500, loss: 1.179854, best loss: 1.047481 2025-01-16 01:48:46,269 - INFO - step 12501, loss: 1.320862, best loss: 1.047481 2025-01-16 01:48:46,420 - INFO - step 12502, loss: 1.249548, best loss: 1.047481 2025-01-16 01:48:46,570 - INFO - step 12503, loss: 1.189059, best loss: 1.047481 2025-01-16 01:48:46,720 - INFO - step 12504, loss: 1.325958, best loss: 1.047481 2025-01-16 01:48:46,870 - INFO - step 12505, loss: 1.292317, best loss: 1.047481 2025-01-16 01:48:47,020 - INFO - step 12506, loss: 1.364544, best loss: 1.047481 2025-01-16 01:48:47,170 - INFO - step 12507, loss: 1.088749, best loss: 1.047481 2025-01-16 01:48:47,320 - INFO - step 12508, loss: 1.408571, best loss: 1.047481 2025-01-16 01:48:47,470 - INFO - step 12509, loss: 1.328206, best loss: 1.047481 2025-01-16 01:48:47,620 - INFO - step 12510, loss: 1.369978, best loss: 1.047481 2025-01-16 01:48:47,771 - INFO - step 12511, loss: 1.328524, best loss: 1.047481 2025-01-16 01:48:47,921 - INFO - step 12512, loss: 1.273513, best loss: 1.047481 2025-01-16 01:48:48,071 - INFO - step 12513, loss: 1.321280, best loss: 1.047481 2025-01-16 01:48:48,221 - INFO - step 12514, loss: 1.211461, best loss: 1.047481 2025-01-16 01:48:48,371 - INFO - step 12515, loss: 1.155389, best loss: 1.047481 2025-01-16 01:48:48,521 - INFO - step 12516, loss: 1.172713, best loss: 1.047481 2025-01-16 01:48:48,672 - INFO - step 12517, loss: 1.348164, best loss: 1.047481 2025-01-16 01:48:48,822 - INFO - step 12518, loss: 1.307721, best loss: 1.047481 2025-01-16 01:48:48,972 - INFO - step 12519, loss: 1.243703, best loss: 1.047481 2025-01-16 01:48:49,122 - INFO - step 12520, loss: 1.223733, best loss: 1.047481 2025-01-16 01:48:49,272 - INFO - step 12521, loss: 1.260074, best loss: 1.047481 2025-01-16 01:48:49,423 - INFO - step 12522, loss: 1.404457, best loss: 1.047481 2025-01-16 01:48:49,574 - INFO - step 12523, loss: 1.280248, best loss: 1.047481 2025-01-16 01:48:49,724 - INFO - step 12524, loss: 1.338409, best loss: 1.047481 2025-01-16 01:48:49,874 - INFO - step 12525, loss: 1.168157, best loss: 1.047481 2025-01-16 01:48:50,024 - INFO - step 12526, loss: 1.115010, best loss: 1.047481 2025-01-16 01:48:50,175 - INFO - step 12527, loss: 1.095230, best loss: 1.047481 2025-01-16 01:48:50,325 - INFO - step 12528, loss: 1.351326, best loss: 1.047481 2025-01-16 01:48:50,475 - INFO - step 12529, loss: 1.295688, best loss: 1.047481 2025-01-16 01:48:50,625 - INFO - step 12530, loss: 1.408929, best loss: 1.047481 2025-01-16 01:48:50,775 - INFO - step 12531, loss: 1.382398, best loss: 1.047481 2025-01-16 01:48:50,926 - INFO - step 12532, loss: 1.268014, best loss: 1.047481 2025-01-16 01:48:51,076 - INFO - step 12533, loss: 1.225315, best loss: 1.047481 2025-01-16 01:48:51,226 - INFO - step 12534, loss: 1.228809, best loss: 1.047481 2025-01-16 01:48:51,376 - INFO - step 12535, loss: 1.405796, best loss: 1.047481 2025-01-16 01:48:51,526 - INFO - step 12536, loss: 1.404795, best loss: 1.047481 2025-01-16 01:48:55,049 - INFO - step 12537, loss: 0.997637, best loss: 0.997637 2025-01-16 01:48:55,199 - INFO - step 12538, loss: 1.242063, best loss: 0.997637 2025-01-16 01:48:55,349 - INFO - step 12539, loss: 1.070154, best loss: 0.997637 2025-01-16 01:48:55,499 - INFO - step 12540, loss: 1.340008, best loss: 0.997637 2025-01-16 01:48:55,649 - INFO - step 12541, loss: 1.357527, best loss: 0.997637 2025-01-16 01:48:55,800 - INFO - step 12542, loss: 1.219183, best loss: 0.997637 2025-01-16 01:48:55,950 - INFO - step 12543, loss: 1.238596, best loss: 0.997637 2025-01-16 01:48:56,100 - INFO - step 12544, loss: 1.274415, best loss: 0.997637 2025-01-16 01:48:56,250 - INFO - step 12545, loss: 1.135076, best loss: 0.997637 2025-01-16 01:48:56,400 - INFO - step 12546, loss: 1.386964, best loss: 0.997637 2025-01-16 01:48:56,550 - INFO - step 12547, loss: 1.359671, best loss: 0.997637 2025-01-16 01:48:56,700 - INFO - step 12548, loss: 1.416413, best loss: 0.997637 2025-01-16 01:48:56,850 - INFO - step 12549, loss: 1.382341, best loss: 0.997637 2025-01-16 01:48:57,001 - INFO - step 12550, loss: 1.208737, best loss: 0.997637 2025-01-16 01:48:57,151 - INFO - step 12551, loss: 1.256206, best loss: 0.997637 2025-01-16 01:48:57,301 - INFO - step 12552, loss: 1.078182, best loss: 0.997637 2025-01-16 01:48:57,451 - INFO - step 12553, loss: 1.371597, best loss: 0.997637 2025-01-16 01:48:57,601 - INFO - step 12554, loss: 1.363646, best loss: 0.997637 2025-01-16 01:48:57,751 - INFO - step 12555, loss: 1.408424, best loss: 0.997637 2025-01-16 01:48:57,902 - INFO - step 12556, loss: 1.321299, best loss: 0.997637 2025-01-16 01:48:58,052 - INFO - step 12557, loss: 1.307540, best loss: 0.997637 2025-01-16 01:48:58,202 - INFO - step 12558, loss: 1.420232, best loss: 0.997637 2025-01-16 01:48:58,352 - INFO - step 12559, loss: 1.260400, best loss: 0.997637 2025-01-16 01:48:58,502 - INFO - step 12560, loss: 1.217885, best loss: 0.997637 2025-01-16 01:48:58,652 - INFO - step 12561, loss: 1.437164, best loss: 0.997637 2025-01-16 01:48:58,802 - INFO - step 12562, loss: 1.149371, best loss: 0.997637 2025-01-16 01:48:58,952 - INFO - step 12563, loss: 1.186110, best loss: 0.997637 2025-01-16 01:48:59,102 - INFO - step 12564, loss: 1.411922, best loss: 0.997637 2025-01-16 01:48:59,253 - INFO - step 12565, loss: 1.486801, best loss: 0.997637 2025-01-16 01:48:59,403 - INFO - step 12566, loss: 1.366912, best loss: 0.997637 2025-01-16 01:48:59,553 - INFO - step 12567, loss: 1.184140, best loss: 0.997637 2025-01-16 01:48:59,703 - INFO - step 12568, loss: 1.351862, best loss: 0.997637 2025-01-16 01:48:59,853 - INFO - step 12569, loss: 1.424860, best loss: 0.997637 2025-01-16 01:49:00,003 - INFO - step 12570, loss: 1.151596, best loss: 0.997637 2025-01-16 01:49:00,153 - INFO - step 12571, loss: 1.245486, best loss: 0.997637 2025-01-16 01:49:00,303 - INFO - step 12572, loss: 1.257936, best loss: 0.997637 2025-01-16 01:49:00,453 - INFO - step 12573, loss: 1.325887, best loss: 0.997637 2025-01-16 01:49:00,603 - INFO - step 12574, loss: 1.151916, best loss: 0.997637 2025-01-16 01:49:00,754 - INFO - step 12575, loss: 1.129532, best loss: 0.997637 2025-01-16 01:49:00,904 - INFO - step 12576, loss: 1.419069, best loss: 0.997637 2025-01-16 01:49:01,054 - INFO - step 12577, loss: 1.369258, best loss: 0.997637 2025-01-16 01:49:01,204 - INFO - step 12578, loss: 1.340260, best loss: 0.997637 2025-01-16 01:49:01,354 - INFO - step 12579, loss: 1.192545, best loss: 0.997637 2025-01-16 01:49:01,504 - INFO - step 12580, loss: 1.393776, best loss: 0.997637 2025-01-16 01:49:01,654 - INFO - step 12581, loss: 1.382252, best loss: 0.997637 2025-01-16 01:49:01,804 - INFO - step 12582, loss: 1.345513, best loss: 0.997637 2025-01-16 01:49:01,954 - INFO - step 12583, loss: 1.225429, best loss: 0.997637 2025-01-16 01:49:02,105 - INFO - step 12584, loss: 1.269017, best loss: 0.997637 2025-01-16 01:49:02,255 - INFO - step 12585, loss: 1.255001, best loss: 0.997637 2025-01-16 01:49:02,405 - INFO - step 12586, loss: 1.452079, best loss: 0.997637 2025-01-16 01:49:02,555 - INFO - step 12587, loss: 1.306979, best loss: 0.997637 2025-01-16 01:49:02,705 - INFO - step 12588, loss: 1.292676, best loss: 0.997637 2025-01-16 01:49:02,855 - INFO - step 12589, loss: 1.152928, best loss: 0.997637 2025-01-16 01:49:03,005 - INFO - step 12590, loss: 1.197547, best loss: 0.997637 2025-01-16 01:49:03,155 - INFO - step 12591, loss: 1.329359, best loss: 0.997637 2025-01-16 01:49:03,306 - INFO - step 12592, loss: 1.329411, best loss: 0.997637 2025-01-16 01:49:03,456 - INFO - step 12593, loss: 1.390681, best loss: 0.997637 2025-01-16 01:49:03,606 - INFO - step 12594, loss: 1.276297, best loss: 0.997637 2025-01-16 01:49:03,756 - INFO - step 12595, loss: 1.273250, best loss: 0.997637 2025-01-16 01:49:03,906 - INFO - step 12596, loss: 1.313951, best loss: 0.997637 2025-01-16 01:49:04,056 - INFO - step 12597, loss: 1.306973, best loss: 0.997637 2025-01-16 01:49:04,207 - INFO - step 12598, loss: 1.100948, best loss: 0.997637 2025-01-16 01:49:04,357 - INFO - step 12599, loss: 1.300388, best loss: 0.997637 2025-01-16 01:49:04,507 - INFO - step 12600, loss: 1.378746, best loss: 0.997637 2025-01-16 01:49:04,657 - INFO - step 12601, loss: 1.522469, best loss: 0.997637 2025-01-16 01:49:04,807 - INFO - step 12602, loss: 1.478563, best loss: 0.997637 2025-01-16 01:49:04,957 - INFO - step 12603, loss: 1.347097, best loss: 0.997637 2025-01-16 01:49:05,107 - INFO - step 12604, loss: 1.374818, best loss: 0.997637 2025-01-16 01:49:05,257 - INFO - step 12605, loss: 1.158339, best loss: 0.997637 2025-01-16 01:49:05,408 - INFO - step 12606, loss: 1.380933, best loss: 0.997637 2025-01-16 01:49:05,558 - INFO - step 12607, loss: 1.212397, best loss: 0.997637 2025-01-16 01:49:05,708 - INFO - step 12608, loss: 1.190256, best loss: 0.997637 2025-01-16 01:49:05,859 - INFO - step 12609, loss: 1.299943, best loss: 0.997637 2025-01-16 01:49:06,009 - INFO - step 12610, loss: 1.344245, best loss: 0.997637 2025-01-16 01:49:06,159 - INFO - step 12611, loss: 1.328790, best loss: 0.997637 2025-01-16 01:49:06,309 - INFO - step 12612, loss: 1.368410, best loss: 0.997637 2025-01-16 01:49:06,459 - INFO - step 12613, loss: 1.357634, best loss: 0.997637 2025-01-16 01:49:06,609 - INFO - step 12614, loss: 1.363586, best loss: 0.997637 2025-01-16 01:49:06,759 - INFO - step 12615, loss: 1.441697, best loss: 0.997637 2025-01-16 01:49:06,909 - INFO - step 12616, loss: 1.260761, best loss: 0.997637 2025-01-16 01:49:07,060 - INFO - step 12617, loss: 1.422143, best loss: 0.997637 2025-01-16 01:49:07,210 - INFO - step 12618, loss: 1.296333, best loss: 0.997637 2025-01-16 01:49:07,360 - INFO - step 12619, loss: 1.184230, best loss: 0.997637 2025-01-16 01:49:07,510 - INFO - step 12620, loss: 1.355434, best loss: 0.997637 2025-01-16 01:49:07,661 - INFO - step 12621, loss: 1.234525, best loss: 0.997637 2025-01-16 01:49:07,811 - INFO - step 12622, loss: 1.215046, best loss: 0.997637 2025-01-16 01:49:07,961 - INFO - step 12623, loss: 1.271765, best loss: 0.997637 2025-01-16 01:49:08,111 - INFO - step 12624, loss: 1.256472, best loss: 0.997637 2025-01-16 01:49:08,261 - INFO - step 12625, loss: 1.289001, best loss: 0.997637 2025-01-16 01:49:08,411 - INFO - step 12626, loss: 1.148472, best loss: 0.997637 2025-01-16 01:49:08,562 - INFO - step 12627, loss: 1.193039, best loss: 0.997637 2025-01-16 01:49:08,712 - INFO - step 12628, loss: 1.240697, best loss: 0.997637 2025-01-16 01:49:08,862 - INFO - step 12629, loss: 1.283449, best loss: 0.997637 2025-01-16 01:49:09,012 - INFO - step 12630, loss: 1.184041, best loss: 0.997637 2025-01-16 01:49:09,162 - INFO - step 12631, loss: 1.462112, best loss: 0.997637 2025-01-16 01:49:09,312 - INFO - step 12632, loss: 1.149706, best loss: 0.997637 2025-01-16 01:49:09,462 - INFO - step 12633, loss: 1.360801, best loss: 0.997637 2025-01-16 01:49:09,613 - INFO - step 12634, loss: 1.399674, best loss: 0.997637 2025-01-16 01:49:09,763 - INFO - step 12635, loss: 1.502091, best loss: 0.997637 2025-01-16 01:49:09,914 - INFO - step 12636, loss: 1.398192, best loss: 0.997637 2025-01-16 01:49:10,064 - INFO - step 12637, loss: 1.234955, best loss: 0.997637 2025-01-16 01:49:10,214 - INFO - step 12638, loss: 1.298550, best loss: 0.997637 2025-01-16 01:49:10,364 - INFO - step 12639, loss: 1.345401, best loss: 0.997637 2025-01-16 01:49:10,514 - INFO - step 12640, loss: 1.299501, best loss: 0.997637 2025-01-16 01:49:10,664 - INFO - step 12641, loss: 1.355840, best loss: 0.997637 2025-01-16 01:49:10,815 - INFO - step 12642, loss: 1.307064, best loss: 0.997637 2025-01-16 01:49:10,965 - INFO - step 12643, loss: 1.552910, best loss: 0.997637 2025-01-16 01:49:11,115 - INFO - step 12644, loss: 1.264742, best loss: 0.997637 2025-01-16 01:49:11,265 - INFO - step 12645, loss: 1.182883, best loss: 0.997637 2025-01-16 01:49:11,415 - INFO - step 12646, loss: 1.396136, best loss: 0.997637 2025-01-16 01:49:11,565 - INFO - step 12647, loss: 1.236159, best loss: 0.997637 2025-01-16 01:49:11,716 - INFO - step 12648, loss: 1.217592, best loss: 0.997637 2025-01-16 01:49:11,866 - INFO - step 12649, loss: 1.415274, best loss: 0.997637 2025-01-16 01:49:12,016 - INFO - step 12650, loss: 1.315286, best loss: 0.997637 2025-01-16 01:49:12,166 - INFO - step 12651, loss: 1.486364, best loss: 0.997637 2025-01-16 01:49:12,316 - INFO - step 12652, loss: 1.263561, best loss: 0.997637 2025-01-16 01:49:12,466 - INFO - step 12653, loss: 1.335587, best loss: 0.997637 2025-01-16 01:49:12,616 - INFO - step 12654, loss: 1.439076, best loss: 0.997637 2025-01-16 01:49:12,766 - INFO - step 12655, loss: 1.310650, best loss: 0.997637 2025-01-16 01:49:12,916 - INFO - step 12656, loss: 1.363392, best loss: 0.997637 2025-01-16 01:49:13,066 - INFO - step 12657, loss: 1.383922, best loss: 0.997637 2025-01-16 01:49:13,217 - INFO - step 12658, loss: 1.238639, best loss: 0.997637 2025-01-16 01:49:13,367 - INFO - step 12659, loss: 1.337035, best loss: 0.997637 2025-01-16 01:49:13,517 - INFO - step 12660, loss: 1.365346, best loss: 0.997637 2025-01-16 01:49:13,667 - INFO - step 12661, loss: 1.328797, best loss: 0.997637 2025-01-16 01:49:13,817 - INFO - step 12662, loss: 1.300098, best loss: 0.997637 2025-01-16 01:49:13,968 - INFO - step 12663, loss: 1.203501, best loss: 0.997637 2025-01-16 01:49:14,118 - INFO - step 12664, loss: 1.193972, best loss: 0.997637 2025-01-16 01:49:14,268 - INFO - step 12665, loss: 1.228757, best loss: 0.997637 2025-01-16 01:49:14,418 - INFO - step 12666, loss: 1.316808, best loss: 0.997637 2025-01-16 01:49:14,569 - INFO - step 12667, loss: 1.469214, best loss: 0.997637 2025-01-16 01:49:14,719 - INFO - step 12668, loss: 1.236190, best loss: 0.997637 2025-01-16 01:49:14,869 - INFO - step 12669, loss: 1.075891, best loss: 0.997637 2025-01-16 01:49:15,019 - INFO - step 12670, loss: 1.319862, best loss: 0.997637 2025-01-16 01:49:15,169 - INFO - step 12671, loss: 1.309225, best loss: 0.997637 2025-01-16 01:49:15,319 - INFO - step 12672, loss: 1.524743, best loss: 0.997637 2025-01-16 01:49:15,470 - INFO - step 12673, loss: 1.127628, best loss: 0.997637 2025-01-16 01:49:15,620 - INFO - step 12674, loss: 1.340368, best loss: 0.997637 2025-01-16 01:49:15,770 - INFO - step 12675, loss: 1.128684, best loss: 0.997637 2025-01-16 01:49:15,920 - INFO - step 12676, loss: 1.132443, best loss: 0.997637 2025-01-16 01:49:16,070 - INFO - step 12677, loss: 1.317442, best loss: 0.997637 2025-01-16 01:49:16,220 - INFO - step 12678, loss: 1.308484, best loss: 0.997637 2025-01-16 01:49:16,370 - INFO - step 12679, loss: 1.338325, best loss: 0.997637 2025-01-16 01:49:16,520 - INFO - step 12680, loss: 1.397551, best loss: 0.997637 2025-01-16 01:49:16,670 - INFO - step 12681, loss: 1.259216, best loss: 0.997637 2025-01-16 01:49:16,821 - INFO - step 12682, loss: 1.356158, best loss: 0.997637 2025-01-16 01:49:16,971 - INFO - step 12683, loss: 1.224421, best loss: 0.997637 2025-01-16 01:49:20,503 - INFO - step 12684, loss: 0.958429, best loss: 0.958429 2025-01-16 01:49:20,665 - INFO - step 12685, loss: 1.243337, best loss: 0.958429 2025-01-16 01:49:20,817 - INFO - step 12686, loss: 1.501348, best loss: 0.958429 2025-01-16 01:49:20,968 - INFO - step 12687, loss: 1.341530, best loss: 0.958429 2025-01-16 01:49:21,118 - INFO - step 12688, loss: 1.243016, best loss: 0.958429 2025-01-16 01:49:21,268 - INFO - step 12689, loss: 1.244274, best loss: 0.958429 2025-01-16 01:49:21,418 - INFO - step 12690, loss: 1.406858, best loss: 0.958429 2025-01-16 01:49:21,569 - INFO - step 12691, loss: 1.418227, best loss: 0.958429 2025-01-16 01:49:21,719 - INFO - step 12692, loss: 1.342179, best loss: 0.958429 2025-01-16 01:49:21,869 - INFO - step 12693, loss: 1.225664, best loss: 0.958429 2025-01-16 01:49:22,019 - INFO - step 12694, loss: 1.400542, best loss: 0.958429 2025-01-16 01:49:22,169 - INFO - step 12695, loss: 1.419369, best loss: 0.958429 2025-01-16 01:49:22,319 - INFO - step 12696, loss: 1.329898, best loss: 0.958429 2025-01-16 01:49:22,469 - INFO - step 12697, loss: 1.313753, best loss: 0.958429 2025-01-16 01:49:22,620 - INFO - step 12698, loss: 1.328376, best loss: 0.958429 2025-01-16 01:49:22,770 - INFO - step 12699, loss: 1.340594, best loss: 0.958429 2025-01-16 01:49:22,920 - INFO - step 12700, loss: 1.242733, best loss: 0.958429 2025-01-16 01:49:23,070 - INFO - step 12701, loss: 1.297851, best loss: 0.958429 2025-01-16 01:49:23,220 - INFO - step 12702, loss: 1.219839, best loss: 0.958429 2025-01-16 01:49:23,370 - INFO - step 12703, loss: 1.065486, best loss: 0.958429 2025-01-16 01:49:23,521 - INFO - step 12704, loss: 1.315989, best loss: 0.958429 2025-01-16 01:49:23,671 - INFO - step 12705, loss: 1.231458, best loss: 0.958429 2025-01-16 01:49:23,821 - INFO - step 12706, loss: 1.498506, best loss: 0.958429 2025-01-16 01:49:23,971 - INFO - step 12707, loss: 1.283141, best loss: 0.958429 2025-01-16 01:49:24,121 - INFO - step 12708, loss: 1.302658, best loss: 0.958429 2025-01-16 01:49:24,271 - INFO - step 12709, loss: 1.364460, best loss: 0.958429 2025-01-16 01:49:24,421 - INFO - step 12710, loss: 1.261526, best loss: 0.958429 2025-01-16 01:49:24,572 - INFO - step 12711, loss: 1.132625, best loss: 0.958429 2025-01-16 01:49:24,722 - INFO - step 12712, loss: 1.262041, best loss: 0.958429 2025-01-16 01:49:24,872 - INFO - step 12713, loss: 1.338307, best loss: 0.958429 2025-01-16 01:49:25,022 - INFO - step 12714, loss: 1.370268, best loss: 0.958429 2025-01-16 01:49:25,173 - INFO - step 12715, loss: 1.285195, best loss: 0.958429 2025-01-16 01:49:25,323 - INFO - step 12716, loss: 1.502477, best loss: 0.958429 2025-01-16 01:49:25,473 - INFO - step 12717, loss: 1.449523, best loss: 0.958429 2025-01-16 01:49:25,623 - INFO - step 12718, loss: 1.427215, best loss: 0.958429 2025-01-16 01:49:25,774 - INFO - step 12719, loss: 1.425731, best loss: 0.958429 2025-01-16 01:49:25,924 - INFO - step 12720, loss: 1.546900, best loss: 0.958429 2025-01-16 01:49:26,074 - INFO - step 12721, loss: 1.293588, best loss: 0.958429 2025-01-16 01:49:26,224 - INFO - step 12722, loss: 1.420976, best loss: 0.958429 2025-01-16 01:49:26,375 - INFO - step 12723, loss: 1.521513, best loss: 0.958429 2025-01-16 01:49:26,525 - INFO - step 12724, loss: 1.349572, best loss: 0.958429 2025-01-16 01:49:26,675 - INFO - step 12725, loss: 1.327908, best loss: 0.958429 2025-01-16 01:49:26,825 - INFO - step 12726, loss: 1.516107, best loss: 0.958429 2025-01-16 01:49:26,975 - INFO - step 12727, loss: 1.449655, best loss: 0.958429 2025-01-16 01:49:27,126 - INFO - step 12728, loss: 1.034106, best loss: 0.958429 2025-01-16 01:49:27,276 - INFO - step 12729, loss: 1.397427, best loss: 0.958429 2025-01-16 01:49:27,426 - INFO - step 12730, loss: 1.345184, best loss: 0.958429 2025-01-16 01:49:27,576 - INFO - step 12731, loss: 1.333467, best loss: 0.958429 2025-01-16 01:49:27,726 - INFO - step 12732, loss: 1.496800, best loss: 0.958429 2025-01-16 01:49:27,876 - INFO - step 12733, loss: 1.354885, best loss: 0.958429 2025-01-16 01:49:28,027 - INFO - step 12734, loss: 1.368628, best loss: 0.958429 2025-01-16 01:49:28,177 - INFO - step 12735, loss: 1.408906, best loss: 0.958429 2025-01-16 01:49:28,327 - INFO - step 12736, loss: 1.427649, best loss: 0.958429 2025-01-16 01:49:28,477 - INFO - step 12737, loss: 1.235559, best loss: 0.958429 2025-01-16 01:49:28,628 - INFO - step 12738, loss: 1.275495, best loss: 0.958429 2025-01-16 01:49:28,778 - INFO - step 12739, loss: 1.167588, best loss: 0.958429 2025-01-16 01:49:28,928 - INFO - step 12740, loss: 1.332050, best loss: 0.958429 2025-01-16 01:49:29,078 - INFO - step 12741, loss: 1.277208, best loss: 0.958429 2025-01-16 01:49:29,229 - INFO - step 12742, loss: 1.325452, best loss: 0.958429 2025-01-16 01:49:29,379 - INFO - step 12743, loss: 1.381300, best loss: 0.958429 2025-01-16 01:49:29,529 - INFO - step 12744, loss: 1.319727, best loss: 0.958429 2025-01-16 01:49:29,679 - INFO - step 12745, loss: 1.232039, best loss: 0.958429 2025-01-16 01:49:29,829 - INFO - step 12746, loss: 1.121693, best loss: 0.958429 2025-01-16 01:49:29,980 - INFO - step 12747, loss: 1.251112, best loss: 0.958429 2025-01-16 01:49:30,130 - INFO - step 12748, loss: 1.327727, best loss: 0.958429 2025-01-16 01:49:30,280 - INFO - step 12749, loss: 1.349038, best loss: 0.958429 2025-01-16 01:49:30,430 - INFO - step 12750, loss: 1.360114, best loss: 0.958429 2025-01-16 01:49:30,580 - INFO - step 12751, loss: 1.194696, best loss: 0.958429 2025-01-16 01:49:30,730 - INFO - step 12752, loss: 1.174869, best loss: 0.958429 2025-01-16 01:49:30,880 - INFO - step 12753, loss: 1.193576, best loss: 0.958429 2025-01-16 01:49:31,030 - INFO - step 12754, loss: 1.308937, best loss: 0.958429 2025-01-16 01:49:31,180 - INFO - step 12755, loss: 1.329002, best loss: 0.958429 2025-01-16 01:49:31,330 - INFO - step 12756, loss: 1.431053, best loss: 0.958429 2025-01-16 01:49:31,481 - INFO - step 12757, loss: 1.391143, best loss: 0.958429 2025-01-16 01:49:31,631 - INFO - step 12758, loss: 1.276593, best loss: 0.958429 2025-01-16 01:49:31,781 - INFO - step 12759, loss: 1.317859, best loss: 0.958429 2025-01-16 01:49:31,931 - INFO - step 12760, loss: 1.356261, best loss: 0.958429 2025-01-16 01:49:32,082 - INFO - step 12761, loss: 1.433282, best loss: 0.958429 2025-01-16 01:49:32,232 - INFO - step 12762, loss: 1.447484, best loss: 0.958429 2025-01-16 01:49:32,382 - INFO - step 12763, loss: 1.526520, best loss: 0.958429 2025-01-16 01:49:32,532 - INFO - step 12764, loss: 1.571715, best loss: 0.958429 2025-01-16 01:49:32,682 - INFO - step 12765, loss: 1.286416, best loss: 0.958429 2025-01-16 01:49:32,832 - INFO - step 12766, loss: 1.394620, best loss: 0.958429 2025-01-16 01:49:32,982 - INFO - step 12767, loss: 1.209630, best loss: 0.958429 2025-01-16 01:49:33,133 - INFO - step 12768, loss: 1.124147, best loss: 0.958429 2025-01-16 01:49:33,283 - INFO - step 12769, loss: 1.351741, best loss: 0.958429 2025-01-16 01:49:33,433 - INFO - step 12770, loss: 1.333429, best loss: 0.958429 2025-01-16 01:49:33,583 - INFO - step 12771, loss: 1.090459, best loss: 0.958429 2025-01-16 01:49:33,733 - INFO - step 12772, loss: 1.127507, best loss: 0.958429 2025-01-16 01:49:33,884 - INFO - step 12773, loss: 1.364469, best loss: 0.958429 2025-01-16 01:49:34,034 - INFO - step 12774, loss: 1.475593, best loss: 0.958429 2025-01-16 01:49:34,184 - INFO - step 12775, loss: 1.326966, best loss: 0.958429 2025-01-16 01:49:34,334 - INFO - step 12776, loss: 1.357063, best loss: 0.958429 2025-01-16 01:49:34,484 - INFO - step 12777, loss: 1.240414, best loss: 0.958429 2025-01-16 01:49:34,635 - INFO - step 12778, loss: 1.084200, best loss: 0.958429 2025-01-16 01:49:34,785 - INFO - step 12779, loss: 1.405575, best loss: 0.958429 2025-01-16 01:49:34,935 - INFO - step 12780, loss: 1.561461, best loss: 0.958429 2025-01-16 01:49:35,085 - INFO - step 12781, loss: 1.484794, best loss: 0.958429 2025-01-16 01:49:35,235 - INFO - step 12782, loss: 1.327952, best loss: 0.958429 2025-01-16 01:49:35,386 - INFO - step 12783, loss: 1.229435, best loss: 0.958429 2025-01-16 01:49:35,536 - INFO - step 12784, loss: 1.405412, best loss: 0.958429 2025-01-16 01:49:35,686 - INFO - step 12785, loss: 1.305250, best loss: 0.958429 2025-01-16 01:49:35,836 - INFO - step 12786, loss: 1.246152, best loss: 0.958429 2025-01-16 01:49:35,987 - INFO - step 12787, loss: 1.364098, best loss: 0.958429 2025-01-16 01:49:36,137 - INFO - step 12788, loss: 1.212885, best loss: 0.958429 2025-01-16 01:49:36,287 - INFO - step 12789, loss: 0.981581, best loss: 0.958429 2025-01-16 01:49:36,438 - INFO - step 12790, loss: 1.295574, best loss: 0.958429 2025-01-16 01:49:36,588 - INFO - step 12791, loss: 1.239939, best loss: 0.958429 2025-01-16 01:49:36,738 - INFO - step 12792, loss: 1.300421, best loss: 0.958429 2025-01-16 01:49:36,888 - INFO - step 12793, loss: 1.015153, best loss: 0.958429 2025-01-16 01:49:37,039 - INFO - step 12794, loss: 1.102528, best loss: 0.958429 2025-01-16 01:49:37,189 - INFO - step 12795, loss: 0.972740, best loss: 0.958429 2025-01-16 01:49:37,339 - INFO - step 12796, loss: 1.148742, best loss: 0.958429 2025-01-16 01:49:37,489 - INFO - step 12797, loss: 1.271331, best loss: 0.958429 2025-01-16 01:49:37,639 - INFO - step 12798, loss: 1.258873, best loss: 0.958429 2025-01-16 01:49:37,789 - INFO - step 12799, loss: 1.316745, best loss: 0.958429 2025-01-16 01:49:37,940 - INFO - step 12800, loss: 1.285034, best loss: 0.958429 2025-01-16 01:49:38,090 - INFO - step 12801, loss: 1.224513, best loss: 0.958429 2025-01-16 01:49:38,240 - INFO - step 12802, loss: 1.320113, best loss: 0.958429 2025-01-16 01:49:38,390 - INFO - step 12803, loss: 1.108283, best loss: 0.958429 2025-01-16 01:49:38,540 - INFO - step 12804, loss: 1.127359, best loss: 0.958429 2025-01-16 01:49:38,690 - INFO - step 12805, loss: 1.218240, best loss: 0.958429 2025-01-16 01:49:38,840 - INFO - step 12806, loss: 1.048683, best loss: 0.958429 2025-01-16 01:49:38,991 - INFO - step 12807, loss: 1.039644, best loss: 0.958429 2025-01-16 01:49:39,141 - INFO - step 12808, loss: 1.080204, best loss: 0.958429 2025-01-16 01:49:39,291 - INFO - step 12809, loss: 1.200382, best loss: 0.958429 2025-01-16 01:49:39,441 - INFO - step 12810, loss: 1.161445, best loss: 0.958429 2025-01-16 01:49:39,591 - INFO - step 12811, loss: 1.093167, best loss: 0.958429 2025-01-16 01:49:39,742 - INFO - step 12812, loss: 1.136220, best loss: 0.958429 2025-01-16 01:49:39,892 - INFO - step 12813, loss: 1.080761, best loss: 0.958429 2025-01-16 01:49:40,042 - INFO - step 12814, loss: 1.174264, best loss: 0.958429 2025-01-16 01:49:40,192 - INFO - step 12815, loss: 1.345505, best loss: 0.958429 2025-01-16 01:49:40,343 - INFO - step 12816, loss: 1.089412, best loss: 0.958429 2025-01-16 01:49:40,493 - INFO - step 12817, loss: 1.156176, best loss: 0.958429 2025-01-16 01:49:40,643 - INFO - step 12818, loss: 1.135593, best loss: 0.958429 2025-01-16 01:49:40,794 - INFO - step 12819, loss: 1.316730, best loss: 0.958429 2025-01-16 01:49:40,944 - INFO - step 12820, loss: 1.224305, best loss: 0.958429 2025-01-16 01:49:41,094 - INFO - step 12821, loss: 1.093891, best loss: 0.958429 2025-01-16 01:49:41,244 - INFO - step 12822, loss: 1.230030, best loss: 0.958429 2025-01-16 01:49:41,395 - INFO - step 12823, loss: 1.219725, best loss: 0.958429 2025-01-16 01:49:41,545 - INFO - step 12824, loss: 1.349275, best loss: 0.958429 2025-01-16 01:49:41,695 - INFO - step 12825, loss: 1.299263, best loss: 0.958429 2025-01-16 01:49:41,845 - INFO - step 12826, loss: 1.338210, best loss: 0.958429 2025-01-16 01:49:41,995 - INFO - step 12827, loss: 1.236425, best loss: 0.958429 2025-01-16 01:49:42,146 - INFO - step 12828, loss: 1.265957, best loss: 0.958429 2025-01-16 01:49:42,296 - INFO - step 12829, loss: 1.208025, best loss: 0.958429 2025-01-16 01:49:42,446 - INFO - step 12830, loss: 1.158293, best loss: 0.958429 2025-01-16 01:49:42,596 - INFO - step 12831, loss: 1.203624, best loss: 0.958429 2025-01-16 01:49:42,747 - INFO - step 12832, loss: 1.207999, best loss: 0.958429 2025-01-16 01:49:42,897 - INFO - step 12833, loss: 1.109346, best loss: 0.958429 2025-01-16 01:49:43,047 - INFO - step 12834, loss: 1.183385, best loss: 0.958429 2025-01-16 01:49:43,197 - INFO - step 12835, loss: 1.167510, best loss: 0.958429 2025-01-16 01:49:43,348 - INFO - step 12836, loss: 1.283678, best loss: 0.958429 2025-01-16 01:49:43,498 - INFO - step 12837, loss: 1.032409, best loss: 0.958429 2025-01-16 01:49:43,648 - INFO - step 12838, loss: 1.285993, best loss: 0.958429 2025-01-16 01:49:43,798 - INFO - step 12839, loss: 1.229789, best loss: 0.958429 2025-01-16 01:49:43,948 - INFO - step 12840, loss: 1.212732, best loss: 0.958429 2025-01-16 01:49:44,098 - INFO - step 12841, loss: 1.230090, best loss: 0.958429 2025-01-16 01:49:44,248 - INFO - step 12842, loss: 1.151193, best loss: 0.958429 2025-01-16 01:49:44,398 - INFO - step 12843, loss: 1.203936, best loss: 0.958429 2025-01-16 01:49:44,549 - INFO - step 12844, loss: 1.179768, best loss: 0.958429 2025-01-16 01:49:44,699 - INFO - step 12845, loss: 1.113451, best loss: 0.958429 2025-01-16 01:49:44,849 - INFO - step 12846, loss: 1.089182, best loss: 0.958429 2025-01-16 01:49:45,000 - INFO - step 12847, loss: 1.180735, best loss: 0.958429 2025-01-16 01:49:45,150 - INFO - step 12848, loss: 1.142674, best loss: 0.958429 2025-01-16 01:49:45,300 - INFO - step 12849, loss: 1.150341, best loss: 0.958429 2025-01-16 01:49:45,450 - INFO - step 12850, loss: 1.069449, best loss: 0.958429 2025-01-16 01:49:45,600 - INFO - step 12851, loss: 1.194312, best loss: 0.958429 2025-01-16 01:49:45,750 - INFO - step 12852, loss: 1.328049, best loss: 0.958429 2025-01-16 01:49:45,900 - INFO - step 12853, loss: 1.213245, best loss: 0.958429 2025-01-16 01:49:46,051 - INFO - step 12854, loss: 1.230620, best loss: 0.958429 2025-01-16 01:49:46,201 - INFO - step 12855, loss: 1.027391, best loss: 0.958429 2025-01-16 01:49:46,351 - INFO - step 12856, loss: 1.037368, best loss: 0.958429 2025-01-16 01:49:46,501 - INFO - step 12857, loss: 1.018422, best loss: 0.958429 2025-01-16 01:49:46,652 - INFO - step 12858, loss: 1.250358, best loss: 0.958429 2025-01-16 01:49:46,802 - INFO - step 12859, loss: 1.253329, best loss: 0.958429 2025-01-16 01:49:46,952 - INFO - step 12860, loss: 1.315329, best loss: 0.958429 2025-01-16 01:49:47,102 - INFO - step 12861, loss: 1.243414, best loss: 0.958429 2025-01-16 01:49:47,252 - INFO - step 12862, loss: 1.213581, best loss: 0.958429 2025-01-16 01:49:47,403 - INFO - step 12863, loss: 1.127575, best loss: 0.958429 2025-01-16 01:49:47,553 - INFO - step 12864, loss: 1.109607, best loss: 0.958429 2025-01-16 01:49:47,703 - INFO - step 12865, loss: 1.307753, best loss: 0.958429 2025-01-16 01:49:47,853 - INFO - step 12866, loss: 1.369441, best loss: 0.958429 2025-01-16 01:49:51,432 - INFO - step 12867, loss: 0.940304, best loss: 0.940304 2025-01-16 01:49:51,582 - INFO - step 12868, loss: 1.068635, best loss: 0.940304 2025-01-16 01:49:51,732 - INFO - step 12869, loss: 1.057016, best loss: 0.940304 2025-01-16 01:49:51,882 - INFO - step 12870, loss: 1.241528, best loss: 0.940304 2025-01-16 01:49:52,033 - INFO - step 12871, loss: 1.261536, best loss: 0.940304 2025-01-16 01:49:52,183 - INFO - step 12872, loss: 1.263545, best loss: 0.940304 2025-01-16 01:49:52,333 - INFO - step 12873, loss: 1.316484, best loss: 0.940304 2025-01-16 01:49:52,483 - INFO - step 12874, loss: 1.254465, best loss: 0.940304 2025-01-16 01:49:52,633 - INFO - step 12875, loss: 1.086186, best loss: 0.940304 2025-01-16 01:49:52,783 - INFO - step 12876, loss: 1.332309, best loss: 0.940304 2025-01-16 01:49:52,934 - INFO - step 12877, loss: 1.213638, best loss: 0.940304 2025-01-16 01:49:53,084 - INFO - step 12878, loss: 1.360481, best loss: 0.940304 2025-01-16 01:49:53,234 - INFO - step 12879, loss: 1.301310, best loss: 0.940304 2025-01-16 01:49:53,384 - INFO - step 12880, loss: 1.130734, best loss: 0.940304 2025-01-16 01:49:53,534 - INFO - step 12881, loss: 1.167221, best loss: 0.940304 2025-01-16 01:49:53,684 - INFO - step 12882, loss: 1.076995, best loss: 0.940304 2025-01-16 01:49:53,835 - INFO - step 12883, loss: 1.291357, best loss: 0.940304 2025-01-16 01:49:53,985 - INFO - step 12884, loss: 1.312832, best loss: 0.940304 2025-01-16 01:49:54,135 - INFO - step 12885, loss: 1.344570, best loss: 0.940304 2025-01-16 01:49:54,285 - INFO - step 12886, loss: 1.379819, best loss: 0.940304 2025-01-16 01:49:54,435 - INFO - step 12887, loss: 1.244741, best loss: 0.940304 2025-01-16 01:49:54,586 - INFO - step 12888, loss: 1.290331, best loss: 0.940304 2025-01-16 01:49:54,736 - INFO - step 12889, loss: 1.242097, best loss: 0.940304 2025-01-16 01:49:54,886 - INFO - step 12890, loss: 1.147052, best loss: 0.940304 2025-01-16 01:49:55,036 - INFO - step 12891, loss: 1.347445, best loss: 0.940304 2025-01-16 01:49:55,186 - INFO - step 12892, loss: 1.115653, best loss: 0.940304 2025-01-16 01:49:55,336 - INFO - step 12893, loss: 1.141034, best loss: 0.940304 2025-01-16 01:49:55,486 - INFO - step 12894, loss: 1.300691, best loss: 0.940304 2025-01-16 01:49:55,636 - INFO - step 12895, loss: 1.388933, best loss: 0.940304 2025-01-16 01:49:55,786 - INFO - step 12896, loss: 1.326439, best loss: 0.940304 2025-01-16 01:49:55,936 - INFO - step 12897, loss: 1.107449, best loss: 0.940304 2025-01-16 01:49:56,086 - INFO - step 12898, loss: 1.270744, best loss: 0.940304 2025-01-16 01:49:56,236 - INFO - step 12899, loss: 1.361089, best loss: 0.940304 2025-01-16 01:49:56,386 - INFO - step 12900, loss: 1.148104, best loss: 0.940304 2025-01-16 01:49:56,536 - INFO - step 12901, loss: 1.125049, best loss: 0.940304 2025-01-16 01:49:56,686 - INFO - step 12902, loss: 1.248870, best loss: 0.940304 2025-01-16 01:49:56,837 - INFO - step 12903, loss: 1.269994, best loss: 0.940304 2025-01-16 01:49:56,987 - INFO - step 12904, loss: 1.060189, best loss: 0.940304 2025-01-16 01:49:57,137 - INFO - step 12905, loss: 1.029799, best loss: 0.940304 2025-01-16 01:49:57,287 - INFO - step 12906, loss: 1.300521, best loss: 0.940304 2025-01-16 01:49:57,438 - INFO - step 12907, loss: 1.235189, best loss: 0.940304 2025-01-16 01:49:57,588 - INFO - step 12908, loss: 1.196284, best loss: 0.940304 2025-01-16 01:49:57,738 - INFO - step 12909, loss: 1.077547, best loss: 0.940304 2025-01-16 01:49:57,888 - INFO - step 12910, loss: 1.280729, best loss: 0.940304 2025-01-16 01:49:58,039 - INFO - step 12911, loss: 1.284219, best loss: 0.940304 2025-01-16 01:49:58,189 - INFO - step 12912, loss: 1.263480, best loss: 0.940304 2025-01-16 01:49:58,339 - INFO - step 12913, loss: 1.123187, best loss: 0.940304 2025-01-16 01:49:58,489 - INFO - step 12914, loss: 1.236525, best loss: 0.940304 2025-01-16 01:49:58,639 - INFO - step 12915, loss: 1.131892, best loss: 0.940304 2025-01-16 01:49:58,789 - INFO - step 12916, loss: 1.267368, best loss: 0.940304 2025-01-16 01:49:58,939 - INFO - step 12917, loss: 1.287599, best loss: 0.940304 2025-01-16 01:49:59,089 - INFO - step 12918, loss: 1.230875, best loss: 0.940304 2025-01-16 01:49:59,239 - INFO - step 12919, loss: 1.000277, best loss: 0.940304 2025-01-16 01:49:59,389 - INFO - step 12920, loss: 1.165443, best loss: 0.940304 2025-01-16 01:49:59,540 - INFO - step 12921, loss: 1.230469, best loss: 0.940304 2025-01-16 01:49:59,690 - INFO - step 12922, loss: 1.270253, best loss: 0.940304 2025-01-16 01:49:59,840 - INFO - step 12923, loss: 1.313718, best loss: 0.940304 2025-01-16 01:49:59,990 - INFO - step 12924, loss: 1.228695, best loss: 0.940304 2025-01-16 01:50:00,140 - INFO - step 12925, loss: 1.236524, best loss: 0.940304 2025-01-16 01:50:00,290 - INFO - step 12926, loss: 1.269357, best loss: 0.940304 2025-01-16 01:50:00,440 - INFO - step 12927, loss: 1.229908, best loss: 0.940304 2025-01-16 01:50:00,591 - INFO - step 12928, loss: 1.107052, best loss: 0.940304 2025-01-16 01:50:00,741 - INFO - step 12929, loss: 1.303039, best loss: 0.940304 2025-01-16 01:50:00,891 - INFO - step 12930, loss: 1.315577, best loss: 0.940304 2025-01-16 01:50:01,041 - INFO - step 12931, loss: 1.472828, best loss: 0.940304 2025-01-16 01:50:01,191 - INFO - step 12932, loss: 1.378684, best loss: 0.940304 2025-01-16 01:50:01,341 - INFO - step 12933, loss: 1.294421, best loss: 0.940304 2025-01-16 01:50:01,491 - INFO - step 12934, loss: 1.266379, best loss: 0.940304 2025-01-16 01:50:01,641 - INFO - step 12935, loss: 1.134313, best loss: 0.940304 2025-01-16 01:50:01,791 - INFO - step 12936, loss: 1.335816, best loss: 0.940304 2025-01-16 01:50:01,941 - INFO - step 12937, loss: 1.171509, best loss: 0.940304 2025-01-16 01:50:02,091 - INFO - step 12938, loss: 1.173134, best loss: 0.940304 2025-01-16 01:50:02,241 - INFO - step 12939, loss: 1.234837, best loss: 0.940304 2025-01-16 01:50:02,391 - INFO - step 12940, loss: 1.245261, best loss: 0.940304 2025-01-16 01:50:02,541 - INFO - step 12941, loss: 1.200640, best loss: 0.940304 2025-01-16 01:50:02,691 - INFO - step 12942, loss: 1.285469, best loss: 0.940304 2025-01-16 01:50:02,841 - INFO - step 12943, loss: 1.357217, best loss: 0.940304 2025-01-16 01:50:02,991 - INFO - step 12944, loss: 1.324365, best loss: 0.940304 2025-01-16 01:50:03,141 - INFO - step 12945, loss: 1.249975, best loss: 0.940304 2025-01-16 01:50:03,291 - INFO - step 12946, loss: 1.259877, best loss: 0.940304 2025-01-16 01:50:03,442 - INFO - step 12947, loss: 1.270822, best loss: 0.940304 2025-01-16 01:50:03,592 - INFO - step 12948, loss: 1.258298, best loss: 0.940304 2025-01-16 01:50:03,742 - INFO - step 12949, loss: 1.170275, best loss: 0.940304 2025-01-16 01:50:03,892 - INFO - step 12950, loss: 1.368357, best loss: 0.940304 2025-01-16 01:50:04,042 - INFO - step 12951, loss: 1.179254, best loss: 0.940304 2025-01-16 01:50:04,192 - INFO - step 12952, loss: 1.104409, best loss: 0.940304 2025-01-16 01:50:04,342 - INFO - step 12953, loss: 1.145706, best loss: 0.940304 2025-01-16 01:50:04,492 - INFO - step 12954, loss: 1.184836, best loss: 0.940304 2025-01-16 01:50:04,643 - INFO - step 12955, loss: 1.221962, best loss: 0.940304 2025-01-16 01:50:04,792 - INFO - step 12956, loss: 1.056478, best loss: 0.940304 2025-01-16 01:50:04,943 - INFO - step 12957, loss: 1.123889, best loss: 0.940304 2025-01-16 01:50:05,093 - INFO - step 12958, loss: 1.182130, best loss: 0.940304 2025-01-16 01:50:05,243 - INFO - step 12959, loss: 1.187522, best loss: 0.940304 2025-01-16 01:50:05,393 - INFO - step 12960, loss: 1.079105, best loss: 0.940304 2025-01-16 01:50:05,543 - INFO - step 12961, loss: 1.348575, best loss: 0.940304 2025-01-16 01:50:05,693 - INFO - step 12962, loss: 1.071893, best loss: 0.940304 2025-01-16 01:50:05,843 - INFO - step 12963, loss: 1.252368, best loss: 0.940304 2025-01-16 01:50:05,993 - INFO - step 12964, loss: 1.308956, best loss: 0.940304 2025-01-16 01:50:06,143 - INFO - step 12965, loss: 1.457004, best loss: 0.940304 2025-01-16 01:50:06,294 - INFO - step 12966, loss: 1.460369, best loss: 0.940304 2025-01-16 01:50:06,444 - INFO - step 12967, loss: 1.239958, best loss: 0.940304 2025-01-16 01:50:06,594 - INFO - step 12968, loss: 1.224156, best loss: 0.940304 2025-01-16 01:50:06,744 - INFO - step 12969, loss: 1.238787, best loss: 0.940304 2025-01-16 01:50:06,894 - INFO - step 12970, loss: 1.217125, best loss: 0.940304 2025-01-16 01:50:07,044 - INFO - step 12971, loss: 1.287930, best loss: 0.940304 2025-01-16 01:50:07,194 - INFO - step 12972, loss: 1.246271, best loss: 0.940304 2025-01-16 01:50:07,344 - INFO - step 12973, loss: 1.451301, best loss: 0.940304 2025-01-16 01:50:07,494 - INFO - step 12974, loss: 1.174301, best loss: 0.940304 2025-01-16 01:50:07,644 - INFO - step 12975, loss: 1.112389, best loss: 0.940304 2025-01-16 01:50:07,794 - INFO - step 12976, loss: 1.246931, best loss: 0.940304 2025-01-16 01:50:07,945 - INFO - step 12977, loss: 1.248618, best loss: 0.940304 2025-01-16 01:50:08,096 - INFO - step 12978, loss: 1.156528, best loss: 0.940304 2025-01-16 01:50:08,246 - INFO - step 12979, loss: 1.323575, best loss: 0.940304 2025-01-16 01:50:08,396 - INFO - step 12980, loss: 1.214299, best loss: 0.940304 2025-01-16 01:50:08,546 - INFO - step 12981, loss: 1.341050, best loss: 0.940304 2025-01-16 01:50:08,697 - INFO - step 12982, loss: 1.257319, best loss: 0.940304 2025-01-16 01:50:08,847 - INFO - step 12983, loss: 1.158864, best loss: 0.940304 2025-01-16 01:50:08,997 - INFO - step 12984, loss: 1.413437, best loss: 0.940304 2025-01-16 01:50:09,147 - INFO - step 12985, loss: 1.282518, best loss: 0.940304 2025-01-16 01:50:09,297 - INFO - step 12986, loss: 1.276945, best loss: 0.940304 2025-01-16 01:50:09,447 - INFO - step 12987, loss: 1.258402, best loss: 0.940304 2025-01-16 01:50:09,597 - INFO - step 12988, loss: 1.176441, best loss: 0.940304 2025-01-16 01:50:09,748 - INFO - step 12989, loss: 1.193189, best loss: 0.940304 2025-01-16 01:50:09,898 - INFO - step 12990, loss: 1.221863, best loss: 0.940304 2025-01-16 01:50:10,048 - INFO - step 12991, loss: 1.175301, best loss: 0.940304 2025-01-16 01:50:10,198 - INFO - step 12992, loss: 1.275781, best loss: 0.940304 2025-01-16 01:50:10,348 - INFO - step 12993, loss: 1.125173, best loss: 0.940304 2025-01-16 01:50:10,498 - INFO - step 12994, loss: 1.084802, best loss: 0.940304 2025-01-16 01:50:10,649 - INFO - step 12995, loss: 1.145142, best loss: 0.940304 2025-01-16 01:50:10,799 - INFO - step 12996, loss: 1.230380, best loss: 0.940304 2025-01-16 01:50:10,949 - INFO - step 12997, loss: 1.416229, best loss: 0.940304 2025-01-16 01:50:11,099 - INFO - step 12998, loss: 1.115677, best loss: 0.940304 2025-01-16 01:50:11,249 - INFO - step 12999, loss: 1.027046, best loss: 0.940304 2025-01-16 01:50:11,399 - INFO - step 13000, loss: 1.202648, best loss: 0.940304 2025-01-16 01:50:11,549 - INFO - step 13001, loss: 1.303497, best loss: 0.940304 2025-01-16 01:50:11,699 - INFO - step 13002, loss: 1.466718, best loss: 0.940304 2025-01-16 01:50:11,849 - INFO - step 13003, loss: 1.087374, best loss: 0.940304 2025-01-16 01:50:12,000 - INFO - step 13004, loss: 1.334418, best loss: 0.940304 2025-01-16 01:50:12,150 - INFO - step 13005, loss: 1.150279, best loss: 0.940304 2025-01-16 01:50:12,300 - INFO - step 13006, loss: 1.043360, best loss: 0.940304 2025-01-16 01:50:12,450 - INFO - step 13007, loss: 1.206832, best loss: 0.940304 2025-01-16 01:50:12,600 - INFO - step 13008, loss: 1.164257, best loss: 0.940304 2025-01-16 01:50:12,751 - INFO - step 13009, loss: 1.207537, best loss: 0.940304 2025-01-16 01:50:12,901 - INFO - step 13010, loss: 1.295470, best loss: 0.940304 2025-01-16 01:50:13,051 - INFO - step 13011, loss: 1.163456, best loss: 0.940304 2025-01-16 01:50:13,201 - INFO - step 13012, loss: 1.302529, best loss: 0.940304 2025-01-16 01:50:13,351 - INFO - step 13013, loss: 1.163503, best loss: 0.940304 2025-01-16 01:50:13,501 - INFO - step 13014, loss: 0.943802, best loss: 0.940304 2025-01-16 01:50:13,651 - INFO - step 13015, loss: 1.182607, best loss: 0.940304 2025-01-16 01:50:13,801 - INFO - step 13016, loss: 1.382106, best loss: 0.940304 2025-01-16 01:50:13,951 - INFO - step 13017, loss: 1.252893, best loss: 0.940304 2025-01-16 01:50:14,101 - INFO - step 13018, loss: 1.135798, best loss: 0.940304 2025-01-16 01:50:14,251 - INFO - step 13019, loss: 1.109609, best loss: 0.940304 2025-01-16 01:50:14,401 - INFO - step 13020, loss: 1.230369, best loss: 0.940304 2025-01-16 01:50:14,552 - INFO - step 13021, loss: 1.307062, best loss: 0.940304 2025-01-16 01:50:14,702 - INFO - step 13022, loss: 1.252779, best loss: 0.940304 2025-01-16 01:50:14,852 - INFO - step 13023, loss: 1.132605, best loss: 0.940304 2025-01-16 01:50:15,002 - INFO - step 13024, loss: 1.388503, best loss: 0.940304 2025-01-16 01:50:15,152 - INFO - step 13025, loss: 1.353268, best loss: 0.940304 2025-01-16 01:50:15,302 - INFO - step 13026, loss: 1.280053, best loss: 0.940304 2025-01-16 01:50:15,452 - INFO - step 13027, loss: 1.234725, best loss: 0.940304 2025-01-16 01:50:15,602 - INFO - step 13028, loss: 1.246029, best loss: 0.940304 2025-01-16 01:50:15,752 - INFO - step 13029, loss: 1.305845, best loss: 0.940304 2025-01-16 01:50:15,903 - INFO - step 13030, loss: 1.176922, best loss: 0.940304 2025-01-16 01:50:16,053 - INFO - step 13031, loss: 1.162363, best loss: 0.940304 2025-01-16 01:50:16,203 - INFO - step 13032, loss: 1.089063, best loss: 0.940304 2025-01-16 01:50:16,353 - INFO - step 13033, loss: 0.992797, best loss: 0.940304 2025-01-16 01:50:16,503 - INFO - step 13034, loss: 1.204795, best loss: 0.940304 2025-01-16 01:50:16,653 - INFO - step 13035, loss: 1.223509, best loss: 0.940304 2025-01-16 01:50:16,803 - INFO - step 13036, loss: 1.472156, best loss: 0.940304 2025-01-16 01:50:16,954 - INFO - step 13037, loss: 1.201566, best loss: 0.940304 2025-01-16 01:50:17,104 - INFO - step 13038, loss: 1.251709, best loss: 0.940304 2025-01-16 01:50:17,254 - INFO - step 13039, loss: 1.318664, best loss: 0.940304 2025-01-16 01:50:17,404 - INFO - step 13040, loss: 1.185391, best loss: 0.940304 2025-01-16 01:50:17,554 - INFO - step 13041, loss: 1.106399, best loss: 0.940304 2025-01-16 01:50:17,704 - INFO - step 13042, loss: 1.186329, best loss: 0.940304 2025-01-16 01:50:17,854 - INFO - step 13043, loss: 1.323270, best loss: 0.940304 2025-01-16 01:50:18,005 - INFO - step 13044, loss: 1.317299, best loss: 0.940304 2025-01-16 01:50:18,155 - INFO - step 13045, loss: 1.191514, best loss: 0.940304 2025-01-16 01:50:18,305 - INFO - step 13046, loss: 1.386154, best loss: 0.940304 2025-01-16 01:50:18,455 - INFO - step 13047, loss: 1.426534, best loss: 0.940304 2025-01-16 01:50:18,605 - INFO - step 13048, loss: 1.299699, best loss: 0.940304 2025-01-16 01:50:18,755 - INFO - step 13049, loss: 1.284514, best loss: 0.940304 2025-01-16 01:50:18,905 - INFO - step 13050, loss: 1.383602, best loss: 0.940304 2025-01-16 01:50:19,055 - INFO - step 13051, loss: 1.197865, best loss: 0.940304 2025-01-16 01:50:19,206 - INFO - step 13052, loss: 1.328170, best loss: 0.940304 2025-01-16 01:50:19,356 - INFO - step 13053, loss: 1.410413, best loss: 0.940304 2025-01-16 01:50:19,506 - INFO - step 13054, loss: 1.315403, best loss: 0.940304 2025-01-16 01:50:19,656 - INFO - step 13055, loss: 1.260698, best loss: 0.940304 2025-01-16 01:50:19,806 - INFO - step 13056, loss: 1.388932, best loss: 0.940304 2025-01-16 01:50:19,956 - INFO - step 13057, loss: 1.333373, best loss: 0.940304 2025-01-16 01:50:20,106 - INFO - step 13058, loss: 1.004453, best loss: 0.940304 2025-01-16 01:50:20,256 - INFO - step 13059, loss: 1.391107, best loss: 0.940304 2025-01-16 01:50:20,406 - INFO - step 13060, loss: 1.244233, best loss: 0.940304 2025-01-16 01:50:20,556 - INFO - step 13061, loss: 1.264428, best loss: 0.940304 2025-01-16 01:50:20,707 - INFO - step 13062, loss: 1.350530, best loss: 0.940304 2025-01-16 01:50:20,857 - INFO - step 13063, loss: 1.270249, best loss: 0.940304 2025-01-16 01:50:21,007 - INFO - step 13064, loss: 1.291017, best loss: 0.940304 2025-01-16 01:50:21,157 - INFO - step 13065, loss: 1.331307, best loss: 0.940304 2025-01-16 01:50:21,307 - INFO - step 13066, loss: 1.382770, best loss: 0.940304 2025-01-16 01:50:21,457 - INFO - step 13067, loss: 1.249755, best loss: 0.940304 2025-01-16 01:50:21,607 - INFO - step 13068, loss: 1.261538, best loss: 0.940304 2025-01-16 01:50:21,757 - INFO - step 13069, loss: 1.142119, best loss: 0.940304 2025-01-16 01:50:21,907 - INFO - step 13070, loss: 1.209499, best loss: 0.940304 2025-01-16 01:50:22,057 - INFO - step 13071, loss: 1.218525, best loss: 0.940304 2025-01-16 01:50:22,207 - INFO - step 13072, loss: 1.285531, best loss: 0.940304 2025-01-16 01:50:22,358 - INFO - step 13073, loss: 1.298080, best loss: 0.940304 2025-01-16 01:50:22,508 - INFO - step 13074, loss: 1.228546, best loss: 0.940304 2025-01-16 01:50:22,658 - INFO - step 13075, loss: 1.212694, best loss: 0.940304 2025-01-16 01:50:22,808 - INFO - step 13076, loss: 1.127987, best loss: 0.940304 2025-01-16 01:50:22,958 - INFO - step 13077, loss: 1.168779, best loss: 0.940304 2025-01-16 01:50:23,108 - INFO - step 13078, loss: 1.274337, best loss: 0.940304 2025-01-16 01:50:23,258 - INFO - step 13079, loss: 1.332237, best loss: 0.940304 2025-01-16 01:50:23,409 - INFO - step 13080, loss: 1.223074, best loss: 0.940304 2025-01-16 01:50:23,559 - INFO - step 13081, loss: 1.169266, best loss: 0.940304 2025-01-16 01:50:23,709 - INFO - step 13082, loss: 1.154811, best loss: 0.940304 2025-01-16 01:50:23,859 - INFO - step 13083, loss: 1.189336, best loss: 0.940304 2025-01-16 01:50:24,009 - INFO - step 13084, loss: 1.315677, best loss: 0.940304 2025-01-16 01:50:24,159 - INFO - step 13085, loss: 1.290716, best loss: 0.940304 2025-01-16 01:50:24,310 - INFO - step 13086, loss: 1.372754, best loss: 0.940304 2025-01-16 01:50:24,460 - INFO - step 13087, loss: 1.285777, best loss: 0.940304 2025-01-16 01:50:24,610 - INFO - step 13088, loss: 1.255419, best loss: 0.940304 2025-01-16 01:50:24,760 - INFO - step 13089, loss: 1.244223, best loss: 0.940304 2025-01-16 01:50:24,910 - INFO - step 13090, loss: 1.352856, best loss: 0.940304 2025-01-16 01:50:25,060 - INFO - step 13091, loss: 1.437300, best loss: 0.940304 2025-01-16 01:50:25,211 - INFO - step 13092, loss: 1.501344, best loss: 0.940304 2025-01-16 01:50:25,361 - INFO - step 13093, loss: 1.450122, best loss: 0.940304 2025-01-16 01:50:25,511 - INFO - step 13094, loss: 1.463262, best loss: 0.940304 2025-01-16 01:50:25,661 - INFO - step 13095, loss: 1.250848, best loss: 0.940304 2025-01-16 01:50:25,811 - INFO - step 13096, loss: 1.322644, best loss: 0.940304 2025-01-16 01:50:25,961 - INFO - step 13097, loss: 1.183404, best loss: 0.940304 2025-01-16 01:50:26,112 - INFO - step 13098, loss: 1.115906, best loss: 0.940304 2025-01-16 01:50:26,262 - INFO - step 13099, loss: 1.328920, best loss: 0.940304 2025-01-16 01:50:26,412 - INFO - step 13100, loss: 1.277987, best loss: 0.940304 2025-01-16 01:50:26,562 - INFO - step 13101, loss: 1.027222, best loss: 0.940304 2025-01-16 01:50:26,712 - INFO - step 13102, loss: 1.087196, best loss: 0.940304 2025-01-16 01:50:26,862 - INFO - step 13103, loss: 1.323264, best loss: 0.940304 2025-01-16 01:50:27,012 - INFO - step 13104, loss: 1.416248, best loss: 0.940304 2025-01-16 01:50:27,162 - INFO - step 13105, loss: 1.218420, best loss: 0.940304 2025-01-16 01:50:27,313 - INFO - step 13106, loss: 1.252256, best loss: 0.940304 2025-01-16 01:50:27,463 - INFO - step 13107, loss: 1.184907, best loss: 0.940304 2025-01-16 01:50:27,613 - INFO - step 13108, loss: 0.983231, best loss: 0.940304 2025-01-16 01:50:27,763 - INFO - step 13109, loss: 1.347050, best loss: 0.940304 2025-01-16 01:50:27,913 - INFO - step 13110, loss: 1.480804, best loss: 0.940304 2025-01-16 01:50:28,063 - INFO - step 13111, loss: 1.459044, best loss: 0.940304 2025-01-16 01:50:28,213 - INFO - step 13112, loss: 1.240145, best loss: 0.940304 2025-01-16 01:50:28,363 - INFO - step 13113, loss: 1.203554, best loss: 0.940304 2025-01-16 01:50:28,514 - INFO - step 13114, loss: 1.347106, best loss: 0.940304 2025-01-16 01:50:28,664 - INFO - step 13115, loss: 1.209896, best loss: 0.940304 2025-01-16 01:50:28,814 - INFO - step 13116, loss: 1.148128, best loss: 0.940304 2025-01-16 01:50:28,964 - INFO - step 13117, loss: 1.330840, best loss: 0.940304 2025-01-16 01:50:29,114 - INFO - step 13118, loss: 1.097234, best loss: 0.940304 2025-01-16 01:50:29,264 - INFO - step 13119, loss: 0.966835, best loss: 0.940304 2025-01-16 01:50:29,415 - INFO - step 13120, loss: 1.075839, best loss: 0.940304 2025-01-16 01:50:29,565 - INFO - step 13121, loss: 1.176131, best loss: 0.940304 2025-01-16 01:50:29,715 - INFO - step 13122, loss: 1.194569, best loss: 0.940304 2025-01-16 01:50:29,865 - INFO - step 13123, loss: 0.987797, best loss: 0.940304 2025-01-16 01:50:30,015 - INFO - step 13124, loss: 1.008422, best loss: 0.940304 2025-01-16 01:50:30,165 - INFO - step 13125, loss: 0.987474, best loss: 0.940304 2025-01-16 01:50:30,315 - INFO - step 13126, loss: 1.140395, best loss: 0.940304 2025-01-16 01:50:30,465 - INFO - step 13127, loss: 1.267561, best loss: 0.940304 2025-01-16 01:50:30,615 - INFO - step 13128, loss: 1.232415, best loss: 0.940304 2025-01-16 01:50:30,765 - INFO - step 13129, loss: 1.275115, best loss: 0.940304 2025-01-16 01:50:30,915 - INFO - step 13130, loss: 1.269480, best loss: 0.940304 2025-01-16 01:50:31,066 - INFO - step 13131, loss: 1.125195, best loss: 0.940304 2025-01-16 01:50:31,216 - INFO - step 13132, loss: 1.252406, best loss: 0.940304 2025-01-16 01:50:31,366 - INFO - step 13133, loss: 1.032830, best loss: 0.940304 2025-01-16 01:50:31,516 - INFO - step 13134, loss: 1.177948, best loss: 0.940304 2025-01-16 01:50:31,666 - INFO - step 13135, loss: 1.097207, best loss: 0.940304 2025-01-16 01:50:31,816 - INFO - step 13136, loss: 1.059672, best loss: 0.940304 2025-01-16 01:50:31,966 - INFO - step 13137, loss: 1.035072, best loss: 0.940304 2025-01-16 01:50:32,116 - INFO - step 13138, loss: 1.068396, best loss: 0.940304 2025-01-16 01:50:32,266 - INFO - step 13139, loss: 1.121691, best loss: 0.940304 2025-01-16 01:50:32,416 - INFO - step 13140, loss: 1.058475, best loss: 0.940304 2025-01-16 01:50:32,566 - INFO - step 13141, loss: 1.069523, best loss: 0.940304 2025-01-16 01:50:32,716 - INFO - step 13142, loss: 0.998251, best loss: 0.940304 2025-01-16 01:50:32,866 - INFO - step 13143, loss: 0.981316, best loss: 0.940304 2025-01-16 01:50:33,017 - INFO - step 13144, loss: 1.103702, best loss: 0.940304 2025-01-16 01:50:33,167 - INFO - step 13145, loss: 1.216737, best loss: 0.940304 2025-01-16 01:50:33,317 - INFO - step 13146, loss: 1.060163, best loss: 0.940304 2025-01-16 01:50:33,467 - INFO - step 13147, loss: 1.064652, best loss: 0.940304 2025-01-16 01:50:33,618 - INFO - step 13148, loss: 1.128014, best loss: 0.940304 2025-01-16 01:50:33,768 - INFO - step 13149, loss: 1.221885, best loss: 0.940304 2025-01-16 01:50:33,918 - INFO - step 13150, loss: 1.175847, best loss: 0.940304 2025-01-16 01:50:34,068 - INFO - step 13151, loss: 1.035376, best loss: 0.940304 2025-01-16 01:50:34,218 - INFO - step 13152, loss: 1.131524, best loss: 0.940304 2025-01-16 01:50:34,368 - INFO - step 13153, loss: 1.138097, best loss: 0.940304 2025-01-16 01:50:34,518 - INFO - step 13154, loss: 1.296409, best loss: 0.940304 2025-01-16 01:50:34,668 - INFO - step 13155, loss: 1.174396, best loss: 0.940304 2025-01-16 01:50:34,819 - INFO - step 13156, loss: 1.223129, best loss: 0.940304 2025-01-16 01:50:34,969 - INFO - step 13157, loss: 1.076780, best loss: 0.940304 2025-01-16 01:50:35,119 - INFO - step 13158, loss: 1.146935, best loss: 0.940304 2025-01-16 01:50:35,269 - INFO - step 13159, loss: 1.109546, best loss: 0.940304 2025-01-16 01:50:35,419 - INFO - step 13160, loss: 1.104070, best loss: 0.940304 2025-01-16 01:50:35,569 - INFO - step 13161, loss: 1.079177, best loss: 0.940304 2025-01-16 01:50:35,719 - INFO - step 13162, loss: 1.084701, best loss: 0.940304 2025-01-16 01:50:35,870 - INFO - step 13163, loss: 1.062089, best loss: 0.940304 2025-01-16 01:50:36,020 - INFO - step 13164, loss: 1.111938, best loss: 0.940304 2025-01-16 01:50:36,170 - INFO - step 13165, loss: 1.060166, best loss: 0.940304 2025-01-16 01:50:36,320 - INFO - step 13166, loss: 1.156157, best loss: 0.940304 2025-01-16 01:50:39,850 - INFO - step 13167, loss: 0.933363, best loss: 0.933363 2025-01-16 01:50:40,013 - INFO - step 13168, loss: 1.225159, best loss: 0.933363 2025-01-16 01:50:40,170 - INFO - step 13169, loss: 1.125317, best loss: 0.933363 2025-01-16 01:50:40,321 - INFO - step 13170, loss: 1.090264, best loss: 0.933363 2025-01-16 01:50:40,471 - INFO - step 13171, loss: 1.137583, best loss: 0.933363 2025-01-16 01:50:40,621 - INFO - step 13172, loss: 1.013811, best loss: 0.933363 2025-01-16 01:50:40,772 - INFO - step 13173, loss: 1.081494, best loss: 0.933363 2025-01-16 01:50:40,922 - INFO - step 13174, loss: 1.011572, best loss: 0.933363 2025-01-16 01:50:41,072 - INFO - step 13175, loss: 1.059695, best loss: 0.933363 2025-01-16 01:50:41,222 - INFO - step 13176, loss: 1.040604, best loss: 0.933363 2025-01-16 01:50:41,382 - INFO - step 13177, loss: 1.141882, best loss: 0.933363 2025-01-16 01:50:41,532 - INFO - step 13178, loss: 1.094514, best loss: 0.933363 2025-01-16 01:50:41,682 - INFO - step 13179, loss: 1.059552, best loss: 0.933363 2025-01-16 01:50:41,832 - INFO - step 13180, loss: 1.028887, best loss: 0.933363 2025-01-16 01:50:41,982 - INFO - step 13181, loss: 1.095338, best loss: 0.933363 2025-01-16 01:50:42,132 - INFO - step 13182, loss: 1.218777, best loss: 0.933363 2025-01-16 01:50:42,282 - INFO - step 13183, loss: 1.147491, best loss: 0.933363 2025-01-16 01:50:42,433 - INFO - step 13184, loss: 1.124026, best loss: 0.933363 2025-01-16 01:50:42,583 - INFO - step 13185, loss: 0.974539, best loss: 0.933363 2025-01-16 01:50:42,733 - INFO - step 13186, loss: 0.963886, best loss: 0.933363 2025-01-16 01:50:46,306 - INFO - step 13187, loss: 0.908593, best loss: 0.908593 2025-01-16 01:50:46,456 - INFO - step 13188, loss: 1.206997, best loss: 0.908593 2025-01-16 01:50:46,606 - INFO - step 13189, loss: 1.137547, best loss: 0.908593 2025-01-16 01:50:46,756 - INFO - step 13190, loss: 1.246697, best loss: 0.908593 2025-01-16 01:50:46,906 - INFO - step 13191, loss: 1.181667, best loss: 0.908593 2025-01-16 01:50:47,056 - INFO - step 13192, loss: 1.088777, best loss: 0.908593 2025-01-16 01:50:47,206 - INFO - step 13193, loss: 1.034386, best loss: 0.908593 2025-01-16 01:50:47,357 - INFO - step 13194, loss: 1.066618, best loss: 0.908593 2025-01-16 01:50:47,507 - INFO - step 13195, loss: 1.158508, best loss: 0.908593 2025-01-16 01:50:47,657 - INFO - step 13196, loss: 1.227076, best loss: 0.908593 2025-01-16 01:50:53,944 - INFO - step 13197, loss: 0.873384, best loss: 0.873384 2025-01-16 01:50:54,095 - INFO - step 13198, loss: 0.989060, best loss: 0.873384 2025-01-16 01:50:54,245 - INFO - step 13199, loss: 0.966007, best loss: 0.873384 2025-01-16 01:50:54,395 - INFO - step 13200, loss: 1.119032, best loss: 0.873384 2025-01-16 01:50:54,546 - INFO - step 13201, loss: 1.119530, best loss: 0.873384 2025-01-16 01:50:54,696 - INFO - step 13202, loss: 1.105279, best loss: 0.873384 2025-01-16 01:50:54,846 - INFO - step 13203, loss: 1.186541, best loss: 0.873384 2025-01-16 01:50:54,996 - INFO - step 13204, loss: 1.079908, best loss: 0.873384 2025-01-16 01:50:55,146 - INFO - step 13205, loss: 0.982854, best loss: 0.873384 2025-01-16 01:50:55,296 - INFO - step 13206, loss: 1.188034, best loss: 0.873384 2025-01-16 01:50:55,446 - INFO - step 13207, loss: 1.113805, best loss: 0.873384 2025-01-16 01:50:55,596 - INFO - step 13208, loss: 1.191554, best loss: 0.873384 2025-01-16 01:50:55,747 - INFO - step 13209, loss: 1.170051, best loss: 0.873384 2025-01-16 01:50:55,897 - INFO - step 13210, loss: 1.031389, best loss: 0.873384 2025-01-16 01:50:56,047 - INFO - step 13211, loss: 1.033695, best loss: 0.873384 2025-01-16 01:50:56,197 - INFO - step 13212, loss: 0.971587, best loss: 0.873384 2025-01-16 01:50:56,347 - INFO - step 13213, loss: 1.210253, best loss: 0.873384 2025-01-16 01:50:56,497 - INFO - step 13214, loss: 1.240144, best loss: 0.873384 2025-01-16 01:50:56,647 - INFO - step 13215, loss: 1.181116, best loss: 0.873384 2025-01-16 01:50:56,797 - INFO - step 13216, loss: 1.220494, best loss: 0.873384 2025-01-16 01:50:56,948 - INFO - step 13217, loss: 1.145360, best loss: 0.873384 2025-01-16 01:50:57,098 - INFO - step 13218, loss: 1.163271, best loss: 0.873384 2025-01-16 01:50:57,248 - INFO - step 13219, loss: 1.107397, best loss: 0.873384 2025-01-16 01:50:57,398 - INFO - step 13220, loss: 1.091862, best loss: 0.873384 2025-01-16 01:50:57,548 - INFO - step 13221, loss: 1.318994, best loss: 0.873384 2025-01-16 01:50:57,698 - INFO - step 13222, loss: 0.960891, best loss: 0.873384 2025-01-16 01:50:57,848 - INFO - step 13223, loss: 1.035684, best loss: 0.873384 2025-01-16 01:50:57,999 - INFO - step 13224, loss: 1.198922, best loss: 0.873384 2025-01-16 01:50:58,149 - INFO - step 13225, loss: 1.243398, best loss: 0.873384 2025-01-16 01:50:58,299 - INFO - step 13226, loss: 1.137447, best loss: 0.873384 2025-01-16 01:50:58,449 - INFO - step 13227, loss: 1.075317, best loss: 0.873384 2025-01-16 01:50:58,599 - INFO - step 13228, loss: 1.216259, best loss: 0.873384 2025-01-16 01:50:58,750 - INFO - step 13229, loss: 1.232825, best loss: 0.873384 2025-01-16 01:50:58,900 - INFO - step 13230, loss: 0.987650, best loss: 0.873384 2025-01-16 01:50:59,050 - INFO - step 13231, loss: 1.085904, best loss: 0.873384 2025-01-16 01:50:59,200 - INFO - step 13232, loss: 1.117933, best loss: 0.873384 2025-01-16 01:50:59,351 - INFO - step 13233, loss: 1.165656, best loss: 0.873384 2025-01-16 01:50:59,501 - INFO - step 13234, loss: 1.010036, best loss: 0.873384 2025-01-16 01:50:59,651 - INFO - step 13235, loss: 0.963650, best loss: 0.873384 2025-01-16 01:50:59,802 - INFO - step 13236, loss: 1.192824, best loss: 0.873384 2025-01-16 01:50:59,952 - INFO - step 13237, loss: 1.182683, best loss: 0.873384 2025-01-16 01:51:00,102 - INFO - step 13238, loss: 1.124974, best loss: 0.873384 2025-01-16 01:51:00,252 - INFO - step 13239, loss: 1.020293, best loss: 0.873384 2025-01-16 01:51:00,403 - INFO - step 13240, loss: 1.165282, best loss: 0.873384 2025-01-16 01:51:00,553 - INFO - step 13241, loss: 1.181088, best loss: 0.873384 2025-01-16 01:51:00,703 - INFO - step 13242, loss: 1.169975, best loss: 0.873384 2025-01-16 01:51:00,853 - INFO - step 13243, loss: 1.041263, best loss: 0.873384 2025-01-16 01:51:01,003 - INFO - step 13244, loss: 1.095595, best loss: 0.873384 2025-01-16 01:51:01,153 - INFO - step 13245, loss: 1.012365, best loss: 0.873384 2025-01-16 01:51:01,303 - INFO - step 13246, loss: 1.220966, best loss: 0.873384 2025-01-16 01:51:01,453 - INFO - step 13247, loss: 1.150817, best loss: 0.873384 2025-01-16 01:51:01,604 - INFO - step 13248, loss: 1.119338, best loss: 0.873384 2025-01-16 01:51:01,754 - INFO - step 13249, loss: 0.961498, best loss: 0.873384 2025-01-16 01:51:01,904 - INFO - step 13250, loss: 1.069435, best loss: 0.873384 2025-01-16 01:51:02,054 - INFO - step 13251, loss: 1.113039, best loss: 0.873384 2025-01-16 01:51:02,204 - INFO - step 13252, loss: 1.192580, best loss: 0.873384 2025-01-16 01:51:02,355 - INFO - step 13253, loss: 1.129053, best loss: 0.873384 2025-01-16 01:51:02,505 - INFO - step 13254, loss: 1.121358, best loss: 0.873384 2025-01-16 01:51:02,655 - INFO - step 13255, loss: 1.052737, best loss: 0.873384 2025-01-16 01:51:02,805 - INFO - step 13256, loss: 1.089038, best loss: 0.873384 2025-01-16 01:51:02,955 - INFO - step 13257, loss: 1.067570, best loss: 0.873384 2025-01-16 01:51:03,105 - INFO - step 13258, loss: 0.941711, best loss: 0.873384 2025-01-16 01:51:03,255 - INFO - step 13259, loss: 1.154578, best loss: 0.873384 2025-01-16 01:51:03,405 - INFO - step 13260, loss: 1.205205, best loss: 0.873384 2025-01-16 01:51:03,555 - INFO - step 13261, loss: 1.306713, best loss: 0.873384 2025-01-16 01:51:03,705 - INFO - step 13262, loss: 1.233076, best loss: 0.873384 2025-01-16 01:51:03,855 - INFO - step 13263, loss: 1.194016, best loss: 0.873384 2025-01-16 01:51:04,006 - INFO - step 13264, loss: 1.269699, best loss: 0.873384 2025-01-16 01:51:04,156 - INFO - step 13265, loss: 0.993334, best loss: 0.873384 2025-01-16 01:51:04,306 - INFO - step 13266, loss: 1.169806, best loss: 0.873384 2025-01-16 01:51:04,456 - INFO - step 13267, loss: 1.095336, best loss: 0.873384 2025-01-16 01:51:04,606 - INFO - step 13268, loss: 1.095042, best loss: 0.873384 2025-01-16 01:51:04,757 - INFO - step 13269, loss: 1.070633, best loss: 0.873384 2025-01-16 01:51:04,907 - INFO - step 13270, loss: 1.097343, best loss: 0.873384 2025-01-16 01:51:05,057 - INFO - step 13271, loss: 1.115956, best loss: 0.873384 2025-01-16 01:51:05,207 - INFO - step 13272, loss: 1.175116, best loss: 0.873384 2025-01-16 01:51:05,357 - INFO - step 13273, loss: 1.145059, best loss: 0.873384 2025-01-16 01:51:05,507 - INFO - step 13274, loss: 1.265095, best loss: 0.873384 2025-01-16 01:51:05,657 - INFO - step 13275, loss: 1.164735, best loss: 0.873384 2025-01-16 01:51:05,808 - INFO - step 13276, loss: 1.090132, best loss: 0.873384 2025-01-16 01:51:05,958 - INFO - step 13277, loss: 1.214776, best loss: 0.873384 2025-01-16 01:51:06,108 - INFO - step 13278, loss: 1.154427, best loss: 0.873384 2025-01-16 01:51:06,258 - INFO - step 13279, loss: 1.002532, best loss: 0.873384 2025-01-16 01:51:06,408 - INFO - step 13280, loss: 1.193741, best loss: 0.873384 2025-01-16 01:51:06,558 - INFO - step 13281, loss: 1.048132, best loss: 0.873384 2025-01-16 01:51:06,709 - INFO - step 13282, loss: 1.017695, best loss: 0.873384 2025-01-16 01:51:06,859 - INFO - step 13283, loss: 1.049003, best loss: 0.873384 2025-01-16 01:51:07,009 - INFO - step 13284, loss: 1.096217, best loss: 0.873384 2025-01-16 01:51:07,159 - INFO - step 13285, loss: 1.111531, best loss: 0.873384 2025-01-16 01:51:07,309 - INFO - step 13286, loss: 0.946160, best loss: 0.873384 2025-01-16 01:51:07,459 - INFO - step 13287, loss: 0.994176, best loss: 0.873384 2025-01-16 01:51:07,609 - INFO - step 13288, loss: 1.080787, best loss: 0.873384 2025-01-16 01:51:07,760 - INFO - step 13289, loss: 1.054293, best loss: 0.873384 2025-01-16 01:51:07,910 - INFO - step 13290, loss: 0.978252, best loss: 0.873384 2025-01-16 01:51:08,060 - INFO - step 13291, loss: 1.097382, best loss: 0.873384 2025-01-16 01:51:08,210 - INFO - step 13292, loss: 1.035010, best loss: 0.873384 2025-01-16 01:51:08,360 - INFO - step 13293, loss: 1.128886, best loss: 0.873384 2025-01-16 01:51:08,510 - INFO - step 13294, loss: 1.198364, best loss: 0.873384 2025-01-16 01:51:08,660 - INFO - step 13295, loss: 1.246382, best loss: 0.873384 2025-01-16 01:51:08,810 - INFO - step 13296, loss: 1.234923, best loss: 0.873384 2025-01-16 01:51:08,960 - INFO - step 13297, loss: 1.083656, best loss: 0.873384 2025-01-16 01:51:09,111 - INFO - step 13298, loss: 1.027309, best loss: 0.873384 2025-01-16 01:51:09,261 - INFO - step 13299, loss: 1.194125, best loss: 0.873384 2025-01-16 01:51:09,411 - INFO - step 13300, loss: 1.121931, best loss: 0.873384 2025-01-16 01:51:09,561 - INFO - step 13301, loss: 1.168990, best loss: 0.873384 2025-01-16 01:51:09,711 - INFO - step 13302, loss: 1.070571, best loss: 0.873384 2025-01-16 01:51:09,862 - INFO - step 13303, loss: 1.356073, best loss: 0.873384 2025-01-16 01:51:10,012 - INFO - step 13304, loss: 1.112319, best loss: 0.873384 2025-01-16 01:51:10,162 - INFO - step 13305, loss: 1.127090, best loss: 0.873384 2025-01-16 01:51:10,312 - INFO - step 13306, loss: 1.122865, best loss: 0.873384 2025-01-16 01:51:10,462 - INFO - step 13307, loss: 1.052526, best loss: 0.873384 2025-01-16 01:51:10,613 - INFO - step 13308, loss: 1.088067, best loss: 0.873384 2025-01-16 01:51:10,763 - INFO - step 13309, loss: 1.205159, best loss: 0.873384 2025-01-16 01:51:10,913 - INFO - step 13310, loss: 1.088346, best loss: 0.873384 2025-01-16 01:51:11,063 - INFO - step 13311, loss: 1.186157, best loss: 0.873384 2025-01-16 01:51:11,213 - INFO - step 13312, loss: 1.137587, best loss: 0.873384 2025-01-16 01:51:11,364 - INFO - step 13313, loss: 1.088949, best loss: 0.873384 2025-01-16 01:51:11,514 - INFO - step 13314, loss: 1.235689, best loss: 0.873384 2025-01-16 01:51:11,664 - INFO - step 13315, loss: 1.152731, best loss: 0.873384 2025-01-16 01:51:11,814 - INFO - step 13316, loss: 1.082641, best loss: 0.873384 2025-01-16 01:51:11,964 - INFO - step 13317, loss: 1.117026, best loss: 0.873384 2025-01-16 01:51:12,114 - INFO - step 13318, loss: 1.024482, best loss: 0.873384 2025-01-16 01:51:12,264 - INFO - step 13319, loss: 1.104304, best loss: 0.873384 2025-01-16 01:51:12,414 - INFO - step 13320, loss: 1.171880, best loss: 0.873384 2025-01-16 01:51:12,565 - INFO - step 13321, loss: 1.129311, best loss: 0.873384 2025-01-16 01:51:12,715 - INFO - step 13322, loss: 1.115750, best loss: 0.873384 2025-01-16 01:51:12,865 - INFO - step 13323, loss: 0.991749, best loss: 0.873384 2025-01-16 01:51:13,015 - INFO - step 13324, loss: 0.981124, best loss: 0.873384 2025-01-16 01:51:13,165 - INFO - step 13325, loss: 1.005475, best loss: 0.873384 2025-01-16 01:51:13,315 - INFO - step 13326, loss: 1.081036, best loss: 0.873384 2025-01-16 01:51:13,465 - INFO - step 13327, loss: 1.151472, best loss: 0.873384 2025-01-16 01:51:13,615 - INFO - step 13328, loss: 0.969042, best loss: 0.873384 2025-01-16 01:51:13,765 - INFO - step 13329, loss: 0.945101, best loss: 0.873384 2025-01-16 01:51:13,916 - INFO - step 13330, loss: 1.035549, best loss: 0.873384 2025-01-16 01:51:14,066 - INFO - step 13331, loss: 1.146826, best loss: 0.873384 2025-01-16 01:51:14,216 - INFO - step 13332, loss: 1.302708, best loss: 0.873384 2025-01-16 01:51:14,366 - INFO - step 13333, loss: 0.917051, best loss: 0.873384 2025-01-16 01:51:14,516 - INFO - step 13334, loss: 1.250213, best loss: 0.873384 2025-01-16 01:51:14,666 - INFO - step 13335, loss: 1.010973, best loss: 0.873384 2025-01-16 01:51:14,816 - INFO - step 13336, loss: 0.929372, best loss: 0.873384 2025-01-16 01:51:14,966 - INFO - step 13337, loss: 1.069540, best loss: 0.873384 2025-01-16 01:51:15,116 - INFO - step 13338, loss: 1.087862, best loss: 0.873384 2025-01-16 01:51:15,267 - INFO - step 13339, loss: 1.105311, best loss: 0.873384 2025-01-16 01:51:15,417 - INFO - step 13340, loss: 1.179844, best loss: 0.873384 2025-01-16 01:51:15,567 - INFO - step 13341, loss: 1.046777, best loss: 0.873384 2025-01-16 01:51:15,717 - INFO - step 13342, loss: 1.191720, best loss: 0.873384 2025-01-16 01:51:15,867 - INFO - step 13343, loss: 0.974428, best loss: 0.873384 2025-01-16 01:51:19,369 - INFO - step 13344, loss: 0.846069, best loss: 0.846069 2025-01-16 01:51:19,519 - INFO - step 13345, loss: 1.090977, best loss: 0.846069 2025-01-16 01:51:19,670 - INFO - step 13346, loss: 1.276770, best loss: 0.846069 2025-01-16 01:51:19,820 - INFO - step 13347, loss: 1.152052, best loss: 0.846069 2025-01-16 01:51:19,970 - INFO - step 13348, loss: 1.004606, best loss: 0.846069 2025-01-16 01:51:20,120 - INFO - step 13349, loss: 1.113899, best loss: 0.846069 2025-01-16 01:51:20,270 - INFO - step 13350, loss: 1.141273, best loss: 0.846069 2025-01-16 01:51:20,420 - INFO - step 13351, loss: 1.181966, best loss: 0.846069 2025-01-16 01:51:20,570 - INFO - step 13352, loss: 1.126390, best loss: 0.846069 2025-01-16 01:51:20,721 - INFO - step 13353, loss: 1.039021, best loss: 0.846069 2025-01-16 01:51:20,871 - INFO - step 13354, loss: 1.243918, best loss: 0.846069 2025-01-16 01:51:21,021 - INFO - step 13355, loss: 1.151293, best loss: 0.846069 2025-01-16 01:51:21,171 - INFO - step 13356, loss: 1.103774, best loss: 0.846069 2025-01-16 01:51:21,321 - INFO - step 13357, loss: 1.076610, best loss: 0.846069 2025-01-16 01:51:21,471 - INFO - step 13358, loss: 1.091560, best loss: 0.846069 2025-01-16 01:51:21,622 - INFO - step 13359, loss: 1.184488, best loss: 0.846069 2025-01-16 01:51:21,771 - INFO - step 13360, loss: 1.059435, best loss: 0.846069 2025-01-16 01:51:21,921 - INFO - step 13361, loss: 1.096539, best loss: 0.846069 2025-01-16 01:51:22,071 - INFO - step 13362, loss: 1.063533, best loss: 0.846069 2025-01-16 01:51:22,222 - INFO - step 13363, loss: 0.848494, best loss: 0.846069 2025-01-16 01:51:22,372 - INFO - step 13364, loss: 1.085130, best loss: 0.846069 2025-01-16 01:51:22,522 - INFO - step 13365, loss: 1.101934, best loss: 0.846069 2025-01-16 01:51:22,672 - INFO - step 13366, loss: 1.208611, best loss: 0.846069 2025-01-16 01:51:22,822 - INFO - step 13367, loss: 1.118662, best loss: 0.846069 2025-01-16 01:51:22,972 - INFO - step 13368, loss: 1.098387, best loss: 0.846069 2025-01-16 01:51:23,122 - INFO - step 13369, loss: 1.128627, best loss: 0.846069 2025-01-16 01:51:23,273 - INFO - step 13370, loss: 1.092928, best loss: 0.846069 2025-01-16 01:51:23,423 - INFO - step 13371, loss: 1.063883, best loss: 0.846069 2025-01-16 01:51:23,573 - INFO - step 13372, loss: 1.131411, best loss: 0.846069 2025-01-16 01:51:23,723 - INFO - step 13373, loss: 1.141356, best loss: 0.846069 2025-01-16 01:51:23,873 - INFO - step 13374, loss: 1.211280, best loss: 0.846069 2025-01-16 01:51:24,023 - INFO - step 13375, loss: 1.035936, best loss: 0.846069 2025-01-16 01:51:24,173 - INFO - step 13376, loss: 1.275283, best loss: 0.846069 2025-01-16 01:51:24,323 - INFO - step 13377, loss: 1.221241, best loss: 0.846069 2025-01-16 01:51:24,473 - INFO - step 13378, loss: 1.189821, best loss: 0.846069 2025-01-16 01:51:24,624 - INFO - step 13379, loss: 1.135116, best loss: 0.846069 2025-01-16 01:51:24,774 - INFO - step 13380, loss: 1.332673, best loss: 0.846069 2025-01-16 01:51:24,924 - INFO - step 13381, loss: 1.121886, best loss: 0.846069 2025-01-16 01:51:25,074 - INFO - step 13382, loss: 1.144773, best loss: 0.846069 2025-01-16 01:51:25,224 - INFO - step 13383, loss: 1.198725, best loss: 0.846069 2025-01-16 01:51:25,374 - INFO - step 13384, loss: 1.140225, best loss: 0.846069 2025-01-16 01:51:25,524 - INFO - step 13385, loss: 1.024557, best loss: 0.846069 2025-01-16 01:51:25,674 - INFO - step 13386, loss: 1.191453, best loss: 0.846069 2025-01-16 01:51:25,825 - INFO - step 13387, loss: 1.187624, best loss: 0.846069 2025-01-16 01:51:25,975 - INFO - step 13388, loss: 0.866994, best loss: 0.846069 2025-01-16 01:51:26,125 - INFO - step 13389, loss: 1.224317, best loss: 0.846069 2025-01-16 01:51:26,275 - INFO - step 13390, loss: 1.094566, best loss: 0.846069 2025-01-16 01:51:26,425 - INFO - step 13391, loss: 1.091882, best loss: 0.846069 2025-01-16 01:51:26,575 - INFO - step 13392, loss: 1.164537, best loss: 0.846069 2025-01-16 01:51:26,725 - INFO - step 13393, loss: 1.096551, best loss: 0.846069 2025-01-16 01:51:26,875 - INFO - step 13394, loss: 1.198480, best loss: 0.846069 2025-01-16 01:51:27,025 - INFO - step 13395, loss: 1.224791, best loss: 0.846069 2025-01-16 01:51:27,175 - INFO - step 13396, loss: 1.261815, best loss: 0.846069 2025-01-16 01:51:27,325 - INFO - step 13397, loss: 1.099184, best loss: 0.846069 2025-01-16 01:51:27,475 - INFO - step 13398, loss: 1.102594, best loss: 0.846069 2025-01-16 01:51:27,626 - INFO - step 13399, loss: 0.957422, best loss: 0.846069 2025-01-16 01:51:27,776 - INFO - step 13400, loss: 1.042009, best loss: 0.846069 2025-01-16 01:51:27,926 - INFO - step 13401, loss: 1.100900, best loss: 0.846069 2025-01-16 01:51:28,076 - INFO - step 13402, loss: 1.166899, best loss: 0.846069 2025-01-16 01:51:28,226 - INFO - step 13403, loss: 1.185203, best loss: 0.846069 2025-01-16 01:51:28,376 - INFO - step 13404, loss: 1.165230, best loss: 0.846069 2025-01-16 01:51:28,526 - INFO - step 13405, loss: 1.078445, best loss: 0.846069 2025-01-16 01:51:28,676 - INFO - step 13406, loss: 1.071825, best loss: 0.846069 2025-01-16 01:51:28,827 - INFO - step 13407, loss: 1.029742, best loss: 0.846069 2025-01-16 01:51:28,977 - INFO - step 13408, loss: 1.082068, best loss: 0.846069 2025-01-16 01:51:29,127 - INFO - step 13409, loss: 1.168576, best loss: 0.846069 2025-01-16 01:51:29,277 - INFO - step 13410, loss: 1.034657, best loss: 0.846069 2025-01-16 01:51:29,427 - INFO - step 13411, loss: 1.006030, best loss: 0.846069 2025-01-16 01:51:29,577 - INFO - step 13412, loss: 1.023940, best loss: 0.846069 2025-01-16 01:51:29,727 - INFO - step 13413, loss: 1.045324, best loss: 0.846069 2025-01-16 01:51:29,877 - INFO - step 13414, loss: 1.163768, best loss: 0.846069 2025-01-16 01:51:30,027 - INFO - step 13415, loss: 1.174577, best loss: 0.846069 2025-01-16 01:51:30,178 - INFO - step 13416, loss: 1.131726, best loss: 0.846069 2025-01-16 01:51:30,328 - INFO - step 13417, loss: 1.167372, best loss: 0.846069 2025-01-16 01:51:30,478 - INFO - step 13418, loss: 1.043703, best loss: 0.846069 2025-01-16 01:51:30,628 - INFO - step 13419, loss: 1.109687, best loss: 0.846069 2025-01-16 01:51:30,778 - INFO - step 13420, loss: 1.181788, best loss: 0.846069 2025-01-16 01:51:30,928 - INFO - step 13421, loss: 1.201180, best loss: 0.846069 2025-01-16 01:51:31,078 - INFO - step 13422, loss: 1.274435, best loss: 0.846069 2025-01-16 01:51:31,228 - INFO - step 13423, loss: 1.297999, best loss: 0.846069 2025-01-16 01:51:31,378 - INFO - step 13424, loss: 1.330413, best loss: 0.846069 2025-01-16 01:51:31,528 - INFO - step 13425, loss: 1.023002, best loss: 0.846069 2025-01-16 01:51:31,679 - INFO - step 13426, loss: 1.184609, best loss: 0.846069 2025-01-16 01:51:31,829 - INFO - step 13427, loss: 1.011535, best loss: 0.846069 2025-01-16 01:51:31,979 - INFO - step 13428, loss: 0.960856, best loss: 0.846069 2025-01-16 01:51:32,129 - INFO - step 13429, loss: 1.158480, best loss: 0.846069 2025-01-16 01:51:32,279 - INFO - step 13430, loss: 1.136974, best loss: 0.846069 2025-01-16 01:51:32,429 - INFO - step 13431, loss: 1.023464, best loss: 0.846069 2025-01-16 01:51:32,579 - INFO - step 13432, loss: 1.046095, best loss: 0.846069 2025-01-16 01:51:32,729 - INFO - step 13433, loss: 1.211731, best loss: 0.846069 2025-01-16 01:51:32,879 - INFO - step 13434, loss: 1.272659, best loss: 0.846069 2025-01-16 01:51:33,029 - INFO - step 13435, loss: 1.086379, best loss: 0.846069 2025-01-16 01:51:33,179 - INFO - step 13436, loss: 1.082255, best loss: 0.846069 2025-01-16 01:51:33,330 - INFO - step 13437, loss: 1.047449, best loss: 0.846069 2025-01-16 01:51:33,480 - INFO - step 13438, loss: 0.854122, best loss: 0.846069 2025-01-16 01:51:33,630 - INFO - step 13439, loss: 1.117281, best loss: 0.846069 2025-01-16 01:51:33,780 - INFO - step 13440, loss: 1.318237, best loss: 0.846069 2025-01-16 01:51:33,930 - INFO - step 13441, loss: 1.352644, best loss: 0.846069 2025-01-16 01:51:34,080 - INFO - step 13442, loss: 1.182806, best loss: 0.846069 2025-01-16 01:51:34,230 - INFO - step 13443, loss: 1.039086, best loss: 0.846069 2025-01-16 01:51:34,380 - INFO - step 13444, loss: 1.188974, best loss: 0.846069 2025-01-16 01:51:34,530 - INFO - step 13445, loss: 1.119962, best loss: 0.846069 2025-01-16 01:51:34,681 - INFO - step 13446, loss: 1.014594, best loss: 0.846069 2025-01-16 01:51:34,831 - INFO - step 13447, loss: 1.152624, best loss: 0.846069 2025-01-16 01:51:34,981 - INFO - step 13448, loss: 1.029451, best loss: 0.846069 2025-01-16 01:51:35,131 - INFO - step 13449, loss: 0.879850, best loss: 0.846069 2025-01-16 01:51:35,281 - INFO - step 13450, loss: 1.065330, best loss: 0.846069 2025-01-16 01:51:35,431 - INFO - step 13451, loss: 1.028665, best loss: 0.846069 2025-01-16 01:51:35,581 - INFO - step 13452, loss: 1.059767, best loss: 0.846069 2025-01-16 01:51:35,731 - INFO - step 13453, loss: 0.867792, best loss: 0.846069 2025-01-16 01:51:35,881 - INFO - step 13454, loss: 0.957951, best loss: 0.846069 2025-01-16 01:51:39,405 - INFO - step 13455, loss: 0.842948, best loss: 0.842948 2025-01-16 01:51:39,567 - INFO - step 13456, loss: 0.974449, best loss: 0.842948 2025-01-16 01:51:39,722 - INFO - step 13457, loss: 1.141709, best loss: 0.842948 2025-01-16 01:51:39,872 - INFO - step 13458, loss: 1.083926, best loss: 0.842948 2025-01-16 01:51:40,022 - INFO - step 13459, loss: 1.153857, best loss: 0.842948 2025-01-16 01:51:40,172 - INFO - step 13460, loss: 1.116008, best loss: 0.842948 2025-01-16 01:51:40,322 - INFO - step 13461, loss: 0.997742, best loss: 0.842948 2025-01-16 01:51:40,473 - INFO - step 13462, loss: 1.075689, best loss: 0.842948 2025-01-16 01:51:40,623 - INFO - step 13463, loss: 0.971691, best loss: 0.842948 2025-01-16 01:51:40,773 - INFO - step 13464, loss: 0.976176, best loss: 0.842948 2025-01-16 01:51:40,923 - INFO - step 13465, loss: 1.023747, best loss: 0.842948 2025-01-16 01:51:41,073 - INFO - step 13466, loss: 0.896280, best loss: 0.842948 2025-01-16 01:51:41,223 - INFO - step 13467, loss: 0.864813, best loss: 0.842948 2025-01-16 01:51:41,373 - INFO - step 13468, loss: 0.962747, best loss: 0.842948 2025-01-16 01:51:41,524 - INFO - step 13469, loss: 1.027042, best loss: 0.842948 2025-01-16 01:51:41,674 - INFO - step 13470, loss: 1.097744, best loss: 0.842948 2025-01-16 01:51:41,825 - INFO - step 13471, loss: 0.974972, best loss: 0.842948 2025-01-16 01:51:41,975 - INFO - step 13472, loss: 1.005869, best loss: 0.842948 2025-01-16 01:51:42,125 - INFO - step 13473, loss: 0.911699, best loss: 0.842948 2025-01-16 01:51:42,275 - INFO - step 13474, loss: 0.970913, best loss: 0.842948 2025-01-16 01:51:42,426 - INFO - step 13475, loss: 1.066976, best loss: 0.842948 2025-01-16 01:51:42,576 - INFO - step 13476, loss: 0.908757, best loss: 0.842948 2025-01-16 01:51:42,726 - INFO - step 13477, loss: 0.965616, best loss: 0.842948 2025-01-16 01:51:42,877 - INFO - step 13478, loss: 1.046221, best loss: 0.842948 2025-01-16 01:51:43,027 - INFO - step 13479, loss: 1.029996, best loss: 0.842948 2025-01-16 01:51:43,177 - INFO - step 13480, loss: 1.128620, best loss: 0.842948 2025-01-16 01:51:43,327 - INFO - step 13481, loss: 0.943328, best loss: 0.842948 2025-01-16 01:51:43,477 - INFO - step 13482, loss: 1.063591, best loss: 0.842948 2025-01-16 01:51:43,627 - INFO - step 13483, loss: 1.067191, best loss: 0.842948 2025-01-16 01:51:43,778 - INFO - step 13484, loss: 1.149356, best loss: 0.842948 2025-01-16 01:51:43,928 - INFO - step 13485, loss: 0.973318, best loss: 0.842948 2025-01-16 01:51:44,078 - INFO - step 13486, loss: 1.069556, best loss: 0.842948 2025-01-16 01:51:44,229 - INFO - step 13487, loss: 0.978935, best loss: 0.842948 2025-01-16 01:51:44,379 - INFO - step 13488, loss: 1.040829, best loss: 0.842948 2025-01-16 01:51:44,529 - INFO - step 13489, loss: 1.056507, best loss: 0.842948 2025-01-16 01:51:44,679 - INFO - step 13490, loss: 1.018157, best loss: 0.842948 2025-01-16 01:51:44,829 - INFO - step 13491, loss: 1.031660, best loss: 0.842948 2025-01-16 01:51:44,979 - INFO - step 13492, loss: 1.001898, best loss: 0.842948 2025-01-16 01:51:45,130 - INFO - step 13493, loss: 0.935865, best loss: 0.842948 2025-01-16 01:51:45,280 - INFO - step 13494, loss: 1.004344, best loss: 0.842948 2025-01-16 01:51:45,430 - INFO - step 13495, loss: 1.034262, best loss: 0.842948 2025-01-16 01:51:45,580 - INFO - step 13496, loss: 1.057324, best loss: 0.842948 2025-01-16 01:51:45,730 - INFO - step 13497, loss: 0.873627, best loss: 0.842948 2025-01-16 01:51:45,881 - INFO - step 13498, loss: 1.067598, best loss: 0.842948 2025-01-16 01:51:46,031 - INFO - step 13499, loss: 1.010654, best loss: 0.842948 2025-01-16 01:51:46,181 - INFO - step 13500, loss: 0.951111, best loss: 0.842948 2025-01-16 01:51:46,331 - INFO - step 13501, loss: 0.999556, best loss: 0.842948 2025-01-16 01:51:46,481 - INFO - step 13502, loss: 0.915277, best loss: 0.842948 2025-01-16 01:51:46,632 - INFO - step 13503, loss: 0.962284, best loss: 0.842948 2025-01-16 01:51:46,782 - INFO - step 13504, loss: 0.967201, best loss: 0.842948 2025-01-16 01:51:46,932 - INFO - step 13505, loss: 0.965427, best loss: 0.842948 2025-01-16 01:51:47,083 - INFO - step 13506, loss: 0.878877, best loss: 0.842948 2025-01-16 01:51:47,233 - INFO - step 13507, loss: 1.076452, best loss: 0.842948 2025-01-16 01:51:47,383 - INFO - step 13508, loss: 0.955449, best loss: 0.842948 2025-01-16 01:51:47,533 - INFO - step 13509, loss: 0.897420, best loss: 0.842948 2025-01-16 01:51:47,683 - INFO - step 13510, loss: 0.850680, best loss: 0.842948 2025-01-16 01:51:47,833 - INFO - step 13511, loss: 0.931081, best loss: 0.842948 2025-01-16 01:51:47,983 - INFO - step 13512, loss: 1.108829, best loss: 0.842948 2025-01-16 01:51:48,134 - INFO - step 13513, loss: 0.978852, best loss: 0.842948 2025-01-16 01:51:48,283 - INFO - step 13514, loss: 1.041572, best loss: 0.842948 2025-01-16 01:51:48,434 - INFO - step 13515, loss: 0.854758, best loss: 0.842948 2025-01-16 01:51:51,527 - INFO - step 13516, loss: 0.807222, best loss: 0.807222 2025-01-16 01:51:51,677 - INFO - step 13517, loss: 0.854389, best loss: 0.807222 2025-01-16 01:51:51,827 - INFO - step 13518, loss: 1.057140, best loss: 0.807222 2025-01-16 01:51:51,977 - INFO - step 13519, loss: 1.078074, best loss: 0.807222 2025-01-16 01:51:52,127 - INFO - step 13520, loss: 1.076731, best loss: 0.807222 2025-01-16 01:51:52,277 - INFO - step 13521, loss: 1.034509, best loss: 0.807222 2025-01-16 01:51:52,427 - INFO - step 13522, loss: 0.947099, best loss: 0.807222 2025-01-16 01:51:52,577 - INFO - step 13523, loss: 0.884753, best loss: 0.807222 2025-01-16 01:51:52,728 - INFO - step 13524, loss: 0.952260, best loss: 0.807222 2025-01-16 01:51:52,878 - INFO - step 13525, loss: 1.079704, best loss: 0.807222 2025-01-16 01:51:53,028 - INFO - step 13526, loss: 1.092440, best loss: 0.807222 2025-01-16 01:51:53,178 - INFO - step 13527, loss: 0.827168, best loss: 0.807222 2025-01-16 01:51:53,328 - INFO - step 13528, loss: 0.902327, best loss: 0.807222 2025-01-16 01:51:53,479 - INFO - step 13529, loss: 0.857681, best loss: 0.807222 2025-01-16 01:51:53,629 - INFO - step 13530, loss: 1.047173, best loss: 0.807222 2025-01-16 01:51:53,779 - INFO - step 13531, loss: 1.044802, best loss: 0.807222 2025-01-16 01:51:53,929 - INFO - step 13532, loss: 1.020805, best loss: 0.807222 2025-01-16 01:51:54,079 - INFO - step 13533, loss: 1.089031, best loss: 0.807222 2025-01-16 01:51:54,230 - INFO - step 13534, loss: 1.022403, best loss: 0.807222 2025-01-16 01:51:54,380 - INFO - step 13535, loss: 0.882351, best loss: 0.807222 2025-01-16 01:51:54,530 - INFO - step 13536, loss: 0.997425, best loss: 0.807222 2025-01-16 01:51:54,680 - INFO - step 13537, loss: 1.060654, best loss: 0.807222 2025-01-16 01:51:54,830 - INFO - step 13538, loss: 1.041282, best loss: 0.807222 2025-01-16 01:51:54,981 - INFO - step 13539, loss: 1.122481, best loss: 0.807222 2025-01-16 01:51:55,131 - INFO - step 13540, loss: 0.956151, best loss: 0.807222 2025-01-16 01:51:55,281 - INFO - step 13541, loss: 0.931612, best loss: 0.807222 2025-01-16 01:51:55,431 - INFO - step 13542, loss: 0.830741, best loss: 0.807222 2025-01-16 01:51:55,581 - INFO - step 13543, loss: 1.073078, best loss: 0.807222 2025-01-16 01:51:55,731 - INFO - step 13544, loss: 1.134863, best loss: 0.807222 2025-01-16 01:51:55,881 - INFO - step 13545, loss: 1.060452, best loss: 0.807222 2025-01-16 01:51:56,031 - INFO - step 13546, loss: 1.078321, best loss: 0.807222 2025-01-16 01:51:56,182 - INFO - step 13547, loss: 0.975280, best loss: 0.807222 2025-01-16 01:51:56,332 - INFO - step 13548, loss: 1.084892, best loss: 0.807222 2025-01-16 01:51:56,482 - INFO - step 13549, loss: 1.011633, best loss: 0.807222 2025-01-16 01:51:56,632 - INFO - step 13550, loss: 0.998487, best loss: 0.807222 2025-01-16 01:51:56,782 - INFO - step 13551, loss: 1.142353, best loss: 0.807222 2025-01-16 01:51:56,932 - INFO - step 13552, loss: 0.899755, best loss: 0.807222 2025-01-16 01:51:57,082 - INFO - step 13553, loss: 0.917640, best loss: 0.807222 2025-01-16 01:51:57,232 - INFO - step 13554, loss: 1.120911, best loss: 0.807222 2025-01-16 01:51:57,382 - INFO - step 13555, loss: 1.132986, best loss: 0.807222 2025-01-16 01:51:57,533 - INFO - step 13556, loss: 1.055709, best loss: 0.807222 2025-01-16 01:51:57,683 - INFO - step 13557, loss: 0.968544, best loss: 0.807222 2025-01-16 01:51:57,834 - INFO - step 13558, loss: 1.049756, best loss: 0.807222 2025-01-16 01:51:57,984 - INFO - step 13559, loss: 1.158254, best loss: 0.807222 2025-01-16 01:51:58,134 - INFO - step 13560, loss: 0.932602, best loss: 0.807222 2025-01-16 01:51:58,284 - INFO - step 13561, loss: 0.987039, best loss: 0.807222 2025-01-16 01:51:58,434 - INFO - step 13562, loss: 1.036554, best loss: 0.807222 2025-01-16 01:51:58,585 - INFO - step 13563, loss: 1.058537, best loss: 0.807222 2025-01-16 01:51:58,735 - INFO - step 13564, loss: 0.929188, best loss: 0.807222 2025-01-16 01:51:58,885 - INFO - step 13565, loss: 0.820231, best loss: 0.807222 2025-01-16 01:51:59,035 - INFO - step 13566, loss: 1.134002, best loss: 0.807222 2025-01-16 01:51:59,185 - INFO - step 13567, loss: 0.998901, best loss: 0.807222 2025-01-16 01:51:59,335 - INFO - step 13568, loss: 1.076381, best loss: 0.807222 2025-01-16 01:51:59,485 - INFO - step 13569, loss: 0.906223, best loss: 0.807222 2025-01-16 01:51:59,636 - INFO - step 13570, loss: 1.110045, best loss: 0.807222 2025-01-16 01:51:59,786 - INFO - step 13571, loss: 1.005354, best loss: 0.807222 2025-01-16 01:51:59,936 - INFO - step 13572, loss: 1.046671, best loss: 0.807222 2025-01-16 01:52:00,086 - INFO - step 13573, loss: 0.949257, best loss: 0.807222 2025-01-16 01:52:00,236 - INFO - step 13574, loss: 0.961994, best loss: 0.807222 2025-01-16 01:52:00,386 - INFO - step 13575, loss: 0.904193, best loss: 0.807222 2025-01-16 01:52:00,536 - INFO - step 13576, loss: 1.062222, best loss: 0.807222 2025-01-16 01:52:00,686 - INFO - step 13577, loss: 1.029055, best loss: 0.807222 2025-01-16 01:52:00,836 - INFO - step 13578, loss: 0.985151, best loss: 0.807222 2025-01-16 01:52:00,986 - INFO - step 13579, loss: 0.857023, best loss: 0.807222 2025-01-16 01:52:01,137 - INFO - step 13580, loss: 0.948576, best loss: 0.807222 2025-01-16 01:52:01,287 - INFO - step 13581, loss: 1.012823, best loss: 0.807222 2025-01-16 01:52:01,437 - INFO - step 13582, loss: 1.034399, best loss: 0.807222 2025-01-16 01:52:01,587 - INFO - step 13583, loss: 0.998705, best loss: 0.807222 2025-01-16 01:52:01,737 - INFO - step 13584, loss: 1.022107, best loss: 0.807222 2025-01-16 01:52:01,887 - INFO - step 13585, loss: 0.927468, best loss: 0.807222 2025-01-16 01:52:02,037 - INFO - step 13586, loss: 0.991509, best loss: 0.807222 2025-01-16 01:52:02,188 - INFO - step 13587, loss: 0.981247, best loss: 0.807222 2025-01-16 01:52:02,338 - INFO - step 13588, loss: 0.849163, best loss: 0.807222 2025-01-16 01:52:02,488 - INFO - step 13589, loss: 1.067505, best loss: 0.807222 2025-01-16 01:52:02,638 - INFO - step 13590, loss: 1.051216, best loss: 0.807222 2025-01-16 01:52:02,788 - INFO - step 13591, loss: 1.162177, best loss: 0.807222 2025-01-16 01:52:02,938 - INFO - step 13592, loss: 1.103342, best loss: 0.807222 2025-01-16 01:52:03,088 - INFO - step 13593, loss: 1.176537, best loss: 0.807222 2025-01-16 01:52:03,238 - INFO - step 13594, loss: 1.071977, best loss: 0.807222 2025-01-16 01:52:03,388 - INFO - step 13595, loss: 0.901451, best loss: 0.807222 2025-01-16 01:52:03,539 - INFO - step 13596, loss: 1.125696, best loss: 0.807222 2025-01-16 01:52:03,689 - INFO - step 13597, loss: 1.004575, best loss: 0.807222 2025-01-16 01:52:03,839 - INFO - step 13598, loss: 0.964725, best loss: 0.807222 2025-01-16 01:52:03,989 - INFO - step 13599, loss: 1.012906, best loss: 0.807222 2025-01-16 01:52:04,139 - INFO - step 13600, loss: 1.083463, best loss: 0.807222 2025-01-16 01:52:04,290 - INFO - step 13601, loss: 0.972435, best loss: 0.807222 2025-01-16 01:52:04,440 - INFO - step 13602, loss: 1.041263, best loss: 0.807222 2025-01-16 01:52:04,589 - INFO - step 13603, loss: 1.088051, best loss: 0.807222 2025-01-16 01:52:04,740 - INFO - step 13604, loss: 1.131997, best loss: 0.807222 2025-01-16 01:52:04,890 - INFO - step 13605, loss: 1.074954, best loss: 0.807222 2025-01-16 01:52:05,040 - INFO - step 13606, loss: 1.052490, best loss: 0.807222 2025-01-16 01:52:05,190 - INFO - step 13607, loss: 1.049694, best loss: 0.807222 2025-01-16 01:52:05,340 - INFO - step 13608, loss: 1.038892, best loss: 0.807222 2025-01-16 01:52:05,490 - INFO - step 13609, loss: 0.932755, best loss: 0.807222 2025-01-16 01:52:05,641 - INFO - step 13610, loss: 1.075736, best loss: 0.807222 2025-01-16 01:52:05,791 - INFO - step 13611, loss: 0.911317, best loss: 0.807222 2025-01-16 01:52:05,941 - INFO - step 13612, loss: 0.930259, best loss: 0.807222 2025-01-16 01:52:06,092 - INFO - step 13613, loss: 0.957366, best loss: 0.807222 2025-01-16 01:52:06,242 - INFO - step 13614, loss: 0.997912, best loss: 0.807222 2025-01-16 01:52:06,392 - INFO - step 13615, loss: 0.954127, best loss: 0.807222 2025-01-16 01:52:06,542 - INFO - step 13616, loss: 0.812396, best loss: 0.807222 2025-01-16 01:52:06,692 - INFO - step 13617, loss: 0.903177, best loss: 0.807222 2025-01-16 01:52:06,842 - INFO - step 13618, loss: 0.980829, best loss: 0.807222 2025-01-16 01:52:06,992 - INFO - step 13619, loss: 0.955374, best loss: 0.807222 2025-01-16 01:52:07,142 - INFO - step 13620, loss: 0.879599, best loss: 0.807222 2025-01-16 01:52:07,292 - INFO - step 13621, loss: 0.980997, best loss: 0.807222 2025-01-16 01:52:07,443 - INFO - step 13622, loss: 0.942287, best loss: 0.807222 2025-01-16 01:52:07,593 - INFO - step 13623, loss: 1.104797, best loss: 0.807222 2025-01-16 01:52:07,743 - INFO - step 13624, loss: 1.049845, best loss: 0.807222 2025-01-16 01:52:07,893 - INFO - step 13625, loss: 1.127083, best loss: 0.807222 2025-01-16 01:52:08,043 - INFO - step 13626, loss: 1.129319, best loss: 0.807222 2025-01-16 01:52:08,193 - INFO - step 13627, loss: 0.898985, best loss: 0.807222 2025-01-16 01:52:08,343 - INFO - step 13628, loss: 0.890537, best loss: 0.807222 2025-01-16 01:52:08,494 - INFO - step 13629, loss: 1.097503, best loss: 0.807222 2025-01-16 01:52:08,644 - INFO - step 13630, loss: 0.953213, best loss: 0.807222 2025-01-16 01:52:08,794 - INFO - step 13631, loss: 0.954094, best loss: 0.807222 2025-01-16 01:52:08,944 - INFO - step 13632, loss: 0.945193, best loss: 0.807222 2025-01-16 01:52:09,094 - INFO - step 13633, loss: 1.205092, best loss: 0.807222 2025-01-16 01:52:09,244 - INFO - step 13634, loss: 0.986260, best loss: 0.807222 2025-01-16 01:52:09,394 - INFO - step 13635, loss: 0.953792, best loss: 0.807222 2025-01-16 01:52:09,544 - INFO - step 13636, loss: 1.076867, best loss: 0.807222 2025-01-16 01:52:09,695 - INFO - step 13637, loss: 0.968605, best loss: 0.807222 2025-01-16 01:52:09,845 - INFO - step 13638, loss: 0.879217, best loss: 0.807222 2025-01-16 01:52:09,995 - INFO - step 13639, loss: 1.132092, best loss: 0.807222 2025-01-16 01:52:10,146 - INFO - step 13640, loss: 0.942645, best loss: 0.807222 2025-01-16 01:52:10,295 - INFO - step 13641, loss: 1.011303, best loss: 0.807222 2025-01-16 01:52:10,446 - INFO - step 13642, loss: 0.961142, best loss: 0.807222 2025-01-16 01:52:10,596 - INFO - step 13643, loss: 0.919542, best loss: 0.807222 2025-01-16 01:52:10,746 - INFO - step 13644, loss: 1.094433, best loss: 0.807222 2025-01-16 01:52:10,896 - INFO - step 13645, loss: 0.966091, best loss: 0.807222 2025-01-16 01:52:11,046 - INFO - step 13646, loss: 0.968702, best loss: 0.807222 2025-01-16 01:52:11,196 - INFO - step 13647, loss: 1.127612, best loss: 0.807222 2025-01-16 01:52:11,346 - INFO - step 13648, loss: 0.932606, best loss: 0.807222 2025-01-16 01:52:11,497 - INFO - step 13649, loss: 1.030874, best loss: 0.807222 2025-01-16 01:52:11,647 - INFO - step 13650, loss: 1.035040, best loss: 0.807222 2025-01-16 01:52:11,797 - INFO - step 13651, loss: 1.034799, best loss: 0.807222 2025-01-16 01:52:11,947 - INFO - step 13652, loss: 1.010269, best loss: 0.807222 2025-01-16 01:52:12,097 - INFO - step 13653, loss: 0.932333, best loss: 0.807222 2025-01-16 01:52:12,247 - INFO - step 13654, loss: 0.924422, best loss: 0.807222 2025-01-16 01:52:12,397 - INFO - step 13655, loss: 0.990555, best loss: 0.807222 2025-01-16 01:52:12,548 - INFO - step 13656, loss: 0.990318, best loss: 0.807222 2025-01-16 01:52:12,698 - INFO - step 13657, loss: 1.147030, best loss: 0.807222 2025-01-16 01:52:12,848 - INFO - step 13658, loss: 0.916301, best loss: 0.807222 2025-01-16 01:52:12,998 - INFO - step 13659, loss: 0.845995, best loss: 0.807222 2025-01-16 01:52:13,148 - INFO - step 13660, loss: 0.974109, best loss: 0.807222 2025-01-16 01:52:13,298 - INFO - step 13661, loss: 1.004111, best loss: 0.807222 2025-01-16 01:52:13,449 - INFO - step 13662, loss: 1.136890, best loss: 0.807222 2025-01-16 01:52:13,599 - INFO - step 13663, loss: 0.864466, best loss: 0.807222 2025-01-16 01:52:13,749 - INFO - step 13664, loss: 1.015424, best loss: 0.807222 2025-01-16 01:52:13,899 - INFO - step 13665, loss: 0.943845, best loss: 0.807222 2025-01-16 01:52:14,050 - INFO - step 13666, loss: 0.848742, best loss: 0.807222 2025-01-16 01:52:14,200 - INFO - step 13667, loss: 1.016165, best loss: 0.807222 2025-01-16 01:52:14,351 - INFO - step 13668, loss: 0.997785, best loss: 0.807222 2025-01-16 01:52:14,501 - INFO - step 13669, loss: 1.081405, best loss: 0.807222 2025-01-16 01:52:14,651 - INFO - step 13670, loss: 1.037882, best loss: 0.807222 2025-01-16 01:52:14,801 - INFO - step 13671, loss: 0.981595, best loss: 0.807222 2025-01-16 01:52:14,951 - INFO - step 13672, loss: 1.068764, best loss: 0.807222 2025-01-16 01:52:15,101 - INFO - step 13673, loss: 0.831969, best loss: 0.807222 2025-01-16 01:52:18,679 - INFO - step 13674, loss: 0.748546, best loss: 0.748546 2025-01-16 01:52:18,829 - INFO - step 13675, loss: 0.926088, best loss: 0.748546 2025-01-16 01:52:18,979 - INFO - step 13676, loss: 1.140525, best loss: 0.748546 2025-01-16 01:52:19,129 - INFO - step 13677, loss: 1.013965, best loss: 0.748546 2025-01-16 01:52:19,280 - INFO - step 13678, loss: 0.905400, best loss: 0.748546 2025-01-16 01:52:19,430 - INFO - step 13679, loss: 0.922923, best loss: 0.748546 2025-01-16 01:52:19,581 - INFO - step 13680, loss: 1.067649, best loss: 0.748546 2025-01-16 01:52:19,731 - INFO - step 13681, loss: 1.103119, best loss: 0.748546 2025-01-16 01:52:19,881 - INFO - step 13682, loss: 1.000042, best loss: 0.748546 2025-01-16 01:52:20,032 - INFO - step 13683, loss: 0.915248, best loss: 0.748546 2025-01-16 01:52:20,182 - INFO - step 13684, loss: 1.096951, best loss: 0.748546 2025-01-16 01:52:20,332 - INFO - step 13685, loss: 1.069451, best loss: 0.748546 2025-01-16 01:52:20,482 - INFO - step 13686, loss: 1.031215, best loss: 0.748546 2025-01-16 01:52:20,632 - INFO - step 13687, loss: 0.998814, best loss: 0.748546 2025-01-16 01:52:20,782 - INFO - step 13688, loss: 1.048044, best loss: 0.748546 2025-01-16 01:52:20,933 - INFO - step 13689, loss: 1.032498, best loss: 0.748546 2025-01-16 01:52:21,083 - INFO - step 13690, loss: 0.953155, best loss: 0.748546 2025-01-16 01:52:21,233 - INFO - step 13691, loss: 1.013530, best loss: 0.748546 2025-01-16 01:52:21,383 - INFO - step 13692, loss: 0.874864, best loss: 0.748546 2025-01-16 01:52:21,533 - INFO - step 13693, loss: 0.780547, best loss: 0.748546 2025-01-16 01:52:21,683 - INFO - step 13694, loss: 1.047843, best loss: 0.748546 2025-01-16 01:52:21,833 - INFO - step 13695, loss: 0.945428, best loss: 0.748546 2025-01-16 01:52:21,983 - INFO - step 13696, loss: 1.139102, best loss: 0.748546 2025-01-16 01:52:22,134 - INFO - step 13697, loss: 1.043008, best loss: 0.748546 2025-01-16 01:52:22,284 - INFO - step 13698, loss: 1.075996, best loss: 0.748546 2025-01-16 01:52:22,434 - INFO - step 13699, loss: 1.067621, best loss: 0.748546 2025-01-16 01:52:22,584 - INFO - step 13700, loss: 0.953183, best loss: 0.748546 2025-01-16 01:52:22,735 - INFO - step 13701, loss: 0.907436, best loss: 0.748546 2025-01-16 01:52:22,885 - INFO - step 13702, loss: 0.964745, best loss: 0.748546 2025-01-16 01:52:23,035 - INFO - step 13703, loss: 0.976338, best loss: 0.748546 2025-01-16 01:52:23,185 - INFO - step 13704, loss: 1.049498, best loss: 0.748546 2025-01-16 01:52:23,336 - INFO - step 13705, loss: 0.916230, best loss: 0.748546 2025-01-16 01:52:23,486 - INFO - step 13706, loss: 1.138505, best loss: 0.748546 2025-01-16 01:52:23,636 - INFO - step 13707, loss: 1.139846, best loss: 0.748546 2025-01-16 01:52:23,786 - INFO - step 13708, loss: 1.133951, best loss: 0.748546 2025-01-16 01:52:23,936 - INFO - step 13709, loss: 1.001599, best loss: 0.748546 2025-01-16 01:52:24,087 - INFO - step 13710, loss: 1.206067, best loss: 0.748546 2025-01-16 01:52:24,237 - INFO - step 13711, loss: 0.972565, best loss: 0.748546 2025-01-16 01:52:24,387 - INFO - step 13712, loss: 1.076634, best loss: 0.748546 2025-01-16 01:52:24,537 - INFO - step 13713, loss: 1.076385, best loss: 0.748546 2025-01-16 01:52:24,687 - INFO - step 13714, loss: 1.014345, best loss: 0.748546 2025-01-16 01:52:24,837 - INFO - step 13715, loss: 0.918166, best loss: 0.748546 2025-01-16 01:52:24,987 - INFO - step 13716, loss: 1.107898, best loss: 0.748546 2025-01-16 01:52:25,138 - INFO - step 13717, loss: 1.086697, best loss: 0.748546 2025-01-16 01:52:25,288 - INFO - step 13718, loss: 0.813388, best loss: 0.748546 2025-01-16 01:52:25,438 - INFO - step 13719, loss: 1.116339, best loss: 0.748546 2025-01-16 01:52:25,588 - INFO - step 13720, loss: 0.960661, best loss: 0.748546 2025-01-16 01:52:25,738 - INFO - step 13721, loss: 1.018882, best loss: 0.748546 2025-01-16 01:52:25,889 - INFO - step 13722, loss: 1.076159, best loss: 0.748546 2025-01-16 01:52:26,039 - INFO - step 13723, loss: 1.027837, best loss: 0.748546 2025-01-16 01:52:26,189 - INFO - step 13724, loss: 1.030743, best loss: 0.748546 2025-01-16 01:52:26,339 - INFO - step 13725, loss: 1.089072, best loss: 0.748546 2025-01-16 01:52:26,489 - INFO - step 13726, loss: 1.098797, best loss: 0.748546 2025-01-16 01:52:26,639 - INFO - step 13727, loss: 1.020760, best loss: 0.748546 2025-01-16 01:52:26,790 - INFO - step 13728, loss: 0.920869, best loss: 0.748546 2025-01-16 01:52:26,940 - INFO - step 13729, loss: 0.850756, best loss: 0.748546 2025-01-16 01:52:27,090 - INFO - step 13730, loss: 0.925778, best loss: 0.748546 2025-01-16 01:52:27,240 - INFO - step 13731, loss: 0.989493, best loss: 0.748546 2025-01-16 01:52:27,390 - INFO - step 13732, loss: 1.015405, best loss: 0.748546 2025-01-16 01:52:27,540 - INFO - step 13733, loss: 1.100509, best loss: 0.748546 2025-01-16 01:52:27,690 - INFO - step 13734, loss: 1.015113, best loss: 0.748546 2025-01-16 01:52:27,840 - INFO - step 13735, loss: 0.998115, best loss: 0.748546 2025-01-16 01:52:27,990 - INFO - step 13736, loss: 0.883901, best loss: 0.748546 2025-01-16 01:52:28,141 - INFO - step 13737, loss: 0.943956, best loss: 0.748546 2025-01-16 01:52:28,291 - INFO - step 13738, loss: 0.997738, best loss: 0.748546 2025-01-16 01:52:28,441 - INFO - step 13739, loss: 0.976613, best loss: 0.748546 2025-01-16 01:52:28,591 - INFO - step 13740, loss: 0.969833, best loss: 0.748546 2025-01-16 01:52:28,741 - INFO - step 13741, loss: 0.874572, best loss: 0.748546 2025-01-16 01:52:28,891 - INFO - step 13742, loss: 0.946733, best loss: 0.748546 2025-01-16 01:52:29,041 - INFO - step 13743, loss: 0.942096, best loss: 0.748546 2025-01-16 01:52:29,191 - INFO - step 13744, loss: 1.018262, best loss: 0.748546 2025-01-16 01:52:29,342 - INFO - step 13745, loss: 0.991035, best loss: 0.748546 2025-01-16 01:52:29,492 - INFO - step 13746, loss: 1.065553, best loss: 0.748546 2025-01-16 01:52:29,642 - INFO - step 13747, loss: 1.049067, best loss: 0.748546 2025-01-16 01:52:29,793 - INFO - step 13748, loss: 0.978921, best loss: 0.748546 2025-01-16 01:52:29,943 - INFO - step 13749, loss: 1.008791, best loss: 0.748546 2025-01-16 01:52:30,093 - INFO - step 13750, loss: 1.158117, best loss: 0.748546 2025-01-16 01:52:30,243 - INFO - step 13751, loss: 1.116786, best loss: 0.748546 2025-01-16 01:52:30,393 - INFO - step 13752, loss: 1.124589, best loss: 0.748546 2025-01-16 01:52:30,543 - INFO - step 13753, loss: 1.192426, best loss: 0.748546 2025-01-16 01:52:30,693 - INFO - step 13754, loss: 1.173103, best loss: 0.748546 2025-01-16 01:52:30,843 - INFO - step 13755, loss: 1.022545, best loss: 0.748546 2025-01-16 01:52:30,994 - INFO - step 13756, loss: 1.041761, best loss: 0.748546 2025-01-16 01:52:31,144 - INFO - step 13757, loss: 0.933316, best loss: 0.748546 2025-01-16 01:52:31,294 - INFO - step 13758, loss: 0.895555, best loss: 0.748546 2025-01-16 01:52:31,443 - INFO - step 13759, loss: 1.061556, best loss: 0.748546 2025-01-16 01:52:31,593 - INFO - step 13760, loss: 1.108296, best loss: 0.748546 2025-01-16 01:52:31,743 - INFO - step 13761, loss: 0.838770, best loss: 0.748546 2025-01-16 01:52:31,892 - INFO - step 13762, loss: 0.911301, best loss: 0.748546 2025-01-16 01:52:32,043 - INFO - step 13763, loss: 1.113254, best loss: 0.748546 2025-01-16 01:52:32,193 - INFO - step 13764, loss: 1.200806, best loss: 0.748546 2025-01-16 01:52:32,343 - INFO - step 13765, loss: 1.037363, best loss: 0.748546 2025-01-16 01:52:32,493 - INFO - step 13766, loss: 1.082230, best loss: 0.748546 2025-01-16 01:52:32,643 - INFO - step 13767, loss: 0.962649, best loss: 0.748546 2025-01-16 01:52:32,794 - INFO - step 13768, loss: 0.877016, best loss: 0.748546 2025-01-16 01:52:32,944 - INFO - step 13769, loss: 1.067906, best loss: 0.748546 2025-01-16 01:52:33,094 - INFO - step 13770, loss: 1.257968, best loss: 0.748546 2025-01-16 01:52:33,244 - INFO - step 13771, loss: 1.283923, best loss: 0.748546 2025-01-16 01:52:33,394 - INFO - step 13772, loss: 1.153231, best loss: 0.748546 2025-01-16 01:52:33,544 - INFO - step 13773, loss: 1.022036, best loss: 0.748546 2025-01-16 01:52:33,694 - INFO - step 13774, loss: 1.115101, best loss: 0.748546 2025-01-16 01:52:33,845 - INFO - step 13775, loss: 1.054664, best loss: 0.748546 2025-01-16 01:52:33,995 - INFO - step 13776, loss: 0.983196, best loss: 0.748546 2025-01-16 01:52:34,145 - INFO - step 13777, loss: 1.087484, best loss: 0.748546 2025-01-16 01:52:34,295 - INFO - step 13778, loss: 0.930108, best loss: 0.748546 2025-01-16 01:52:34,445 - INFO - step 13779, loss: 0.803482, best loss: 0.748546 2025-01-16 01:52:34,595 - INFO - step 13780, loss: 0.977952, best loss: 0.748546 2025-01-16 01:52:34,745 - INFO - step 13781, loss: 1.053187, best loss: 0.748546 2025-01-16 01:52:34,895 - INFO - step 13782, loss: 0.991137, best loss: 0.748546 2025-01-16 01:52:35,045 - INFO - step 13783, loss: 0.834870, best loss: 0.748546 2025-01-16 01:52:35,196 - INFO - step 13784, loss: 0.871602, best loss: 0.748546 2025-01-16 01:52:35,346 - INFO - step 13785, loss: 0.807608, best loss: 0.748546 2025-01-16 01:52:35,496 - INFO - step 13786, loss: 0.840509, best loss: 0.748546 2025-01-16 01:52:35,646 - INFO - step 13787, loss: 1.072727, best loss: 0.748546 2025-01-16 01:52:35,797 - INFO - step 13788, loss: 0.937289, best loss: 0.748546 2025-01-16 01:52:35,947 - INFO - step 13789, loss: 1.112051, best loss: 0.748546 2025-01-16 01:52:36,097 - INFO - step 13790, loss: 0.985872, best loss: 0.748546 2025-01-16 01:52:36,247 - INFO - step 13791, loss: 0.929481, best loss: 0.748546 2025-01-16 01:52:36,397 - INFO - step 13792, loss: 1.028971, best loss: 0.748546 2025-01-16 01:52:36,547 - INFO - step 13793, loss: 0.830189, best loss: 0.748546 2025-01-16 01:52:36,698 - INFO - step 13794, loss: 0.919067, best loss: 0.748546 2025-01-16 01:52:36,848 - INFO - step 13795, loss: 0.915395, best loss: 0.748546 2025-01-16 01:52:36,998 - INFO - step 13796, loss: 0.917443, best loss: 0.748546 2025-01-16 01:52:37,148 - INFO - step 13797, loss: 0.807814, best loss: 0.748546 2025-01-16 01:52:37,299 - INFO - step 13798, loss: 0.922968, best loss: 0.748546 2025-01-16 01:52:37,449 - INFO - step 13799, loss: 0.955574, best loss: 0.748546 2025-01-16 01:52:37,599 - INFO - step 13800, loss: 0.938756, best loss: 0.748546 2025-01-16 01:52:37,749 - INFO - step 13801, loss: 0.934682, best loss: 0.748546 2025-01-16 01:52:37,899 - INFO - step 13802, loss: 0.886343, best loss: 0.748546 2025-01-16 01:52:38,049 - INFO - step 13803, loss: 0.822393, best loss: 0.748546 2025-01-16 01:52:38,199 - INFO - step 13804, loss: 0.965234, best loss: 0.748546 2025-01-16 01:52:38,349 - INFO - step 13805, loss: 1.034974, best loss: 0.748546 2025-01-16 01:52:38,499 - INFO - step 13806, loss: 0.906862, best loss: 0.748546 2025-01-16 01:52:38,650 - INFO - step 13807, loss: 0.945382, best loss: 0.748546 2025-01-16 01:52:38,800 - INFO - step 13808, loss: 0.985468, best loss: 0.748546 2025-01-16 01:52:38,950 - INFO - step 13809, loss: 1.006414, best loss: 0.748546 2025-01-16 01:52:39,100 - INFO - step 13810, loss: 0.959930, best loss: 0.748546 2025-01-16 01:52:39,250 - INFO - step 13811, loss: 0.826567, best loss: 0.748546 2025-01-16 01:52:39,400 - INFO - step 13812, loss: 1.007419, best loss: 0.748546 2025-01-16 01:52:39,550 - INFO - step 13813, loss: 0.928712, best loss: 0.748546 2025-01-16 01:52:39,700 - INFO - step 13814, loss: 1.136547, best loss: 0.748546 2025-01-16 01:52:39,850 - INFO - step 13815, loss: 1.076028, best loss: 0.748546 2025-01-16 01:52:40,000 - INFO - step 13816, loss: 1.023731, best loss: 0.748546 2025-01-16 01:52:40,150 - INFO - step 13817, loss: 0.948733, best loss: 0.748546 2025-01-16 01:52:40,300 - INFO - step 13818, loss: 0.975685, best loss: 0.748546 2025-01-16 01:52:40,450 - INFO - step 13819, loss: 0.953636, best loss: 0.748546 2025-01-16 01:52:40,600 - INFO - step 13820, loss: 0.961106, best loss: 0.748546 2025-01-16 01:52:40,750 - INFO - step 13821, loss: 0.924389, best loss: 0.748546 2025-01-16 01:52:40,900 - INFO - step 13822, loss: 0.931554, best loss: 0.748546 2025-01-16 01:52:41,051 - INFO - step 13823, loss: 0.905048, best loss: 0.748546 2025-01-16 01:52:41,201 - INFO - step 13824, loss: 0.872740, best loss: 0.748546 2025-01-16 01:52:41,351 - INFO - step 13825, loss: 0.881373, best loss: 0.748546 2025-01-16 01:52:41,501 - INFO - step 13826, loss: 0.941122, best loss: 0.748546 2025-01-16 01:52:41,651 - INFO - step 13827, loss: 0.861859, best loss: 0.748546 2025-01-16 01:52:41,802 - INFO - step 13828, loss: 1.014468, best loss: 0.748546 2025-01-16 01:52:41,952 - INFO - step 13829, loss: 1.025110, best loss: 0.748546 2025-01-16 01:52:42,101 - INFO - step 13830, loss: 0.951625, best loss: 0.748546 2025-01-16 01:52:42,252 - INFO - step 13831, loss: 0.935731, best loss: 0.748546 2025-01-16 01:52:42,402 - INFO - step 13832, loss: 0.924354, best loss: 0.748546 2025-01-16 01:52:42,552 - INFO - step 13833, loss: 0.926492, best loss: 0.748546 2025-01-16 01:52:42,702 - INFO - step 13834, loss: 0.942262, best loss: 0.748546 2025-01-16 01:52:42,852 - INFO - step 13835, loss: 0.846581, best loss: 0.748546 2025-01-16 01:52:43,002 - INFO - step 13836, loss: 0.821341, best loss: 0.748546 2025-01-16 01:52:43,152 - INFO - step 13837, loss: 0.943334, best loss: 0.748546 2025-01-16 01:52:43,302 - INFO - step 13838, loss: 0.891704, best loss: 0.748546 2025-01-16 01:52:43,453 - INFO - step 13839, loss: 0.930176, best loss: 0.748546 2025-01-16 01:52:43,603 - INFO - step 13840, loss: 0.851447, best loss: 0.748546 2025-01-16 01:52:43,753 - INFO - step 13841, loss: 0.864175, best loss: 0.748546 2025-01-16 01:52:43,903 - INFO - step 13842, loss: 1.067621, best loss: 0.748546 2025-01-16 01:52:44,053 - INFO - step 13843, loss: 0.913910, best loss: 0.748546 2025-01-16 01:52:44,203 - INFO - step 13844, loss: 0.928243, best loss: 0.748546 2025-01-16 01:52:44,353 - INFO - step 13845, loss: 0.839285, best loss: 0.748546 2025-01-16 01:52:44,503 - INFO - step 13846, loss: 0.775356, best loss: 0.748546 2025-01-16 01:52:44,654 - INFO - step 13847, loss: 0.804659, best loss: 0.748546 2025-01-16 01:52:44,804 - INFO - step 13848, loss: 0.934645, best loss: 0.748546 2025-01-16 01:52:44,953 - INFO - step 13849, loss: 0.921301, best loss: 0.748546 2025-01-16 01:52:45,104 - INFO - step 13850, loss: 1.004699, best loss: 0.748546 2025-01-16 01:52:45,254 - INFO - step 13851, loss: 1.015463, best loss: 0.748546 2025-01-16 01:52:45,404 - INFO - step 13852, loss: 0.936162, best loss: 0.748546 2025-01-16 01:52:45,555 - INFO - step 13853, loss: 0.881021, best loss: 0.748546 2025-01-16 01:52:45,705 - INFO - step 13854, loss: 0.834187, best loss: 0.748546 2025-01-16 01:52:45,856 - INFO - step 13855, loss: 0.976505, best loss: 0.748546 2025-01-16 01:52:46,006 - INFO - step 13856, loss: 1.076624, best loss: 0.748546 2025-01-16 01:52:49,525 - INFO - step 13857, loss: 0.713048, best loss: 0.713048 2025-01-16 01:52:49,687 - INFO - step 13858, loss: 0.841937, best loss: 0.713048 2025-01-16 01:52:49,839 - INFO - step 13859, loss: 0.792177, best loss: 0.713048 2025-01-16 01:52:49,989 - INFO - step 13860, loss: 0.933417, best loss: 0.713048 2025-01-16 01:52:50,139 - INFO - step 13861, loss: 0.995683, best loss: 0.713048 2025-01-16 01:52:50,289 - INFO - step 13862, loss: 0.914277, best loss: 0.713048 2025-01-16 01:52:50,439 - INFO - step 13863, loss: 1.051510, best loss: 0.713048 2025-01-16 01:52:50,589 - INFO - step 13864, loss: 0.899463, best loss: 0.713048 2025-01-16 01:52:50,740 - INFO - step 13865, loss: 0.826386, best loss: 0.713048 2025-01-16 01:52:50,890 - INFO - step 13866, loss: 0.936866, best loss: 0.713048 2025-01-16 01:52:51,040 - INFO - step 13867, loss: 0.974932, best loss: 0.713048 2025-01-16 01:52:51,190 - INFO - step 13868, loss: 0.996793, best loss: 0.713048 2025-01-16 01:52:51,340 - INFO - step 13869, loss: 1.001139, best loss: 0.713048 2025-01-16 01:52:51,490 - INFO - step 13870, loss: 0.888197, best loss: 0.713048 2025-01-16 01:52:51,641 - INFO - step 13871, loss: 0.860271, best loss: 0.713048 2025-01-16 01:52:51,791 - INFO - step 13872, loss: 0.812658, best loss: 0.713048 2025-01-16 01:52:51,941 - INFO - step 13873, loss: 1.018609, best loss: 0.713048 2025-01-16 01:52:52,091 - INFO - step 13874, loss: 1.110507, best loss: 0.713048 2025-01-16 01:52:52,241 - INFO - step 13875, loss: 1.013169, best loss: 0.713048 2025-01-16 01:52:52,392 - INFO - step 13876, loss: 1.015914, best loss: 0.713048 2025-01-16 01:52:52,542 - INFO - step 13877, loss: 0.904302, best loss: 0.713048 2025-01-16 01:52:52,692 - INFO - step 13878, loss: 0.993719, best loss: 0.713048 2025-01-16 01:52:52,842 - INFO - step 13879, loss: 0.923888, best loss: 0.713048 2025-01-16 01:52:52,992 - INFO - step 13880, loss: 0.886880, best loss: 0.713048 2025-01-16 01:52:53,142 - INFO - step 13881, loss: 1.022738, best loss: 0.713048 2025-01-16 01:52:53,293 - INFO - step 13882, loss: 0.884762, best loss: 0.713048 2025-01-16 01:52:53,443 - INFO - step 13883, loss: 0.929718, best loss: 0.713048 2025-01-16 01:52:53,593 - INFO - step 13884, loss: 1.025830, best loss: 0.713048 2025-01-16 01:52:53,743 - INFO - step 13885, loss: 1.104125, best loss: 0.713048 2025-01-16 01:52:53,894 - INFO - step 13886, loss: 0.961616, best loss: 0.713048 2025-01-16 01:52:54,044 - INFO - step 13887, loss: 0.862116, best loss: 0.713048 2025-01-16 01:52:54,194 - INFO - step 13888, loss: 0.974049, best loss: 0.713048 2025-01-16 01:52:54,344 - INFO - step 13889, loss: 1.028255, best loss: 0.713048 2025-01-16 01:52:54,494 - INFO - step 13890, loss: 0.845400, best loss: 0.713048 2025-01-16 01:52:54,645 - INFO - step 13891, loss: 0.864114, best loss: 0.713048 2025-01-16 01:52:54,795 - INFO - step 13892, loss: 0.955122, best loss: 0.713048 2025-01-16 01:52:54,945 - INFO - step 13893, loss: 0.903667, best loss: 0.713048 2025-01-16 01:52:55,095 - INFO - step 13894, loss: 0.798753, best loss: 0.713048 2025-01-16 01:52:55,245 - INFO - step 13895, loss: 0.806651, best loss: 0.713048 2025-01-16 01:52:55,396 - INFO - step 13896, loss: 1.011131, best loss: 0.713048 2025-01-16 01:52:55,546 - INFO - step 13897, loss: 1.022527, best loss: 0.713048 2025-01-16 01:52:55,695 - INFO - step 13898, loss: 0.905345, best loss: 0.713048 2025-01-16 01:52:55,846 - INFO - step 13899, loss: 0.831834, best loss: 0.713048 2025-01-16 01:52:55,996 - INFO - step 13900, loss: 1.054988, best loss: 0.713048 2025-01-16 01:52:56,146 - INFO - step 13901, loss: 0.944681, best loss: 0.713048 2025-01-16 01:52:56,296 - INFO - step 13902, loss: 0.982795, best loss: 0.713048 2025-01-16 01:52:56,446 - INFO - step 13903, loss: 0.912567, best loss: 0.713048 2025-01-16 01:52:56,596 - INFO - step 13904, loss: 0.848179, best loss: 0.713048 2025-01-16 01:52:56,746 - INFO - step 13905, loss: 0.815291, best loss: 0.713048 2025-01-16 01:52:56,896 - INFO - step 13906, loss: 0.880708, best loss: 0.713048 2025-01-16 01:52:57,046 - INFO - step 13907, loss: 0.928675, best loss: 0.713048 2025-01-16 01:52:57,196 - INFO - step 13908, loss: 0.855721, best loss: 0.713048 2025-01-16 01:52:57,346 - INFO - step 13909, loss: 0.784320, best loss: 0.713048 2025-01-16 01:52:57,496 - INFO - step 13910, loss: 0.832807, best loss: 0.713048 2025-01-16 01:52:57,646 - INFO - step 13911, loss: 0.888795, best loss: 0.713048 2025-01-16 01:52:57,796 - INFO - step 13912, loss: 0.994325, best loss: 0.713048 2025-01-16 01:52:57,946 - INFO - step 13913, loss: 0.986879, best loss: 0.713048 2025-01-16 01:52:58,096 - INFO - step 13914, loss: 0.918014, best loss: 0.713048 2025-01-16 01:52:58,246 - INFO - step 13915, loss: 0.793116, best loss: 0.713048 2025-01-16 01:52:58,396 - INFO - step 13916, loss: 0.965845, best loss: 0.713048 2025-01-16 01:52:58,547 - INFO - step 13917, loss: 0.908608, best loss: 0.713048 2025-01-16 01:52:58,697 - INFO - step 13918, loss: 0.782140, best loss: 0.713048 2025-01-16 01:52:58,846 - INFO - step 13919, loss: 0.947765, best loss: 0.713048 2025-01-16 01:52:58,997 - INFO - step 13920, loss: 1.013349, best loss: 0.713048 2025-01-16 01:52:59,147 - INFO - step 13921, loss: 1.061711, best loss: 0.713048 2025-01-16 01:52:59,297 - INFO - step 13922, loss: 1.091087, best loss: 0.713048 2025-01-16 01:52:59,447 - INFO - step 13923, loss: 1.014553, best loss: 0.713048 2025-01-16 01:52:59,597 - INFO - step 13924, loss: 0.938880, best loss: 0.713048 2025-01-16 01:52:59,747 - INFO - step 13925, loss: 0.775001, best loss: 0.713048 2025-01-16 01:52:59,897 - INFO - step 13926, loss: 1.077622, best loss: 0.713048 2025-01-16 01:53:00,047 - INFO - step 13927, loss: 0.933709, best loss: 0.713048 2025-01-16 01:53:00,197 - INFO - step 13928, loss: 0.860220, best loss: 0.713048 2025-01-16 01:53:00,347 - INFO - step 13929, loss: 0.992745, best loss: 0.713048 2025-01-16 01:53:00,497 - INFO - step 13930, loss: 0.955111, best loss: 0.713048 2025-01-16 01:53:00,647 - INFO - step 13931, loss: 0.902229, best loss: 0.713048 2025-01-16 01:53:00,797 - INFO - step 13932, loss: 0.955964, best loss: 0.713048 2025-01-16 01:53:00,947 - INFO - step 13933, loss: 0.933205, best loss: 0.713048 2025-01-16 01:53:01,097 - INFO - step 13934, loss: 1.024424, best loss: 0.713048 2025-01-16 01:53:01,247 - INFO - step 13935, loss: 0.944223, best loss: 0.713048 2025-01-16 01:53:01,397 - INFO - step 13936, loss: 0.908289, best loss: 0.713048 2025-01-16 01:53:01,547 - INFO - step 13937, loss: 0.937459, best loss: 0.713048 2025-01-16 01:53:01,697 - INFO - step 13938, loss: 0.983680, best loss: 0.713048 2025-01-16 01:53:01,847 - INFO - step 13939, loss: 0.847921, best loss: 0.713048 2025-01-16 01:53:01,998 - INFO - step 13940, loss: 0.978053, best loss: 0.713048 2025-01-16 01:53:02,148 - INFO - step 13941, loss: 0.892929, best loss: 0.713048 2025-01-16 01:53:02,298 - INFO - step 13942, loss: 0.818974, best loss: 0.713048 2025-01-16 01:53:02,448 - INFO - step 13943, loss: 0.889306, best loss: 0.713048 2025-01-16 01:53:02,598 - INFO - step 13944, loss: 0.881912, best loss: 0.713048 2025-01-16 01:53:02,748 - INFO - step 13945, loss: 0.932342, best loss: 0.713048 2025-01-16 01:53:02,898 - INFO - step 13946, loss: 0.738149, best loss: 0.713048 2025-01-16 01:53:03,048 - INFO - step 13947, loss: 0.867518, best loss: 0.713048 2025-01-16 01:53:03,198 - INFO - step 13948, loss: 0.856219, best loss: 0.713048 2025-01-16 01:53:03,348 - INFO - step 13949, loss: 0.859782, best loss: 0.713048 2025-01-16 01:53:03,498 - INFO - step 13950, loss: 0.812310, best loss: 0.713048 2025-01-16 01:53:03,648 - INFO - step 13951, loss: 0.875691, best loss: 0.713048 2025-01-16 01:53:03,798 - INFO - step 13952, loss: 0.876099, best loss: 0.713048 2025-01-16 01:53:03,948 - INFO - step 13953, loss: 0.871037, best loss: 0.713048 2025-01-16 01:53:04,098 - INFO - step 13954, loss: 0.964943, best loss: 0.713048 2025-01-16 01:53:04,248 - INFO - step 13955, loss: 0.997197, best loss: 0.713048 2025-01-16 01:53:04,398 - INFO - step 13956, loss: 1.015148, best loss: 0.713048 2025-01-16 01:53:04,549 - INFO - step 13957, loss: 0.846548, best loss: 0.713048 2025-01-16 01:53:04,699 - INFO - step 13958, loss: 0.834830, best loss: 0.713048 2025-01-16 01:53:04,849 - INFO - step 13959, loss: 0.895712, best loss: 0.713048 2025-01-16 01:53:04,999 - INFO - step 13960, loss: 0.924467, best loss: 0.713048 2025-01-16 01:53:05,149 - INFO - step 13961, loss: 0.907291, best loss: 0.713048 2025-01-16 01:53:05,299 - INFO - step 13962, loss: 0.891697, best loss: 0.713048 2025-01-16 01:53:05,449 - INFO - step 13963, loss: 1.086742, best loss: 0.713048 2025-01-16 01:53:05,599 - INFO - step 13964, loss: 0.869236, best loss: 0.713048 2025-01-16 01:53:05,749 - INFO - step 13965, loss: 0.820844, best loss: 0.713048 2025-01-16 01:53:05,899 - INFO - step 13966, loss: 0.993217, best loss: 0.713048 2025-01-16 01:53:06,049 - INFO - step 13967, loss: 0.880442, best loss: 0.713048 2025-01-16 01:53:06,199 - INFO - step 13968, loss: 0.879087, best loss: 0.713048 2025-01-16 01:53:06,349 - INFO - step 13969, loss: 1.031411, best loss: 0.713048 2025-01-16 01:53:06,499 - INFO - step 13970, loss: 0.890998, best loss: 0.713048 2025-01-16 01:53:06,650 - INFO - step 13971, loss: 0.914658, best loss: 0.713048 2025-01-16 01:53:06,800 - INFO - step 13972, loss: 0.855954, best loss: 0.713048 2025-01-16 01:53:06,950 - INFO - step 13973, loss: 0.839342, best loss: 0.713048 2025-01-16 01:53:07,100 - INFO - step 13974, loss: 0.944555, best loss: 0.713048 2025-01-16 01:53:07,250 - INFO - step 13975, loss: 0.889957, best loss: 0.713048 2025-01-16 01:53:07,400 - INFO - step 13976, loss: 0.841924, best loss: 0.713048 2025-01-16 01:53:07,550 - INFO - step 13977, loss: 0.890365, best loss: 0.713048 2025-01-16 01:53:07,700 - INFO - step 13978, loss: 0.835687, best loss: 0.713048 2025-01-16 01:53:07,850 - INFO - step 13979, loss: 1.003153, best loss: 0.713048 2025-01-16 01:53:08,001 - INFO - step 13980, loss: 0.845148, best loss: 0.713048 2025-01-16 01:53:08,151 - INFO - step 13981, loss: 0.929736, best loss: 0.713048 2025-01-16 01:53:08,301 - INFO - step 13982, loss: 0.890481, best loss: 0.713048 2025-01-16 01:53:08,451 - INFO - step 13983, loss: 0.814654, best loss: 0.713048 2025-01-16 01:53:08,601 - INFO - step 13984, loss: 0.841594, best loss: 0.713048 2025-01-16 01:53:08,751 - INFO - step 13985, loss: 0.832420, best loss: 0.713048 2025-01-16 01:53:08,901 - INFO - step 13986, loss: 0.846279, best loss: 0.713048 2025-01-16 01:53:09,051 - INFO - step 13987, loss: 0.979677, best loss: 0.713048 2025-01-16 01:53:09,201 - INFO - step 13988, loss: 0.838374, best loss: 0.713048 2025-01-16 01:53:09,351 - INFO - step 13989, loss: 0.830495, best loss: 0.713048 2025-01-16 01:53:09,501 - INFO - step 13990, loss: 0.851222, best loss: 0.713048 2025-01-16 01:53:09,651 - INFO - step 13991, loss: 0.932906, best loss: 0.713048 2025-01-16 01:53:09,801 - INFO - step 13992, loss: 1.061283, best loss: 0.713048 2025-01-16 01:53:09,951 - INFO - step 13993, loss: 0.788923, best loss: 0.713048 2025-01-16 01:53:10,101 - INFO - step 13994, loss: 1.009724, best loss: 0.713048 2025-01-16 01:53:10,251 - INFO - step 13995, loss: 0.811721, best loss: 0.713048 2025-01-16 01:53:10,401 - INFO - step 13996, loss: 0.769513, best loss: 0.713048 2025-01-16 01:53:10,551 - INFO - step 13997, loss: 0.919275, best loss: 0.713048 2025-01-16 01:53:10,701 - INFO - step 13998, loss: 0.883518, best loss: 0.713048 2025-01-16 01:53:10,851 - INFO - step 13999, loss: 0.911659, best loss: 0.713048 2025-01-16 01:53:11,001 - INFO - step 14000, loss: 0.951238, best loss: 0.713048 2025-01-16 01:53:11,151 - INFO - step 14001, loss: 0.858192, best loss: 0.713048 2025-01-16 01:53:11,301 - INFO - step 14002, loss: 0.963064, best loss: 0.713048 2025-01-16 01:53:11,451 - INFO - step 14003, loss: 0.790777, best loss: 0.713048 2025-01-16 01:53:14,958 - INFO - step 14004, loss: 0.688905, best loss: 0.688905 2025-01-16 01:53:15,109 - INFO - step 14005, loss: 0.916345, best loss: 0.688905 2025-01-16 01:53:15,258 - INFO - step 14006, loss: 0.982580, best loss: 0.688905 2025-01-16 01:53:15,409 - INFO - step 14007, loss: 0.931621, best loss: 0.688905 2025-01-16 01:53:15,559 - INFO - step 14008, loss: 0.810186, best loss: 0.688905 2025-01-16 01:53:15,709 - INFO - step 14009, loss: 0.795392, best loss: 0.688905 2025-01-16 01:53:15,859 - INFO - step 14010, loss: 0.865260, best loss: 0.688905 2025-01-16 01:53:16,009 - INFO - step 14011, loss: 0.894038, best loss: 0.688905 2025-01-16 01:53:16,160 - INFO - step 14012, loss: 0.934619, best loss: 0.688905 2025-01-16 01:53:16,310 - INFO - step 14013, loss: 0.823554, best loss: 0.688905 2025-01-16 01:53:16,460 - INFO - step 14014, loss: 1.007871, best loss: 0.688905 2025-01-16 01:53:16,610 - INFO - step 14015, loss: 0.967687, best loss: 0.688905 2025-01-16 01:53:16,761 - INFO - step 14016, loss: 0.965638, best loss: 0.688905 2025-01-16 01:53:16,912 - INFO - step 14017, loss: 0.864758, best loss: 0.688905 2025-01-16 01:53:17,062 - INFO - step 14018, loss: 0.864180, best loss: 0.688905 2025-01-16 01:53:17,212 - INFO - step 14019, loss: 0.899844, best loss: 0.688905 2025-01-16 01:53:17,362 - INFO - step 14020, loss: 0.820622, best loss: 0.688905 2025-01-16 01:53:17,512 - INFO - step 14021, loss: 0.852891, best loss: 0.688905 2025-01-16 01:53:17,662 - INFO - step 14022, loss: 0.865605, best loss: 0.688905 2025-01-16 01:53:17,812 - INFO - step 14023, loss: 0.711262, best loss: 0.688905 2025-01-16 01:53:17,962 - INFO - step 14024, loss: 0.931759, best loss: 0.688905 2025-01-16 01:53:18,116 - INFO - step 14025, loss: 0.805197, best loss: 0.688905 2025-01-16 01:53:18,266 - INFO - step 14026, loss: 1.039683, best loss: 0.688905 2025-01-16 01:53:18,416 - INFO - step 14027, loss: 0.893895, best loss: 0.688905 2025-01-16 01:53:18,567 - INFO - step 14028, loss: 0.902041, best loss: 0.688905 2025-01-16 01:53:18,717 - INFO - step 14029, loss: 0.942872, best loss: 0.688905 2025-01-16 01:53:18,867 - INFO - step 14030, loss: 0.856771, best loss: 0.688905 2025-01-16 01:53:19,017 - INFO - step 14031, loss: 0.806556, best loss: 0.688905 2025-01-16 01:53:19,167 - INFO - step 14032, loss: 0.873253, best loss: 0.688905 2025-01-16 01:53:19,317 - INFO - step 14033, loss: 0.884681, best loss: 0.688905 2025-01-16 01:53:19,468 - INFO - step 14034, loss: 0.888852, best loss: 0.688905 2025-01-16 01:53:19,618 - INFO - step 14035, loss: 0.828945, best loss: 0.688905 2025-01-16 01:53:19,768 - INFO - step 14036, loss: 0.955890, best loss: 0.688905 2025-01-16 01:53:19,918 - INFO - step 14037, loss: 1.025201, best loss: 0.688905 2025-01-16 01:53:20,068 - INFO - step 14038, loss: 0.936353, best loss: 0.688905 2025-01-16 01:53:20,218 - INFO - step 14039, loss: 0.978788, best loss: 0.688905 2025-01-16 01:53:20,368 - INFO - step 14040, loss: 1.088382, best loss: 0.688905 2025-01-16 01:53:20,519 - INFO - step 14041, loss: 0.845769, best loss: 0.688905 2025-01-16 01:53:20,669 - INFO - step 14042, loss: 0.965747, best loss: 0.688905 2025-01-16 01:53:20,819 - INFO - step 14043, loss: 0.988613, best loss: 0.688905 2025-01-16 01:53:20,969 - INFO - step 14044, loss: 0.897570, best loss: 0.688905 2025-01-16 01:53:21,119 - INFO - step 14045, loss: 0.839024, best loss: 0.688905 2025-01-16 01:53:21,269 - INFO - step 14046, loss: 0.991383, best loss: 0.688905 2025-01-16 01:53:21,419 - INFO - step 14047, loss: 0.936552, best loss: 0.688905 2025-01-16 01:53:21,569 - INFO - step 14048, loss: 0.724591, best loss: 0.688905 2025-01-16 01:53:21,719 - INFO - step 14049, loss: 1.034969, best loss: 0.688905 2025-01-16 01:53:21,869 - INFO - step 14050, loss: 0.912803, best loss: 0.688905 2025-01-16 01:53:22,020 - INFO - step 14051, loss: 0.909790, best loss: 0.688905 2025-01-16 01:53:22,170 - INFO - step 14052, loss: 0.960823, best loss: 0.688905 2025-01-16 01:53:22,320 - INFO - step 14053, loss: 0.990042, best loss: 0.688905 2025-01-16 01:53:22,470 - INFO - step 14054, loss: 1.008713, best loss: 0.688905 2025-01-16 01:53:22,620 - INFO - step 14055, loss: 0.962411, best loss: 0.688905 2025-01-16 01:53:22,770 - INFO - step 14056, loss: 0.977916, best loss: 0.688905 2025-01-16 01:53:22,920 - INFO - step 14057, loss: 0.891350, best loss: 0.688905 2025-01-16 01:53:23,070 - INFO - step 14058, loss: 0.847175, best loss: 0.688905 2025-01-16 01:53:23,220 - INFO - step 14059, loss: 0.785695, best loss: 0.688905 2025-01-16 01:53:23,371 - INFO - step 14060, loss: 0.888858, best loss: 0.688905 2025-01-16 01:53:23,521 - INFO - step 14061, loss: 0.867622, best loss: 0.688905 2025-01-16 01:53:23,671 - INFO - step 14062, loss: 0.928007, best loss: 0.688905 2025-01-16 01:53:23,821 - INFO - step 14063, loss: 0.923985, best loss: 0.688905 2025-01-16 01:53:23,971 - INFO - step 14064, loss: 0.899300, best loss: 0.688905 2025-01-16 01:53:24,121 - INFO - step 14065, loss: 0.830979, best loss: 0.688905 2025-01-16 01:53:24,271 - INFO - step 14066, loss: 0.797722, best loss: 0.688905 2025-01-16 01:53:24,422 - INFO - step 14067, loss: 0.846974, best loss: 0.688905 2025-01-16 01:53:24,572 - INFO - step 14068, loss: 0.899613, best loss: 0.688905 2025-01-16 01:53:24,722 - INFO - step 14069, loss: 0.882574, best loss: 0.688905 2025-01-16 01:53:24,872 - INFO - step 14070, loss: 0.807369, best loss: 0.688905 2025-01-16 01:53:25,022 - INFO - step 14071, loss: 0.875680, best loss: 0.688905 2025-01-16 01:53:25,172 - INFO - step 14072, loss: 0.929501, best loss: 0.688905 2025-01-16 01:53:25,322 - INFO - step 14073, loss: 0.939192, best loss: 0.688905 2025-01-16 01:53:25,472 - INFO - step 14074, loss: 0.919305, best loss: 0.688905 2025-01-16 01:53:25,622 - INFO - step 14075, loss: 0.941412, best loss: 0.688905 2025-01-16 01:53:25,773 - INFO - step 14076, loss: 0.980433, best loss: 0.688905 2025-01-16 01:53:25,923 - INFO - step 14077, loss: 0.926922, best loss: 0.688905 2025-01-16 01:53:26,073 - INFO - step 14078, loss: 0.834931, best loss: 0.688905 2025-01-16 01:53:26,223 - INFO - step 14079, loss: 0.934017, best loss: 0.688905 2025-01-16 01:53:26,373 - INFO - step 14080, loss: 1.004827, best loss: 0.688905 2025-01-16 01:53:26,523 - INFO - step 14081, loss: 0.986977, best loss: 0.688905 2025-01-16 01:53:26,673 - INFO - step 14082, loss: 1.036388, best loss: 0.688905 2025-01-16 01:53:26,823 - INFO - step 14083, loss: 1.034953, best loss: 0.688905 2025-01-16 01:53:26,972 - INFO - step 14084, loss: 1.010781, best loss: 0.688905 2025-01-16 01:53:27,122 - INFO - step 14085, loss: 1.065917, best loss: 0.688905 2025-01-16 01:53:27,272 - INFO - step 14086, loss: 0.943289, best loss: 0.688905 2025-01-16 01:53:27,423 - INFO - step 14087, loss: 0.846146, best loss: 0.688905 2025-01-16 01:53:27,573 - INFO - step 14088, loss: 0.766820, best loss: 0.688905 2025-01-16 01:53:27,723 - INFO - step 14089, loss: 0.961969, best loss: 0.688905 2025-01-16 01:53:27,873 - INFO - step 14090, loss: 0.974929, best loss: 0.688905 2025-01-16 01:53:28,023 - INFO - step 14091, loss: 0.783426, best loss: 0.688905 2025-01-16 01:53:28,173 - INFO - step 14092, loss: 0.797769, best loss: 0.688905 2025-01-16 01:53:28,323 - INFO - step 14093, loss: 0.975330, best loss: 0.688905 2025-01-16 01:53:28,473 - INFO - step 14094, loss: 1.032642, best loss: 0.688905 2025-01-16 01:53:28,623 - INFO - step 14095, loss: 0.881297, best loss: 0.688905 2025-01-16 01:53:28,773 - INFO - step 14096, loss: 0.954463, best loss: 0.688905 2025-01-16 01:53:28,923 - INFO - step 14097, loss: 0.869151, best loss: 0.688905 2025-01-16 01:53:29,074 - INFO - step 14098, loss: 0.698873, best loss: 0.688905 2025-01-16 01:53:29,224 - INFO - step 14099, loss: 0.926184, best loss: 0.688905 2025-01-16 01:53:29,374 - INFO - step 14100, loss: 1.003076, best loss: 0.688905 2025-01-16 01:53:29,524 - INFO - step 14101, loss: 1.013234, best loss: 0.688905 2025-01-16 01:53:29,674 - INFO - step 14102, loss: 0.953132, best loss: 0.688905 2025-01-16 01:53:29,824 - INFO - step 14103, loss: 0.823741, best loss: 0.688905 2025-01-16 01:53:29,974 - INFO - step 14104, loss: 1.001171, best loss: 0.688905 2025-01-16 01:53:30,124 - INFO - step 14105, loss: 0.987675, best loss: 0.688905 2025-01-16 01:53:30,274 - INFO - step 14106, loss: 0.870287, best loss: 0.688905 2025-01-16 01:53:30,424 - INFO - step 14107, loss: 0.901727, best loss: 0.688905 2025-01-16 01:53:30,574 - INFO - step 14108, loss: 0.794141, best loss: 0.688905 2025-01-16 01:53:33,854 - INFO - step 14109, loss: 0.685245, best loss: 0.685245 2025-01-16 01:53:34,015 - INFO - step 14110, loss: 0.823704, best loss: 0.685245 2025-01-16 01:53:34,166 - INFO - step 14111, loss: 0.805311, best loss: 0.685245 2025-01-16 01:53:34,317 - INFO - step 14112, loss: 0.883291, best loss: 0.685245 2025-01-16 01:53:34,467 - INFO - step 14113, loss: 0.785178, best loss: 0.685245 2025-01-16 01:53:34,617 - INFO - step 14114, loss: 0.776485, best loss: 0.685245 2025-01-16 01:53:34,767 - INFO - step 14115, loss: 0.743160, best loss: 0.685245 2025-01-16 01:53:34,917 - INFO - step 14116, loss: 0.852557, best loss: 0.685245 2025-01-16 01:53:35,067 - INFO - step 14117, loss: 0.988261, best loss: 0.685245 2025-01-16 01:53:35,217 - INFO - step 14118, loss: 0.874818, best loss: 0.685245 2025-01-16 01:53:35,367 - INFO - step 14119, loss: 1.016879, best loss: 0.685245 2025-01-16 01:53:35,518 - INFO - step 14120, loss: 0.880299, best loss: 0.685245 2025-01-16 01:53:35,668 - INFO - step 14121, loss: 0.838773, best loss: 0.685245 2025-01-16 01:53:35,818 - INFO - step 14122, loss: 0.867149, best loss: 0.685245 2025-01-16 01:53:35,968 - INFO - step 14123, loss: 0.791288, best loss: 0.685245 2025-01-16 01:53:36,118 - INFO - step 14124, loss: 0.821951, best loss: 0.685245 2025-01-16 01:53:36,268 - INFO - step 14125, loss: 0.810803, best loss: 0.685245 2025-01-16 01:53:36,418 - INFO - step 14126, loss: 0.746101, best loss: 0.685245 2025-01-16 01:53:36,568 - INFO - step 14127, loss: 0.728507, best loss: 0.685245 2025-01-16 01:53:36,718 - INFO - step 14128, loss: 0.761356, best loss: 0.685245 2025-01-16 01:53:36,868 - INFO - step 14129, loss: 0.817546, best loss: 0.685245 2025-01-16 01:53:37,018 - INFO - step 14130, loss: 0.773558, best loss: 0.685245 2025-01-16 01:53:37,168 - INFO - step 14131, loss: 0.807619, best loss: 0.685245 2025-01-16 01:53:37,318 - INFO - step 14132, loss: 0.773468, best loss: 0.685245 2025-01-16 01:53:37,468 - INFO - step 14133, loss: 0.738701, best loss: 0.685245 2025-01-16 01:53:37,618 - INFO - step 14134, loss: 0.772134, best loss: 0.685245 2025-01-16 01:53:37,768 - INFO - step 14135, loss: 0.881952, best loss: 0.685245 2025-01-16 01:53:37,918 - INFO - step 14136, loss: 0.730813, best loss: 0.685245 2025-01-16 01:53:38,068 - INFO - step 14137, loss: 0.768180, best loss: 0.685245 2025-01-16 01:53:38,219 - INFO - step 14138, loss: 0.836215, best loss: 0.685245 2025-01-16 01:53:38,369 - INFO - step 14139, loss: 0.871171, best loss: 0.685245 2025-01-16 01:53:38,519 - INFO - step 14140, loss: 0.857386, best loss: 0.685245 2025-01-16 01:53:38,669 - INFO - step 14141, loss: 0.748990, best loss: 0.685245 2025-01-16 01:53:38,819 - INFO - step 14142, loss: 0.791079, best loss: 0.685245 2025-01-16 01:53:38,969 - INFO - step 14143, loss: 0.782497, best loss: 0.685245 2025-01-16 01:53:39,119 - INFO - step 14144, loss: 0.929939, best loss: 0.685245 2025-01-16 01:53:39,269 - INFO - step 14145, loss: 0.856657, best loss: 0.685245 2025-01-16 01:53:39,420 - INFO - step 14146, loss: 0.885616, best loss: 0.685245 2025-01-16 01:53:39,570 - INFO - step 14147, loss: 0.809594, best loss: 0.685245 2025-01-16 01:53:39,721 - INFO - step 14148, loss: 0.904203, best loss: 0.685245 2025-01-16 01:53:39,871 - INFO - step 14149, loss: 0.889319, best loss: 0.685245 2025-01-16 01:53:40,021 - INFO - step 14150, loss: 0.894608, best loss: 0.685245 2025-01-16 01:53:40,171 - INFO - step 14151, loss: 0.798674, best loss: 0.685245 2025-01-16 01:53:40,321 - INFO - step 14152, loss: 0.853592, best loss: 0.685245 2025-01-16 01:53:40,471 - INFO - step 14153, loss: 0.814306, best loss: 0.685245 2025-01-16 01:53:40,621 - INFO - step 14154, loss: 0.926374, best loss: 0.685245 2025-01-16 01:53:40,771 - INFO - step 14155, loss: 0.843979, best loss: 0.685245 2025-01-16 01:53:40,921 - INFO - step 14156, loss: 0.828123, best loss: 0.685245 2025-01-16 01:53:41,072 - INFO - step 14157, loss: 0.706528, best loss: 0.685245 2025-01-16 01:53:41,222 - INFO - step 14158, loss: 0.875862, best loss: 0.685245 2025-01-16 01:53:41,372 - INFO - step 14159, loss: 0.882134, best loss: 0.685245 2025-01-16 01:53:41,522 - INFO - step 14160, loss: 0.785977, best loss: 0.685245 2025-01-16 01:53:41,672 - INFO - step 14161, loss: 0.867716, best loss: 0.685245 2025-01-16 01:53:41,822 - INFO - step 14162, loss: 0.784983, best loss: 0.685245 2025-01-16 01:53:41,972 - INFO - step 14163, loss: 0.862567, best loss: 0.685245 2025-01-16 01:53:42,122 - INFO - step 14164, loss: 0.818007, best loss: 0.685245 2025-01-16 01:53:42,272 - INFO - step 14165, loss: 0.752508, best loss: 0.685245 2025-01-16 01:53:42,422 - INFO - step 14166, loss: 0.773763, best loss: 0.685245 2025-01-16 01:53:42,572 - INFO - step 14167, loss: 0.876145, best loss: 0.685245 2025-01-16 01:53:42,722 - INFO - step 14168, loss: 0.771955, best loss: 0.685245 2025-01-16 01:53:42,872 - INFO - step 14169, loss: 0.853395, best loss: 0.685245 2025-01-16 01:53:43,022 - INFO - step 14170, loss: 0.753152, best loss: 0.685245 2025-01-16 01:53:43,172 - INFO - step 14171, loss: 0.795494, best loss: 0.685245 2025-01-16 01:53:43,322 - INFO - step 14172, loss: 1.043705, best loss: 0.685245 2025-01-16 01:53:43,472 - INFO - step 14173, loss: 0.880654, best loss: 0.685245 2025-01-16 01:53:43,622 - INFO - step 14174, loss: 0.824945, best loss: 0.685245 2025-01-16 01:53:43,772 - INFO - step 14175, loss: 0.694440, best loss: 0.685245 2025-01-16 01:53:47,281 - INFO - step 14176, loss: 0.677938, best loss: 0.677938 2025-01-16 01:53:47,432 - INFO - step 14177, loss: 0.704635, best loss: 0.677938 2025-01-16 01:53:47,582 - INFO - step 14178, loss: 0.820024, best loss: 0.677938 2025-01-16 01:53:47,732 - INFO - step 14179, loss: 0.825313, best loss: 0.677938 2025-01-16 01:53:47,882 - INFO - step 14180, loss: 0.884935, best loss: 0.677938 2025-01-16 01:53:48,033 - INFO - step 14181, loss: 0.887969, best loss: 0.677938 2025-01-16 01:53:48,183 - INFO - step 14182, loss: 0.772149, best loss: 0.677938 2025-01-16 01:53:48,333 - INFO - step 14183, loss: 0.804663, best loss: 0.677938 2025-01-16 01:53:48,483 - INFO - step 14184, loss: 0.776005, best loss: 0.677938 2025-01-16 01:53:48,633 - INFO - step 14185, loss: 0.873794, best loss: 0.677938 2025-01-16 01:53:48,783 - INFO - step 14186, loss: 0.931409, best loss: 0.677938 2025-01-16 01:53:52,300 - INFO - step 14187, loss: 0.643198, best loss: 0.643198 2025-01-16 01:53:52,451 - INFO - step 14188, loss: 0.802434, best loss: 0.643198 2025-01-16 01:53:52,601 - INFO - step 14189, loss: 0.694912, best loss: 0.643198 2025-01-16 01:53:52,751 - INFO - step 14190, loss: 0.824402, best loss: 0.643198 2025-01-16 01:53:52,901 - INFO - step 14191, loss: 0.823129, best loss: 0.643198 2025-01-16 01:53:53,051 - INFO - step 14192, loss: 0.781924, best loss: 0.643198 2025-01-16 01:53:53,202 - INFO - step 14193, loss: 0.883045, best loss: 0.643198 2025-01-16 01:53:53,352 - INFO - step 14194, loss: 0.801404, best loss: 0.643198 2025-01-16 01:53:53,502 - INFO - step 14195, loss: 0.805276, best loss: 0.643198 2025-01-16 01:53:53,652 - INFO - step 14196, loss: 0.829983, best loss: 0.643198 2025-01-16 01:53:53,802 - INFO - step 14197, loss: 0.819406, best loss: 0.643198 2025-01-16 01:53:53,953 - INFO - step 14198, loss: 0.898616, best loss: 0.643198 2025-01-16 01:53:54,103 - INFO - step 14199, loss: 0.896204, best loss: 0.643198 2025-01-16 01:53:54,253 - INFO - step 14200, loss: 0.770827, best loss: 0.643198 2025-01-16 01:53:54,403 - INFO - step 14201, loss: 0.767984, best loss: 0.643198 2025-01-16 01:53:54,553 - INFO - step 14202, loss: 0.715811, best loss: 0.643198 2025-01-16 01:53:54,703 - INFO - step 14203, loss: 0.967620, best loss: 0.643198 2025-01-16 01:53:54,854 - INFO - step 14204, loss: 0.939196, best loss: 0.643198 2025-01-16 01:53:55,004 - INFO - step 14205, loss: 0.884960, best loss: 0.643198 2025-01-16 01:53:55,154 - INFO - step 14206, loss: 0.906410, best loss: 0.643198 2025-01-16 01:53:55,304 - INFO - step 14207, loss: 0.852508, best loss: 0.643198 2025-01-16 01:53:55,454 - INFO - step 14208, loss: 0.860859, best loss: 0.643198 2025-01-16 01:53:55,605 - INFO - step 14209, loss: 0.873120, best loss: 0.643198 2025-01-16 01:53:55,755 - INFO - step 14210, loss: 0.784957, best loss: 0.643198 2025-01-16 01:53:55,905 - INFO - step 14211, loss: 0.951754, best loss: 0.643198 2025-01-16 01:53:56,055 - INFO - step 14212, loss: 0.776618, best loss: 0.643198 2025-01-16 01:53:56,206 - INFO - step 14213, loss: 0.834366, best loss: 0.643198 2025-01-16 01:53:56,356 - INFO - step 14214, loss: 0.820715, best loss: 0.643198 2025-01-16 01:53:56,506 - INFO - step 14215, loss: 0.956039, best loss: 0.643198 2025-01-16 01:53:56,656 - INFO - step 14216, loss: 0.809573, best loss: 0.643198 2025-01-16 01:53:56,806 - INFO - step 14217, loss: 0.737284, best loss: 0.643198 2025-01-16 01:53:56,956 - INFO - step 14218, loss: 0.889836, best loss: 0.643198 2025-01-16 01:53:57,106 - INFO - step 14219, loss: 0.926295, best loss: 0.643198 2025-01-16 01:53:57,256 - INFO - step 14220, loss: 0.721868, best loss: 0.643198 2025-01-16 01:53:57,406 - INFO - step 14221, loss: 0.798784, best loss: 0.643198 2025-01-16 01:53:57,556 - INFO - step 14222, loss: 0.851104, best loss: 0.643198 2025-01-16 01:53:57,707 - INFO - step 14223, loss: 0.850482, best loss: 0.643198 2025-01-16 01:53:57,857 - INFO - step 14224, loss: 0.780663, best loss: 0.643198 2025-01-16 01:53:58,007 - INFO - step 14225, loss: 0.744040, best loss: 0.643198 2025-01-16 01:53:58,157 - INFO - step 14226, loss: 0.910824, best loss: 0.643198 2025-01-16 01:53:58,307 - INFO - step 14227, loss: 0.868471, best loss: 0.643198 2025-01-16 01:53:58,458 - INFO - step 14228, loss: 0.835485, best loss: 0.643198 2025-01-16 01:53:58,608 - INFO - step 14229, loss: 0.737244, best loss: 0.643198 2025-01-16 01:53:58,758 - INFO - step 14230, loss: 0.863557, best loss: 0.643198 2025-01-16 01:53:58,909 - INFO - step 14231, loss: 0.824861, best loss: 0.643198 2025-01-16 01:53:59,059 - INFO - step 14232, loss: 0.881850, best loss: 0.643198 2025-01-16 01:53:59,209 - INFO - step 14233, loss: 0.810750, best loss: 0.643198 2025-01-16 01:53:59,359 - INFO - step 14234, loss: 0.794443, best loss: 0.643198 2025-01-16 01:53:59,510 - INFO - step 14235, loss: 0.724839, best loss: 0.643198 2025-01-16 01:53:59,660 - INFO - step 14236, loss: 0.919489, best loss: 0.643198 2025-01-16 01:53:59,810 - INFO - step 14237, loss: 0.873296, best loss: 0.643198 2025-01-16 01:53:59,960 - INFO - step 14238, loss: 0.853380, best loss: 0.643198 2025-01-16 01:54:00,110 - INFO - step 14239, loss: 0.729106, best loss: 0.643198 2025-01-16 01:54:00,260 - INFO - step 14240, loss: 0.753051, best loss: 0.643198 2025-01-16 01:54:00,410 - INFO - step 14241, loss: 0.755155, best loss: 0.643198 2025-01-16 01:54:00,560 - INFO - step 14242, loss: 0.858491, best loss: 0.643198 2025-01-16 01:54:00,710 - INFO - step 14243, loss: 0.791348, best loss: 0.643198 2025-01-16 01:54:00,861 - INFO - step 14244, loss: 0.781653, best loss: 0.643198 2025-01-16 01:54:01,011 - INFO - step 14245, loss: 0.767127, best loss: 0.643198 2025-01-16 01:54:01,161 - INFO - step 14246, loss: 0.824062, best loss: 0.643198 2025-01-16 01:54:01,311 - INFO - step 14247, loss: 0.779511, best loss: 0.643198 2025-01-16 01:54:01,461 - INFO - step 14248, loss: 0.720870, best loss: 0.643198 2025-01-16 01:54:01,611 - INFO - step 14249, loss: 0.868425, best loss: 0.643198 2025-01-16 01:54:01,762 - INFO - step 14250, loss: 0.861582, best loss: 0.643198 2025-01-16 01:54:01,912 - INFO - step 14251, loss: 0.888923, best loss: 0.643198 2025-01-16 01:54:02,062 - INFO - step 14252, loss: 0.940206, best loss: 0.643198 2025-01-16 01:54:02,212 - INFO - step 14253, loss: 0.891151, best loss: 0.643198 2025-01-16 01:54:02,362 - INFO - step 14254, loss: 0.838474, best loss: 0.643198 2025-01-16 01:54:02,512 - INFO - step 14255, loss: 0.696662, best loss: 0.643198 2025-01-16 01:54:02,663 - INFO - step 14256, loss: 0.941303, best loss: 0.643198 2025-01-16 01:54:02,813 - INFO - step 14257, loss: 0.845591, best loss: 0.643198 2025-01-16 01:54:02,963 - INFO - step 14258, loss: 0.740613, best loss: 0.643198 2025-01-16 01:54:03,113 - INFO - step 14259, loss: 0.790707, best loss: 0.643198 2025-01-16 01:54:03,263 - INFO - step 14260, loss: 0.859986, best loss: 0.643198 2025-01-16 01:54:03,413 - INFO - step 14261, loss: 0.798991, best loss: 0.643198 2025-01-16 01:54:03,563 - INFO - step 14262, loss: 0.903720, best loss: 0.643198 2025-01-16 01:54:03,713 - INFO - step 14263, loss: 0.821170, best loss: 0.643198 2025-01-16 01:54:03,864 - INFO - step 14264, loss: 0.906061, best loss: 0.643198 2025-01-16 01:54:04,014 - INFO - step 14265, loss: 0.805354, best loss: 0.643198 2025-01-16 01:54:04,164 - INFO - step 14266, loss: 0.784942, best loss: 0.643198 2025-01-16 01:54:04,314 - INFO - step 14267, loss: 0.802784, best loss: 0.643198 2025-01-16 01:54:04,464 - INFO - step 14268, loss: 0.823402, best loss: 0.643198 2025-01-16 01:54:04,614 - INFO - step 14269, loss: 0.753532, best loss: 0.643198 2025-01-16 01:54:04,764 - INFO - step 14270, loss: 0.843934, best loss: 0.643198 2025-01-16 01:54:04,914 - INFO - step 14271, loss: 0.778324, best loss: 0.643198 2025-01-16 01:54:05,064 - INFO - step 14272, loss: 0.760337, best loss: 0.643198 2025-01-16 01:54:05,214 - INFO - step 14273, loss: 0.804530, best loss: 0.643198 2025-01-16 01:54:05,364 - INFO - step 14274, loss: 0.787259, best loss: 0.643198 2025-01-16 01:54:05,514 - INFO - step 14275, loss: 0.891551, best loss: 0.643198 2025-01-16 01:54:05,664 - INFO - step 14276, loss: 0.681764, best loss: 0.643198 2025-01-16 01:54:05,814 - INFO - step 14277, loss: 0.773497, best loss: 0.643198 2025-01-16 01:54:05,964 - INFO - step 14278, loss: 0.716091, best loss: 0.643198 2025-01-16 01:54:06,115 - INFO - step 14279, loss: 0.807540, best loss: 0.643198 2025-01-16 01:54:06,265 - INFO - step 14280, loss: 0.684441, best loss: 0.643198 2025-01-16 01:54:06,415 - INFO - step 14281, loss: 0.756456, best loss: 0.643198 2025-01-16 01:54:06,565 - INFO - step 14282, loss: 0.786467, best loss: 0.643198 2025-01-16 01:54:06,715 - INFO - step 14283, loss: 0.840766, best loss: 0.643198 2025-01-16 01:54:06,865 - INFO - step 14284, loss: 0.889343, best loss: 0.643198 2025-01-16 01:54:07,015 - INFO - step 14285, loss: 0.888944, best loss: 0.643198 2025-01-16 01:54:07,165 - INFO - step 14286, loss: 0.848817, best loss: 0.643198 2025-01-16 01:54:07,315 - INFO - step 14287, loss: 0.800590, best loss: 0.643198 2025-01-16 01:54:07,465 - INFO - step 14288, loss: 0.718231, best loss: 0.643198 2025-01-16 01:54:07,616 - INFO - step 14289, loss: 0.811361, best loss: 0.643198 2025-01-16 01:54:07,766 - INFO - step 14290, loss: 0.764684, best loss: 0.643198 2025-01-16 01:54:07,916 - INFO - step 14291, loss: 0.815062, best loss: 0.643198 2025-01-16 01:54:08,066 - INFO - step 14292, loss: 0.719317, best loss: 0.643198 2025-01-16 01:54:08,216 - INFO - step 14293, loss: 0.958474, best loss: 0.643198 2025-01-16 01:54:08,366 - INFO - step 14294, loss: 0.775007, best loss: 0.643198 2025-01-16 01:54:08,517 - INFO - step 14295, loss: 0.724132, best loss: 0.643198 2025-01-16 01:54:08,667 - INFO - step 14296, loss: 0.872114, best loss: 0.643198 2025-01-16 01:54:08,816 - INFO - step 14297, loss: 0.784678, best loss: 0.643198 2025-01-16 01:54:08,967 - INFO - step 14298, loss: 0.751736, best loss: 0.643198 2025-01-16 01:54:09,117 - INFO - step 14299, loss: 0.913634, best loss: 0.643198 2025-01-16 01:54:09,267 - INFO - step 14300, loss: 0.760001, best loss: 0.643198 2025-01-16 01:54:09,417 - INFO - step 14301, loss: 0.819756, best loss: 0.643198 2025-01-16 01:54:09,567 - INFO - step 14302, loss: 0.797288, best loss: 0.643198 2025-01-16 01:54:09,718 - INFO - step 14303, loss: 0.759893, best loss: 0.643198 2025-01-16 01:54:09,868 - INFO - step 14304, loss: 0.863187, best loss: 0.643198 2025-01-16 01:54:10,018 - INFO - step 14305, loss: 0.767485, best loss: 0.643198 2025-01-16 01:54:10,168 - INFO - step 14306, loss: 0.814993, best loss: 0.643198 2025-01-16 01:54:10,318 - INFO - step 14307, loss: 0.840058, best loss: 0.643198 2025-01-16 01:54:10,468 - INFO - step 14308, loss: 0.734969, best loss: 0.643198 2025-01-16 01:54:10,619 - INFO - step 14309, loss: 0.838307, best loss: 0.643198 2025-01-16 01:54:10,769 - INFO - step 14310, loss: 0.831498, best loss: 0.643198 2025-01-16 01:54:10,919 - INFO - step 14311, loss: 0.849214, best loss: 0.643198 2025-01-16 01:54:11,069 - INFO - step 14312, loss: 0.851291, best loss: 0.643198 2025-01-16 01:54:11,219 - INFO - step 14313, loss: 0.719619, best loss: 0.643198 2025-01-16 01:54:11,369 - INFO - step 14314, loss: 0.754664, best loss: 0.643198 2025-01-16 01:54:11,519 - INFO - step 14315, loss: 0.758656, best loss: 0.643198 2025-01-16 01:54:11,669 - INFO - step 14316, loss: 0.750406, best loss: 0.643198 2025-01-16 01:54:11,819 - INFO - step 14317, loss: 0.915342, best loss: 0.643198 2025-01-16 01:54:11,969 - INFO - step 14318, loss: 0.686815, best loss: 0.643198 2025-01-16 01:54:12,119 - INFO - step 14319, loss: 0.728773, best loss: 0.643198 2025-01-16 01:54:12,269 - INFO - step 14320, loss: 0.735794, best loss: 0.643198 2025-01-16 01:54:12,419 - INFO - step 14321, loss: 0.841450, best loss: 0.643198 2025-01-16 01:54:12,569 - INFO - step 14322, loss: 0.950480, best loss: 0.643198 2025-01-16 01:54:12,719 - INFO - step 14323, loss: 0.698223, best loss: 0.643198 2025-01-16 01:54:12,870 - INFO - step 14324, loss: 0.863634, best loss: 0.643198 2025-01-16 01:54:13,020 - INFO - step 14325, loss: 0.693236, best loss: 0.643198 2025-01-16 01:54:13,170 - INFO - step 14326, loss: 0.654169, best loss: 0.643198 2025-01-16 01:54:13,320 - INFO - step 14327, loss: 0.788198, best loss: 0.643198 2025-01-16 01:54:13,470 - INFO - step 14328, loss: 0.768915, best loss: 0.643198 2025-01-16 01:54:13,620 - INFO - step 14329, loss: 0.762750, best loss: 0.643198 2025-01-16 01:54:13,770 - INFO - step 14330, loss: 0.848481, best loss: 0.643198 2025-01-16 01:54:13,920 - INFO - step 14331, loss: 0.746305, best loss: 0.643198 2025-01-16 01:54:14,070 - INFO - step 14332, loss: 0.837454, best loss: 0.643198 2025-01-16 01:54:14,221 - INFO - step 14333, loss: 0.752501, best loss: 0.643198 2025-01-16 01:54:17,685 - INFO - step 14334, loss: 0.631632, best loss: 0.631632 2025-01-16 01:54:17,836 - INFO - step 14335, loss: 0.790763, best loss: 0.631632 2025-01-16 01:54:17,987 - INFO - step 14336, loss: 0.858180, best loss: 0.631632 2025-01-16 01:54:18,137 - INFO - step 14337, loss: 0.768942, best loss: 0.631632 2025-01-16 01:54:18,287 - INFO - step 14338, loss: 0.685220, best loss: 0.631632 2025-01-16 01:54:18,437 - INFO - step 14339, loss: 0.702456, best loss: 0.631632 2025-01-16 01:54:18,587 - INFO - step 14340, loss: 0.763589, best loss: 0.631632 2025-01-16 01:54:18,738 - INFO - step 14341, loss: 0.816042, best loss: 0.631632 2025-01-16 01:54:18,889 - INFO - step 14342, loss: 0.790953, best loss: 0.631632 2025-01-16 01:54:19,039 - INFO - step 14343, loss: 0.788906, best loss: 0.631632 2025-01-16 01:54:19,190 - INFO - step 14344, loss: 0.840630, best loss: 0.631632 2025-01-16 01:54:19,341 - INFO - step 14345, loss: 0.858460, best loss: 0.631632 2025-01-16 01:54:19,493 - INFO - step 14346, loss: 0.862746, best loss: 0.631632 2025-01-16 01:54:19,645 - INFO - step 14347, loss: 0.757869, best loss: 0.631632 2025-01-16 01:54:19,796 - INFO - step 14348, loss: 0.838188, best loss: 0.631632 2025-01-16 01:54:19,946 - INFO - step 14349, loss: 0.827980, best loss: 0.631632 2025-01-16 01:54:20,096 - INFO - step 14350, loss: 0.745682, best loss: 0.631632 2025-01-16 01:54:20,247 - INFO - step 14351, loss: 0.793857, best loss: 0.631632 2025-01-16 01:54:20,398 - INFO - step 14352, loss: 0.770369, best loss: 0.631632 2025-01-16 01:54:20,548 - INFO - step 14353, loss: 0.646164, best loss: 0.631632 2025-01-16 01:54:20,698 - INFO - step 14354, loss: 0.774807, best loss: 0.631632 2025-01-16 01:54:20,849 - INFO - step 14355, loss: 0.763126, best loss: 0.631632 2025-01-16 01:54:20,999 - INFO - step 14356, loss: 0.940784, best loss: 0.631632 2025-01-16 01:54:21,149 - INFO - step 14357, loss: 0.825316, best loss: 0.631632 2025-01-16 01:54:21,299 - INFO - step 14358, loss: 0.779383, best loss: 0.631632 2025-01-16 01:54:21,449 - INFO - step 14359, loss: 0.839968, best loss: 0.631632 2025-01-16 01:54:21,600 - INFO - step 14360, loss: 0.767383, best loss: 0.631632 2025-01-16 01:54:21,750 - INFO - step 14361, loss: 0.684123, best loss: 0.631632 2025-01-16 01:54:21,900 - INFO - step 14362, loss: 0.791798, best loss: 0.631632 2025-01-16 01:54:22,050 - INFO - step 14363, loss: 0.761284, best loss: 0.631632 2025-01-16 01:54:22,200 - INFO - step 14364, loss: 0.780796, best loss: 0.631632 2025-01-16 01:54:22,350 - INFO - step 14365, loss: 0.766616, best loss: 0.631632 2025-01-16 01:54:22,500 - INFO - step 14366, loss: 0.839950, best loss: 0.631632 2025-01-16 01:54:22,650 - INFO - step 14367, loss: 0.882657, best loss: 0.631632 2025-01-16 01:54:22,800 - INFO - step 14368, loss: 0.823693, best loss: 0.631632 2025-01-16 01:54:22,950 - INFO - step 14369, loss: 0.797160, best loss: 0.631632 2025-01-16 01:54:23,101 - INFO - step 14370, loss: 0.922390, best loss: 0.631632 2025-01-16 01:54:23,251 - INFO - step 14371, loss: 0.808740, best loss: 0.631632 2025-01-16 01:54:23,401 - INFO - step 14372, loss: 0.839038, best loss: 0.631632 2025-01-16 01:54:23,551 - INFO - step 14373, loss: 0.864785, best loss: 0.631632 2025-01-16 01:54:23,701 - INFO - step 14374, loss: 0.789488, best loss: 0.631632 2025-01-16 01:54:23,851 - INFO - step 14375, loss: 0.734886, best loss: 0.631632 2025-01-16 01:54:24,001 - INFO - step 14376, loss: 0.897944, best loss: 0.631632 2025-01-16 01:54:24,151 - INFO - step 14377, loss: 0.828345, best loss: 0.631632 2025-01-16 01:54:27,664 - INFO - step 14378, loss: 0.624992, best loss: 0.624992 2025-01-16 01:54:27,827 - INFO - step 14379, loss: 0.884345, best loss: 0.624992 2025-01-16 01:54:27,978 - INFO - step 14380, loss: 0.756650, best loss: 0.624992 2025-01-16 01:54:28,129 - INFO - step 14381, loss: 0.811466, best loss: 0.624992 2025-01-16 01:54:28,279 - INFO - step 14382, loss: 0.804398, best loss: 0.624992 2025-01-16 01:54:28,429 - INFO - step 14383, loss: 0.813881, best loss: 0.624992 2025-01-16 01:54:28,578 - INFO - step 14384, loss: 0.879313, best loss: 0.624992 2025-01-16 01:54:28,729 - INFO - step 14385, loss: 0.880907, best loss: 0.624992 2025-01-16 01:54:28,879 - INFO - step 14386, loss: 0.902495, best loss: 0.624992 2025-01-16 01:54:29,029 - INFO - step 14387, loss: 0.770608, best loss: 0.624992 2025-01-16 01:54:29,179 - INFO - step 14388, loss: 0.767345, best loss: 0.624992 2025-01-16 01:54:29,329 - INFO - step 14389, loss: 0.683958, best loss: 0.624992 2025-01-16 01:54:29,480 - INFO - step 14390, loss: 0.752525, best loss: 0.624992 2025-01-16 01:54:29,630 - INFO - step 14391, loss: 0.753134, best loss: 0.624992 2025-01-16 01:54:29,781 - INFO - step 14392, loss: 0.785397, best loss: 0.624992 2025-01-16 01:54:29,931 - INFO - step 14393, loss: 0.819757, best loss: 0.624992 2025-01-16 01:54:30,081 - INFO - step 14394, loss: 0.788988, best loss: 0.624992 2025-01-16 01:54:30,231 - INFO - step 14395, loss: 0.771015, best loss: 0.624992 2025-01-16 01:54:30,381 - INFO - step 14396, loss: 0.747586, best loss: 0.624992 2025-01-16 01:54:30,532 - INFO - step 14397, loss: 0.735946, best loss: 0.624992 2025-01-16 01:54:30,682 - INFO - step 14398, loss: 0.775683, best loss: 0.624992 2025-01-16 01:54:30,832 - INFO - step 14399, loss: 0.739221, best loss: 0.624992 2025-01-16 01:54:30,982 - INFO - step 14400, loss: 0.763915, best loss: 0.624992 2025-01-16 01:54:31,132 - INFO - step 14401, loss: 0.823766, best loss: 0.624992 2025-01-16 01:54:31,282 - INFO - step 14402, loss: 0.749130, best loss: 0.624992 2025-01-16 01:54:31,433 - INFO - step 14403, loss: 0.745988, best loss: 0.624992 2025-01-16 01:54:31,583 - INFO - step 14404, loss: 0.826357, best loss: 0.624992 2025-01-16 01:54:31,734 - INFO - step 14405, loss: 0.875049, best loss: 0.624992 2025-01-16 01:54:31,884 - INFO - step 14406, loss: 0.897094, best loss: 0.624992 2025-01-16 01:54:32,034 - INFO - step 14407, loss: 0.828750, best loss: 0.624992 2025-01-16 01:54:32,184 - INFO - step 14408, loss: 0.874121, best loss: 0.624992 2025-01-16 01:54:32,334 - INFO - step 14409, loss: 0.818084, best loss: 0.624992 2025-01-16 01:54:32,484 - INFO - step 14410, loss: 0.883859, best loss: 0.624992 2025-01-16 01:54:32,634 - INFO - step 14411, loss: 0.810839, best loss: 0.624992 2025-01-16 01:54:32,784 - INFO - step 14412, loss: 0.913441, best loss: 0.624992 2025-01-16 01:54:32,934 - INFO - step 14413, loss: 0.960463, best loss: 0.624992 2025-01-16 01:54:33,085 - INFO - step 14414, loss: 0.880113, best loss: 0.624992 2025-01-16 01:54:33,235 - INFO - step 14415, loss: 0.811217, best loss: 0.624992 2025-01-16 01:54:33,385 - INFO - step 14416, loss: 0.833477, best loss: 0.624992 2025-01-16 01:54:33,535 - INFO - step 14417, loss: 0.719665, best loss: 0.624992 2025-01-16 01:54:33,685 - INFO - step 14418, loss: 0.680898, best loss: 0.624992 2025-01-16 01:54:33,835 - INFO - step 14419, loss: 0.821044, best loss: 0.624992 2025-01-16 01:54:33,986 - INFO - step 14420, loss: 0.765842, best loss: 0.624992 2025-01-16 01:54:34,136 - INFO - step 14421, loss: 0.639922, best loss: 0.624992 2025-01-16 01:54:34,286 - INFO - step 14422, loss: 0.729057, best loss: 0.624992 2025-01-16 01:54:34,436 - INFO - step 14423, loss: 0.876848, best loss: 0.624992 2025-01-16 01:54:34,586 - INFO - step 14424, loss: 0.919653, best loss: 0.624992 2025-01-16 01:54:34,736 - INFO - step 14425, loss: 0.766332, best loss: 0.624992 2025-01-16 01:54:34,886 - INFO - step 14426, loss: 0.859166, best loss: 0.624992 2025-01-16 01:54:35,037 - INFO - step 14427, loss: 0.758522, best loss: 0.624992 2025-01-16 01:54:35,187 - INFO - step 14428, loss: 0.644913, best loss: 0.624992 2025-01-16 01:54:35,337 - INFO - step 14429, loss: 0.787204, best loss: 0.624992 2025-01-16 01:54:35,487 - INFO - step 14430, loss: 0.938269, best loss: 0.624992 2025-01-16 01:54:35,637 - INFO - step 14431, loss: 0.912661, best loss: 0.624992 2025-01-16 01:54:35,787 - INFO - step 14432, loss: 0.894050, best loss: 0.624992 2025-01-16 01:54:35,937 - INFO - step 14433, loss: 0.708287, best loss: 0.624992 2025-01-16 01:54:36,088 - INFO - step 14434, loss: 0.822972, best loss: 0.624992 2025-01-16 01:54:36,238 - INFO - step 14435, loss: 0.846876, best loss: 0.624992 2025-01-16 01:54:36,388 - INFO - step 14436, loss: 0.732848, best loss: 0.624992 2025-01-16 01:54:36,538 - INFO - step 14437, loss: 0.795752, best loss: 0.624992 2025-01-16 01:54:36,688 - INFO - step 14438, loss: 0.673515, best loss: 0.624992 2025-01-16 01:54:40,211 - INFO - step 14439, loss: 0.621424, best loss: 0.621424 2025-01-16 01:54:40,362 - INFO - step 14440, loss: 0.721430, best loss: 0.621424 2025-01-16 01:54:40,512 - INFO - step 14441, loss: 0.722420, best loss: 0.621424 2025-01-16 01:54:40,662 - INFO - step 14442, loss: 0.764064, best loss: 0.621424 2025-01-16 01:54:44,187 - INFO - step 14443, loss: 0.587561, best loss: 0.587561 2025-01-16 01:54:44,338 - INFO - step 14444, loss: 0.676054, best loss: 0.587561 2025-01-16 01:54:44,488 - INFO - step 14445, loss: 0.659651, best loss: 0.587561 2025-01-16 01:54:44,639 - INFO - step 14446, loss: 0.707351, best loss: 0.587561 2025-01-16 01:54:44,789 - INFO - step 14447, loss: 0.875638, best loss: 0.587561 2025-01-16 01:54:44,939 - INFO - step 14448, loss: 0.812180, best loss: 0.587561 2025-01-16 01:54:45,089 - INFO - step 14449, loss: 0.845898, best loss: 0.587561 2025-01-16 01:54:45,239 - INFO - step 14450, loss: 0.806169, best loss: 0.587561 2025-01-16 01:54:45,389 - INFO - step 14451, loss: 0.801618, best loss: 0.587561 2025-01-16 01:54:45,539 - INFO - step 14452, loss: 0.732915, best loss: 0.587561 2025-01-16 01:54:45,689 - INFO - step 14453, loss: 0.659541, best loss: 0.587561 2025-01-16 01:54:45,840 - INFO - step 14454, loss: 0.681131, best loss: 0.587561 2025-01-16 01:54:45,990 - INFO - step 14455, loss: 0.675926, best loss: 0.587561 2025-01-16 01:54:46,140 - INFO - step 14456, loss: 0.643068, best loss: 0.587561 2025-01-16 01:54:46,290 - INFO - step 14457, loss: 0.632638, best loss: 0.587561 2025-01-16 01:54:46,440 - INFO - step 14458, loss: 0.620374, best loss: 0.587561 2025-01-16 01:54:46,590 - INFO - step 14459, loss: 0.791547, best loss: 0.587561 2025-01-16 01:54:46,740 - INFO - step 14460, loss: 0.733226, best loss: 0.587561 2025-01-16 01:54:46,890 - INFO - step 14461, loss: 0.704981, best loss: 0.587561 2025-01-16 01:54:47,040 - INFO - step 14462, loss: 0.672593, best loss: 0.587561 2025-01-16 01:54:47,190 - INFO - step 14463, loss: 0.723651, best loss: 0.587561 2025-01-16 01:54:47,340 - INFO - step 14464, loss: 0.688946, best loss: 0.587561 2025-01-16 01:54:47,490 - INFO - step 14465, loss: 0.778508, best loss: 0.587561 2025-01-16 01:54:47,641 - INFO - step 14466, loss: 0.732754, best loss: 0.587561 2025-01-16 01:54:47,791 - INFO - step 14467, loss: 0.702827, best loss: 0.587561 2025-01-16 01:54:47,941 - INFO - step 14468, loss: 0.792762, best loss: 0.587561 2025-01-16 01:54:48,091 - INFO - step 14469, loss: 0.773210, best loss: 0.587561 2025-01-16 01:54:48,241 - INFO - step 14470, loss: 0.749725, best loss: 0.587561 2025-01-16 01:54:48,391 - INFO - step 14471, loss: 0.704882, best loss: 0.587561 2025-01-16 01:54:48,541 - INFO - step 14472, loss: 0.836341, best loss: 0.587561 2025-01-16 01:54:48,691 - INFO - step 14473, loss: 0.693104, best loss: 0.587561 2025-01-16 01:54:48,841 - INFO - step 14474, loss: 0.857499, best loss: 0.587561 2025-01-16 01:54:48,992 - INFO - step 14475, loss: 0.725075, best loss: 0.587561 2025-01-16 01:54:49,142 - INFO - step 14476, loss: 0.767157, best loss: 0.587561 2025-01-16 01:54:49,292 - INFO - step 14477, loss: 0.713150, best loss: 0.587561 2025-01-16 01:54:49,443 - INFO - step 14478, loss: 0.715648, best loss: 0.587561 2025-01-16 01:54:49,593 - INFO - step 14479, loss: 0.751484, best loss: 0.587561 2025-01-16 01:54:49,743 - INFO - step 14480, loss: 0.737410, best loss: 0.587561 2025-01-16 01:54:49,893 - INFO - step 14481, loss: 0.742221, best loss: 0.587561 2025-01-16 01:54:50,043 - INFO - step 14482, loss: 0.763930, best loss: 0.587561 2025-01-16 01:54:50,194 - INFO - step 14483, loss: 0.753191, best loss: 0.587561 2025-01-16 01:54:50,344 - INFO - step 14484, loss: 0.818576, best loss: 0.587561 2025-01-16 01:54:50,494 - INFO - step 14485, loss: 0.777198, best loss: 0.587561 2025-01-16 01:54:50,644 - INFO - step 14486, loss: 0.766464, best loss: 0.587561 2025-01-16 01:54:50,794 - INFO - step 14487, loss: 0.628476, best loss: 0.587561 2025-01-16 01:54:50,944 - INFO - step 14488, loss: 0.803225, best loss: 0.587561 2025-01-16 01:54:51,095 - INFO - step 14489, loss: 0.790774, best loss: 0.587561 2025-01-16 01:54:51,245 - INFO - step 14490, loss: 0.701193, best loss: 0.587561 2025-01-16 01:54:51,394 - INFO - step 14491, loss: 0.781059, best loss: 0.587561 2025-01-16 01:54:51,544 - INFO - step 14492, loss: 0.687408, best loss: 0.587561 2025-01-16 01:54:51,695 - INFO - step 14493, loss: 0.777539, best loss: 0.587561 2025-01-16 01:54:51,844 - INFO - step 14494, loss: 0.766782, best loss: 0.587561 2025-01-16 01:54:51,995 - INFO - step 14495, loss: 0.635164, best loss: 0.587561 2025-01-16 01:54:52,145 - INFO - step 14496, loss: 0.672975, best loss: 0.587561 2025-01-16 01:54:52,295 - INFO - step 14497, loss: 0.737842, best loss: 0.587561 2025-01-16 01:54:52,446 - INFO - step 14498, loss: 0.709821, best loss: 0.587561 2025-01-16 01:54:52,596 - INFO - step 14499, loss: 0.775181, best loss: 0.587561 2025-01-16 01:54:52,747 - INFO - step 14500, loss: 0.666937, best loss: 0.587561 2025-01-16 01:54:52,897 - INFO - step 14501, loss: 0.770122, best loss: 0.587561 2025-01-16 01:54:53,047 - INFO - step 14502, loss: 0.877702, best loss: 0.587561 2025-01-16 01:54:53,198 - INFO - step 14503, loss: 0.737927, best loss: 0.587561 2025-01-16 01:54:53,348 - INFO - step 14504, loss: 0.706814, best loss: 0.587561 2025-01-16 01:54:56,612 - INFO - step 14505, loss: 0.569271, best loss: 0.569271 2025-01-16 01:54:56,762 - INFO - step 14506, loss: 0.585021, best loss: 0.569271 2025-01-16 01:55:00,371 - INFO - step 14507, loss: 0.550108, best loss: 0.550108 2025-01-16 01:55:00,522 - INFO - step 14508, loss: 0.739633, best loss: 0.550108 2025-01-16 01:55:00,672 - INFO - step 14509, loss: 0.798003, best loss: 0.550108 2025-01-16 01:55:00,822 - INFO - step 14510, loss: 0.839772, best loss: 0.550108 2025-01-16 01:55:00,972 - INFO - step 14511, loss: 0.794778, best loss: 0.550108 2025-01-16 01:55:01,122 - INFO - step 14512, loss: 0.717300, best loss: 0.550108 2025-01-16 01:55:01,272 - INFO - step 14513, loss: 0.668018, best loss: 0.550108 2025-01-16 01:55:01,422 - INFO - step 14514, loss: 0.658064, best loss: 0.550108 2025-01-16 01:55:01,572 - INFO - step 14515, loss: 0.766158, best loss: 0.550108 2025-01-16 01:55:01,723 - INFO - step 14516, loss: 0.844734, best loss: 0.550108 2025-01-16 01:55:01,873 - INFO - step 14517, loss: 0.595118, best loss: 0.550108 2025-01-16 01:55:02,023 - INFO - step 14518, loss: 0.698674, best loss: 0.550108 2025-01-16 01:55:02,173 - INFO - step 14519, loss: 0.643668, best loss: 0.550108 2025-01-16 01:55:02,323 - INFO - step 14520, loss: 0.736854, best loss: 0.550108 2025-01-16 01:55:02,474 - INFO - step 14521, loss: 0.760808, best loss: 0.550108 2025-01-16 01:55:02,624 - INFO - step 14522, loss: 0.713409, best loss: 0.550108 2025-01-16 01:55:02,774 - INFO - step 14523, loss: 0.768975, best loss: 0.550108 2025-01-16 01:55:02,924 - INFO - step 14524, loss: 0.778584, best loss: 0.550108 2025-01-16 01:55:03,075 - INFO - step 14525, loss: 0.671891, best loss: 0.550108 2025-01-16 01:55:03,225 - INFO - step 14526, loss: 0.729871, best loss: 0.550108 2025-01-16 01:55:03,375 - INFO - step 14527, loss: 0.725506, best loss: 0.550108 2025-01-16 01:55:03,525 - INFO - step 14528, loss: 0.784707, best loss: 0.550108 2025-01-16 01:55:03,675 - INFO - step 14529, loss: 0.828208, best loss: 0.550108 2025-01-16 01:55:03,826 - INFO - step 14530, loss: 0.680176, best loss: 0.550108 2025-01-16 01:55:03,976 - INFO - step 14531, loss: 0.650098, best loss: 0.550108 2025-01-16 01:55:04,126 - INFO - step 14532, loss: 0.622033, best loss: 0.550108 2025-01-16 01:55:04,276 - INFO - step 14533, loss: 0.774927, best loss: 0.550108 2025-01-16 01:55:04,427 - INFO - step 14534, loss: 0.862518, best loss: 0.550108 2025-01-16 01:55:04,578 - INFO - step 14535, loss: 0.703998, best loss: 0.550108 2025-01-16 01:55:04,728 - INFO - step 14536, loss: 0.845854, best loss: 0.550108 2025-01-16 01:55:04,878 - INFO - step 14537, loss: 0.739336, best loss: 0.550108 2025-01-16 01:55:05,028 - INFO - step 14538, loss: 0.790190, best loss: 0.550108 2025-01-16 01:55:05,178 - INFO - step 14539, loss: 0.736979, best loss: 0.550108 2025-01-16 01:55:05,328 - INFO - step 14540, loss: 0.679001, best loss: 0.550108 2025-01-16 01:55:05,478 - INFO - step 14541, loss: 0.759278, best loss: 0.550108 2025-01-16 01:55:05,628 - INFO - step 14542, loss: 0.710408, best loss: 0.550108 2025-01-16 01:55:05,778 - INFO - step 14543, loss: 0.714729, best loss: 0.550108 2025-01-16 01:55:05,929 - INFO - step 14544, loss: 0.740842, best loss: 0.550108 2025-01-16 01:55:06,079 - INFO - step 14545, loss: 0.935147, best loss: 0.550108 2025-01-16 01:55:06,229 - INFO - step 14546, loss: 0.783496, best loss: 0.550108 2025-01-16 01:55:06,379 - INFO - step 14547, loss: 0.723883, best loss: 0.550108 2025-01-16 01:55:06,530 - INFO - step 14548, loss: 0.811177, best loss: 0.550108 2025-01-16 01:55:06,680 - INFO - step 14549, loss: 0.797013, best loss: 0.550108 2025-01-16 01:55:06,830 - INFO - step 14550, loss: 0.678265, best loss: 0.550108 2025-01-16 01:55:06,980 - INFO - step 14551, loss: 0.726604, best loss: 0.550108 2025-01-16 01:55:07,131 - INFO - step 14552, loss: 0.727662, best loss: 0.550108 2025-01-16 01:55:07,281 - INFO - step 14553, loss: 0.805856, best loss: 0.550108 2025-01-16 01:55:07,431 - INFO - step 14554, loss: 0.676587, best loss: 0.550108 2025-01-16 01:55:07,581 - INFO - step 14555, loss: 0.633810, best loss: 0.550108 2025-01-16 01:55:07,731 - INFO - step 14556, loss: 0.814312, best loss: 0.550108 2025-01-16 01:55:07,881 - INFO - step 14557, loss: 0.799361, best loss: 0.550108 2025-01-16 01:55:08,031 - INFO - step 14558, loss: 0.784814, best loss: 0.550108 2025-01-16 01:55:08,181 - INFO - step 14559, loss: 0.695230, best loss: 0.550108 2025-01-16 01:55:08,332 - INFO - step 14560, loss: 0.792822, best loss: 0.550108 2025-01-16 01:55:08,482 - INFO - step 14561, loss: 0.798119, best loss: 0.550108 2025-01-16 01:55:08,632 - INFO - step 14562, loss: 0.839610, best loss: 0.550108 2025-01-16 01:55:08,782 - INFO - step 14563, loss: 0.655476, best loss: 0.550108 2025-01-16 01:55:08,932 - INFO - step 14564, loss: 0.672604, best loss: 0.550108 2025-01-16 01:55:09,083 - INFO - step 14565, loss: 0.635601, best loss: 0.550108 2025-01-16 01:55:09,233 - INFO - step 14566, loss: 0.818649, best loss: 0.550108 2025-01-16 01:55:09,383 - INFO - step 14567, loss: 0.796727, best loss: 0.550108 2025-01-16 01:55:09,533 - INFO - step 14568, loss: 0.692493, best loss: 0.550108 2025-01-16 01:55:09,683 - INFO - step 14569, loss: 0.695470, best loss: 0.550108 2025-01-16 01:55:09,834 - INFO - step 14570, loss: 0.675097, best loss: 0.550108 2025-01-16 01:55:09,984 - INFO - step 14571, loss: 0.675332, best loss: 0.550108 2025-01-16 01:55:10,134 - INFO - step 14572, loss: 0.784262, best loss: 0.550108 2025-01-16 01:55:10,284 - INFO - step 14573, loss: 0.713349, best loss: 0.550108 2025-01-16 01:55:10,434 - INFO - step 14574, loss: 0.740000, best loss: 0.550108 2025-01-16 01:55:10,585 - INFO - step 14575, loss: 0.700798, best loss: 0.550108 2025-01-16 01:55:10,735 - INFO - step 14576, loss: 0.753679, best loss: 0.550108 2025-01-16 01:55:10,885 - INFO - step 14577, loss: 0.673367, best loss: 0.550108 2025-01-16 01:55:11,035 - INFO - step 14578, loss: 0.665031, best loss: 0.550108 2025-01-16 01:55:11,185 - INFO - step 14579, loss: 0.826526, best loss: 0.550108 2025-01-16 01:55:11,335 - INFO - step 14580, loss: 0.761088, best loss: 0.550108 2025-01-16 01:55:11,485 - INFO - step 14581, loss: 0.896470, best loss: 0.550108 2025-01-16 01:55:11,635 - INFO - step 14582, loss: 0.899311, best loss: 0.550108 2025-01-16 01:55:11,785 - INFO - step 14583, loss: 0.805748, best loss: 0.550108 2025-01-16 01:55:11,935 - INFO - step 14584, loss: 0.798285, best loss: 0.550108 2025-01-16 01:55:12,085 - INFO - step 14585, loss: 0.681676, best loss: 0.550108 2025-01-16 01:55:12,236 - INFO - step 14586, loss: 0.777860, best loss: 0.550108 2025-01-16 01:55:12,386 - INFO - step 14587, loss: 0.790428, best loss: 0.550108 2025-01-16 01:55:12,536 - INFO - step 14588, loss: 0.627068, best loss: 0.550108 2025-01-16 01:55:12,686 - INFO - step 14589, loss: 0.782771, best loss: 0.550108 2025-01-16 01:55:12,836 - INFO - step 14590, loss: 0.788456, best loss: 0.550108 2025-01-16 01:55:12,986 - INFO - step 14591, loss: 0.728974, best loss: 0.550108 2025-01-16 01:55:13,137 - INFO - step 14592, loss: 0.789070, best loss: 0.550108 2025-01-16 01:55:13,287 - INFO - step 14593, loss: 0.780027, best loss: 0.550108 2025-01-16 01:55:13,437 - INFO - step 14594, loss: 0.815642, best loss: 0.550108 2025-01-16 01:55:13,587 - INFO - step 14595, loss: 0.696847, best loss: 0.550108 2025-01-16 01:55:13,737 - INFO - step 14596, loss: 0.732775, best loss: 0.550108 2025-01-16 01:55:13,887 - INFO - step 14597, loss: 0.760472, best loss: 0.550108 2025-01-16 01:55:14,037 - INFO - step 14598, loss: 0.766889, best loss: 0.550108 2025-01-16 01:55:14,187 - INFO - step 14599, loss: 0.660156, best loss: 0.550108 2025-01-16 01:55:14,338 - INFO - step 14600, loss: 0.841184, best loss: 0.550108 2025-01-16 01:55:14,488 - INFO - step 14601, loss: 0.669079, best loss: 0.550108 2025-01-16 01:55:14,638 - INFO - step 14602, loss: 0.648903, best loss: 0.550108 2025-01-16 01:55:14,788 - INFO - step 14603, loss: 0.667583, best loss: 0.550108 2025-01-16 01:55:14,938 - INFO - step 14604, loss: 0.766153, best loss: 0.550108 2025-01-16 01:55:15,088 - INFO - step 14605, loss: 0.757782, best loss: 0.550108 2025-01-16 01:55:15,239 - INFO - step 14606, loss: 0.617162, best loss: 0.550108 2025-01-16 01:55:15,389 - INFO - step 14607, loss: 0.738283, best loss: 0.550108 2025-01-16 01:55:15,539 - INFO - step 14608, loss: 0.702301, best loss: 0.550108 2025-01-16 01:55:15,690 - INFO - step 14609, loss: 0.700791, best loss: 0.550108 2025-01-16 01:55:15,840 - INFO - step 14610, loss: 0.632339, best loss: 0.550108 2025-01-16 01:55:15,990 - INFO - step 14611, loss: 0.732038, best loss: 0.550108 2025-01-16 01:55:16,140 - INFO - step 14612, loss: 0.649740, best loss: 0.550108 2025-01-16 01:55:16,290 - INFO - step 14613, loss: 0.729840, best loss: 0.550108 2025-01-16 01:55:16,440 - INFO - step 14614, loss: 0.743234, best loss: 0.550108 2025-01-16 01:55:16,590 - INFO - step 14615, loss: 0.785367, best loss: 0.550108 2025-01-16 01:55:16,740 - INFO - step 14616, loss: 0.748808, best loss: 0.550108 2025-01-16 01:55:16,891 - INFO - step 14617, loss: 0.708650, best loss: 0.550108 2025-01-16 01:55:17,041 - INFO - step 14618, loss: 0.693701, best loss: 0.550108 2025-01-16 01:55:17,191 - INFO - step 14619, loss: 0.727849, best loss: 0.550108 2025-01-16 01:55:17,341 - INFO - step 14620, loss: 0.671134, best loss: 0.550108 2025-01-16 01:55:17,492 - INFO - step 14621, loss: 0.731048, best loss: 0.550108 2025-01-16 01:55:17,642 - INFO - step 14622, loss: 0.661202, best loss: 0.550108 2025-01-16 01:55:17,792 - INFO - step 14623, loss: 0.868739, best loss: 0.550108 2025-01-16 01:55:17,942 - INFO - step 14624, loss: 0.635284, best loss: 0.550108 2025-01-16 01:55:18,092 - INFO - step 14625, loss: 0.638856, best loss: 0.550108 2025-01-16 01:55:18,243 - INFO - step 14626, loss: 0.835439, best loss: 0.550108 2025-01-16 01:55:18,393 - INFO - step 14627, loss: 0.708816, best loss: 0.550108 2025-01-16 01:55:18,543 - INFO - step 14628, loss: 0.657676, best loss: 0.550108 2025-01-16 01:55:18,694 - INFO - step 14629, loss: 0.763534, best loss: 0.550108 2025-01-16 01:55:18,844 - INFO - step 14630, loss: 0.685804, best loss: 0.550108 2025-01-16 01:55:18,994 - INFO - step 14631, loss: 0.721433, best loss: 0.550108 2025-01-16 01:55:19,144 - INFO - step 14632, loss: 0.633539, best loss: 0.550108 2025-01-16 01:55:19,295 - INFO - step 14633, loss: 0.637245, best loss: 0.550108 2025-01-16 01:55:19,445 - INFO - step 14634, loss: 0.786657, best loss: 0.550108 2025-01-16 01:55:19,595 - INFO - step 14635, loss: 0.695720, best loss: 0.550108 2025-01-16 01:55:19,745 - INFO - step 14636, loss: 0.724727, best loss: 0.550108 2025-01-16 01:55:19,895 - INFO - step 14637, loss: 0.795193, best loss: 0.550108 2025-01-16 01:55:20,045 - INFO - step 14638, loss: 0.660617, best loss: 0.550108 2025-01-16 01:55:20,195 - INFO - step 14639, loss: 0.754271, best loss: 0.550108 2025-01-16 01:55:20,345 - INFO - step 14640, loss: 0.782411, best loss: 0.550108 2025-01-16 01:55:20,496 - INFO - step 14641, loss: 0.661224, best loss: 0.550108 2025-01-16 01:55:20,646 - INFO - step 14642, loss: 0.782474, best loss: 0.550108 2025-01-16 01:55:20,796 - INFO - step 14643, loss: 0.640536, best loss: 0.550108 2025-01-16 01:55:20,946 - INFO - step 14644, loss: 0.697817, best loss: 0.550108 2025-01-16 01:55:21,097 - INFO - step 14645, loss: 0.710605, best loss: 0.550108 2025-01-16 01:55:21,247 - INFO - step 14646, loss: 0.737783, best loss: 0.550108 2025-01-16 01:55:21,397 - INFO - step 14647, loss: 0.802194, best loss: 0.550108 2025-01-16 01:55:21,547 - INFO - step 14648, loss: 0.613425, best loss: 0.550108 2025-01-16 01:55:21,697 - INFO - step 14649, loss: 0.675485, best loss: 0.550108 2025-01-16 01:55:21,848 - INFO - step 14650, loss: 0.664879, best loss: 0.550108 2025-01-16 01:55:21,998 - INFO - step 14651, loss: 0.737962, best loss: 0.550108 2025-01-16 01:55:22,148 - INFO - step 14652, loss: 0.877878, best loss: 0.550108 2025-01-16 01:55:22,298 - INFO - step 14653, loss: 0.661071, best loss: 0.550108 2025-01-16 01:55:22,449 - INFO - step 14654, loss: 0.861525, best loss: 0.550108 2025-01-16 01:55:22,599 - INFO - step 14655, loss: 0.682651, best loss: 0.550108 2025-01-16 01:55:22,749 - INFO - step 14656, loss: 0.614544, best loss: 0.550108 2025-01-16 01:55:22,899 - INFO - step 14657, loss: 0.787587, best loss: 0.550108 2025-01-16 01:55:23,050 - INFO - step 14658, loss: 0.672773, best loss: 0.550108 2025-01-16 01:55:23,200 - INFO - step 14659, loss: 0.715369, best loss: 0.550108 2025-01-16 01:55:23,350 - INFO - step 14660, loss: 0.780496, best loss: 0.550108 2025-01-16 01:55:23,500 - INFO - step 14661, loss: 0.686378, best loss: 0.550108 2025-01-16 01:55:23,650 - INFO - step 14662, loss: 0.753825, best loss: 0.550108 2025-01-16 01:55:23,800 - INFO - step 14663, loss: 0.670294, best loss: 0.550108 2025-01-16 01:55:23,950 - INFO - step 14664, loss: 0.577765, best loss: 0.550108 2025-01-16 01:55:24,101 - INFO - step 14665, loss: 0.707034, best loss: 0.550108 2025-01-16 01:55:24,251 - INFO - step 14666, loss: 0.838004, best loss: 0.550108 2025-01-16 01:55:24,401 - INFO - step 14667, loss: 0.739433, best loss: 0.550108 2025-01-16 01:55:24,551 - INFO - step 14668, loss: 0.645309, best loss: 0.550108 2025-01-16 01:55:24,701 - INFO - step 14669, loss: 0.659745, best loss: 0.550108 2025-01-16 01:55:24,851 - INFO - step 14670, loss: 0.763289, best loss: 0.550108 2025-01-16 01:55:25,002 - INFO - step 14671, loss: 0.689772, best loss: 0.550108 2025-01-16 01:55:25,152 - INFO - step 14672, loss: 0.694210, best loss: 0.550108 2025-01-16 01:55:25,302 - INFO - step 14673, loss: 0.753723, best loss: 0.550108 2025-01-16 01:55:25,452 - INFO - step 14674, loss: 0.760536, best loss: 0.550108 2025-01-16 01:55:25,602 - INFO - step 14675, loss: 0.724721, best loss: 0.550108 2025-01-16 01:55:25,753 - INFO - step 14676, loss: 0.741771, best loss: 0.550108 2025-01-16 01:55:25,903 - INFO - step 14677, loss: 0.674085, best loss: 0.550108 2025-01-16 01:55:26,053 - INFO - step 14678, loss: 0.704326, best loss: 0.550108 2025-01-16 01:55:26,203 - INFO - step 14679, loss: 0.717602, best loss: 0.550108 2025-01-16 01:55:26,353 - INFO - step 14680, loss: 0.663058, best loss: 0.550108 2025-01-16 01:55:26,503 - INFO - step 14681, loss: 0.676608, best loss: 0.550108 2025-01-16 01:55:26,653 - INFO - step 14682, loss: 0.692451, best loss: 0.550108 2025-01-16 01:55:26,803 - INFO - step 14683, loss: 0.607402, best loss: 0.550108 2025-01-16 01:55:26,953 - INFO - step 14684, loss: 0.713522, best loss: 0.550108 2025-01-16 01:55:27,103 - INFO - step 14685, loss: 0.693543, best loss: 0.550108 2025-01-16 01:55:27,253 - INFO - step 14686, loss: 0.804298, best loss: 0.550108 2025-01-16 01:55:27,403 - INFO - step 14687, loss: 0.703321, best loss: 0.550108 2025-01-16 01:55:27,554 - INFO - step 14688, loss: 0.690885, best loss: 0.550108 2025-01-16 01:55:27,704 - INFO - step 14689, loss: 0.720144, best loss: 0.550108 2025-01-16 01:55:27,854 - INFO - step 14690, loss: 0.624805, best loss: 0.550108 2025-01-16 01:55:28,004 - INFO - step 14691, loss: 0.640295, best loss: 0.550108 2025-01-16 01:55:28,154 - INFO - step 14692, loss: 0.718026, best loss: 0.550108 2025-01-16 01:55:28,304 - INFO - step 14693, loss: 0.716167, best loss: 0.550108 2025-01-16 01:55:28,454 - INFO - step 14694, loss: 0.711600, best loss: 0.550108 2025-01-16 01:55:28,604 - INFO - step 14695, loss: 0.658065, best loss: 0.550108 2025-01-16 01:55:28,754 - INFO - step 14696, loss: 0.784502, best loss: 0.550108 2025-01-16 01:55:28,904 - INFO - step 14697, loss: 0.727410, best loss: 0.550108 2025-01-16 01:55:29,055 - INFO - step 14698, loss: 0.762618, best loss: 0.550108 2025-01-16 01:55:29,205 - INFO - step 14699, loss: 0.743555, best loss: 0.550108 2025-01-16 01:55:29,355 - INFO - step 14700, loss: 0.879369, best loss: 0.550108 2025-01-16 01:55:29,505 - INFO - step 14701, loss: 0.659157, best loss: 0.550108 2025-01-16 01:55:29,655 - INFO - step 14702, loss: 0.728892, best loss: 0.550108 2025-01-16 01:55:29,805 - INFO - step 14703, loss: 0.833062, best loss: 0.550108 2025-01-16 01:55:29,955 - INFO - step 14704, loss: 0.712863, best loss: 0.550108 2025-01-16 01:55:30,105 - INFO - step 14705, loss: 0.673361, best loss: 0.550108 2025-01-16 01:55:30,255 - INFO - step 14706, loss: 0.754050, best loss: 0.550108 2025-01-16 01:55:30,405 - INFO - step 14707, loss: 0.775412, best loss: 0.550108 2025-01-16 01:55:30,555 - INFO - step 14708, loss: 0.557154, best loss: 0.550108 2025-01-16 01:55:30,705 - INFO - step 14709, loss: 0.757908, best loss: 0.550108 2025-01-16 01:55:30,855 - INFO - step 14710, loss: 0.734699, best loss: 0.550108 2025-01-16 01:55:31,005 - INFO - step 14711, loss: 0.696554, best loss: 0.550108 2025-01-16 01:55:31,155 - INFO - step 14712, loss: 0.784141, best loss: 0.550108 2025-01-16 01:55:31,305 - INFO - step 14713, loss: 0.681236, best loss: 0.550108 2025-01-16 01:55:31,455 - INFO - step 14714, loss: 0.687807, best loss: 0.550108 2025-01-16 01:55:31,605 - INFO - step 14715, loss: 0.767869, best loss: 0.550108 2025-01-16 01:55:31,755 - INFO - step 14716, loss: 0.769879, best loss: 0.550108 2025-01-16 01:55:31,905 - INFO - step 14717, loss: 0.696752, best loss: 0.550108 2025-01-16 01:55:32,055 - INFO - step 14718, loss: 0.659980, best loss: 0.550108 2025-01-16 01:55:32,206 - INFO - step 14719, loss: 0.640211, best loss: 0.550108 2025-01-16 01:55:32,356 - INFO - step 14720, loss: 0.700530, best loss: 0.550108 2025-01-16 01:55:32,506 - INFO - step 14721, loss: 0.717860, best loss: 0.550108 2025-01-16 01:55:32,656 - INFO - step 14722, loss: 0.714244, best loss: 0.550108 2025-01-16 01:55:32,806 - INFO - step 14723, loss: 0.750540, best loss: 0.550108 2025-01-16 01:55:32,956 - INFO - step 14724, loss: 0.730602, best loss: 0.550108 2025-01-16 01:55:33,106 - INFO - step 14725, loss: 0.687089, best loss: 0.550108 2025-01-16 01:55:33,256 - INFO - step 14726, loss: 0.617424, best loss: 0.550108 2025-01-16 01:55:33,407 - INFO - step 14727, loss: 0.637949, best loss: 0.550108 2025-01-16 01:55:33,557 - INFO - step 14728, loss: 0.686825, best loss: 0.550108 2025-01-16 01:55:33,707 - INFO - step 14729, loss: 0.685600, best loss: 0.550108 2025-01-16 01:55:33,857 - INFO - step 14730, loss: 0.699268, best loss: 0.550108 2025-01-16 01:55:34,007 - INFO - step 14731, loss: 0.684576, best loss: 0.550108 2025-01-16 01:55:34,157 - INFO - step 14732, loss: 0.673199, best loss: 0.550108 2025-01-16 01:55:34,307 - INFO - step 14733, loss: 0.667536, best loss: 0.550108 2025-01-16 01:55:34,457 - INFO - step 14734, loss: 0.731646, best loss: 0.550108 2025-01-16 01:55:34,607 - INFO - step 14735, loss: 0.837803, best loss: 0.550108 2025-01-16 01:55:34,757 - INFO - step 14736, loss: 0.779825, best loss: 0.550108 2025-01-16 01:55:34,907 - INFO - step 14737, loss: 0.726840, best loss: 0.550108 2025-01-16 01:55:35,057 - INFO - step 14738, loss: 0.761397, best loss: 0.550108 2025-01-16 01:55:35,207 - INFO - step 14739, loss: 0.764726, best loss: 0.550108 2025-01-16 01:55:35,357 - INFO - step 14740, loss: 0.841610, best loss: 0.550108 2025-01-16 01:55:35,507 - INFO - step 14741, loss: 0.765464, best loss: 0.550108 2025-01-16 01:55:35,657 - INFO - step 14742, loss: 0.823230, best loss: 0.550108 2025-01-16 01:55:35,808 - INFO - step 14743, loss: 0.901215, best loss: 0.550108 2025-01-16 01:55:35,958 - INFO - step 14744, loss: 0.860477, best loss: 0.550108 2025-01-16 01:55:36,108 - INFO - step 14745, loss: 0.760796, best loss: 0.550108 2025-01-16 01:55:36,258 - INFO - step 14746, loss: 0.798030, best loss: 0.550108 2025-01-16 01:55:36,408 - INFO - step 14747, loss: 0.692121, best loss: 0.550108 2025-01-16 01:55:36,558 - INFO - step 14748, loss: 0.654095, best loss: 0.550108 2025-01-16 01:55:36,708 - INFO - step 14749, loss: 0.812577, best loss: 0.550108 2025-01-16 01:55:36,858 - INFO - step 14750, loss: 0.726497, best loss: 0.550108 2025-01-16 01:55:37,008 - INFO - step 14751, loss: 0.585982, best loss: 0.550108 2025-01-16 01:55:37,158 - INFO - step 14752, loss: 0.634452, best loss: 0.550108 2025-01-16 01:55:37,309 - INFO - step 14753, loss: 0.734908, best loss: 0.550108 2025-01-16 01:55:37,459 - INFO - step 14754, loss: 0.828967, best loss: 0.550108 2025-01-16 01:55:37,609 - INFO - step 14755, loss: 0.728239, best loss: 0.550108 2025-01-16 01:55:37,760 - INFO - step 14756, loss: 0.741139, best loss: 0.550108 2025-01-16 01:55:37,910 - INFO - step 14757, loss: 0.641187, best loss: 0.550108 2025-01-16 01:55:38,060 - INFO - step 14758, loss: 0.611612, best loss: 0.550108 2025-01-16 01:55:38,210 - INFO - step 14759, loss: 0.711287, best loss: 0.550108 2025-01-16 01:55:38,360 - INFO - step 14760, loss: 0.801669, best loss: 0.550108 2025-01-16 01:55:38,510 - INFO - step 14761, loss: 0.824216, best loss: 0.550108 2025-01-16 01:55:38,660 - INFO - step 14762, loss: 0.798053, best loss: 0.550108 2025-01-16 01:55:38,810 - INFO - step 14763, loss: 0.683694, best loss: 0.550108 2025-01-16 01:55:38,960 - INFO - step 14764, loss: 0.793557, best loss: 0.550108 2025-01-16 01:55:39,110 - INFO - step 14765, loss: 0.784070, best loss: 0.550108 2025-01-16 01:55:39,260 - INFO - step 14766, loss: 0.671350, best loss: 0.550108 2025-01-16 01:55:39,410 - INFO - step 14767, loss: 0.713337, best loss: 0.550108 2025-01-16 01:55:39,561 - INFO - step 14768, loss: 0.693104, best loss: 0.550108 2025-01-16 01:55:39,711 - INFO - step 14769, loss: 0.590269, best loss: 0.550108 2025-01-16 01:55:39,861 - INFO - step 14770, loss: 0.637612, best loss: 0.550108 2025-01-16 01:55:40,011 - INFO - step 14771, loss: 0.684741, best loss: 0.550108 2025-01-16 01:55:40,161 - INFO - step 14772, loss: 0.667954, best loss: 0.550108 2025-01-16 01:55:40,311 - INFO - step 14773, loss: 0.605423, best loss: 0.550108 2025-01-16 01:55:40,461 - INFO - step 14774, loss: 0.561844, best loss: 0.550108 2025-01-16 01:55:43,728 - INFO - step 14775, loss: 0.530078, best loss: 0.530078 2025-01-16 01:55:43,891 - INFO - step 14776, loss: 0.675117, best loss: 0.530078 2025-01-16 01:55:44,042 - INFO - step 14777, loss: 0.764094, best loss: 0.530078 2025-01-16 01:55:44,192 - INFO - step 14778, loss: 0.631537, best loss: 0.530078 2025-01-16 01:55:44,343 - INFO - step 14779, loss: 0.673697, best loss: 0.530078 2025-01-16 01:55:44,493 - INFO - step 14780, loss: 0.718109, best loss: 0.530078 2025-01-16 01:55:44,643 - INFO - step 14781, loss: 0.618873, best loss: 0.530078 2025-01-16 01:55:44,793 - INFO - step 14782, loss: 0.693319, best loss: 0.530078 2025-01-16 01:55:44,943 - INFO - step 14783, loss: 0.616108, best loss: 0.530078 2025-01-16 01:55:45,093 - INFO - step 14784, loss: 0.574519, best loss: 0.530078 2025-01-16 01:55:45,243 - INFO - step 14785, loss: 0.596808, best loss: 0.530078 2025-01-16 01:55:45,394 - INFO - step 14786, loss: 0.582763, best loss: 0.530078 2025-01-16 01:55:45,544 - INFO - step 14787, loss: 0.554348, best loss: 0.530078 2025-01-16 01:55:45,694 - INFO - step 14788, loss: 0.563755, best loss: 0.530078 2025-01-16 01:55:45,844 - INFO - step 14789, loss: 0.657680, best loss: 0.530078 2025-01-16 01:55:45,994 - INFO - step 14790, loss: 0.646616, best loss: 0.530078 2025-01-16 01:55:46,144 - INFO - step 14791, loss: 0.662253, best loss: 0.530078 2025-01-16 01:55:46,294 - INFO - step 14792, loss: 0.599912, best loss: 0.530078 2025-01-16 01:55:46,444 - INFO - step 14793, loss: 0.580453, best loss: 0.530078 2025-01-16 01:55:46,594 - INFO - step 14794, loss: 0.604355, best loss: 0.530078 2025-01-16 01:55:46,745 - INFO - step 14795, loss: 0.707722, best loss: 0.530078 2025-01-16 01:55:46,894 - INFO - step 14796, loss: 0.664310, best loss: 0.530078 2025-01-16 01:55:47,045 - INFO - step 14797, loss: 0.622515, best loss: 0.530078 2025-01-16 01:55:47,195 - INFO - step 14798, loss: 0.693840, best loss: 0.530078 2025-01-16 01:55:47,345 - INFO - step 14799, loss: 0.721379, best loss: 0.530078 2025-01-16 01:55:47,496 - INFO - step 14800, loss: 0.670223, best loss: 0.530078 2025-01-16 01:55:47,646 - INFO - step 14801, loss: 0.616230, best loss: 0.530078 2025-01-16 01:55:47,796 - INFO - step 14802, loss: 0.714808, best loss: 0.530078 2025-01-16 01:55:47,947 - INFO - step 14803, loss: 0.601682, best loss: 0.530078 2025-01-16 01:55:48,097 - INFO - step 14804, loss: 0.788817, best loss: 0.530078 2025-01-16 01:55:48,247 - INFO - step 14805, loss: 0.687972, best loss: 0.530078 2025-01-16 01:55:48,397 - INFO - step 14806, loss: 0.673024, best loss: 0.530078 2025-01-16 01:55:48,547 - INFO - step 14807, loss: 0.643311, best loss: 0.530078 2025-01-16 01:55:48,697 - INFO - step 14808, loss: 0.742642, best loss: 0.530078 2025-01-16 01:55:48,847 - INFO - step 14809, loss: 0.653132, best loss: 0.530078 2025-01-16 01:55:48,998 - INFO - step 14810, loss: 0.694752, best loss: 0.530078 2025-01-16 01:55:49,148 - INFO - step 14811, loss: 0.710758, best loss: 0.530078 2025-01-16 01:55:49,298 - INFO - step 14812, loss: 0.671215, best loss: 0.530078 2025-01-16 01:55:49,448 - INFO - step 14813, loss: 0.597907, best loss: 0.530078 2025-01-16 01:55:49,599 - INFO - step 14814, loss: 0.622631, best loss: 0.530078 2025-01-16 01:55:49,749 - INFO - step 14815, loss: 0.653269, best loss: 0.530078 2025-01-16 01:55:49,899 - INFO - step 14816, loss: 0.684240, best loss: 0.530078 2025-01-16 01:55:50,049 - INFO - step 14817, loss: 0.610662, best loss: 0.530078 2025-01-16 01:55:50,200 - INFO - step 14818, loss: 0.727335, best loss: 0.530078 2025-01-16 01:55:50,350 - INFO - step 14819, loss: 0.652430, best loss: 0.530078 2025-01-16 01:55:50,500 - INFO - step 14820, loss: 0.700753, best loss: 0.530078 2025-01-16 01:55:50,650 - INFO - step 14821, loss: 0.652831, best loss: 0.530078 2025-01-16 01:55:50,801 - INFO - step 14822, loss: 0.629896, best loss: 0.530078 2025-01-16 01:55:50,951 - INFO - step 14823, loss: 0.735322, best loss: 0.530078 2025-01-16 01:55:51,101 - INFO - step 14824, loss: 0.667876, best loss: 0.530078 2025-01-16 01:55:51,251 - INFO - step 14825, loss: 0.613829, best loss: 0.530078 2025-01-16 01:55:51,401 - INFO - step 14826, loss: 0.582766, best loss: 0.530078 2025-01-16 01:55:51,551 - INFO - step 14827, loss: 0.706485, best loss: 0.530078 2025-01-16 01:55:51,701 - INFO - step 14828, loss: 0.635868, best loss: 0.530078 2025-01-16 01:55:51,851 - INFO - step 14829, loss: 0.676488, best loss: 0.530078 2025-01-16 01:55:52,001 - INFO - step 14830, loss: 0.654592, best loss: 0.530078 2025-01-16 01:55:52,151 - INFO - step 14831, loss: 0.646778, best loss: 0.530078 2025-01-16 01:55:52,302 - INFO - step 14832, loss: 0.762381, best loss: 0.530078 2025-01-16 01:55:52,452 - INFO - step 14833, loss: 0.625431, best loss: 0.530078 2025-01-16 01:55:52,602 - INFO - step 14834, loss: 0.656631, best loss: 0.530078 2025-01-16 01:55:52,752 - INFO - step 14835, loss: 0.556329, best loss: 0.530078 2025-01-16 01:55:56,306 - INFO - step 14836, loss: 0.518917, best loss: 0.518917 2025-01-16 01:55:56,456 - INFO - step 14837, loss: 0.572103, best loss: 0.518917 2025-01-16 01:55:56,607 - INFO - step 14838, loss: 0.672578, best loss: 0.518917 2025-01-16 01:55:56,757 - INFO - step 14839, loss: 0.705707, best loss: 0.518917 2025-01-16 01:55:56,907 - INFO - step 14840, loss: 0.712998, best loss: 0.518917 2025-01-16 01:55:57,057 - INFO - step 14841, loss: 0.650371, best loss: 0.518917 2025-01-16 01:55:57,207 - INFO - step 14842, loss: 0.605648, best loss: 0.518917 2025-01-16 01:55:57,357 - INFO - step 14843, loss: 0.522228, best loss: 0.518917 2025-01-16 01:55:57,507 - INFO - step 14844, loss: 0.623000, best loss: 0.518917 2025-01-16 01:55:57,658 - INFO - step 14845, loss: 0.716742, best loss: 0.518917 2025-01-16 01:55:57,809 - INFO - step 14846, loss: 0.754340, best loss: 0.518917 2025-01-16 01:56:01,369 - INFO - step 14847, loss: 0.486541, best loss: 0.486541 2025-01-16 01:56:01,520 - INFO - step 14848, loss: 0.609240, best loss: 0.486541 2025-01-16 01:56:01,670 - INFO - step 14849, loss: 0.586643, best loss: 0.486541 2025-01-16 01:56:01,820 - INFO - step 14850, loss: 0.655707, best loss: 0.486541 2025-01-16 01:56:01,970 - INFO - step 14851, loss: 0.678837, best loss: 0.486541 2025-01-16 01:56:02,120 - INFO - step 14852, loss: 0.652080, best loss: 0.486541 2025-01-16 01:56:02,270 - INFO - step 14853, loss: 0.633111, best loss: 0.486541 2025-01-16 01:56:02,420 - INFO - step 14854, loss: 0.660707, best loss: 0.486541 2025-01-16 01:56:02,570 - INFO - step 14855, loss: 0.660792, best loss: 0.486541 2025-01-16 01:56:02,720 - INFO - step 14856, loss: 0.682961, best loss: 0.486541 2025-01-16 01:56:02,870 - INFO - step 14857, loss: 0.686724, best loss: 0.486541 2025-01-16 01:56:03,020 - INFO - step 14858, loss: 0.692692, best loss: 0.486541 2025-01-16 01:56:03,170 - INFO - step 14859, loss: 0.712085, best loss: 0.486541 2025-01-16 01:56:03,320 - INFO - step 14860, loss: 0.636296, best loss: 0.486541 2025-01-16 01:56:03,471 - INFO - step 14861, loss: 0.610892, best loss: 0.486541 2025-01-16 01:56:03,621 - INFO - step 14862, loss: 0.550608, best loss: 0.486541 2025-01-16 01:56:03,771 - INFO - step 14863, loss: 0.609710, best loss: 0.486541 2025-01-16 01:56:03,921 - INFO - step 14864, loss: 0.697507, best loss: 0.486541 2025-01-16 01:56:04,071 - INFO - step 14865, loss: 0.683406, best loss: 0.486541 2025-01-16 01:56:04,221 - INFO - step 14866, loss: 0.734251, best loss: 0.486541 2025-01-16 01:56:04,371 - INFO - step 14867, loss: 0.676814, best loss: 0.486541 2025-01-16 01:56:04,522 - INFO - step 14868, loss: 0.733960, best loss: 0.486541 2025-01-16 01:56:04,672 - INFO - step 14869, loss: 0.645786, best loss: 0.486541 2025-01-16 01:56:04,822 - INFO - step 14870, loss: 0.650206, best loss: 0.486541 2025-01-16 01:56:04,972 - INFO - step 14871, loss: 0.734871, best loss: 0.486541 2025-01-16 01:56:05,122 - INFO - step 14872, loss: 0.599411, best loss: 0.486541 2025-01-16 01:56:05,272 - INFO - step 14873, loss: 0.638318, best loss: 0.486541 2025-01-16 01:56:05,422 - INFO - step 14874, loss: 0.628161, best loss: 0.486541 2025-01-16 01:56:05,572 - INFO - step 14875, loss: 0.750490, best loss: 0.486541 2025-01-16 01:56:05,723 - INFO - step 14876, loss: 0.643549, best loss: 0.486541 2025-01-16 01:56:05,873 - INFO - step 14877, loss: 0.532965, best loss: 0.486541 2025-01-16 01:56:06,023 - INFO - step 14878, loss: 0.705999, best loss: 0.486541 2025-01-16 01:56:06,173 - INFO - step 14879, loss: 0.731725, best loss: 0.486541 2025-01-16 01:56:06,324 - INFO - step 14880, loss: 0.631882, best loss: 0.486541 2025-01-16 01:56:06,474 - INFO - step 14881, loss: 0.647933, best loss: 0.486541 2025-01-16 01:56:06,624 - INFO - step 14882, loss: 0.648888, best loss: 0.486541 2025-01-16 01:56:06,774 - INFO - step 14883, loss: 0.772219, best loss: 0.486541 2025-01-16 01:56:06,924 - INFO - step 14884, loss: 0.618312, best loss: 0.486541 2025-01-16 01:56:07,074 - INFO - step 14885, loss: 0.513169, best loss: 0.486541 2025-01-16 01:56:07,224 - INFO - step 14886, loss: 0.638593, best loss: 0.486541 2025-01-16 01:56:07,375 - INFO - step 14887, loss: 0.711092, best loss: 0.486541 2025-01-16 01:56:07,525 - INFO - step 14888, loss: 0.698992, best loss: 0.486541 2025-01-16 01:56:07,675 - INFO - step 14889, loss: 0.616534, best loss: 0.486541 2025-01-16 01:56:07,826 - INFO - step 14890, loss: 0.699359, best loss: 0.486541 2025-01-16 01:56:07,976 - INFO - step 14891, loss: 0.660770, best loss: 0.486541 2025-01-16 01:56:08,126 - INFO - step 14892, loss: 0.782426, best loss: 0.486541 2025-01-16 01:56:08,276 - INFO - step 14893, loss: 0.613303, best loss: 0.486541 2025-01-16 01:56:08,426 - INFO - step 14894, loss: 0.608985, best loss: 0.486541 2025-01-16 01:56:08,576 - INFO - step 14895, loss: 0.532909, best loss: 0.486541 2025-01-16 01:56:08,727 - INFO - step 14896, loss: 0.718546, best loss: 0.486541 2025-01-16 01:56:08,877 - INFO - step 14897, loss: 0.670494, best loss: 0.486541 2025-01-16 01:56:09,027 - INFO - step 14898, loss: 0.669344, best loss: 0.486541 2025-01-16 01:56:09,177 - INFO - step 14899, loss: 0.599842, best loss: 0.486541 2025-01-16 01:56:09,327 - INFO - step 14900, loss: 0.578064, best loss: 0.486541 2025-01-16 01:56:09,477 - INFO - step 14901, loss: 0.675599, best loss: 0.486541 2025-01-16 01:56:09,627 - INFO - step 14902, loss: 0.625496, best loss: 0.486541 2025-01-16 01:56:09,777 - INFO - step 14903, loss: 0.680525, best loss: 0.486541 2025-01-16 01:56:09,927 - INFO - step 14904, loss: 0.639352, best loss: 0.486541 2025-01-16 01:56:10,077 - INFO - step 14905, loss: 0.603136, best loss: 0.486541 2025-01-16 01:56:10,227 - INFO - step 14906, loss: 0.652751, best loss: 0.486541 2025-01-16 01:56:10,378 - INFO - step 14907, loss: 0.585432, best loss: 0.486541 2025-01-16 01:56:10,528 - INFO - step 14908, loss: 0.599454, best loss: 0.486541 2025-01-16 01:56:10,678 - INFO - step 14909, loss: 0.724712, best loss: 0.486541 2025-01-16 01:56:10,828 - INFO - step 14910, loss: 0.686057, best loss: 0.486541 2025-01-16 01:56:10,978 - INFO - step 14911, loss: 0.749588, best loss: 0.486541 2025-01-16 01:56:11,128 - INFO - step 14912, loss: 0.762153, best loss: 0.486541 2025-01-16 01:56:11,278 - INFO - step 14913, loss: 0.761095, best loss: 0.486541 2025-01-16 01:56:11,428 - INFO - step 14914, loss: 0.692337, best loss: 0.486541 2025-01-16 01:56:11,578 - INFO - step 14915, loss: 0.602293, best loss: 0.486541 2025-01-16 01:56:11,728 - INFO - step 14916, loss: 0.709794, best loss: 0.486541 2025-01-16 01:56:11,878 - INFO - step 14917, loss: 0.726932, best loss: 0.486541 2025-01-16 01:56:12,029 - INFO - step 14918, loss: 0.656868, best loss: 0.486541 2025-01-16 01:56:12,179 - INFO - step 14919, loss: 0.727554, best loss: 0.486541 2025-01-16 01:56:12,329 - INFO - step 14920, loss: 0.677016, best loss: 0.486541 2025-01-16 01:56:12,479 - INFO - step 14921, loss: 0.634660, best loss: 0.486541 2025-01-16 01:56:12,630 - INFO - step 14922, loss: 0.665018, best loss: 0.486541 2025-01-16 01:56:12,780 - INFO - step 14923, loss: 0.703930, best loss: 0.486541 2025-01-16 01:56:12,930 - INFO - step 14924, loss: 0.717713, best loss: 0.486541 2025-01-16 01:56:13,080 - INFO - step 14925, loss: 0.635294, best loss: 0.486541 2025-01-16 01:56:13,230 - INFO - step 14926, loss: 0.610464, best loss: 0.486541 2025-01-16 01:56:13,380 - INFO - step 14927, loss: 0.682352, best loss: 0.486541 2025-01-16 01:56:13,530 - INFO - step 14928, loss: 0.689334, best loss: 0.486541 2025-01-16 01:56:13,680 - INFO - step 14929, loss: 0.573059, best loss: 0.486541 2025-01-16 01:56:13,830 - INFO - step 14930, loss: 0.621032, best loss: 0.486541 2025-01-16 01:56:13,980 - INFO - step 14931, loss: 0.660602, best loss: 0.486541 2025-01-16 01:56:14,130 - INFO - step 14932, loss: 0.556197, best loss: 0.486541 2025-01-16 01:56:14,280 - INFO - step 14933, loss: 0.576593, best loss: 0.486541 2025-01-16 01:56:14,430 - INFO - step 14934, loss: 0.620368, best loss: 0.486541 2025-01-16 01:56:14,581 - INFO - step 14935, loss: 0.648816, best loss: 0.486541 2025-01-16 01:56:14,731 - INFO - step 14936, loss: 0.518154, best loss: 0.486541 2025-01-16 01:56:14,881 - INFO - step 14937, loss: 0.685504, best loss: 0.486541 2025-01-16 01:56:15,031 - INFO - step 14938, loss: 0.676520, best loss: 0.486541 2025-01-16 01:56:15,181 - INFO - step 14939, loss: 0.627154, best loss: 0.486541 2025-01-16 01:56:15,331 - INFO - step 14940, loss: 0.574942, best loss: 0.486541 2025-01-16 01:56:15,482 - INFO - step 14941, loss: 0.736747, best loss: 0.486541 2025-01-16 01:56:15,632 - INFO - step 14942, loss: 0.624515, best loss: 0.486541 2025-01-16 01:56:15,782 - INFO - step 14943, loss: 0.620299, best loss: 0.486541 2025-01-16 01:56:15,932 - INFO - step 14944, loss: 0.740224, best loss: 0.486541 2025-01-16 01:56:16,082 - INFO - step 14945, loss: 0.756096, best loss: 0.486541 2025-01-16 01:56:16,232 - INFO - step 14946, loss: 0.676037, best loss: 0.486541 2025-01-16 01:56:16,382 - INFO - step 14947, loss: 0.624251, best loss: 0.486541 2025-01-16 01:56:16,532 - INFO - step 14948, loss: 0.576330, best loss: 0.486541 2025-01-16 01:56:16,682 - INFO - step 14949, loss: 0.689983, best loss: 0.486541 2025-01-16 01:56:16,832 - INFO - step 14950, loss: 0.649949, best loss: 0.486541 2025-01-16 01:56:16,982 - INFO - step 14951, loss: 0.586006, best loss: 0.486541 2025-01-16 01:56:17,132 - INFO - step 14952, loss: 0.597815, best loss: 0.486541 2025-01-16 01:56:17,282 - INFO - step 14953, loss: 0.790303, best loss: 0.486541 2025-01-16 01:56:17,433 - INFO - step 14954, loss: 0.601830, best loss: 0.486541 2025-01-16 01:56:17,583 - INFO - step 14955, loss: 0.554927, best loss: 0.486541 2025-01-16 01:56:17,733 - INFO - step 14956, loss: 0.692027, best loss: 0.486541 2025-01-16 01:56:17,883 - INFO - step 14957, loss: 0.636469, best loss: 0.486541 2025-01-16 01:56:18,033 - INFO - step 14958, loss: 0.596021, best loss: 0.486541 2025-01-16 01:56:18,183 - INFO - step 14959, loss: 0.702896, best loss: 0.486541 2025-01-16 01:56:18,333 - INFO - step 14960, loss: 0.620846, best loss: 0.486541 2025-01-16 01:56:18,483 - INFO - step 14961, loss: 0.614743, best loss: 0.486541 2025-01-16 01:56:18,634 - INFO - step 14962, loss: 0.568412, best loss: 0.486541 2025-01-16 01:56:18,784 - INFO - step 14963, loss: 0.622729, best loss: 0.486541 2025-01-16 01:56:18,934 - INFO - step 14964, loss: 0.724354, best loss: 0.486541 2025-01-16 01:56:19,084 - INFO - step 14965, loss: 0.692241, best loss: 0.486541 2025-01-16 01:56:19,234 - INFO - step 14966, loss: 0.603294, best loss: 0.486541 2025-01-16 01:56:19,385 - INFO - step 14967, loss: 0.743307, best loss: 0.486541 2025-01-16 01:56:19,535 - INFO - step 14968, loss: 0.593007, best loss: 0.486541 2025-01-16 01:56:19,685 - INFO - step 14969, loss: 0.656493, best loss: 0.486541 2025-01-16 01:56:19,835 - INFO - step 14970, loss: 0.664871, best loss: 0.486541 2025-01-16 01:56:19,985 - INFO - step 14971, loss: 0.631945, best loss: 0.486541 2025-01-16 01:56:20,135 - INFO - step 14972, loss: 0.697493, best loss: 0.486541 2025-01-16 01:56:20,285 - INFO - step 14973, loss: 0.593823, best loss: 0.486541 2025-01-16 01:56:20,436 - INFO - step 14974, loss: 0.621306, best loss: 0.486541 2025-01-16 01:56:20,585 - INFO - step 14975, loss: 0.515426, best loss: 0.486541 2025-01-16 01:56:20,736 - INFO - step 14976, loss: 0.717045, best loss: 0.486541 2025-01-16 01:56:20,886 - INFO - step 14977, loss: 0.710274, best loss: 0.486541 2025-01-16 01:56:21,036 - INFO - step 14978, loss: 0.572124, best loss: 0.486541 2025-01-16 01:56:21,186 - INFO - step 14979, loss: 0.652009, best loss: 0.486541 2025-01-16 01:56:21,336 - INFO - step 14980, loss: 0.612447, best loss: 0.486541 2025-01-16 01:56:21,486 - INFO - step 14981, loss: 0.669196, best loss: 0.486541 2025-01-16 01:56:21,635 - INFO - step 14982, loss: 0.752420, best loss: 0.486541 2025-01-16 01:56:21,786 - INFO - step 14983, loss: 0.573307, best loss: 0.486541 2025-01-16 01:56:21,936 - INFO - step 14984, loss: 0.727158, best loss: 0.486541 2025-01-16 01:56:22,086 - INFO - step 14985, loss: 0.613987, best loss: 0.486541 2025-01-16 01:56:22,236 - INFO - step 14986, loss: 0.549983, best loss: 0.486541 2025-01-16 01:56:22,386 - INFO - step 14987, loss: 0.722534, best loss: 0.486541 2025-01-16 01:56:22,536 - INFO - step 14988, loss: 0.657342, best loss: 0.486541 2025-01-16 01:56:22,686 - INFO - step 14989, loss: 0.635153, best loss: 0.486541 2025-01-16 01:56:22,836 - INFO - step 14990, loss: 0.687784, best loss: 0.486541 2025-01-16 01:56:22,986 - INFO - step 14991, loss: 0.618325, best loss: 0.486541 2025-01-16 01:56:23,137 - INFO - step 14992, loss: 0.728076, best loss: 0.486541 2025-01-16 01:56:23,287 - INFO - step 14993, loss: 0.568279, best loss: 0.486541 2025-01-16 01:56:23,437 - INFO - step 14994, loss: 0.528055, best loss: 0.486541 2025-01-16 01:56:23,587 - INFO - step 14995, loss: 0.570527, best loss: 0.486541 2025-01-16 01:56:23,737 - INFO - step 14996, loss: 0.768412, best loss: 0.486541 2025-01-16 01:56:23,887 - INFO - step 14997, loss: 0.681049, best loss: 0.486541 2025-01-16 01:56:24,038 - INFO - step 14998, loss: 0.590040, best loss: 0.486541 2025-01-16 01:56:24,188 - INFO - step 14999, loss: 0.603159, best loss: 0.486541 2025-01-16 01:56:24,338 - INFO - step 15000, loss: 0.648419, best loss: 0.486541 2025-01-16 01:56:24,488 - INFO - step 15001, loss: 0.678880, best loss: 0.486541 2025-01-16 01:56:24,638 - INFO - step 15002, loss: 0.613185, best loss: 0.486541 2025-01-16 01:56:24,789 - INFO - step 15003, loss: 0.635623, best loss: 0.486541 2025-01-16 01:56:24,939 - INFO - step 15004, loss: 0.736040, best loss: 0.486541 2025-01-16 01:56:25,089 - INFO - step 15005, loss: 0.678395, best loss: 0.486541 2025-01-16 01:56:25,239 - INFO - step 15006, loss: 0.749943, best loss: 0.486541 2025-01-16 01:56:25,390 - INFO - step 15007, loss: 0.612909, best loss: 0.486541 2025-01-16 01:56:25,540 - INFO - step 15008, loss: 0.664455, best loss: 0.486541 2025-01-16 01:56:25,690 - INFO - step 15009, loss: 0.608617, best loss: 0.486541 2025-01-16 01:56:25,840 - INFO - step 15010, loss: 0.627096, best loss: 0.486541 2025-01-16 01:56:25,991 - INFO - step 15011, loss: 0.615456, best loss: 0.486541 2025-01-16 01:56:26,141 - INFO - step 15012, loss: 0.620156, best loss: 0.486541 2025-01-16 01:56:26,291 - INFO - step 15013, loss: 0.550388, best loss: 0.486541 2025-01-16 01:56:26,441 - INFO - step 15014, loss: 0.638884, best loss: 0.486541 2025-01-16 01:56:26,591 - INFO - step 15015, loss: 0.576666, best loss: 0.486541 2025-01-16 01:56:26,741 - INFO - step 15016, loss: 0.679690, best loss: 0.486541 2025-01-16 01:56:26,891 - INFO - step 15017, loss: 0.624486, best loss: 0.486541 2025-01-16 01:56:27,041 - INFO - step 15018, loss: 0.621969, best loss: 0.486541 2025-01-16 01:56:27,191 - INFO - step 15019, loss: 0.660212, best loss: 0.486541 2025-01-16 01:56:27,342 - INFO - step 15020, loss: 0.621290, best loss: 0.486541 2025-01-16 01:56:27,492 - INFO - step 15021, loss: 0.623998, best loss: 0.486541 2025-01-16 01:56:27,642 - INFO - step 15022, loss: 0.636363, best loss: 0.486541 2025-01-16 01:56:27,792 - INFO - step 15023, loss: 0.662857, best loss: 0.486541 2025-01-16 01:56:27,942 - INFO - step 15024, loss: 0.626296, best loss: 0.486541 2025-01-16 01:56:28,092 - INFO - step 15025, loss: 0.565285, best loss: 0.486541 2025-01-16 01:56:28,242 - INFO - step 15026, loss: 0.673810, best loss: 0.486541 2025-01-16 01:56:28,392 - INFO - step 15027, loss: 0.661790, best loss: 0.486541 2025-01-16 01:56:28,542 - INFO - step 15028, loss: 0.649852, best loss: 0.486541 2025-01-16 01:56:28,692 - INFO - step 15029, loss: 0.628837, best loss: 0.486541 2025-01-16 01:56:28,842 - INFO - step 15030, loss: 0.835124, best loss: 0.486541 2025-01-16 01:56:28,992 - INFO - step 15031, loss: 0.636120, best loss: 0.486541 2025-01-16 01:56:29,143 - INFO - step 15032, loss: 0.689065, best loss: 0.486541 2025-01-16 01:56:29,293 - INFO - step 15033, loss: 0.729821, best loss: 0.486541 2025-01-16 01:56:29,443 - INFO - step 15034, loss: 0.624911, best loss: 0.486541 2025-01-16 01:56:29,593 - INFO - step 15035, loss: 0.569980, best loss: 0.486541 2025-01-16 01:56:29,743 - INFO - step 15036, loss: 0.668202, best loss: 0.486541 2025-01-16 01:56:29,893 - INFO - step 15037, loss: 0.703158, best loss: 0.486541 2025-01-16 01:56:30,044 - INFO - step 15038, loss: 0.508732, best loss: 0.486541 2025-01-16 01:56:30,194 - INFO - step 15039, loss: 0.733214, best loss: 0.486541 2025-01-16 01:56:30,344 - INFO - step 15040, loss: 0.650552, best loss: 0.486541 2025-01-16 01:56:30,494 - INFO - step 15041, loss: 0.660854, best loss: 0.486541 2025-01-16 01:56:30,644 - INFO - step 15042, loss: 0.699015, best loss: 0.486541 2025-01-16 01:56:30,795 - INFO - step 15043, loss: 0.625888, best loss: 0.486541 2025-01-16 01:56:30,944 - INFO - step 15044, loss: 0.646263, best loss: 0.486541 2025-01-16 01:56:31,094 - INFO - step 15045, loss: 0.708774, best loss: 0.486541 2025-01-16 01:56:31,244 - INFO - step 15046, loss: 0.758234, best loss: 0.486541 2025-01-16 01:56:31,394 - INFO - step 15047, loss: 0.610452, best loss: 0.486541 2025-01-16 01:56:31,545 - INFO - step 15048, loss: 0.591204, best loss: 0.486541 2025-01-16 01:56:31,695 - INFO - step 15049, loss: 0.617996, best loss: 0.486541 2025-01-16 01:56:31,845 - INFO - step 15050, loss: 0.599497, best loss: 0.486541 2025-01-16 01:56:31,995 - INFO - step 15051, loss: 0.624554, best loss: 0.486541 2025-01-16 01:56:32,145 - INFO - step 15052, loss: 0.621614, best loss: 0.486541 2025-01-16 01:56:32,295 - INFO - step 15053, loss: 0.662370, best loss: 0.486541 2025-01-16 01:56:32,445 - INFO - step 15054, loss: 0.671948, best loss: 0.486541 2025-01-16 01:56:32,595 - INFO - step 15055, loss: 0.617603, best loss: 0.486541 2025-01-16 01:56:32,745 - INFO - step 15056, loss: 0.583649, best loss: 0.486541 2025-01-16 01:56:32,895 - INFO - step 15057, loss: 0.595174, best loss: 0.486541 2025-01-16 01:56:33,046 - INFO - step 15058, loss: 0.567087, best loss: 0.486541 2025-01-16 01:56:33,196 - INFO - step 15059, loss: 0.644858, best loss: 0.486541 2025-01-16 01:56:33,346 - INFO - step 15060, loss: 0.599034, best loss: 0.486541 2025-01-16 01:56:33,496 - INFO - step 15061, loss: 0.548536, best loss: 0.486541 2025-01-16 01:56:33,646 - INFO - step 15062, loss: 0.545204, best loss: 0.486541 2025-01-16 01:56:33,796 - INFO - step 15063, loss: 0.599319, best loss: 0.486541 2025-01-16 01:56:33,946 - INFO - step 15064, loss: 0.651957, best loss: 0.486541 2025-01-16 01:56:34,097 - INFO - step 15065, loss: 0.707352, best loss: 0.486541 2025-01-16 01:56:34,247 - INFO - step 15066, loss: 0.662255, best loss: 0.486541 2025-01-16 01:56:34,397 - INFO - step 15067, loss: 0.663463, best loss: 0.486541 2025-01-16 01:56:34,547 - INFO - step 15068, loss: 0.626756, best loss: 0.486541 2025-01-16 01:56:34,697 - INFO - step 15069, loss: 0.603063, best loss: 0.486541 2025-01-16 01:56:34,847 - INFO - step 15070, loss: 0.679999, best loss: 0.486541 2025-01-16 01:56:34,997 - INFO - step 15071, loss: 0.723667, best loss: 0.486541 2025-01-16 01:56:35,147 - INFO - step 15072, loss: 0.695571, best loss: 0.486541 2025-01-16 01:56:35,297 - INFO - step 15073, loss: 0.740066, best loss: 0.486541 2025-01-16 01:56:35,447 - INFO - step 15074, loss: 0.737084, best loss: 0.486541 2025-01-16 01:56:35,597 - INFO - step 15075, loss: 0.672233, best loss: 0.486541 2025-01-16 01:56:35,748 - INFO - step 15076, loss: 0.722699, best loss: 0.486541 2025-01-16 01:56:35,898 - INFO - step 15077, loss: 0.653562, best loss: 0.486541 2025-01-16 01:56:36,048 - INFO - step 15078, loss: 0.562487, best loss: 0.486541 2025-01-16 01:56:36,198 - INFO - step 15079, loss: 0.742967, best loss: 0.486541 2025-01-16 01:56:36,348 - INFO - step 15080, loss: 0.657639, best loss: 0.486541 2025-01-16 01:56:36,498 - INFO - step 15081, loss: 0.520578, best loss: 0.486541 2025-01-16 01:56:36,648 - INFO - step 15082, loss: 0.612034, best loss: 0.486541 2025-01-16 01:56:36,798 - INFO - step 15083, loss: 0.719177, best loss: 0.486541 2025-01-16 01:56:36,949 - INFO - step 15084, loss: 0.728381, best loss: 0.486541 2025-01-16 01:56:37,099 - INFO - step 15085, loss: 0.699010, best loss: 0.486541 2025-01-16 01:56:37,249 - INFO - step 15086, loss: 0.682098, best loss: 0.486541 2025-01-16 01:56:37,399 - INFO - step 15087, loss: 0.609149, best loss: 0.486541 2025-01-16 01:56:37,549 - INFO - step 15088, loss: 0.529319, best loss: 0.486541 2025-01-16 01:56:37,699 - INFO - step 15089, loss: 0.627483, best loss: 0.486541 2025-01-16 01:56:37,849 - INFO - step 15090, loss: 0.734680, best loss: 0.486541 2025-01-16 01:56:37,999 - INFO - step 15091, loss: 0.828554, best loss: 0.486541 2025-01-16 01:56:38,149 - INFO - step 15092, loss: 0.669670, best loss: 0.486541 2025-01-16 01:56:38,300 - INFO - step 15093, loss: 0.666516, best loss: 0.486541 2025-01-16 01:56:38,450 - INFO - step 15094, loss: 0.729610, best loss: 0.486541 2025-01-16 01:56:38,600 - INFO - step 15095, loss: 0.670036, best loss: 0.486541 2025-01-16 01:56:38,750 - INFO - step 15096, loss: 0.603243, best loss: 0.486541 2025-01-16 01:56:38,900 - INFO - step 15097, loss: 0.684688, best loss: 0.486541 2025-01-16 01:56:39,051 - INFO - step 15098, loss: 0.606947, best loss: 0.486541 2025-01-16 01:56:39,201 - INFO - step 15099, loss: 0.545072, best loss: 0.486541 2025-01-16 01:56:39,351 - INFO - step 15100, loss: 0.646150, best loss: 0.486541 2025-01-16 01:56:39,501 - INFO - step 15101, loss: 0.661483, best loss: 0.486541 2025-01-16 01:56:39,651 - INFO - step 15102, loss: 0.615060, best loss: 0.486541 2025-01-16 01:56:39,801 - INFO - step 15103, loss: 0.540566, best loss: 0.486541 2025-01-16 01:56:39,951 - INFO - step 15104, loss: 0.536603, best loss: 0.486541 2025-01-16 01:56:40,101 - INFO - step 15105, loss: 0.550867, best loss: 0.486541 2025-01-16 01:56:40,251 - INFO - step 15106, loss: 0.600410, best loss: 0.486541 2025-01-16 01:56:40,402 - INFO - step 15107, loss: 0.700632, best loss: 0.486541 2025-01-16 01:56:40,552 - INFO - step 15108, loss: 0.579268, best loss: 0.486541 2025-01-16 01:56:40,702 - INFO - step 15109, loss: 0.680729, best loss: 0.486541 2025-01-16 01:56:40,852 - INFO - step 15110, loss: 0.673760, best loss: 0.486541 2025-01-16 01:56:41,002 - INFO - step 15111, loss: 0.583555, best loss: 0.486541 2025-01-16 01:56:41,152 - INFO - step 15112, loss: 0.623818, best loss: 0.486541 2025-01-16 01:56:41,302 - INFO - step 15113, loss: 0.545206, best loss: 0.486541 2025-01-16 01:56:41,452 - INFO - step 15114, loss: 0.526455, best loss: 0.486541 2025-01-16 01:56:41,602 - INFO - step 15115, loss: 0.583980, best loss: 0.486541 2025-01-16 01:56:41,752 - INFO - step 15116, loss: 0.527460, best loss: 0.486541 2025-01-16 01:56:44,993 - INFO - step 15117, loss: 0.482871, best loss: 0.482871 2025-01-16 01:56:45,152 - INFO - step 15118, loss: 0.545587, best loss: 0.482871 2025-01-16 01:56:45,303 - INFO - step 15119, loss: 0.593668, best loss: 0.482871 2025-01-16 01:56:45,453 - INFO - step 15120, loss: 0.619488, best loss: 0.482871 2025-01-16 01:56:45,603 - INFO - step 15121, loss: 0.643302, best loss: 0.482871 2025-01-16 01:56:45,753 - INFO - step 15122, loss: 0.502103, best loss: 0.482871 2025-01-16 01:56:45,904 - INFO - step 15123, loss: 0.524282, best loss: 0.482871 2025-01-16 01:56:46,054 - INFO - step 15124, loss: 0.567654, best loss: 0.482871 2025-01-16 01:56:46,204 - INFO - step 15125, loss: 0.590772, best loss: 0.482871 2025-01-16 01:56:46,354 - INFO - step 15126, loss: 0.566706, best loss: 0.482871 2025-01-16 01:56:46,504 - INFO - step 15127, loss: 0.561169, best loss: 0.482871 2025-01-16 01:56:46,654 - INFO - step 15128, loss: 0.595944, best loss: 0.482871 2025-01-16 01:56:46,804 - INFO - step 15129, loss: 0.591597, best loss: 0.482871 2025-01-16 01:56:46,954 - INFO - step 15130, loss: 0.624119, best loss: 0.482871 2025-01-16 01:56:47,104 - INFO - step 15131, loss: 0.558337, best loss: 0.482871 2025-01-16 01:56:47,255 - INFO - step 15132, loss: 0.641728, best loss: 0.482871 2025-01-16 01:56:47,405 - INFO - step 15133, loss: 0.560260, best loss: 0.482871 2025-01-16 01:56:47,555 - INFO - step 15134, loss: 0.676522, best loss: 0.482871 2025-01-16 01:56:47,705 - INFO - step 15135, loss: 0.628849, best loss: 0.482871 2025-01-16 01:56:47,855 - INFO - step 15136, loss: 0.617754, best loss: 0.482871 2025-01-16 01:56:48,005 - INFO - step 15137, loss: 0.608499, best loss: 0.482871 2025-01-16 01:56:48,156 - INFO - step 15138, loss: 0.652236, best loss: 0.482871 2025-01-16 01:56:48,306 - INFO - step 15139, loss: 0.552027, best loss: 0.482871 2025-01-16 01:56:48,456 - INFO - step 15140, loss: 0.671292, best loss: 0.482871 2025-01-16 01:56:48,606 - INFO - step 15141, loss: 0.604671, best loss: 0.482871 2025-01-16 01:56:48,757 - INFO - step 15142, loss: 0.584075, best loss: 0.482871 2025-01-16 01:56:48,907 - INFO - step 15143, loss: 0.594856, best loss: 0.482871 2025-01-16 01:56:49,057 - INFO - step 15144, loss: 0.636875, best loss: 0.482871 2025-01-16 01:56:49,207 - INFO - step 15145, loss: 0.659537, best loss: 0.482871 2025-01-16 01:56:49,357 - INFO - step 15146, loss: 0.643909, best loss: 0.482871 2025-01-16 01:56:49,507 - INFO - step 15147, loss: 0.517951, best loss: 0.482871 2025-01-16 01:56:49,658 - INFO - step 15148, loss: 0.668928, best loss: 0.482871 2025-01-16 01:56:49,808 - INFO - step 15149, loss: 0.606994, best loss: 0.482871 2025-01-16 01:56:49,958 - INFO - step 15150, loss: 0.669558, best loss: 0.482871 2025-01-16 01:56:50,108 - INFO - step 15151, loss: 0.633447, best loss: 0.482871 2025-01-16 01:56:50,258 - INFO - step 15152, loss: 0.554474, best loss: 0.482871 2025-01-16 01:56:50,408 - INFO - step 15153, loss: 0.650689, best loss: 0.482871 2025-01-16 01:56:50,558 - INFO - step 15154, loss: 0.617243, best loss: 0.482871 2025-01-16 01:56:50,708 - INFO - step 15155, loss: 0.580112, best loss: 0.482871 2025-01-16 01:56:50,858 - INFO - step 15156, loss: 0.569680, best loss: 0.482871 2025-01-16 01:56:51,008 - INFO - step 15157, loss: 0.581769, best loss: 0.482871 2025-01-16 01:56:51,158 - INFO - step 15158, loss: 0.613478, best loss: 0.482871 2025-01-16 01:56:51,308 - INFO - step 15159, loss: 0.611164, best loss: 0.482871 2025-01-16 01:56:51,458 - INFO - step 15160, loss: 0.549728, best loss: 0.482871 2025-01-16 01:56:51,609 - INFO - step 15161, loss: 0.612415, best loss: 0.482871 2025-01-16 01:56:51,759 - INFO - step 15162, loss: 0.680040, best loss: 0.482871 2025-01-16 01:56:51,909 - INFO - step 15163, loss: 0.604732, best loss: 0.482871 2025-01-16 01:56:52,059 - INFO - step 15164, loss: 0.706888, best loss: 0.482871 2025-01-16 01:56:52,209 - INFO - step 15165, loss: 0.602480, best loss: 0.482871 2025-01-16 01:56:52,359 - INFO - step 15166, loss: 0.500926, best loss: 0.482871 2025-01-16 01:56:52,509 - INFO - step 15167, loss: 0.578367, best loss: 0.482871 2025-01-16 01:56:52,659 - INFO - step 15168, loss: 0.606717, best loss: 0.482871 2025-01-16 01:56:52,809 - INFO - step 15169, loss: 0.639827, best loss: 0.482871 2025-01-16 01:56:52,959 - INFO - step 15170, loss: 0.665571, best loss: 0.482871 2025-01-16 01:56:53,109 - INFO - step 15171, loss: 0.627665, best loss: 0.482871 2025-01-16 01:56:53,259 - INFO - step 15172, loss: 0.628514, best loss: 0.482871 2025-01-16 01:56:53,409 - INFO - step 15173, loss: 0.556810, best loss: 0.482871 2025-01-16 01:56:53,560 - INFO - step 15174, loss: 0.550835, best loss: 0.482871 2025-01-16 01:56:53,710 - INFO - step 15175, loss: 0.661929, best loss: 0.482871 2025-01-16 01:56:53,860 - INFO - step 15176, loss: 0.772210, best loss: 0.482871 2025-01-16 01:57:01,772 - INFO - step 15177, loss: 0.446923, best loss: 0.446923 2025-01-16 01:57:01,922 - INFO - step 15178, loss: 0.546006, best loss: 0.446923 2025-01-16 01:57:02,072 - INFO - step 15179, loss: 0.511852, best loss: 0.446923 2025-01-16 01:57:02,222 - INFO - step 15180, loss: 0.536449, best loss: 0.446923 2025-01-16 01:57:02,372 - INFO - step 15181, loss: 0.604721, best loss: 0.446923 2025-01-16 01:57:02,522 - INFO - step 15182, loss: 0.566051, best loss: 0.446923 2025-01-16 01:57:02,672 - INFO - step 15183, loss: 0.567748, best loss: 0.446923 2025-01-16 01:57:02,823 - INFO - step 15184, loss: 0.633158, best loss: 0.446923 2025-01-16 01:57:02,973 - INFO - step 15185, loss: 0.588956, best loss: 0.446923 2025-01-16 01:57:03,123 - INFO - step 15186, loss: 0.622044, best loss: 0.446923 2025-01-16 01:57:03,273 - INFO - step 15187, loss: 0.553529, best loss: 0.446923 2025-01-16 01:57:03,423 - INFO - step 15188, loss: 0.606569, best loss: 0.446923 2025-01-16 01:57:03,574 - INFO - step 15189, loss: 0.657986, best loss: 0.446923 2025-01-16 01:57:03,724 - INFO - step 15190, loss: 0.594545, best loss: 0.446923 2025-01-16 01:57:03,874 - INFO - step 15191, loss: 0.564680, best loss: 0.446923 2025-01-16 01:57:04,024 - INFO - step 15192, loss: 0.498447, best loss: 0.446923 2025-01-16 01:57:04,175 - INFO - step 15193, loss: 0.588473, best loss: 0.446923 2025-01-16 01:57:04,325 - INFO - step 15194, loss: 0.666104, best loss: 0.446923 2025-01-16 01:57:04,475 - INFO - step 15195, loss: 0.566369, best loss: 0.446923 2025-01-16 01:57:04,626 - INFO - step 15196, loss: 0.634486, best loss: 0.446923 2025-01-16 01:57:04,776 - INFO - step 15197, loss: 0.531571, best loss: 0.446923 2025-01-16 01:57:04,926 - INFO - step 15198, loss: 0.591454, best loss: 0.446923 2025-01-16 01:57:05,076 - INFO - step 15199, loss: 0.602585, best loss: 0.446923 2025-01-16 01:57:05,226 - INFO - step 15200, loss: 0.577108, best loss: 0.446923 2025-01-16 01:57:05,376 - INFO - step 15201, loss: 0.628930, best loss: 0.446923 2025-01-16 01:57:05,526 - INFO - step 15202, loss: 0.567910, best loss: 0.446923 2025-01-16 01:57:05,677 - INFO - step 15203, loss: 0.596973, best loss: 0.446923 2025-01-16 01:57:05,827 - INFO - step 15204, loss: 0.656218, best loss: 0.446923 2025-01-16 01:57:05,977 - INFO - step 15205, loss: 0.695466, best loss: 0.446923 2025-01-16 01:57:06,127 - INFO - step 15206, loss: 0.640327, best loss: 0.446923 2025-01-16 01:57:06,277 - INFO - step 15207, loss: 0.526951, best loss: 0.446923 2025-01-16 01:57:06,427 - INFO - step 15208, loss: 0.551496, best loss: 0.446923 2025-01-16 01:57:06,577 - INFO - step 15209, loss: 0.658263, best loss: 0.446923 2025-01-16 01:57:06,727 - INFO - step 15210, loss: 0.536639, best loss: 0.446923 2025-01-16 01:57:06,877 - INFO - step 15211, loss: 0.593997, best loss: 0.446923 2025-01-16 01:57:07,027 - INFO - step 15212, loss: 0.613577, best loss: 0.446923 2025-01-16 01:57:07,178 - INFO - step 15213, loss: 0.607852, best loss: 0.446923 2025-01-16 01:57:07,328 - INFO - step 15214, loss: 0.534860, best loss: 0.446923 2025-01-16 01:57:07,478 - INFO - step 15215, loss: 0.532884, best loss: 0.446923 2025-01-16 01:57:07,628 - INFO - step 15216, loss: 0.639895, best loss: 0.446923 2025-01-16 01:57:07,779 - INFO - step 15217, loss: 0.613364, best loss: 0.446923 2025-01-16 01:57:07,929 - INFO - step 15218, loss: 0.648117, best loss: 0.446923 2025-01-16 01:57:08,079 - INFO - step 15219, loss: 0.530712, best loss: 0.446923 2025-01-16 01:57:08,229 - INFO - step 15220, loss: 0.604776, best loss: 0.446923 2025-01-16 01:57:08,379 - INFO - step 15221, loss: 0.659895, best loss: 0.446923 2025-01-16 01:57:08,529 - INFO - step 15222, loss: 0.640691, best loss: 0.446923 2025-01-16 01:57:08,679 - INFO - step 15223, loss: 0.487637, best loss: 0.446923 2025-01-16 01:57:08,829 - INFO - step 15224, loss: 0.543036, best loss: 0.446923 2025-01-16 01:57:08,979 - INFO - step 15225, loss: 0.524430, best loss: 0.446923 2025-01-16 01:57:09,129 - INFO - step 15226, loss: 0.560455, best loss: 0.446923 2025-01-16 01:57:09,279 - INFO - step 15227, loss: 0.605668, best loss: 0.446923 2025-01-16 01:57:09,430 - INFO - step 15228, loss: 0.566343, best loss: 0.446923 2025-01-16 01:57:09,581 - INFO - step 15229, loss: 0.492205, best loss: 0.446923 2025-01-16 01:57:09,731 - INFO - step 15230, loss: 0.525482, best loss: 0.446923 2025-01-16 01:57:09,881 - INFO - step 15231, loss: 0.567482, best loss: 0.446923 2025-01-16 01:57:10,030 - INFO - step 15232, loss: 0.592033, best loss: 0.446923 2025-01-16 01:57:10,181 - INFO - step 15233, loss: 0.588446, best loss: 0.446923 2025-01-16 01:57:10,331 - INFO - step 15234, loss: 0.563259, best loss: 0.446923 2025-01-16 01:57:10,481 - INFO - step 15235, loss: 0.570901, best loss: 0.446923 2025-01-16 01:57:10,631 - INFO - step 15236, loss: 0.623977, best loss: 0.446923 2025-01-16 01:57:10,781 - INFO - step 15237, loss: 0.606940, best loss: 0.446923 2025-01-16 01:57:10,931 - INFO - step 15238, loss: 0.557833, best loss: 0.446923 2025-01-16 01:57:11,082 - INFO - step 15239, loss: 0.662396, best loss: 0.446923 2025-01-16 01:57:11,232 - INFO - step 15240, loss: 0.624759, best loss: 0.446923 2025-01-16 01:57:11,382 - INFO - step 15241, loss: 0.684796, best loss: 0.446923 2025-01-16 01:57:11,532 - INFO - step 15242, loss: 0.641306, best loss: 0.446923 2025-01-16 01:57:11,682 - INFO - step 15243, loss: 0.642257, best loss: 0.446923 2025-01-16 01:57:11,832 - INFO - step 15244, loss: 0.666688, best loss: 0.446923 2025-01-16 01:57:11,982 - INFO - step 15245, loss: 0.539034, best loss: 0.446923 2025-01-16 01:57:12,132 - INFO - step 15246, loss: 0.660525, best loss: 0.446923 2025-01-16 01:57:12,282 - INFO - step 15247, loss: 0.716840, best loss: 0.446923 2025-01-16 01:57:12,432 - INFO - step 15248, loss: 0.548124, best loss: 0.446923 2025-01-16 01:57:12,582 - INFO - step 15249, loss: 0.633710, best loss: 0.446923 2025-01-16 01:57:12,732 - INFO - step 15250, loss: 0.671368, best loss: 0.446923 2025-01-16 01:57:12,882 - INFO - step 15251, loss: 0.630342, best loss: 0.446923 2025-01-16 01:57:13,032 - INFO - step 15252, loss: 0.614251, best loss: 0.446923 2025-01-16 01:57:13,182 - INFO - step 15253, loss: 0.675633, best loss: 0.446923 2025-01-16 01:57:13,333 - INFO - step 15254, loss: 0.671370, best loss: 0.446923 2025-01-16 01:57:13,483 - INFO - step 15255, loss: 0.607455, best loss: 0.446923 2025-01-16 01:57:13,634 - INFO - step 15256, loss: 0.590913, best loss: 0.446923 2025-01-16 01:57:13,784 - INFO - step 15257, loss: 0.642802, best loss: 0.446923 2025-01-16 01:57:13,934 - INFO - step 15258, loss: 0.564713, best loss: 0.446923 2025-01-16 01:57:14,084 - INFO - step 15259, loss: 0.508323, best loss: 0.446923 2025-01-16 01:57:14,234 - INFO - step 15260, loss: 0.616834, best loss: 0.446923 2025-01-16 01:57:14,384 - INFO - step 15261, loss: 0.540823, best loss: 0.446923 2025-01-16 01:57:14,534 - INFO - step 15262, loss: 0.540424, best loss: 0.446923 2025-01-16 01:57:14,685 - INFO - step 15263, loss: 0.531845, best loss: 0.446923 2025-01-16 01:57:14,835 - INFO - step 15264, loss: 0.591445, best loss: 0.446923 2025-01-16 01:57:14,985 - INFO - step 15265, loss: 0.590877, best loss: 0.446923 2025-01-16 01:57:15,135 - INFO - step 15266, loss: 0.547642, best loss: 0.446923 2025-01-16 01:57:15,285 - INFO - step 15267, loss: 0.552511, best loss: 0.446923 2025-01-16 01:57:15,435 - INFO - step 15268, loss: 0.567148, best loss: 0.446923 2025-01-16 01:57:15,585 - INFO - step 15269, loss: 0.597315, best loss: 0.446923 2025-01-16 01:57:15,736 - INFO - step 15270, loss: 0.525892, best loss: 0.446923 2025-01-16 01:57:15,886 - INFO - step 15271, loss: 0.681379, best loss: 0.446923 2025-01-16 01:57:16,036 - INFO - step 15272, loss: 0.594153, best loss: 0.446923 2025-01-16 01:57:16,186 - INFO - step 15273, loss: 0.604415, best loss: 0.446923 2025-01-16 01:57:16,336 - INFO - step 15274, loss: 0.568478, best loss: 0.446923 2025-01-16 01:57:16,486 - INFO - step 15275, loss: 0.628415, best loss: 0.446923 2025-01-16 01:57:16,636 - INFO - step 15276, loss: 0.632328, best loss: 0.446923 2025-01-16 01:57:16,786 - INFO - step 15277, loss: 0.544658, best loss: 0.446923 2025-01-16 01:57:16,936 - INFO - step 15278, loss: 0.573104, best loss: 0.446923 2025-01-16 01:57:17,086 - INFO - step 15279, loss: 0.581000, best loss: 0.446923 2025-01-16 01:57:17,237 - INFO - step 15280, loss: 0.562308, best loss: 0.446923 2025-01-16 01:57:17,387 - INFO - step 15281, loss: 0.568012, best loss: 0.446923 2025-01-16 01:57:17,537 - INFO - step 15282, loss: 0.548707, best loss: 0.446923 2025-01-16 01:57:17,687 - INFO - step 15283, loss: 0.715943, best loss: 0.446923 2025-01-16 01:57:17,838 - INFO - step 15284, loss: 0.523149, best loss: 0.446923 2025-01-16 01:57:17,988 - INFO - step 15285, loss: 0.539518, best loss: 0.446923 2025-01-16 01:57:18,138 - INFO - step 15286, loss: 0.629513, best loss: 0.446923 2025-01-16 01:57:18,288 - INFO - step 15287, loss: 0.531363, best loss: 0.446923 2025-01-16 01:57:18,438 - INFO - step 15288, loss: 0.546937, best loss: 0.446923 2025-01-16 01:57:18,588 - INFO - step 15289, loss: 0.657661, best loss: 0.446923 2025-01-16 01:57:18,739 - INFO - step 15290, loss: 0.573452, best loss: 0.446923 2025-01-16 01:57:18,889 - INFO - step 15291, loss: 0.598806, best loss: 0.446923 2025-01-16 01:57:19,039 - INFO - step 15292, loss: 0.563633, best loss: 0.446923 2025-01-16 01:57:19,189 - INFO - step 15293, loss: 0.544780, best loss: 0.446923 2025-01-16 01:57:19,339 - INFO - step 15294, loss: 0.615352, best loss: 0.446923 2025-01-16 01:57:19,489 - INFO - step 15295, loss: 0.566468, best loss: 0.446923 2025-01-16 01:57:19,639 - INFO - step 15296, loss: 0.540170, best loss: 0.446923 2025-01-16 01:57:19,789 - INFO - step 15297, loss: 0.652977, best loss: 0.446923 2025-01-16 01:57:19,939 - INFO - step 15298, loss: 0.574805, best loss: 0.446923 2025-01-16 01:57:20,090 - INFO - step 15299, loss: 0.596889, best loss: 0.446923 2025-01-16 01:57:20,240 - INFO - step 15300, loss: 0.583977, best loss: 0.446923 2025-01-16 01:57:20,390 - INFO - step 15301, loss: 0.581995, best loss: 0.446923 2025-01-16 01:57:20,540 - INFO - step 15302, loss: 0.582160, best loss: 0.446923 2025-01-16 01:57:20,690 - INFO - step 15303, loss: 0.501341, best loss: 0.446923 2025-01-16 01:57:20,840 - INFO - step 15304, loss: 0.587887, best loss: 0.446923 2025-01-16 01:57:20,990 - INFO - step 15305, loss: 0.504295, best loss: 0.446923 2025-01-16 01:57:21,140 - INFO - step 15306, loss: 0.555503, best loss: 0.446923 2025-01-16 01:57:21,291 - INFO - step 15307, loss: 0.647436, best loss: 0.446923 2025-01-16 01:57:21,441 - INFO - step 15308, loss: 0.489459, best loss: 0.446923 2025-01-16 01:57:21,591 - INFO - step 15309, loss: 0.583531, best loss: 0.446923 2025-01-16 01:57:21,741 - INFO - step 15310, loss: 0.588806, best loss: 0.446923 2025-01-16 01:57:21,891 - INFO - step 15311, loss: 0.623603, best loss: 0.446923 2025-01-16 01:57:22,042 - INFO - step 15312, loss: 0.632785, best loss: 0.446923 2025-01-16 01:57:22,192 - INFO - step 15313, loss: 0.504095, best loss: 0.446923 2025-01-16 01:57:22,342 - INFO - step 15314, loss: 0.586293, best loss: 0.446923 2025-01-16 01:57:22,492 - INFO - step 15315, loss: 0.466253, best loss: 0.446923 2025-01-16 01:57:26,038 - INFO - step 15316, loss: 0.441271, best loss: 0.441271 2025-01-16 01:57:26,190 - INFO - step 15317, loss: 0.591940, best loss: 0.441271 2025-01-16 01:57:26,340 - INFO - step 15318, loss: 0.502787, best loss: 0.441271 2025-01-16 01:57:26,489 - INFO - step 15319, loss: 0.574447, best loss: 0.441271 2025-01-16 01:57:26,640 - INFO - step 15320, loss: 0.629669, best loss: 0.441271 2025-01-16 01:57:26,790 - INFO - step 15321, loss: 0.534312, best loss: 0.441271 2025-01-16 01:57:26,940 - INFO - step 15322, loss: 0.600575, best loss: 0.441271 2025-01-16 01:57:27,090 - INFO - step 15323, loss: 0.475538, best loss: 0.441271 2025-01-16 01:57:27,240 - INFO - step 15324, loss: 0.472005, best loss: 0.441271 2025-01-16 01:57:27,390 - INFO - step 15325, loss: 0.543969, best loss: 0.441271 2025-01-16 01:57:27,541 - INFO - step 15326, loss: 0.605112, best loss: 0.441271 2025-01-16 01:57:27,691 - INFO - step 15327, loss: 0.623182, best loss: 0.441271 2025-01-16 01:57:27,841 - INFO - step 15328, loss: 0.502113, best loss: 0.441271 2025-01-16 01:57:27,991 - INFO - step 15329, loss: 0.557186, best loss: 0.441271 2025-01-16 01:57:28,142 - INFO - step 15330, loss: 0.565601, best loss: 0.441271 2025-01-16 01:57:28,293 - INFO - step 15331, loss: 0.731427, best loss: 0.441271 2025-01-16 01:57:28,443 - INFO - step 15332, loss: 0.629925, best loss: 0.441271 2025-01-16 01:57:28,594 - INFO - step 15333, loss: 0.582557, best loss: 0.441271 2025-01-16 01:57:28,744 - INFO - step 15334, loss: 0.629430, best loss: 0.441271 2025-01-16 01:57:28,894 - INFO - step 15335, loss: 0.624128, best loss: 0.441271 2025-01-16 01:57:29,044 - INFO - step 15336, loss: 0.616350, best loss: 0.441271 2025-01-16 01:57:29,195 - INFO - step 15337, loss: 0.564283, best loss: 0.441271 2025-01-16 01:57:29,345 - INFO - step 15338, loss: 0.664089, best loss: 0.441271 2025-01-16 01:57:29,495 - INFO - step 15339, loss: 0.707950, best loss: 0.441271 2025-01-16 01:57:29,645 - INFO - step 15340, loss: 0.641955, best loss: 0.441271 2025-01-16 01:57:29,796 - INFO - step 15341, loss: 0.592465, best loss: 0.441271 2025-01-16 01:57:29,946 - INFO - step 15342, loss: 0.563707, best loss: 0.441271 2025-01-16 01:57:30,096 - INFO - step 15343, loss: 0.548251, best loss: 0.441271 2025-01-16 01:57:30,246 - INFO - step 15344, loss: 0.620851, best loss: 0.441271 2025-01-16 01:57:30,397 - INFO - step 15345, loss: 0.551456, best loss: 0.441271 2025-01-16 01:57:30,547 - INFO - step 15346, loss: 0.670004, best loss: 0.441271 2025-01-16 01:57:30,697 - INFO - step 15347, loss: 0.628787, best loss: 0.441271 2025-01-16 01:57:30,847 - INFO - step 15348, loss: 0.601020, best loss: 0.441271 2025-01-16 01:57:30,997 - INFO - step 15349, loss: 0.633680, best loss: 0.441271 2025-01-16 01:57:31,147 - INFO - step 15350, loss: 0.556332, best loss: 0.441271 2025-01-16 01:57:31,297 - INFO - step 15351, loss: 0.507415, best loss: 0.441271 2025-01-16 01:57:31,448 - INFO - step 15352, loss: 0.542010, best loss: 0.441271 2025-01-16 01:57:31,598 - INFO - step 15353, loss: 0.584186, best loss: 0.441271 2025-01-16 01:57:31,748 - INFO - step 15354, loss: 0.630062, best loss: 0.441271 2025-01-16 01:57:31,898 - INFO - step 15355, loss: 0.558289, best loss: 0.441271 2025-01-16 01:57:32,048 - INFO - step 15356, loss: 0.667472, best loss: 0.441271 2025-01-16 01:57:32,199 - INFO - step 15357, loss: 0.688745, best loss: 0.441271 2025-01-16 01:57:32,349 - INFO - step 15358, loss: 0.597707, best loss: 0.441271 2025-01-16 01:57:32,499 - INFO - step 15359, loss: 0.636162, best loss: 0.441271 2025-01-16 01:57:32,650 - INFO - step 15360, loss: 0.634986, best loss: 0.441271 2025-01-16 01:57:32,800 - INFO - step 15361, loss: 0.527140, best loss: 0.441271 2025-01-16 01:57:32,950 - INFO - step 15362, loss: 0.556863, best loss: 0.441271 2025-01-16 01:57:33,100 - INFO - step 15363, loss: 0.696853, best loss: 0.441271 2025-01-16 01:57:33,250 - INFO - step 15364, loss: 0.543254, best loss: 0.441271 2025-01-16 01:57:33,400 - INFO - step 15365, loss: 0.599847, best loss: 0.441271 2025-01-16 01:57:33,551 - INFO - step 15366, loss: 0.624338, best loss: 0.441271 2025-01-16 01:57:33,701 - INFO - step 15367, loss: 0.695148, best loss: 0.441271 2025-01-16 01:57:33,851 - INFO - step 15368, loss: 0.560233, best loss: 0.441271 2025-01-16 01:57:34,001 - INFO - step 15369, loss: 0.618312, best loss: 0.441271 2025-01-16 01:57:34,151 - INFO - step 15370, loss: 0.543657, best loss: 0.441271 2025-01-16 01:57:34,301 - INFO - step 15371, loss: 0.616144, best loss: 0.441271 2025-01-16 01:57:34,452 - INFO - step 15372, loss: 0.632969, best loss: 0.441271 2025-01-16 01:57:34,602 - INFO - step 15373, loss: 0.563565, best loss: 0.441271 2025-01-16 01:57:34,752 - INFO - step 15374, loss: 0.639108, best loss: 0.441271 2025-01-16 01:57:34,902 - INFO - step 15375, loss: 0.675672, best loss: 0.441271 2025-01-16 01:57:35,052 - INFO - step 15376, loss: 0.679634, best loss: 0.441271 2025-01-16 01:57:35,202 - INFO - step 15377, loss: 0.629007, best loss: 0.441271 2025-01-16 01:57:35,352 - INFO - step 15378, loss: 0.546092, best loss: 0.441271 2025-01-16 01:57:35,502 - INFO - step 15379, loss: 0.554449, best loss: 0.441271 2025-01-16 01:57:35,652 - INFO - step 15380, loss: 0.584325, best loss: 0.441271 2025-01-16 01:57:35,803 - INFO - step 15381, loss: 0.640155, best loss: 0.441271 2025-01-16 01:57:35,953 - INFO - step 15382, loss: 0.589945, best loss: 0.441271 2025-01-16 01:57:36,103 - INFO - step 15383, loss: 0.639038, best loss: 0.441271 2025-01-16 01:57:36,253 - INFO - step 15384, loss: 0.608262, best loss: 0.441271 2025-01-16 01:57:36,402 - INFO - step 15385, loss: 0.561877, best loss: 0.441271 2025-01-16 01:57:36,552 - INFO - step 15386, loss: 0.530799, best loss: 0.441271 2025-01-16 01:57:36,703 - INFO - step 15387, loss: 0.530638, best loss: 0.441271 2025-01-16 01:57:36,853 - INFO - step 15388, loss: 0.497062, best loss: 0.441271 2025-01-16 01:57:37,003 - INFO - step 15389, loss: 0.624355, best loss: 0.441271 2025-01-16 01:57:37,153 - INFO - step 15390, loss: 0.546063, best loss: 0.441271 2025-01-16 01:57:37,303 - INFO - step 15391, loss: 0.641036, best loss: 0.441271 2025-01-16 01:57:37,453 - INFO - step 15392, loss: 0.500705, best loss: 0.441271 2025-01-16 01:57:37,603 - INFO - step 15393, loss: 0.600741, best loss: 0.441271 2025-01-16 01:57:37,753 - INFO - step 15394, loss: 0.608534, best loss: 0.441271 2025-01-16 01:57:37,903 - INFO - step 15395, loss: 0.656485, best loss: 0.441271 2025-01-16 01:57:38,053 - INFO - step 15396, loss: 0.629893, best loss: 0.441271 2025-01-16 01:57:38,203 - INFO - step 15397, loss: 0.606724, best loss: 0.441271 2025-01-16 01:57:38,353 - INFO - step 15398, loss: 0.579353, best loss: 0.441271 2025-01-16 01:57:38,503 - INFO - step 15399, loss: 0.625518, best loss: 0.441271 2025-01-16 01:57:38,653 - INFO - step 15400, loss: 0.602556, best loss: 0.441271 2025-01-16 01:57:38,803 - INFO - step 15401, loss: 0.669559, best loss: 0.441271 2025-01-16 01:57:38,953 - INFO - step 15402, loss: 0.631896, best loss: 0.441271 2025-01-16 01:57:39,104 - INFO - step 15403, loss: 0.640841, best loss: 0.441271 2025-01-16 01:57:39,254 - INFO - step 15404, loss: 0.668532, best loss: 0.441271 2025-01-16 01:57:39,404 - INFO - step 15405, loss: 0.581091, best loss: 0.441271 2025-01-16 01:57:39,554 - INFO - step 15406, loss: 0.645042, best loss: 0.441271 2025-01-16 01:57:39,704 - INFO - step 15407, loss: 0.577239, best loss: 0.441271 2025-01-16 01:57:39,855 - INFO - step 15408, loss: 0.570740, best loss: 0.441271 2025-01-16 01:57:40,005 - INFO - step 15409, loss: 0.624348, best loss: 0.441271 2025-01-16 01:57:40,155 - INFO - step 15410, loss: 0.606779, best loss: 0.441271 2025-01-16 01:57:40,305 - INFO - step 15411, loss: 0.476148, best loss: 0.441271 2025-01-16 01:57:40,455 - INFO - step 15412, loss: 0.530920, best loss: 0.441271 2025-01-16 01:57:40,605 - INFO - step 15413, loss: 0.773228, best loss: 0.441271 2025-01-16 01:57:40,755 - INFO - step 15414, loss: 0.732148, best loss: 0.441271 2025-01-16 01:57:40,906 - INFO - step 15415, loss: 0.626063, best loss: 0.441271 2025-01-16 01:57:41,055 - INFO - step 15416, loss: 0.543263, best loss: 0.441271 2025-01-16 01:57:41,205 - INFO - step 15417, loss: 0.495230, best loss: 0.441271 2025-01-16 01:57:41,356 - INFO - step 15418, loss: 0.516359, best loss: 0.441271 2025-01-16 01:57:41,506 - INFO - step 15419, loss: 0.569032, best loss: 0.441271 2025-01-16 01:57:41,656 - INFO - step 15420, loss: 0.658840, best loss: 0.441271 2025-01-16 01:57:41,806 - INFO - step 15421, loss: 0.662358, best loss: 0.441271 2025-01-16 01:57:41,956 - INFO - step 15422, loss: 0.579947, best loss: 0.441271 2025-01-16 01:57:42,106 - INFO - step 15423, loss: 0.496457, best loss: 0.441271 2025-01-16 01:57:42,256 - INFO - step 15424, loss: 0.622620, best loss: 0.441271 2025-01-16 01:57:42,407 - INFO - step 15425, loss: 0.561333, best loss: 0.441271 2025-01-16 01:57:42,557 - INFO - step 15426, loss: 0.547123, best loss: 0.441271 2025-01-16 01:57:42,707 - INFO - step 15427, loss: 0.588122, best loss: 0.441271 2025-01-16 01:57:42,857 - INFO - step 15428, loss: 0.519488, best loss: 0.441271 2025-01-16 01:57:43,007 - INFO - step 15429, loss: 0.529320, best loss: 0.441271 2025-01-16 01:57:43,157 - INFO - step 15430, loss: 0.565500, best loss: 0.441271 2025-01-16 01:57:43,307 - INFO - step 15431, loss: 0.578336, best loss: 0.441271 2025-01-16 01:57:43,457 - INFO - step 15432, loss: 0.557781, best loss: 0.441271 2025-01-16 01:57:46,954 - INFO - step 15433, loss: 0.440636, best loss: 0.440636 2025-01-16 01:57:47,116 - INFO - step 15434, loss: 0.498750, best loss: 0.440636 2025-01-16 01:57:47,267 - INFO - step 15435, loss: 0.464181, best loss: 0.440636 2025-01-16 01:57:47,418 - INFO - step 15436, loss: 0.539593, best loss: 0.440636 2025-01-16 01:57:47,568 - INFO - step 15437, loss: 0.634584, best loss: 0.440636 2025-01-16 01:57:47,718 - INFO - step 15438, loss: 0.516210, best loss: 0.440636 2025-01-16 01:57:47,868 - INFO - step 15439, loss: 0.559855, best loss: 0.440636 2025-01-16 01:57:48,018 - INFO - step 15440, loss: 0.575233, best loss: 0.440636 2025-01-16 01:57:48,168 - INFO - step 15441, loss: 0.548394, best loss: 0.440636 2025-01-16 01:57:48,318 - INFO - step 15442, loss: 0.539537, best loss: 0.440636 2025-01-16 01:57:48,468 - INFO - step 15443, loss: 0.447806, best loss: 0.440636 2025-01-16 01:57:48,618 - INFO - step 15444, loss: 0.472289, best loss: 0.440636 2025-01-16 01:57:48,768 - INFO - step 15445, loss: 0.472216, best loss: 0.440636 2025-01-16 01:57:48,918 - INFO - step 15446, loss: 0.504621, best loss: 0.440636 2025-01-16 01:57:49,068 - INFO - step 15447, loss: 0.457143, best loss: 0.440636 2025-01-16 01:57:49,218 - INFO - step 15448, loss: 0.503736, best loss: 0.440636 2025-01-16 01:57:49,369 - INFO - step 15449, loss: 0.517137, best loss: 0.440636 2025-01-16 01:57:49,519 - INFO - step 15450, loss: 0.496700, best loss: 0.440636 2025-01-16 01:57:49,669 - INFO - step 15451, loss: 0.534055, best loss: 0.440636 2025-01-16 01:57:49,819 - INFO - step 15452, loss: 0.448957, best loss: 0.440636 2025-01-16 01:57:49,969 - INFO - step 15453, loss: 0.465822, best loss: 0.440636 2025-01-16 01:57:50,119 - INFO - step 15454, loss: 0.559307, best loss: 0.440636 2025-01-16 01:57:50,269 - INFO - step 15455, loss: 0.620537, best loss: 0.440636 2025-01-16 01:57:50,420 - INFO - step 15456, loss: 0.572445, best loss: 0.440636 2025-01-16 01:57:50,570 - INFO - step 15457, loss: 0.510073, best loss: 0.440636 2025-01-16 01:57:50,720 - INFO - step 15458, loss: 0.558433, best loss: 0.440636 2025-01-16 01:57:50,870 - INFO - step 15459, loss: 0.589930, best loss: 0.440636 2025-01-16 01:57:51,020 - INFO - step 15460, loss: 0.560772, best loss: 0.440636 2025-01-16 01:57:51,171 - INFO - step 15461, loss: 0.443140, best loss: 0.440636 2025-01-16 01:57:51,321 - INFO - step 15462, loss: 0.490620, best loss: 0.440636 2025-01-16 01:57:51,471 - INFO - step 15463, loss: 0.475290, best loss: 0.440636 2025-01-16 01:57:51,621 - INFO - step 15464, loss: 0.560344, best loss: 0.440636 2025-01-16 01:57:51,771 - INFO - step 15465, loss: 0.507838, best loss: 0.440636 2025-01-16 01:57:51,921 - INFO - step 15466, loss: 0.559064, best loss: 0.440636 2025-01-16 01:57:52,071 - INFO - step 15467, loss: 0.489050, best loss: 0.440636 2025-01-16 01:57:52,221 - INFO - step 15468, loss: 0.591421, best loss: 0.440636 2025-01-16 01:57:52,371 - INFO - step 15469, loss: 0.542344, best loss: 0.440636 2025-01-16 01:57:52,521 - INFO - step 15470, loss: 0.522668, best loss: 0.440636 2025-01-16 01:57:52,671 - INFO - step 15471, loss: 0.561550, best loss: 0.440636 2025-01-16 01:57:52,821 - INFO - step 15472, loss: 0.511085, best loss: 0.440636 2025-01-16 01:57:52,971 - INFO - step 15473, loss: 0.542576, best loss: 0.440636 2025-01-16 01:57:53,121 - INFO - step 15474, loss: 0.540145, best loss: 0.440636 2025-01-16 01:57:53,271 - INFO - step 15475, loss: 0.556859, best loss: 0.440636 2025-01-16 01:57:53,421 - INFO - step 15476, loss: 0.501327, best loss: 0.440636 2025-01-16 01:57:53,571 - INFO - step 15477, loss: 0.445515, best loss: 0.440636 2025-01-16 01:57:53,721 - INFO - step 15478, loss: 0.552530, best loss: 0.440636 2025-01-16 01:57:53,871 - INFO - step 15479, loss: 0.546992, best loss: 0.440636 2025-01-16 01:57:54,021 - INFO - step 15480, loss: 0.564274, best loss: 0.440636 2025-01-16 01:57:54,171 - INFO - step 15481, loss: 0.563967, best loss: 0.440636 2025-01-16 01:57:54,322 - INFO - step 15482, loss: 0.516580, best loss: 0.440636 2025-01-16 01:57:54,472 - INFO - step 15483, loss: 0.531828, best loss: 0.440636 2025-01-16 01:57:54,622 - INFO - step 15484, loss: 0.514830, best loss: 0.440636 2025-01-16 01:57:54,772 - INFO - step 15485, loss: 0.522849, best loss: 0.440636 2025-01-16 01:57:54,922 - INFO - step 15486, loss: 0.462674, best loss: 0.440636 2025-01-16 01:57:55,072 - INFO - step 15487, loss: 0.545216, best loss: 0.440636 2025-01-16 01:57:55,222 - INFO - step 15488, loss: 0.598056, best loss: 0.440636 2025-01-16 01:57:55,372 - INFO - step 15489, loss: 0.597489, best loss: 0.440636 2025-01-16 01:57:55,522 - INFO - step 15490, loss: 0.481114, best loss: 0.440636 2025-01-16 01:57:55,672 - INFO - step 15491, loss: 0.543289, best loss: 0.440636 2025-01-16 01:57:55,823 - INFO - step 15492, loss: 0.632200, best loss: 0.440636 2025-01-16 01:57:55,973 - INFO - step 15493, loss: 0.535542, best loss: 0.440636 2025-01-16 01:57:56,123 - INFO - step 15494, loss: 0.539390, best loss: 0.440636 2025-01-16 01:57:56,273 - INFO - step 15495, loss: 0.504726, best loss: 0.440636 2025-01-16 01:57:56,423 - INFO - step 15496, loss: 0.454328, best loss: 0.440636 2025-01-16 01:57:56,573 - INFO - step 15497, loss: 0.502176, best loss: 0.440636 2025-01-16 01:57:56,723 - INFO - step 15498, loss: 0.611464, best loss: 0.440636 2025-01-16 01:57:56,873 - INFO - step 15499, loss: 0.630049, best loss: 0.440636 2025-01-16 01:57:57,023 - INFO - step 15500, loss: 0.584144, best loss: 0.440636 2025-01-16 01:57:57,173 - INFO - step 15501, loss: 0.562366, best loss: 0.440636 2025-01-16 01:57:57,323 - INFO - step 15502, loss: 0.498069, best loss: 0.440636 2025-01-16 01:57:57,474 - INFO - step 15503, loss: 0.520093, best loss: 0.440636 2025-01-16 01:57:57,624 - INFO - step 15504, loss: 0.505872, best loss: 0.440636 2025-01-16 01:57:57,774 - INFO - step 15505, loss: 0.592376, best loss: 0.440636 2025-01-16 01:57:57,924 - INFO - step 15506, loss: 0.585111, best loss: 0.440636 2025-01-16 01:58:01,495 - INFO - step 15507, loss: 0.408834, best loss: 0.408834 2025-01-16 01:58:01,645 - INFO - step 15508, loss: 0.494600, best loss: 0.408834 2025-01-16 01:58:01,795 - INFO - step 15509, loss: 0.479842, best loss: 0.408834 2025-01-16 01:58:01,945 - INFO - step 15510, loss: 0.507844, best loss: 0.408834 2025-01-16 01:58:02,095 - INFO - step 15511, loss: 0.584362, best loss: 0.408834 2025-01-16 01:58:02,245 - INFO - step 15512, loss: 0.587123, best loss: 0.408834 2025-01-16 01:58:02,395 - INFO - step 15513, loss: 0.546951, best loss: 0.408834 2025-01-16 01:58:02,545 - INFO - step 15514, loss: 0.532838, best loss: 0.408834 2025-01-16 01:58:02,695 - INFO - step 15515, loss: 0.504048, best loss: 0.408834 2025-01-16 01:58:02,845 - INFO - step 15516, loss: 0.510032, best loss: 0.408834 2025-01-16 01:58:02,995 - INFO - step 15517, loss: 0.477806, best loss: 0.408834 2025-01-16 01:58:03,146 - INFO - step 15518, loss: 0.581613, best loss: 0.408834 2025-01-16 01:58:03,296 - INFO - step 15519, loss: 0.505085, best loss: 0.408834 2025-01-16 01:58:03,446 - INFO - step 15520, loss: 0.498673, best loss: 0.408834 2025-01-16 01:58:03,595 - INFO - step 15521, loss: 0.489988, best loss: 0.408834 2025-01-16 01:58:03,746 - INFO - step 15522, loss: 0.416319, best loss: 0.408834 2025-01-16 01:58:03,896 - INFO - step 15523, loss: 0.539723, best loss: 0.408834 2025-01-16 01:58:04,046 - INFO - step 15524, loss: 0.576937, best loss: 0.408834 2025-01-16 01:58:04,196 - INFO - step 15525, loss: 0.569578, best loss: 0.408834 2025-01-16 01:58:04,346 - INFO - step 15526, loss: 0.558993, best loss: 0.408834 2025-01-16 01:58:04,496 - INFO - step 15527, loss: 0.513816, best loss: 0.408834 2025-01-16 01:58:04,646 - INFO - step 15528, loss: 0.544720, best loss: 0.408834 2025-01-16 01:58:04,797 - INFO - step 15529, loss: 0.515211, best loss: 0.408834 2025-01-16 01:58:04,947 - INFO - step 15530, loss: 0.505997, best loss: 0.408834 2025-01-16 01:58:05,097 - INFO - step 15531, loss: 0.474311, best loss: 0.408834 2025-01-16 01:58:05,247 - INFO - step 15532, loss: 0.477820, best loss: 0.408834 2025-01-16 01:58:05,397 - INFO - step 15533, loss: 0.511573, best loss: 0.408834 2025-01-16 01:58:05,547 - INFO - step 15534, loss: 0.544743, best loss: 0.408834 2025-01-16 01:58:05,697 - INFO - step 15535, loss: 0.578020, best loss: 0.408834 2025-01-16 01:58:05,847 - INFO - step 15536, loss: 0.531682, best loss: 0.408834 2025-01-16 01:58:05,997 - INFO - step 15537, loss: 0.481188, best loss: 0.408834 2025-01-16 01:58:06,147 - INFO - step 15538, loss: 0.484635, best loss: 0.408834 2025-01-16 01:58:06,297 - INFO - step 15539, loss: 0.581911, best loss: 0.408834 2025-01-16 01:58:06,448 - INFO - step 15540, loss: 0.481366, best loss: 0.408834 2025-01-16 01:58:06,598 - INFO - step 15541, loss: 0.514167, best loss: 0.408834 2025-01-16 01:58:06,748 - INFO - step 15542, loss: 0.582786, best loss: 0.408834 2025-01-16 01:58:06,898 - INFO - step 15543, loss: 0.552233, best loss: 0.408834 2025-01-16 01:58:07,049 - INFO - step 15544, loss: 0.502104, best loss: 0.408834 2025-01-16 01:58:07,199 - INFO - step 15545, loss: 0.439946, best loss: 0.408834 2025-01-16 01:58:07,349 - INFO - step 15546, loss: 0.562504, best loss: 0.408834 2025-01-16 01:58:07,499 - INFO - step 15547, loss: 0.521359, best loss: 0.408834 2025-01-16 01:58:07,649 - INFO - step 15548, loss: 0.496023, best loss: 0.408834 2025-01-16 01:58:07,800 - INFO - step 15549, loss: 0.447299, best loss: 0.408834 2025-01-16 01:58:07,950 - INFO - step 15550, loss: 0.590083, best loss: 0.408834 2025-01-16 01:58:08,100 - INFO - step 15551, loss: 0.567332, best loss: 0.408834 2025-01-16 01:58:08,250 - INFO - step 15552, loss: 0.581293, best loss: 0.408834 2025-01-16 01:58:08,400 - INFO - step 15553, loss: 0.561345, best loss: 0.408834 2025-01-16 01:58:08,550 - INFO - step 15554, loss: 0.476014, best loss: 0.408834 2025-01-16 01:58:08,700 - INFO - step 15555, loss: 0.490115, best loss: 0.408834 2025-01-16 01:58:08,851 - INFO - step 15556, loss: 0.589982, best loss: 0.408834 2025-01-16 01:58:09,001 - INFO - step 15557, loss: 0.542172, best loss: 0.408834 2025-01-16 01:58:09,151 - INFO - step 15558, loss: 0.536717, best loss: 0.408834 2025-01-16 01:58:09,301 - INFO - step 15559, loss: 0.516000, best loss: 0.408834 2025-01-16 01:58:09,451 - INFO - step 15560, loss: 0.486046, best loss: 0.408834 2025-01-16 01:58:09,602 - INFO - step 15561, loss: 0.518913, best loss: 0.408834 2025-01-16 01:58:09,752 - INFO - step 15562, loss: 0.523785, best loss: 0.408834 2025-01-16 01:58:09,903 - INFO - step 15563, loss: 0.527947, best loss: 0.408834 2025-01-16 01:58:10,053 - INFO - step 15564, loss: 0.516366, best loss: 0.408834 2025-01-16 01:58:10,203 - INFO - step 15565, loss: 0.495213, best loss: 0.408834 2025-01-16 01:58:10,353 - INFO - step 15566, loss: 0.521082, best loss: 0.408834 2025-01-16 01:58:10,503 - INFO - step 15567, loss: 0.592726, best loss: 0.408834 2025-01-16 01:58:10,654 - INFO - step 15568, loss: 0.477686, best loss: 0.408834 2025-01-16 01:58:10,804 - INFO - step 15569, loss: 0.589808, best loss: 0.408834 2025-01-16 01:58:10,954 - INFO - step 15570, loss: 0.558154, best loss: 0.408834 2025-01-16 01:58:11,104 - INFO - step 15571, loss: 0.684740, best loss: 0.408834 2025-01-16 01:58:11,254 - INFO - step 15572, loss: 0.591170, best loss: 0.408834 2025-01-16 01:58:11,404 - INFO - step 15573, loss: 0.612258, best loss: 0.408834 2025-01-16 01:58:11,554 - INFO - step 15574, loss: 0.570657, best loss: 0.408834 2025-01-16 01:58:11,705 - INFO - step 15575, loss: 0.472139, best loss: 0.408834 2025-01-16 01:58:11,855 - INFO - step 15576, loss: 0.557884, best loss: 0.408834 2025-01-16 01:58:12,005 - INFO - step 15577, loss: 0.584238, best loss: 0.408834 2025-01-16 01:58:12,155 - INFO - step 15578, loss: 0.492609, best loss: 0.408834 2025-01-16 01:58:12,305 - INFO - step 15579, loss: 0.584588, best loss: 0.408834 2025-01-16 01:58:12,455 - INFO - step 15580, loss: 0.638866, best loss: 0.408834 2025-01-16 01:58:12,605 - INFO - step 15581, loss: 0.563807, best loss: 0.408834 2025-01-16 01:58:12,755 - INFO - step 15582, loss: 0.604102, best loss: 0.408834 2025-01-16 01:58:12,905 - INFO - step 15583, loss: 0.598920, best loss: 0.408834 2025-01-16 01:58:13,055 - INFO - step 15584, loss: 0.545444, best loss: 0.408834 2025-01-16 01:58:13,205 - INFO - step 15585, loss: 0.599409, best loss: 0.408834 2025-01-16 01:58:13,355 - INFO - step 15586, loss: 0.588697, best loss: 0.408834 2025-01-16 01:58:13,505 - INFO - step 15587, loss: 0.661002, best loss: 0.408834 2025-01-16 01:58:13,655 - INFO - step 15588, loss: 0.581571, best loss: 0.408834 2025-01-16 01:58:13,805 - INFO - step 15589, loss: 0.545161, best loss: 0.408834 2025-01-16 01:58:13,955 - INFO - step 15590, loss: 0.523292, best loss: 0.408834 2025-01-16 01:58:14,105 - INFO - step 15591, loss: 0.475670, best loss: 0.408834 2025-01-16 01:58:14,255 - INFO - step 15592, loss: 0.510910, best loss: 0.408834 2025-01-16 01:58:14,405 - INFO - step 15593, loss: 0.501745, best loss: 0.408834 2025-01-16 01:58:14,555 - INFO - step 15594, loss: 0.524945, best loss: 0.408834 2025-01-16 01:58:14,706 - INFO - step 15595, loss: 0.549847, best loss: 0.408834 2025-01-16 01:58:14,856 - INFO - step 15596, loss: 0.548985, best loss: 0.408834 2025-01-16 01:58:15,006 - INFO - step 15597, loss: 0.563534, best loss: 0.408834 2025-01-16 01:58:15,156 - INFO - step 15598, loss: 0.529006, best loss: 0.408834 2025-01-16 01:58:15,306 - INFO - step 15599, loss: 0.542680, best loss: 0.408834 2025-01-16 01:58:15,456 - INFO - step 15600, loss: 0.509177, best loss: 0.408834 2025-01-16 01:58:15,606 - INFO - step 15601, loss: 0.576664, best loss: 0.408834 2025-01-16 01:58:15,757 - INFO - step 15602, loss: 0.520570, best loss: 0.408834 2025-01-16 01:58:15,907 - INFO - step 15603, loss: 0.548163, best loss: 0.408834 2025-01-16 01:58:16,057 - INFO - step 15604, loss: 0.525040, best loss: 0.408834 2025-01-16 01:58:16,207 - INFO - step 15605, loss: 0.569806, best loss: 0.408834 2025-01-16 01:58:16,357 - INFO - step 15606, loss: 0.552794, best loss: 0.408834 2025-01-16 01:58:16,508 - INFO - step 15607, loss: 0.525378, best loss: 0.408834 2025-01-16 01:58:16,658 - INFO - step 15608, loss: 0.505920, best loss: 0.408834 2025-01-16 01:58:16,808 - INFO - step 15609, loss: 0.533455, best loss: 0.408834 2025-01-16 01:58:16,958 - INFO - step 15610, loss: 0.464035, best loss: 0.408834 2025-01-16 01:58:17,108 - INFO - step 15611, loss: 0.573352, best loss: 0.408834 2025-01-16 01:58:17,258 - INFO - step 15612, loss: 0.469985, best loss: 0.408834 2025-01-16 01:58:17,409 - INFO - step 15613, loss: 0.615857, best loss: 0.408834 2025-01-16 01:58:17,559 - INFO - step 15614, loss: 0.532655, best loss: 0.408834 2025-01-16 01:58:17,709 - INFO - step 15615, loss: 0.469371, best loss: 0.408834 2025-01-16 01:58:17,859 - INFO - step 15616, loss: 0.560204, best loss: 0.408834 2025-01-16 01:58:18,009 - INFO - step 15617, loss: 0.513469, best loss: 0.408834 2025-01-16 01:58:18,160 - INFO - step 15618, loss: 0.556673, best loss: 0.408834 2025-01-16 01:58:18,310 - INFO - step 15619, loss: 0.582327, best loss: 0.408834 2025-01-16 01:58:18,460 - INFO - step 15620, loss: 0.471500, best loss: 0.408834 2025-01-16 01:58:18,610 - INFO - step 15621, loss: 0.526097, best loss: 0.408834 2025-01-16 01:58:18,760 - INFO - step 15622, loss: 0.456728, best loss: 0.408834 2025-01-16 01:58:18,911 - INFO - step 15623, loss: 0.477318, best loss: 0.408834 2025-01-16 01:58:19,061 - INFO - step 15624, loss: 0.563738, best loss: 0.408834 2025-01-16 01:58:19,211 - INFO - step 15625, loss: 0.483051, best loss: 0.408834 2025-01-16 01:58:19,361 - INFO - step 15626, loss: 0.572837, best loss: 0.408834 2025-01-16 01:58:19,512 - INFO - step 15627, loss: 0.496341, best loss: 0.408834 2025-01-16 01:58:19,662 - INFO - step 15628, loss: 0.428049, best loss: 0.408834 2025-01-16 01:58:19,812 - INFO - step 15629, loss: 0.536573, best loss: 0.408834 2025-01-16 01:58:19,962 - INFO - step 15630, loss: 0.487034, best loss: 0.408834 2025-01-16 01:58:20,112 - INFO - step 15631, loss: 0.443950, best loss: 0.408834 2025-01-16 01:58:20,262 - INFO - step 15632, loss: 0.547576, best loss: 0.408834 2025-01-16 01:58:20,413 - INFO - step 15633, loss: 0.456302, best loss: 0.408834 2025-01-16 01:58:20,563 - INFO - step 15634, loss: 0.467381, best loss: 0.408834 2025-01-16 01:58:20,713 - INFO - step 15635, loss: 0.443939, best loss: 0.408834 2025-01-16 01:58:20,863 - INFO - step 15636, loss: 0.452572, best loss: 0.408834 2025-01-16 01:58:21,013 - INFO - step 15637, loss: 0.514829, best loss: 0.408834 2025-01-16 01:58:21,164 - INFO - step 15638, loss: 0.474744, best loss: 0.408834 2025-01-16 01:58:21,314 - INFO - step 15639, loss: 0.512233, best loss: 0.408834 2025-01-16 01:58:21,464 - INFO - step 15640, loss: 0.467730, best loss: 0.408834 2025-01-16 01:58:21,615 - INFO - step 15641, loss: 0.507231, best loss: 0.408834 2025-01-16 01:58:21,765 - INFO - step 15642, loss: 0.590397, best loss: 0.408834 2025-01-16 01:58:21,915 - INFO - step 15643, loss: 0.496109, best loss: 0.408834 2025-01-16 01:58:22,065 - INFO - step 15644, loss: 0.612327, best loss: 0.408834 2025-01-16 01:58:22,215 - INFO - step 15645, loss: 0.443966, best loss: 0.408834 2025-01-16 01:58:25,780 - INFO - step 15646, loss: 0.394553, best loss: 0.394553 2025-01-16 01:58:25,932 - INFO - step 15647, loss: 0.542240, best loss: 0.394553 2025-01-16 01:58:26,082 - INFO - step 15648, loss: 0.476141, best loss: 0.394553 2025-01-16 01:58:26,232 - INFO - step 15649, loss: 0.491449, best loss: 0.394553 2025-01-16 01:58:26,382 - INFO - step 15650, loss: 0.508274, best loss: 0.394553 2025-01-16 01:58:26,532 - INFO - step 15651, loss: 0.500450, best loss: 0.394553 2025-01-16 01:58:26,683 - INFO - step 15652, loss: 0.533866, best loss: 0.394553 2025-01-16 01:58:26,833 - INFO - step 15653, loss: 0.413810, best loss: 0.394553 2025-01-16 01:58:30,578 - INFO - step 15654, loss: 0.363574, best loss: 0.363574 2025-01-16 01:58:30,729 - INFO - step 15655, loss: 0.454169, best loss: 0.363574 2025-01-16 01:58:30,879 - INFO - step 15656, loss: 0.472921, best loss: 0.363574 2025-01-16 01:58:31,047 - INFO - step 15657, loss: 0.545234, best loss: 0.363574 2025-01-16 01:58:31,197 - INFO - step 15658, loss: 0.487881, best loss: 0.363574 2025-01-16 01:58:31,347 - INFO - step 15659, loss: 0.447427, best loss: 0.363574 2025-01-16 01:58:31,498 - INFO - step 15660, loss: 0.500629, best loss: 0.363574 2025-01-16 01:58:31,648 - INFO - step 15661, loss: 0.522621, best loss: 0.363574 2025-01-16 01:58:31,803 - INFO - step 15662, loss: 0.472343, best loss: 0.363574 2025-01-16 01:58:31,953 - INFO - step 15663, loss: 0.488938, best loss: 0.363574 2025-01-16 01:58:32,104 - INFO - step 15664, loss: 0.583048, best loss: 0.363574 2025-01-16 01:58:32,254 - INFO - step 15665, loss: 0.543997, best loss: 0.363574 2025-01-16 01:58:32,404 - INFO - step 15666, loss: 0.598481, best loss: 0.363574 2025-01-16 01:58:32,554 - INFO - step 15667, loss: 0.534815, best loss: 0.363574 2025-01-16 01:58:32,704 - INFO - step 15668, loss: 0.564152, best loss: 0.363574 2025-01-16 01:58:32,854 - INFO - step 15669, loss: 0.556082, best loss: 0.363574 2025-01-16 01:58:33,005 - INFO - step 15670, loss: 0.542060, best loss: 0.363574 2025-01-16 01:58:33,155 - INFO - step 15671, loss: 0.559368, best loss: 0.363574 2025-01-16 01:58:33,305 - INFO - step 15672, loss: 0.497105, best loss: 0.363574 2025-01-16 01:58:33,455 - INFO - step 15673, loss: 0.476484, best loss: 0.363574 2025-01-16 01:58:33,605 - INFO - step 15674, loss: 0.529361, best loss: 0.363574 2025-01-16 01:58:33,755 - INFO - step 15675, loss: 0.476355, best loss: 0.363574 2025-01-16 01:58:33,906 - INFO - step 15676, loss: 0.629434, best loss: 0.363574 2025-01-16 01:58:34,056 - INFO - step 15677, loss: 0.558763, best loss: 0.363574 2025-01-16 01:58:34,206 - INFO - step 15678, loss: 0.572681, best loss: 0.363574 2025-01-16 01:58:34,356 - INFO - step 15679, loss: 0.526977, best loss: 0.363574 2025-01-16 01:58:34,506 - INFO - step 15680, loss: 0.520706, best loss: 0.363574 2025-01-16 01:58:34,657 - INFO - step 15681, loss: 0.449706, best loss: 0.363574 2025-01-16 01:58:34,807 - INFO - step 15682, loss: 0.477271, best loss: 0.363574 2025-01-16 01:58:34,957 - INFO - step 15683, loss: 0.464335, best loss: 0.363574 2025-01-16 01:58:35,108 - INFO - step 15684, loss: 0.563760, best loss: 0.363574 2025-01-16 01:58:35,258 - INFO - step 15685, loss: 0.527226, best loss: 0.363574 2025-01-16 01:58:35,408 - INFO - step 15686, loss: 0.666674, best loss: 0.363574 2025-01-16 01:58:35,558 - INFO - step 15687, loss: 0.598339, best loss: 0.363574 2025-01-16 01:58:35,709 - INFO - step 15688, loss: 0.540272, best loss: 0.363574 2025-01-16 01:58:35,859 - INFO - step 15689, loss: 0.503909, best loss: 0.363574 2025-01-16 01:58:36,010 - INFO - step 15690, loss: 0.621163, best loss: 0.363574 2025-01-16 01:58:36,160 - INFO - step 15691, loss: 0.456299, best loss: 0.363574 2025-01-16 01:58:36,310 - INFO - step 15692, loss: 0.479019, best loss: 0.363574 2025-01-16 01:58:36,461 - INFO - step 15693, loss: 0.576604, best loss: 0.363574 2025-01-16 01:58:36,612 - INFO - step 15694, loss: 0.500161, best loss: 0.363574 2025-01-16 01:58:36,762 - INFO - step 15695, loss: 0.472578, best loss: 0.363574 2025-01-16 01:58:36,912 - INFO - step 15696, loss: 0.551142, best loss: 0.363574 2025-01-16 01:58:37,062 - INFO - step 15697, loss: 0.553302, best loss: 0.363574 2025-01-16 01:58:37,213 - INFO - step 15698, loss: 0.537395, best loss: 0.363574 2025-01-16 01:58:37,363 - INFO - step 15699, loss: 0.577620, best loss: 0.363574 2025-01-16 01:58:37,514 - INFO - step 15700, loss: 0.522642, best loss: 0.363574 2025-01-16 01:58:37,664 - INFO - step 15701, loss: 0.522572, best loss: 0.363574 2025-01-16 01:58:37,814 - INFO - step 15702, loss: 0.587143, best loss: 0.363574 2025-01-16 01:58:37,964 - INFO - step 15703, loss: 0.518360, best loss: 0.363574 2025-01-16 01:58:38,115 - INFO - step 15704, loss: 0.545385, best loss: 0.363574 2025-01-16 01:58:38,265 - INFO - step 15705, loss: 0.605696, best loss: 0.363574 2025-01-16 01:58:38,415 - INFO - step 15706, loss: 0.570609, best loss: 0.363574 2025-01-16 01:58:38,565 - INFO - step 15707, loss: 0.519640, best loss: 0.363574 2025-01-16 01:58:38,715 - INFO - step 15708, loss: 0.521544, best loss: 0.363574 2025-01-16 01:58:38,866 - INFO - step 15709, loss: 0.460952, best loss: 0.363574 2025-01-16 01:58:39,016 - INFO - step 15710, loss: 0.542046, best loss: 0.363574 2025-01-16 01:58:39,166 - INFO - step 15711, loss: 0.510822, best loss: 0.363574 2025-01-16 01:58:39,316 - INFO - step 15712, loss: 0.571955, best loss: 0.363574 2025-01-16 01:58:39,466 - INFO - step 15713, loss: 0.570945, best loss: 0.363574 2025-01-16 01:58:39,616 - INFO - step 15714, loss: 0.526898, best loss: 0.363574 2025-01-16 01:58:39,767 - INFO - step 15715, loss: 0.510151, best loss: 0.363574 2025-01-16 01:58:39,917 - INFO - step 15716, loss: 0.524880, best loss: 0.363574 2025-01-16 01:58:40,067 - INFO - step 15717, loss: 0.585954, best loss: 0.363574 2025-01-16 01:58:40,217 - INFO - step 15718, loss: 0.519531, best loss: 0.363574 2025-01-16 01:58:40,367 - INFO - step 15719, loss: 0.537678, best loss: 0.363574 2025-01-16 01:58:40,517 - INFO - step 15720, loss: 0.551977, best loss: 0.363574 2025-01-16 01:58:40,667 - INFO - step 15721, loss: 0.517553, best loss: 0.363574 2025-01-16 01:58:40,817 - INFO - step 15722, loss: 0.443012, best loss: 0.363574 2025-01-16 01:58:40,967 - INFO - step 15723, loss: 0.460741, best loss: 0.363574 2025-01-16 01:58:41,117 - INFO - step 15724, loss: 0.557862, best loss: 0.363574 2025-01-16 01:58:41,267 - INFO - step 15725, loss: 0.517019, best loss: 0.363574 2025-01-16 01:58:41,418 - INFO - step 15726, loss: 0.592397, best loss: 0.363574 2025-01-16 01:58:41,568 - INFO - step 15727, loss: 0.527139, best loss: 0.363574 2025-01-16 01:58:41,718 - INFO - step 15728, loss: 0.514445, best loss: 0.363574 2025-01-16 01:58:41,868 - INFO - step 15729, loss: 0.547592, best loss: 0.363574 2025-01-16 01:58:42,018 - INFO - step 15730, loss: 0.520418, best loss: 0.363574 2025-01-16 01:58:42,168 - INFO - step 15731, loss: 0.556511, best loss: 0.363574 2025-01-16 01:58:42,319 - INFO - step 15732, loss: 0.562922, best loss: 0.363574 2025-01-16 01:58:42,469 - INFO - step 15733, loss: 0.549920, best loss: 0.363574 2025-01-16 01:58:42,619 - INFO - step 15734, loss: 0.552893, best loss: 0.363574 2025-01-16 01:58:42,769 - INFO - step 15735, loss: 0.553364, best loss: 0.363574 2025-01-16 01:58:42,919 - INFO - step 15736, loss: 0.532460, best loss: 0.363574 2025-01-16 01:58:43,069 - INFO - step 15737, loss: 0.509764, best loss: 0.363574 2025-01-16 01:58:43,219 - INFO - step 15738, loss: 0.426253, best loss: 0.363574 2025-01-16 01:58:43,369 - INFO - step 15739, loss: 0.560499, best loss: 0.363574 2025-01-16 01:58:43,519 - INFO - step 15740, loss: 0.481049, best loss: 0.363574 2025-01-16 01:58:43,670 - INFO - step 15741, loss: 0.447247, best loss: 0.363574 2025-01-16 01:58:43,820 - INFO - step 15742, loss: 0.447250, best loss: 0.363574 2025-01-16 01:58:43,970 - INFO - step 15743, loss: 0.507543, best loss: 0.363574 2025-01-16 01:58:44,120 - INFO - step 15744, loss: 0.593398, best loss: 0.363574 2025-01-16 01:58:44,270 - INFO - step 15745, loss: 0.499526, best loss: 0.363574 2025-01-16 01:58:44,420 - INFO - step 15746, loss: 0.468174, best loss: 0.363574 2025-01-16 01:58:44,570 - INFO - step 15747, loss: 0.458460, best loss: 0.363574 2025-01-16 01:58:44,720 - INFO - step 15748, loss: 0.420088, best loss: 0.363574 2025-01-16 01:58:44,870 - INFO - step 15749, loss: 0.571056, best loss: 0.363574 2025-01-16 01:58:45,020 - INFO - step 15750, loss: 0.540882, best loss: 0.363574 2025-01-16 01:58:45,170 - INFO - step 15751, loss: 0.611497, best loss: 0.363574 2025-01-16 01:58:45,320 - INFO - step 15752, loss: 0.514047, best loss: 0.363574 2025-01-16 01:58:45,471 - INFO - step 15753, loss: 0.432081, best loss: 0.363574 2025-01-16 01:58:45,621 - INFO - step 15754, loss: 0.558390, best loss: 0.363574 2025-01-16 01:58:45,771 - INFO - step 15755, loss: 0.458245, best loss: 0.363574 2025-01-16 01:58:45,921 - INFO - step 15756, loss: 0.448198, best loss: 0.363574 2025-01-16 01:58:46,072 - INFO - step 15757, loss: 0.508708, best loss: 0.363574 2025-01-16 01:58:46,222 - INFO - step 15758, loss: 0.441035, best loss: 0.363574 2025-01-16 01:58:46,372 - INFO - step 15759, loss: 0.387270, best loss: 0.363574 2025-01-16 01:58:46,522 - INFO - step 15760, loss: 0.471850, best loss: 0.363574 2025-01-16 01:58:46,672 - INFO - step 15761, loss: 0.540035, best loss: 0.363574 2025-01-16 01:58:46,822 - INFO - step 15762, loss: 0.487669, best loss: 0.363574 2025-01-16 01:58:46,972 - INFO - step 15763, loss: 0.423005, best loss: 0.363574 2025-01-16 01:58:47,123 - INFO - step 15764, loss: 0.475674, best loss: 0.363574 2025-01-16 01:58:47,273 - INFO - step 15765, loss: 0.400498, best loss: 0.363574 2025-01-16 01:58:47,423 - INFO - step 15766, loss: 0.488567, best loss: 0.363574 2025-01-16 01:58:47,573 - INFO - step 15767, loss: 0.494914, best loss: 0.363574 2025-01-16 01:58:47,723 - INFO - step 15768, loss: 0.471430, best loss: 0.363574 2025-01-16 01:58:47,873 - INFO - step 15769, loss: 0.480116, best loss: 0.363574 2025-01-16 01:58:48,023 - INFO - step 15770, loss: 0.512534, best loss: 0.363574 2025-01-16 01:58:48,173 - INFO - step 15771, loss: 0.456902, best loss: 0.363574 2025-01-16 01:58:48,323 - INFO - step 15772, loss: 0.507391, best loss: 0.363574 2025-01-16 01:58:48,473 - INFO - step 15773, loss: 0.404989, best loss: 0.363574 2025-01-16 01:58:48,624 - INFO - step 15774, loss: 0.421948, best loss: 0.363574 2025-01-16 01:58:48,774 - INFO - step 15775, loss: 0.399515, best loss: 0.363574 2025-01-16 01:58:48,924 - INFO - step 15776, loss: 0.448442, best loss: 0.363574 2025-01-16 01:58:49,074 - INFO - step 15777, loss: 0.411280, best loss: 0.363574 2025-01-16 01:58:49,224 - INFO - step 15778, loss: 0.509020, best loss: 0.363574 2025-01-16 01:58:49,374 - INFO - step 15779, loss: 0.469137, best loss: 0.363574 2025-01-16 01:58:49,525 - INFO - step 15780, loss: 0.455748, best loss: 0.363574 2025-01-16 01:58:49,675 - INFO - step 15781, loss: 0.451697, best loss: 0.363574 2025-01-16 01:58:49,825 - INFO - step 15782, loss: 0.403506, best loss: 0.363574 2025-01-16 01:58:49,975 - INFO - step 15783, loss: 0.444664, best loss: 0.363574 2025-01-16 01:58:50,125 - INFO - step 15784, loss: 0.473255, best loss: 0.363574 2025-01-16 01:58:50,275 - INFO - step 15785, loss: 0.478703, best loss: 0.363574 2025-01-16 01:58:50,426 - INFO - step 15786, loss: 0.421554, best loss: 0.363574 2025-01-16 01:58:50,576 - INFO - step 15787, loss: 0.407104, best loss: 0.363574 2025-01-16 01:58:50,726 - INFO - step 15788, loss: 0.495773, best loss: 0.363574 2025-01-16 01:58:50,876 - INFO - step 15789, loss: 0.458763, best loss: 0.363574 2025-01-16 01:58:51,026 - INFO - step 15790, loss: 0.489877, best loss: 0.363574 2025-01-16 01:58:51,176 - INFO - step 15791, loss: 0.387520, best loss: 0.363574 2025-01-16 01:58:51,326 - INFO - step 15792, loss: 0.525441, best loss: 0.363574 2025-01-16 01:58:51,476 - INFO - step 15793, loss: 0.431954, best loss: 0.363574 2025-01-16 01:58:51,627 - INFO - step 15794, loss: 0.554571, best loss: 0.363574 2025-01-16 01:58:51,777 - INFO - step 15795, loss: 0.465439, best loss: 0.363574 2025-01-16 01:58:51,927 - INFO - step 15796, loss: 0.428165, best loss: 0.363574 2025-01-16 01:58:52,077 - INFO - step 15797, loss: 0.480623, best loss: 0.363574 2025-01-16 01:58:52,227 - INFO - step 15798, loss: 0.492954, best loss: 0.363574 2025-01-16 01:58:52,377 - INFO - step 15799, loss: 0.453131, best loss: 0.363574 2025-01-16 01:58:52,527 - INFO - step 15800, loss: 0.385405, best loss: 0.363574 2025-01-16 01:58:52,677 - INFO - step 15801, loss: 0.459734, best loss: 0.363574 2025-01-16 01:58:52,827 - INFO - step 15802, loss: 0.528057, best loss: 0.363574 2025-01-16 01:58:52,977 - INFO - step 15803, loss: 0.498485, best loss: 0.363574 2025-01-16 01:58:53,127 - INFO - step 15804, loss: 0.479350, best loss: 0.363574 2025-01-16 01:58:53,278 - INFO - step 15805, loss: 0.539725, best loss: 0.363574 2025-01-16 01:58:53,428 - INFO - step 15806, loss: 0.488763, best loss: 0.363574 2025-01-16 01:58:53,578 - INFO - step 15807, loss: 0.440879, best loss: 0.363574 2025-01-16 01:58:53,728 - INFO - step 15808, loss: 0.489360, best loss: 0.363574 2025-01-16 01:58:53,878 - INFO - step 15809, loss: 0.484199, best loss: 0.363574 2025-01-16 01:58:54,028 - INFO - step 15810, loss: 0.455527, best loss: 0.363574 2025-01-16 01:58:54,178 - INFO - step 15811, loss: 0.515901, best loss: 0.363574 2025-01-16 01:58:54,329 - INFO - step 15812, loss: 0.482844, best loss: 0.363574 2025-01-16 01:58:54,479 - INFO - step 15813, loss: 0.444933, best loss: 0.363574 2025-01-16 01:58:54,629 - INFO - step 15814, loss: 0.464383, best loss: 0.363574 2025-01-16 01:58:54,779 - INFO - step 15815, loss: 0.454638, best loss: 0.363574 2025-01-16 01:58:54,929 - INFO - step 15816, loss: 0.452117, best loss: 0.363574 2025-01-16 01:58:55,079 - INFO - step 15817, loss: 0.488070, best loss: 0.363574 2025-01-16 01:58:55,229 - INFO - step 15818, loss: 0.531372, best loss: 0.363574 2025-01-16 01:58:55,380 - INFO - step 15819, loss: 0.535213, best loss: 0.363574 2025-01-16 01:58:55,529 - INFO - step 15820, loss: 0.429175, best loss: 0.363574 2025-01-16 01:58:55,680 - INFO - step 15821, loss: 0.491033, best loss: 0.363574 2025-01-16 01:58:55,830 - INFO - step 15822, loss: 0.646989, best loss: 0.363574 2025-01-16 01:58:55,980 - INFO - step 15823, loss: 0.495958, best loss: 0.363574 2025-01-16 01:58:56,130 - INFO - step 15824, loss: 0.462479, best loss: 0.363574 2025-01-16 01:58:56,280 - INFO - step 15825, loss: 0.403601, best loss: 0.363574 2025-01-16 01:58:56,430 - INFO - step 15826, loss: 0.419498, best loss: 0.363574 2025-01-16 01:58:56,580 - INFO - step 15827, loss: 0.490540, best loss: 0.363574 2025-01-16 01:58:56,730 - INFO - step 15828, loss: 0.530491, best loss: 0.363574 2025-01-16 01:58:56,881 - INFO - step 15829, loss: 0.542414, best loss: 0.363574 2025-01-16 01:58:57,031 - INFO - step 15830, loss: 0.517340, best loss: 0.363574 2025-01-16 01:58:57,181 - INFO - step 15831, loss: 0.500857, best loss: 0.363574 2025-01-16 01:58:57,331 - INFO - step 15832, loss: 0.457490, best loss: 0.363574 2025-01-16 01:58:57,481 - INFO - step 15833, loss: 0.492982, best loss: 0.363574 2025-01-16 01:58:57,631 - INFO - step 15834, loss: 0.435689, best loss: 0.363574 2025-01-16 01:58:57,782 - INFO - step 15835, loss: 0.492564, best loss: 0.363574 2025-01-16 01:58:57,932 - INFO - step 15836, loss: 0.527035, best loss: 0.363574 2025-01-16 01:58:58,082 - INFO - step 15837, loss: 0.404628, best loss: 0.363574 2025-01-16 01:58:58,232 - INFO - step 15838, loss: 0.483969, best loss: 0.363574 2025-01-16 01:58:58,382 - INFO - step 15839, loss: 0.472117, best loss: 0.363574 2025-01-16 01:58:58,532 - INFO - step 15840, loss: 0.480208, best loss: 0.363574 2025-01-16 01:58:58,682 - INFO - step 15841, loss: 0.563110, best loss: 0.363574 2025-01-16 01:58:58,833 - INFO - step 15842, loss: 0.520940, best loss: 0.363574 2025-01-16 01:58:58,983 - INFO - step 15843, loss: 0.518738, best loss: 0.363574 2025-01-16 01:58:59,133 - INFO - step 15844, loss: 0.454664, best loss: 0.363574 2025-01-16 01:58:59,283 - INFO - step 15845, loss: 0.451024, best loss: 0.363574 2025-01-16 01:58:59,433 - INFO - step 15846, loss: 0.465738, best loss: 0.363574 2025-01-16 01:58:59,584 - INFO - step 15847, loss: 0.485078, best loss: 0.363574 2025-01-16 01:58:59,734 - INFO - step 15848, loss: 0.549412, best loss: 0.363574 2025-01-16 01:58:59,884 - INFO - step 15849, loss: 0.526503, best loss: 0.363574 2025-01-16 01:59:00,034 - INFO - step 15850, loss: 0.426245, best loss: 0.363574 2025-01-16 01:59:00,184 - INFO - step 15851, loss: 0.466039, best loss: 0.363574 2025-01-16 01:59:00,334 - INFO - step 15852, loss: 0.388238, best loss: 0.363574 2025-01-16 01:59:00,485 - INFO - step 15853, loss: 0.485982, best loss: 0.363574 2025-01-16 01:59:00,635 - INFO - step 15854, loss: 0.543689, best loss: 0.363574 2025-01-16 01:59:00,785 - INFO - step 15855, loss: 0.471082, best loss: 0.363574 2025-01-16 01:59:00,935 - INFO - step 15856, loss: 0.535946, best loss: 0.363574 2025-01-16 01:59:01,086 - INFO - step 15857, loss: 0.464577, best loss: 0.363574 2025-01-16 01:59:01,236 - INFO - step 15858, loss: 0.574867, best loss: 0.363574 2025-01-16 01:59:01,386 - INFO - step 15859, loss: 0.469349, best loss: 0.363574 2025-01-16 01:59:01,536 - INFO - step 15860, loss: 0.488239, best loss: 0.363574 2025-01-16 01:59:01,686 - INFO - step 15861, loss: 0.445327, best loss: 0.363574 2025-01-16 01:59:01,836 - INFO - step 15862, loss: 0.436903, best loss: 0.363574 2025-01-16 01:59:01,987 - INFO - step 15863, loss: 0.462044, best loss: 0.363574 2025-01-16 01:59:02,137 - INFO - step 15864, loss: 0.529812, best loss: 0.363574 2025-01-16 01:59:02,287 - INFO - step 15865, loss: 0.564999, best loss: 0.363574 2025-01-16 01:59:02,437 - INFO - step 15866, loss: 0.502241, best loss: 0.363574 2025-01-16 01:59:02,587 - INFO - step 15867, loss: 0.462864, best loss: 0.363574 2025-01-16 01:59:02,737 - INFO - step 15868, loss: 0.453423, best loss: 0.363574 2025-01-16 01:59:02,887 - INFO - step 15869, loss: 0.509130, best loss: 0.363574 2025-01-16 01:59:03,037 - INFO - step 15870, loss: 0.456519, best loss: 0.363574 2025-01-16 01:59:03,187 - INFO - step 15871, loss: 0.465820, best loss: 0.363574 2025-01-16 01:59:03,337 - INFO - step 15872, loss: 0.498241, best loss: 0.363574 2025-01-16 01:59:03,488 - INFO - step 15873, loss: 0.464322, best loss: 0.363574 2025-01-16 01:59:03,637 - INFO - step 15874, loss: 0.408660, best loss: 0.363574 2025-01-16 01:59:03,788 - INFO - step 15875, loss: 0.399214, best loss: 0.363574 2025-01-16 01:59:03,938 - INFO - step 15876, loss: 0.491419, best loss: 0.363574 2025-01-16 01:59:04,088 - INFO - step 15877, loss: 0.473583, best loss: 0.363574 2025-01-16 01:59:04,238 - INFO - step 15878, loss: 0.454497, best loss: 0.363574 2025-01-16 01:59:04,388 - INFO - step 15879, loss: 0.425099, best loss: 0.363574 2025-01-16 01:59:04,538 - INFO - step 15880, loss: 0.472991, best loss: 0.363574 2025-01-16 01:59:04,688 - INFO - step 15881, loss: 0.502714, best loss: 0.363574 2025-01-16 01:59:04,838 - INFO - step 15882, loss: 0.572930, best loss: 0.363574 2025-01-16 01:59:04,988 - INFO - step 15883, loss: 0.404839, best loss: 0.363574 2025-01-16 01:59:05,138 - INFO - step 15884, loss: 0.427846, best loss: 0.363574 2025-01-16 01:59:05,289 - INFO - step 15885, loss: 0.393910, best loss: 0.363574 2025-01-16 01:59:05,439 - INFO - step 15886, loss: 0.472283, best loss: 0.363574 2025-01-16 01:59:05,589 - INFO - step 15887, loss: 0.457592, best loss: 0.363574 2025-01-16 01:59:05,739 - INFO - step 15888, loss: 0.480456, best loss: 0.363574 2025-01-16 01:59:05,890 - INFO - step 15889, loss: 0.410162, best loss: 0.363574 2025-01-16 01:59:06,040 - INFO - step 15890, loss: 0.460122, best loss: 0.363574 2025-01-16 01:59:06,190 - INFO - step 15891, loss: 0.453074, best loss: 0.363574 2025-01-16 01:59:06,340 - INFO - step 15892, loss: 0.433701, best loss: 0.363574 2025-01-16 01:59:06,490 - INFO - step 15893, loss: 0.449326, best loss: 0.363574 2025-01-16 01:59:06,640 - INFO - step 15894, loss: 0.409311, best loss: 0.363574 2025-01-16 01:59:06,790 - INFO - step 15895, loss: 0.418769, best loss: 0.363574 2025-01-16 01:59:06,940 - INFO - step 15896, loss: 0.472136, best loss: 0.363574 2025-01-16 01:59:07,090 - INFO - step 15897, loss: 0.487415, best loss: 0.363574 2025-01-16 01:59:07,240 - INFO - step 15898, loss: 0.478440, best loss: 0.363574 2025-01-16 01:59:07,390 - INFO - step 15899, loss: 0.494439, best loss: 0.363574 2025-01-16 01:59:07,540 - INFO - step 15900, loss: 0.492761, best loss: 0.363574 2025-01-16 01:59:07,690 - INFO - step 15901, loss: 0.495292, best loss: 0.363574 2025-01-16 01:59:07,841 - INFO - step 15902, loss: 0.563601, best loss: 0.363574 2025-01-16 01:59:07,991 - INFO - step 15903, loss: 0.516943, best loss: 0.363574 2025-01-16 01:59:08,141 - INFO - step 15904, loss: 0.503769, best loss: 0.363574 2025-01-16 01:59:08,290 - INFO - step 15905, loss: 0.452664, best loss: 0.363574 2025-01-16 01:59:08,440 - INFO - step 15906, loss: 0.495985, best loss: 0.363574 2025-01-16 01:59:08,591 - INFO - step 15907, loss: 0.605499, best loss: 0.363574 2025-01-16 01:59:08,741 - INFO - step 15908, loss: 0.439195, best loss: 0.363574 2025-01-16 01:59:08,890 - INFO - step 15909, loss: 0.489672, best loss: 0.363574 2025-01-16 01:59:09,041 - INFO - step 15910, loss: 0.495124, best loss: 0.363574 2025-01-16 01:59:09,191 - INFO - step 15911, loss: 0.461633, best loss: 0.363574 2025-01-16 01:59:09,341 - INFO - step 15912, loss: 0.468181, best loss: 0.363574 2025-01-16 01:59:09,491 - INFO - step 15913, loss: 0.535325, best loss: 0.363574 2025-01-16 01:59:09,642 - INFO - step 15914, loss: 0.530009, best loss: 0.363574 2025-01-16 01:59:09,792 - INFO - step 15915, loss: 0.451275, best loss: 0.363574 2025-01-16 01:59:09,942 - INFO - step 15916, loss: 0.479432, best loss: 0.363574 2025-01-16 01:59:10,092 - INFO - step 15917, loss: 0.477122, best loss: 0.363574 2025-01-16 01:59:10,242 - INFO - step 15918, loss: 0.533254, best loss: 0.363574 2025-01-16 01:59:10,392 - INFO - step 15919, loss: 0.487060, best loss: 0.363574 2025-01-16 01:59:10,542 - INFO - step 15920, loss: 0.490013, best loss: 0.363574 2025-01-16 01:59:10,692 - INFO - step 15921, loss: 0.400935, best loss: 0.363574 2025-01-16 01:59:10,843 - INFO - step 15922, loss: 0.481080, best loss: 0.363574 2025-01-16 01:59:10,993 - INFO - step 15923, loss: 0.433808, best loss: 0.363574 2025-01-16 01:59:11,143 - INFO - step 15924, loss: 0.414275, best loss: 0.363574 2025-01-16 01:59:11,293 - INFO - step 15925, loss: 0.453643, best loss: 0.363574 2025-01-16 01:59:11,443 - INFO - step 15926, loss: 0.402369, best loss: 0.363574 2025-01-16 01:59:11,593 - INFO - step 15927, loss: 0.422304, best loss: 0.363574 2025-01-16 01:59:11,743 - INFO - step 15928, loss: 0.464324, best loss: 0.363574 2025-01-16 01:59:11,893 - INFO - step 15929, loss: 0.474996, best loss: 0.363574 2025-01-16 01:59:12,043 - INFO - step 15930, loss: 0.443442, best loss: 0.363574 2025-01-16 01:59:12,193 - INFO - step 15931, loss: 0.538033, best loss: 0.363574 2025-01-16 01:59:12,344 - INFO - step 15932, loss: 0.456798, best loss: 0.363574 2025-01-16 01:59:12,494 - INFO - step 15933, loss: 0.501947, best loss: 0.363574 2025-01-16 01:59:12,644 - INFO - step 15934, loss: 0.502208, best loss: 0.363574 2025-01-16 01:59:12,794 - INFO - step 15935, loss: 0.506701, best loss: 0.363574 2025-01-16 01:59:12,944 - INFO - step 15936, loss: 0.479878, best loss: 0.363574 2025-01-16 01:59:13,094 - INFO - step 15937, loss: 0.481473, best loss: 0.363574 2025-01-16 01:59:13,244 - INFO - step 15938, loss: 0.433079, best loss: 0.363574 2025-01-16 01:59:13,394 - INFO - step 15939, loss: 0.480783, best loss: 0.363574 2025-01-16 01:59:13,544 - INFO - step 15940, loss: 0.458024, best loss: 0.363574 2025-01-16 01:59:13,694 - INFO - step 15941, loss: 0.451349, best loss: 0.363574 2025-01-16 01:59:13,844 - INFO - step 15942, loss: 0.445038, best loss: 0.363574 2025-01-16 01:59:13,994 - INFO - step 15943, loss: 0.492837, best loss: 0.363574 2025-01-16 01:59:14,144 - INFO - step 15944, loss: 0.475815, best loss: 0.363574 2025-01-16 01:59:14,294 - INFO - step 15945, loss: 0.420003, best loss: 0.363574 2025-01-16 01:59:14,444 - INFO - step 15946, loss: 0.544094, best loss: 0.363574 2025-01-16 01:59:14,594 - INFO - step 15947, loss: 0.430037, best loss: 0.363574 2025-01-16 01:59:14,745 - INFO - step 15948, loss: 0.428472, best loss: 0.363574 2025-01-16 01:59:14,895 - INFO - step 15949, loss: 0.507216, best loss: 0.363574 2025-01-16 01:59:15,045 - INFO - step 15950, loss: 0.414046, best loss: 0.363574 2025-01-16 01:59:15,195 - INFO - step 15951, loss: 0.468648, best loss: 0.363574 2025-01-16 01:59:15,345 - INFO - step 15952, loss: 0.400688, best loss: 0.363574 2025-01-16 01:59:15,495 - INFO - step 15953, loss: 0.380221, best loss: 0.363574 2025-01-16 01:59:15,645 - INFO - step 15954, loss: 0.501188, best loss: 0.363574 2025-01-16 01:59:15,795 - INFO - step 15955, loss: 0.424016, best loss: 0.363574 2025-01-16 01:59:15,945 - INFO - step 15956, loss: 0.454581, best loss: 0.363574 2025-01-16 01:59:16,095 - INFO - step 15957, loss: 0.388284, best loss: 0.363574 2025-01-16 01:59:16,245 - INFO - step 15958, loss: 0.423479, best loss: 0.363574 2025-01-16 01:59:16,395 - INFO - step 15959, loss: 0.438396, best loss: 0.363574 2025-01-16 01:59:16,545 - INFO - step 15960, loss: 0.465478, best loss: 0.363574 2025-01-16 01:59:16,696 - INFO - step 15961, loss: 0.426871, best loss: 0.363574 2025-01-16 01:59:16,846 - INFO - step 15962, loss: 0.495980, best loss: 0.363574 2025-01-16 01:59:16,996 - INFO - step 15963, loss: 0.404152, best loss: 0.363574 2025-01-16 01:59:17,146 - INFO - step 15964, loss: 0.448497, best loss: 0.363574 2025-01-16 01:59:17,296 - INFO - step 15965, loss: 0.392933, best loss: 0.363574 2025-01-16 01:59:17,446 - INFO - step 15966, loss: 0.422826, best loss: 0.363574 2025-01-16 01:59:17,596 - INFO - step 15967, loss: 0.490581, best loss: 0.363574 2025-01-16 01:59:17,746 - INFO - step 15968, loss: 0.411968, best loss: 0.363574 2025-01-16 01:59:17,896 - INFO - step 15969, loss: 0.419292, best loss: 0.363574 2025-01-16 01:59:18,046 - INFO - step 15970, loss: 0.435516, best loss: 0.363574 2025-01-16 01:59:18,196 - INFO - step 15971, loss: 0.449184, best loss: 0.363574 2025-01-16 01:59:18,346 - INFO - step 15972, loss: 0.480689, best loss: 0.363574 2025-01-16 01:59:18,496 - INFO - step 15973, loss: 0.363997, best loss: 0.363574 2025-01-16 01:59:18,647 - INFO - step 15974, loss: 0.494096, best loss: 0.363574 2025-01-16 01:59:18,797 - INFO - step 15975, loss: 0.381379, best loss: 0.363574 2025-01-16 01:59:22,333 - INFO - step 15976, loss: 0.350001, best loss: 0.350001 2025-01-16 01:59:22,492 - INFO - step 15977, loss: 0.473189, best loss: 0.350001 2025-01-16 01:59:22,643 - INFO - step 15978, loss: 0.438673, best loss: 0.350001 2025-01-16 01:59:22,793 - INFO - step 15979, loss: 0.503032, best loss: 0.350001 2025-01-16 01:59:22,943 - INFO - step 15980, loss: 0.460976, best loss: 0.350001 2025-01-16 01:59:23,093 - INFO - step 15981, loss: 0.493812, best loss: 0.350001 2025-01-16 01:59:23,244 - INFO - step 15982, loss: 0.511311, best loss: 0.350001 2025-01-16 01:59:26,833 - INFO - step 15983, loss: 0.340728, best loss: 0.340728 2025-01-16 01:59:26,983 - INFO - step 15984, loss: 0.344605, best loss: 0.340728 2025-01-16 01:59:27,134 - INFO - step 15985, loss: 0.404201, best loss: 0.340728 2025-01-16 01:59:27,284 - INFO - step 15986, loss: 0.510043, best loss: 0.340728 2025-01-16 01:59:27,434 - INFO - step 15987, loss: 0.424973, best loss: 0.340728 2025-01-16 01:59:27,584 - INFO - step 15988, loss: 0.414497, best loss: 0.340728 2025-01-16 01:59:27,734 - INFO - step 15989, loss: 0.411450, best loss: 0.340728 2025-01-16 01:59:27,885 - INFO - step 15990, loss: 0.413516, best loss: 0.340728 2025-01-16 01:59:28,035 - INFO - step 15991, loss: 0.458628, best loss: 0.340728 2025-01-16 01:59:28,185 - INFO - step 15992, loss: 0.402741, best loss: 0.340728 2025-01-16 01:59:28,335 - INFO - step 15993, loss: 0.462587, best loss: 0.340728 2025-01-16 01:59:28,485 - INFO - step 15994, loss: 0.466344, best loss: 0.340728 2025-01-16 01:59:28,636 - INFO - step 15995, loss: 0.490780, best loss: 0.340728 2025-01-16 01:59:28,786 - INFO - step 15996, loss: 0.507578, best loss: 0.340728 2025-01-16 01:59:28,936 - INFO - step 15997, loss: 0.466923, best loss: 0.340728 2025-01-16 01:59:29,086 - INFO - step 15998, loss: 0.454346, best loss: 0.340728 2025-01-16 01:59:29,236 - INFO - step 15999, loss: 0.502345, best loss: 0.340728 2025-01-16 01:59:29,386 - INFO - step 16000, loss: 0.444814, best loss: 0.340728 2025-01-16 01:59:29,536 - INFO - step 16001, loss: 0.469181, best loss: 0.340728 2025-01-16 01:59:29,686 - INFO - step 16002, loss: 0.450296, best loss: 0.340728 2025-01-16 01:59:29,836 - INFO - step 16003, loss: 0.383771, best loss: 0.340728 2025-01-16 01:59:29,986 - INFO - step 16004, loss: 0.452178, best loss: 0.340728 2025-01-16 01:59:30,136 - INFO - step 16005, loss: 0.473330, best loss: 0.340728 2025-01-16 01:59:30,287 - INFO - step 16006, loss: 0.490401, best loss: 0.340728 2025-01-16 01:59:30,436 - INFO - step 16007, loss: 0.471871, best loss: 0.340728 2025-01-16 01:59:30,586 - INFO - step 16008, loss: 0.529407, best loss: 0.340728 2025-01-16 01:59:30,736 - INFO - step 16009, loss: 0.488637, best loss: 0.340728 2025-01-16 01:59:30,886 - INFO - step 16010, loss: 0.449964, best loss: 0.340728 2025-01-16 01:59:31,037 - INFO - step 16011, loss: 0.480676, best loss: 0.340728 2025-01-16 01:59:31,187 - INFO - step 16012, loss: 0.474726, best loss: 0.340728 2025-01-16 01:59:31,337 - INFO - step 16013, loss: 0.464259, best loss: 0.340728 2025-01-16 01:59:31,487 - INFO - step 16014, loss: 0.496354, best loss: 0.340728 2025-01-16 01:59:31,637 - INFO - step 16015, loss: 0.475616, best loss: 0.340728 2025-01-16 01:59:31,787 - INFO - step 16016, loss: 0.524147, best loss: 0.340728 2025-01-16 01:59:31,938 - INFO - step 16017, loss: 0.495006, best loss: 0.340728 2025-01-16 01:59:32,088 - INFO - step 16018, loss: 0.452758, best loss: 0.340728 2025-01-16 01:59:32,238 - INFO - step 16019, loss: 0.476775, best loss: 0.340728 2025-01-16 01:59:32,388 - INFO - step 16020, loss: 0.504104, best loss: 0.340728 2025-01-16 01:59:32,539 - INFO - step 16021, loss: 0.484568, best loss: 0.340728 2025-01-16 01:59:32,689 - INFO - step 16022, loss: 0.478633, best loss: 0.340728 2025-01-16 01:59:32,839 - INFO - step 16023, loss: 0.551590, best loss: 0.340728 2025-01-16 01:59:32,989 - INFO - step 16024, loss: 0.430073, best loss: 0.340728 2025-01-16 01:59:33,140 - INFO - step 16025, loss: 0.439590, best loss: 0.340728 2025-01-16 01:59:33,290 - INFO - step 16026, loss: 0.510202, best loss: 0.340728 2025-01-16 01:59:33,440 - INFO - step 16027, loss: 0.511484, best loss: 0.340728 2025-01-16 01:59:33,590 - INFO - step 16028, loss: 0.619278, best loss: 0.340728 2025-01-16 01:59:33,740 - INFO - step 16029, loss: 0.503238, best loss: 0.340728 2025-01-16 01:59:33,890 - INFO - step 16030, loss: 0.459712, best loss: 0.340728 2025-01-16 01:59:34,041 - INFO - step 16031, loss: 0.499091, best loss: 0.340728 2025-01-16 01:59:34,191 - INFO - step 16032, loss: 0.521412, best loss: 0.340728 2025-01-16 01:59:34,341 - INFO - step 16033, loss: 0.493152, best loss: 0.340728 2025-01-16 01:59:34,491 - INFO - step 16034, loss: 0.567622, best loss: 0.340728 2025-01-16 01:59:34,642 - INFO - step 16035, loss: 0.542216, best loss: 0.340728 2025-01-16 01:59:34,792 - INFO - step 16036, loss: 0.515737, best loss: 0.340728 2025-01-16 01:59:34,942 - INFO - step 16037, loss: 0.512391, best loss: 0.340728 2025-01-16 01:59:35,092 - INFO - step 16038, loss: 0.554604, best loss: 0.340728 2025-01-16 01:59:35,243 - INFO - step 16039, loss: 0.381565, best loss: 0.340728 2025-01-16 01:59:35,393 - INFO - step 16040, loss: 0.539262, best loss: 0.340728 2025-01-16 01:59:35,543 - INFO - step 16041, loss: 0.468151, best loss: 0.340728 2025-01-16 01:59:35,693 - INFO - step 16042, loss: 0.445845, best loss: 0.340728 2025-01-16 01:59:35,843 - INFO - step 16043, loss: 0.499748, best loss: 0.340728 2025-01-16 01:59:35,993 - INFO - step 16044, loss: 0.465082, best loss: 0.340728 2025-01-16 01:59:36,143 - INFO - step 16045, loss: 0.502037, best loss: 0.340728 2025-01-16 01:59:36,294 - INFO - step 16046, loss: 0.460614, best loss: 0.340728 2025-01-16 01:59:36,444 - INFO - step 16047, loss: 0.499798, best loss: 0.340728 2025-01-16 01:59:36,594 - INFO - step 16048, loss: 0.450783, best loss: 0.340728 2025-01-16 01:59:36,744 - INFO - step 16049, loss: 0.466162, best loss: 0.340728 2025-01-16 01:59:36,894 - INFO - step 16050, loss: 0.449698, best loss: 0.340728 2025-01-16 01:59:37,045 - INFO - step 16051, loss: 0.450668, best loss: 0.340728 2025-01-16 01:59:37,195 - INFO - step 16052, loss: 0.375594, best loss: 0.340728 2025-01-16 01:59:37,345 - INFO - step 16053, loss: 0.416321, best loss: 0.340728 2025-01-16 01:59:37,495 - INFO - step 16054, loss: 0.485923, best loss: 0.340728 2025-01-16 01:59:37,645 - INFO - step 16055, loss: 0.457607, best loss: 0.340728 2025-01-16 01:59:37,796 - INFO - step 16056, loss: 0.451960, best loss: 0.340728 2025-01-16 01:59:37,946 - INFO - step 16057, loss: 0.474936, best loss: 0.340728 2025-01-16 01:59:38,096 - INFO - step 16058, loss: 0.427924, best loss: 0.340728 2025-01-16 01:59:38,246 - INFO - step 16059, loss: 0.403955, best loss: 0.340728 2025-01-16 01:59:38,396 - INFO - step 16060, loss: 0.472354, best loss: 0.340728 2025-01-16 01:59:38,547 - INFO - step 16061, loss: 0.520561, best loss: 0.340728 2025-01-16 01:59:38,698 - INFO - step 16062, loss: 0.489385, best loss: 0.340728 2025-01-16 01:59:38,848 - INFO - step 16063, loss: 0.476564, best loss: 0.340728 2025-01-16 01:59:38,998 - INFO - step 16064, loss: 0.495325, best loss: 0.340728 2025-01-16 01:59:39,148 - INFO - step 16065, loss: 0.394217, best loss: 0.340728 2025-01-16 01:59:39,298 - INFO - step 16066, loss: 0.461349, best loss: 0.340728 2025-01-16 01:59:39,448 - INFO - step 16067, loss: 0.454005, best loss: 0.340728 2025-01-16 01:59:39,599 - INFO - step 16068, loss: 0.416903, best loss: 0.340728 2025-01-16 01:59:39,749 - INFO - step 16069, loss: 0.536424, best loss: 0.340728 2025-01-16 01:59:39,899 - INFO - step 16070, loss: 0.455495, best loss: 0.340728 2025-01-16 01:59:40,049 - INFO - step 16071, loss: 0.416312, best loss: 0.340728 2025-01-16 01:59:40,199 - INFO - step 16072, loss: 0.413142, best loss: 0.340728 2025-01-16 01:59:40,349 - INFO - step 16073, loss: 0.454540, best loss: 0.340728 2025-01-16 01:59:40,499 - INFO - step 16074, loss: 0.556252, best loss: 0.340728 2025-01-16 01:59:40,650 - INFO - step 16075, loss: 0.437047, best loss: 0.340728 2025-01-16 01:59:40,800 - INFO - step 16076, loss: 0.452638, best loss: 0.340728 2025-01-16 01:59:40,950 - INFO - step 16077, loss: 0.441723, best loss: 0.340728 2025-01-16 01:59:41,100 - INFO - step 16078, loss: 0.393393, best loss: 0.340728 2025-01-16 01:59:41,250 - INFO - step 16079, loss: 0.514383, best loss: 0.340728 2025-01-16 01:59:41,400 - INFO - step 16080, loss: 0.535588, best loss: 0.340728 2025-01-16 01:59:41,550 - INFO - step 16081, loss: 0.594043, best loss: 0.340728 2025-01-16 01:59:41,700 - INFO - step 16082, loss: 0.460960, best loss: 0.340728 2025-01-16 01:59:41,850 - INFO - step 16083, loss: 0.424200, best loss: 0.340728 2025-01-16 01:59:42,000 - INFO - step 16084, loss: 0.597945, best loss: 0.340728 2025-01-16 01:59:42,150 - INFO - step 16085, loss: 0.454394, best loss: 0.340728 2025-01-16 01:59:42,300 - INFO - step 16086, loss: 0.466744, best loss: 0.340728 2025-01-16 01:59:42,450 - INFO - step 16087, loss: 0.448249, best loss: 0.340728 2025-01-16 01:59:42,600 - INFO - step 16088, loss: 0.457526, best loss: 0.340728 2025-01-16 01:59:42,751 - INFO - step 16089, loss: 0.392461, best loss: 0.340728 2025-01-16 01:59:42,900 - INFO - step 16090, loss: 0.470958, best loss: 0.340728 2025-01-16 01:59:43,050 - INFO - step 16091, loss: 0.456391, best loss: 0.340728 2025-01-16 01:59:43,201 - INFO - step 16092, loss: 0.456452, best loss: 0.340728 2025-01-16 01:59:43,351 - INFO - step 16093, loss: 0.425556, best loss: 0.340728 2025-01-16 01:59:43,501 - INFO - step 16094, loss: 0.419031, best loss: 0.340728 2025-01-16 01:59:43,651 - INFO - step 16095, loss: 0.358886, best loss: 0.340728 2025-01-16 01:59:43,801 - INFO - step 16096, loss: 0.436862, best loss: 0.340728 2025-01-16 01:59:43,951 - INFO - step 16097, loss: 0.496534, best loss: 0.340728 2025-01-16 01:59:44,101 - INFO - step 16098, loss: 0.419633, best loss: 0.340728 2025-01-16 01:59:44,251 - INFO - step 16099, loss: 0.459823, best loss: 0.340728 2025-01-16 01:59:44,401 - INFO - step 16100, loss: 0.451829, best loss: 0.340728 2025-01-16 01:59:44,551 - INFO - step 16101, loss: 0.443991, best loss: 0.340728 2025-01-16 01:59:44,701 - INFO - step 16102, loss: 0.487756, best loss: 0.340728 2025-01-16 01:59:44,851 - INFO - step 16103, loss: 0.364564, best loss: 0.340728 2025-01-16 01:59:45,001 - INFO - step 16104, loss: 0.392944, best loss: 0.340728 2025-01-16 01:59:45,151 - INFO - step 16105, loss: 0.372986, best loss: 0.340728 2025-01-16 01:59:45,301 - INFO - step 16106, loss: 0.409514, best loss: 0.340728 2025-01-16 01:59:45,451 - INFO - step 16107, loss: 0.409558, best loss: 0.340728 2025-01-16 01:59:45,601 - INFO - step 16108, loss: 0.419869, best loss: 0.340728 2025-01-16 01:59:45,751 - INFO - step 16109, loss: 0.397948, best loss: 0.340728 2025-01-16 01:59:45,901 - INFO - step 16110, loss: 0.434862, best loss: 0.340728 2025-01-16 01:59:46,051 - INFO - step 16111, loss: 0.449858, best loss: 0.340728 2025-01-16 01:59:46,202 - INFO - step 16112, loss: 0.379609, best loss: 0.340728 2025-01-16 01:59:46,352 - INFO - step 16113, loss: 0.422144, best loss: 0.340728 2025-01-16 01:59:46,502 - INFO - step 16114, loss: 0.426923, best loss: 0.340728 2025-01-16 01:59:46,652 - INFO - step 16115, loss: 0.442423, best loss: 0.340728 2025-01-16 01:59:46,802 - INFO - step 16116, loss: 0.342471, best loss: 0.340728 2025-01-16 01:59:46,952 - INFO - step 16117, loss: 0.431224, best loss: 0.340728 2025-01-16 01:59:47,102 - INFO - step 16118, loss: 0.421137, best loss: 0.340728 2025-01-16 01:59:47,252 - INFO - step 16119, loss: 0.383144, best loss: 0.340728 2025-01-16 01:59:47,402 - INFO - step 16120, loss: 0.371745, best loss: 0.340728 2025-01-16 01:59:47,552 - INFO - step 16121, loss: 0.403111, best loss: 0.340728 2025-01-16 01:59:47,702 - INFO - step 16122, loss: 0.470412, best loss: 0.340728 2025-01-16 01:59:47,852 - INFO - step 16123, loss: 0.406169, best loss: 0.340728 2025-01-16 01:59:48,002 - INFO - step 16124, loss: 0.474037, best loss: 0.340728 2025-01-16 01:59:48,153 - INFO - step 16125, loss: 0.442205, best loss: 0.340728 2025-01-16 01:59:48,302 - INFO - step 16126, loss: 0.410307, best loss: 0.340728 2025-01-16 01:59:48,453 - INFO - step 16127, loss: 0.413431, best loss: 0.340728 2025-01-16 01:59:48,603 - INFO - step 16128, loss: 0.431177, best loss: 0.340728 2025-01-16 01:59:48,753 - INFO - step 16129, loss: 0.419727, best loss: 0.340728 2025-01-16 01:59:48,903 - INFO - step 16130, loss: 0.395750, best loss: 0.340728 2025-01-16 01:59:49,053 - INFO - step 16131, loss: 0.397301, best loss: 0.340728 2025-01-16 01:59:49,204 - INFO - step 16132, loss: 0.441104, best loss: 0.340728 2025-01-16 01:59:49,354 - INFO - step 16133, loss: 0.479077, best loss: 0.340728 2025-01-16 01:59:49,504 - INFO - step 16134, loss: 0.492411, best loss: 0.340728 2025-01-16 01:59:49,654 - INFO - step 16135, loss: 0.464928, best loss: 0.340728 2025-01-16 01:59:49,804 - INFO - step 16136, loss: 0.411995, best loss: 0.340728 2025-01-16 01:59:49,954 - INFO - step 16137, loss: 0.360922, best loss: 0.340728 2025-01-16 01:59:50,104 - INFO - step 16138, loss: 0.422762, best loss: 0.340728 2025-01-16 01:59:50,255 - INFO - step 16139, loss: 0.429200, best loss: 0.340728 2025-01-16 01:59:50,405 - INFO - step 16140, loss: 0.373443, best loss: 0.340728 2025-01-16 01:59:50,555 - INFO - step 16141, loss: 0.443728, best loss: 0.340728 2025-01-16 01:59:50,705 - INFO - step 16142, loss: 0.415103, best loss: 0.340728 2025-01-16 01:59:50,855 - INFO - step 16143, loss: 0.405249, best loss: 0.340728 2025-01-16 01:59:51,005 - INFO - step 16144, loss: 0.388097, best loss: 0.340728 2025-01-16 01:59:51,155 - INFO - step 16145, loss: 0.428067, best loss: 0.340728 2025-01-16 01:59:51,305 - INFO - step 16146, loss: 0.402838, best loss: 0.340728 2025-01-16 01:59:51,455 - INFO - step 16147, loss: 0.469901, best loss: 0.340728 2025-01-16 01:59:51,605 - INFO - step 16148, loss: 0.456406, best loss: 0.340728 2025-01-16 01:59:51,756 - INFO - step 16149, loss: 0.499252, best loss: 0.340728 2025-01-16 01:59:51,906 - INFO - step 16150, loss: 0.405228, best loss: 0.340728 2025-01-16 01:59:52,056 - INFO - step 16151, loss: 0.473813, best loss: 0.340728 2025-01-16 01:59:52,206 - INFO - step 16152, loss: 0.542048, best loss: 0.340728 2025-01-16 01:59:52,356 - INFO - step 16153, loss: 0.454280, best loss: 0.340728 2025-01-16 01:59:52,506 - INFO - step 16154, loss: 0.440122, best loss: 0.340728 2025-01-16 01:59:52,656 - INFO - step 16155, loss: 0.424611, best loss: 0.340728 2025-01-16 01:59:52,806 - INFO - step 16156, loss: 0.341690, best loss: 0.340728 2025-01-16 01:59:52,957 - INFO - step 16157, loss: 0.442570, best loss: 0.340728 2025-01-16 01:59:53,107 - INFO - step 16158, loss: 0.476691, best loss: 0.340728 2025-01-16 01:59:53,257 - INFO - step 16159, loss: 0.445315, best loss: 0.340728 2025-01-16 01:59:53,407 - INFO - step 16160, loss: 0.552268, best loss: 0.340728 2025-01-16 01:59:53,557 - INFO - step 16161, loss: 0.477710, best loss: 0.340728 2025-01-16 01:59:53,707 - INFO - step 16162, loss: 0.481592, best loss: 0.340728 2025-01-16 01:59:53,857 - INFO - step 16163, loss: 0.506631, best loss: 0.340728 2025-01-16 01:59:54,007 - INFO - step 16164, loss: 0.399111, best loss: 0.340728 2025-01-16 01:59:54,157 - INFO - step 16165, loss: 0.436784, best loss: 0.340728 2025-01-16 01:59:54,308 - INFO - step 16166, loss: 0.443446, best loss: 0.340728 2025-01-16 01:59:54,458 - INFO - step 16167, loss: 0.438177, best loss: 0.340728 2025-01-16 01:59:54,608 - INFO - step 16168, loss: 0.427217, best loss: 0.340728 2025-01-16 01:59:54,758 - INFO - step 16169, loss: 0.398795, best loss: 0.340728 2025-01-16 01:59:54,908 - INFO - step 16170, loss: 0.443786, best loss: 0.340728 2025-01-16 01:59:55,058 - INFO - step 16171, loss: 0.477222, best loss: 0.340728 2025-01-16 01:59:55,208 - INFO - step 16172, loss: 0.494967, best loss: 0.340728 2025-01-16 01:59:55,358 - INFO - step 16173, loss: 0.557503, best loss: 0.340728 2025-01-16 01:59:55,508 - INFO - step 16174, loss: 0.560928, best loss: 0.340728 2025-01-16 01:59:55,658 - INFO - step 16175, loss: 0.513935, best loss: 0.340728 2025-01-16 01:59:55,808 - INFO - step 16176, loss: 0.455819, best loss: 0.340728 2025-01-16 01:59:55,958 - INFO - step 16177, loss: 0.478912, best loss: 0.340728 2025-01-16 01:59:56,108 - INFO - step 16178, loss: 0.485197, best loss: 0.340728 2025-01-16 01:59:56,258 - INFO - step 16179, loss: 0.441276, best loss: 0.340728 2025-01-16 01:59:56,408 - INFO - step 16180, loss: 0.425193, best loss: 0.340728 2025-01-16 01:59:56,558 - INFO - step 16181, loss: 0.482752, best loss: 0.340728 2025-01-16 01:59:56,708 - INFO - step 16182, loss: 0.403917, best loss: 0.340728 2025-01-16 01:59:56,858 - INFO - step 16183, loss: 0.496105, best loss: 0.340728 2025-01-16 01:59:57,009 - INFO - step 16184, loss: 0.520445, best loss: 0.340728 2025-01-16 01:59:57,159 - INFO - step 16185, loss: 0.437562, best loss: 0.340728 2025-01-16 01:59:57,309 - INFO - step 16186, loss: 0.471975, best loss: 0.340728 2025-01-16 01:59:57,459 - INFO - step 16187, loss: 0.435506, best loss: 0.340728 2025-01-16 01:59:57,609 - INFO - step 16188, loss: 0.473952, best loss: 0.340728 2025-01-16 01:59:57,759 - INFO - step 16189, loss: 0.399177, best loss: 0.340728 2025-01-16 01:59:57,909 - INFO - step 16190, loss: 0.392391, best loss: 0.340728 2025-01-16 01:59:58,059 - INFO - step 16191, loss: 0.389304, best loss: 0.340728 2025-01-16 01:59:58,209 - INFO - step 16192, loss: 0.365120, best loss: 0.340728 2025-01-16 01:59:58,359 - INFO - step 16193, loss: 0.421131, best loss: 0.340728 2025-01-16 01:59:58,509 - INFO - step 16194, loss: 0.405318, best loss: 0.340728 2025-01-16 01:59:58,659 - INFO - step 16195, loss: 0.446613, best loss: 0.340728 2025-01-16 01:59:58,809 - INFO - step 16196, loss: 0.442873, best loss: 0.340728 2025-01-16 01:59:58,959 - INFO - step 16197, loss: 0.341638, best loss: 0.340728 2025-01-16 01:59:59,109 - INFO - step 16198, loss: 0.456216, best loss: 0.340728 2025-01-16 01:59:59,259 - INFO - step 16199, loss: 0.496323, best loss: 0.340728 2025-01-16 01:59:59,410 - INFO - step 16200, loss: 0.411726, best loss: 0.340728 2025-01-16 01:59:59,560 - INFO - step 16201, loss: 0.406315, best loss: 0.340728 2025-01-16 01:59:59,710 - INFO - step 16202, loss: 0.439144, best loss: 0.340728 2025-01-16 01:59:59,860 - INFO - step 16203, loss: 0.407769, best loss: 0.340728 2025-01-16 02:00:00,010 - INFO - step 16204, loss: 0.400356, best loss: 0.340728 2025-01-16 02:00:00,160 - INFO - step 16205, loss: 0.426700, best loss: 0.340728 2025-01-16 02:00:00,311 - INFO - step 16206, loss: 0.452344, best loss: 0.340728 2025-01-16 02:00:00,461 - INFO - step 16207, loss: 0.518902, best loss: 0.340728 2025-01-16 02:00:00,611 - INFO - step 16208, loss: 0.476363, best loss: 0.340728 2025-01-16 02:00:00,761 - INFO - step 16209, loss: 0.365864, best loss: 0.340728 2025-01-16 02:00:00,911 - INFO - step 16210, loss: 0.426776, best loss: 0.340728 2025-01-16 02:00:01,061 - INFO - step 16211, loss: 0.486575, best loss: 0.340728 2025-01-16 02:00:01,211 - INFO - step 16212, loss: 0.527141, best loss: 0.340728 2025-01-16 02:00:01,361 - INFO - step 16213, loss: 0.391505, best loss: 0.340728 2025-01-16 02:00:01,512 - INFO - step 16214, loss: 0.381502, best loss: 0.340728 2025-01-16 02:00:01,662 - INFO - step 16215, loss: 0.403045, best loss: 0.340728 2025-01-16 02:00:01,812 - INFO - step 16216, loss: 0.407255, best loss: 0.340728 2025-01-16 02:00:01,962 - INFO - step 16217, loss: 0.454799, best loss: 0.340728 2025-01-16 02:00:02,112 - INFO - step 16218, loss: 0.422398, best loss: 0.340728 2025-01-16 02:00:02,262 - INFO - step 16219, loss: 0.353845, best loss: 0.340728 2025-01-16 02:00:02,412 - INFO - step 16220, loss: 0.395577, best loss: 0.340728 2025-01-16 02:00:02,562 - INFO - step 16221, loss: 0.422530, best loss: 0.340728 2025-01-16 02:00:02,712 - INFO - step 16222, loss: 0.430542, best loss: 0.340728 2025-01-16 02:00:02,862 - INFO - step 16223, loss: 0.416443, best loss: 0.340728 2025-01-16 02:00:03,012 - INFO - step 16224, loss: 0.435443, best loss: 0.340728 2025-01-16 02:00:03,162 - INFO - step 16225, loss: 0.464017, best loss: 0.340728 2025-01-16 02:00:03,312 - INFO - step 16226, loss: 0.436477, best loss: 0.340728 2025-01-16 02:00:03,462 - INFO - step 16227, loss: 0.395637, best loss: 0.340728 2025-01-16 02:00:03,612 - INFO - step 16228, loss: 0.375504, best loss: 0.340728 2025-01-16 02:00:03,762 - INFO - step 16229, loss: 0.403706, best loss: 0.340728 2025-01-16 02:00:03,912 - INFO - step 16230, loss: 0.462430, best loss: 0.340728 2025-01-16 02:00:04,062 - INFO - step 16231, loss: 0.480839, best loss: 0.340728 2025-01-16 02:00:04,213 - INFO - step 16232, loss: 0.505514, best loss: 0.340728 2025-01-16 02:00:04,363 - INFO - step 16233, loss: 0.447433, best loss: 0.340728 2025-01-16 02:00:04,513 - INFO - step 16234, loss: 0.492770, best loss: 0.340728 2025-01-16 02:00:04,663 - INFO - step 16235, loss: 0.401568, best loss: 0.340728 2025-01-16 02:00:04,813 - INFO - step 16236, loss: 0.452585, best loss: 0.340728 2025-01-16 02:00:04,963 - INFO - step 16237, loss: 0.450477, best loss: 0.340728 2025-01-16 02:00:05,113 - INFO - step 16238, loss: 0.447306, best loss: 0.340728 2025-01-16 02:00:05,263 - INFO - step 16239, loss: 0.450884, best loss: 0.340728 2025-01-16 02:00:05,413 - INFO - step 16240, loss: 0.443179, best loss: 0.340728 2025-01-16 02:00:05,563 - INFO - step 16241, loss: 0.448134, best loss: 0.340728 2025-01-16 02:00:05,713 - INFO - step 16242, loss: 0.455157, best loss: 0.340728 2025-01-16 02:00:05,863 - INFO - step 16243, loss: 0.425632, best loss: 0.340728 2025-01-16 02:00:06,013 - INFO - step 16244, loss: 0.481450, best loss: 0.340728 2025-01-16 02:00:06,163 - INFO - step 16245, loss: 0.442193, best loss: 0.340728 2025-01-16 02:00:06,313 - INFO - step 16246, loss: 0.422691, best loss: 0.340728 2025-01-16 02:00:06,464 - INFO - step 16247, loss: 0.419456, best loss: 0.340728 2025-01-16 02:00:06,614 - INFO - step 16248, loss: 0.443352, best loss: 0.340728 2025-01-16 02:00:06,764 - INFO - step 16249, loss: 0.426873, best loss: 0.340728 2025-01-16 02:00:06,914 - INFO - step 16250, loss: 0.390030, best loss: 0.340728 2025-01-16 02:00:07,064 - INFO - step 16251, loss: 0.365077, best loss: 0.340728 2025-01-16 02:00:07,214 - INFO - step 16252, loss: 0.369103, best loss: 0.340728 2025-01-16 02:00:07,364 - INFO - step 16253, loss: 0.371509, best loss: 0.340728 2025-01-16 02:00:07,514 - INFO - step 16254, loss: 0.391533, best loss: 0.340728 2025-01-16 02:00:07,665 - INFO - step 16255, loss: 0.421615, best loss: 0.340728 2025-01-16 02:00:11,179 - INFO - step 16256, loss: 0.339763, best loss: 0.339763 2025-01-16 02:00:11,341 - INFO - step 16257, loss: 0.423634, best loss: 0.339763 2025-01-16 02:00:11,498 - INFO - step 16258, loss: 0.388477, best loss: 0.339763 2025-01-16 02:00:11,649 - INFO - step 16259, loss: 0.420915, best loss: 0.339763 2025-01-16 02:00:11,799 - INFO - step 16260, loss: 0.376014, best loss: 0.339763 2025-01-16 02:00:11,949 - INFO - step 16261, loss: 0.445038, best loss: 0.339763 2025-01-16 02:00:12,099 - INFO - step 16262, loss: 0.390050, best loss: 0.339763 2025-01-16 02:00:12,249 - INFO - step 16263, loss: 0.443042, best loss: 0.339763 2025-01-16 02:00:12,399 - INFO - step 16264, loss: 0.440940, best loss: 0.339763 2025-01-16 02:00:12,550 - INFO - step 16265, loss: 0.447029, best loss: 0.339763 2025-01-16 02:00:12,700 - INFO - step 16266, loss: 0.409308, best loss: 0.339763 2025-01-16 02:00:12,850 - INFO - step 16267, loss: 0.391665, best loss: 0.339763 2025-01-16 02:00:21,789 - INFO - step 16268, loss: 0.334924, best loss: 0.334924 2025-01-16 02:00:21,939 - INFO - step 16269, loss: 0.399656, best loss: 0.334924 2025-01-16 02:00:22,090 - INFO - step 16270, loss: 0.468070, best loss: 0.334924 2025-01-16 02:00:22,240 - INFO - step 16271, loss: 0.408759, best loss: 0.334924 2025-01-16 02:00:22,391 - INFO - step 16272, loss: 0.406487, best loss: 0.334924 2025-01-16 02:00:22,541 - INFO - step 16273, loss: 0.448365, best loss: 0.334924 2025-01-16 02:00:22,691 - INFO - step 16274, loss: 0.417624, best loss: 0.334924 2025-01-16 02:00:22,841 - INFO - step 16275, loss: 0.417905, best loss: 0.334924 2025-01-16 02:00:22,991 - INFO - step 16276, loss: 0.476321, best loss: 0.334924 2025-01-16 02:00:23,141 - INFO - step 16277, loss: 0.382013, best loss: 0.334924 2025-01-16 02:00:23,291 - INFO - step 16278, loss: 0.377650, best loss: 0.334924 2025-01-16 02:00:23,441 - INFO - step 16279, loss: 0.501113, best loss: 0.334924 2025-01-16 02:00:23,592 - INFO - step 16280, loss: 0.372020, best loss: 0.334924 2025-01-16 02:00:23,742 - INFO - step 16281, loss: 0.395260, best loss: 0.334924 2025-01-16 02:00:23,892 - INFO - step 16282, loss: 0.339009, best loss: 0.334924 2025-01-16 02:00:24,042 - INFO - step 16283, loss: 0.389779, best loss: 0.334924 2025-01-16 02:00:24,192 - INFO - step 16284, loss: 0.472265, best loss: 0.334924 2025-01-16 02:00:24,342 - INFO - step 16285, loss: 0.411977, best loss: 0.334924 2025-01-16 02:00:24,492 - INFO - step 16286, loss: 0.421297, best loss: 0.334924 2025-01-16 02:00:24,642 - INFO - step 16287, loss: 0.402105, best loss: 0.334924 2025-01-16 02:00:24,792 - INFO - step 16288, loss: 0.356729, best loss: 0.334924 2025-01-16 02:00:24,943 - INFO - step 16289, loss: 0.390152, best loss: 0.334924 2025-01-16 02:00:25,093 - INFO - step 16290, loss: 0.414229, best loss: 0.334924 2025-01-16 02:00:25,243 - INFO - step 16291, loss: 0.368001, best loss: 0.334924 2025-01-16 02:00:25,393 - INFO - step 16292, loss: 0.430086, best loss: 0.334924 2025-01-16 02:00:25,543 - INFO - step 16293, loss: 0.368206, best loss: 0.334924 2025-01-16 02:00:25,693 - INFO - step 16294, loss: 0.399540, best loss: 0.334924 2025-01-16 02:00:28,820 - INFO - step 16295, loss: 0.312855, best loss: 0.312855 2025-01-16 02:00:28,970 - INFO - step 16296, loss: 0.358696, best loss: 0.312855 2025-01-16 02:00:29,120 - INFO - step 16297, loss: 0.440961, best loss: 0.312855 2025-01-16 02:00:29,270 - INFO - step 16298, loss: 0.346875, best loss: 0.312855 2025-01-16 02:00:29,421 - INFO - step 16299, loss: 0.401711, best loss: 0.312855 2025-01-16 02:00:29,571 - INFO - step 16300, loss: 0.358265, best loss: 0.312855 2025-01-16 02:00:29,721 - INFO - step 16301, loss: 0.359583, best loss: 0.312855 2025-01-16 02:00:29,871 - INFO - step 16302, loss: 0.371050, best loss: 0.312855 2025-01-16 02:00:30,021 - INFO - step 16303, loss: 0.374879, best loss: 0.312855 2025-01-16 02:00:30,172 - INFO - step 16304, loss: 0.502104, best loss: 0.312855 2025-01-16 02:00:30,322 - INFO - step 16305, loss: 0.376691, best loss: 0.312855 2025-01-16 02:00:30,472 - INFO - step 16306, loss: 0.333973, best loss: 0.312855 2025-01-16 02:00:30,622 - INFO - step 16307, loss: 0.398945, best loss: 0.312855 2025-01-16 02:00:30,772 - INFO - step 16308, loss: 0.395164, best loss: 0.312855 2025-01-16 02:00:30,922 - INFO - step 16309, loss: 0.409629, best loss: 0.312855 2025-01-16 02:00:31,073 - INFO - step 16310, loss: 0.409414, best loss: 0.312855 2025-01-16 02:00:31,223 - INFO - step 16311, loss: 0.435937, best loss: 0.312855 2025-01-16 02:00:31,373 - INFO - step 16312, loss: 0.404386, best loss: 0.312855 2025-01-16 02:00:34,902 - INFO - step 16313, loss: 0.307451, best loss: 0.307451 2025-01-16 02:00:35,052 - INFO - step 16314, loss: 0.313720, best loss: 0.307451 2025-01-16 02:00:35,203 - INFO - step 16315, loss: 0.406661, best loss: 0.307451 2025-01-16 02:00:35,353 - INFO - step 16316, loss: 0.400692, best loss: 0.307451 2025-01-16 02:00:35,503 - INFO - step 16317, loss: 0.377557, best loss: 0.307451 2025-01-16 02:00:35,653 - INFO - step 16318, loss: 0.353114, best loss: 0.307451 2025-01-16 02:00:35,803 - INFO - step 16319, loss: 0.384961, best loss: 0.307451 2025-01-16 02:00:35,954 - INFO - step 16320, loss: 0.402982, best loss: 0.307451 2025-01-16 02:00:36,104 - INFO - step 16321, loss: 0.424346, best loss: 0.307451 2025-01-16 02:00:36,254 - INFO - step 16322, loss: 0.326367, best loss: 0.307451 2025-01-16 02:00:36,404 - INFO - step 16323, loss: 0.390940, best loss: 0.307451 2025-01-16 02:00:36,554 - INFO - step 16324, loss: 0.475192, best loss: 0.307451 2025-01-16 02:00:36,705 - INFO - step 16325, loss: 0.414535, best loss: 0.307451 2025-01-16 02:00:36,855 - INFO - step 16326, loss: 0.422854, best loss: 0.307451 2025-01-16 02:00:37,005 - INFO - step 16327, loss: 0.397138, best loss: 0.307451 2025-01-16 02:00:37,155 - INFO - step 16328, loss: 0.393486, best loss: 0.307451 2025-01-16 02:00:37,305 - INFO - step 16329, loss: 0.419313, best loss: 0.307451 2025-01-16 02:00:37,455 - INFO - step 16330, loss: 0.356102, best loss: 0.307451 2025-01-16 02:00:37,605 - INFO - step 16331, loss: 0.377577, best loss: 0.307451 2025-01-16 02:00:37,755 - INFO - step 16332, loss: 0.406066, best loss: 0.307451 2025-01-16 02:00:37,905 - INFO - step 16333, loss: 0.345836, best loss: 0.307451 2025-01-16 02:00:38,056 - INFO - step 16334, loss: 0.393893, best loss: 0.307451 2025-01-16 02:00:38,206 - INFO - step 16335, loss: 0.453917, best loss: 0.307451 2025-01-16 02:00:38,356 - INFO - step 16336, loss: 0.438694, best loss: 0.307451 2025-01-16 02:00:38,506 - INFO - step 16337, loss: 0.426932, best loss: 0.307451 2025-01-16 02:00:38,656 - INFO - step 16338, loss: 0.402164, best loss: 0.307451 2025-01-16 02:00:38,806 - INFO - step 16339, loss: 0.423461, best loss: 0.307451 2025-01-16 02:00:38,956 - INFO - step 16340, loss: 0.373817, best loss: 0.307451 2025-01-16 02:00:39,106 - INFO - step 16341, loss: 0.370916, best loss: 0.307451 2025-01-16 02:00:39,256 - INFO - step 16342, loss: 0.357635, best loss: 0.307451 2025-01-16 02:00:39,407 - INFO - step 16343, loss: 0.477073, best loss: 0.307451 2025-01-16 02:00:39,558 - INFO - step 16344, loss: 0.454374, best loss: 0.307451 2025-01-16 02:00:39,708 - INFO - step 16345, loss: 0.420609, best loss: 0.307451 2025-01-16 02:00:39,858 - INFO - step 16346, loss: 0.461221, best loss: 0.307451 2025-01-16 02:00:40,008 - INFO - step 16347, loss: 0.504938, best loss: 0.307451 2025-01-16 02:00:40,158 - INFO - step 16348, loss: 0.419575, best loss: 0.307451 2025-01-16 02:00:40,308 - INFO - step 16349, loss: 0.439192, best loss: 0.307451 2025-01-16 02:00:40,458 - INFO - step 16350, loss: 0.447654, best loss: 0.307451 2025-01-16 02:00:40,608 - INFO - step 16351, loss: 0.363262, best loss: 0.307451 2025-01-16 02:00:40,758 - INFO - step 16352, loss: 0.438609, best loss: 0.307451 2025-01-16 02:00:40,909 - INFO - step 16353, loss: 0.455375, best loss: 0.307451 2025-01-16 02:00:41,059 - INFO - step 16354, loss: 0.385701, best loss: 0.307451 2025-01-16 02:00:41,209 - INFO - step 16355, loss: 0.401409, best loss: 0.307451 2025-01-16 02:00:41,359 - INFO - step 16356, loss: 0.440996, best loss: 0.307451 2025-01-16 02:00:41,509 - INFO - step 16357, loss: 0.489815, best loss: 0.307451 2025-01-16 02:00:41,659 - INFO - step 16358, loss: 0.429236, best loss: 0.307451 2025-01-16 02:00:41,810 - INFO - step 16359, loss: 0.452182, best loss: 0.307451 2025-01-16 02:00:41,960 - INFO - step 16360, loss: 0.437216, best loss: 0.307451 2025-01-16 02:00:42,110 - INFO - step 16361, loss: 0.430640, best loss: 0.307451 2025-01-16 02:00:42,260 - INFO - step 16362, loss: 0.442603, best loss: 0.307451 2025-01-16 02:00:42,411 - INFO - step 16363, loss: 0.457100, best loss: 0.307451 2025-01-16 02:00:42,561 - INFO - step 16364, loss: 0.441209, best loss: 0.307451 2025-01-16 02:00:42,711 - INFO - step 16365, loss: 0.438323, best loss: 0.307451 2025-01-16 02:00:42,861 - INFO - step 16366, loss: 0.514035, best loss: 0.307451 2025-01-16 02:00:43,011 - INFO - step 16367, loss: 0.481490, best loss: 0.307451 2025-01-16 02:00:43,161 - INFO - step 16368, loss: 0.484923, best loss: 0.307451 2025-01-16 02:00:43,312 - INFO - step 16369, loss: 0.362880, best loss: 0.307451 2025-01-16 02:00:43,462 - INFO - step 16370, loss: 0.436827, best loss: 0.307451 2025-01-16 02:00:43,612 - INFO - step 16371, loss: 0.434295, best loss: 0.307451 2025-01-16 02:00:43,762 - INFO - step 16372, loss: 0.414742, best loss: 0.307451 2025-01-16 02:00:43,912 - INFO - step 16373, loss: 0.440526, best loss: 0.307451 2025-01-16 02:00:44,062 - INFO - step 16374, loss: 0.434320, best loss: 0.307451 2025-01-16 02:00:44,212 - INFO - step 16375, loss: 0.398707, best loss: 0.307451 2025-01-16 02:00:44,362 - INFO - step 16376, loss: 0.400610, best loss: 0.307451 2025-01-16 02:00:44,512 - INFO - step 16377, loss: 0.413898, best loss: 0.307451 2025-01-16 02:00:44,663 - INFO - step 16378, loss: 0.428038, best loss: 0.307451 2025-01-16 02:00:44,813 - INFO - step 16379, loss: 0.427002, best loss: 0.307451 2025-01-16 02:00:44,963 - INFO - step 16380, loss: 0.418312, best loss: 0.307451 2025-01-16 02:00:45,113 - INFO - step 16381, loss: 0.382278, best loss: 0.307451 2025-01-16 02:00:45,263 - INFO - step 16382, loss: 0.389722, best loss: 0.307451 2025-01-16 02:00:45,414 - INFO - step 16383, loss: 0.384018, best loss: 0.307451 2025-01-16 02:00:45,564 - INFO - step 16384, loss: 0.454323, best loss: 0.307451 2025-01-16 02:00:45,714 - INFO - step 16385, loss: 0.443364, best loss: 0.307451 2025-01-16 02:00:45,864 - INFO - step 16386, loss: 0.390868, best loss: 0.307451 2025-01-16 02:00:46,014 - INFO - step 16387, loss: 0.380017, best loss: 0.307451 2025-01-16 02:00:46,164 - INFO - step 16388, loss: 0.413441, best loss: 0.307451 2025-01-16 02:00:46,314 - INFO - step 16389, loss: 0.360884, best loss: 0.307451 2025-01-16 02:00:46,464 - INFO - step 16390, loss: 0.493437, best loss: 0.307451 2025-01-16 02:00:46,614 - INFO - step 16391, loss: 0.443293, best loss: 0.307451 2025-01-16 02:00:46,765 - INFO - step 16392, loss: 0.452728, best loss: 0.307451 2025-01-16 02:00:46,915 - INFO - step 16393, loss: 0.460321, best loss: 0.307451 2025-01-16 02:00:47,065 - INFO - step 16394, loss: 0.503588, best loss: 0.307451 2025-01-16 02:00:47,215 - INFO - step 16395, loss: 0.383309, best loss: 0.307451 2025-01-16 02:00:47,365 - INFO - step 16396, loss: 0.421897, best loss: 0.307451 2025-01-16 02:00:47,515 - INFO - step 16397, loss: 0.400425, best loss: 0.307451 2025-01-16 02:00:47,665 - INFO - step 16398, loss: 0.331314, best loss: 0.307451 2025-01-16 02:00:47,815 - INFO - step 16399, loss: 0.485798, best loss: 0.307451 2025-01-16 02:00:47,966 - INFO - step 16400, loss: 0.419817, best loss: 0.307451 2025-01-16 02:00:48,116 - INFO - step 16401, loss: 0.356953, best loss: 0.307451 2025-01-16 02:00:48,266 - INFO - step 16402, loss: 0.386284, best loss: 0.307451 2025-01-16 02:00:48,416 - INFO - step 16403, loss: 0.511933, best loss: 0.307451 2025-01-16 02:00:48,566 - INFO - step 16404, loss: 0.432116, best loss: 0.307451 2025-01-16 02:00:48,716 - INFO - step 16405, loss: 0.425784, best loss: 0.307451 2025-01-16 02:00:48,866 - INFO - step 16406, loss: 0.417629, best loss: 0.307451 2025-01-16 02:00:49,017 - INFO - step 16407, loss: 0.431262, best loss: 0.307451 2025-01-16 02:00:49,167 - INFO - step 16408, loss: 0.330235, best loss: 0.307451 2025-01-16 02:00:49,317 - INFO - step 16409, loss: 0.424537, best loss: 0.307451 2025-01-16 02:00:49,467 - INFO - step 16410, loss: 0.569615, best loss: 0.307451 2025-01-16 02:00:49,617 - INFO - step 16411, loss: 0.492182, best loss: 0.307451 2025-01-16 02:00:49,767 - INFO - step 16412, loss: 0.435283, best loss: 0.307451 2025-01-16 02:00:49,917 - INFO - step 16413, loss: 0.410947, best loss: 0.307451 2025-01-16 02:00:50,067 - INFO - step 16414, loss: 0.486664, best loss: 0.307451 2025-01-16 02:00:50,217 - INFO - step 16415, loss: 0.383416, best loss: 0.307451 2025-01-16 02:00:50,367 - INFO - step 16416, loss: 0.415205, best loss: 0.307451 2025-01-16 02:00:50,517 - INFO - step 16417, loss: 0.450156, best loss: 0.307451 2025-01-16 02:00:50,667 - INFO - step 16418, loss: 0.367002, best loss: 0.307451 2025-01-16 02:00:50,817 - INFO - step 16419, loss: 0.382322, best loss: 0.307451 2025-01-16 02:00:50,968 - INFO - step 16420, loss: 0.425763, best loss: 0.307451 2025-01-16 02:00:51,118 - INFO - step 16421, loss: 0.497299, best loss: 0.307451 2025-01-16 02:00:51,267 - INFO - step 16422, loss: 0.385693, best loss: 0.307451 2025-01-16 02:00:51,418 - INFO - step 16423, loss: 0.338046, best loss: 0.307451 2025-01-16 02:00:51,567 - INFO - step 16424, loss: 0.336671, best loss: 0.307451 2025-01-16 02:00:51,718 - INFO - step 16425, loss: 0.319949, best loss: 0.307451 2025-01-16 02:00:51,867 - INFO - step 16426, loss: 0.351245, best loss: 0.307451 2025-01-16 02:00:52,017 - INFO - step 16427, loss: 0.417730, best loss: 0.307451 2025-01-16 02:00:52,167 - INFO - step 16428, loss: 0.376684, best loss: 0.307451 2025-01-16 02:00:52,318 - INFO - step 16429, loss: 0.397060, best loss: 0.307451 2025-01-16 02:00:52,469 - INFO - step 16430, loss: 0.416020, best loss: 0.307451 2025-01-16 02:00:52,619 - INFO - step 16431, loss: 0.363284, best loss: 0.307451 2025-01-16 02:00:52,770 - INFO - step 16432, loss: 0.386680, best loss: 0.307451 2025-01-16 02:00:52,920 - INFO - step 16433, loss: 0.341725, best loss: 0.307451 2025-01-16 02:00:53,070 - INFO - step 16434, loss: 0.359004, best loss: 0.307451 2025-01-16 02:00:53,220 - INFO - step 16435, loss: 0.386236, best loss: 0.307451 2025-01-16 02:00:53,370 - INFO - step 16436, loss: 0.371300, best loss: 0.307451 2025-01-16 02:00:53,520 - INFO - step 16437, loss: 0.401070, best loss: 0.307451 2025-01-16 02:00:53,670 - INFO - step 16438, loss: 0.350417, best loss: 0.307451 2025-01-16 02:00:53,821 - INFO - step 16439, loss: 0.367923, best loss: 0.307451 2025-01-16 02:00:53,971 - INFO - step 16440, loss: 0.389197, best loss: 0.307451 2025-01-16 02:00:54,121 - INFO - step 16441, loss: 0.376042, best loss: 0.307451 2025-01-16 02:00:54,271 - INFO - step 16442, loss: 0.353636, best loss: 0.307451 2025-01-16 02:00:54,421 - INFO - step 16443, loss: 0.344176, best loss: 0.307451 2025-01-16 02:00:54,571 - INFO - step 16444, loss: 0.413910, best loss: 0.307451 2025-01-16 02:00:54,721 - INFO - step 16445, loss: 0.421847, best loss: 0.307451 2025-01-16 02:00:54,871 - INFO - step 16446, loss: 0.361141, best loss: 0.307451 2025-01-16 02:00:55,022 - INFO - step 16447, loss: 0.389576, best loss: 0.307451 2025-01-16 02:00:55,172 - INFO - step 16448, loss: 0.410066, best loss: 0.307451 2025-01-16 02:00:55,322 - INFO - step 16449, loss: 0.358571, best loss: 0.307451 2025-01-16 02:00:55,472 - INFO - step 16450, loss: 0.355954, best loss: 0.307451 2025-01-16 02:00:59,021 - INFO - step 16451, loss: 0.297128, best loss: 0.297128 2025-01-16 02:00:59,183 - INFO - step 16452, loss: 0.368511, best loss: 0.297128 2025-01-16 02:00:59,339 - INFO - step 16453, loss: 0.381000, best loss: 0.297128 2025-01-16 02:00:59,490 - INFO - step 16454, loss: 0.483018, best loss: 0.297128 2025-01-16 02:00:59,640 - INFO - step 16455, loss: 0.389062, best loss: 0.297128 2025-01-16 02:00:59,790 - INFO - step 16456, loss: 0.412559, best loss: 0.297128 2025-01-16 02:00:59,940 - INFO - step 16457, loss: 0.402531, best loss: 0.297128 2025-01-16 02:01:00,090 - INFO - step 16458, loss: 0.361228, best loss: 0.297128 2025-01-16 02:01:00,241 - INFO - step 16459, loss: 0.409381, best loss: 0.297128 2025-01-16 02:01:00,391 - INFO - step 16460, loss: 0.359601, best loss: 0.297128 2025-01-16 02:01:00,541 - INFO - step 16461, loss: 0.341897, best loss: 0.297128 2025-01-16 02:01:00,691 - INFO - step 16462, loss: 0.353762, best loss: 0.297128 2025-01-16 02:01:00,841 - INFO - step 16463, loss: 0.374316, best loss: 0.297128 2025-01-16 02:01:00,992 - INFO - step 16464, loss: 0.368882, best loss: 0.297128 2025-01-16 02:01:01,142 - INFO - step 16465, loss: 0.396200, best loss: 0.297128 2025-01-16 02:01:01,292 - INFO - step 16466, loss: 0.391680, best loss: 0.297128 2025-01-16 02:01:01,443 - INFO - step 16467, loss: 0.350723, best loss: 0.297128 2025-01-16 02:01:01,593 - INFO - step 16468, loss: 0.435554, best loss: 0.297128 2025-01-16 02:01:01,743 - INFO - step 16469, loss: 0.399735, best loss: 0.297128 2025-01-16 02:01:01,893 - INFO - step 16470, loss: 0.425127, best loss: 0.297128 2025-01-16 02:01:02,043 - INFO - step 16471, loss: 0.368293, best loss: 0.297128 2025-01-16 02:01:02,193 - INFO - step 16472, loss: 0.340680, best loss: 0.297128 2025-01-16 02:01:02,343 - INFO - step 16473, loss: 0.394721, best loss: 0.297128 2025-01-16 02:01:02,493 - INFO - step 16474, loss: 0.371923, best loss: 0.297128 2025-01-16 02:01:02,643 - INFO - step 16475, loss: 0.389185, best loss: 0.297128 2025-01-16 02:01:02,793 - INFO - step 16476, loss: 0.324268, best loss: 0.297128 2025-01-16 02:01:02,944 - INFO - step 16477, loss: 0.399413, best loss: 0.297128 2025-01-16 02:01:03,094 - INFO - step 16478, loss: 0.384176, best loss: 0.297128 2025-01-16 02:01:03,244 - INFO - step 16479, loss: 0.392928, best loss: 0.297128 2025-01-16 02:01:03,394 - INFO - step 16480, loss: 0.375323, best loss: 0.297128 2025-01-16 02:01:03,544 - INFO - step 16481, loss: 0.378231, best loss: 0.297128 2025-01-16 02:01:03,694 - INFO - step 16482, loss: 0.469656, best loss: 0.297128 2025-01-16 02:01:03,844 - INFO - step 16483, loss: 0.412473, best loss: 0.297128 2025-01-16 02:01:03,994 - INFO - step 16484, loss: 0.430911, best loss: 0.297128 2025-01-16 02:01:04,144 - INFO - step 16485, loss: 0.406080, best loss: 0.297128 2025-01-16 02:01:04,294 - INFO - step 16486, loss: 0.325451, best loss: 0.297128 2025-01-16 02:01:04,444 - INFO - step 16487, loss: 0.310122, best loss: 0.297128 2025-01-16 02:01:04,593 - INFO - step 16488, loss: 0.391890, best loss: 0.297128 2025-01-16 02:01:04,743 - INFO - step 16489, loss: 0.401327, best loss: 0.297128 2025-01-16 02:01:04,893 - INFO - step 16490, loss: 0.396039, best loss: 0.297128 2025-01-16 02:01:05,043 - INFO - step 16491, loss: 0.428056, best loss: 0.297128 2025-01-16 02:01:05,193 - INFO - step 16492, loss: 0.427779, best loss: 0.297128 2025-01-16 02:01:05,342 - INFO - step 16493, loss: 0.441091, best loss: 0.297128 2025-01-16 02:01:05,492 - INFO - step 16494, loss: 0.375585, best loss: 0.297128 2025-01-16 02:01:05,642 - INFO - step 16495, loss: 0.447925, best loss: 0.297128 2025-01-16 02:01:05,792 - INFO - step 16496, loss: 0.437292, best loss: 0.297128 2025-01-16 02:01:05,942 - INFO - step 16497, loss: 0.398034, best loss: 0.297128 2025-01-16 02:01:06,092 - INFO - step 16498, loss: 0.518895, best loss: 0.297128 2025-01-16 02:01:06,242 - INFO - step 16499, loss: 0.357375, best loss: 0.297128 2025-01-16 02:01:06,392 - INFO - step 16500, loss: 0.364146, best loss: 0.297128 2025-01-16 02:01:06,542 - INFO - step 16501, loss: 0.420179, best loss: 0.297128 2025-01-16 02:01:06,692 - INFO - step 16502, loss: 0.391076, best loss: 0.297128 2025-01-16 02:01:06,842 - INFO - step 16503, loss: 0.529586, best loss: 0.297128 2025-01-16 02:01:06,992 - INFO - step 16504, loss: 0.422763, best loss: 0.297128 2025-01-16 02:01:07,143 - INFO - step 16505, loss: 0.439089, best loss: 0.297128 2025-01-16 02:01:07,294 - INFO - step 16506, loss: 0.379394, best loss: 0.297128 2025-01-16 02:01:07,444 - INFO - step 16507, loss: 0.434244, best loss: 0.297128 2025-01-16 02:01:07,594 - INFO - step 16508, loss: 0.415985, best loss: 0.297128 2025-01-16 02:01:07,744 - INFO - step 16509, loss: 0.499429, best loss: 0.297128 2025-01-16 02:01:07,894 - INFO - step 16510, loss: 0.411982, best loss: 0.297128 2025-01-16 02:01:08,044 - INFO - step 16511, loss: 0.441569, best loss: 0.297128 2025-01-16 02:01:08,194 - INFO - step 16512, loss: 0.356269, best loss: 0.297128 2025-01-16 02:01:08,345 - INFO - step 16513, loss: 0.481291, best loss: 0.297128 2025-01-16 02:01:08,495 - INFO - step 16514, loss: 0.441748, best loss: 0.297128 2025-01-16 02:01:08,645 - INFO - step 16515, loss: 0.393442, best loss: 0.297128 2025-01-16 02:01:08,795 - INFO - step 16516, loss: 0.428538, best loss: 0.297128 2025-01-16 02:01:08,946 - INFO - step 16517, loss: 0.410518, best loss: 0.297128 2025-01-16 02:01:09,096 - INFO - step 16518, loss: 0.394615, best loss: 0.297128 2025-01-16 02:01:09,246 - INFO - step 16519, loss: 0.363982, best loss: 0.297128 2025-01-16 02:01:09,396 - INFO - step 16520, loss: 0.394976, best loss: 0.297128 2025-01-16 02:01:09,546 - INFO - step 16521, loss: 0.416113, best loss: 0.297128 2025-01-16 02:01:09,696 - INFO - step 16522, loss: 0.363740, best loss: 0.297128 2025-01-16 02:01:09,846 - INFO - step 16523, loss: 0.389004, best loss: 0.297128 2025-01-16 02:01:09,997 - INFO - step 16524, loss: 0.450209, best loss: 0.297128 2025-01-16 02:01:10,147 - INFO - step 16525, loss: 0.481397, best loss: 0.297128 2025-01-16 02:01:10,297 - INFO - step 16526, loss: 0.456647, best loss: 0.297128 2025-01-16 02:01:10,447 - INFO - step 16527, loss: 0.389742, best loss: 0.297128 2025-01-16 02:01:10,597 - INFO - step 16528, loss: 0.386448, best loss: 0.297128 2025-01-16 02:01:10,747 - INFO - step 16529, loss: 0.443032, best loss: 0.297128 2025-01-16 02:01:10,897 - INFO - step 16530, loss: 0.347276, best loss: 0.297128 2025-01-16 02:01:11,048 - INFO - step 16531, loss: 0.387583, best loss: 0.297128 2025-01-16 02:01:11,198 - INFO - step 16532, loss: 0.380613, best loss: 0.297128 2025-01-16 02:01:11,348 - INFO - step 16533, loss: 0.376496, best loss: 0.297128 2025-01-16 02:01:11,498 - INFO - step 16534, loss: 0.409485, best loss: 0.297128 2025-01-16 02:01:11,648 - INFO - step 16535, loss: 0.410529, best loss: 0.297128 2025-01-16 02:01:11,799 - INFO - step 16536, loss: 0.426928, best loss: 0.297128 2025-01-16 02:01:11,949 - INFO - step 16537, loss: 0.437816, best loss: 0.297128 2025-01-16 02:01:12,099 - INFO - step 16538, loss: 0.385444, best loss: 0.297128 2025-01-16 02:01:12,250 - INFO - step 16539, loss: 0.355678, best loss: 0.297128 2025-01-16 02:01:12,400 - INFO - step 16540, loss: 0.434369, best loss: 0.297128 2025-01-16 02:01:12,550 - INFO - step 16541, loss: 0.403041, best loss: 0.297128 2025-01-16 02:01:12,701 - INFO - step 16542, loss: 0.459283, best loss: 0.297128 2025-01-16 02:01:12,851 - INFO - step 16543, loss: 0.382434, best loss: 0.297128 2025-01-16 02:01:13,001 - INFO - step 16544, loss: 0.418098, best loss: 0.297128 2025-01-16 02:01:13,151 - INFO - step 16545, loss: 0.357734, best loss: 0.297128 2025-01-16 02:01:13,301 - INFO - step 16546, loss: 0.379633, best loss: 0.297128 2025-01-16 02:01:13,451 - INFO - step 16547, loss: 0.399989, best loss: 0.297128 2025-01-16 02:01:13,601 - INFO - step 16548, loss: 0.351395, best loss: 0.297128 2025-01-16 02:01:13,752 - INFO - step 16549, loss: 0.344352, best loss: 0.297128 2025-01-16 02:01:13,902 - INFO - step 16550, loss: 0.363695, best loss: 0.297128 2025-01-16 02:01:14,052 - INFO - step 16551, loss: 0.340938, best loss: 0.297128 2025-01-16 02:01:14,202 - INFO - step 16552, loss: 0.346868, best loss: 0.297128 2025-01-16 02:01:14,352 - INFO - step 16553, loss: 0.388425, best loss: 0.297128 2025-01-16 02:01:14,502 - INFO - step 16554, loss: 0.322157, best loss: 0.297128 2025-01-16 02:01:14,653 - INFO - step 16555, loss: 0.320860, best loss: 0.297128 2025-01-16 02:01:14,803 - INFO - step 16556, loss: 0.403892, best loss: 0.297128 2025-01-16 02:01:14,953 - INFO - step 16557, loss: 0.359379, best loss: 0.297128 2025-01-16 02:01:15,103 - INFO - step 16558, loss: 0.339916, best loss: 0.297128 2025-01-16 02:01:15,254 - INFO - step 16559, loss: 0.344924, best loss: 0.297128 2025-01-16 02:01:15,404 - INFO - step 16560, loss: 0.406215, best loss: 0.297128 2025-01-16 02:01:15,554 - INFO - step 16561, loss: 0.400624, best loss: 0.297128 2025-01-16 02:01:15,704 - INFO - step 16562, loss: 0.418672, best loss: 0.297128 2025-01-16 02:01:15,854 - INFO - step 16563, loss: 0.325808, best loss: 0.297128 2025-01-16 02:01:16,004 - INFO - step 16564, loss: 0.389860, best loss: 0.297128 2025-01-16 02:01:16,154 - INFO - step 16565, loss: 0.363022, best loss: 0.297128 2025-01-16 02:01:16,304 - INFO - step 16566, loss: 0.403377, best loss: 0.297128 2025-01-16 02:01:16,455 - INFO - step 16567, loss: 0.432890, best loss: 0.297128 2025-01-16 02:01:16,605 - INFO - step 16568, loss: 0.375482, best loss: 0.297128 2025-01-16 02:01:16,755 - INFO - step 16569, loss: 0.382113, best loss: 0.297128 2025-01-16 02:01:16,905 - INFO - step 16570, loss: 0.419936, best loss: 0.297128 2025-01-16 02:01:17,055 - INFO - step 16571, loss: 0.403328, best loss: 0.297128 2025-01-16 02:01:17,206 - INFO - step 16572, loss: 0.342725, best loss: 0.297128 2025-01-16 02:01:17,355 - INFO - step 16573, loss: 0.358373, best loss: 0.297128 2025-01-16 02:01:17,506 - INFO - step 16574, loss: 0.453831, best loss: 0.297128 2025-01-16 02:01:17,657 - INFO - step 16575, loss: 0.355545, best loss: 0.297128 2025-01-16 02:01:17,807 - INFO - step 16576, loss: 0.355975, best loss: 0.297128 2025-01-16 02:01:17,957 - INFO - step 16577, loss: 0.340108, best loss: 0.297128 2025-01-16 02:01:18,107 - INFO - step 16578, loss: 0.427492, best loss: 0.297128 2025-01-16 02:01:18,257 - INFO - step 16579, loss: 0.367289, best loss: 0.297128 2025-01-16 02:01:18,407 - INFO - step 16580, loss: 0.394550, best loss: 0.297128 2025-01-16 02:01:18,557 - INFO - step 16581, loss: 0.350635, best loss: 0.297128 2025-01-16 02:01:18,707 - INFO - step 16582, loss: 0.353134, best loss: 0.297128 2025-01-16 02:01:18,857 - INFO - step 16583, loss: 0.352539, best loss: 0.297128 2025-01-16 02:01:19,008 - INFO - step 16584, loss: 0.349990, best loss: 0.297128 2025-01-16 02:01:19,158 - INFO - step 16585, loss: 0.382632, best loss: 0.297128 2025-01-16 02:01:19,308 - INFO - step 16586, loss: 0.298093, best loss: 0.297128 2025-01-16 02:01:19,458 - INFO - step 16587, loss: 0.342236, best loss: 0.297128 2025-01-16 02:01:19,609 - INFO - step 16588, loss: 0.388696, best loss: 0.297128 2025-01-16 02:01:19,759 - INFO - step 16589, loss: 0.334513, best loss: 0.297128 2025-01-16 02:01:19,909 - INFO - step 16590, loss: 0.341513, best loss: 0.297128 2025-01-16 02:01:20,059 - INFO - step 16591, loss: 0.306263, best loss: 0.297128 2025-01-16 02:01:20,210 - INFO - step 16592, loss: 0.303659, best loss: 0.297128 2025-01-16 02:01:20,360 - INFO - step 16593, loss: 0.377814, best loss: 0.297128 2025-01-16 02:01:20,510 - INFO - step 16594, loss: 0.409219, best loss: 0.297128 2025-01-16 02:01:20,661 - INFO - step 16595, loss: 0.458033, best loss: 0.297128 2025-01-16 02:01:20,811 - INFO - step 16596, loss: 0.372245, best loss: 0.297128 2025-01-16 02:01:20,961 - INFO - step 16597, loss: 0.336669, best loss: 0.297128 2025-01-16 02:01:21,111 - INFO - step 16598, loss: 0.310185, best loss: 0.297128 2025-01-16 02:01:21,261 - INFO - step 16599, loss: 0.359584, best loss: 0.297128 2025-01-16 02:01:21,412 - INFO - step 16600, loss: 0.438139, best loss: 0.297128 2025-01-16 02:01:21,562 - INFO - step 16601, loss: 0.396096, best loss: 0.297128 2025-01-16 02:01:21,712 - INFO - step 16602, loss: 0.340859, best loss: 0.297128 2025-01-16 02:01:21,863 - INFO - step 16603, loss: 0.377426, best loss: 0.297128 2025-01-16 02:01:22,013 - INFO - step 16604, loss: 0.398290, best loss: 0.297128 2025-01-16 02:01:22,163 - INFO - step 16605, loss: 0.339037, best loss: 0.297128 2025-01-16 02:01:22,313 - INFO - step 16606, loss: 0.416570, best loss: 0.297128 2025-01-16 02:01:22,463 - INFO - step 16607, loss: 0.365077, best loss: 0.297128 2025-01-16 02:01:22,614 - INFO - step 16608, loss: 0.338623, best loss: 0.297128 2025-01-16 02:01:22,764 - INFO - step 16609, loss: 0.444052, best loss: 0.297128 2025-01-16 02:01:22,914 - INFO - step 16610, loss: 0.362575, best loss: 0.297128 2025-01-16 02:01:23,064 - INFO - step 16611, loss: 0.350190, best loss: 0.297128 2025-01-16 02:01:23,214 - INFO - step 16612, loss: 0.312719, best loss: 0.297128 2025-01-16 02:01:23,365 - INFO - step 16613, loss: 0.327305, best loss: 0.297128 2025-01-16 02:01:23,515 - INFO - step 16614, loss: 0.402529, best loss: 0.297128 2025-01-16 02:01:23,665 - INFO - step 16615, loss: 0.388471, best loss: 0.297128 2025-01-16 02:01:23,815 - INFO - step 16616, loss: 0.333183, best loss: 0.297128 2025-01-16 02:01:23,965 - INFO - step 16617, loss: 0.384954, best loss: 0.297128 2025-01-16 02:01:24,115 - INFO - step 16618, loss: 0.336088, best loss: 0.297128 2025-01-16 02:01:24,266 - INFO - step 16619, loss: 0.361019, best loss: 0.297128 2025-01-16 02:01:24,416 - INFO - step 16620, loss: 0.411786, best loss: 0.297128 2025-01-16 02:01:24,566 - INFO - step 16621, loss: 0.310460, best loss: 0.297128 2025-01-16 02:01:24,716 - INFO - step 16622, loss: 0.351756, best loss: 0.297128 2025-01-16 02:01:24,866 - INFO - step 16623, loss: 0.358588, best loss: 0.297128 2025-01-16 02:01:25,016 - INFO - step 16624, loss: 0.387951, best loss: 0.297128 2025-01-16 02:01:25,166 - INFO - step 16625, loss: 0.313934, best loss: 0.297128 2025-01-16 02:01:25,316 - INFO - step 16626, loss: 0.338280, best loss: 0.297128 2025-01-16 02:01:25,466 - INFO - step 16627, loss: 0.383341, best loss: 0.297128 2025-01-16 02:01:25,617 - INFO - step 16628, loss: 0.336674, best loss: 0.297128 2025-01-16 02:01:25,767 - INFO - step 16629, loss: 0.410646, best loss: 0.297128 2025-01-16 02:01:29,345 - INFO - step 16630, loss: 0.286429, best loss: 0.286429 2025-01-16 02:01:29,496 - INFO - step 16631, loss: 0.420853, best loss: 0.286429 2025-01-16 02:01:29,646 - INFO - step 16632, loss: 0.395693, best loss: 0.286429 2025-01-16 02:01:29,796 - INFO - step 16633, loss: 0.330261, best loss: 0.286429 2025-01-16 02:01:29,946 - INFO - step 16634, loss: 0.445856, best loss: 0.286429 2025-01-16 02:01:33,531 - INFO - step 16635, loss: 0.282605, best loss: 0.282605 2025-01-16 02:01:37,131 - INFO - step 16636, loss: 0.280468, best loss: 0.280468 2025-01-16 02:01:37,281 - INFO - step 16637, loss: 0.374861, best loss: 0.280468 2025-01-16 02:01:37,431 - INFO - step 16638, loss: 0.336945, best loss: 0.280468 2025-01-16 02:01:37,581 - INFO - step 16639, loss: 0.400576, best loss: 0.280468 2025-01-16 02:01:37,732 - INFO - step 16640, loss: 0.390167, best loss: 0.280468 2025-01-16 02:01:37,882 - INFO - step 16641, loss: 0.379876, best loss: 0.280468 2025-01-16 02:01:38,032 - INFO - step 16642, loss: 0.413752, best loss: 0.280468 2025-01-16 02:01:38,182 - INFO - step 16643, loss: 0.378162, best loss: 0.280468 2025-01-16 02:01:38,332 - INFO - step 16644, loss: 0.305326, best loss: 0.280468 2025-01-16 02:01:38,483 - INFO - step 16645, loss: 0.347330, best loss: 0.280468 2025-01-16 02:01:38,633 - INFO - step 16646, loss: 0.393087, best loss: 0.280468 2025-01-16 02:01:38,783 - INFO - step 16647, loss: 0.302585, best loss: 0.280468 2025-01-16 02:01:38,933 - INFO - step 16648, loss: 0.333028, best loss: 0.280468 2025-01-16 02:01:39,084 - INFO - step 16649, loss: 0.335193, best loss: 0.280468 2025-01-16 02:01:39,234 - INFO - step 16650, loss: 0.366623, best loss: 0.280468 2025-01-16 02:01:39,384 - INFO - step 16651, loss: 0.360445, best loss: 0.280468 2025-01-16 02:01:39,535 - INFO - step 16652, loss: 0.393151, best loss: 0.280468 2025-01-16 02:01:39,685 - INFO - step 16653, loss: 0.347880, best loss: 0.280468 2025-01-16 02:01:39,835 - INFO - step 16654, loss: 0.413678, best loss: 0.280468 2025-01-16 02:01:39,986 - INFO - step 16655, loss: 0.404835, best loss: 0.280468 2025-01-16 02:01:40,136 - INFO - step 16656, loss: 0.396407, best loss: 0.280468 2025-01-16 02:01:40,286 - INFO - step 16657, loss: 0.363796, best loss: 0.280468 2025-01-16 02:01:40,436 - INFO - step 16658, loss: 0.409157, best loss: 0.280468 2025-01-16 02:01:40,586 - INFO - step 16659, loss: 0.411550, best loss: 0.280468 2025-01-16 02:01:40,736 - INFO - step 16660, loss: 0.365498, best loss: 0.280468 2025-01-16 02:01:40,886 - INFO - step 16661, loss: 0.305317, best loss: 0.280468 2025-01-16 02:01:41,036 - INFO - step 16662, loss: 0.421307, best loss: 0.280468 2025-01-16 02:01:41,186 - INFO - step 16663, loss: 0.341760, best loss: 0.280468 2025-01-16 02:01:41,336 - INFO - step 16664, loss: 0.365337, best loss: 0.280468 2025-01-16 02:01:41,486 - INFO - step 16665, loss: 0.378308, best loss: 0.280468 2025-01-16 02:01:41,636 - INFO - step 16666, loss: 0.410474, best loss: 0.280468 2025-01-16 02:01:41,786 - INFO - step 16667, loss: 0.433601, best loss: 0.280468 2025-01-16 02:01:41,936 - INFO - step 16668, loss: 0.387222, best loss: 0.280468 2025-01-16 02:01:42,086 - INFO - step 16669, loss: 0.374753, best loss: 0.280468 2025-01-16 02:01:42,237 - INFO - step 16670, loss: 0.348332, best loss: 0.280468 2025-01-16 02:01:42,387 - INFO - step 16671, loss: 0.398925, best loss: 0.280468 2025-01-16 02:01:42,537 - INFO - step 16672, loss: 0.376667, best loss: 0.280468 2025-01-16 02:01:42,687 - INFO - step 16673, loss: 0.357443, best loss: 0.280468 2025-01-16 02:01:42,837 - INFO - step 16674, loss: 0.385666, best loss: 0.280468 2025-01-16 02:01:42,987 - INFO - step 16675, loss: 0.369861, best loss: 0.280468 2025-01-16 02:01:43,137 - INFO - step 16676, loss: 0.435736, best loss: 0.280468 2025-01-16 02:01:43,287 - INFO - step 16677, loss: 0.364889, best loss: 0.280468 2025-01-16 02:01:43,437 - INFO - step 16678, loss: 0.357163, best loss: 0.280468 2025-01-16 02:01:43,587 - INFO - step 16679, loss: 0.372157, best loss: 0.280468 2025-01-16 02:01:43,737 - INFO - step 16680, loss: 0.429086, best loss: 0.280468 2025-01-16 02:01:43,888 - INFO - step 16681, loss: 0.400047, best loss: 0.280468 2025-01-16 02:01:44,038 - INFO - step 16682, loss: 0.406420, best loss: 0.280468 2025-01-16 02:01:44,188 - INFO - step 16683, loss: 0.399692, best loss: 0.280468 2025-01-16 02:01:44,339 - INFO - step 16684, loss: 0.354168, best loss: 0.280468 2025-01-16 02:01:44,489 - INFO - step 16685, loss: 0.380286, best loss: 0.280468 2025-01-16 02:01:44,639 - INFO - step 16686, loss: 0.382960, best loss: 0.280468 2025-01-16 02:01:44,789 - INFO - step 16687, loss: 0.432762, best loss: 0.280468 2025-01-16 02:01:44,939 - INFO - step 16688, loss: 0.317326, best loss: 0.280468 2025-01-16 02:01:45,090 - INFO - step 16689, loss: 0.450895, best loss: 0.280468 2025-01-16 02:01:45,240 - INFO - step 16690, loss: 0.358830, best loss: 0.280468 2025-01-16 02:01:45,390 - INFO - step 16691, loss: 0.444341, best loss: 0.280468 2025-01-16 02:01:45,540 - INFO - step 16692, loss: 0.406846, best loss: 0.280468 2025-01-16 02:01:45,691 - INFO - step 16693, loss: 0.358075, best loss: 0.280468 2025-01-16 02:01:45,841 - INFO - step 16694, loss: 0.402734, best loss: 0.280468 2025-01-16 02:01:45,991 - INFO - step 16695, loss: 0.381985, best loss: 0.280468 2025-01-16 02:01:46,142 - INFO - step 16696, loss: 0.415500, best loss: 0.280468 2025-01-16 02:01:46,292 - INFO - step 16697, loss: 0.429182, best loss: 0.280468 2025-01-16 02:01:46,442 - INFO - step 16698, loss: 0.429368, best loss: 0.280468 2025-01-16 02:01:46,592 - INFO - step 16699, loss: 0.338472, best loss: 0.280468 2025-01-16 02:01:46,743 - INFO - step 16700, loss: 0.402591, best loss: 0.280468 2025-01-16 02:01:46,893 - INFO - step 16701, loss: 0.373926, best loss: 0.280468 2025-01-16 02:01:47,043 - INFO - step 16702, loss: 0.370199, best loss: 0.280468 2025-01-16 02:01:47,194 - INFO - step 16703, loss: 0.397911, best loss: 0.280468 2025-01-16 02:01:47,344 - INFO - step 16704, loss: 0.335412, best loss: 0.280468 2025-01-16 02:01:47,494 - INFO - step 16705, loss: 0.351970, best loss: 0.280468 2025-01-16 02:01:47,644 - INFO - step 16706, loss: 0.339321, best loss: 0.280468 2025-01-16 02:01:47,794 - INFO - step 16707, loss: 0.354432, best loss: 0.280468 2025-01-16 02:01:47,945 - INFO - step 16708, loss: 0.348154, best loss: 0.280468 2025-01-16 02:01:48,095 - INFO - step 16709, loss: 0.429519, best loss: 0.280468 2025-01-16 02:01:48,245 - INFO - step 16710, loss: 0.358156, best loss: 0.280468 2025-01-16 02:01:48,395 - INFO - step 16711, loss: 0.354434, best loss: 0.280468 2025-01-16 02:01:48,545 - INFO - step 16712, loss: 0.373720, best loss: 0.280468 2025-01-16 02:01:48,696 - INFO - step 16713, loss: 0.370564, best loss: 0.280468 2025-01-16 02:01:48,846 - INFO - step 16714, loss: 0.401194, best loss: 0.280468 2025-01-16 02:01:48,996 - INFO - step 16715, loss: 0.407276, best loss: 0.280468 2025-01-16 02:01:49,146 - INFO - step 16716, loss: 0.425287, best loss: 0.280468 2025-01-16 02:01:49,296 - INFO - step 16717, loss: 0.343183, best loss: 0.280468 2025-01-16 02:01:49,447 - INFO - step 16718, loss: 0.374475, best loss: 0.280468 2025-01-16 02:01:49,598 - INFO - step 16719, loss: 0.381005, best loss: 0.280468 2025-01-16 02:01:49,748 - INFO - step 16720, loss: 0.398770, best loss: 0.280468 2025-01-16 02:01:49,898 - INFO - step 16721, loss: 0.377551, best loss: 0.280468 2025-01-16 02:01:50,048 - INFO - step 16722, loss: 0.451753, best loss: 0.280468 2025-01-16 02:01:50,198 - INFO - step 16723, loss: 0.429683, best loss: 0.280468 2025-01-16 02:01:50,348 - INFO - step 16724, loss: 0.383434, best loss: 0.280468 2025-01-16 02:01:50,498 - INFO - step 16725, loss: 0.341929, best loss: 0.280468 2025-01-16 02:01:50,648 - INFO - step 16726, loss: 0.370418, best loss: 0.280468 2025-01-16 02:01:50,798 - INFO - step 16727, loss: 0.364881, best loss: 0.280468 2025-01-16 02:01:50,948 - INFO - step 16728, loss: 0.347423, best loss: 0.280468 2025-01-16 02:01:51,098 - INFO - step 16729, loss: 0.382484, best loss: 0.280468 2025-01-16 02:01:51,248 - INFO - step 16730, loss: 0.392357, best loss: 0.280468 2025-01-16 02:01:51,399 - INFO - step 16731, loss: 0.310679, best loss: 0.280468 2025-01-16 02:01:51,549 - INFO - step 16732, loss: 0.355241, best loss: 0.280468 2025-01-16 02:01:51,699 - INFO - step 16733, loss: 0.425403, best loss: 0.280468 2025-01-16 02:01:51,849 - INFO - step 16734, loss: 0.399514, best loss: 0.280468 2025-01-16 02:01:51,999 - INFO - step 16735, loss: 0.370868, best loss: 0.280468 2025-01-16 02:01:52,149 - INFO - step 16736, loss: 0.406954, best loss: 0.280468 2025-01-16 02:01:52,299 - INFO - step 16737, loss: 0.365538, best loss: 0.280468 2025-01-16 02:01:52,449 - INFO - step 16738, loss: 0.309540, best loss: 0.280468 2025-01-16 02:01:52,600 - INFO - step 16739, loss: 0.375162, best loss: 0.280468 2025-01-16 02:01:52,749 - INFO - step 16740, loss: 0.421080, best loss: 0.280468 2025-01-16 02:01:52,899 - INFO - step 16741, loss: 0.399745, best loss: 0.280468 2025-01-16 02:01:53,049 - INFO - step 16742, loss: 0.386945, best loss: 0.280468 2025-01-16 02:01:53,199 - INFO - step 16743, loss: 0.352913, best loss: 0.280468 2025-01-16 02:01:53,350 - INFO - step 16744, loss: 0.417012, best loss: 0.280468 2025-01-16 02:01:53,500 - INFO - step 16745, loss: 0.333058, best loss: 0.280468 2025-01-16 02:01:53,650 - INFO - step 16746, loss: 0.427030, best loss: 0.280468 2025-01-16 02:01:53,800 - INFO - step 16747, loss: 0.374276, best loss: 0.280468 2025-01-16 02:01:53,950 - INFO - step 16748, loss: 0.349980, best loss: 0.280468 2025-01-16 02:01:54,101 - INFO - step 16749, loss: 0.345746, best loss: 0.280468 2025-01-16 02:01:54,251 - INFO - step 16750, loss: 0.372678, best loss: 0.280468 2025-01-16 02:01:54,401 - INFO - step 16751, loss: 0.379823, best loss: 0.280468 2025-01-16 02:01:54,551 - INFO - step 16752, loss: 0.314690, best loss: 0.280468 2025-01-16 02:01:54,702 - INFO - step 16753, loss: 0.418223, best loss: 0.280468 2025-01-16 02:01:54,852 - INFO - step 16754, loss: 0.357373, best loss: 0.280468 2025-01-16 02:01:55,002 - INFO - step 16755, loss: 0.328516, best loss: 0.280468 2025-01-16 02:01:55,152 - INFO - step 16756, loss: 0.349470, best loss: 0.280468 2025-01-16 02:01:55,302 - INFO - step 16757, loss: 0.404448, best loss: 0.280468 2025-01-16 02:01:55,452 - INFO - step 16758, loss: 0.380318, best loss: 0.280468 2025-01-16 02:01:55,602 - INFO - step 16759, loss: 0.422667, best loss: 0.280468 2025-01-16 02:01:55,752 - INFO - step 16760, loss: 0.363648, best loss: 0.280468 2025-01-16 02:01:55,902 - INFO - step 16761, loss: 0.312289, best loss: 0.280468 2025-01-16 02:01:56,052 - INFO - step 16762, loss: 0.375735, best loss: 0.280468 2025-01-16 02:01:56,203 - INFO - step 16763, loss: 0.336082, best loss: 0.280468 2025-01-16 02:01:56,352 - INFO - step 16764, loss: 0.342158, best loss: 0.280468 2025-01-16 02:01:56,503 - INFO - step 16765, loss: 0.383901, best loss: 0.280468 2025-01-16 02:01:56,652 - INFO - step 16766, loss: 0.413289, best loss: 0.280468 2025-01-16 02:01:56,802 - INFO - step 16767, loss: 0.304533, best loss: 0.280468 2025-01-16 02:01:56,952 - INFO - step 16768, loss: 0.347236, best loss: 0.280468 2025-01-16 02:01:57,102 - INFO - step 16769, loss: 0.352412, best loss: 0.280468 2025-01-16 02:01:57,252 - INFO - step 16770, loss: 0.357197, best loss: 0.280468 2025-01-16 02:01:57,402 - INFO - step 16771, loss: 0.342101, best loss: 0.280468 2025-01-16 02:01:57,553 - INFO - step 16772, loss: 0.303245, best loss: 0.280468 2025-01-16 02:01:57,703 - INFO - step 16773, loss: 0.305837, best loss: 0.280468 2025-01-16 02:01:57,852 - INFO - step 16774, loss: 0.362939, best loss: 0.280468 2025-01-16 02:01:58,002 - INFO - step 16775, loss: 0.389090, best loss: 0.280468 2025-01-16 02:01:58,153 - INFO - step 16776, loss: 0.306644, best loss: 0.280468 2025-01-16 02:01:58,303 - INFO - step 16777, loss: 0.381183, best loss: 0.280468 2025-01-16 02:01:58,453 - INFO - step 16778, loss: 0.355179, best loss: 0.280468 2025-01-16 02:01:58,603 - INFO - step 16779, loss: 0.369442, best loss: 0.280468 2025-01-16 02:01:58,753 - INFO - step 16780, loss: 0.351412, best loss: 0.280468 2025-01-16 02:01:58,903 - INFO - step 16781, loss: 0.293737, best loss: 0.280468 2025-01-16 02:01:59,053 - INFO - step 16782, loss: 0.382853, best loss: 0.280468 2025-01-16 02:01:59,203 - INFO - step 16783, loss: 0.345790, best loss: 0.280468 2025-01-16 02:01:59,353 - INFO - step 16784, loss: 0.378195, best loss: 0.280468 2025-01-16 02:01:59,503 - INFO - step 16785, loss: 0.379134, best loss: 0.280468 2025-01-16 02:01:59,653 - INFO - step 16786, loss: 0.315814, best loss: 0.280468 2025-01-16 02:01:59,803 - INFO - step 16787, loss: 0.296856, best loss: 0.280468 2025-01-16 02:01:59,953 - INFO - step 16788, loss: 0.341323, best loss: 0.280468 2025-01-16 02:02:00,103 - INFO - step 16789, loss: 0.331966, best loss: 0.280468 2025-01-16 02:02:00,253 - INFO - step 16790, loss: 0.384178, best loss: 0.280468 2025-01-16 02:02:00,403 - INFO - step 16791, loss: 0.350590, best loss: 0.280468 2025-01-16 02:02:00,553 - INFO - step 16792, loss: 0.357025, best loss: 0.280468 2025-01-16 02:02:00,703 - INFO - step 16793, loss: 0.353733, best loss: 0.280468 2025-01-16 02:02:00,853 - INFO - step 16794, loss: 0.376674, best loss: 0.280468 2025-01-16 02:02:01,003 - INFO - step 16795, loss: 0.355420, best loss: 0.280468 2025-01-16 02:02:01,153 - INFO - step 16796, loss: 0.345540, best loss: 0.280468 2025-01-16 02:02:01,304 - INFO - step 16797, loss: 0.288116, best loss: 0.280468 2025-01-16 02:02:01,454 - INFO - step 16798, loss: 0.336793, best loss: 0.280468 2025-01-16 02:02:01,604 - INFO - step 16799, loss: 0.334771, best loss: 0.280468 2025-01-16 02:02:01,753 - INFO - step 16800, loss: 0.383356, best loss: 0.280468 2025-01-16 02:02:01,903 - INFO - step 16801, loss: 0.373006, best loss: 0.280468 2025-01-16 02:02:02,054 - INFO - step 16802, loss: 0.329806, best loss: 0.280468 2025-01-16 02:02:02,204 - INFO - step 16803, loss: 0.341706, best loss: 0.280468 2025-01-16 02:02:02,354 - INFO - step 16804, loss: 0.355061, best loss: 0.280468 2025-01-16 02:02:02,504 - INFO - step 16805, loss: 0.449820, best loss: 0.280468 2025-01-16 02:02:02,654 - INFO - step 16806, loss: 0.284087, best loss: 0.280468 2025-01-16 02:02:02,803 - INFO - step 16807, loss: 0.315121, best loss: 0.280468 2025-01-16 02:02:02,953 - INFO - step 16808, loss: 0.341292, best loss: 0.280468 2025-01-16 02:02:03,103 - INFO - step 16809, loss: 0.416093, best loss: 0.280468 2025-01-16 02:02:03,254 - INFO - step 16810, loss: 0.392117, best loss: 0.280468 2025-01-16 02:02:03,404 - INFO - step 16811, loss: 0.376783, best loss: 0.280468 2025-01-16 02:02:03,554 - INFO - step 16812, loss: 0.377687, best loss: 0.280468 2025-01-16 02:02:03,704 - INFO - step 16813, loss: 0.346973, best loss: 0.280468 2025-01-16 02:02:03,854 - INFO - step 16814, loss: 0.368625, best loss: 0.280468 2025-01-16 02:02:04,003 - INFO - step 16815, loss: 0.336883, best loss: 0.280468 2025-01-16 02:02:04,154 - INFO - step 16816, loss: 0.355401, best loss: 0.280468 2025-01-16 02:02:04,304 - INFO - step 16817, loss: 0.318278, best loss: 0.280468 2025-01-16 02:02:04,454 - INFO - step 16818, loss: 0.411042, best loss: 0.280468 2025-01-16 02:02:04,604 - INFO - step 16819, loss: 0.387234, best loss: 0.280468 2025-01-16 02:02:04,754 - INFO - step 16820, loss: 0.438806, best loss: 0.280468 2025-01-16 02:02:04,904 - INFO - step 16821, loss: 0.430110, best loss: 0.280468 2025-01-16 02:02:05,054 - INFO - step 16822, loss: 0.331056, best loss: 0.280468 2025-01-16 02:02:05,204 - INFO - step 16823, loss: 0.368204, best loss: 0.280468 2025-01-16 02:02:05,354 - INFO - step 16824, loss: 0.313270, best loss: 0.280468 2025-01-16 02:02:05,504 - INFO - step 16825, loss: 0.412614, best loss: 0.280468 2025-01-16 02:02:05,654 - INFO - step 16826, loss: 0.359447, best loss: 0.280468 2025-01-16 02:02:05,804 - INFO - step 16827, loss: 0.305140, best loss: 0.280468 2025-01-16 02:02:05,954 - INFO - step 16828, loss: 0.297786, best loss: 0.280468 2025-01-16 02:02:06,104 - INFO - step 16829, loss: 0.402493, best loss: 0.280468 2025-01-16 02:02:06,254 - INFO - step 16830, loss: 0.365583, best loss: 0.280468 2025-01-16 02:02:06,404 - INFO - step 16831, loss: 0.422274, best loss: 0.280468 2025-01-16 02:02:06,554 - INFO - step 16832, loss: 0.366735, best loss: 0.280468 2025-01-16 02:02:06,704 - INFO - step 16833, loss: 0.421843, best loss: 0.280468 2025-01-16 02:02:06,854 - INFO - step 16834, loss: 0.370494, best loss: 0.280468 2025-01-16 02:02:07,004 - INFO - step 16835, loss: 0.292640, best loss: 0.280468 2025-01-16 02:02:07,154 - INFO - step 16836, loss: 0.343467, best loss: 0.280468 2025-01-16 02:02:07,305 - INFO - step 16837, loss: 0.315294, best loss: 0.280468 2025-01-16 02:02:07,455 - INFO - step 16838, loss: 0.406152, best loss: 0.280468 2025-01-16 02:02:07,605 - INFO - step 16839, loss: 0.407231, best loss: 0.280468 2025-01-16 02:02:07,755 - INFO - step 16840, loss: 0.365984, best loss: 0.280468 2025-01-16 02:02:07,905 - INFO - step 16841, loss: 0.374456, best loss: 0.280468 2025-01-16 02:02:08,055 - INFO - step 16842, loss: 0.344694, best loss: 0.280468 2025-01-16 02:02:08,206 - INFO - step 16843, loss: 0.372258, best loss: 0.280468 2025-01-16 02:02:08,355 - INFO - step 16844, loss: 0.463499, best loss: 0.280468 2025-01-16 02:02:08,506 - INFO - step 16845, loss: 0.389638, best loss: 0.280468 2025-01-16 02:02:08,656 - INFO - step 16846, loss: 0.369978, best loss: 0.280468 2025-01-16 02:02:08,806 - INFO - step 16847, loss: 0.378133, best loss: 0.280468 2025-01-16 02:02:08,956 - INFO - step 16848, loss: 0.373721, best loss: 0.280468 2025-01-16 02:02:09,106 - INFO - step 16849, loss: 0.354755, best loss: 0.280468 2025-01-16 02:02:09,256 - INFO - step 16850, loss: 0.348935, best loss: 0.280468 2025-01-16 02:02:09,406 - INFO - step 16851, loss: 0.374302, best loss: 0.280468 2025-01-16 02:02:09,556 - INFO - step 16852, loss: 0.368890, best loss: 0.280468 2025-01-16 02:02:09,707 - INFO - step 16853, loss: 0.368786, best loss: 0.280468 2025-01-16 02:02:09,857 - INFO - step 16854, loss: 0.356279, best loss: 0.280468 2025-01-16 02:02:10,007 - INFO - step 16855, loss: 0.382040, best loss: 0.280468 2025-01-16 02:02:10,158 - INFO - step 16856, loss: 0.415633, best loss: 0.280468 2025-01-16 02:02:10,308 - INFO - step 16857, loss: 0.366936, best loss: 0.280468 2025-01-16 02:02:10,458 - INFO - step 16858, loss: 0.337114, best loss: 0.280468 2025-01-16 02:02:10,609 - INFO - step 16859, loss: 0.375721, best loss: 0.280468 2025-01-16 02:02:10,759 - INFO - step 16860, loss: 0.381190, best loss: 0.280468 2025-01-16 02:02:10,909 - INFO - step 16861, loss: 0.338904, best loss: 0.280468 2025-01-16 02:02:11,059 - INFO - step 16862, loss: 0.349930, best loss: 0.280468 2025-01-16 02:02:11,209 - INFO - step 16863, loss: 0.378242, best loss: 0.280468 2025-01-16 02:02:11,359 - INFO - step 16864, loss: 0.374488, best loss: 0.280468 2025-01-16 02:02:11,509 - INFO - step 16865, loss: 0.390502, best loss: 0.280468 2025-01-16 02:02:11,659 - INFO - step 16866, loss: 0.377987, best loss: 0.280468 2025-01-16 02:02:11,809 - INFO - step 16867, loss: 0.370838, best loss: 0.280468 2025-01-16 02:02:11,960 - INFO - step 16868, loss: 0.366307, best loss: 0.280468 2025-01-16 02:02:12,110 - INFO - step 16869, loss: 0.320810, best loss: 0.280468 2025-01-16 02:02:12,260 - INFO - step 16870, loss: 0.344430, best loss: 0.280468 2025-01-16 02:02:12,410 - INFO - step 16871, loss: 0.325083, best loss: 0.280468 2025-01-16 02:02:12,560 - INFO - step 16872, loss: 0.422048, best loss: 0.280468 2025-01-16 02:02:12,710 - INFO - step 16873, loss: 0.351756, best loss: 0.280468 2025-01-16 02:02:12,860 - INFO - step 16874, loss: 0.353284, best loss: 0.280468 2025-01-16 02:02:13,010 - INFO - step 16875, loss: 0.420270, best loss: 0.280468 2025-01-16 02:02:13,160 - INFO - step 16876, loss: 0.369312, best loss: 0.280468 2025-01-16 02:02:13,310 - INFO - step 16877, loss: 0.339503, best loss: 0.280468 2025-01-16 02:02:13,460 - INFO - step 16878, loss: 0.346607, best loss: 0.280468 2025-01-16 02:02:13,610 - INFO - step 16879, loss: 0.329375, best loss: 0.280468 2025-01-16 02:02:13,760 - INFO - step 16880, loss: 0.348950, best loss: 0.280468 2025-01-16 02:02:13,910 - INFO - step 16881, loss: 0.398990, best loss: 0.280468 2025-01-16 02:02:14,060 - INFO - step 16882, loss: 0.364915, best loss: 0.280468 2025-01-16 02:02:14,210 - INFO - step 16883, loss: 0.386352, best loss: 0.280468 2025-01-16 02:02:14,360 - INFO - step 16884, loss: 0.304602, best loss: 0.280468 2025-01-16 02:02:14,510 - INFO - step 16885, loss: 0.334974, best loss: 0.280468 2025-01-16 02:02:14,660 - INFO - step 16886, loss: 0.316976, best loss: 0.280468 2025-01-16 02:02:14,810 - INFO - step 16887, loss: 0.342516, best loss: 0.280468 2025-01-16 02:02:14,961 - INFO - step 16888, loss: 0.342713, best loss: 0.280468 2025-01-16 02:02:15,111 - INFO - step 16889, loss: 0.351209, best loss: 0.280468 2025-01-16 02:02:15,261 - INFO - step 16890, loss: 0.375109, best loss: 0.280468 2025-01-16 02:02:15,411 - INFO - step 16891, loss: 0.395183, best loss: 0.280468 2025-01-16 02:02:15,561 - INFO - step 16892, loss: 0.403497, best loss: 0.280468 2025-01-16 02:02:15,710 - INFO - step 16893, loss: 0.372950, best loss: 0.280468 2025-01-16 02:02:15,861 - INFO - step 16894, loss: 0.365796, best loss: 0.280468 2025-01-16 02:02:16,010 - INFO - step 16895, loss: 0.332518, best loss: 0.280468 2025-01-16 02:02:16,161 - INFO - step 16896, loss: 0.435483, best loss: 0.280468 2025-01-16 02:02:16,311 - INFO - step 16897, loss: 0.344318, best loss: 0.280468 2025-01-16 02:02:16,461 - INFO - step 16898, loss: 0.368112, best loss: 0.280468 2025-01-16 02:02:16,611 - INFO - step 16899, loss: 0.372226, best loss: 0.280468 2025-01-16 02:02:16,761 - INFO - step 16900, loss: 0.376376, best loss: 0.280468 2025-01-16 02:02:16,911 - INFO - step 16901, loss: 0.358154, best loss: 0.280468 2025-01-16 02:02:17,061 - INFO - step 16902, loss: 0.309503, best loss: 0.280468 2025-01-16 02:02:17,211 - INFO - step 16903, loss: 0.321104, best loss: 0.280468 2025-01-16 02:02:17,361 - INFO - step 16904, loss: 0.412056, best loss: 0.280468 2025-01-16 02:02:17,511 - INFO - step 16905, loss: 0.348037, best loss: 0.280468 2025-01-16 02:02:17,662 - INFO - step 16906, loss: 0.316835, best loss: 0.280468 2025-01-16 02:02:17,812 - INFO - step 16907, loss: 0.379222, best loss: 0.280468 2025-01-16 02:02:17,962 - INFO - step 16908, loss: 0.385354, best loss: 0.280468 2025-01-16 02:02:18,112 - INFO - step 16909, loss: 0.328398, best loss: 0.280468 2025-01-16 02:02:18,262 - INFO - step 16910, loss: 0.315930, best loss: 0.280468 2025-01-16 02:02:18,412 - INFO - step 16911, loss: 0.281297, best loss: 0.280468 2025-01-16 02:02:18,562 - INFO - step 16912, loss: 0.346186, best loss: 0.280468 2025-01-16 02:02:18,712 - INFO - step 16913, loss: 0.358245, best loss: 0.280468 2025-01-16 02:02:18,862 - INFO - step 16914, loss: 0.315306, best loss: 0.280468 2025-01-16 02:02:19,012 - INFO - step 16915, loss: 0.331731, best loss: 0.280468 2025-01-16 02:02:19,162 - INFO - step 16916, loss: 0.309333, best loss: 0.280468 2025-01-16 02:02:19,312 - INFO - step 16917, loss: 0.347706, best loss: 0.280468 2025-01-16 02:02:19,463 - INFO - step 16918, loss: 0.315775, best loss: 0.280468 2025-01-16 02:02:19,614 - INFO - step 16919, loss: 0.363651, best loss: 0.280468 2025-01-16 02:02:23,130 - INFO - step 16920, loss: 0.270260, best loss: 0.270260 2025-01-16 02:02:23,293 - INFO - step 16921, loss: 0.338967, best loss: 0.270260 2025-01-16 02:02:23,449 - INFO - step 16922, loss: 0.330762, best loss: 0.270260 2025-01-16 02:02:23,599 - INFO - step 16923, loss: 0.323585, best loss: 0.270260 2025-01-16 02:02:23,749 - INFO - step 16924, loss: 0.367345, best loss: 0.270260 2025-01-16 02:02:23,899 - INFO - step 16925, loss: 0.426117, best loss: 0.270260 2025-01-16 02:02:24,049 - INFO - step 16926, loss: 0.319353, best loss: 0.270260 2025-01-16 02:02:24,199 - INFO - step 16927, loss: 0.319592, best loss: 0.270260 2025-01-16 02:02:24,349 - INFO - step 16928, loss: 0.278291, best loss: 0.270260 2025-01-16 02:02:24,499 - INFO - step 16929, loss: 0.311542, best loss: 0.270260 2025-01-16 02:02:24,649 - INFO - step 16930, loss: 0.353633, best loss: 0.270260 2025-01-16 02:02:24,799 - INFO - step 16931, loss: 0.306690, best loss: 0.270260 2025-01-16 02:02:32,108 - INFO - step 16932, loss: 0.267861, best loss: 0.267861 2025-01-16 02:02:32,258 - INFO - step 16933, loss: 0.324400, best loss: 0.267861 2025-01-16 02:02:32,408 - INFO - step 16934, loss: 0.302741, best loss: 0.267861 2025-01-16 02:02:32,558 - INFO - step 16935, loss: 0.277565, best loss: 0.267861 2025-01-16 02:02:32,709 - INFO - step 16936, loss: 0.421661, best loss: 0.267861 2025-01-16 02:02:32,859 - INFO - step 16937, loss: 0.312823, best loss: 0.267861 2025-01-16 02:02:33,010 - INFO - step 16938, loss: 0.325683, best loss: 0.267861 2025-01-16 02:02:33,160 - INFO - step 16939, loss: 0.353297, best loss: 0.267861 2025-01-16 02:02:33,310 - INFO - step 16940, loss: 0.299246, best loss: 0.267861 2025-01-16 02:02:33,460 - INFO - step 16941, loss: 0.322314, best loss: 0.267861 2025-01-16 02:02:33,610 - INFO - step 16942, loss: 0.305265, best loss: 0.267861 2025-01-16 02:02:33,761 - INFO - step 16943, loss: 0.285445, best loss: 0.267861 2025-01-16 02:02:33,911 - INFO - step 16944, loss: 0.368529, best loss: 0.267861 2025-01-16 02:02:34,061 - INFO - step 16945, loss: 0.305028, best loss: 0.267861 2025-01-16 02:02:34,211 - INFO - step 16946, loss: 0.366485, best loss: 0.267861 2025-01-16 02:02:34,362 - INFO - step 16947, loss: 0.332419, best loss: 0.267861 2025-01-16 02:02:34,512 - INFO - step 16948, loss: 0.297029, best loss: 0.267861 2025-01-16 02:02:34,662 - INFO - step 16949, loss: 0.333489, best loss: 0.267861 2025-01-16 02:02:34,812 - INFO - step 16950, loss: 0.372975, best loss: 0.267861 2025-01-16 02:02:34,962 - INFO - step 16951, loss: 0.307788, best loss: 0.267861 2025-01-16 02:02:35,112 - INFO - step 16952, loss: 0.337322, best loss: 0.267861 2025-01-16 02:02:35,262 - INFO - step 16953, loss: 0.282271, best loss: 0.267861 2025-01-16 02:02:35,412 - INFO - step 16954, loss: 0.320268, best loss: 0.267861 2025-01-16 02:02:35,562 - INFO - step 16955, loss: 0.278202, best loss: 0.267861 2025-01-16 02:02:35,712 - INFO - step 16956, loss: 0.322907, best loss: 0.267861 2025-01-16 02:02:35,862 - INFO - step 16957, loss: 0.332721, best loss: 0.267861 2025-01-16 02:02:36,013 - INFO - step 16958, loss: 0.317985, best loss: 0.267861 2025-01-16 02:02:36,163 - INFO - step 16959, loss: 0.363688, best loss: 0.267861 2025-01-16 02:02:36,312 - INFO - step 16960, loss: 0.307581, best loss: 0.267861 2025-01-16 02:02:36,462 - INFO - step 16961, loss: 0.342381, best loss: 0.267861 2025-01-16 02:02:36,612 - INFO - step 16962, loss: 0.364915, best loss: 0.267861 2025-01-16 02:02:36,762 - INFO - step 16963, loss: 0.315048, best loss: 0.267861 2025-01-16 02:02:36,913 - INFO - step 16964, loss: 0.362179, best loss: 0.267861 2025-01-16 02:02:37,062 - INFO - step 16965, loss: 0.303185, best loss: 0.267861 2025-01-16 02:02:40,816 - INFO - step 16966, loss: 0.259557, best loss: 0.259557 2025-01-16 02:02:40,967 - INFO - step 16967, loss: 0.342357, best loss: 0.259557 2025-01-16 02:02:41,117 - INFO - step 16968, loss: 0.369890, best loss: 0.259557 2025-01-16 02:02:41,268 - INFO - step 16969, loss: 0.350220, best loss: 0.259557 2025-01-16 02:02:41,418 - INFO - step 16970, loss: 0.290801, best loss: 0.259557 2025-01-16 02:02:41,568 - INFO - step 16971, loss: 0.348350, best loss: 0.259557 2025-01-16 02:02:41,718 - INFO - step 16972, loss: 0.364279, best loss: 0.259557 2025-01-16 02:02:41,868 - INFO - step 16973, loss: 0.338760, best loss: 0.259557 2025-01-16 02:02:42,018 - INFO - step 16974, loss: 0.303075, best loss: 0.259557 2025-01-16 02:02:42,168 - INFO - step 16975, loss: 0.331646, best loss: 0.259557 2025-01-16 02:02:42,318 - INFO - step 16976, loss: 0.350272, best loss: 0.259557 2025-01-16 02:02:42,468 - INFO - step 16977, loss: 0.311507, best loss: 0.259557 2025-01-16 02:02:42,618 - INFO - step 16978, loss: 0.335722, best loss: 0.259557 2025-01-16 02:02:42,769 - INFO - step 16979, loss: 0.310163, best loss: 0.259557 2025-01-16 02:02:42,919 - INFO - step 16980, loss: 0.325777, best loss: 0.259557 2025-01-16 02:02:43,069 - INFO - step 16981, loss: 0.355382, best loss: 0.259557 2025-01-16 02:02:43,219 - INFO - step 16982, loss: 0.280918, best loss: 0.259557 2025-01-16 02:02:43,369 - INFO - step 16983, loss: 0.323798, best loss: 0.259557 2025-01-16 02:02:43,519 - INFO - step 16984, loss: 0.352916, best loss: 0.259557 2025-01-16 02:02:43,669 - INFO - step 16985, loss: 0.302599, best loss: 0.259557 2025-01-16 02:02:43,819 - INFO - step 16986, loss: 0.332235, best loss: 0.259557 2025-01-16 02:02:43,969 - INFO - step 16987, loss: 0.316775, best loss: 0.259557 2025-01-16 02:02:44,119 - INFO - step 16988, loss: 0.307650, best loss: 0.259557 2025-01-16 02:02:44,270 - INFO - step 16989, loss: 0.415211, best loss: 0.259557 2025-01-16 02:02:44,420 - INFO - step 16990, loss: 0.281570, best loss: 0.259557 2025-01-16 02:02:44,570 - INFO - step 16991, loss: 0.278342, best loss: 0.259557 2025-01-16 02:02:44,720 - INFO - step 16992, loss: 0.320182, best loss: 0.259557 2025-01-16 02:02:44,871 - INFO - step 16993, loss: 0.293010, best loss: 0.259557 2025-01-16 02:02:45,021 - INFO - step 16994, loss: 0.292993, best loss: 0.259557 2025-01-16 02:02:45,171 - INFO - step 16995, loss: 0.316646, best loss: 0.259557 2025-01-16 02:02:45,321 - INFO - step 16996, loss: 0.389387, best loss: 0.259557 2025-01-16 02:02:45,472 - INFO - step 16997, loss: 0.343861, best loss: 0.259557 2025-01-16 02:02:45,622 - INFO - step 16998, loss: 0.317435, best loss: 0.259557 2025-01-16 02:02:45,772 - INFO - step 16999, loss: 0.289567, best loss: 0.259557 2025-01-16 02:02:45,922 - INFO - step 17000, loss: 0.300064, best loss: 0.259557 2025-01-16 02:02:46,073 - INFO - step 17001, loss: 0.282610, best loss: 0.259557 2025-01-16 02:02:46,223 - INFO - step 17002, loss: 0.314082, best loss: 0.259557 2025-01-16 02:02:46,373 - INFO - step 17003, loss: 0.306055, best loss: 0.259557 2025-01-16 02:02:46,523 - INFO - step 17004, loss: 0.332634, best loss: 0.259557 2025-01-16 02:02:46,673 - INFO - step 17005, loss: 0.283903, best loss: 0.259557 2025-01-16 02:02:46,823 - INFO - step 17006, loss: 0.374908, best loss: 0.259557 2025-01-16 02:02:46,973 - INFO - step 17007, loss: 0.297582, best loss: 0.259557 2025-01-16 02:02:47,123 - INFO - step 17008, loss: 0.311980, best loss: 0.259557 2025-01-16 02:02:47,273 - INFO - step 17009, loss: 0.369351, best loss: 0.259557 2025-01-16 02:02:47,424 - INFO - step 17010, loss: 0.408089, best loss: 0.259557 2025-01-16 02:02:47,574 - INFO - step 17011, loss: 0.384280, best loss: 0.259557 2025-01-16 02:02:47,724 - INFO - step 17012, loss: 0.340546, best loss: 0.259557 2025-01-16 02:02:47,874 - INFO - step 17013, loss: 0.399140, best loss: 0.259557 2025-01-16 02:02:48,024 - INFO - step 17014, loss: 0.305860, best loss: 0.259557 2025-01-16 02:02:48,175 - INFO - step 17015, loss: 0.343253, best loss: 0.259557 2025-01-16 02:02:48,325 - INFO - step 17016, loss: 0.355829, best loss: 0.259557 2025-01-16 02:02:48,476 - INFO - step 17017, loss: 0.382725, best loss: 0.259557 2025-01-16 02:02:48,626 - INFO - step 17018, loss: 0.293994, best loss: 0.259557 2025-01-16 02:02:48,776 - INFO - step 17019, loss: 0.383874, best loss: 0.259557 2025-01-16 02:02:48,926 - INFO - step 17020, loss: 0.348177, best loss: 0.259557 2025-01-16 02:02:49,076 - INFO - step 17021, loss: 0.353852, best loss: 0.259557 2025-01-16 02:02:49,226 - INFO - step 17022, loss: 0.371716, best loss: 0.259557 2025-01-16 02:02:49,376 - INFO - step 17023, loss: 0.300617, best loss: 0.259557 2025-01-16 02:02:49,527 - INFO - step 17024, loss: 0.375263, best loss: 0.259557 2025-01-16 02:02:49,677 - INFO - step 17025, loss: 0.384209, best loss: 0.259557 2025-01-16 02:02:49,827 - INFO - step 17026, loss: 0.361066, best loss: 0.259557 2025-01-16 02:02:49,977 - INFO - step 17027, loss: 0.360827, best loss: 0.259557 2025-01-16 02:02:50,127 - INFO - step 17028, loss: 0.299493, best loss: 0.259557 2025-01-16 02:02:50,277 - INFO - step 17029, loss: 0.285685, best loss: 0.259557 2025-01-16 02:02:50,427 - INFO - step 17030, loss: 0.319530, best loss: 0.259557 2025-01-16 02:02:50,578 - INFO - step 17031, loss: 0.337379, best loss: 0.259557 2025-01-16 02:02:50,728 - INFO - step 17032, loss: 0.354711, best loss: 0.259557 2025-01-16 02:02:50,878 - INFO - step 17033, loss: 0.349979, best loss: 0.259557 2025-01-16 02:02:51,028 - INFO - step 17034, loss: 0.328218, best loss: 0.259557 2025-01-16 02:02:51,178 - INFO - step 17035, loss: 0.320288, best loss: 0.259557 2025-01-16 02:02:51,328 - INFO - step 17036, loss: 0.330020, best loss: 0.259557 2025-01-16 02:02:51,478 - INFO - step 17037, loss: 0.326088, best loss: 0.259557 2025-01-16 02:02:51,628 - INFO - step 17038, loss: 0.371246, best loss: 0.259557 2025-01-16 02:02:51,778 - INFO - step 17039, loss: 0.356512, best loss: 0.259557 2025-01-16 02:02:51,928 - INFO - step 17040, loss: 0.273831, best loss: 0.259557 2025-01-16 02:02:52,078 - INFO - step 17041, loss: 0.272657, best loss: 0.259557 2025-01-16 02:02:52,228 - INFO - step 17042, loss: 0.266396, best loss: 0.259557 2025-01-16 02:02:52,378 - INFO - step 17043, loss: 0.311696, best loss: 0.259557 2025-01-16 02:02:52,528 - INFO - step 17044, loss: 0.328908, best loss: 0.259557 2025-01-16 02:02:52,678 - INFO - step 17045, loss: 0.369906, best loss: 0.259557 2025-01-16 02:02:52,828 - INFO - step 17046, loss: 0.398895, best loss: 0.259557 2025-01-16 02:02:52,978 - INFO - step 17047, loss: 0.382040, best loss: 0.259557 2025-01-16 02:02:53,129 - INFO - step 17048, loss: 0.300222, best loss: 0.259557 2025-01-16 02:02:53,278 - INFO - step 17049, loss: 0.316997, best loss: 0.259557 2025-01-16 02:02:53,429 - INFO - step 17050, loss: 0.386147, best loss: 0.259557 2025-01-16 02:02:53,579 - INFO - step 17051, loss: 0.396126, best loss: 0.259557 2025-01-16 02:02:53,729 - INFO - step 17052, loss: 0.455953, best loss: 0.259557 2025-01-16 02:02:53,879 - INFO - step 17053, loss: 0.375634, best loss: 0.259557 2025-01-16 02:02:54,030 - INFO - step 17054, loss: 0.412509, best loss: 0.259557 2025-01-16 02:02:54,180 - INFO - step 17055, loss: 0.332929, best loss: 0.259557 2025-01-16 02:02:54,330 - INFO - step 17056, loss: 0.359118, best loss: 0.259557 2025-01-16 02:02:54,480 - INFO - step 17057, loss: 0.363272, best loss: 0.259557 2025-01-16 02:02:54,630 - INFO - step 17058, loss: 0.271997, best loss: 0.259557 2025-01-16 02:02:54,780 - INFO - step 17059, loss: 0.326131, best loss: 0.259557 2025-01-16 02:02:54,930 - INFO - step 17060, loss: 0.356975, best loss: 0.259557 2025-01-16 02:02:55,080 - INFO - step 17061, loss: 0.267038, best loss: 0.259557 2025-01-16 02:02:55,230 - INFO - step 17062, loss: 0.299647, best loss: 0.259557 2025-01-16 02:02:55,381 - INFO - step 17063, loss: 0.348044, best loss: 0.259557 2025-01-16 02:02:55,531 - INFO - step 17064, loss: 0.392037, best loss: 0.259557 2025-01-16 02:02:55,681 - INFO - step 17065, loss: 0.363173, best loss: 0.259557 2025-01-16 02:02:55,831 - INFO - step 17066, loss: 0.332003, best loss: 0.259557 2025-01-16 02:02:55,981 - INFO - step 17067, loss: 0.285024, best loss: 0.259557 2025-01-16 02:02:56,131 - INFO - step 17068, loss: 0.312240, best loss: 0.259557 2025-01-16 02:02:56,281 - INFO - step 17069, loss: 0.333644, best loss: 0.259557 2025-01-16 02:02:56,432 - INFO - step 17070, loss: 0.363093, best loss: 0.259557 2025-01-16 02:02:56,583 - INFO - step 17071, loss: 0.354026, best loss: 0.259557 2025-01-16 02:02:56,733 - INFO - step 17072, loss: 0.405745, best loss: 0.259557 2025-01-16 02:02:56,883 - INFO - step 17073, loss: 0.389162, best loss: 0.259557 2025-01-16 02:02:57,033 - INFO - step 17074, loss: 0.347749, best loss: 0.259557 2025-01-16 02:02:57,183 - INFO - step 17075, loss: 0.300485, best loss: 0.259557 2025-01-16 02:02:57,333 - INFO - step 17076, loss: 0.322182, best loss: 0.259557 2025-01-16 02:02:57,483 - INFO - step 17077, loss: 0.300232, best loss: 0.259557 2025-01-16 02:02:57,633 - INFO - step 17078, loss: 0.333043, best loss: 0.259557 2025-01-16 02:03:01,161 - INFO - step 17079, loss: 0.254389, best loss: 0.254389 2025-01-16 02:03:01,311 - INFO - step 17080, loss: 0.329139, best loss: 0.254389 2025-01-16 02:03:01,461 - INFO - step 17081, loss: 0.334362, best loss: 0.254389 2025-01-16 02:03:01,611 - INFO - step 17082, loss: 0.305437, best loss: 0.254389 2025-01-16 02:03:01,761 - INFO - step 17083, loss: 0.313267, best loss: 0.254389 2025-01-16 02:03:01,911 - INFO - step 17084, loss: 0.341327, best loss: 0.254389 2025-01-16 02:03:02,062 - INFO - step 17085, loss: 0.320849, best loss: 0.254389 2025-01-16 02:03:02,212 - INFO - step 17086, loss: 0.330516, best loss: 0.254389 2025-01-16 02:03:02,362 - INFO - step 17087, loss: 0.395198, best loss: 0.254389 2025-01-16 02:03:02,512 - INFO - step 17088, loss: 0.329355, best loss: 0.254389 2025-01-16 02:03:02,662 - INFO - step 17089, loss: 0.383649, best loss: 0.254389 2025-01-16 02:03:02,812 - INFO - step 17090, loss: 0.430790, best loss: 0.254389 2025-01-16 02:03:02,962 - INFO - step 17091, loss: 0.338404, best loss: 0.254389 2025-01-16 02:03:03,112 - INFO - step 17092, loss: 0.362738, best loss: 0.254389 2025-01-16 02:03:03,262 - INFO - step 17093, loss: 0.309614, best loss: 0.254389 2025-01-16 02:03:03,412 - INFO - step 17094, loss: 0.337315, best loss: 0.254389 2025-01-16 02:03:03,562 - INFO - step 17095, loss: 0.345626, best loss: 0.254389 2025-01-16 02:03:03,712 - INFO - step 17096, loss: 0.382643, best loss: 0.254389 2025-01-16 02:03:03,862 - INFO - step 17097, loss: 0.276784, best loss: 0.254389 2025-01-16 02:03:04,012 - INFO - step 17098, loss: 0.316708, best loss: 0.254389 2025-01-16 02:03:04,162 - INFO - step 17099, loss: 0.333532, best loss: 0.254389 2025-01-16 02:03:04,312 - INFO - step 17100, loss: 0.394031, best loss: 0.254389 2025-01-16 02:03:04,463 - INFO - step 17101, loss: 0.324583, best loss: 0.254389 2025-01-16 02:03:04,613 - INFO - step 17102, loss: 0.318759, best loss: 0.254389 2025-01-16 02:03:04,763 - INFO - step 17103, loss: 0.319262, best loss: 0.254389 2025-01-16 02:03:04,913 - INFO - step 17104, loss: 0.318958, best loss: 0.254389 2025-01-16 02:03:05,063 - INFO - step 17105, loss: 0.377460, best loss: 0.254389 2025-01-16 02:03:05,214 - INFO - step 17106, loss: 0.323920, best loss: 0.254389 2025-01-16 02:03:05,365 - INFO - step 17107, loss: 0.375415, best loss: 0.254389 2025-01-16 02:03:05,515 - INFO - step 17108, loss: 0.340710, best loss: 0.254389 2025-01-16 02:03:05,665 - INFO - step 17109, loss: 0.338839, best loss: 0.254389 2025-01-16 02:03:05,816 - INFO - step 17110, loss: 0.326451, best loss: 0.254389 2025-01-16 02:03:05,967 - INFO - step 17111, loss: 0.260072, best loss: 0.254389 2025-01-16 02:03:06,118 - INFO - step 17112, loss: 0.358708, best loss: 0.254389 2025-01-16 02:03:06,268 - INFO - step 17113, loss: 0.304105, best loss: 0.254389 2025-01-16 02:03:06,418 - INFO - step 17114, loss: 0.332287, best loss: 0.254389 2025-01-16 02:03:06,568 - INFO - step 17115, loss: 0.322629, best loss: 0.254389 2025-01-16 02:03:06,719 - INFO - step 17116, loss: 0.343275, best loss: 0.254389 2025-01-16 02:03:06,869 - INFO - step 17117, loss: 0.305884, best loss: 0.254389 2025-01-16 02:03:07,019 - INFO - step 17118, loss: 0.320965, best loss: 0.254389 2025-01-16 02:03:07,170 - INFO - step 17119, loss: 0.328742, best loss: 0.254389 2025-01-16 02:03:07,320 - INFO - step 17120, loss: 0.298856, best loss: 0.254389 2025-01-16 02:03:07,470 - INFO - step 17121, loss: 0.288542, best loss: 0.254389 2025-01-16 02:03:07,620 - INFO - step 17122, loss: 0.296878, best loss: 0.254389 2025-01-16 02:03:07,770 - INFO - step 17123, loss: 0.299231, best loss: 0.254389 2025-01-16 02:03:07,920 - INFO - step 17124, loss: 0.334916, best loss: 0.254389 2025-01-16 02:03:08,070 - INFO - step 17125, loss: 0.329701, best loss: 0.254389 2025-01-16 02:03:08,221 - INFO - step 17126, loss: 0.307556, best loss: 0.254389 2025-01-16 02:03:08,371 - INFO - step 17127, loss: 0.297472, best loss: 0.254389 2025-01-16 02:03:08,521 - INFO - step 17128, loss: 0.402663, best loss: 0.254389 2025-01-16 02:03:08,671 - INFO - step 17129, loss: 0.345017, best loss: 0.254389 2025-01-16 02:03:08,821 - INFO - step 17130, loss: 0.397842, best loss: 0.254389 2025-01-16 02:03:08,971 - INFO - step 17131, loss: 0.304504, best loss: 0.254389 2025-01-16 02:03:09,121 - INFO - step 17132, loss: 0.313917, best loss: 0.254389 2025-01-16 02:03:09,271 - INFO - step 17133, loss: 0.301445, best loss: 0.254389 2025-01-16 02:03:09,422 - INFO - step 17134, loss: 0.264405, best loss: 0.254389 2025-01-16 02:03:09,572 - INFO - step 17135, loss: 0.386369, best loss: 0.254389 2025-01-16 02:03:09,722 - INFO - step 17136, loss: 0.291490, best loss: 0.254389 2025-01-16 02:03:09,873 - INFO - step 17137, loss: 0.329535, best loss: 0.254389 2025-01-16 02:03:10,023 - INFO - step 17138, loss: 0.323219, best loss: 0.254389 2025-01-16 02:03:10,173 - INFO - step 17139, loss: 0.350354, best loss: 0.254389 2025-01-16 02:03:10,323 - INFO - step 17140, loss: 0.337899, best loss: 0.254389 2025-01-16 02:03:10,473 - INFO - step 17141, loss: 0.350467, best loss: 0.254389 2025-01-16 02:03:10,623 - INFO - step 17142, loss: 0.382329, best loss: 0.254389 2025-01-16 02:03:10,773 - INFO - step 17143, loss: 0.335189, best loss: 0.254389 2025-01-16 02:03:10,923 - INFO - step 17144, loss: 0.316904, best loss: 0.254389 2025-01-16 02:03:11,074 - INFO - step 17145, loss: 0.334592, best loss: 0.254389 2025-01-16 02:03:11,224 - INFO - step 17146, loss: 0.302639, best loss: 0.254389 2025-01-16 02:03:11,374 - INFO - step 17147, loss: 0.291635, best loss: 0.254389 2025-01-16 02:03:11,524 - INFO - step 17148, loss: 0.287700, best loss: 0.254389 2025-01-16 02:03:11,674 - INFO - step 17149, loss: 0.351757, best loss: 0.254389 2025-01-16 02:03:11,824 - INFO - step 17150, loss: 0.327458, best loss: 0.254389 2025-01-16 02:03:11,974 - INFO - step 17151, loss: 0.318084, best loss: 0.254389 2025-01-16 02:03:12,124 - INFO - step 17152, loss: 0.317274, best loss: 0.254389 2025-01-16 02:03:12,275 - INFO - step 17153, loss: 0.309661, best loss: 0.254389 2025-01-16 02:03:12,425 - INFO - step 17154, loss: 0.313937, best loss: 0.254389 2025-01-16 02:03:12,575 - INFO - step 17155, loss: 0.383626, best loss: 0.254389 2025-01-16 02:03:12,725 - INFO - step 17156, loss: 0.382855, best loss: 0.254389 2025-01-16 02:03:12,876 - INFO - step 17157, loss: 0.299140, best loss: 0.254389 2025-01-16 02:03:13,026 - INFO - step 17158, loss: 0.321208, best loss: 0.254389 2025-01-16 02:03:13,176 - INFO - step 17159, loss: 0.279044, best loss: 0.254389 2025-01-16 02:03:13,326 - INFO - step 17160, loss: 0.312266, best loss: 0.254389 2025-01-16 02:03:13,477 - INFO - step 17161, loss: 0.319796, best loss: 0.254389 2025-01-16 02:03:13,627 - INFO - step 17162, loss: 0.359478, best loss: 0.254389 2025-01-16 02:03:13,777 - INFO - step 17163, loss: 0.314674, best loss: 0.254389 2025-01-16 02:03:13,927 - INFO - step 17164, loss: 0.342339, best loss: 0.254389 2025-01-16 02:03:14,077 - INFO - step 17165, loss: 0.295623, best loss: 0.254389 2025-01-16 02:03:14,227 - INFO - step 17166, loss: 0.334526, best loss: 0.254389 2025-01-16 02:03:14,377 - INFO - step 17167, loss: 0.311645, best loss: 0.254389 2025-01-16 02:03:14,527 - INFO - step 17168, loss: 0.376858, best loss: 0.254389 2025-01-16 02:03:14,677 - INFO - step 17169, loss: 0.358515, best loss: 0.254389 2025-01-16 02:03:14,827 - INFO - step 17170, loss: 0.372686, best loss: 0.254389 2025-01-16 02:03:14,977 - INFO - step 17171, loss: 0.315622, best loss: 0.254389 2025-01-16 02:03:18,552 - INFO - step 17172, loss: 0.247561, best loss: 0.247561 2025-01-16 02:03:18,715 - INFO - step 17173, loss: 0.331191, best loss: 0.247561 2025-01-16 02:03:18,866 - INFO - step 17174, loss: 0.346405, best loss: 0.247561 2025-01-16 02:03:19,017 - INFO - step 17175, loss: 0.303213, best loss: 0.247561 2025-01-16 02:03:19,167 - INFO - step 17176, loss: 0.325716, best loss: 0.247561 2025-01-16 02:03:19,317 - INFO - step 17177, loss: 0.334294, best loss: 0.247561 2025-01-16 02:03:19,467 - INFO - step 17178, loss: 0.378864, best loss: 0.247561 2025-01-16 02:03:19,617 - INFO - step 17179, loss: 0.315242, best loss: 0.247561 2025-01-16 02:03:19,767 - INFO - step 17180, loss: 0.305194, best loss: 0.247561 2025-01-16 02:03:19,918 - INFO - step 17181, loss: 0.376336, best loss: 0.247561 2025-01-16 02:03:20,068 - INFO - step 17182, loss: 0.330075, best loss: 0.247561 2025-01-16 02:03:20,218 - INFO - step 17183, loss: 0.311209, best loss: 0.247561 2025-01-16 02:03:20,368 - INFO - step 17184, loss: 0.382651, best loss: 0.247561 2025-01-16 02:03:20,518 - INFO - step 17185, loss: 0.304735, best loss: 0.247561 2025-01-16 02:03:20,669 - INFO - step 17186, loss: 0.306859, best loss: 0.247561 2025-01-16 02:03:20,819 - INFO - step 17187, loss: 0.323107, best loss: 0.247561 2025-01-16 02:03:20,969 - INFO - step 17188, loss: 0.328284, best loss: 0.247561 2025-01-16 02:03:21,119 - INFO - step 17189, loss: 0.331060, best loss: 0.247561 2025-01-16 02:03:21,269 - INFO - step 17190, loss: 0.312172, best loss: 0.247561 2025-01-16 02:03:21,419 - INFO - step 17191, loss: 0.337211, best loss: 0.247561 2025-01-16 02:03:21,569 - INFO - step 17192, loss: 0.340696, best loss: 0.247561 2025-01-16 02:03:21,719 - INFO - step 17193, loss: 0.345133, best loss: 0.247561 2025-01-16 02:03:21,870 - INFO - step 17194, loss: 0.315557, best loss: 0.247561 2025-01-16 02:03:22,020 - INFO - step 17195, loss: 0.358960, best loss: 0.247561 2025-01-16 02:03:22,170 - INFO - step 17196, loss: 0.387688, best loss: 0.247561 2025-01-16 02:03:22,320 - INFO - step 17197, loss: 0.345714, best loss: 0.247561 2025-01-16 02:03:22,470 - INFO - step 17198, loss: 0.396768, best loss: 0.247561 2025-01-16 02:03:22,620 - INFO - step 17199, loss: 0.318586, best loss: 0.247561 2025-01-16 02:03:22,770 - INFO - step 17200, loss: 0.377758, best loss: 0.247561 2025-01-16 02:03:22,920 - INFO - step 17201, loss: 0.383254, best loss: 0.247561 2025-01-16 02:03:23,070 - INFO - step 17202, loss: 0.375310, best loss: 0.247561 2025-01-16 02:03:23,221 - INFO - step 17203, loss: 0.314293, best loss: 0.247561 2025-01-16 02:03:23,371 - INFO - step 17204, loss: 0.318481, best loss: 0.247561 2025-01-16 02:03:23,521 - INFO - step 17205, loss: 0.361924, best loss: 0.247561 2025-01-16 02:03:23,671 - INFO - step 17206, loss: 0.357547, best loss: 0.247561 2025-01-16 02:03:23,821 - INFO - step 17207, loss: 0.308680, best loss: 0.247561 2025-01-16 02:03:23,971 - INFO - step 17208, loss: 0.342031, best loss: 0.247561 2025-01-16 02:03:24,121 - INFO - step 17209, loss: 0.309970, best loss: 0.247561 2025-01-16 02:03:24,272 - INFO - step 17210, loss: 0.307063, best loss: 0.247561 2025-01-16 02:03:24,422 - INFO - step 17211, loss: 0.286063, best loss: 0.247561 2025-01-16 02:03:24,572 - INFO - step 17212, loss: 0.327522, best loss: 0.247561 2025-01-16 02:03:24,722 - INFO - step 17213, loss: 0.369769, best loss: 0.247561 2025-01-16 02:03:24,872 - INFO - step 17214, loss: 0.315238, best loss: 0.247561 2025-01-16 02:03:25,022 - INFO - step 17215, loss: 0.320696, best loss: 0.247561 2025-01-16 02:03:25,172 - INFO - step 17216, loss: 0.341229, best loss: 0.247561 2025-01-16 02:03:25,322 - INFO - step 17217, loss: 0.286426, best loss: 0.247561 2025-01-16 02:03:25,473 - INFO - step 17218, loss: 0.337749, best loss: 0.247561 2025-01-16 02:03:25,623 - INFO - step 17219, loss: 0.354896, best loss: 0.247561 2025-01-16 02:03:25,773 - INFO - step 17220, loss: 0.298083, best loss: 0.247561 2025-01-16 02:03:25,923 - INFO - step 17221, loss: 0.401102, best loss: 0.247561 2025-01-16 02:03:26,073 - INFO - step 17222, loss: 0.346469, best loss: 0.247561 2025-01-16 02:03:26,224 - INFO - step 17223, loss: 0.316693, best loss: 0.247561 2025-01-16 02:03:26,374 - INFO - step 17224, loss: 0.331613, best loss: 0.247561 2025-01-16 02:03:26,524 - INFO - step 17225, loss: 0.322182, best loss: 0.247561 2025-01-16 02:03:26,674 - INFO - step 17226, loss: 0.375054, best loss: 0.247561 2025-01-16 02:03:26,824 - INFO - step 17227, loss: 0.352005, best loss: 0.247561 2025-01-16 02:03:26,974 - INFO - step 17228, loss: 0.299662, best loss: 0.247561 2025-01-16 02:03:27,124 - INFO - step 17229, loss: 0.378222, best loss: 0.247561 2025-01-16 02:03:27,274 - INFO - step 17230, loss: 0.311220, best loss: 0.247561 2025-01-16 02:03:27,425 - INFO - step 17231, loss: 0.335653, best loss: 0.247561 2025-01-16 02:03:27,575 - INFO - step 17232, loss: 0.344089, best loss: 0.247561 2025-01-16 02:03:27,725 - INFO - step 17233, loss: 0.323526, best loss: 0.247561 2025-01-16 02:03:27,875 - INFO - step 17234, loss: 0.316234, best loss: 0.247561 2025-01-16 02:03:28,025 - INFO - step 17235, loss: 0.308540, best loss: 0.247561 2025-01-16 02:03:28,175 - INFO - step 17236, loss: 0.289982, best loss: 0.247561 2025-01-16 02:03:28,325 - INFO - step 17237, loss: 0.364646, best loss: 0.247561 2025-01-16 02:03:28,475 - INFO - step 17238, loss: 0.412227, best loss: 0.247561 2025-01-16 02:03:28,625 - INFO - step 17239, loss: 0.321093, best loss: 0.247561 2025-01-16 02:03:28,775 - INFO - step 17240, loss: 0.320807, best loss: 0.247561 2025-01-16 02:03:28,926 - INFO - step 17241, loss: 0.297856, best loss: 0.247561 2025-01-16 02:03:29,076 - INFO - step 17242, loss: 0.322438, best loss: 0.247561 2025-01-16 02:03:29,226 - INFO - step 17243, loss: 0.312225, best loss: 0.247561 2025-01-16 02:03:29,376 - INFO - step 17244, loss: 0.302025, best loss: 0.247561 2025-01-16 02:03:29,526 - INFO - step 17245, loss: 0.300620, best loss: 0.247561 2025-01-16 02:03:29,676 - INFO - step 17246, loss: 0.254576, best loss: 0.247561 2025-01-16 02:03:29,826 - INFO - step 17247, loss: 0.311431, best loss: 0.247561 2025-01-16 02:03:29,976 - INFO - step 17248, loss: 0.258793, best loss: 0.247561 2025-01-16 02:03:30,127 - INFO - step 17249, loss: 0.301411, best loss: 0.247561 2025-01-16 02:03:30,277 - INFO - step 17250, loss: 0.272735, best loss: 0.247561 2025-01-16 02:03:30,427 - INFO - step 17251, loss: 0.280194, best loss: 0.247561 2025-01-16 02:03:30,577 - INFO - step 17252, loss: 0.266792, best loss: 0.247561 2025-01-16 02:03:30,727 - INFO - step 17253, loss: 0.323680, best loss: 0.247561 2025-01-16 02:03:30,877 - INFO - step 17254, loss: 0.328798, best loss: 0.247561 2025-01-16 02:03:31,028 - INFO - step 17255, loss: 0.335161, best loss: 0.247561 2025-01-16 02:03:31,178 - INFO - step 17256, loss: 0.305745, best loss: 0.247561 2025-01-16 02:03:31,328 - INFO - step 17257, loss: 0.273684, best loss: 0.247561 2025-01-16 02:03:31,478 - INFO - step 17258, loss: 0.285294, best loss: 0.247561 2025-01-16 02:03:31,629 - INFO - step 17259, loss: 0.274514, best loss: 0.247561 2025-01-16 02:03:31,779 - INFO - step 17260, loss: 0.314943, best loss: 0.247561 2025-01-16 02:03:31,929 - INFO - step 17261, loss: 0.256594, best loss: 0.247561 2025-01-16 02:03:32,080 - INFO - step 17262, loss: 0.296872, best loss: 0.247561 2025-01-16 02:03:32,230 - INFO - step 17263, loss: 0.333868, best loss: 0.247561 2025-01-16 02:03:32,380 - INFO - step 17264, loss: 0.313691, best loss: 0.247561 2025-01-16 02:03:32,530 - INFO - step 17265, loss: 0.343337, best loss: 0.247561 2025-01-16 02:03:32,680 - INFO - step 17266, loss: 0.384675, best loss: 0.247561 2025-01-16 02:03:32,830 - INFO - step 17267, loss: 0.280287, best loss: 0.247561 2025-01-16 02:03:32,980 - INFO - step 17268, loss: 0.347534, best loss: 0.247561 2025-01-16 02:03:33,130 - INFO - step 17269, loss: 0.408483, best loss: 0.247561 2025-01-16 02:03:33,280 - INFO - step 17270, loss: 0.302576, best loss: 0.247561 2025-01-16 02:03:33,431 - INFO - step 17271, loss: 0.331945, best loss: 0.247561 2025-01-16 02:03:33,581 - INFO - step 17272, loss: 0.271699, best loss: 0.247561 2025-01-16 02:03:33,731 - INFO - step 17273, loss: 0.271082, best loss: 0.247561 2025-01-16 02:03:33,881 - INFO - step 17274, loss: 0.373726, best loss: 0.247561 2025-01-16 02:03:34,031 - INFO - step 17275, loss: 0.322015, best loss: 0.247561 2025-01-16 02:03:34,182 - INFO - step 17276, loss: 0.321002, best loss: 0.247561 2025-01-16 02:03:34,332 - INFO - step 17277, loss: 0.347666, best loss: 0.247561 2025-01-16 02:03:34,482 - INFO - step 17278, loss: 0.259731, best loss: 0.247561 2025-01-16 02:03:34,632 - INFO - step 17279, loss: 0.299586, best loss: 0.247561 2025-01-16 02:03:34,782 - INFO - step 17280, loss: 0.292202, best loss: 0.247561 2025-01-16 02:03:34,933 - INFO - step 17281, loss: 0.278628, best loss: 0.247561 2025-01-16 02:03:35,083 - INFO - step 17282, loss: 0.313769, best loss: 0.247561 2025-01-16 02:03:35,233 - INFO - step 17283, loss: 0.288700, best loss: 0.247561 2025-01-16 02:03:35,383 - INFO - step 17284, loss: 0.344249, best loss: 0.247561 2025-01-16 02:03:35,533 - INFO - step 17285, loss: 0.252362, best loss: 0.247561 2025-01-16 02:03:35,684 - INFO - step 17286, loss: 0.251181, best loss: 0.247561 2025-01-16 02:03:35,834 - INFO - step 17287, loss: 0.300080, best loss: 0.247561 2025-01-16 02:03:35,984 - INFO - step 17288, loss: 0.306553, best loss: 0.247561 2025-01-16 02:03:36,134 - INFO - step 17289, loss: 0.322932, best loss: 0.247561 2025-01-16 02:03:36,284 - INFO - step 17290, loss: 0.278487, best loss: 0.247561 2025-01-16 02:03:36,434 - INFO - step 17291, loss: 0.324218, best loss: 0.247561 2025-01-16 02:03:36,584 - INFO - step 17292, loss: 0.343206, best loss: 0.247561 2025-01-16 02:03:36,734 - INFO - step 17293, loss: 0.270300, best loss: 0.247561 2025-01-16 02:03:36,885 - INFO - step 17294, loss: 0.335477, best loss: 0.247561 2025-01-16 02:03:37,035 - INFO - step 17295, loss: 0.310827, best loss: 0.247561 2025-01-16 02:03:40,488 - INFO - step 17296, loss: 0.213462, best loss: 0.213462 2025-01-16 02:03:40,638 - INFO - step 17297, loss: 0.327485, best loss: 0.213462 2025-01-16 02:03:40,788 - INFO - step 17298, loss: 0.349791, best loss: 0.213462 2025-01-16 02:03:40,938 - INFO - step 17299, loss: 0.318941, best loss: 0.213462 2025-01-16 02:03:41,089 - INFO - step 17300, loss: 0.318488, best loss: 0.213462 2025-01-16 02:03:41,239 - INFO - step 17301, loss: 0.263552, best loss: 0.213462 2025-01-16 02:03:41,389 - INFO - step 17302, loss: 0.352881, best loss: 0.213462 2025-01-16 02:03:41,539 - INFO - step 17303, loss: 0.250942, best loss: 0.213462 2025-01-16 02:03:41,689 - INFO - step 17304, loss: 0.331783, best loss: 0.213462 2025-01-16 02:03:41,839 - INFO - step 17305, loss: 0.290144, best loss: 0.213462 2025-01-16 02:03:41,990 - INFO - step 17306, loss: 0.340217, best loss: 0.213462 2025-01-16 02:03:42,140 - INFO - step 17307, loss: 0.314616, best loss: 0.213462 2025-01-16 02:03:42,291 - INFO - step 17308, loss: 0.322491, best loss: 0.213462 2025-01-16 02:03:42,441 - INFO - step 17309, loss: 0.311012, best loss: 0.213462 2025-01-16 02:03:42,591 - INFO - step 17310, loss: 0.297445, best loss: 0.213462 2025-01-16 02:03:42,741 - INFO - step 17311, loss: 0.288792, best loss: 0.213462 2025-01-16 02:03:42,891 - INFO - step 17312, loss: 0.333504, best loss: 0.213462 2025-01-16 02:03:43,042 - INFO - step 17313, loss: 0.277997, best loss: 0.213462 2025-01-16 02:03:43,192 - INFO - step 17314, loss: 0.329359, best loss: 0.213462 2025-01-16 02:03:43,342 - INFO - step 17315, loss: 0.327898, best loss: 0.213462 2025-01-16 02:03:43,492 - INFO - step 17316, loss: 0.339120, best loss: 0.213462 2025-01-16 02:03:43,642 - INFO - step 17317, loss: 0.279220, best loss: 0.213462 2025-01-16 02:03:43,792 - INFO - step 17318, loss: 0.324251, best loss: 0.213462 2025-01-16 02:03:43,943 - INFO - step 17319, loss: 0.296412, best loss: 0.213462 2025-01-16 02:03:44,093 - INFO - step 17320, loss: 0.309405, best loss: 0.213462 2025-01-16 02:03:44,243 - INFO - step 17321, loss: 0.288096, best loss: 0.213462 2025-01-16 02:03:44,393 - INFO - step 17322, loss: 0.276733, best loss: 0.213462 2025-01-16 02:03:44,543 - INFO - step 17323, loss: 0.283787, best loss: 0.213462 2025-01-16 02:03:44,694 - INFO - step 17324, loss: 0.231539, best loss: 0.213462 2025-01-16 02:03:44,844 - INFO - step 17325, loss: 0.250512, best loss: 0.213462 2025-01-16 02:03:44,994 - INFO - step 17326, loss: 0.341469, best loss: 0.213462 2025-01-16 02:03:45,144 - INFO - step 17327, loss: 0.298885, best loss: 0.213462 2025-01-16 02:03:45,294 - INFO - step 17328, loss: 0.236649, best loss: 0.213462 2025-01-16 02:03:45,444 - INFO - step 17329, loss: 0.300264, best loss: 0.213462 2025-01-16 02:03:45,594 - INFO - step 17330, loss: 0.267323, best loss: 0.213462 2025-01-16 02:03:45,745 - INFO - step 17331, loss: 0.275642, best loss: 0.213462 2025-01-16 02:03:45,895 - INFO - step 17332, loss: 0.287436, best loss: 0.213462 2025-01-16 02:03:46,045 - INFO - step 17333, loss: 0.308744, best loss: 0.213462 2025-01-16 02:03:46,195 - INFO - step 17334, loss: 0.281014, best loss: 0.213462 2025-01-16 02:03:46,345 - INFO - step 17335, loss: 0.287172, best loss: 0.213462 2025-01-16 02:03:46,496 - INFO - step 17336, loss: 0.322043, best loss: 0.213462 2025-01-16 02:03:46,646 - INFO - step 17337, loss: 0.267475, best loss: 0.213462 2025-01-16 02:03:46,796 - INFO - step 17338, loss: 0.299731, best loss: 0.213462 2025-01-16 02:03:46,946 - INFO - step 17339, loss: 0.278213, best loss: 0.213462 2025-01-16 02:03:47,096 - INFO - step 17340, loss: 0.336554, best loss: 0.213462 2025-01-16 02:03:47,246 - INFO - step 17341, loss: 0.295364, best loss: 0.213462 2025-01-16 02:03:47,396 - INFO - step 17342, loss: 0.287424, best loss: 0.213462 2025-01-16 02:03:47,546 - INFO - step 17343, loss: 0.391527, best loss: 0.213462 2025-01-16 02:03:47,696 - INFO - step 17344, loss: 0.320440, best loss: 0.213462 2025-01-16 02:03:47,846 - INFO - step 17345, loss: 0.303417, best loss: 0.213462 2025-01-16 02:03:47,997 - INFO - step 17346, loss: 0.314782, best loss: 0.213462 2025-01-16 02:03:48,146 - INFO - step 17347, loss: 0.374085, best loss: 0.213462 2025-01-16 02:03:48,297 - INFO - step 17348, loss: 0.302691, best loss: 0.213462 2025-01-16 02:03:48,446 - INFO - step 17349, loss: 0.342816, best loss: 0.213462 2025-01-16 02:03:48,597 - INFO - step 17350, loss: 0.286616, best loss: 0.213462 2025-01-16 02:03:48,746 - INFO - step 17351, loss: 0.337850, best loss: 0.213462 2025-01-16 02:03:48,897 - INFO - step 17352, loss: 0.337378, best loss: 0.213462 2025-01-16 02:03:49,047 - INFO - step 17353, loss: 0.350067, best loss: 0.213462 2025-01-16 02:03:49,197 - INFO - step 17354, loss: 0.393311, best loss: 0.213462 2025-01-16 02:03:49,347 - INFO - step 17355, loss: 0.384295, best loss: 0.213462 2025-01-16 02:03:49,497 - INFO - step 17356, loss: 0.349757, best loss: 0.213462 2025-01-16 02:03:49,648 - INFO - step 17357, loss: 0.347735, best loss: 0.213462 2025-01-16 02:03:49,798 - INFO - step 17358, loss: 0.336119, best loss: 0.213462 2025-01-16 02:03:49,948 - INFO - step 17359, loss: 0.264326, best loss: 0.213462 2025-01-16 02:03:50,098 - INFO - step 17360, loss: 0.290626, best loss: 0.213462 2025-01-16 02:03:50,248 - INFO - step 17361, loss: 0.348583, best loss: 0.213462 2025-01-16 02:03:50,398 - INFO - step 17362, loss: 0.406880, best loss: 0.213462 2025-01-16 02:03:50,548 - INFO - step 17363, loss: 0.328212, best loss: 0.213462 2025-01-16 02:03:50,698 - INFO - step 17364, loss: 0.306809, best loss: 0.213462 2025-01-16 02:03:50,848 - INFO - step 17365, loss: 0.323722, best loss: 0.213462 2025-01-16 02:03:50,998 - INFO - step 17366, loss: 0.312979, best loss: 0.213462 2025-01-16 02:03:51,148 - INFO - step 17367, loss: 0.284746, best loss: 0.213462 2025-01-16 02:03:51,298 - INFO - step 17368, loss: 0.266328, best loss: 0.213462 2025-01-16 02:03:51,448 - INFO - step 17369, loss: 0.349871, best loss: 0.213462 2025-01-16 02:03:51,598 - INFO - step 17370, loss: 0.292277, best loss: 0.213462 2025-01-16 02:03:51,749 - INFO - step 17371, loss: 0.299259, best loss: 0.213462 2025-01-16 02:03:51,899 - INFO - step 17372, loss: 0.246597, best loss: 0.213462 2025-01-16 02:03:52,049 - INFO - step 17373, loss: 0.317840, best loss: 0.213462 2025-01-16 02:03:52,199 - INFO - step 17374, loss: 0.325830, best loss: 0.213462 2025-01-16 02:03:52,349 - INFO - step 17375, loss: 0.328375, best loss: 0.213462 2025-01-16 02:03:52,499 - INFO - step 17376, loss: 0.391340, best loss: 0.213462 2025-01-16 02:03:52,649 - INFO - step 17377, loss: 0.302670, best loss: 0.213462 2025-01-16 02:03:52,799 - INFO - step 17378, loss: 0.335983, best loss: 0.213462 2025-01-16 02:03:52,949 - INFO - step 17379, loss: 0.323423, best loss: 0.213462 2025-01-16 02:03:53,099 - INFO - step 17380, loss: 0.322682, best loss: 0.213462 2025-01-16 02:03:53,249 - INFO - step 17381, loss: 0.354579, best loss: 0.213462 2025-01-16 02:03:53,399 - INFO - step 17382, loss: 0.334004, best loss: 0.213462 2025-01-16 02:03:53,549 - INFO - step 17383, loss: 0.393979, best loss: 0.213462 2025-01-16 02:03:53,699 - INFO - step 17384, loss: 0.386424, best loss: 0.213462 2025-01-16 02:03:53,850 - INFO - step 17385, loss: 0.309665, best loss: 0.213462 2025-01-16 02:03:54,000 - INFO - step 17386, loss: 0.343777, best loss: 0.213462 2025-01-16 02:03:54,150 - INFO - step 17387, loss: 0.326006, best loss: 0.213462 2025-01-16 02:03:54,300 - INFO - step 17388, loss: 0.257142, best loss: 0.213462 2025-01-16 02:03:54,450 - INFO - step 17389, loss: 0.372422, best loss: 0.213462 2025-01-16 02:03:54,600 - INFO - step 17390, loss: 0.305611, best loss: 0.213462 2025-01-16 02:03:54,750 - INFO - step 17391, loss: 0.287965, best loss: 0.213462 2025-01-16 02:03:54,900 - INFO - step 17392, loss: 0.327109, best loss: 0.213462 2025-01-16 02:03:55,051 - INFO - step 17393, loss: 0.337036, best loss: 0.213462 2025-01-16 02:03:55,201 - INFO - step 17394, loss: 0.345600, best loss: 0.213462 2025-01-16 02:03:55,351 - INFO - step 17395, loss: 0.274197, best loss: 0.213462 2025-01-16 02:03:55,501 - INFO - step 17396, loss: 0.319494, best loss: 0.213462 2025-01-16 02:03:55,651 - INFO - step 17397, loss: 0.242716, best loss: 0.213462 2025-01-16 02:03:55,801 - INFO - step 17398, loss: 0.338154, best loss: 0.213462 2025-01-16 02:03:55,951 - INFO - step 17399, loss: 0.327439, best loss: 0.213462 2025-01-16 02:03:56,101 - INFO - step 17400, loss: 0.391550, best loss: 0.213462 2025-01-16 02:03:56,251 - INFO - step 17401, loss: 0.328138, best loss: 0.213462 2025-01-16 02:03:56,401 - INFO - step 17402, loss: 0.283962, best loss: 0.213462 2025-01-16 02:03:56,551 - INFO - step 17403, loss: 0.335582, best loss: 0.213462 2025-01-16 02:03:56,701 - INFO - step 17404, loss: 0.322870, best loss: 0.213462 2025-01-16 02:03:56,851 - INFO - step 17405, loss: 0.321538, best loss: 0.213462 2025-01-16 02:03:57,001 - INFO - step 17406, loss: 0.272702, best loss: 0.213462 2025-01-16 02:03:57,151 - INFO - step 17407, loss: 0.323327, best loss: 0.213462 2025-01-16 02:03:57,301 - INFO - step 17408, loss: 0.276679, best loss: 0.213462 2025-01-16 02:03:57,451 - INFO - step 17409, loss: 0.215738, best loss: 0.213462 2025-01-16 02:03:57,601 - INFO - step 17410, loss: 0.303781, best loss: 0.213462 2025-01-16 02:03:57,751 - INFO - step 17411, loss: 0.327040, best loss: 0.213462 2025-01-16 02:03:57,902 - INFO - step 17412, loss: 0.298026, best loss: 0.213462 2025-01-16 02:03:58,052 - INFO - step 17413, loss: 0.338713, best loss: 0.213462 2025-01-16 02:03:58,202 - INFO - step 17414, loss: 0.316464, best loss: 0.213462 2025-01-16 02:03:58,352 - INFO - step 17415, loss: 0.255094, best loss: 0.213462 2025-01-16 02:03:58,502 - INFO - step 17416, loss: 0.272199, best loss: 0.213462 2025-01-16 02:03:58,652 - INFO - step 17417, loss: 0.305682, best loss: 0.213462 2025-01-16 02:03:58,802 - INFO - step 17418, loss: 0.263369, best loss: 0.213462 2025-01-16 02:03:58,952 - INFO - step 17419, loss: 0.328531, best loss: 0.213462 2025-01-16 02:03:59,102 - INFO - step 17420, loss: 0.291075, best loss: 0.213462 2025-01-16 02:03:59,253 - INFO - step 17421, loss: 0.338108, best loss: 0.213462 2025-01-16 02:03:59,403 - INFO - step 17422, loss: 0.306118, best loss: 0.213462 2025-01-16 02:03:59,553 - INFO - step 17423, loss: 0.279478, best loss: 0.213462 2025-01-16 02:03:59,703 - INFO - step 17424, loss: 0.259727, best loss: 0.213462 2025-01-16 02:03:59,853 - INFO - step 17425, loss: 0.319676, best loss: 0.213462 2025-01-16 02:04:00,003 - INFO - step 17426, loss: 0.358919, best loss: 0.213462 2025-01-16 02:04:00,153 - INFO - step 17427, loss: 0.300602, best loss: 0.213462 2025-01-16 02:04:00,303 - INFO - step 17428, loss: 0.272304, best loss: 0.213462 2025-01-16 02:04:00,453 - INFO - step 17429, loss: 0.271445, best loss: 0.213462 2025-01-16 02:04:00,603 - INFO - step 17430, loss: 0.311027, best loss: 0.213462 2025-01-16 02:04:00,753 - INFO - step 17431, loss: 0.287687, best loss: 0.213462 2025-01-16 02:04:00,903 - INFO - step 17432, loss: 0.293076, best loss: 0.213462 2025-01-16 02:04:01,053 - INFO - step 17433, loss: 0.262793, best loss: 0.213462 2025-01-16 02:04:01,203 - INFO - step 17434, loss: 0.304009, best loss: 0.213462 2025-01-16 02:04:01,353 - INFO - step 17435, loss: 0.307998, best loss: 0.213462 2025-01-16 02:04:01,503 - INFO - step 17436, loss: 0.275051, best loss: 0.213462 2025-01-16 02:04:01,653 - INFO - step 17437, loss: 0.305780, best loss: 0.213462 2025-01-16 02:04:01,804 - INFO - step 17438, loss: 0.282064, best loss: 0.213462 2025-01-16 02:04:01,954 - INFO - step 17439, loss: 0.354045, best loss: 0.213462 2025-01-16 02:04:02,104 - INFO - step 17440, loss: 0.269214, best loss: 0.213462 2025-01-16 02:04:02,254 - INFO - step 17441, loss: 0.327744, best loss: 0.213462 2025-01-16 02:04:02,404 - INFO - step 17442, loss: 0.326765, best loss: 0.213462 2025-01-16 02:04:02,554 - INFO - step 17443, loss: 0.309589, best loss: 0.213462 2025-01-16 02:04:02,704 - INFO - step 17444, loss: 0.288040, best loss: 0.213462 2025-01-16 02:04:02,854 - INFO - step 17445, loss: 0.296488, best loss: 0.213462 2025-01-16 02:04:03,004 - INFO - step 17446, loss: 0.273252, best loss: 0.213462 2025-01-16 02:04:03,154 - INFO - step 17447, loss: 0.263772, best loss: 0.213462 2025-01-16 02:04:03,305 - INFO - step 17448, loss: 0.263439, best loss: 0.213462 2025-01-16 02:04:03,455 - INFO - step 17449, loss: 0.271644, best loss: 0.213462 2025-01-16 02:04:03,605 - INFO - step 17450, loss: 0.330913, best loss: 0.213462 2025-01-16 02:04:03,755 - INFO - step 17451, loss: 0.252240, best loss: 0.213462 2025-01-16 02:04:03,905 - INFO - step 17452, loss: 0.253014, best loss: 0.213462 2025-01-16 02:04:04,055 - INFO - step 17453, loss: 0.251222, best loss: 0.213462 2025-01-16 02:04:04,205 - INFO - step 17454, loss: 0.268626, best loss: 0.213462 2025-01-16 02:04:04,355 - INFO - step 17455, loss: 0.303697, best loss: 0.213462 2025-01-16 02:04:04,505 - INFO - step 17456, loss: 0.285382, best loss: 0.213462 2025-01-16 02:04:04,656 - INFO - step 17457, loss: 0.277782, best loss: 0.213462 2025-01-16 02:04:04,806 - INFO - step 17458, loss: 0.309524, best loss: 0.213462 2025-01-16 02:04:04,956 - INFO - step 17459, loss: 0.303310, best loss: 0.213462 2025-01-16 02:04:05,106 - INFO - step 17460, loss: 0.354878, best loss: 0.213462 2025-01-16 02:04:05,256 - INFO - step 17461, loss: 0.298150, best loss: 0.213462 2025-01-16 02:04:05,406 - INFO - step 17462, loss: 0.250092, best loss: 0.213462 2025-01-16 02:04:05,556 - INFO - step 17463, loss: 0.298520, best loss: 0.213462 2025-01-16 02:04:05,706 - INFO - step 17464, loss: 0.264876, best loss: 0.213462 2025-01-16 02:04:05,856 - INFO - step 17465, loss: 0.282023, best loss: 0.213462 2025-01-16 02:04:06,006 - INFO - step 17466, loss: 0.280010, best loss: 0.213462 2025-01-16 02:04:06,157 - INFO - step 17467, loss: 0.296130, best loss: 0.213462 2025-01-16 02:04:06,307 - INFO - step 17468, loss: 0.309802, best loss: 0.213462 2025-01-16 02:04:06,457 - INFO - step 17469, loss: 0.322221, best loss: 0.213462 2025-01-16 02:04:06,607 - INFO - step 17470, loss: 0.261206, best loss: 0.213462 2025-01-16 02:04:06,757 - INFO - step 17471, loss: 0.298976, best loss: 0.213462 2025-01-16 02:04:06,907 - INFO - step 17472, loss: 0.291202, best loss: 0.213462 2025-01-16 02:04:07,057 - INFO - step 17473, loss: 0.311227, best loss: 0.213462 2025-01-16 02:04:07,207 - INFO - step 17474, loss: 0.280522, best loss: 0.213462 2025-01-16 02:04:07,357 - INFO - step 17475, loss: 0.313498, best loss: 0.213462 2025-01-16 02:04:07,507 - INFO - step 17476, loss: 0.276048, best loss: 0.213462 2025-01-16 02:04:07,657 - INFO - step 17477, loss: 0.292886, best loss: 0.213462 2025-01-16 02:04:07,807 - INFO - step 17478, loss: 0.302979, best loss: 0.213462 2025-01-16 02:04:07,957 - INFO - step 17479, loss: 0.343015, best loss: 0.213462 2025-01-16 02:04:08,107 - INFO - step 17480, loss: 0.316113, best loss: 0.213462 2025-01-16 02:04:08,258 - INFO - step 17481, loss: 0.292645, best loss: 0.213462 2025-01-16 02:04:08,408 - INFO - step 17482, loss: 0.288052, best loss: 0.213462 2025-01-16 02:04:08,558 - INFO - step 17483, loss: 0.278089, best loss: 0.213462 2025-01-16 02:04:08,708 - INFO - step 17484, loss: 0.248489, best loss: 0.213462 2025-01-16 02:04:08,858 - INFO - step 17485, loss: 0.270406, best loss: 0.213462 2025-01-16 02:04:09,008 - INFO - step 17486, loss: 0.305884, best loss: 0.213462 2025-01-16 02:04:09,158 - INFO - step 17487, loss: 0.252063, best loss: 0.213462 2025-01-16 02:04:09,308 - INFO - step 17488, loss: 0.286803, best loss: 0.213462 2025-01-16 02:04:09,458 - INFO - step 17489, loss: 0.230950, best loss: 0.213462 2025-01-16 02:04:09,608 - INFO - step 17490, loss: 0.336339, best loss: 0.213462 2025-01-16 02:04:09,758 - INFO - step 17491, loss: 0.289249, best loss: 0.213462 2025-01-16 02:04:09,909 - INFO - step 17492, loss: 0.287531, best loss: 0.213462 2025-01-16 02:04:10,058 - INFO - step 17493, loss: 0.256593, best loss: 0.213462 2025-01-16 02:04:10,209 - INFO - step 17494, loss: 0.290382, best loss: 0.213462 2025-01-16 02:04:10,359 - INFO - step 17495, loss: 0.305339, best loss: 0.213462 2025-01-16 02:04:10,509 - INFO - step 17496, loss: 0.291349, best loss: 0.213462 2025-01-16 02:04:10,659 - INFO - step 17497, loss: 0.310255, best loss: 0.213462 2025-01-16 02:04:10,809 - INFO - step 17498, loss: 0.381963, best loss: 0.213462 2025-01-16 02:04:10,958 - INFO - step 17499, loss: 0.318579, best loss: 0.213462 2025-01-16 02:04:11,109 - INFO - step 17500, loss: 0.335761, best loss: 0.213462 2025-01-16 02:04:11,259 - INFO - step 17501, loss: 0.316503, best loss: 0.213462 2025-01-16 02:04:11,409 - INFO - step 17502, loss: 0.249290, best loss: 0.213462 2025-01-16 02:04:11,559 - INFO - step 17503, loss: 0.322994, best loss: 0.213462 2025-01-16 02:04:11,709 - INFO - step 17504, loss: 0.324542, best loss: 0.213462 2025-01-16 02:04:11,859 - INFO - step 17505, loss: 0.248465, best loss: 0.213462 2025-01-16 02:04:12,009 - INFO - step 17506, loss: 0.364308, best loss: 0.213462 2025-01-16 02:04:12,159 - INFO - step 17507, loss: 0.323493, best loss: 0.213462 2025-01-16 02:04:12,309 - INFO - step 17508, loss: 0.283496, best loss: 0.213462 2025-01-16 02:04:12,459 - INFO - step 17509, loss: 0.295494, best loss: 0.213462 2025-01-16 02:04:12,609 - INFO - step 17510, loss: 0.322032, best loss: 0.213462 2025-01-16 02:04:12,759 - INFO - step 17511, loss: 0.272994, best loss: 0.213462 2025-01-16 02:04:12,909 - INFO - step 17512, loss: 0.339736, best loss: 0.213462 2025-01-16 02:04:13,059 - INFO - step 17513, loss: 0.319329, best loss: 0.213462 2025-01-16 02:04:13,210 - INFO - step 17514, loss: 0.315888, best loss: 0.213462 2025-01-16 02:04:13,360 - INFO - step 17515, loss: 0.424095, best loss: 0.213462 2025-01-16 02:04:13,510 - INFO - step 17516, loss: 0.332333, best loss: 0.213462 2025-01-16 02:04:13,660 - INFO - step 17517, loss: 0.293179, best loss: 0.213462 2025-01-16 02:04:13,810 - INFO - step 17518, loss: 0.349632, best loss: 0.213462 2025-01-16 02:04:13,960 - INFO - step 17519, loss: 0.336205, best loss: 0.213462 2025-01-16 02:04:14,110 - INFO - step 17520, loss: 0.328495, best loss: 0.213462 2025-01-16 02:04:14,260 - INFO - step 17521, loss: 0.340755, best loss: 0.213462 2025-01-16 02:04:14,410 - INFO - step 17522, loss: 0.311732, best loss: 0.213462 2025-01-16 02:04:14,560 - INFO - step 17523, loss: 0.323957, best loss: 0.213462 2025-01-16 02:04:14,710 - INFO - step 17524, loss: 0.353917, best loss: 0.213462 2025-01-16 02:04:14,860 - INFO - step 17525, loss: 0.291299, best loss: 0.213462 2025-01-16 02:04:15,010 - INFO - step 17526, loss: 0.376351, best loss: 0.213462 2025-01-16 02:04:15,160 - INFO - step 17527, loss: 0.345853, best loss: 0.213462 2025-01-16 02:04:15,310 - INFO - step 17528, loss: 0.346267, best loss: 0.213462 2025-01-16 02:04:15,461 - INFO - step 17529, loss: 0.327452, best loss: 0.213462 2025-01-16 02:04:15,611 - INFO - step 17530, loss: 0.355318, best loss: 0.213462 2025-01-16 02:04:15,761 - INFO - step 17531, loss: 0.306521, best loss: 0.213462 2025-01-16 02:04:15,912 - INFO - step 17532, loss: 0.333719, best loss: 0.213462 2025-01-16 02:04:16,062 - INFO - step 17533, loss: 0.333751, best loss: 0.213462 2025-01-16 02:04:16,212 - INFO - step 17534, loss: 0.305800, best loss: 0.213462 2025-01-16 02:04:16,363 - INFO - step 17535, loss: 0.267283, best loss: 0.213462 2025-01-16 02:04:16,513 - INFO - step 17536, loss: 0.385708, best loss: 0.213462 2025-01-16 02:04:16,663 - INFO - step 17537, loss: 0.319217, best loss: 0.213462 2025-01-16 02:04:16,813 - INFO - step 17538, loss: 0.307299, best loss: 0.213462 2025-01-16 02:04:16,963 - INFO - step 17539, loss: 0.316301, best loss: 0.213462 2025-01-16 02:04:17,113 - INFO - step 17540, loss: 0.326616, best loss: 0.213462 2025-01-16 02:04:17,263 - INFO - step 17541, loss: 0.297020, best loss: 0.213462 2025-01-16 02:04:17,413 - INFO - step 17542, loss: 0.314244, best loss: 0.213462 2025-01-16 02:04:17,563 - INFO - step 17543, loss: 0.355939, best loss: 0.213462 2025-01-16 02:04:17,714 - INFO - step 17544, loss: 0.270336, best loss: 0.213462 2025-01-16 02:04:17,864 - INFO - step 17545, loss: 0.282188, best loss: 0.213462 2025-01-16 02:04:18,014 - INFO - step 17546, loss: 0.297142, best loss: 0.213462 2025-01-16 02:04:18,164 - INFO - step 17547, loss: 0.298708, best loss: 0.213462 2025-01-16 02:04:18,314 - INFO - step 17548, loss: 0.288293, best loss: 0.213462 2025-01-16 02:04:18,464 - INFO - step 17549, loss: 0.344849, best loss: 0.213462 2025-01-16 02:04:18,614 - INFO - step 17550, loss: 0.296981, best loss: 0.213462 2025-01-16 02:04:18,764 - INFO - step 17551, loss: 0.324179, best loss: 0.213462 2025-01-16 02:04:18,914 - INFO - step 17552, loss: 0.370543, best loss: 0.213462 2025-01-16 02:04:19,064 - INFO - step 17553, loss: 0.263569, best loss: 0.213462 2025-01-16 02:04:19,214 - INFO - step 17554, loss: 0.415442, best loss: 0.213462 2025-01-16 02:04:19,364 - INFO - step 17555, loss: 0.248489, best loss: 0.213462 2025-01-16 02:04:19,514 - INFO - step 17556, loss: 0.357624, best loss: 0.213462 2025-01-16 02:04:19,664 - INFO - step 17557, loss: 0.293529, best loss: 0.213462 2025-01-16 02:04:19,814 - INFO - step 17558, loss: 0.310140, best loss: 0.213462 2025-01-16 02:04:19,964 - INFO - step 17559, loss: 0.322398, best loss: 0.213462 2025-01-16 02:04:20,115 - INFO - step 17560, loss: 0.256831, best loss: 0.213462 2025-01-16 02:04:20,265 - INFO - step 17561, loss: 0.335567, best loss: 0.213462 2025-01-16 02:04:20,415 - INFO - step 17562, loss: 0.322827, best loss: 0.213462 2025-01-16 02:04:20,565 - INFO - step 17563, loss: 0.318777, best loss: 0.213462 2025-01-16 02:04:20,715 - INFO - step 17564, loss: 0.288104, best loss: 0.213462 2025-01-16 02:04:20,865 - INFO - step 17565, loss: 0.306717, best loss: 0.213462 2025-01-16 02:04:21,016 - INFO - step 17566, loss: 0.283882, best loss: 0.213462 2025-01-16 02:04:21,166 - INFO - step 17567, loss: 0.315869, best loss: 0.213462 2025-01-16 02:04:21,316 - INFO - step 17568, loss: 0.273791, best loss: 0.213462 2025-01-16 02:04:21,466 - INFO - step 17569, loss: 0.283899, best loss: 0.213462 2025-01-16 02:04:21,616 - INFO - step 17570, loss: 0.327895, best loss: 0.213462 2025-01-16 02:04:21,766 - INFO - step 17571, loss: 0.338144, best loss: 0.213462 2025-01-16 02:04:21,916 - INFO - step 17572, loss: 0.232424, best loss: 0.213462 2025-01-16 02:04:22,066 - INFO - step 17573, loss: 0.262778, best loss: 0.213462 2025-01-16 02:04:22,217 - INFO - step 17574, loss: 0.299536, best loss: 0.213462 2025-01-16 02:04:22,367 - INFO - step 17575, loss: 0.314726, best loss: 0.213462 2025-01-16 02:04:22,517 - INFO - step 17576, loss: 0.236277, best loss: 0.213462 2025-01-16 02:04:22,667 - INFO - step 17577, loss: 0.296514, best loss: 0.213462 2025-01-16 02:04:22,817 - INFO - step 17578, loss: 0.273657, best loss: 0.213462 2025-01-16 02:04:22,967 - INFO - step 17579, loss: 0.238576, best loss: 0.213462 2025-01-16 02:04:23,117 - INFO - step 17580, loss: 0.259711, best loss: 0.213462 2025-01-16 02:04:23,267 - INFO - step 17581, loss: 0.314120, best loss: 0.213462 2025-01-16 02:04:23,418 - INFO - step 17582, loss: 0.297301, best loss: 0.213462 2025-01-16 02:04:23,568 - INFO - step 17583, loss: 0.287294, best loss: 0.213462 2025-01-16 02:04:23,718 - INFO - step 17584, loss: 0.243715, best loss: 0.213462 2025-01-16 02:04:23,868 - INFO - step 17585, loss: 0.330037, best loss: 0.213462 2025-01-16 02:04:24,018 - INFO - step 17586, loss: 0.326398, best loss: 0.213462 2025-01-16 02:04:24,168 - INFO - step 17587, loss: 0.272648, best loss: 0.213462 2025-01-16 02:04:24,318 - INFO - step 17588, loss: 0.273264, best loss: 0.213462 2025-01-16 02:04:24,468 - INFO - step 17589, loss: 0.286649, best loss: 0.213462 2025-01-16 02:04:24,618 - INFO - step 17590, loss: 0.311475, best loss: 0.213462 2025-01-16 02:04:24,769 - INFO - step 17591, loss: 0.317074, best loss: 0.213462 2025-01-16 02:04:24,919 - INFO - step 17592, loss: 0.272731, best loss: 0.213462 2025-01-16 02:04:25,069 - INFO - step 17593, loss: 0.330835, best loss: 0.213462 2025-01-16 02:04:25,219 - INFO - step 17594, loss: 0.269876, best loss: 0.213462 2025-01-16 02:04:25,369 - INFO - step 17595, loss: 0.288232, best loss: 0.213462 2025-01-16 02:04:25,519 - INFO - step 17596, loss: 0.326248, best loss: 0.213462 2025-01-16 02:04:25,669 - INFO - step 17597, loss: 0.299198, best loss: 0.213462 2025-01-16 02:04:25,819 - INFO - step 17598, loss: 0.258342, best loss: 0.213462 2025-01-16 02:04:25,969 - INFO - step 17599, loss: 0.297062, best loss: 0.213462 2025-01-16 02:04:26,119 - INFO - step 17600, loss: 0.283057, best loss: 0.213462 2025-01-16 02:04:26,269 - INFO - step 17601, loss: 0.278990, best loss: 0.213462 2025-01-16 02:04:26,419 - INFO - step 17602, loss: 0.256245, best loss: 0.213462 2025-01-16 02:04:26,569 - INFO - step 17603, loss: 0.268951, best loss: 0.213462 2025-01-16 02:04:26,719 - INFO - step 17604, loss: 0.315348, best loss: 0.213462 2025-01-16 02:04:26,870 - INFO - step 17605, loss: 0.305869, best loss: 0.213462 2025-01-16 02:04:27,020 - INFO - step 17606, loss: 0.285933, best loss: 0.213462 2025-01-16 02:04:27,170 - INFO - step 17607, loss: 0.288679, best loss: 0.213462 2025-01-16 02:04:27,320 - INFO - step 17608, loss: 0.240044, best loss: 0.213462 2025-01-16 02:04:27,470 - INFO - step 17609, loss: 0.277841, best loss: 0.213462 2025-01-16 02:04:27,620 - INFO - step 17610, loss: 0.276629, best loss: 0.213462 2025-01-16 02:04:27,770 - INFO - step 17611, loss: 0.282963, best loss: 0.213462 2025-01-16 02:04:27,920 - INFO - step 17612, loss: 0.312682, best loss: 0.213462 2025-01-16 02:04:28,070 - INFO - step 17613, loss: 0.270613, best loss: 0.213462 2025-01-16 02:04:28,220 - INFO - step 17614, loss: 0.275947, best loss: 0.213462 2025-01-16 02:04:28,370 - INFO - step 17615, loss: 0.235364, best loss: 0.213462 2025-01-16 02:04:28,520 - INFO - step 17616, loss: 0.282238, best loss: 0.213462 2025-01-16 02:04:28,671 - INFO - step 17617, loss: 0.320878, best loss: 0.213462 2025-01-16 02:04:28,821 - INFO - step 17618, loss: 0.296269, best loss: 0.213462 2025-01-16 02:04:28,971 - INFO - step 17619, loss: 0.289011, best loss: 0.213462 2025-01-16 02:04:29,121 - INFO - step 17620, loss: 0.262584, best loss: 0.213462 2025-01-16 02:04:29,271 - INFO - step 17621, loss: 0.283386, best loss: 0.213462 2025-01-16 02:04:29,421 - INFO - step 17622, loss: 0.276064, best loss: 0.213462 2025-01-16 02:04:29,571 - INFO - step 17623, loss: 0.369305, best loss: 0.213462 2025-01-16 02:04:29,721 - INFO - step 17624, loss: 0.294352, best loss: 0.213462 2025-01-16 02:04:29,871 - INFO - step 17625, loss: 0.280721, best loss: 0.213462 2025-01-16 02:04:33,405 - INFO - step 17626, loss: 0.203190, best loss: 0.203190 2025-01-16 02:04:33,568 - INFO - step 17627, loss: 0.301401, best loss: 0.203190 2025-01-16 02:04:33,722 - INFO - step 17628, loss: 0.263863, best loss: 0.203190 2025-01-16 02:04:33,873 - INFO - step 17629, loss: 0.296696, best loss: 0.203190 2025-01-16 02:04:34,023 - INFO - step 17630, loss: 0.289113, best loss: 0.203190 2025-01-16 02:04:34,173 - INFO - step 17631, loss: 0.263186, best loss: 0.203190 2025-01-16 02:04:34,322 - INFO - step 17632, loss: 0.306059, best loss: 0.203190 2025-01-16 02:04:34,473 - INFO - step 17633, loss: 0.263611, best loss: 0.203190 2025-01-16 02:04:34,623 - INFO - step 17634, loss: 0.303282, best loss: 0.203190 2025-01-16 02:04:34,773 - INFO - step 17635, loss: 0.299213, best loss: 0.203190 2025-01-16 02:04:34,923 - INFO - step 17636, loss: 0.335100, best loss: 0.203190 2025-01-16 02:04:35,073 - INFO - step 17637, loss: 0.305200, best loss: 0.203190 2025-01-16 02:04:35,223 - INFO - step 17638, loss: 0.249927, best loss: 0.203190 2025-01-16 02:04:35,374 - INFO - step 17639, loss: 0.291066, best loss: 0.203190 2025-01-16 02:04:35,524 - INFO - step 17640, loss: 0.330722, best loss: 0.203190 2025-01-16 02:04:35,674 - INFO - step 17641, loss: 0.259175, best loss: 0.203190 2025-01-16 02:04:35,824 - INFO - step 17642, loss: 0.310562, best loss: 0.203190 2025-01-16 02:04:35,974 - INFO - step 17643, loss: 0.314080, best loss: 0.203190 2025-01-16 02:04:36,124 - INFO - step 17644, loss: 0.350171, best loss: 0.203190 2025-01-16 02:04:36,274 - INFO - step 17645, loss: 0.278853, best loss: 0.203190 2025-01-16 02:04:36,424 - INFO - step 17646, loss: 0.296787, best loss: 0.203190 2025-01-16 02:04:36,574 - INFO - step 17647, loss: 0.299281, best loss: 0.203190 2025-01-16 02:04:36,724 - INFO - step 17648, loss: 0.348282, best loss: 0.203190 2025-01-16 02:04:36,874 - INFO - step 17649, loss: 0.294818, best loss: 0.203190 2025-01-16 02:04:37,024 - INFO - step 17650, loss: 0.301616, best loss: 0.203190 2025-01-16 02:04:37,174 - INFO - step 17651, loss: 0.258068, best loss: 0.203190 2025-01-16 02:04:37,324 - INFO - step 17652, loss: 0.297112, best loss: 0.203190 2025-01-16 02:04:37,475 - INFO - step 17653, loss: 0.288294, best loss: 0.203190 2025-01-16 02:04:37,625 - INFO - step 17654, loss: 0.258331, best loss: 0.203190 2025-01-16 02:04:37,775 - INFO - step 17655, loss: 0.252735, best loss: 0.203190 2025-01-16 02:04:37,925 - INFO - step 17656, loss: 0.304272, best loss: 0.203190 2025-01-16 02:04:38,075 - INFO - step 17657, loss: 0.296867, best loss: 0.203190 2025-01-16 02:04:38,225 - INFO - step 17658, loss: 0.267549, best loss: 0.203190 2025-01-16 02:04:38,375 - INFO - step 17659, loss: 0.271792, best loss: 0.203190 2025-01-16 02:04:38,525 - INFO - step 17660, loss: 0.245318, best loss: 0.203190 2025-01-16 02:04:38,675 - INFO - step 17661, loss: 0.231623, best loss: 0.203190 2025-01-16 02:04:38,825 - INFO - step 17662, loss: 0.313569, best loss: 0.203190 2025-01-16 02:04:38,975 - INFO - step 17663, loss: 0.303307, best loss: 0.203190 2025-01-16 02:04:39,125 - INFO - step 17664, loss: 0.275351, best loss: 0.203190 2025-01-16 02:04:39,275 - INFO - step 17665, loss: 0.288176, best loss: 0.203190 2025-01-16 02:04:39,425 - INFO - step 17666, loss: 0.274223, best loss: 0.203190 2025-01-16 02:04:39,576 - INFO - step 17667, loss: 0.302935, best loss: 0.203190 2025-01-16 02:04:39,726 - INFO - step 17668, loss: 0.279076, best loss: 0.203190 2025-01-16 02:04:39,916 - INFO - step 17669, loss: 0.258374, best loss: 0.203190 2025-01-16 02:04:40,066 - INFO - step 17670, loss: 0.329497, best loss: 0.203190 2025-01-16 02:04:40,216 - INFO - step 17671, loss: 0.275297, best loss: 0.203190 2025-01-16 02:04:40,366 - INFO - step 17672, loss: 0.259087, best loss: 0.203190 2025-01-16 02:04:40,516 - INFO - step 17673, loss: 0.380934, best loss: 0.203190 2025-01-16 02:04:40,667 - INFO - step 17674, loss: 0.295754, best loss: 0.203190 2025-01-16 02:04:40,817 - INFO - step 17675, loss: 0.269759, best loss: 0.203190 2025-01-16 02:04:40,967 - INFO - step 17676, loss: 0.252967, best loss: 0.203190 2025-01-16 02:04:41,117 - INFO - step 17677, loss: 0.326688, best loss: 0.203190 2025-01-16 02:04:41,267 - INFO - step 17678, loss: 0.238999, best loss: 0.203190 2025-01-16 02:04:41,418 - INFO - step 17679, loss: 0.284435, best loss: 0.203190 2025-01-16 02:04:41,568 - INFO - step 17680, loss: 0.274111, best loss: 0.203190 2025-01-16 02:04:41,718 - INFO - step 17681, loss: 0.293284, best loss: 0.203190 2025-01-16 02:04:41,868 - INFO - step 17682, loss: 0.282232, best loss: 0.203190 2025-01-16 02:04:42,018 - INFO - step 17683, loss: 0.311509, best loss: 0.203190 2025-01-16 02:04:42,168 - INFO - step 17684, loss: 0.287487, best loss: 0.203190 2025-01-16 02:04:42,318 - INFO - step 17685, loss: 0.350442, best loss: 0.203190 2025-01-16 02:04:42,468 - INFO - step 17686, loss: 0.269123, best loss: 0.203190 2025-01-16 02:04:42,618 - INFO - step 17687, loss: 0.268745, best loss: 0.203190 2025-01-16 02:04:42,768 - INFO - step 17688, loss: 0.269820, best loss: 0.203190 2025-01-16 02:04:42,918 - INFO - step 17689, loss: 0.223893, best loss: 0.203190 2025-01-16 02:04:43,068 - INFO - step 17690, loss: 0.323426, best loss: 0.203190 2025-01-16 02:04:43,218 - INFO - step 17691, loss: 0.269351, best loss: 0.203190 2025-01-16 02:04:43,368 - INFO - step 17692, loss: 0.303594, best loss: 0.203190 2025-01-16 02:04:43,518 - INFO - step 17693, loss: 0.286395, best loss: 0.203190 2025-01-16 02:04:43,669 - INFO - step 17694, loss: 0.325263, best loss: 0.203190 2025-01-16 02:04:43,819 - INFO - step 17695, loss: 0.269247, best loss: 0.203190 2025-01-16 02:04:43,969 - INFO - step 17696, loss: 0.286872, best loss: 0.203190 2025-01-16 02:04:44,119 - INFO - step 17697, loss: 0.262954, best loss: 0.203190 2025-01-16 02:04:44,269 - INFO - step 17698, loss: 0.243428, best loss: 0.203190 2025-01-16 02:04:44,419 - INFO - step 17699, loss: 0.274890, best loss: 0.203190 2025-01-16 02:04:44,569 - INFO - step 17700, loss: 0.283415, best loss: 0.203190 2025-01-16 02:04:44,719 - INFO - step 17701, loss: 0.256331, best loss: 0.203190 2025-01-16 02:04:44,869 - INFO - step 17702, loss: 0.254545, best loss: 0.203190 2025-01-16 02:04:45,019 - INFO - step 17703, loss: 0.297954, best loss: 0.203190 2025-01-16 02:04:45,169 - INFO - step 17704, loss: 0.333563, best loss: 0.203190 2025-01-16 02:04:45,319 - INFO - step 17705, loss: 0.264919, best loss: 0.203190 2025-01-16 02:04:45,469 - INFO - step 17706, loss: 0.268587, best loss: 0.203190 2025-01-16 02:04:45,619 - INFO - step 17707, loss: 0.279401, best loss: 0.203190 2025-01-16 02:04:45,769 - INFO - step 17708, loss: 0.301736, best loss: 0.203190 2025-01-16 02:04:45,919 - INFO - step 17709, loss: 0.281506, best loss: 0.203190 2025-01-16 02:04:46,069 - INFO - step 17710, loss: 0.292311, best loss: 0.203190 2025-01-16 02:04:46,219 - INFO - step 17711, loss: 0.309299, best loss: 0.203190 2025-01-16 02:04:46,369 - INFO - step 17712, loss: 0.297172, best loss: 0.203190 2025-01-16 02:04:46,519 - INFO - step 17713, loss: 0.288492, best loss: 0.203190 2025-01-16 02:04:46,669 - INFO - step 17714, loss: 0.340359, best loss: 0.203190 2025-01-16 02:04:46,820 - INFO - step 17715, loss: 0.264135, best loss: 0.203190 2025-01-16 02:04:46,969 - INFO - step 17716, loss: 0.301400, best loss: 0.203190 2025-01-16 02:04:47,120 - INFO - step 17717, loss: 0.269902, best loss: 0.203190 2025-01-16 02:04:47,270 - INFO - step 17718, loss: 0.274454, best loss: 0.203190 2025-01-16 02:04:47,420 - INFO - step 17719, loss: 0.355830, best loss: 0.203190 2025-01-16 02:04:47,571 - INFO - step 17720, loss: 0.284161, best loss: 0.203190 2025-01-16 02:04:47,721 - INFO - step 17721, loss: 0.270729, best loss: 0.203190 2025-01-16 02:04:47,871 - INFO - step 17722, loss: 0.303691, best loss: 0.203190 2025-01-16 02:04:48,021 - INFO - step 17723, loss: 0.292675, best loss: 0.203190 2025-01-16 02:04:48,171 - INFO - step 17724, loss: 0.331707, best loss: 0.203190 2025-01-16 02:04:48,321 - INFO - step 17725, loss: 0.277783, best loss: 0.203190 2025-01-16 02:04:48,471 - INFO - step 17726, loss: 0.303273, best loss: 0.203190 2025-01-16 02:04:48,621 - INFO - step 17727, loss: 0.229891, best loss: 0.203190 2025-01-16 02:04:48,771 - INFO - step 17728, loss: 0.290357, best loss: 0.203190 2025-01-16 02:04:48,921 - INFO - step 17729, loss: 0.277244, best loss: 0.203190 2025-01-16 02:04:49,071 - INFO - step 17730, loss: 0.353478, best loss: 0.203190 2025-01-16 02:04:49,221 - INFO - step 17731, loss: 0.273637, best loss: 0.203190 2025-01-16 02:04:49,371 - INFO - step 17732, loss: 0.282772, best loss: 0.203190 2025-01-16 02:04:49,522 - INFO - step 17733, loss: 0.315510, best loss: 0.203190 2025-01-16 02:04:49,672 - INFO - step 17734, loss: 0.321668, best loss: 0.203190 2025-01-16 02:04:49,822 - INFO - step 17735, loss: 0.316784, best loss: 0.203190 2025-01-16 02:04:49,972 - INFO - step 17736, loss: 0.265626, best loss: 0.203190 2025-01-16 02:04:50,122 - INFO - step 17737, loss: 0.286227, best loss: 0.203190 2025-01-16 02:04:50,272 - INFO - step 17738, loss: 0.241683, best loss: 0.203190 2025-01-16 02:04:50,422 - INFO - step 17739, loss: 0.255243, best loss: 0.203190 2025-01-16 02:04:50,572 - INFO - step 17740, loss: 0.269861, best loss: 0.203190 2025-01-16 02:04:50,723 - INFO - step 17741, loss: 0.274394, best loss: 0.203190 2025-01-16 02:04:50,873 - INFO - step 17742, loss: 0.250847, best loss: 0.203190 2025-01-16 02:04:51,023 - INFO - step 17743, loss: 0.262038, best loss: 0.203190 2025-01-16 02:04:51,173 - INFO - step 17744, loss: 0.268898, best loss: 0.203190 2025-01-16 02:04:51,323 - INFO - step 17745, loss: 0.283166, best loss: 0.203190 2025-01-16 02:04:51,473 - INFO - step 17746, loss: 0.271879, best loss: 0.203190 2025-01-16 02:04:51,623 - INFO - step 17747, loss: 0.315065, best loss: 0.203190 2025-01-16 02:04:51,774 - INFO - step 17748, loss: 0.323235, best loss: 0.203190 2025-01-16 02:04:51,924 - INFO - step 17749, loss: 0.332610, best loss: 0.203190 2025-01-16 02:04:52,074 - INFO - step 17750, loss: 0.294014, best loss: 0.203190 2025-01-16 02:04:52,224 - INFO - step 17751, loss: 0.236296, best loss: 0.203190 2025-01-16 02:04:52,374 - INFO - step 17752, loss: 0.278243, best loss: 0.203190 2025-01-16 02:04:52,524 - INFO - step 17753, loss: 0.226711, best loss: 0.203190 2025-01-16 02:04:52,674 - INFO - step 17754, loss: 0.231357, best loss: 0.203190 2025-01-16 02:04:52,824 - INFO - step 17755, loss: 0.292956, best loss: 0.203190 2025-01-16 02:04:52,974 - INFO - step 17756, loss: 0.257676, best loss: 0.203190 2025-01-16 02:04:53,125 - INFO - step 17757, loss: 0.284629, best loss: 0.203190 2025-01-16 02:04:53,275 - INFO - step 17758, loss: 0.245637, best loss: 0.203190 2025-01-16 02:04:53,425 - INFO - step 17759, loss: 0.283001, best loss: 0.203190 2025-01-16 02:04:53,575 - INFO - step 17760, loss: 0.337343, best loss: 0.203190 2025-01-16 02:04:53,725 - INFO - step 17761, loss: 0.275778, best loss: 0.203190 2025-01-16 02:04:53,875 - INFO - step 17762, loss: 0.244316, best loss: 0.203190 2025-01-16 02:04:54,025 - INFO - step 17763, loss: 0.267176, best loss: 0.203190 2025-01-16 02:04:54,175 - INFO - step 17764, loss: 0.254589, best loss: 0.203190 2025-01-16 02:04:54,326 - INFO - step 17765, loss: 0.242217, best loss: 0.203190 2025-01-16 02:04:54,476 - INFO - step 17766, loss: 0.273762, best loss: 0.203190 2025-01-16 02:04:54,626 - INFO - step 17767, loss: 0.244404, best loss: 0.203190 2025-01-16 02:04:54,776 - INFO - step 17768, loss: 0.310740, best loss: 0.203190 2025-01-16 02:04:54,926 - INFO - step 17769, loss: 0.265384, best loss: 0.203190 2025-01-16 02:04:55,076 - INFO - step 17770, loss: 0.331570, best loss: 0.203190 2025-01-16 02:04:55,226 - INFO - step 17771, loss: 0.256526, best loss: 0.203190 2025-01-16 02:04:55,376 - INFO - step 17772, loss: 0.262795, best loss: 0.203190 2025-01-16 02:04:55,526 - INFO - step 17773, loss: 0.333426, best loss: 0.203190 2025-01-16 02:04:55,676 - INFO - step 17774, loss: 0.280062, best loss: 0.203190 2025-01-16 02:04:55,827 - INFO - step 17775, loss: 0.288039, best loss: 0.203190 2025-01-16 02:04:55,977 - INFO - step 17776, loss: 0.270338, best loss: 0.203190 2025-01-16 02:04:56,127 - INFO - step 17777, loss: 0.291489, best loss: 0.203190 2025-01-16 02:04:56,277 - INFO - step 17778, loss: 0.275993, best loss: 0.203190 2025-01-16 02:04:56,427 - INFO - step 17779, loss: 0.273900, best loss: 0.203190 2025-01-16 02:04:56,577 - INFO - step 17780, loss: 0.256969, best loss: 0.203190 2025-01-16 02:04:56,727 - INFO - step 17781, loss: 0.270350, best loss: 0.203190 2025-01-16 02:04:56,877 - INFO - step 17782, loss: 0.234838, best loss: 0.203190 2025-01-16 02:04:57,028 - INFO - step 17783, loss: 0.257720, best loss: 0.203190 2025-01-16 02:04:57,178 - INFO - step 17784, loss: 0.245633, best loss: 0.203190 2025-01-16 02:04:57,328 - INFO - step 17785, loss: 0.273249, best loss: 0.203190 2025-01-16 02:04:57,478 - INFO - step 17786, loss: 0.274125, best loss: 0.203190 2025-01-16 02:04:57,628 - INFO - step 17787, loss: 0.205166, best loss: 0.203190 2025-01-16 02:04:57,778 - INFO - step 17788, loss: 0.261497, best loss: 0.203190 2025-01-16 02:04:57,928 - INFO - step 17789, loss: 0.269665, best loss: 0.203190 2025-01-16 02:04:58,078 - INFO - step 17790, loss: 0.247416, best loss: 0.203190 2025-01-16 02:04:58,228 - INFO - step 17791, loss: 0.311401, best loss: 0.203190 2025-01-16 02:04:58,378 - INFO - step 17792, loss: 0.250691, best loss: 0.203190 2025-01-16 02:04:58,528 - INFO - step 17793, loss: 0.314804, best loss: 0.203190 2025-01-16 02:04:58,678 - INFO - step 17794, loss: 0.286300, best loss: 0.203190 2025-01-16 02:04:58,828 - INFO - step 17795, loss: 0.343202, best loss: 0.203190 2025-01-16 02:04:58,978 - INFO - step 17796, loss: 0.289922, best loss: 0.203190 2025-01-16 02:04:59,128 - INFO - step 17797, loss: 0.275575, best loss: 0.203190 2025-01-16 02:04:59,278 - INFO - step 17798, loss: 0.229550, best loss: 0.203190 2025-01-16 02:04:59,428 - INFO - step 17799, loss: 0.253666, best loss: 0.203190 2025-01-16 02:04:59,579 - INFO - step 17800, loss: 0.207673, best loss: 0.203190 2025-01-16 02:04:59,729 - INFO - step 17801, loss: 0.310088, best loss: 0.203190 2025-01-16 02:04:59,879 - INFO - step 17802, loss: 0.285330, best loss: 0.203190 2025-01-16 02:05:00,029 - INFO - step 17803, loss: 0.257912, best loss: 0.203190 2025-01-16 02:05:00,179 - INFO - step 17804, loss: 0.226528, best loss: 0.203190 2025-01-16 02:05:00,329 - INFO - step 17805, loss: 0.330796, best loss: 0.203190 2025-01-16 02:05:00,479 - INFO - step 17806, loss: 0.222502, best loss: 0.203190 2025-01-16 02:05:00,629 - INFO - step 17807, loss: 0.229687, best loss: 0.203190 2025-01-16 02:05:00,779 - INFO - step 17808, loss: 0.290237, best loss: 0.203190 2025-01-16 02:05:00,929 - INFO - step 17809, loss: 0.281865, best loss: 0.203190 2025-01-16 02:05:01,079 - INFO - step 17810, loss: 0.254222, best loss: 0.203190 2025-01-16 02:05:01,229 - INFO - step 17811, loss: 0.312629, best loss: 0.203190 2025-01-16 02:05:01,379 - INFO - step 17812, loss: 0.272573, best loss: 0.203190 2025-01-16 02:05:01,529 - INFO - step 17813, loss: 0.276929, best loss: 0.203190 2025-01-16 02:05:01,679 - INFO - step 17814, loss: 0.238943, best loss: 0.203190 2025-01-16 02:05:01,829 - INFO - step 17815, loss: 0.249950, best loss: 0.203190 2025-01-16 02:05:01,979 - INFO - step 17816, loss: 0.330873, best loss: 0.203190 2025-01-16 02:05:02,130 - INFO - step 17817, loss: 0.206595, best loss: 0.203190 2025-01-16 02:05:02,280 - INFO - step 17818, loss: 0.256081, best loss: 0.203190 2025-01-16 02:05:05,775 - INFO - step 17819, loss: 0.186648, best loss: 0.186648 2025-01-16 02:05:05,925 - INFO - step 17820, loss: 0.352171, best loss: 0.186648 2025-01-16 02:05:06,076 - INFO - step 17821, loss: 0.301842, best loss: 0.186648 2025-01-16 02:05:06,226 - INFO - step 17822, loss: 0.265220, best loss: 0.186648 2025-01-16 02:05:06,376 - INFO - step 17823, loss: 0.283825, best loss: 0.186648 2025-01-16 02:05:06,526 - INFO - step 17824, loss: 0.259349, best loss: 0.186648 2025-01-16 02:05:06,676 - INFO - step 17825, loss: 0.300313, best loss: 0.186648 2025-01-16 02:05:06,826 - INFO - step 17826, loss: 0.296326, best loss: 0.186648 2025-01-16 02:05:06,976 - INFO - step 17827, loss: 0.259515, best loss: 0.186648 2025-01-16 02:05:07,126 - INFO - step 17828, loss: 0.280285, best loss: 0.186648 2025-01-16 02:05:07,276 - INFO - step 17829, loss: 0.243508, best loss: 0.186648 2025-01-16 02:05:07,426 - INFO - step 17830, loss: 0.290588, best loss: 0.186648 2025-01-16 02:05:07,576 - INFO - step 17831, loss: 0.230948, best loss: 0.186648 2025-01-16 02:05:07,726 - INFO - step 17832, loss: 0.240900, best loss: 0.186648 2025-01-16 02:05:07,876 - INFO - step 17833, loss: 0.195448, best loss: 0.186648 2025-01-16 02:05:08,026 - INFO - step 17834, loss: 0.284206, best loss: 0.186648 2025-01-16 02:05:08,176 - INFO - step 17835, loss: 0.286783, best loss: 0.186648 2025-01-16 02:05:08,326 - INFO - step 17836, loss: 0.295777, best loss: 0.186648 2025-01-16 02:05:08,476 - INFO - step 17837, loss: 0.296537, best loss: 0.186648 2025-01-16 02:05:08,626 - INFO - step 17838, loss: 0.299612, best loss: 0.186648 2025-01-16 02:05:08,776 - INFO - step 17839, loss: 0.245909, best loss: 0.186648 2025-01-16 02:05:08,927 - INFO - step 17840, loss: 0.261140, best loss: 0.186648 2025-01-16 02:05:09,077 - INFO - step 17841, loss: 0.317849, best loss: 0.186648 2025-01-16 02:05:09,227 - INFO - step 17842, loss: 0.265406, best loss: 0.186648 2025-01-16 02:05:09,377 - INFO - step 17843, loss: 0.250895, best loss: 0.186648 2025-01-16 02:05:09,527 - INFO - step 17844, loss: 0.269792, best loss: 0.186648 2025-01-16 02:05:09,678 - INFO - step 17845, loss: 0.316175, best loss: 0.186648 2025-01-16 02:05:09,828 - INFO - step 17846, loss: 0.332239, best loss: 0.186648 2025-01-16 02:05:09,978 - INFO - step 17847, loss: 0.259644, best loss: 0.186648 2025-01-16 02:05:10,128 - INFO - step 17848, loss: 0.272189, best loss: 0.186648 2025-01-16 02:05:10,278 - INFO - step 17849, loss: 0.242737, best loss: 0.186648 2025-01-16 02:05:10,428 - INFO - step 17850, loss: 0.229384, best loss: 0.186648 2025-01-16 02:05:10,578 - INFO - step 17851, loss: 0.242241, best loss: 0.186648 2025-01-16 02:05:10,728 - INFO - step 17852, loss: 0.299363, best loss: 0.186648 2025-01-16 02:05:10,878 - INFO - step 17853, loss: 0.296287, best loss: 0.186648 2025-01-16 02:05:11,028 - INFO - step 17854, loss: 0.299056, best loss: 0.186648 2025-01-16 02:05:11,178 - INFO - step 17855, loss: 0.262905, best loss: 0.186648 2025-01-16 02:05:11,329 - INFO - step 17856, loss: 0.369763, best loss: 0.186648 2025-01-16 02:05:11,479 - INFO - step 17857, loss: 0.324574, best loss: 0.186648 2025-01-16 02:05:11,629 - INFO - step 17858, loss: 0.322258, best loss: 0.186648 2025-01-16 02:05:11,779 - INFO - step 17859, loss: 0.329805, best loss: 0.186648 2025-01-16 02:05:11,929 - INFO - step 17860, loss: 0.294598, best loss: 0.186648 2025-01-16 02:05:12,079 - INFO - step 17861, loss: 0.342949, best loss: 0.186648 2025-01-16 02:05:12,230 - INFO - step 17862, loss: 0.301499, best loss: 0.186648 2025-01-16 02:05:12,380 - INFO - step 17863, loss: 0.297355, best loss: 0.186648 2025-01-16 02:05:12,530 - INFO - step 17864, loss: 0.267706, best loss: 0.186648 2025-01-16 02:05:12,680 - INFO - step 17865, loss: 0.232372, best loss: 0.186648 2025-01-16 02:05:12,830 - INFO - step 17866, loss: 0.324485, best loss: 0.186648 2025-01-16 02:05:12,980 - INFO - step 17867, loss: 0.357291, best loss: 0.186648 2025-01-16 02:05:13,131 - INFO - step 17868, loss: 0.299255, best loss: 0.186648 2025-01-16 02:05:13,281 - INFO - step 17869, loss: 0.345947, best loss: 0.186648 2025-01-16 02:05:13,431 - INFO - step 17870, loss: 0.327528, best loss: 0.186648 2025-01-16 02:05:13,581 - INFO - step 17871, loss: 0.267541, best loss: 0.186648 2025-01-16 02:05:13,731 - INFO - step 17872, loss: 0.294136, best loss: 0.186648 2025-01-16 02:05:13,881 - INFO - step 17873, loss: 0.307532, best loss: 0.186648 2025-01-16 02:05:14,031 - INFO - step 17874, loss: 0.317767, best loss: 0.186648 2025-01-16 02:05:14,181 - INFO - step 17875, loss: 0.266967, best loss: 0.186648 2025-01-16 02:05:14,331 - INFO - step 17876, loss: 0.319827, best loss: 0.186648 2025-01-16 02:05:14,481 - INFO - step 17877, loss: 0.278536, best loss: 0.186648 2025-01-16 02:05:14,631 - INFO - step 17878, loss: 0.302610, best loss: 0.186648 2025-01-16 02:05:14,781 - INFO - step 17879, loss: 0.354110, best loss: 0.186648 2025-01-16 02:05:14,931 - INFO - step 17880, loss: 0.290123, best loss: 0.186648 2025-01-16 02:05:15,081 - INFO - step 17881, loss: 0.384606, best loss: 0.186648 2025-01-16 02:05:15,231 - INFO - step 17882, loss: 0.290657, best loss: 0.186648 2025-01-16 02:05:15,381 - INFO - step 17883, loss: 0.303029, best loss: 0.186648 2025-01-16 02:05:15,531 - INFO - step 17884, loss: 0.304766, best loss: 0.186648 2025-01-16 02:05:15,682 - INFO - step 17885, loss: 0.275398, best loss: 0.186648 2025-01-16 02:05:15,832 - INFO - step 17886, loss: 0.280502, best loss: 0.186648 2025-01-16 02:05:15,982 - INFO - step 17887, loss: 0.279276, best loss: 0.186648 2025-01-16 02:05:16,132 - INFO - step 17888, loss: 0.286551, best loss: 0.186648 2025-01-16 02:05:16,282 - INFO - step 17889, loss: 0.304895, best loss: 0.186648 2025-01-16 02:05:16,432 - INFO - step 17890, loss: 0.270666, best loss: 0.186648 2025-01-16 02:05:16,582 - INFO - step 17891, loss: 0.293014, best loss: 0.186648 2025-01-16 02:05:16,732 - INFO - step 17892, loss: 0.335962, best loss: 0.186648 2025-01-16 02:05:16,882 - INFO - step 17893, loss: 0.297811, best loss: 0.186648 2025-01-16 02:05:17,032 - INFO - step 17894, loss: 0.274803, best loss: 0.186648 2025-01-16 02:05:17,183 - INFO - step 17895, loss: 0.286775, best loss: 0.186648 2025-01-16 02:05:17,333 - INFO - step 17896, loss: 0.252417, best loss: 0.186648 2025-01-16 02:05:17,483 - INFO - step 17897, loss: 0.316251, best loss: 0.186648 2025-01-16 02:05:17,633 - INFO - step 17898, loss: 0.272022, best loss: 0.186648 2025-01-16 02:05:17,783 - INFO - step 17899, loss: 0.317365, best loss: 0.186648 2025-01-16 02:05:17,933 - INFO - step 17900, loss: 0.286249, best loss: 0.186648 2025-01-16 02:05:18,083 - INFO - step 17901, loss: 0.307638, best loss: 0.186648 2025-01-16 02:05:18,233 - INFO - step 17902, loss: 0.284997, best loss: 0.186648 2025-01-16 02:05:18,383 - INFO - step 17903, loss: 0.268305, best loss: 0.186648 2025-01-16 02:05:18,534 - INFO - step 17904, loss: 0.247984, best loss: 0.186648 2025-01-16 02:05:18,684 - INFO - step 17905, loss: 0.268067, best loss: 0.186648 2025-01-16 02:05:18,834 - INFO - step 17906, loss: 0.253925, best loss: 0.186648 2025-01-16 02:05:18,984 - INFO - step 17907, loss: 0.236802, best loss: 0.186648 2025-01-16 02:05:19,134 - INFO - step 17908, loss: 0.272853, best loss: 0.186648 2025-01-16 02:05:19,284 - INFO - step 17909, loss: 0.229309, best loss: 0.186648 2025-01-16 02:05:19,434 - INFO - step 17910, loss: 0.265064, best loss: 0.186648 2025-01-16 02:05:19,585 - INFO - step 17911, loss: 0.308324, best loss: 0.186648 2025-01-16 02:05:19,735 - INFO - step 17912, loss: 0.292486, best loss: 0.186648 2025-01-16 02:05:19,885 - INFO - step 17913, loss: 0.262676, best loss: 0.186648 2025-01-16 02:05:20,035 - INFO - step 17914, loss: 0.320376, best loss: 0.186648 2025-01-16 02:05:20,185 - INFO - step 17915, loss: 0.331107, best loss: 0.186648 2025-01-16 02:05:20,335 - INFO - step 17916, loss: 0.288930, best loss: 0.186648 2025-01-16 02:05:20,485 - INFO - step 17917, loss: 0.252630, best loss: 0.186648 2025-01-16 02:05:20,636 - INFO - step 17918, loss: 0.240772, best loss: 0.186648 2025-01-16 02:05:20,786 - INFO - step 17919, loss: 0.252965, best loss: 0.186648 2025-01-16 02:05:20,936 - INFO - step 17920, loss: 0.270904, best loss: 0.186648 2025-01-16 02:05:21,086 - INFO - step 17921, loss: 0.283214, best loss: 0.186648 2025-01-16 02:05:21,236 - INFO - step 17922, loss: 0.262963, best loss: 0.186648 2025-01-16 02:05:21,386 - INFO - step 17923, loss: 0.302344, best loss: 0.186648 2025-01-16 02:05:21,536 - INFO - step 17924, loss: 0.248372, best loss: 0.186648 2025-01-16 02:05:21,686 - INFO - step 17925, loss: 0.297659, best loss: 0.186648 2025-01-16 02:05:21,836 - INFO - step 17926, loss: 0.322891, best loss: 0.186648 2025-01-16 02:05:21,986 - INFO - step 17927, loss: 0.252641, best loss: 0.186648 2025-01-16 02:05:22,136 - INFO - step 17928, loss: 0.279336, best loss: 0.186648 2025-01-16 02:05:22,286 - INFO - step 17929, loss: 0.282105, best loss: 0.186648 2025-01-16 02:05:22,436 - INFO - step 17930, loss: 0.267145, best loss: 0.186648 2025-01-16 02:05:22,587 - INFO - step 17931, loss: 0.215096, best loss: 0.186648 2025-01-16 02:05:22,737 - INFO - step 17932, loss: 0.231770, best loss: 0.186648 2025-01-16 02:05:22,887 - INFO - step 17933, loss: 0.249338, best loss: 0.186648 2025-01-16 02:05:23,037 - INFO - step 17934, loss: 0.300080, best loss: 0.186648 2025-01-16 02:05:23,187 - INFO - step 17935, loss: 0.265839, best loss: 0.186648 2025-01-16 02:05:23,337 - INFO - step 17936, loss: 0.267453, best loss: 0.186648 2025-01-16 02:05:23,487 - INFO - step 17937, loss: 0.280500, best loss: 0.186648 2025-01-16 02:05:23,638 - INFO - step 17938, loss: 0.257902, best loss: 0.186648 2025-01-16 02:05:23,787 - INFO - step 17939, loss: 0.247718, best loss: 0.186648 2025-01-16 02:05:23,938 - INFO - step 17940, loss: 0.254829, best loss: 0.186648 2025-01-16 02:05:24,088 - INFO - step 17941, loss: 0.270763, best loss: 0.186648 2025-01-16 02:05:24,238 - INFO - step 17942, loss: 0.273238, best loss: 0.186648 2025-01-16 02:05:24,388 - INFO - step 17943, loss: 0.241778, best loss: 0.186648 2025-01-16 02:05:24,538 - INFO - step 17944, loss: 0.266014, best loss: 0.186648 2025-01-16 02:05:24,688 - INFO - step 17945, loss: 0.258825, best loss: 0.186648 2025-01-16 02:05:24,838 - INFO - step 17946, loss: 0.254066, best loss: 0.186648 2025-01-16 02:05:24,989 - INFO - step 17947, loss: 0.294083, best loss: 0.186648 2025-01-16 02:05:25,139 - INFO - step 17948, loss: 0.246071, best loss: 0.186648 2025-01-16 02:05:25,289 - INFO - step 17949, loss: 0.254933, best loss: 0.186648 2025-01-16 02:05:25,439 - INFO - step 17950, loss: 0.277004, best loss: 0.186648 2025-01-16 02:05:25,589 - INFO - step 17951, loss: 0.244908, best loss: 0.186648 2025-01-16 02:05:25,739 - INFO - step 17952, loss: 0.262002, best loss: 0.186648 2025-01-16 02:05:25,889 - INFO - step 17953, loss: 0.249408, best loss: 0.186648 2025-01-16 02:05:26,039 - INFO - step 17954, loss: 0.285235, best loss: 0.186648 2025-01-16 02:05:26,189 - INFO - step 17955, loss: 0.245735, best loss: 0.186648 2025-01-16 02:05:26,340 - INFO - step 17956, loss: 0.235422, best loss: 0.186648 2025-01-16 02:05:26,490 - INFO - step 17957, loss: 0.272282, best loss: 0.186648 2025-01-16 02:05:26,640 - INFO - step 17958, loss: 0.267722, best loss: 0.186648 2025-01-16 02:05:26,790 - INFO - step 17959, loss: 0.285379, best loss: 0.186648 2025-01-16 02:05:26,940 - INFO - step 17960, loss: 0.253301, best loss: 0.186648 2025-01-16 02:05:27,090 - INFO - step 17961, loss: 0.227429, best loss: 0.186648 2025-01-16 02:05:27,240 - INFO - step 17962, loss: 0.234078, best loss: 0.186648 2025-01-16 02:05:27,390 - INFO - step 17963, loss: 0.234390, best loss: 0.186648 2025-01-16 02:05:27,540 - INFO - step 17964, loss: 0.238559, best loss: 0.186648 2025-01-16 02:05:27,691 - INFO - step 17965, loss: 0.272008, best loss: 0.186648 2025-01-16 02:05:27,841 - INFO - step 17966, loss: 0.338371, best loss: 0.186648 2025-01-16 02:05:27,991 - INFO - step 17967, loss: 0.282953, best loss: 0.186648 2025-01-16 02:05:28,141 - INFO - step 17968, loss: 0.241874, best loss: 0.186648 2025-01-16 02:05:28,291 - INFO - step 17969, loss: 0.278517, best loss: 0.186648 2025-01-16 02:05:28,441 - INFO - step 17970, loss: 0.275697, best loss: 0.186648 2025-01-16 02:05:28,591 - INFO - step 17971, loss: 0.254658, best loss: 0.186648 2025-01-16 02:05:28,741 - INFO - step 17972, loss: 0.274866, best loss: 0.186648 2025-01-16 02:05:28,891 - INFO - step 17973, loss: 0.264583, best loss: 0.186648 2025-01-16 02:05:29,042 - INFO - step 17974, loss: 0.255467, best loss: 0.186648 2025-01-16 02:05:29,192 - INFO - step 17975, loss: 0.269684, best loss: 0.186648 2025-01-16 02:05:29,342 - INFO - step 17976, loss: 0.313170, best loss: 0.186648 2025-01-16 02:05:29,492 - INFO - step 17977, loss: 0.307805, best loss: 0.186648 2025-01-16 02:05:29,643 - INFO - step 17978, loss: 0.302609, best loss: 0.186648 2025-01-16 02:05:29,793 - INFO - step 17979, loss: 0.281239, best loss: 0.186648 2025-01-16 02:05:29,943 - INFO - step 17980, loss: 0.264648, best loss: 0.186648 2025-01-16 02:05:30,093 - INFO - step 17981, loss: 0.248837, best loss: 0.186648 2025-01-16 02:05:30,243 - INFO - step 17982, loss: 0.285558, best loss: 0.186648 2025-01-16 02:05:30,393 - INFO - step 17983, loss: 0.281976, best loss: 0.186648 2025-01-16 02:05:30,543 - INFO - step 17984, loss: 0.260537, best loss: 0.186648 2025-01-16 02:05:30,693 - INFO - step 17985, loss: 0.287284, best loss: 0.186648 2025-01-16 02:05:30,843 - INFO - step 17986, loss: 0.294563, best loss: 0.186648 2025-01-16 02:05:30,994 - INFO - step 17987, loss: 0.312519, best loss: 0.186648 2025-01-16 02:05:31,144 - INFO - step 17988, loss: 0.332376, best loss: 0.186648 2025-01-16 02:05:31,294 - INFO - step 17989, loss: 0.292454, best loss: 0.186648 2025-01-16 02:05:31,444 - INFO - step 17990, loss: 0.273679, best loss: 0.186648 2025-01-16 02:05:31,594 - INFO - step 17991, loss: 0.225072, best loss: 0.186648 2025-01-16 02:05:31,745 - INFO - step 17992, loss: 0.248570, best loss: 0.186648 2025-01-16 02:05:31,895 - INFO - step 17993, loss: 0.259464, best loss: 0.186648 2025-01-16 02:05:32,045 - INFO - step 17994, loss: 0.260407, best loss: 0.186648 2025-01-16 02:05:32,195 - INFO - step 17995, loss: 0.231335, best loss: 0.186648 2025-01-16 02:05:32,345 - INFO - step 17996, loss: 0.262673, best loss: 0.186648 2025-01-16 02:05:32,496 - INFO - step 17997, loss: 0.281190, best loss: 0.186648 2025-01-16 02:05:32,646 - INFO - step 17998, loss: 0.238516, best loss: 0.186648 2025-01-16 02:05:32,796 - INFO - step 17999, loss: 0.254529, best loss: 0.186648 2025-01-16 02:05:32,946 - INFO - step 18000, loss: 0.322126, best loss: 0.186648 2025-01-16 02:05:33,096 - INFO - step 18001, loss: 0.274011, best loss: 0.186648 2025-01-16 02:05:33,246 - INFO - step 18002, loss: 0.244867, best loss: 0.186648 2025-01-16 02:05:33,396 - INFO - step 18003, loss: 0.347300, best loss: 0.186648 2025-01-16 02:05:33,546 - INFO - step 18004, loss: 0.269528, best loss: 0.186648 2025-01-16 02:05:33,696 - INFO - step 18005, loss: 0.262542, best loss: 0.186648 2025-01-16 02:05:33,847 - INFO - step 18006, loss: 0.247293, best loss: 0.186648 2025-01-16 02:05:33,997 - INFO - step 18007, loss: 0.288432, best loss: 0.186648 2025-01-16 02:05:34,147 - INFO - step 18008, loss: 0.240476, best loss: 0.186648 2025-01-16 02:05:34,297 - INFO - step 18009, loss: 0.255982, best loss: 0.186648 2025-01-16 02:05:34,447 - INFO - step 18010, loss: 0.247559, best loss: 0.186648 2025-01-16 02:05:34,597 - INFO - step 18011, loss: 0.241959, best loss: 0.186648 2025-01-16 02:05:34,748 - INFO - step 18012, loss: 0.267053, best loss: 0.186648 2025-01-16 02:05:34,898 - INFO - step 18013, loss: 0.303663, best loss: 0.186648 2025-01-16 02:05:35,048 - INFO - step 18014, loss: 0.274275, best loss: 0.186648 2025-01-16 02:05:35,198 - INFO - step 18015, loss: 0.240307, best loss: 0.186648 2025-01-16 02:05:35,348 - INFO - step 18016, loss: 0.252369, best loss: 0.186648 2025-01-16 02:05:35,498 - INFO - step 18017, loss: 0.278006, best loss: 0.186648 2025-01-16 02:05:35,649 - INFO - step 18018, loss: 0.302231, best loss: 0.186648 2025-01-16 02:05:35,799 - INFO - step 18019, loss: 0.220912, best loss: 0.186648 2025-01-16 02:05:35,949 - INFO - step 18020, loss: 0.242158, best loss: 0.186648 2025-01-16 02:05:36,099 - INFO - step 18021, loss: 0.261763, best loss: 0.186648 2025-01-16 02:05:36,249 - INFO - step 18022, loss: 0.248546, best loss: 0.186648 2025-01-16 02:05:36,400 - INFO - step 18023, loss: 0.278395, best loss: 0.186648 2025-01-16 02:05:36,550 - INFO - step 18024, loss: 0.277472, best loss: 0.186648 2025-01-16 02:05:36,700 - INFO - step 18025, loss: 0.294446, best loss: 0.186648 2025-01-16 02:05:36,850 - INFO - step 18026, loss: 0.229733, best loss: 0.186648 2025-01-16 02:05:37,000 - INFO - step 18027, loss: 0.235788, best loss: 0.186648 2025-01-16 02:05:37,150 - INFO - step 18028, loss: 0.267155, best loss: 0.186648 2025-01-16 02:05:37,301 - INFO - step 18029, loss: 0.301341, best loss: 0.186648 2025-01-16 02:05:37,451 - INFO - step 18030, loss: 0.253949, best loss: 0.186648 2025-01-16 02:05:37,601 - INFO - step 18031, loss: 0.279924, best loss: 0.186648 2025-01-16 02:05:37,751 - INFO - step 18032, loss: 0.256802, best loss: 0.186648 2025-01-16 02:05:37,901 - INFO - step 18033, loss: 0.261616, best loss: 0.186648 2025-01-16 02:05:38,051 - INFO - step 18034, loss: 0.262265, best loss: 0.186648 2025-01-16 02:05:38,201 - INFO - step 18035, loss: 0.301232, best loss: 0.186648 2025-01-16 02:05:38,352 - INFO - step 18036, loss: 0.281612, best loss: 0.186648 2025-01-16 02:05:38,502 - INFO - step 18037, loss: 0.276209, best loss: 0.186648 2025-01-16 02:05:38,652 - INFO - step 18038, loss: 0.281389, best loss: 0.186648 2025-01-16 02:05:38,803 - INFO - step 18039, loss: 0.263096, best loss: 0.186648 2025-01-16 02:05:38,953 - INFO - step 18040, loss: 0.298310, best loss: 0.186648 2025-01-16 02:05:39,103 - INFO - step 18041, loss: 0.271497, best loss: 0.186648 2025-01-16 02:05:39,253 - INFO - step 18042, loss: 0.315492, best loss: 0.186648 2025-01-16 02:05:39,403 - INFO - step 18043, loss: 0.297088, best loss: 0.186648 2025-01-16 02:05:39,553 - INFO - step 18044, loss: 0.352427, best loss: 0.186648 2025-01-16 02:05:39,703 - INFO - step 18045, loss: 0.275032, best loss: 0.186648 2025-01-16 02:05:39,853 - INFO - step 18046, loss: 0.289803, best loss: 0.186648 2025-01-16 02:05:40,003 - INFO - step 18047, loss: 0.268521, best loss: 0.186648 2025-01-16 02:05:40,153 - INFO - step 18048, loss: 0.243559, best loss: 0.186648 2025-01-16 02:05:40,304 - INFO - step 18049, loss: 0.260823, best loss: 0.186648 2025-01-16 02:05:40,454 - INFO - step 18050, loss: 0.250261, best loss: 0.186648 2025-01-16 02:05:40,604 - INFO - step 18051, loss: 0.247216, best loss: 0.186648 2025-01-16 02:05:40,754 - INFO - step 18052, loss: 0.289352, best loss: 0.186648 2025-01-16 02:05:40,904 - INFO - step 18053, loss: 0.272724, best loss: 0.186648 2025-01-16 02:05:41,054 - INFO - step 18054, loss: 0.280875, best loss: 0.186648 2025-01-16 02:05:41,204 - INFO - step 18055, loss: 0.267188, best loss: 0.186648 2025-01-16 02:05:41,354 - INFO - step 18056, loss: 0.253678, best loss: 0.186648 2025-01-16 02:05:41,504 - INFO - step 18057, loss: 0.265825, best loss: 0.186648 2025-01-16 02:05:41,654 - INFO - step 18058, loss: 0.255534, best loss: 0.186648 2025-01-16 02:05:41,804 - INFO - step 18059, loss: 0.299209, best loss: 0.186648 2025-01-16 02:05:41,954 - INFO - step 18060, loss: 0.284217, best loss: 0.186648 2025-01-16 02:05:42,104 - INFO - step 18061, loss: 0.282204, best loss: 0.186648 2025-01-16 02:05:42,254 - INFO - step 18062, loss: 0.288828, best loss: 0.186648 2025-01-16 02:05:42,404 - INFO - step 18063, loss: 0.274147, best loss: 0.186648 2025-01-16 02:05:42,554 - INFO - step 18064, loss: 0.319230, best loss: 0.186648 2025-01-16 02:05:42,704 - INFO - step 18065, loss: 0.302577, best loss: 0.186648 2025-01-16 02:05:42,854 - INFO - step 18066, loss: 0.244943, best loss: 0.186648 2025-01-16 02:05:43,004 - INFO - step 18067, loss: 0.277792, best loss: 0.186648 2025-01-16 02:05:43,154 - INFO - step 18068, loss: 0.261754, best loss: 0.186648 2025-01-16 02:05:43,304 - INFO - step 18069, loss: 0.283388, best loss: 0.186648 2025-01-16 02:05:43,454 - INFO - step 18070, loss: 0.252235, best loss: 0.186648 2025-01-16 02:05:43,604 - INFO - step 18071, loss: 0.305715, best loss: 0.186648 2025-01-16 02:05:43,754 - INFO - step 18072, loss: 0.288771, best loss: 0.186648 2025-01-16 02:05:43,904 - INFO - step 18073, loss: 0.266447, best loss: 0.186648 2025-01-16 02:05:44,054 - INFO - step 18074, loss: 0.289171, best loss: 0.186648 2025-01-16 02:05:44,204 - INFO - step 18075, loss: 0.263379, best loss: 0.186648 2025-01-16 02:05:44,354 - INFO - step 18076, loss: 0.253605, best loss: 0.186648 2025-01-16 02:05:44,505 - INFO - step 18077, loss: 0.337178, best loss: 0.186648 2025-01-16 02:05:44,655 - INFO - step 18078, loss: 0.258690, best loss: 0.186648 2025-01-16 02:05:44,805 - INFO - step 18079, loss: 0.216783, best loss: 0.186648 2025-01-16 02:05:44,955 - INFO - step 18080, loss: 0.295057, best loss: 0.186648 2025-01-16 02:05:45,105 - INFO - step 18081, loss: 0.265654, best loss: 0.186648 2025-01-16 02:05:45,255 - INFO - step 18082, loss: 0.301633, best loss: 0.186648 2025-01-16 02:05:45,405 - INFO - step 18083, loss: 0.250331, best loss: 0.186648 2025-01-16 02:05:45,555 - INFO - step 18084, loss: 0.212398, best loss: 0.186648 2025-01-16 02:05:45,705 - INFO - step 18085, loss: 0.237140, best loss: 0.186648 2025-01-16 02:05:45,855 - INFO - step 18086, loss: 0.240806, best loss: 0.186648 2025-01-16 02:05:46,005 - INFO - step 18087, loss: 0.212231, best loss: 0.186648 2025-01-16 02:05:46,156 - INFO - step 18088, loss: 0.255445, best loss: 0.186648 2025-01-16 02:05:46,306 - INFO - step 18089, loss: 0.239113, best loss: 0.186648 2025-01-16 02:05:46,456 - INFO - step 18090, loss: 0.286676, best loss: 0.186648 2025-01-16 02:05:46,606 - INFO - step 18091, loss: 0.248538, best loss: 0.186648 2025-01-16 02:05:46,756 - INFO - step 18092, loss: 0.224607, best loss: 0.186648 2025-01-16 02:05:46,906 - INFO - step 18093, loss: 0.253104, best loss: 0.186648 2025-01-16 02:05:47,056 - INFO - step 18094, loss: 0.264826, best loss: 0.186648 2025-01-16 02:05:47,206 - INFO - step 18095, loss: 0.322758, best loss: 0.186648 2025-01-16 02:05:47,356 - INFO - step 18096, loss: 0.217285, best loss: 0.186648 2025-01-16 02:05:47,506 - INFO - step 18097, loss: 0.275577, best loss: 0.186648 2025-01-16 02:05:47,656 - INFO - step 18098, loss: 0.248882, best loss: 0.186648 2025-01-16 02:05:47,806 - INFO - step 18099, loss: 0.262436, best loss: 0.186648 2025-01-16 02:05:47,956 - INFO - step 18100, loss: 0.281215, best loss: 0.186648 2025-01-16 02:05:48,106 - INFO - step 18101, loss: 0.238425, best loss: 0.186648 2025-01-16 02:05:48,256 - INFO - step 18102, loss: 0.259672, best loss: 0.186648 2025-01-16 02:05:48,406 - INFO - step 18103, loss: 0.297878, best loss: 0.186648 2025-01-16 02:05:48,556 - INFO - step 18104, loss: 0.292367, best loss: 0.186648 2025-01-16 02:05:48,706 - INFO - step 18105, loss: 0.286324, best loss: 0.186648 2025-01-16 02:05:48,857 - INFO - step 18106, loss: 0.310223, best loss: 0.186648 2025-01-16 02:05:49,007 - INFO - step 18107, loss: 0.289324, best loss: 0.186648 2025-01-16 02:05:49,157 - INFO - step 18108, loss: 0.277053, best loss: 0.186648 2025-01-16 02:05:49,307 - INFO - step 18109, loss: 0.318654, best loss: 0.186648 2025-01-16 02:05:49,457 - INFO - step 18110, loss: 0.287166, best loss: 0.186648 2025-01-16 02:05:49,608 - INFO - step 18111, loss: 0.286218, best loss: 0.186648 2025-01-16 02:05:49,758 - INFO - step 18112, loss: 0.262535, best loss: 0.186648 2025-01-16 02:05:49,908 - INFO - step 18113, loss: 0.237418, best loss: 0.186648 2025-01-16 02:05:50,058 - INFO - step 18114, loss: 0.237747, best loss: 0.186648 2025-01-16 02:05:50,209 - INFO - step 18115, loss: 0.304214, best loss: 0.186648 2025-01-16 02:05:50,359 - INFO - step 18116, loss: 0.312302, best loss: 0.186648 2025-01-16 02:05:50,509 - INFO - step 18117, loss: 0.215469, best loss: 0.186648 2025-01-16 02:05:50,659 - INFO - step 18118, loss: 0.247900, best loss: 0.186648 2025-01-16 02:05:50,810 - INFO - step 18119, loss: 0.297402, best loss: 0.186648 2025-01-16 02:05:50,960 - INFO - step 18120, loss: 0.257930, best loss: 0.186648 2025-01-16 02:05:51,110 - INFO - step 18121, loss: 0.291225, best loss: 0.186648 2025-01-16 02:05:51,260 - INFO - step 18122, loss: 0.244722, best loss: 0.186648 2025-01-16 02:05:51,410 - INFO - step 18123, loss: 0.312164, best loss: 0.186648 2025-01-16 02:05:51,560 - INFO - step 18124, loss: 0.319776, best loss: 0.186648 2025-01-16 02:05:51,710 - INFO - step 18125, loss: 0.347847, best loss: 0.186648 2025-01-16 02:05:51,860 - INFO - step 18126, loss: 0.237802, best loss: 0.186648 2025-01-16 02:05:52,010 - INFO - step 18127, loss: 0.232529, best loss: 0.186648 2025-01-16 02:05:52,160 - INFO - step 18128, loss: 0.227054, best loss: 0.186648 2025-01-16 02:05:52,310 - INFO - step 18129, loss: 0.271334, best loss: 0.186648 2025-01-16 02:05:52,460 - INFO - step 18130, loss: 0.240449, best loss: 0.186648 2025-01-16 02:05:52,610 - INFO - step 18131, loss: 0.266314, best loss: 0.186648 2025-01-16 02:05:52,760 - INFO - step 18132, loss: 0.277299, best loss: 0.186648 2025-01-16 02:05:52,910 - INFO - step 18133, loss: 0.256224, best loss: 0.186648 2025-01-16 02:05:53,060 - INFO - step 18134, loss: 0.268627, best loss: 0.186648 2025-01-16 02:05:53,210 - INFO - step 18135, loss: 0.243764, best loss: 0.186648 2025-01-16 02:05:53,361 - INFO - step 18136, loss: 0.200250, best loss: 0.186648 2025-01-16 02:05:53,511 - INFO - step 18137, loss: 0.233717, best loss: 0.186648 2025-01-16 02:05:53,661 - INFO - step 18138, loss: 0.245933, best loss: 0.186648 2025-01-16 02:05:53,811 - INFO - step 18139, loss: 0.207446, best loss: 0.186648 2025-01-16 02:05:53,961 - INFO - step 18140, loss: 0.332009, best loss: 0.186648 2025-01-16 02:05:54,111 - INFO - step 18141, loss: 0.260420, best loss: 0.186648 2025-01-16 02:05:54,261 - INFO - step 18142, loss: 0.249232, best loss: 0.186648 2025-01-16 02:05:54,411 - INFO - step 18143, loss: 0.269508, best loss: 0.186648 2025-01-16 02:05:54,561 - INFO - step 18144, loss: 0.195760, best loss: 0.186648 2025-01-16 02:05:54,711 - INFO - step 18145, loss: 0.263310, best loss: 0.186648 2025-01-16 02:05:54,862 - INFO - step 18146, loss: 0.318186, best loss: 0.186648 2025-01-16 02:05:55,012 - INFO - step 18147, loss: 0.192641, best loss: 0.186648 2025-01-16 02:05:55,162 - INFO - step 18148, loss: 0.263173, best loss: 0.186648 2025-01-16 02:05:58,678 - INFO - step 18149, loss: 0.181952, best loss: 0.181952 2025-01-16 02:05:58,841 - INFO - step 18150, loss: 0.262514, best loss: 0.181952 2025-01-16 02:05:58,994 - INFO - step 18151, loss: 0.268790, best loss: 0.181952 2025-01-16 02:05:59,144 - INFO - step 18152, loss: 0.281453, best loss: 0.181952 2025-01-16 02:05:59,294 - INFO - step 18153, loss: 0.279727, best loss: 0.181952 2025-01-16 02:05:59,445 - INFO - step 18154, loss: 0.259592, best loss: 0.181952 2025-01-16 02:05:59,595 - INFO - step 18155, loss: 0.257390, best loss: 0.181952 2025-01-16 02:05:59,745 - INFO - step 18156, loss: 0.288219, best loss: 0.181952 2025-01-16 02:05:59,896 - INFO - step 18157, loss: 0.283529, best loss: 0.181952 2025-01-16 02:06:00,047 - INFO - step 18158, loss: 0.278137, best loss: 0.181952 2025-01-16 02:06:00,197 - INFO - step 18159, loss: 0.260862, best loss: 0.181952 2025-01-16 02:06:00,347 - INFO - step 18160, loss: 0.249079, best loss: 0.181952 2025-01-16 02:06:00,497 - INFO - step 18161, loss: 0.239038, best loss: 0.181952 2025-01-16 02:06:00,647 - INFO - step 18162, loss: 0.218765, best loss: 0.181952 2025-01-16 02:06:00,797 - INFO - step 18163, loss: 0.249490, best loss: 0.181952 2025-01-16 02:06:00,947 - INFO - step 18164, loss: 0.260908, best loss: 0.181952 2025-01-16 02:06:01,097 - INFO - step 18165, loss: 0.236405, best loss: 0.181952 2025-01-16 02:06:01,248 - INFO - step 18166, loss: 0.296657, best loss: 0.181952 2025-01-16 02:06:01,398 - INFO - step 18167, loss: 0.226688, best loss: 0.181952 2025-01-16 02:06:01,548 - INFO - step 18168, loss: 0.240095, best loss: 0.181952 2025-01-16 02:06:01,698 - INFO - step 18169, loss: 0.272053, best loss: 0.181952 2025-01-16 02:06:01,848 - INFO - step 18170, loss: 0.224547, best loss: 0.181952 2025-01-16 02:06:01,999 - INFO - step 18171, loss: 0.280075, best loss: 0.181952 2025-01-16 02:06:02,149 - INFO - step 18172, loss: 0.202700, best loss: 0.181952 2025-01-16 02:06:02,299 - INFO - step 18173, loss: 0.266342, best loss: 0.181952 2025-01-16 02:06:02,449 - INFO - step 18174, loss: 0.253110, best loss: 0.181952 2025-01-16 02:06:02,599 - INFO - step 18175, loss: 0.278744, best loss: 0.181952 2025-01-16 02:06:02,749 - INFO - step 18176, loss: 0.221183, best loss: 0.181952 2025-01-16 02:06:02,900 - INFO - step 18177, loss: 0.231453, best loss: 0.181952 2025-01-16 02:06:03,050 - INFO - step 18178, loss: 0.283031, best loss: 0.181952 2025-01-16 02:06:03,200 - INFO - step 18179, loss: 0.266315, best loss: 0.181952 2025-01-16 02:06:03,350 - INFO - step 18180, loss: 0.308853, best loss: 0.181952 2025-01-16 02:06:03,500 - INFO - step 18181, loss: 0.232636, best loss: 0.181952 2025-01-16 02:06:03,650 - INFO - step 18182, loss: 0.244421, best loss: 0.181952 2025-01-16 02:06:03,800 - INFO - step 18183, loss: 0.291799, best loss: 0.181952 2025-01-16 02:06:03,950 - INFO - step 18184, loss: 0.295724, best loss: 0.181952 2025-01-16 02:06:04,101 - INFO - step 18185, loss: 0.214511, best loss: 0.181952 2025-01-16 02:06:04,251 - INFO - step 18186, loss: 0.217178, best loss: 0.181952 2025-01-16 02:06:04,400 - INFO - step 18187, loss: 0.320153, best loss: 0.181952 2025-01-16 02:06:04,551 - INFO - step 18188, loss: 0.243884, best loss: 0.181952 2025-01-16 02:06:04,701 - INFO - step 18189, loss: 0.227565, best loss: 0.181952 2025-01-16 02:06:04,851 - INFO - step 18190, loss: 0.270292, best loss: 0.181952 2025-01-16 02:06:05,001 - INFO - step 18191, loss: 0.249104, best loss: 0.181952 2025-01-16 02:06:05,151 - INFO - step 18192, loss: 0.301166, best loss: 0.181952 2025-01-16 02:06:05,301 - INFO - step 18193, loss: 0.295826, best loss: 0.181952 2025-01-16 02:06:05,451 - INFO - step 18194, loss: 0.262402, best loss: 0.181952 2025-01-16 02:06:05,601 - INFO - step 18195, loss: 0.258618, best loss: 0.181952 2025-01-16 02:06:05,751 - INFO - step 18196, loss: 0.281200, best loss: 0.181952 2025-01-16 02:06:05,902 - INFO - step 18197, loss: 0.255271, best loss: 0.181952 2025-01-16 02:06:06,053 - INFO - step 18198, loss: 0.254168, best loss: 0.181952 2025-01-16 02:06:06,203 - INFO - step 18199, loss: 0.333777, best loss: 0.181952 2025-01-16 02:06:06,353 - INFO - step 18200, loss: 0.278264, best loss: 0.181952 2025-01-16 02:06:06,503 - INFO - step 18201, loss: 0.244652, best loss: 0.181952 2025-01-16 02:06:06,653 - INFO - step 18202, loss: 0.246129, best loss: 0.181952 2025-01-16 02:06:06,803 - INFO - step 18203, loss: 0.341270, best loss: 0.181952 2025-01-16 02:06:06,953 - INFO - step 18204, loss: 0.251754, best loss: 0.181952 2025-01-16 02:06:07,103 - INFO - step 18205, loss: 0.240742, best loss: 0.181952 2025-01-16 02:06:07,253 - INFO - step 18206, loss: 0.221032, best loss: 0.181952 2025-01-16 02:06:07,403 - INFO - step 18207, loss: 0.255164, best loss: 0.181952 2025-01-16 02:06:07,553 - INFO - step 18208, loss: 0.248381, best loss: 0.181952 2025-01-16 02:06:07,703 - INFO - step 18209, loss: 0.309300, best loss: 0.181952 2025-01-16 02:06:07,853 - INFO - step 18210, loss: 0.306658, best loss: 0.181952 2025-01-16 02:06:08,003 - INFO - step 18211, loss: 0.354395, best loss: 0.181952 2025-01-16 02:06:08,153 - INFO - step 18212, loss: 0.348693, best loss: 0.181952 2025-01-16 02:06:08,303 - INFO - step 18213, loss: 0.291597, best loss: 0.181952 2025-01-16 02:06:08,453 - INFO - step 18214, loss: 0.323404, best loss: 0.181952 2025-01-16 02:06:08,604 - INFO - step 18215, loss: 0.267803, best loss: 0.181952 2025-01-16 02:06:08,754 - INFO - step 18216, loss: 0.248076, best loss: 0.181952 2025-01-16 02:06:08,904 - INFO - step 18217, loss: 0.259183, best loss: 0.181952 2025-01-16 02:06:09,054 - INFO - step 18218, loss: 0.284361, best loss: 0.181952 2025-01-16 02:06:09,204 - INFO - step 18219, loss: 0.267597, best loss: 0.181952 2025-01-16 02:06:09,354 - INFO - step 18220, loss: 0.260727, best loss: 0.181952 2025-01-16 02:06:09,505 - INFO - step 18221, loss: 0.286993, best loss: 0.181952 2025-01-16 02:06:09,655 - INFO - step 18222, loss: 0.301553, best loss: 0.181952 2025-01-16 02:06:09,805 - INFO - step 18223, loss: 0.264415, best loss: 0.181952 2025-01-16 02:06:09,955 - INFO - step 18224, loss: 0.292606, best loss: 0.181952 2025-01-16 02:06:10,105 - INFO - step 18225, loss: 0.313252, best loss: 0.181952 2025-01-16 02:06:10,255 - INFO - step 18226, loss: 0.307856, best loss: 0.181952 2025-01-16 02:06:10,405 - INFO - step 18227, loss: 0.334778, best loss: 0.181952 2025-01-16 02:06:10,555 - INFO - step 18228, loss: 0.277477, best loss: 0.181952 2025-01-16 02:06:10,705 - INFO - step 18229, loss: 0.291214, best loss: 0.181952 2025-01-16 02:06:10,855 - INFO - step 18230, loss: 0.314468, best loss: 0.181952 2025-01-16 02:06:11,005 - INFO - step 18231, loss: 0.343437, best loss: 0.181952 2025-01-16 02:06:11,155 - INFO - step 18232, loss: 0.250804, best loss: 0.181952 2025-01-16 02:06:11,305 - INFO - step 18233, loss: 0.210881, best loss: 0.181952 2025-01-16 02:06:11,455 - INFO - step 18234, loss: 0.278012, best loss: 0.181952 2025-01-16 02:06:11,605 - INFO - step 18235, loss: 0.298923, best loss: 0.181952 2025-01-16 02:06:11,756 - INFO - step 18236, loss: 0.276485, best loss: 0.181952 2025-01-16 02:06:11,906 - INFO - step 18237, loss: 0.233402, best loss: 0.181952 2025-01-16 02:06:12,056 - INFO - step 18238, loss: 0.286043, best loss: 0.181952 2025-01-16 02:06:12,206 - INFO - step 18239, loss: 0.266034, best loss: 0.181952 2025-01-16 02:06:12,356 - INFO - step 18240, loss: 0.213987, best loss: 0.181952 2025-01-16 02:06:12,506 - INFO - step 18241, loss: 0.273256, best loss: 0.181952 2025-01-16 02:06:12,656 - INFO - step 18242, loss: 0.274172, best loss: 0.181952 2025-01-16 02:06:12,806 - INFO - step 18243, loss: 0.321597, best loss: 0.181952 2025-01-16 02:06:12,956 - INFO - step 18244, loss: 0.226361, best loss: 0.181952 2025-01-16 02:06:13,106 - INFO - step 18245, loss: 0.333627, best loss: 0.181952 2025-01-16 02:06:13,257 - INFO - step 18246, loss: 0.294254, best loss: 0.181952 2025-01-16 02:06:13,407 - INFO - step 18247, loss: 0.237401, best loss: 0.181952 2025-01-16 02:06:13,557 - INFO - step 18248, loss: 0.276967, best loss: 0.181952 2025-01-16 02:06:13,707 - INFO - step 18249, loss: 0.245856, best loss: 0.181952 2025-01-16 02:06:13,857 - INFO - step 18250, loss: 0.247959, best loss: 0.181952 2025-01-16 02:06:14,007 - INFO - step 18251, loss: 0.233339, best loss: 0.181952 2025-01-16 02:06:14,157 - INFO - step 18252, loss: 0.271488, best loss: 0.181952 2025-01-16 02:06:14,307 - INFO - step 18253, loss: 0.230248, best loss: 0.181952 2025-01-16 02:06:14,457 - INFO - step 18254, loss: 0.284379, best loss: 0.181952 2025-01-16 02:06:14,607 - INFO - step 18255, loss: 0.278217, best loss: 0.181952 2025-01-16 02:06:14,757 - INFO - step 18256, loss: 0.283649, best loss: 0.181952 2025-01-16 02:06:14,907 - INFO - step 18257, loss: 0.281771, best loss: 0.181952 2025-01-16 02:06:15,057 - INFO - step 18258, loss: 0.239443, best loss: 0.181952 2025-01-16 02:06:15,207 - INFO - step 18259, loss: 0.278897, best loss: 0.181952 2025-01-16 02:06:15,358 - INFO - step 18260, loss: 0.216298, best loss: 0.181952 2025-01-16 02:06:15,508 - INFO - step 18261, loss: 0.256612, best loss: 0.181952 2025-01-16 02:06:15,658 - INFO - step 18262, loss: 0.231978, best loss: 0.181952 2025-01-16 02:06:15,808 - INFO - step 18263, loss: 0.249664, best loss: 0.181952 2025-01-16 02:06:15,958 - INFO - step 18264, loss: 0.265879, best loss: 0.181952 2025-01-16 02:06:16,109 - INFO - step 18265, loss: 0.249373, best loss: 0.181952 2025-01-16 02:06:16,259 - INFO - step 18266, loss: 0.274574, best loss: 0.181952 2025-01-16 02:06:16,409 - INFO - step 18267, loss: 0.282632, best loss: 0.181952 2025-01-16 02:06:16,559 - INFO - step 18268, loss: 0.204223, best loss: 0.181952 2025-01-16 02:06:16,709 - INFO - step 18269, loss: 0.264405, best loss: 0.181952 2025-01-16 02:06:16,859 - INFO - step 18270, loss: 0.250633, best loss: 0.181952 2025-01-16 02:06:17,009 - INFO - step 18271, loss: 0.235127, best loss: 0.181952 2025-01-16 02:06:17,159 - INFO - step 18272, loss: 0.281381, best loss: 0.181952 2025-01-16 02:06:17,309 - INFO - step 18273, loss: 0.194879, best loss: 0.181952 2025-01-16 02:06:17,459 - INFO - step 18274, loss: 0.257975, best loss: 0.181952 2025-01-16 02:06:17,609 - INFO - step 18275, loss: 0.257396, best loss: 0.181952 2025-01-16 02:06:17,759 - INFO - step 18276, loss: 0.278035, best loss: 0.181952 2025-01-16 02:06:17,910 - INFO - step 18277, loss: 0.271861, best loss: 0.181952 2025-01-16 02:06:18,059 - INFO - step 18278, loss: 0.235860, best loss: 0.181952 2025-01-16 02:06:18,210 - INFO - step 18279, loss: 0.249745, best loss: 0.181952 2025-01-16 02:06:18,360 - INFO - step 18280, loss: 0.241907, best loss: 0.181952 2025-01-16 02:06:18,510 - INFO - step 18281, loss: 0.264863, best loss: 0.181952 2025-01-16 02:06:18,660 - INFO - step 18282, loss: 0.275987, best loss: 0.181952 2025-01-16 02:06:18,810 - INFO - step 18283, loss: 0.211680, best loss: 0.181952 2025-01-16 02:06:18,960 - INFO - step 18284, loss: 0.238122, best loss: 0.181952 2025-01-16 02:06:19,110 - INFO - step 18285, loss: 0.233927, best loss: 0.181952 2025-01-16 02:06:19,260 - INFO - step 18286, loss: 0.209789, best loss: 0.181952 2025-01-16 02:06:19,410 - INFO - step 18287, loss: 0.274888, best loss: 0.181952 2025-01-16 02:06:19,560 - INFO - step 18288, loss: 0.259372, best loss: 0.181952 2025-01-16 02:06:19,710 - INFO - step 18289, loss: 0.246202, best loss: 0.181952 2025-01-16 02:06:19,860 - INFO - step 18290, loss: 0.282610, best loss: 0.181952 2025-01-16 02:06:20,011 - INFO - step 18291, loss: 0.213994, best loss: 0.181952 2025-01-16 02:06:20,161 - INFO - step 18292, loss: 0.278295, best loss: 0.181952 2025-01-16 02:06:20,311 - INFO - step 18293, loss: 0.240038, best loss: 0.181952 2025-01-16 02:06:20,461 - INFO - step 18294, loss: 0.190841, best loss: 0.181952 2025-01-16 02:06:20,611 - INFO - step 18295, loss: 0.277198, best loss: 0.181952 2025-01-16 02:06:20,761 - INFO - step 18296, loss: 0.297548, best loss: 0.181952 2025-01-16 02:06:20,911 - INFO - step 18297, loss: 0.248916, best loss: 0.181952 2025-01-16 02:06:21,061 - INFO - step 18298, loss: 0.226080, best loss: 0.181952 2025-01-16 02:06:21,212 - INFO - step 18299, loss: 0.227860, best loss: 0.181952 2025-01-16 02:06:21,362 - INFO - step 18300, loss: 0.275141, best loss: 0.181952 2025-01-16 02:06:21,512 - INFO - step 18301, loss: 0.288166, best loss: 0.181952 2025-01-16 02:06:21,662 - INFO - step 18302, loss: 0.255668, best loss: 0.181952 2025-01-16 02:06:21,812 - INFO - step 18303, loss: 0.267180, best loss: 0.181952 2025-01-16 02:06:21,963 - INFO - step 18304, loss: 0.265543, best loss: 0.181952 2025-01-16 02:06:22,113 - INFO - step 18305, loss: 0.315909, best loss: 0.181952 2025-01-16 02:06:22,263 - INFO - step 18306, loss: 0.244506, best loss: 0.181952 2025-01-16 02:06:22,413 - INFO - step 18307, loss: 0.304445, best loss: 0.181952 2025-01-16 02:06:22,563 - INFO - step 18308, loss: 0.295009, best loss: 0.181952 2025-01-16 02:06:22,713 - INFO - step 18309, loss: 0.236353, best loss: 0.181952 2025-01-16 02:06:22,863 - INFO - step 18310, loss: 0.268641, best loss: 0.181952 2025-01-16 02:06:23,013 - INFO - step 18311, loss: 0.296328, best loss: 0.181952 2025-01-16 02:06:23,163 - INFO - step 18312, loss: 0.320804, best loss: 0.181952 2025-01-16 02:06:23,313 - INFO - step 18313, loss: 0.260338, best loss: 0.181952 2025-01-16 02:06:23,463 - INFO - step 18314, loss: 0.314426, best loss: 0.181952 2025-01-16 02:06:23,613 - INFO - step 18315, loss: 0.292264, best loss: 0.181952 2025-01-16 02:06:23,763 - INFO - step 18316, loss: 0.321424, best loss: 0.181952 2025-01-16 02:06:23,914 - INFO - step 18317, loss: 0.328095, best loss: 0.181952 2025-01-16 02:06:24,064 - INFO - step 18318, loss: 0.273316, best loss: 0.181952 2025-01-16 02:06:24,214 - INFO - step 18319, loss: 0.279182, best loss: 0.181952 2025-01-16 02:06:24,364 - INFO - step 18320, loss: 0.284244, best loss: 0.181952 2025-01-16 02:06:24,514 - INFO - step 18321, loss: 0.258027, best loss: 0.181952 2025-01-16 02:06:24,664 - INFO - step 18322, loss: 0.250125, best loss: 0.181952 2025-01-16 02:06:24,814 - INFO - step 18323, loss: 0.244201, best loss: 0.181952 2025-01-16 02:06:24,965 - INFO - step 18324, loss: 0.275184, best loss: 0.181952 2025-01-16 02:06:25,115 - INFO - step 18325, loss: 0.251524, best loss: 0.181952 2025-01-16 02:06:25,265 - INFO - step 18326, loss: 0.261394, best loss: 0.181952 2025-01-16 02:06:25,415 - INFO - step 18327, loss: 0.298082, best loss: 0.181952 2025-01-16 02:06:25,565 - INFO - step 18328, loss: 0.275992, best loss: 0.181952 2025-01-16 02:06:25,715 - INFO - step 18329, loss: 0.242940, best loss: 0.181952 2025-01-16 02:06:25,866 - INFO - step 18330, loss: 0.295650, best loss: 0.181952 2025-01-16 02:06:26,016 - INFO - step 18331, loss: 0.287674, best loss: 0.181952 2025-01-16 02:06:26,166 - INFO - step 18332, loss: 0.283396, best loss: 0.181952 2025-01-16 02:06:26,316 - INFO - step 18333, loss: 0.338508, best loss: 0.181952 2025-01-16 02:06:26,466 - INFO - step 18334, loss: 0.265623, best loss: 0.181952 2025-01-16 02:06:26,616 - INFO - step 18335, loss: 0.282227, best loss: 0.181952 2025-01-16 02:06:26,766 - INFO - step 18336, loss: 0.282718, best loss: 0.181952 2025-01-16 02:06:26,916 - INFO - step 18337, loss: 0.336698, best loss: 0.181952 2025-01-16 02:06:27,066 - INFO - step 18338, loss: 0.245674, best loss: 0.181952 2025-01-16 02:06:27,216 - INFO - step 18339, loss: 0.274043, best loss: 0.181952 2025-01-16 02:06:27,366 - INFO - step 18340, loss: 0.236157, best loss: 0.181952 2025-01-16 02:06:27,516 - INFO - step 18341, loss: 0.226022, best loss: 0.181952 2025-01-16 02:06:27,666 - INFO - step 18342, loss: 0.264760, best loss: 0.181952 2025-01-16 02:06:27,816 - INFO - step 18343, loss: 0.288706, best loss: 0.181952 2025-01-16 02:06:27,967 - INFO - step 18344, loss: 0.271270, best loss: 0.181952 2025-01-16 02:06:28,117 - INFO - step 18345, loss: 0.235291, best loss: 0.181952 2025-01-16 02:06:28,267 - INFO - step 18346, loss: 0.258003, best loss: 0.181952 2025-01-16 02:06:28,417 - INFO - step 18347, loss: 0.262320, best loss: 0.181952 2025-01-16 02:06:28,568 - INFO - step 18348, loss: 0.237251, best loss: 0.181952 2025-01-16 02:06:28,718 - INFO - step 18349, loss: 0.232017, best loss: 0.181952 2025-01-16 02:06:28,868 - INFO - step 18350, loss: 0.237277, best loss: 0.181952 2025-01-16 02:06:29,018 - INFO - step 18351, loss: 0.297462, best loss: 0.181952 2025-01-16 02:06:29,168 - INFO - step 18352, loss: 0.316961, best loss: 0.181952 2025-01-16 02:06:29,319 - INFO - step 18353, loss: 0.234665, best loss: 0.181952 2025-01-16 02:06:29,469 - INFO - step 18354, loss: 0.259493, best loss: 0.181952 2025-01-16 02:06:29,619 - INFO - step 18355, loss: 0.237933, best loss: 0.181952 2025-01-16 02:06:29,769 - INFO - step 18356, loss: 0.206887, best loss: 0.181952 2025-01-16 02:06:29,919 - INFO - step 18357, loss: 0.205895, best loss: 0.181952 2025-01-16 02:06:30,069 - INFO - step 18358, loss: 0.239727, best loss: 0.181952 2025-01-16 02:06:30,219 - INFO - step 18359, loss: 0.249417, best loss: 0.181952 2025-01-16 02:06:30,369 - INFO - step 18360, loss: 0.275980, best loss: 0.181952 2025-01-16 02:06:30,519 - INFO - step 18361, loss: 0.256892, best loss: 0.181952 2025-01-16 02:06:30,670 - INFO - step 18362, loss: 0.207008, best loss: 0.181952 2025-01-16 02:06:30,820 - INFO - step 18363, loss: 0.254004, best loss: 0.181952 2025-01-16 02:06:30,970 - INFO - step 18364, loss: 0.238209, best loss: 0.181952 2025-01-16 02:06:31,120 - INFO - step 18365, loss: 0.284354, best loss: 0.181952 2025-01-16 02:06:31,270 - INFO - step 18366, loss: 0.257978, best loss: 0.181952 2025-01-16 02:06:31,420 - INFO - step 18367, loss: 0.248003, best loss: 0.181952 2025-01-16 02:06:31,570 - INFO - step 18368, loss: 0.230833, best loss: 0.181952 2025-01-16 02:06:31,720 - INFO - step 18369, loss: 0.314677, best loss: 0.181952 2025-01-16 02:06:31,870 - INFO - step 18370, loss: 0.273830, best loss: 0.181952 2025-01-16 02:06:32,020 - INFO - step 18371, loss: 0.318369, best loss: 0.181952 2025-01-16 02:06:32,171 - INFO - step 18372, loss: 0.324962, best loss: 0.181952 2025-01-16 02:06:32,321 - INFO - step 18373, loss: 0.253764, best loss: 0.181952 2025-01-16 02:06:32,471 - INFO - step 18374, loss: 0.316532, best loss: 0.181952 2025-01-16 02:06:32,621 - INFO - step 18375, loss: 0.269651, best loss: 0.181952 2025-01-16 02:06:32,771 - INFO - step 18376, loss: 0.267653, best loss: 0.181952 2025-01-16 02:06:32,921 - INFO - step 18377, loss: 0.255880, best loss: 0.181952 2025-01-16 02:06:33,072 - INFO - step 18378, loss: 0.249342, best loss: 0.181952 2025-01-16 02:06:33,222 - INFO - step 18379, loss: 0.289344, best loss: 0.181952 2025-01-16 02:06:33,372 - INFO - step 18380, loss: 0.301952, best loss: 0.181952 2025-01-16 02:06:33,522 - INFO - step 18381, loss: 0.267227, best loss: 0.181952 2025-01-16 02:06:33,672 - INFO - step 18382, loss: 0.232445, best loss: 0.181952 2025-01-16 02:06:33,822 - INFO - step 18383, loss: 0.266542, best loss: 0.181952 2025-01-16 02:06:33,972 - INFO - step 18384, loss: 0.276331, best loss: 0.181952 2025-01-16 02:06:34,122 - INFO - step 18385, loss: 0.253175, best loss: 0.181952 2025-01-16 02:06:34,273 - INFO - step 18386, loss: 0.269740, best loss: 0.181952 2025-01-16 02:06:34,423 - INFO - step 18387, loss: 0.259269, best loss: 0.181952 2025-01-16 02:06:34,573 - INFO - step 18388, loss: 0.264170, best loss: 0.181952 2025-01-16 02:06:34,723 - INFO - step 18389, loss: 0.269007, best loss: 0.181952 2025-01-16 02:06:34,873 - INFO - step 18390, loss: 0.349027, best loss: 0.181952 2025-01-16 02:06:35,023 - INFO - step 18391, loss: 0.326875, best loss: 0.181952 2025-01-16 02:06:35,173 - INFO - step 18392, loss: 0.281355, best loss: 0.181952 2025-01-16 02:06:35,324 - INFO - step 18393, loss: 0.275991, best loss: 0.181952 2025-01-16 02:06:35,474 - INFO - step 18394, loss: 0.322904, best loss: 0.181952 2025-01-16 02:06:35,624 - INFO - step 18395, loss: 0.265316, best loss: 0.181952 2025-01-16 02:06:35,774 - INFO - step 18396, loss: 0.286832, best loss: 0.181952 2025-01-16 02:06:35,924 - INFO - step 18397, loss: 0.239294, best loss: 0.181952 2025-01-16 02:06:36,074 - INFO - step 18398, loss: 0.280844, best loss: 0.181952 2025-01-16 02:06:36,224 - INFO - step 18399, loss: 0.298642, best loss: 0.181952 2025-01-16 02:06:36,374 - INFO - step 18400, loss: 0.256338, best loss: 0.181952 2025-01-16 02:06:36,524 - INFO - step 18401, loss: 0.298800, best loss: 0.181952 2025-01-16 02:06:36,674 - INFO - step 18402, loss: 0.251750, best loss: 0.181952 2025-01-16 02:06:36,824 - INFO - step 18403, loss: 0.262611, best loss: 0.181952 2025-01-16 02:06:36,974 - INFO - step 18404, loss: 0.267101, best loss: 0.181952 2025-01-16 02:06:37,124 - INFO - step 18405, loss: 0.226965, best loss: 0.181952 2025-01-16 02:06:37,274 - INFO - step 18406, loss: 0.268168, best loss: 0.181952 2025-01-16 02:06:37,424 - INFO - step 18407, loss: 0.306197, best loss: 0.181952 2025-01-16 02:06:37,575 - INFO - step 18408, loss: 0.292420, best loss: 0.181952 2025-01-16 02:06:37,725 - INFO - step 18409, loss: 0.242363, best loss: 0.181952 2025-01-16 02:06:37,875 - INFO - step 18410, loss: 0.253142, best loss: 0.181952 2025-01-16 02:06:38,025 - INFO - step 18411, loss: 0.236197, best loss: 0.181952 2025-01-16 02:06:38,175 - INFO - step 18412, loss: 0.264316, best loss: 0.181952 2025-01-16 02:06:38,325 - INFO - step 18413, loss: 0.220355, best loss: 0.181952 2025-01-16 02:06:38,475 - INFO - step 18414, loss: 0.236391, best loss: 0.181952 2025-01-16 02:06:38,625 - INFO - step 18415, loss: 0.233360, best loss: 0.181952 2025-01-16 02:06:38,775 - INFO - step 18416, loss: 0.223959, best loss: 0.181952 2025-01-16 02:06:38,926 - INFO - step 18417, loss: 0.227606, best loss: 0.181952 2025-01-16 02:06:39,075 - INFO - step 18418, loss: 0.268819, best loss: 0.181952 2025-01-16 02:06:39,225 - INFO - step 18419, loss: 0.268015, best loss: 0.181952 2025-01-16 02:06:39,375 - INFO - step 18420, loss: 0.260528, best loss: 0.181952 2025-01-16 02:06:39,526 - INFO - step 18421, loss: 0.227571, best loss: 0.181952 2025-01-16 02:06:39,676 - INFO - step 18422, loss: 0.238022, best loss: 0.181952 2025-01-16 02:06:39,826 - INFO - step 18423, loss: 0.223896, best loss: 0.181952 2025-01-16 02:06:39,976 - INFO - step 18424, loss: 0.274762, best loss: 0.181952 2025-01-16 02:06:40,126 - INFO - step 18425, loss: 0.236442, best loss: 0.181952 2025-01-16 02:06:40,276 - INFO - step 18426, loss: 0.254836, best loss: 0.181952 2025-01-16 02:06:40,426 - INFO - step 18427, loss: 0.298632, best loss: 0.181952 2025-01-16 02:06:40,576 - INFO - step 18428, loss: 0.250098, best loss: 0.181952 2025-01-16 02:06:40,726 - INFO - step 18429, loss: 0.284371, best loss: 0.181952 2025-01-16 02:06:40,876 - INFO - step 18430, loss: 0.283806, best loss: 0.181952 2025-01-16 02:06:41,027 - INFO - step 18431, loss: 0.260681, best loss: 0.181952 2025-01-16 02:06:41,176 - INFO - step 18432, loss: 0.263466, best loss: 0.181952 2025-01-16 02:06:41,326 - INFO - step 18433, loss: 0.262466, best loss: 0.181952 2025-01-16 02:06:41,476 - INFO - step 18434, loss: 0.267874, best loss: 0.181952 2025-01-16 02:06:41,626 - INFO - step 18435, loss: 0.238024, best loss: 0.181952 2025-01-16 02:06:41,776 - INFO - step 18436, loss: 0.272771, best loss: 0.181952 2025-01-16 02:06:41,927 - INFO - step 18437, loss: 0.236280, best loss: 0.181952 2025-01-16 02:06:42,077 - INFO - step 18438, loss: 0.232024, best loss: 0.181952 2025-01-16 02:06:42,227 - INFO - step 18439, loss: 0.271365, best loss: 0.181952 2025-01-16 02:06:42,377 - INFO - step 18440, loss: 0.303914, best loss: 0.181952 2025-01-16 02:06:42,527 - INFO - step 18441, loss: 0.212903, best loss: 0.181952 2025-01-16 02:06:42,677 - INFO - step 18442, loss: 0.272649, best loss: 0.181952 2025-01-16 02:06:42,827 - INFO - step 18443, loss: 0.256191, best loss: 0.181952 2025-01-16 02:06:42,977 - INFO - step 18444, loss: 0.262203, best loss: 0.181952 2025-01-16 02:06:43,127 - INFO - step 18445, loss: 0.248117, best loss: 0.181952 2025-01-16 02:06:43,277 - INFO - step 18446, loss: 0.223078, best loss: 0.181952 2025-01-16 02:06:43,428 - INFO - step 18447, loss: 0.285396, best loss: 0.181952 2025-01-16 02:06:43,578 - INFO - step 18448, loss: 0.292583, best loss: 0.181952 2025-01-16 02:06:43,728 - INFO - step 18449, loss: 0.234977, best loss: 0.181952 2025-01-16 02:06:43,878 - INFO - step 18450, loss: 0.243724, best loss: 0.181952 2025-01-16 02:06:44,028 - INFO - step 18451, loss: 0.242143, best loss: 0.181952 2025-01-16 02:06:44,178 - INFO - step 18452, loss: 0.270750, best loss: 0.181952 2025-01-16 02:06:44,328 - INFO - step 18453, loss: 0.245848, best loss: 0.181952 2025-01-16 02:06:44,478 - INFO - step 18454, loss: 0.250222, best loss: 0.181952 2025-01-16 02:06:44,628 - INFO - step 18455, loss: 0.321575, best loss: 0.181952 2025-01-16 02:06:44,778 - INFO - step 18456, loss: 0.233675, best loss: 0.181952 2025-01-16 02:06:44,928 - INFO - step 18457, loss: 0.225249, best loss: 0.181952 2025-01-16 02:06:45,078 - INFO - step 18458, loss: 0.256324, best loss: 0.181952 2025-01-16 02:06:45,228 - INFO - step 18459, loss: 0.312691, best loss: 0.181952 2025-01-16 02:06:45,378 - INFO - step 18460, loss: 0.224025, best loss: 0.181952 2025-01-16 02:06:45,528 - INFO - step 18461, loss: 0.242614, best loss: 0.181952 2025-01-16 02:06:45,678 - INFO - step 18462, loss: 0.323310, best loss: 0.181952 2025-01-16 02:06:45,828 - INFO - step 18463, loss: 0.223611, best loss: 0.181952 2025-01-16 02:06:45,979 - INFO - step 18464, loss: 0.221241, best loss: 0.181952 2025-01-16 02:06:46,129 - INFO - step 18465, loss: 0.224424, best loss: 0.181952 2025-01-16 02:06:46,279 - INFO - step 18466, loss: 0.240727, best loss: 0.181952 2025-01-16 02:06:46,430 - INFO - step 18467, loss: 0.224084, best loss: 0.181952 2025-01-16 02:06:46,580 - INFO - step 18468, loss: 0.264259, best loss: 0.181952 2025-01-16 02:06:46,730 - INFO - step 18469, loss: 0.237089, best loss: 0.181952 2025-01-16 02:06:46,880 - INFO - step 18470, loss: 0.300382, best loss: 0.181952 2025-01-16 02:06:47,030 - INFO - step 18471, loss: 0.266615, best loss: 0.181952 2025-01-16 02:06:47,180 - INFO - step 18472, loss: 0.260447, best loss: 0.181952 2025-01-16 02:06:47,330 - INFO - step 18473, loss: 0.283696, best loss: 0.181952 2025-01-16 02:06:47,480 - INFO - step 18474, loss: 0.239034, best loss: 0.181952 2025-01-16 02:06:47,630 - INFO - step 18475, loss: 0.272843, best loss: 0.181952 2025-01-16 02:06:47,781 - INFO - step 18476, loss: 0.344325, best loss: 0.181952 2025-01-16 02:06:47,931 - INFO - step 18477, loss: 0.239878, best loss: 0.181952 2025-01-16 02:06:48,081 - INFO - step 18478, loss: 0.337733, best loss: 0.181952 2025-01-16 02:06:48,231 - INFO - step 18479, loss: 0.247337, best loss: 0.181952 2025-01-16 02:06:48,381 - INFO - step 18480, loss: 0.290106, best loss: 0.181952 2025-01-16 02:06:48,531 - INFO - step 18481, loss: 0.292257, best loss: 0.181952 2025-01-16 02:06:48,681 - INFO - step 18482, loss: 0.236283, best loss: 0.181952 2025-01-16 02:06:48,831 - INFO - step 18483, loss: 0.263656, best loss: 0.181952 2025-01-16 02:06:48,981 - INFO - step 18484, loss: 0.235345, best loss: 0.181952 2025-01-16 02:06:49,131 - INFO - step 18485, loss: 0.291953, best loss: 0.181952 2025-01-16 02:06:49,281 - INFO - step 18486, loss: 0.304459, best loss: 0.181952 2025-01-16 02:06:49,431 - INFO - step 18487, loss: 0.227831, best loss: 0.181952 2025-01-16 02:06:49,581 - INFO - step 18488, loss: 0.263126, best loss: 0.181952 2025-01-16 02:06:49,731 - INFO - step 18489, loss: 0.283377, best loss: 0.181952 2025-01-16 02:06:49,881 - INFO - step 18490, loss: 0.221713, best loss: 0.181952 2025-01-16 02:06:50,031 - INFO - step 18491, loss: 0.242272, best loss: 0.181952 2025-01-16 02:06:50,181 - INFO - step 18492, loss: 0.213691, best loss: 0.181952 2025-01-16 02:06:50,331 - INFO - step 18493, loss: 0.259902, best loss: 0.181952 2025-01-16 02:06:50,481 - INFO - step 18494, loss: 0.261055, best loss: 0.181952 2025-01-16 02:06:50,631 - INFO - step 18495, loss: 0.232001, best loss: 0.181952 2025-01-16 02:06:50,781 - INFO - step 18496, loss: 0.292708, best loss: 0.181952 2025-01-16 02:06:50,931 - INFO - step 18497, loss: 0.249170, best loss: 0.181952 2025-01-16 02:06:51,081 - INFO - step 18498, loss: 0.231077, best loss: 0.181952 2025-01-16 02:06:51,231 - INFO - step 18499, loss: 0.239381, best loss: 0.181952 2025-01-16 02:06:51,381 - INFO - step 18500, loss: 0.244159, best loss: 0.181952 2025-01-16 02:06:51,531 - INFO - step 18501, loss: 0.256262, best loss: 0.181952 2025-01-16 02:06:51,681 - INFO - step 18502, loss: 0.210861, best loss: 0.181952 2025-01-16 02:06:51,831 - INFO - step 18503, loss: 0.270807, best loss: 0.181952 2025-01-16 02:06:51,981 - INFO - step 18504, loss: 0.270378, best loss: 0.181952 2025-01-16 02:06:52,131 - INFO - step 18505, loss: 0.263131, best loss: 0.181952 2025-01-16 02:06:52,281 - INFO - step 18506, loss: 0.290354, best loss: 0.181952 2025-01-16 02:06:52,431 - INFO - step 18507, loss: 0.217339, best loss: 0.181952 2025-01-16 02:06:52,582 - INFO - step 18508, loss: 0.222199, best loss: 0.181952 2025-01-16 02:06:52,732 - INFO - step 18509, loss: 0.255935, best loss: 0.181952 2025-01-16 02:06:52,882 - INFO - step 18510, loss: 0.266892, best loss: 0.181952 2025-01-16 02:06:53,033 - INFO - step 18511, loss: 0.254569, best loss: 0.181952 2025-01-16 02:06:53,183 - INFO - step 18512, loss: 0.216916, best loss: 0.181952 2025-01-16 02:06:53,333 - INFO - step 18513, loss: 0.258339, best loss: 0.181952 2025-01-16 02:06:53,483 - INFO - step 18514, loss: 0.290112, best loss: 0.181952 2025-01-16 02:06:53,633 - INFO - step 18515, loss: 0.217456, best loss: 0.181952 2025-01-16 02:06:53,783 - INFO - step 18516, loss: 0.275032, best loss: 0.181952 2025-01-16 02:06:53,934 - INFO - step 18517, loss: 0.323855, best loss: 0.181952 2025-01-16 02:06:54,084 - INFO - step 18518, loss: 0.315260, best loss: 0.181952 2025-01-16 02:06:54,234 - INFO - step 18519, loss: 0.215958, best loss: 0.181952 2025-01-16 02:06:54,384 - INFO - step 18520, loss: 0.299584, best loss: 0.181952 2025-01-16 02:06:54,534 - INFO - step 18521, loss: 0.240939, best loss: 0.181952 2025-01-16 02:06:54,684 - INFO - step 18522, loss: 0.260708, best loss: 0.181952 2025-01-16 02:06:54,834 - INFO - step 18523, loss: 0.208540, best loss: 0.181952 2025-01-16 02:06:54,984 - INFO - step 18524, loss: 0.230734, best loss: 0.181952 2025-01-16 02:06:55,134 - INFO - step 18525, loss: 0.253855, best loss: 0.181952 2025-01-16 02:06:55,284 - INFO - step 18526, loss: 0.257414, best loss: 0.181952 2025-01-16 02:06:55,434 - INFO - step 18527, loss: 0.231946, best loss: 0.181952 2025-01-16 02:06:55,584 - INFO - step 18528, loss: 0.290791, best loss: 0.181952 2025-01-16 02:06:55,734 - INFO - step 18529, loss: 0.306350, best loss: 0.181952 2025-01-16 02:06:55,884 - INFO - step 18530, loss: 0.301203, best loss: 0.181952 2025-01-16 02:06:56,034 - INFO - step 18531, loss: 0.258987, best loss: 0.181952 2025-01-16 02:06:56,185 - INFO - step 18532, loss: 0.288284, best loss: 0.181952 2025-01-16 02:06:56,335 - INFO - step 18533, loss: 0.308896, best loss: 0.181952 2025-01-16 02:06:56,485 - INFO - step 18534, loss: 0.222873, best loss: 0.181952 2025-01-16 02:06:56,635 - INFO - step 18535, loss: 0.241973, best loss: 0.181952 2025-01-16 02:06:56,785 - INFO - step 18536, loss: 0.271623, best loss: 0.181952 2025-01-16 02:06:56,935 - INFO - step 18537, loss: 0.204287, best loss: 0.181952 2025-01-16 02:06:57,085 - INFO - step 18538, loss: 0.265290, best loss: 0.181952 2025-01-16 02:06:57,235 - INFO - step 18539, loss: 0.305985, best loss: 0.181952 2025-01-16 02:06:57,385 - INFO - step 18540, loss: 0.279281, best loss: 0.181952 2025-01-16 02:06:57,535 - INFO - step 18541, loss: 0.261855, best loss: 0.181952 2025-01-16 02:06:57,685 - INFO - step 18542, loss: 0.313697, best loss: 0.181952 2025-01-16 02:06:57,835 - INFO - step 18543, loss: 0.327605, best loss: 0.181952 2025-01-16 02:06:57,985 - INFO - step 18544, loss: 0.287585, best loss: 0.181952 2025-01-16 02:06:58,135 - INFO - step 18545, loss: 0.260480, best loss: 0.181952 2025-01-16 02:06:58,285 - INFO - step 18546, loss: 0.258679, best loss: 0.181952 2025-01-16 02:06:58,435 - INFO - step 18547, loss: 0.264362, best loss: 0.181952 2025-01-16 02:06:58,585 - INFO - step 18548, loss: 0.234858, best loss: 0.181952 2025-01-16 02:06:58,735 - INFO - step 18549, loss: 0.276041, best loss: 0.181952 2025-01-16 02:06:58,885 - INFO - step 18550, loss: 0.272053, best loss: 0.181952 2025-01-16 02:06:59,036 - INFO - step 18551, loss: 0.238303, best loss: 0.181952 2025-01-16 02:06:59,186 - INFO - step 18552, loss: 0.244500, best loss: 0.181952 2025-01-16 02:06:59,336 - INFO - step 18553, loss: 0.272467, best loss: 0.181952 2025-01-16 02:06:59,486 - INFO - step 18554, loss: 0.276131, best loss: 0.181952 2025-01-16 02:06:59,637 - INFO - step 18555, loss: 0.256415, best loss: 0.181952 2025-01-16 02:06:59,787 - INFO - step 18556, loss: 0.245231, best loss: 0.181952 2025-01-16 02:06:59,937 - INFO - step 18557, loss: 0.261164, best loss: 0.181952 2025-01-16 02:07:00,087 - INFO - step 18558, loss: 0.263603, best loss: 0.181952 2025-01-16 02:07:00,238 - INFO - step 18559, loss: 0.279125, best loss: 0.181952 2025-01-16 02:07:00,388 - INFO - step 18560, loss: 0.309065, best loss: 0.181952 2025-01-16 02:07:00,538 - INFO - step 18561, loss: 0.244515, best loss: 0.181952 2025-01-16 02:07:00,688 - INFO - step 18562, loss: 0.249406, best loss: 0.181952 2025-01-16 02:07:00,838 - INFO - step 18563, loss: 0.205631, best loss: 0.181952 2025-01-16 02:07:00,988 - INFO - step 18564, loss: 0.244642, best loss: 0.181952 2025-01-16 02:07:01,138 - INFO - step 18565, loss: 0.255899, best loss: 0.181952 2025-01-16 02:07:01,288 - INFO - step 18566, loss: 0.223505, best loss: 0.181952 2025-01-16 02:07:01,438 - INFO - step 18567, loss: 0.255031, best loss: 0.181952 2025-01-16 02:07:01,589 - INFO - step 18568, loss: 0.235039, best loss: 0.181952 2025-01-16 02:07:01,739 - INFO - step 18569, loss: 0.274485, best loss: 0.181952 2025-01-16 02:07:01,889 - INFO - step 18570, loss: 0.235854, best loss: 0.181952 2025-01-16 02:07:02,039 - INFO - step 18571, loss: 0.255797, best loss: 0.181952 2025-01-16 02:07:02,189 - INFO - step 18572, loss: 0.208596, best loss: 0.181952 2025-01-16 02:07:02,339 - INFO - step 18573, loss: 0.251225, best loss: 0.181952 2025-01-16 02:07:02,489 - INFO - step 18574, loss: 0.246112, best loss: 0.181952 2025-01-16 02:07:02,640 - INFO - step 18575, loss: 0.262187, best loss: 0.181952 2025-01-16 02:07:02,790 - INFO - step 18576, loss: 0.233770, best loss: 0.181952 2025-01-16 02:07:02,940 - INFO - step 18577, loss: 0.247972, best loss: 0.181952 2025-01-16 02:07:03,090 - INFO - step 18578, loss: 0.226518, best loss: 0.181952 2025-01-16 02:07:03,240 - INFO - step 18579, loss: 0.245332, best loss: 0.181952 2025-01-16 02:07:03,390 - INFO - step 18580, loss: 0.242557, best loss: 0.181952 2025-01-16 02:07:03,540 - INFO - step 18581, loss: 0.230183, best loss: 0.181952 2025-01-16 02:07:03,690 - INFO - step 18582, loss: 0.260529, best loss: 0.181952 2025-01-16 02:07:03,841 - INFO - step 18583, loss: 0.205429, best loss: 0.181952 2025-01-16 02:07:03,991 - INFO - step 18584, loss: 0.263268, best loss: 0.181952 2025-01-16 02:07:04,141 - INFO - step 18585, loss: 0.225886, best loss: 0.181952 2025-01-16 02:07:04,291 - INFO - step 18586, loss: 0.260423, best loss: 0.181952 2025-01-16 02:07:04,441 - INFO - step 18587, loss: 0.223894, best loss: 0.181952 2025-01-16 02:07:04,591 - INFO - step 18588, loss: 0.219927, best loss: 0.181952 2025-01-16 02:07:04,741 - INFO - step 18589, loss: 0.269112, best loss: 0.181952 2025-01-16 02:07:04,891 - INFO - step 18590, loss: 0.215853, best loss: 0.181952 2025-01-16 02:07:05,041 - INFO - step 18591, loss: 0.234490, best loss: 0.181952 2025-01-16 02:07:05,192 - INFO - step 18592, loss: 0.254401, best loss: 0.181952 2025-01-16 02:07:05,342 - INFO - step 18593, loss: 0.221478, best loss: 0.181952 2025-01-16 02:07:05,492 - INFO - step 18594, loss: 0.234576, best loss: 0.181952 2025-01-16 02:07:05,642 - INFO - step 18595, loss: 0.210441, best loss: 0.181952 2025-01-16 02:07:05,792 - INFO - step 18596, loss: 0.266265, best loss: 0.181952 2025-01-16 02:07:05,942 - INFO - step 18597, loss: 0.220105, best loss: 0.181952 2025-01-16 02:07:06,092 - INFO - step 18598, loss: 0.231902, best loss: 0.181952 2025-01-16 02:07:06,242 - INFO - step 18599, loss: 0.298157, best loss: 0.181952 2025-01-16 02:07:06,393 - INFO - step 18600, loss: 0.225365, best loss: 0.181952 2025-01-16 02:07:06,543 - INFO - step 18601, loss: 0.231037, best loss: 0.181952 2025-01-16 02:07:06,693 - INFO - step 18602, loss: 0.216471, best loss: 0.181952 2025-01-16 02:07:06,843 - INFO - step 18603, loss: 0.206938, best loss: 0.181952 2025-01-16 02:07:06,993 - INFO - step 18604, loss: 0.273241, best loss: 0.181952 2025-01-16 02:07:07,143 - INFO - step 18605, loss: 0.242201, best loss: 0.181952 2025-01-16 02:07:07,293 - INFO - step 18606, loss: 0.292055, best loss: 0.181952 2025-01-16 02:07:07,443 - INFO - step 18607, loss: 0.335022, best loss: 0.181952 2025-01-16 02:07:07,593 - INFO - step 18608, loss: 0.253073, best loss: 0.181952 2025-01-16 02:07:07,744 - INFO - step 18609, loss: 0.271022, best loss: 0.181952 2025-01-16 02:07:07,894 - INFO - step 18610, loss: 0.274295, best loss: 0.181952 2025-01-16 02:07:08,044 - INFO - step 18611, loss: 0.245063, best loss: 0.181952 2025-01-16 02:07:08,194 - INFO - step 18612, loss: 0.236432, best loss: 0.181952 2025-01-16 02:07:08,344 - INFO - step 18613, loss: 0.229381, best loss: 0.181952 2025-01-16 02:07:08,494 - INFO - step 18614, loss: 0.272689, best loss: 0.181952 2025-01-16 02:07:08,644 - INFO - step 18615, loss: 0.214841, best loss: 0.181952 2025-01-16 02:07:08,794 - INFO - step 18616, loss: 0.261538, best loss: 0.181952 2025-01-16 02:07:08,945 - INFO - step 18617, loss: 0.228986, best loss: 0.181952 2025-01-16 02:07:09,095 - INFO - step 18618, loss: 0.220184, best loss: 0.181952 2025-01-16 02:07:09,245 - INFO - step 18619, loss: 0.214113, best loss: 0.181952 2025-01-16 02:07:09,395 - INFO - step 18620, loss: 0.321567, best loss: 0.181952 2025-01-16 02:07:09,546 - INFO - step 18621, loss: 0.218375, best loss: 0.181952 2025-01-16 02:07:09,696 - INFO - step 18622, loss: 0.271127, best loss: 0.181952 2025-01-16 02:07:09,846 - INFO - step 18623, loss: 0.188707, best loss: 0.181952 2025-01-16 02:07:09,996 - INFO - step 18624, loss: 0.185799, best loss: 0.181952 2025-01-16 02:07:10,146 - INFO - step 18625, loss: 0.236862, best loss: 0.181952 2025-01-16 02:07:10,296 - INFO - step 18626, loss: 0.234557, best loss: 0.181952 2025-01-16 02:07:10,446 - INFO - step 18627, loss: 0.235899, best loss: 0.181952 2025-01-16 02:07:10,597 - INFO - step 18628, loss: 0.235635, best loss: 0.181952 2025-01-16 02:07:10,747 - INFO - step 18629, loss: 0.207051, best loss: 0.181952 2025-01-16 02:07:10,897 - INFO - step 18630, loss: 0.281367, best loss: 0.181952 2025-01-16 02:07:11,047 - INFO - step 18631, loss: 0.276227, best loss: 0.181952 2025-01-16 02:07:11,197 - INFO - step 18632, loss: 0.231475, best loss: 0.181952 2025-01-16 02:07:11,347 - INFO - step 18633, loss: 0.228669, best loss: 0.181952 2025-01-16 02:07:11,497 - INFO - step 18634, loss: 0.252659, best loss: 0.181952 2025-01-16 02:07:11,647 - INFO - step 18635, loss: 0.254997, best loss: 0.181952 2025-01-16 02:07:11,797 - INFO - step 18636, loss: 0.249301, best loss: 0.181952 2025-01-16 02:07:11,947 - INFO - step 18637, loss: 0.222179, best loss: 0.181952 2025-01-16 02:07:12,097 - INFO - step 18638, loss: 0.318208, best loss: 0.181952 2025-01-16 02:07:12,247 - INFO - step 18639, loss: 0.250657, best loss: 0.181952 2025-01-16 02:07:12,398 - INFO - step 18640, loss: 0.258081, best loss: 0.181952 2025-01-16 02:07:12,548 - INFO - step 18641, loss: 0.240569, best loss: 0.181952 2025-01-16 02:07:12,698 - INFO - step 18642, loss: 0.296479, best loss: 0.181952 2025-01-16 02:07:12,848 - INFO - step 18643, loss: 0.228748, best loss: 0.181952 2025-01-16 02:07:12,998 - INFO - step 18644, loss: 0.227951, best loss: 0.181952 2025-01-16 02:07:13,148 - INFO - step 18645, loss: 0.247470, best loss: 0.181952 2025-01-16 02:07:13,298 - INFO - step 18646, loss: 0.269919, best loss: 0.181952 2025-01-16 02:07:13,448 - INFO - step 18647, loss: 0.296356, best loss: 0.181952 2025-01-16 02:07:13,598 - INFO - step 18648, loss: 0.268025, best loss: 0.181952 2025-01-16 02:07:13,748 - INFO - step 18649, loss: 0.259791, best loss: 0.181952 2025-01-16 02:07:13,898 - INFO - step 18650, loss: 0.250702, best loss: 0.181952 2025-01-16 02:07:14,048 - INFO - step 18651, loss: 0.243039, best loss: 0.181952 2025-01-16 02:07:14,198 - INFO - step 18652, loss: 0.276397, best loss: 0.181952 2025-01-16 02:07:14,349 - INFO - step 18653, loss: 0.291549, best loss: 0.181952 2025-01-16 02:07:14,499 - INFO - step 18654, loss: 0.261100, best loss: 0.181952 2025-01-16 02:07:14,649 - INFO - step 18655, loss: 0.289508, best loss: 0.181952 2025-01-16 02:07:14,799 - INFO - step 18656, loss: 0.313597, best loss: 0.181952 2025-01-16 02:07:14,949 - INFO - step 18657, loss: 0.345115, best loss: 0.181952 2025-01-16 02:07:15,099 - INFO - step 18658, loss: 0.250401, best loss: 0.181952 2025-01-16 02:07:15,250 - INFO - step 18659, loss: 0.258887, best loss: 0.181952 2025-01-16 02:07:15,400 - INFO - step 18660, loss: 0.283906, best loss: 0.181952 2025-01-16 02:07:15,550 - INFO - step 18661, loss: 0.228975, best loss: 0.181952 2025-01-16 02:07:15,700 - INFO - step 18662, loss: 0.232229, best loss: 0.181952 2025-01-16 02:07:15,850 - INFO - step 18663, loss: 0.285423, best loss: 0.181952 2025-01-16 02:07:16,000 - INFO - step 18664, loss: 0.236203, best loss: 0.181952 2025-01-16 02:07:16,151 - INFO - step 18665, loss: 0.224955, best loss: 0.181952 2025-01-16 02:07:16,301 - INFO - step 18666, loss: 0.274760, best loss: 0.181952 2025-01-16 02:07:16,451 - INFO - step 18667, loss: 0.251933, best loss: 0.181952 2025-01-16 02:07:16,601 - INFO - step 18668, loss: 0.303107, best loss: 0.181952 2025-01-16 02:07:16,751 - INFO - step 18669, loss: 0.274335, best loss: 0.181952 2025-01-16 02:07:16,901 - INFO - step 18670, loss: 0.227574, best loss: 0.181952 2025-01-16 02:07:17,052 - INFO - step 18671, loss: 0.270435, best loss: 0.181952 2025-01-16 02:07:17,201 - INFO - step 18672, loss: 0.258294, best loss: 0.181952 2025-01-16 02:07:17,351 - INFO - step 18673, loss: 0.245382, best loss: 0.181952 2025-01-16 02:07:17,502 - INFO - step 18674, loss: 0.245503, best loss: 0.181952 2025-01-16 02:07:17,651 - INFO - step 18675, loss: 0.284232, best loss: 0.181952 2025-01-16 02:07:17,801 - INFO - step 18676, loss: 0.222711, best loss: 0.181952 2025-01-16 02:07:17,951 - INFO - step 18677, loss: 0.278259, best loss: 0.181952 2025-01-16 02:07:18,102 - INFO - step 18678, loss: 0.269867, best loss: 0.181952 2025-01-16 02:07:18,252 - INFO - step 18679, loss: 0.247580, best loss: 0.181952 2025-01-16 02:07:18,402 - INFO - step 18680, loss: 0.234710, best loss: 0.181952 2025-01-16 02:07:18,552 - INFO - step 18681, loss: 0.219745, best loss: 0.181952 2025-01-16 02:07:18,702 - INFO - step 18682, loss: 0.264341, best loss: 0.181952 2025-01-16 02:07:18,853 - INFO - step 18683, loss: 0.292374, best loss: 0.181952 2025-01-16 02:07:19,003 - INFO - step 18684, loss: 0.224859, best loss: 0.181952 2025-01-16 02:07:19,153 - INFO - step 18685, loss: 0.260442, best loss: 0.181952 2025-01-16 02:07:19,303 - INFO - step 18686, loss: 0.240174, best loss: 0.181952 2025-01-16 02:07:19,453 - INFO - step 18687, loss: 0.256444, best loss: 0.181952 2025-01-16 02:07:19,604 - INFO - step 18688, loss: 0.246088, best loss: 0.181952 2025-01-16 02:07:19,754 - INFO - step 18689, loss: 0.214357, best loss: 0.181952 2025-01-16 02:07:19,904 - INFO - step 18690, loss: 0.211650, best loss: 0.181952 2025-01-16 02:07:20,054 - INFO - step 18691, loss: 0.225405, best loss: 0.181952 2025-01-16 02:07:20,204 - INFO - step 18692, loss: 0.208157, best loss: 0.181952 2025-01-16 02:07:20,354 - INFO - step 18693, loss: 0.260733, best loss: 0.181952 2025-01-16 02:07:20,504 - INFO - step 18694, loss: 0.234994, best loss: 0.181952 2025-01-16 02:07:20,654 - INFO - step 18695, loss: 0.223755, best loss: 0.181952 2025-01-16 02:07:20,804 - INFO - step 18696, loss: 0.236870, best loss: 0.181952 2025-01-16 02:07:20,954 - INFO - step 18697, loss: 0.232898, best loss: 0.181952 2025-01-16 02:07:21,104 - INFO - step 18698, loss: 0.231481, best loss: 0.181952 2025-01-16 02:07:21,254 - INFO - step 18699, loss: 0.268245, best loss: 0.181952 2025-01-16 02:07:21,404 - INFO - step 18700, loss: 0.280752, best loss: 0.181952 2025-01-16 02:07:21,554 - INFO - step 18701, loss: 0.255661, best loss: 0.181952 2025-01-16 02:07:21,704 - INFO - step 18702, loss: 0.257374, best loss: 0.181952 2025-01-16 02:07:21,854 - INFO - step 18703, loss: 0.270562, best loss: 0.181952 2025-01-16 02:07:22,004 - INFO - step 18704, loss: 0.311442, best loss: 0.181952 2025-01-16 02:07:22,154 - INFO - step 18705, loss: 0.209039, best loss: 0.181952 2025-01-16 02:07:22,304 - INFO - step 18706, loss: 0.225201, best loss: 0.181952 2025-01-16 02:07:22,454 - INFO - step 18707, loss: 0.222248, best loss: 0.181952 2025-01-16 02:07:22,604 - INFO - step 18708, loss: 0.242121, best loss: 0.181952 2025-01-16 02:07:22,754 - INFO - step 18709, loss: 0.289326, best loss: 0.181952 2025-01-16 02:07:22,904 - INFO - step 18710, loss: 0.261193, best loss: 0.181952 2025-01-16 02:07:23,053 - INFO - step 18711, loss: 0.213524, best loss: 0.181952 2025-01-16 02:07:23,203 - INFO - step 18712, loss: 0.200017, best loss: 0.181952 2025-01-16 02:07:23,354 - INFO - step 18713, loss: 0.206989, best loss: 0.181952 2025-01-16 02:07:23,504 - INFO - step 18714, loss: 0.218515, best loss: 0.181952 2025-01-16 02:07:23,654 - INFO - step 18715, loss: 0.233082, best loss: 0.181952 2025-01-16 02:07:23,804 - INFO - step 18716, loss: 0.244120, best loss: 0.181952 2025-01-16 02:07:23,954 - INFO - step 18717, loss: 0.260960, best loss: 0.181952 2025-01-16 02:07:24,105 - INFO - step 18718, loss: 0.245330, best loss: 0.181952 2025-01-16 02:07:24,255 - INFO - step 18719, loss: 0.265331, best loss: 0.181952 2025-01-16 02:07:24,405 - INFO - step 18720, loss: 0.288728, best loss: 0.181952 2025-01-16 02:07:24,556 - INFO - step 18721, loss: 0.338581, best loss: 0.181952 2025-01-16 02:07:24,706 - INFO - step 18722, loss: 0.232333, best loss: 0.181952 2025-01-16 02:07:24,856 - INFO - step 18723, loss: 0.189812, best loss: 0.181952 2025-01-16 02:07:25,006 - INFO - step 18724, loss: 0.243139, best loss: 0.181952 2025-01-16 02:07:25,156 - INFO - step 18725, loss: 0.189511, best loss: 0.181952 2025-01-16 02:07:25,306 - INFO - step 18726, loss: 0.260705, best loss: 0.181952 2025-01-16 02:07:25,456 - INFO - step 18727, loss: 0.254489, best loss: 0.181952 2025-01-16 02:07:25,606 - INFO - step 18728, loss: 0.228657, best loss: 0.181952 2025-01-16 02:07:25,757 - INFO - step 18729, loss: 0.240485, best loss: 0.181952 2025-01-16 02:07:25,907 - INFO - step 18730, loss: 0.231621, best loss: 0.181952 2025-01-16 02:07:26,057 - INFO - step 18731, loss: 0.259328, best loss: 0.181952 2025-01-16 02:07:26,207 - INFO - step 18732, loss: 0.221638, best loss: 0.181952 2025-01-16 02:07:26,357 - INFO - step 18733, loss: 0.274908, best loss: 0.181952 2025-01-16 02:07:26,507 - INFO - step 18734, loss: 0.205471, best loss: 0.181952 2025-01-16 02:07:26,657 - INFO - step 18735, loss: 0.200348, best loss: 0.181952 2025-01-16 02:07:26,807 - INFO - step 18736, loss: 0.232326, best loss: 0.181952 2025-01-16 02:07:26,957 - INFO - step 18737, loss: 0.273556, best loss: 0.181952 2025-01-16 02:07:27,108 - INFO - step 18738, loss: 0.331953, best loss: 0.181952 2025-01-16 02:07:27,258 - INFO - step 18739, loss: 0.290197, best loss: 0.181952 2025-01-16 02:07:27,408 - INFO - step 18740, loss: 0.235303, best loss: 0.181952 2025-01-16 02:07:27,558 - INFO - step 18741, loss: 0.275597, best loss: 0.181952 2025-01-16 02:07:27,708 - INFO - step 18742, loss: 0.262167, best loss: 0.181952 2025-01-16 02:07:27,858 - INFO - step 18743, loss: 0.262504, best loss: 0.181952 2025-01-16 02:07:28,008 - INFO - step 18744, loss: 0.280363, best loss: 0.181952 2025-01-16 02:07:28,158 - INFO - step 18745, loss: 0.206528, best loss: 0.181952 2025-01-16 02:07:28,308 - INFO - step 18746, loss: 0.225499, best loss: 0.181952 2025-01-16 02:07:28,458 - INFO - step 18747, loss: 0.258773, best loss: 0.181952 2025-01-16 02:07:28,608 - INFO - step 18748, loss: 0.241548, best loss: 0.181952 2025-01-16 02:07:28,758 - INFO - step 18749, loss: 0.254930, best loss: 0.181952 2025-01-16 02:07:28,908 - INFO - step 18750, loss: 0.275699, best loss: 0.181952 2025-01-16 02:07:29,058 - INFO - step 18751, loss: 0.228123, best loss: 0.181952 2025-01-16 02:07:29,208 - INFO - step 18752, loss: 0.207509, best loss: 0.181952 2025-01-16 02:07:29,358 - INFO - step 18753, loss: 0.224268, best loss: 0.181952 2025-01-16 02:07:29,508 - INFO - step 18754, loss: 0.224181, best loss: 0.181952 2025-01-16 02:07:29,659 - INFO - step 18755, loss: 0.204003, best loss: 0.181952 2025-01-16 02:07:29,808 - INFO - step 18756, loss: 0.232552, best loss: 0.181952 2025-01-16 02:07:29,958 - INFO - step 18757, loss: 0.272142, best loss: 0.181952 2025-01-16 02:07:30,108 - INFO - step 18758, loss: 0.217557, best loss: 0.181952 2025-01-16 02:07:30,258 - INFO - step 18759, loss: 0.227254, best loss: 0.181952 2025-01-16 02:07:30,408 - INFO - step 18760, loss: 0.272014, best loss: 0.181952 2025-01-16 02:07:30,558 - INFO - step 18761, loss: 0.224670, best loss: 0.181952 2025-01-16 02:07:30,709 - INFO - step 18762, loss: 0.225867, best loss: 0.181952 2025-01-16 02:07:30,859 - INFO - step 18763, loss: 0.290151, best loss: 0.181952 2025-01-16 02:07:31,009 - INFO - step 18764, loss: 0.254362, best loss: 0.181952 2025-01-16 02:07:31,159 - INFO - step 18765, loss: 0.223302, best loss: 0.181952 2025-01-16 02:07:31,309 - INFO - step 18766, loss: 0.214340, best loss: 0.181952 2025-01-16 02:07:31,460 - INFO - step 18767, loss: 0.213830, best loss: 0.181952 2025-01-16 02:07:31,610 - INFO - step 18768, loss: 0.221483, best loss: 0.181952 2025-01-16 02:07:31,760 - INFO - step 18769, loss: 0.241495, best loss: 0.181952 2025-01-16 02:07:31,910 - INFO - step 18770, loss: 0.265466, best loss: 0.181952 2025-01-16 02:07:32,060 - INFO - step 18771, loss: 0.225073, best loss: 0.181952 2025-01-16 02:07:32,210 - INFO - step 18772, loss: 0.240797, best loss: 0.181952 2025-01-16 02:07:32,360 - INFO - step 18773, loss: 0.225564, best loss: 0.181952 2025-01-16 02:07:32,510 - INFO - step 18774, loss: 0.265485, best loss: 0.181952 2025-01-16 02:07:32,660 - INFO - step 18775, loss: 0.219933, best loss: 0.181952 2025-01-16 02:07:32,810 - INFO - step 18776, loss: 0.276518, best loss: 0.181952 2025-01-16 02:07:32,960 - INFO - step 18777, loss: 0.286098, best loss: 0.181952 2025-01-16 02:07:33,110 - INFO - step 18778, loss: 0.217387, best loss: 0.181952 2025-01-16 02:07:33,260 - INFO - step 18779, loss: 0.225188, best loss: 0.181952 2025-01-16 02:07:33,411 - INFO - step 18780, loss: 0.234128, best loss: 0.181952 2025-01-16 02:07:33,561 - INFO - step 18781, loss: 0.255895, best loss: 0.181952 2025-01-16 02:07:33,711 - INFO - step 18782, loss: 0.278048, best loss: 0.181952 2025-01-16 02:07:33,861 - INFO - step 18783, loss: 0.265378, best loss: 0.181952 2025-01-16 02:07:34,011 - INFO - step 18784, loss: 0.255461, best loss: 0.181952 2025-01-16 02:07:34,161 - INFO - step 18785, loss: 0.199259, best loss: 0.181952 2025-01-16 02:07:34,311 - INFO - step 18786, loss: 0.264009, best loss: 0.181952 2025-01-16 02:07:34,461 - INFO - step 18787, loss: 0.268798, best loss: 0.181952 2025-01-16 02:07:34,611 - INFO - step 18788, loss: 0.251517, best loss: 0.181952 2025-01-16 02:07:34,761 - INFO - step 18789, loss: 0.234763, best loss: 0.181952 2025-01-16 02:07:34,911 - INFO - step 18790, loss: 0.216040, best loss: 0.181952 2025-01-16 02:07:35,061 - INFO - step 18791, loss: 0.209824, best loss: 0.181952 2025-01-16 02:07:35,212 - INFO - step 18792, loss: 0.299331, best loss: 0.181952 2025-01-16 02:07:35,362 - INFO - step 18793, loss: 0.258698, best loss: 0.181952 2025-01-16 02:07:35,512 - INFO - step 18794, loss: 0.301592, best loss: 0.181952 2025-01-16 02:07:35,662 - INFO - step 18795, loss: 0.251117, best loss: 0.181952 2025-01-16 02:07:39,164 - INFO - step 18796, loss: 0.164299, best loss: 0.164299 2025-01-16 02:07:39,326 - INFO - step 18797, loss: 0.219603, best loss: 0.164299 2025-01-16 02:07:39,478 - INFO - step 18798, loss: 0.273377, best loss: 0.164299 2025-01-16 02:07:39,629 - INFO - step 18799, loss: 0.227513, best loss: 0.164299 2025-01-16 02:07:39,779 - INFO - step 18800, loss: 0.264123, best loss: 0.164299 2025-01-16 02:07:39,929 - INFO - step 18801, loss: 0.252832, best loss: 0.164299 2025-01-16 02:07:40,079 - INFO - step 18802, loss: 0.242559, best loss: 0.164299 2025-01-16 02:07:40,229 - INFO - step 18803, loss: 0.295402, best loss: 0.164299 2025-01-16 02:07:40,380 - INFO - step 18804, loss: 0.239448, best loss: 0.164299 2025-01-16 02:07:40,530 - INFO - step 18805, loss: 0.237458, best loss: 0.164299 2025-01-16 02:07:40,680 - INFO - step 18806, loss: 0.332027, best loss: 0.164299 2025-01-16 02:07:40,830 - INFO - step 18807, loss: 0.283732, best loss: 0.164299 2025-01-16 02:07:40,981 - INFO - step 18808, loss: 0.313763, best loss: 0.164299 2025-01-16 02:07:41,131 - INFO - step 18809, loss: 0.188499, best loss: 0.164299 2025-01-16 02:07:41,281 - INFO - step 18810, loss: 0.234531, best loss: 0.164299 2025-01-16 02:07:41,431 - INFO - step 18811, loss: 0.266882, best loss: 0.164299 2025-01-16 02:07:41,581 - INFO - step 18812, loss: 0.272716, best loss: 0.164299 2025-01-16 02:07:41,731 - INFO - step 18813, loss: 0.249310, best loss: 0.164299 2025-01-16 02:07:41,881 - INFO - step 18814, loss: 0.252964, best loss: 0.164299 2025-01-16 02:07:42,032 - INFO - step 18815, loss: 0.342390, best loss: 0.164299 2025-01-16 02:07:42,182 - INFO - step 18816, loss: 0.243720, best loss: 0.164299 2025-01-16 02:07:42,332 - INFO - step 18817, loss: 0.235368, best loss: 0.164299 2025-01-16 02:07:42,482 - INFO - step 18818, loss: 0.265554, best loss: 0.164299 2025-01-16 02:07:42,632 - INFO - step 18819, loss: 0.241601, best loss: 0.164299 2025-01-16 02:07:42,782 - INFO - step 18820, loss: 0.243490, best loss: 0.164299 2025-01-16 02:07:42,932 - INFO - step 18821, loss: 0.259541, best loss: 0.164299 2025-01-16 02:07:43,082 - INFO - step 18822, loss: 0.180038, best loss: 0.164299 2025-01-16 02:07:43,232 - INFO - step 18823, loss: 0.250580, best loss: 0.164299 2025-01-16 02:07:43,382 - INFO - step 18824, loss: 0.313252, best loss: 0.164299 2025-01-16 02:07:43,532 - INFO - step 18825, loss: 0.269068, best loss: 0.164299 2025-01-16 02:07:43,682 - INFO - step 18826, loss: 0.320659, best loss: 0.164299 2025-01-16 02:07:43,832 - INFO - step 18827, loss: 0.177543, best loss: 0.164299 2025-01-16 02:07:43,982 - INFO - step 18828, loss: 0.257156, best loss: 0.164299 2025-01-16 02:07:44,132 - INFO - step 18829, loss: 0.236261, best loss: 0.164299 2025-01-16 02:07:44,282 - INFO - step 18830, loss: 0.244171, best loss: 0.164299 2025-01-16 02:07:44,432 - INFO - step 18831, loss: 0.271793, best loss: 0.164299 2025-01-16 02:07:44,582 - INFO - step 18832, loss: 0.197093, best loss: 0.164299 2025-01-16 02:07:44,732 - INFO - step 18833, loss: 0.268688, best loss: 0.164299 2025-01-16 02:07:44,882 - INFO - step 18834, loss: 0.229351, best loss: 0.164299 2025-01-16 02:07:45,032 - INFO - step 18835, loss: 0.276674, best loss: 0.164299 2025-01-16 02:07:45,182 - INFO - step 18836, loss: 0.243672, best loss: 0.164299 2025-01-16 02:07:45,332 - INFO - step 18837, loss: 0.195466, best loss: 0.164299 2025-01-16 02:07:45,482 - INFO - step 18838, loss: 0.218783, best loss: 0.164299 2025-01-16 02:07:45,632 - INFO - step 18839, loss: 0.253605, best loss: 0.164299 2025-01-16 02:07:45,782 - INFO - step 18840, loss: 0.237793, best loss: 0.164299 2025-01-16 02:07:45,932 - INFO - step 18841, loss: 0.239066, best loss: 0.164299 2025-01-16 02:07:46,082 - INFO - step 18842, loss: 0.256744, best loss: 0.164299 2025-01-16 02:07:46,232 - INFO - step 18843, loss: 0.267986, best loss: 0.164299 2025-01-16 02:07:46,382 - INFO - step 18844, loss: 0.236038, best loss: 0.164299 2025-01-16 02:07:46,532 - INFO - step 18845, loss: 0.229127, best loss: 0.164299 2025-01-16 02:07:46,682 - INFO - step 18846, loss: 0.238919, best loss: 0.164299 2025-01-16 02:07:46,832 - INFO - step 18847, loss: 0.247816, best loss: 0.164299 2025-01-16 02:07:46,982 - INFO - step 18848, loss: 0.246338, best loss: 0.164299 2025-01-16 02:07:47,132 - INFO - step 18849, loss: 0.173223, best loss: 0.164299 2025-01-16 02:07:47,282 - INFO - step 18850, loss: 0.239546, best loss: 0.164299 2025-01-16 02:07:47,432 - INFO - step 18851, loss: 0.239504, best loss: 0.164299 2025-01-16 02:07:47,582 - INFO - step 18852, loss: 0.248126, best loss: 0.164299 2025-01-16 02:07:47,732 - INFO - step 18853, loss: 0.172587, best loss: 0.164299 2025-01-16 02:07:47,882 - INFO - step 18854, loss: 0.227108, best loss: 0.164299 2025-01-16 02:07:48,032 - INFO - step 18855, loss: 0.228300, best loss: 0.164299 2025-01-16 02:07:48,182 - INFO - step 18856, loss: 0.287591, best loss: 0.164299 2025-01-16 02:07:48,332 - INFO - step 18857, loss: 0.248275, best loss: 0.164299 2025-01-16 02:07:48,482 - INFO - step 18858, loss: 0.236177, best loss: 0.164299 2025-01-16 02:07:48,632 - INFO - step 18859, loss: 0.234993, best loss: 0.164299 2025-01-16 02:07:48,782 - INFO - step 18860, loss: 0.259376, best loss: 0.164299 2025-01-16 02:07:48,932 - INFO - step 18861, loss: 0.193174, best loss: 0.164299 2025-01-16 02:07:49,082 - INFO - step 18862, loss: 0.223137, best loss: 0.164299 2025-01-16 02:07:49,232 - INFO - step 18863, loss: 0.257303, best loss: 0.164299 2025-01-16 02:07:49,382 - INFO - step 18864, loss: 0.258644, best loss: 0.164299 2025-01-16 02:07:49,532 - INFO - step 18865, loss: 0.209111, best loss: 0.164299 2025-01-16 02:07:49,683 - INFO - step 18866, loss: 0.285824, best loss: 0.164299 2025-01-16 02:07:49,833 - INFO - step 18867, loss: 0.239086, best loss: 0.164299 2025-01-16 02:07:49,983 - INFO - step 18868, loss: 0.265946, best loss: 0.164299 2025-01-16 02:07:50,133 - INFO - step 18869, loss: 0.282275, best loss: 0.164299 2025-01-16 02:07:50,284 - INFO - step 18870, loss: 0.197368, best loss: 0.164299 2025-01-16 02:07:50,434 - INFO - step 18871, loss: 0.286788, best loss: 0.164299 2025-01-16 02:07:50,584 - INFO - step 18872, loss: 0.254866, best loss: 0.164299 2025-01-16 02:07:50,734 - INFO - step 18873, loss: 0.274036, best loss: 0.164299 2025-01-16 02:07:50,884 - INFO - step 18874, loss: 0.289645, best loss: 0.164299 2025-01-16 02:07:51,034 - INFO - step 18875, loss: 0.236060, best loss: 0.164299 2025-01-16 02:07:51,184 - INFO - step 18876, loss: 0.240380, best loss: 0.164299 2025-01-16 02:07:51,334 - INFO - step 18877, loss: 0.229119, best loss: 0.164299 2025-01-16 02:07:51,484 - INFO - step 18878, loss: 0.221504, best loss: 0.164299 2025-01-16 02:07:51,635 - INFO - step 18879, loss: 0.269632, best loss: 0.164299 2025-01-16 02:07:51,785 - INFO - step 18880, loss: 0.308406, best loss: 0.164299 2025-01-16 02:07:51,935 - INFO - step 18881, loss: 0.243166, best loss: 0.164299 2025-01-16 02:07:52,085 - INFO - step 18882, loss: 0.256628, best loss: 0.164299 2025-01-16 02:07:52,235 - INFO - step 18883, loss: 0.189636, best loss: 0.164299 2025-01-16 02:07:52,385 - INFO - step 18884, loss: 0.246457, best loss: 0.164299 2025-01-16 02:07:52,535 - INFO - step 18885, loss: 0.236877, best loss: 0.164299 2025-01-16 02:07:52,685 - INFO - step 18886, loss: 0.166199, best loss: 0.164299 2025-01-16 02:07:52,835 - INFO - step 18887, loss: 0.221425, best loss: 0.164299 2025-01-16 02:07:52,986 - INFO - step 18888, loss: 0.245579, best loss: 0.164299 2025-01-16 02:07:53,136 - INFO - step 18889, loss: 0.248636, best loss: 0.164299 2025-01-16 02:07:53,286 - INFO - step 18890, loss: 0.270370, best loss: 0.164299 2025-01-16 02:07:53,436 - INFO - step 18891, loss: 0.226102, best loss: 0.164299 2025-01-16 02:07:53,586 - INFO - step 18892, loss: 0.237007, best loss: 0.164299 2025-01-16 02:07:53,736 - INFO - step 18893, loss: 0.221998, best loss: 0.164299 2025-01-16 02:07:53,886 - INFO - step 18894, loss: 0.248453, best loss: 0.164299 2025-01-16 02:07:54,036 - INFO - step 18895, loss: 0.224309, best loss: 0.164299 2025-01-16 02:07:54,187 - INFO - step 18896, loss: 0.193199, best loss: 0.164299 2025-01-16 02:07:54,337 - INFO - step 18897, loss: 0.225148, best loss: 0.164299 2025-01-16 02:07:54,487 - INFO - step 18898, loss: 0.217772, best loss: 0.164299 2025-01-16 02:07:54,638 - INFO - step 18899, loss: 0.232714, best loss: 0.164299 2025-01-16 02:07:54,788 - INFO - step 18900, loss: 0.191712, best loss: 0.164299 2025-01-16 02:07:54,938 - INFO - step 18901, loss: 0.279509, best loss: 0.164299 2025-01-16 02:07:55,088 - INFO - step 18902, loss: 0.213428, best loss: 0.164299 2025-01-16 02:07:55,238 - INFO - step 18903, loss: 0.262814, best loss: 0.164299 2025-01-16 02:07:55,389 - INFO - step 18904, loss: 0.237863, best loss: 0.164299 2025-01-16 02:07:55,539 - INFO - step 18905, loss: 0.265365, best loss: 0.164299 2025-01-16 02:07:55,689 - INFO - step 18906, loss: 0.255074, best loss: 0.164299 2025-01-16 02:07:55,839 - INFO - step 18907, loss: 0.241792, best loss: 0.164299 2025-01-16 02:07:55,989 - INFO - step 18908, loss: 0.229194, best loss: 0.164299 2025-01-16 02:07:56,140 - INFO - step 18909, loss: 0.258657, best loss: 0.164299 2025-01-16 02:07:56,290 - INFO - step 18910, loss: 0.258973, best loss: 0.164299 2025-01-16 02:07:56,440 - INFO - step 18911, loss: 0.229119, best loss: 0.164299 2025-01-16 02:07:56,590 - INFO - step 18912, loss: 0.204154, best loss: 0.164299 2025-01-16 02:07:56,740 - INFO - step 18913, loss: 0.233480, best loss: 0.164299 2025-01-16 02:07:56,890 - INFO - step 18914, loss: 0.255368, best loss: 0.164299 2025-01-16 02:07:57,040 - INFO - step 18915, loss: 0.216833, best loss: 0.164299 2025-01-16 02:07:57,190 - INFO - step 18916, loss: 0.264740, best loss: 0.164299 2025-01-16 02:07:57,341 - INFO - step 18917, loss: 0.237043, best loss: 0.164299 2025-01-16 02:07:57,491 - INFO - step 18918, loss: 0.189715, best loss: 0.164299 2025-01-16 02:07:57,641 - INFO - step 18919, loss: 0.247606, best loss: 0.164299 2025-01-16 02:07:57,791 - INFO - step 18920, loss: 0.229860, best loss: 0.164299 2025-01-16 02:07:57,941 - INFO - step 18921, loss: 0.260824, best loss: 0.164299 2025-01-16 02:07:58,091 - INFO - step 18922, loss: 0.240761, best loss: 0.164299 2025-01-16 02:07:58,241 - INFO - step 18923, loss: 0.190075, best loss: 0.164299 2025-01-16 02:07:58,391 - INFO - step 18924, loss: 0.246209, best loss: 0.164299 2025-01-16 02:07:58,541 - INFO - step 18925, loss: 0.208058, best loss: 0.164299 2025-01-16 02:07:58,691 - INFO - step 18926, loss: 0.220461, best loss: 0.164299 2025-01-16 02:07:58,841 - INFO - step 18927, loss: 0.214525, best loss: 0.164299 2025-01-16 02:07:58,991 - INFO - step 18928, loss: 0.198691, best loss: 0.164299 2025-01-16 02:07:59,142 - INFO - step 18929, loss: 0.249968, best loss: 0.164299 2025-01-16 02:07:59,291 - INFO - step 18930, loss: 0.248970, best loss: 0.164299 2025-01-16 02:07:59,442 - INFO - step 18931, loss: 0.196979, best loss: 0.164299 2025-01-16 02:07:59,592 - INFO - step 18932, loss: 0.215461, best loss: 0.164299 2025-01-16 02:07:59,742 - INFO - step 18933, loss: 0.234929, best loss: 0.164299 2025-01-16 02:07:59,892 - INFO - step 18934, loss: 0.198920, best loss: 0.164299 2025-01-16 02:08:00,042 - INFO - step 18935, loss: 0.212289, best loss: 0.164299 2025-01-16 02:08:00,192 - INFO - step 18936, loss: 0.228148, best loss: 0.164299 2025-01-16 02:08:00,342 - INFO - step 18937, loss: 0.266391, best loss: 0.164299 2025-01-16 02:08:00,492 - INFO - step 18938, loss: 0.213925, best loss: 0.164299 2025-01-16 02:08:00,642 - INFO - step 18939, loss: 0.255783, best loss: 0.164299 2025-01-16 02:08:00,792 - INFO - step 18940, loss: 0.216140, best loss: 0.164299 2025-01-16 02:08:00,942 - INFO - step 18941, loss: 0.208693, best loss: 0.164299 2025-01-16 02:08:01,092 - INFO - step 18942, loss: 0.219112, best loss: 0.164299 2025-01-16 02:08:01,242 - INFO - step 18943, loss: 0.174964, best loss: 0.164299 2025-01-16 02:08:01,392 - INFO - step 18944, loss: 0.246500, best loss: 0.164299 2025-01-16 02:08:01,542 - INFO - step 18945, loss: 0.192875, best loss: 0.164299 2025-01-16 02:08:01,692 - INFO - step 18946, loss: 0.191318, best loss: 0.164299 2025-01-16 02:08:01,842 - INFO - step 18947, loss: 0.215452, best loss: 0.164299 2025-01-16 02:08:01,992 - INFO - step 18948, loss: 0.209772, best loss: 0.164299 2025-01-16 02:08:02,143 - INFO - step 18949, loss: 0.220187, best loss: 0.164299 2025-01-16 02:08:02,293 - INFO - step 18950, loss: 0.255975, best loss: 0.164299 2025-01-16 02:08:02,443 - INFO - step 18951, loss: 0.203170, best loss: 0.164299 2025-01-16 02:08:02,593 - INFO - step 18952, loss: 0.209280, best loss: 0.164299 2025-01-16 02:08:02,743 - INFO - step 18953, loss: 0.178882, best loss: 0.164299 2025-01-16 02:08:02,893 - INFO - step 18954, loss: 0.167122, best loss: 0.164299 2025-01-16 02:08:03,043 - INFO - step 18955, loss: 0.258313, best loss: 0.164299 2025-01-16 02:08:03,193 - INFO - step 18956, loss: 0.301398, best loss: 0.164299 2025-01-16 02:08:03,343 - INFO - step 18957, loss: 0.257450, best loss: 0.164299 2025-01-16 02:08:03,493 - INFO - step 18958, loss: 0.226937, best loss: 0.164299 2025-01-16 02:08:03,643 - INFO - step 18959, loss: 0.264575, best loss: 0.164299 2025-01-16 02:08:03,793 - INFO - step 18960, loss: 0.197991, best loss: 0.164299 2025-01-16 02:08:03,943 - INFO - step 18961, loss: 0.256392, best loss: 0.164299 2025-01-16 02:08:04,093 - INFO - step 18962, loss: 0.228656, best loss: 0.164299 2025-01-16 02:08:04,243 - INFO - step 18963, loss: 0.221919, best loss: 0.164299 2025-01-16 02:08:04,393 - INFO - step 18964, loss: 0.225250, best loss: 0.164299 2025-01-16 02:08:04,543 - INFO - step 18965, loss: 0.225027, best loss: 0.164299 2025-01-16 02:08:04,693 - INFO - step 18966, loss: 0.254073, best loss: 0.164299 2025-01-16 02:08:04,843 - INFO - step 18967, loss: 0.194468, best loss: 0.164299 2025-01-16 02:08:04,994 - INFO - step 18968, loss: 0.231582, best loss: 0.164299 2025-01-16 02:08:05,144 - INFO - step 18969, loss: 0.220152, best loss: 0.164299 2025-01-16 02:08:05,294 - INFO - step 18970, loss: 0.271000, best loss: 0.164299 2025-01-16 02:08:05,444 - INFO - step 18971, loss: 0.243730, best loss: 0.164299 2025-01-16 02:08:05,594 - INFO - step 18972, loss: 0.230247, best loss: 0.164299 2025-01-16 02:08:05,744 - INFO - step 18973, loss: 0.261342, best loss: 0.164299 2025-01-16 02:08:05,894 - INFO - step 18974, loss: 0.251477, best loss: 0.164299 2025-01-16 02:08:06,044 - INFO - step 18975, loss: 0.207201, best loss: 0.164299 2025-01-16 02:08:06,194 - INFO - step 18976, loss: 0.246554, best loss: 0.164299 2025-01-16 02:08:06,344 - INFO - step 18977, loss: 0.276329, best loss: 0.164299 2025-01-16 02:08:06,495 - INFO - step 18978, loss: 0.241043, best loss: 0.164299 2025-01-16 02:08:06,645 - INFO - step 18979, loss: 0.212857, best loss: 0.164299 2025-01-16 02:08:06,795 - INFO - step 18980, loss: 0.209977, best loss: 0.164299 2025-01-16 02:08:06,945 - INFO - step 18981, loss: 0.209827, best loss: 0.164299 2025-01-16 02:08:07,095 - INFO - step 18982, loss: 0.228054, best loss: 0.164299 2025-01-16 02:08:07,245 - INFO - step 18983, loss: 0.235260, best loss: 0.164299 2025-01-16 02:08:07,395 - INFO - step 18984, loss: 0.203942, best loss: 0.164299 2025-01-16 02:08:07,545 - INFO - step 18985, loss: 0.205104, best loss: 0.164299 2025-01-16 02:08:07,696 - INFO - step 18986, loss: 0.266373, best loss: 0.164299 2025-01-16 02:08:07,845 - INFO - step 18987, loss: 0.266472, best loss: 0.164299 2025-01-16 02:08:07,995 - INFO - step 18988, loss: 0.202070, best loss: 0.164299 2025-01-16 02:08:08,146 - INFO - step 18989, loss: 0.254350, best loss: 0.164299 2025-01-16 02:08:08,296 - INFO - step 18990, loss: 0.224306, best loss: 0.164299 2025-01-16 02:08:08,446 - INFO - step 18991, loss: 0.248020, best loss: 0.164299 2025-01-16 02:08:08,596 - INFO - step 18992, loss: 0.243583, best loss: 0.164299 2025-01-16 02:08:08,746 - INFO - step 18993, loss: 0.279119, best loss: 0.164299 2025-01-16 02:08:08,896 - INFO - step 18994, loss: 0.215058, best loss: 0.164299 2025-01-16 02:08:09,046 - INFO - step 18995, loss: 0.187685, best loss: 0.164299 2025-01-16 02:08:09,196 - INFO - step 18996, loss: 0.241193, best loss: 0.164299 2025-01-16 02:08:09,346 - INFO - step 18997, loss: 0.222726, best loss: 0.164299 2025-01-16 02:08:09,496 - INFO - step 18998, loss: 0.205937, best loss: 0.164299 2025-01-16 02:08:09,647 - INFO - step 18999, loss: 0.226139, best loss: 0.164299 2025-01-16 02:08:09,797 - INFO - step 19000, loss: 0.233033, best loss: 0.164299 2025-01-16 02:08:09,947 - INFO - step 19001, loss: 0.265433, best loss: 0.164299 2025-01-16 02:08:10,097 - INFO - step 19002, loss: 0.252849, best loss: 0.164299 2025-01-16 02:08:10,247 - INFO - step 19003, loss: 0.255836, best loss: 0.164299 2025-01-16 02:08:10,397 - INFO - step 19004, loss: 0.279836, best loss: 0.164299 2025-01-16 02:08:10,547 - INFO - step 19005, loss: 0.250729, best loss: 0.164299 2025-01-16 02:08:10,697 - INFO - step 19006, loss: 0.232401, best loss: 0.164299 2025-01-16 02:08:10,847 - INFO - step 19007, loss: 0.209303, best loss: 0.164299 2025-01-16 02:08:10,997 - INFO - step 19008, loss: 0.215638, best loss: 0.164299 2025-01-16 02:08:11,147 - INFO - step 19009, loss: 0.224130, best loss: 0.164299 2025-01-16 02:08:11,298 - INFO - step 19010, loss: 0.190317, best loss: 0.164299 2025-01-16 02:08:11,448 - INFO - step 19011, loss: 0.261770, best loss: 0.164299 2025-01-16 02:08:11,598 - INFO - step 19012, loss: 0.248827, best loss: 0.164299 2025-01-16 02:08:11,748 - INFO - step 19013, loss: 0.271076, best loss: 0.164299 2025-01-16 02:08:11,898 - INFO - step 19014, loss: 0.210685, best loss: 0.164299 2025-01-16 02:08:12,048 - INFO - step 19015, loss: 0.208626, best loss: 0.164299 2025-01-16 02:08:12,198 - INFO - step 19016, loss: 0.235171, best loss: 0.164299 2025-01-16 02:08:12,348 - INFO - step 19017, loss: 0.190152, best loss: 0.164299 2025-01-16 02:08:12,498 - INFO - step 19018, loss: 0.250523, best loss: 0.164299 2025-01-16 02:08:12,649 - INFO - step 19019, loss: 0.208717, best loss: 0.164299 2025-01-16 02:08:12,799 - INFO - step 19020, loss: 0.202312, best loss: 0.164299 2025-01-16 02:08:12,949 - INFO - step 19021, loss: 0.241934, best loss: 0.164299 2025-01-16 02:08:13,099 - INFO - step 19022, loss: 0.193422, best loss: 0.164299 2025-01-16 02:08:13,249 - INFO - step 19023, loss: 0.218486, best loss: 0.164299 2025-01-16 02:08:13,399 - INFO - step 19024, loss: 0.217026, best loss: 0.164299 2025-01-16 02:08:13,549 - INFO - step 19025, loss: 0.240788, best loss: 0.164299 2025-01-16 02:08:13,700 - INFO - step 19026, loss: 0.229370, best loss: 0.164299 2025-01-16 02:08:13,850 - INFO - step 19027, loss: 0.233067, best loss: 0.164299 2025-01-16 02:08:14,000 - INFO - step 19028, loss: 0.230596, best loss: 0.164299 2025-01-16 02:08:14,150 - INFO - step 19029, loss: 0.213576, best loss: 0.164299 2025-01-16 02:08:14,300 - INFO - step 19030, loss: 0.260026, best loss: 0.164299 2025-01-16 02:08:14,450 - INFO - step 19031, loss: 0.241020, best loss: 0.164299 2025-01-16 02:08:14,600 - INFO - step 19032, loss: 0.265033, best loss: 0.164299 2025-01-16 02:08:14,750 - INFO - step 19033, loss: 0.223033, best loss: 0.164299 2025-01-16 02:08:14,900 - INFO - step 19034, loss: 0.282009, best loss: 0.164299 2025-01-16 02:08:15,050 - INFO - step 19035, loss: 0.186615, best loss: 0.164299 2025-01-16 02:08:15,200 - INFO - step 19036, loss: 0.255440, best loss: 0.164299 2025-01-16 02:08:15,350 - INFO - step 19037, loss: 0.234979, best loss: 0.164299 2025-01-16 02:08:15,500 - INFO - step 19038, loss: 0.220221, best loss: 0.164299 2025-01-16 02:08:15,651 - INFO - step 19039, loss: 0.280389, best loss: 0.164299 2025-01-16 02:08:15,801 - INFO - step 19040, loss: 0.270128, best loss: 0.164299 2025-01-16 02:08:15,951 - INFO - step 19041, loss: 0.206691, best loss: 0.164299 2025-01-16 02:08:16,101 - INFO - step 19042, loss: 0.232470, best loss: 0.164299 2025-01-16 02:08:16,251 - INFO - step 19043, loss: 0.273199, best loss: 0.164299 2025-01-16 02:08:16,401 - INFO - step 19044, loss: 0.249302, best loss: 0.164299 2025-01-16 02:08:16,551 - INFO - step 19045, loss: 0.234171, best loss: 0.164299 2025-01-16 02:08:16,701 - INFO - step 19046, loss: 0.225741, best loss: 0.164299 2025-01-16 02:08:16,851 - INFO - step 19047, loss: 0.231918, best loss: 0.164299 2025-01-16 02:08:17,001 - INFO - step 19048, loss: 0.235080, best loss: 0.164299 2025-01-16 02:08:17,151 - INFO - step 19049, loss: 0.270849, best loss: 0.164299 2025-01-16 02:08:17,302 - INFO - step 19050, loss: 0.261705, best loss: 0.164299 2025-01-16 02:08:17,452 - INFO - step 19051, loss: 0.321154, best loss: 0.164299 2025-01-16 02:08:17,602 - INFO - step 19052, loss: 0.238934, best loss: 0.164299 2025-01-16 02:08:17,752 - INFO - step 19053, loss: 0.221799, best loss: 0.164299 2025-01-16 02:08:17,902 - INFO - step 19054, loss: 0.268006, best loss: 0.164299 2025-01-16 02:08:18,052 - INFO - step 19055, loss: 0.211592, best loss: 0.164299 2025-01-16 02:08:18,203 - INFO - step 19056, loss: 0.281680, best loss: 0.164299 2025-01-16 02:08:18,353 - INFO - step 19057, loss: 0.223783, best loss: 0.164299 2025-01-16 02:08:18,503 - INFO - step 19058, loss: 0.199270, best loss: 0.164299 2025-01-16 02:08:18,653 - INFO - step 19059, loss: 0.192143, best loss: 0.164299 2025-01-16 02:08:18,803 - INFO - step 19060, loss: 0.250505, best loss: 0.164299 2025-01-16 02:08:18,953 - INFO - step 19061, loss: 0.211425, best loss: 0.164299 2025-01-16 02:08:19,102 - INFO - step 19062, loss: 0.223894, best loss: 0.164299 2025-01-16 02:08:19,253 - INFO - step 19063, loss: 0.244423, best loss: 0.164299 2025-01-16 02:08:19,403 - INFO - step 19064, loss: 0.242474, best loss: 0.164299 2025-01-16 02:08:19,553 - INFO - step 19065, loss: 0.214790, best loss: 0.164299 2025-01-16 02:08:19,703 - INFO - step 19066, loss: 0.220131, best loss: 0.164299 2025-01-16 02:08:19,853 - INFO - step 19067, loss: 0.237773, best loss: 0.164299 2025-01-16 02:08:20,003 - INFO - step 19068, loss: 0.269880, best loss: 0.164299 2025-01-16 02:08:20,153 - INFO - step 19069, loss: 0.282138, best loss: 0.164299 2025-01-16 02:08:20,303 - INFO - step 19070, loss: 0.288473, best loss: 0.164299 2025-01-16 02:08:20,454 - INFO - step 19071, loss: 0.243697, best loss: 0.164299 2025-01-16 02:08:20,604 - INFO - step 19072, loss: 0.251666, best loss: 0.164299 2025-01-16 02:08:20,754 - INFO - step 19073, loss: 0.219340, best loss: 0.164299 2025-01-16 02:08:20,904 - INFO - step 19074, loss: 0.204508, best loss: 0.164299 2025-01-16 02:08:21,054 - INFO - step 19075, loss: 0.261981, best loss: 0.164299 2025-01-16 02:08:21,204 - INFO - step 19076, loss: 0.242119, best loss: 0.164299 2025-01-16 02:08:21,354 - INFO - step 19077, loss: 0.231502, best loss: 0.164299 2025-01-16 02:08:21,505 - INFO - step 19078, loss: 0.237940, best loss: 0.164299 2025-01-16 02:08:21,655 - INFO - step 19079, loss: 0.197043, best loss: 0.164299 2025-01-16 02:08:21,805 - INFO - step 19080, loss: 0.237252, best loss: 0.164299 2025-01-16 02:08:21,955 - INFO - step 19081, loss: 0.231287, best loss: 0.164299 2025-01-16 02:08:22,105 - INFO - step 19082, loss: 0.254139, best loss: 0.164299 2025-01-16 02:08:22,255 - INFO - step 19083, loss: 0.216615, best loss: 0.164299 2025-01-16 02:08:22,405 - INFO - step 19084, loss: 0.186894, best loss: 0.164299 2025-01-16 02:08:22,555 - INFO - step 19085, loss: 0.243115, best loss: 0.164299 2025-01-16 02:08:22,705 - INFO - step 19086, loss: 0.190399, best loss: 0.164299 2025-01-16 02:08:22,855 - INFO - step 19087, loss: 0.238422, best loss: 0.164299 2025-01-16 02:08:23,006 - INFO - step 19088, loss: 0.193238, best loss: 0.164299 2025-01-16 02:08:23,156 - INFO - step 19089, loss: 0.217242, best loss: 0.164299 2025-01-16 02:08:23,306 - INFO - step 19090, loss: 0.217380, best loss: 0.164299 2025-01-16 02:08:23,456 - INFO - step 19091, loss: 0.207264, best loss: 0.164299 2025-01-16 02:08:23,606 - INFO - step 19092, loss: 0.225106, best loss: 0.164299 2025-01-16 02:08:23,756 - INFO - step 19093, loss: 0.220341, best loss: 0.164299 2025-01-16 02:08:23,907 - INFO - step 19094, loss: 0.270664, best loss: 0.164299 2025-01-16 02:08:24,057 - INFO - step 19095, loss: 0.237977, best loss: 0.164299 2025-01-16 02:08:24,207 - INFO - step 19096, loss: 0.262159, best loss: 0.164299 2025-01-16 02:08:24,357 - INFO - step 19097, loss: 0.228216, best loss: 0.164299 2025-01-16 02:08:24,507 - INFO - step 19098, loss: 0.247731, best loss: 0.164299 2025-01-16 02:08:24,657 - INFO - step 19099, loss: 0.231909, best loss: 0.164299 2025-01-16 02:08:24,807 - INFO - step 19100, loss: 0.238838, best loss: 0.164299 2025-01-16 02:08:24,957 - INFO - step 19101, loss: 0.271301, best loss: 0.164299 2025-01-16 02:08:25,107 - INFO - step 19102, loss: 0.219867, best loss: 0.164299 2025-01-16 02:08:25,257 - INFO - step 19103, loss: 0.221342, best loss: 0.164299 2025-01-16 02:08:25,407 - INFO - step 19104, loss: 0.226033, best loss: 0.164299 2025-01-16 02:08:25,558 - INFO - step 19105, loss: 0.253561, best loss: 0.164299 2025-01-16 02:08:25,708 - INFO - step 19106, loss: 0.240500, best loss: 0.164299 2025-01-16 02:08:25,858 - INFO - step 19107, loss: 0.244201, best loss: 0.164299 2025-01-16 02:08:26,008 - INFO - step 19108, loss: 0.212546, best loss: 0.164299 2025-01-16 02:08:26,158 - INFO - step 19109, loss: 0.239388, best loss: 0.164299 2025-01-16 02:08:26,309 - INFO - step 19110, loss: 0.246767, best loss: 0.164299 2025-01-16 02:08:26,459 - INFO - step 19111, loss: 0.293695, best loss: 0.164299 2025-01-16 02:08:26,609 - INFO - step 19112, loss: 0.266438, best loss: 0.164299 2025-01-16 02:08:26,760 - INFO - step 19113, loss: 0.225552, best loss: 0.164299 2025-01-16 02:08:26,910 - INFO - step 19114, loss: 0.381143, best loss: 0.164299 2025-01-16 02:08:27,060 - INFO - step 19115, loss: 0.203690, best loss: 0.164299 2025-01-16 02:08:27,210 - INFO - step 19116, loss: 0.195275, best loss: 0.164299 2025-01-16 02:08:27,361 - INFO - step 19117, loss: 0.261778, best loss: 0.164299 2025-01-16 02:08:27,511 - INFO - step 19118, loss: 0.190735, best loss: 0.164299 2025-01-16 02:08:27,661 - INFO - step 19119, loss: 0.251437, best loss: 0.164299 2025-01-16 02:08:27,811 - INFO - step 19120, loss: 0.246380, best loss: 0.164299 2025-01-16 02:08:27,962 - INFO - step 19121, loss: 0.237292, best loss: 0.164299 2025-01-16 02:08:28,112 - INFO - step 19122, loss: 0.303694, best loss: 0.164299 2025-01-16 02:08:28,262 - INFO - step 19123, loss: 0.255707, best loss: 0.164299 2025-01-16 02:08:28,413 - INFO - step 19124, loss: 0.244390, best loss: 0.164299 2025-01-16 02:08:28,563 - INFO - step 19125, loss: 0.205847, best loss: 0.164299 2025-01-16 02:08:32,118 - INFO - step 19126, loss: 0.142890, best loss: 0.142890 2025-01-16 02:08:32,280 - INFO - step 19127, loss: 0.236016, best loss: 0.142890 2025-01-16 02:08:32,435 - INFO - step 19128, loss: 0.294172, best loss: 0.142890 2025-01-16 02:08:32,586 - INFO - step 19129, loss: 0.269273, best loss: 0.142890 2025-01-16 02:08:32,736 - INFO - step 19130, loss: 0.259853, best loss: 0.142890 2025-01-16 02:08:32,886 - INFO - step 19131, loss: 0.236739, best loss: 0.142890 2025-01-16 02:08:33,036 - INFO - step 19132, loss: 0.195634, best loss: 0.142890 2025-01-16 02:08:33,187 - INFO - step 19133, loss: 0.234585, best loss: 0.142890 2025-01-16 02:08:33,337 - INFO - step 19134, loss: 0.235638, best loss: 0.142890 2025-01-16 02:08:33,487 - INFO - step 19135, loss: 0.229050, best loss: 0.142890 2025-01-16 02:08:33,638 - INFO - step 19136, loss: 0.262423, best loss: 0.142890 2025-01-16 02:08:33,788 - INFO - step 19137, loss: 0.404539, best loss: 0.142890 2025-01-16 02:08:33,938 - INFO - step 19138, loss: 0.250557, best loss: 0.142890 2025-01-16 02:08:34,088 - INFO - step 19139, loss: 0.239929, best loss: 0.142890 2025-01-16 02:08:34,238 - INFO - step 19140, loss: 0.220804, best loss: 0.142890 2025-01-16 02:08:34,388 - INFO - step 19141, loss: 0.264648, best loss: 0.142890 2025-01-16 02:08:34,538 - INFO - step 19142, loss: 0.281800, best loss: 0.142890 2025-01-16 02:08:34,689 - INFO - step 19143, loss: 0.240355, best loss: 0.142890 2025-01-16 02:08:34,840 - INFO - step 19144, loss: 0.258424, best loss: 0.142890 2025-01-16 02:08:34,990 - INFO - step 19145, loss: 0.332109, best loss: 0.142890 2025-01-16 02:08:35,140 - INFO - step 19146, loss: 0.192876, best loss: 0.142890 2025-01-16 02:08:35,291 - INFO - step 19147, loss: 0.234933, best loss: 0.142890 2025-01-16 02:08:35,441 - INFO - step 19148, loss: 0.276691, best loss: 0.142890 2025-01-16 02:08:35,591 - INFO - step 19149, loss: 0.273740, best loss: 0.142890 2025-01-16 02:08:35,741 - INFO - step 19150, loss: 0.227323, best loss: 0.142890 2025-01-16 02:08:35,892 - INFO - step 19151, loss: 0.256970, best loss: 0.142890 2025-01-16 02:08:36,042 - INFO - step 19152, loss: 0.209412, best loss: 0.142890 2025-01-16 02:08:36,192 - INFO - step 19153, loss: 0.202547, best loss: 0.142890 2025-01-16 02:08:36,342 - INFO - step 19154, loss: 0.218154, best loss: 0.142890 2025-01-16 02:08:36,492 - INFO - step 19155, loss: 0.263670, best loss: 0.142890 2025-01-16 02:08:36,642 - INFO - step 19156, loss: 0.288595, best loss: 0.142890 2025-01-16 02:08:36,792 - INFO - step 19157, loss: 0.190261, best loss: 0.142890 2025-01-16 02:08:36,942 - INFO - step 19158, loss: 0.242138, best loss: 0.142890 2025-01-16 02:08:37,093 - INFO - step 19159, loss: 0.228150, best loss: 0.142890 2025-01-16 02:08:37,243 - INFO - step 19160, loss: 0.225075, best loss: 0.142890 2025-01-16 02:08:37,393 - INFO - step 19161, loss: 0.221955, best loss: 0.142890 2025-01-16 02:08:37,543 - INFO - step 19162, loss: 0.302613, best loss: 0.142890 2025-01-16 02:08:37,694 - INFO - step 19163, loss: 0.204768, best loss: 0.142890 2025-01-16 02:08:37,844 - INFO - step 19164, loss: 0.232646, best loss: 0.142890 2025-01-16 02:08:37,994 - INFO - step 19165, loss: 0.267699, best loss: 0.142890 2025-01-16 02:08:38,144 - INFO - step 19166, loss: 0.248430, best loss: 0.142890 2025-01-16 02:08:38,294 - INFO - step 19167, loss: 0.192649, best loss: 0.142890 2025-01-16 02:08:38,444 - INFO - step 19168, loss: 0.234668, best loss: 0.142890 2025-01-16 02:08:38,594 - INFO - step 19169, loss: 0.275915, best loss: 0.142890 2025-01-16 02:08:38,745 - INFO - step 19170, loss: 0.223244, best loss: 0.142890 2025-01-16 02:08:38,895 - INFO - step 19171, loss: 0.223084, best loss: 0.142890 2025-01-16 02:08:39,045 - INFO - step 19172, loss: 0.262017, best loss: 0.142890 2025-01-16 02:08:39,195 - INFO - step 19173, loss: 0.234374, best loss: 0.142890 2025-01-16 02:08:39,345 - INFO - step 19174, loss: 0.206569, best loss: 0.142890 2025-01-16 02:08:39,496 - INFO - step 19175, loss: 0.242189, best loss: 0.142890 2025-01-16 02:08:39,646 - INFO - step 19176, loss: 0.230264, best loss: 0.142890 2025-01-16 02:08:39,796 - INFO - step 19177, loss: 0.272052, best loss: 0.142890 2025-01-16 02:08:39,946 - INFO - step 19178, loss: 0.309646, best loss: 0.142890 2025-01-16 02:08:40,097 - INFO - step 19179, loss: 0.210658, best loss: 0.142890 2025-01-16 02:08:40,247 - INFO - step 19180, loss: 0.218925, best loss: 0.142890 2025-01-16 02:08:40,397 - INFO - step 19181, loss: 0.216535, best loss: 0.142890 2025-01-16 02:08:40,547 - INFO - step 19182, loss: 0.259713, best loss: 0.142890 2025-01-16 02:08:40,697 - INFO - step 19183, loss: 0.195761, best loss: 0.142890 2025-01-16 02:08:40,847 - INFO - step 19184, loss: 0.206947, best loss: 0.142890 2025-01-16 02:08:40,998 - INFO - step 19185, loss: 0.182623, best loss: 0.142890 2025-01-16 02:08:41,148 - INFO - step 19186, loss: 0.265543, best loss: 0.142890 2025-01-16 02:08:41,298 - INFO - step 19187, loss: 0.209006, best loss: 0.142890 2025-01-16 02:08:41,448 - INFO - step 19188, loss: 0.220085, best loss: 0.142890 2025-01-16 02:08:41,598 - INFO - step 19189, loss: 0.178215, best loss: 0.142890 2025-01-16 02:08:41,749 - INFO - step 19190, loss: 0.203672, best loss: 0.142890 2025-01-16 02:08:41,899 - INFO - step 19191, loss: 0.195864, best loss: 0.142890 2025-01-16 02:08:42,049 - INFO - step 19192, loss: 0.183324, best loss: 0.142890 2025-01-16 02:08:42,199 - INFO - step 19193, loss: 0.194118, best loss: 0.142890 2025-01-16 02:08:42,349 - INFO - step 19194, loss: 0.206084, best loss: 0.142890 2025-01-16 02:08:42,500 - INFO - step 19195, loss: 0.154889, best loss: 0.142890 2025-01-16 02:08:42,650 - INFO - step 19196, loss: 0.250059, best loss: 0.142890 2025-01-16 02:08:42,800 - INFO - step 19197, loss: 0.217819, best loss: 0.142890 2025-01-16 02:08:42,950 - INFO - step 19198, loss: 0.224007, best loss: 0.142890 2025-01-16 02:08:43,100 - INFO - step 19199, loss: 0.271018, best loss: 0.142890 2025-01-16 02:08:43,251 - INFO - step 19200, loss: 0.219131, best loss: 0.142890 2025-01-16 02:08:43,401 - INFO - step 19201, loss: 0.224655, best loss: 0.142890 2025-01-16 02:08:43,551 - INFO - step 19202, loss: 0.233109, best loss: 0.142890 2025-01-16 02:08:43,701 - INFO - step 19203, loss: 0.230503, best loss: 0.142890 2025-01-16 02:08:43,852 - INFO - step 19204, loss: 0.236672, best loss: 0.142890 2025-01-16 02:08:44,002 - INFO - step 19205, loss: 0.243690, best loss: 0.142890 2025-01-16 02:08:44,152 - INFO - step 19206, loss: 0.211691, best loss: 0.142890 2025-01-16 02:08:44,302 - INFO - step 19207, loss: 0.232995, best loss: 0.142890 2025-01-16 02:08:44,452 - INFO - step 19208, loss: 0.193077, best loss: 0.142890 2025-01-16 02:08:44,602 - INFO - step 19209, loss: 0.240526, best loss: 0.142890 2025-01-16 02:08:44,753 - INFO - step 19210, loss: 0.228787, best loss: 0.142890 2025-01-16 02:08:44,903 - INFO - step 19211, loss: 0.239190, best loss: 0.142890 2025-01-16 02:08:45,053 - INFO - step 19212, loss: 0.237369, best loss: 0.142890 2025-01-16 02:08:45,203 - INFO - step 19213, loss: 0.211769, best loss: 0.142890 2025-01-16 02:08:45,353 - INFO - step 19214, loss: 0.264623, best loss: 0.142890 2025-01-16 02:08:45,503 - INFO - step 19215, loss: 0.233034, best loss: 0.142890 2025-01-16 02:08:45,653 - INFO - step 19216, loss: 0.177758, best loss: 0.142890 2025-01-16 02:08:45,803 - INFO - step 19217, loss: 0.262554, best loss: 0.142890 2025-01-16 02:08:45,953 - INFO - step 19218, loss: 0.220025, best loss: 0.142890 2025-01-16 02:08:46,103 - INFO - step 19219, loss: 0.197489, best loss: 0.142890 2025-01-16 02:08:46,254 - INFO - step 19220, loss: 0.230899, best loss: 0.142890 2025-01-16 02:08:46,404 - INFO - step 19221, loss: 0.201994, best loss: 0.142890 2025-01-16 02:08:46,554 - INFO - step 19222, loss: 0.220038, best loss: 0.142890 2025-01-16 02:08:46,704 - INFO - step 19223, loss: 0.200678, best loss: 0.142890 2025-01-16 02:08:46,854 - INFO - step 19224, loss: 0.245490, best loss: 0.142890 2025-01-16 02:08:47,004 - INFO - step 19225, loss: 0.268163, best loss: 0.142890 2025-01-16 02:08:47,154 - INFO - step 19226, loss: 0.182066, best loss: 0.142890 2025-01-16 02:08:47,305 - INFO - step 19227, loss: 0.220846, best loss: 0.142890 2025-01-16 02:08:47,455 - INFO - step 19228, loss: 0.204223, best loss: 0.142890 2025-01-16 02:08:47,605 - INFO - step 19229, loss: 0.248827, best loss: 0.142890 2025-01-16 02:08:47,755 - INFO - step 19230, loss: 0.216790, best loss: 0.142890 2025-01-16 02:08:47,905 - INFO - step 19231, loss: 0.282258, best loss: 0.142890 2025-01-16 02:08:48,056 - INFO - step 19232, loss: 0.204899, best loss: 0.142890 2025-01-16 02:08:48,206 - INFO - step 19233, loss: 0.188857, best loss: 0.142890 2025-01-16 02:08:48,356 - INFO - step 19234, loss: 0.202619, best loss: 0.142890 2025-01-16 02:08:48,506 - INFO - step 19235, loss: 0.258000, best loss: 0.142890 2025-01-16 02:08:48,656 - INFO - step 19236, loss: 0.253756, best loss: 0.142890 2025-01-16 02:08:48,807 - INFO - step 19237, loss: 0.242161, best loss: 0.142890 2025-01-16 02:08:48,957 - INFO - step 19238, loss: 0.199702, best loss: 0.142890 2025-01-16 02:08:49,107 - INFO - step 19239, loss: 0.233210, best loss: 0.142890 2025-01-16 02:08:49,258 - INFO - step 19240, loss: 0.265292, best loss: 0.142890 2025-01-16 02:08:49,408 - INFO - step 19241, loss: 0.259216, best loss: 0.142890 2025-01-16 02:08:49,558 - INFO - step 19242, loss: 0.237786, best loss: 0.142890 2025-01-16 02:08:49,709 - INFO - step 19243, loss: 0.271948, best loss: 0.142890 2025-01-16 02:08:49,859 - INFO - step 19244, loss: 0.227048, best loss: 0.142890 2025-01-16 02:08:50,009 - INFO - step 19245, loss: 0.200750, best loss: 0.142890 2025-01-16 02:08:50,159 - INFO - step 19246, loss: 0.283971, best loss: 0.142890 2025-01-16 02:08:50,309 - INFO - step 19247, loss: 0.213565, best loss: 0.142890 2025-01-16 02:08:50,459 - INFO - step 19248, loss: 0.213818, best loss: 0.142890 2025-01-16 02:08:50,609 - INFO - step 19249, loss: 0.216595, best loss: 0.142890 2025-01-16 02:08:50,759 - INFO - step 19250, loss: 0.245611, best loss: 0.142890 2025-01-16 02:08:50,909 - INFO - step 19251, loss: 0.249936, best loss: 0.142890 2025-01-16 02:08:51,060 - INFO - step 19252, loss: 0.232873, best loss: 0.142890 2025-01-16 02:08:51,210 - INFO - step 19253, loss: 0.171671, best loss: 0.142890 2025-01-16 02:08:51,360 - INFO - step 19254, loss: 0.243846, best loss: 0.142890 2025-01-16 02:08:51,510 - INFO - step 19255, loss: 0.239834, best loss: 0.142890 2025-01-16 02:08:51,660 - INFO - step 19256, loss: 0.226568, best loss: 0.142890 2025-01-16 02:08:51,810 - INFO - step 19257, loss: 0.210831, best loss: 0.142890 2025-01-16 02:08:51,961 - INFO - step 19258, loss: 0.195269, best loss: 0.142890 2025-01-16 02:08:52,111 - INFO - step 19259, loss: 0.225000, best loss: 0.142890 2025-01-16 02:08:52,261 - INFO - step 19260, loss: 0.216700, best loss: 0.142890 2025-01-16 02:08:52,412 - INFO - step 19261, loss: 0.237302, best loss: 0.142890 2025-01-16 02:08:52,562 - INFO - step 19262, loss: 0.232820, best loss: 0.142890 2025-01-16 02:08:52,712 - INFO - step 19263, loss: 0.216688, best loss: 0.142890 2025-01-16 02:08:52,862 - INFO - step 19264, loss: 0.198552, best loss: 0.142890 2025-01-16 02:08:53,012 - INFO - step 19265, loss: 0.233448, best loss: 0.142890 2025-01-16 02:08:53,162 - INFO - step 19266, loss: 0.203488, best loss: 0.142890 2025-01-16 02:08:53,312 - INFO - step 19267, loss: 0.249237, best loss: 0.142890 2025-01-16 02:08:53,462 - INFO - step 19268, loss: 0.188156, best loss: 0.142890 2025-01-16 02:08:53,612 - INFO - step 19269, loss: 0.245102, best loss: 0.142890 2025-01-16 02:08:53,762 - INFO - step 19270, loss: 0.249517, best loss: 0.142890 2025-01-16 02:08:53,912 - INFO - step 19271, loss: 0.208259, best loss: 0.142890 2025-01-16 02:08:54,062 - INFO - step 19272, loss: 0.238687, best loss: 0.142890 2025-01-16 02:08:54,212 - INFO - step 19273, loss: 0.203144, best loss: 0.142890 2025-01-16 02:08:54,362 - INFO - step 19274, loss: 0.262234, best loss: 0.142890 2025-01-16 02:08:54,512 - INFO - step 19275, loss: 0.188069, best loss: 0.142890 2025-01-16 02:08:54,662 - INFO - step 19276, loss: 0.159590, best loss: 0.142890 2025-01-16 02:08:54,812 - INFO - step 19277, loss: 0.235116, best loss: 0.142890 2025-01-16 02:08:54,962 - INFO - step 19278, loss: 0.206797, best loss: 0.142890 2025-01-16 02:08:55,112 - INFO - step 19279, loss: 0.220103, best loss: 0.142890 2025-01-16 02:08:55,262 - INFO - step 19280, loss: 0.214361, best loss: 0.142890 2025-01-16 02:08:55,413 - INFO - step 19281, loss: 0.229033, best loss: 0.142890 2025-01-16 02:08:55,562 - INFO - step 19282, loss: 0.206499, best loss: 0.142890 2025-01-16 02:08:55,712 - INFO - step 19283, loss: 0.190066, best loss: 0.142890 2025-01-16 02:08:55,862 - INFO - step 19284, loss: 0.144819, best loss: 0.142890 2025-01-16 02:08:56,013 - INFO - step 19285, loss: 0.223488, best loss: 0.142890 2025-01-16 02:08:56,162 - INFO - step 19286, loss: 0.220283, best loss: 0.142890 2025-01-16 02:08:56,312 - INFO - step 19287, loss: 0.232152, best loss: 0.142890 2025-01-16 02:08:56,463 - INFO - step 19288, loss: 0.224704, best loss: 0.142890 2025-01-16 02:08:56,613 - INFO - step 19289, loss: 0.184777, best loss: 0.142890 2025-01-16 02:08:56,762 - INFO - step 19290, loss: 0.190426, best loss: 0.142890 2025-01-16 02:08:56,912 - INFO - step 19291, loss: 0.188584, best loss: 0.142890 2025-01-16 02:08:57,063 - INFO - step 19292, loss: 0.178962, best loss: 0.142890 2025-01-16 02:08:57,213 - INFO - step 19293, loss: 0.179634, best loss: 0.142890 2025-01-16 02:08:57,363 - INFO - step 19294, loss: 0.235934, best loss: 0.142890 2025-01-16 02:08:57,513 - INFO - step 19295, loss: 0.199431, best loss: 0.142890 2025-01-16 02:08:57,662 - INFO - step 19296, loss: 0.277615, best loss: 0.142890 2025-01-16 02:08:57,812 - INFO - step 19297, loss: 0.211044, best loss: 0.142890 2025-01-16 02:08:57,962 - INFO - step 19298, loss: 0.238463, best loss: 0.142890 2025-01-16 02:08:58,112 - INFO - step 19299, loss: 0.239172, best loss: 0.142890 2025-01-16 02:08:58,262 - INFO - step 19300, loss: 0.187543, best loss: 0.142890 2025-01-16 02:08:58,412 - INFO - step 19301, loss: 0.188181, best loss: 0.142890 2025-01-16 02:08:58,562 - INFO - step 19302, loss: 0.216018, best loss: 0.142890 2025-01-16 02:08:58,712 - INFO - step 19303, loss: 0.205340, best loss: 0.142890 2025-01-16 02:08:58,862 - INFO - step 19304, loss: 0.187459, best loss: 0.142890 2025-01-16 02:08:59,012 - INFO - step 19305, loss: 0.194098, best loss: 0.142890 2025-01-16 02:08:59,162 - INFO - step 19306, loss: 0.238103, best loss: 0.142890 2025-01-16 02:08:59,312 - INFO - step 19307, loss: 0.239041, best loss: 0.142890 2025-01-16 02:08:59,462 - INFO - step 19308, loss: 0.227894, best loss: 0.142890 2025-01-16 02:08:59,613 - INFO - step 19309, loss: 0.233049, best loss: 0.142890 2025-01-16 02:08:59,763 - INFO - step 19310, loss: 0.241028, best loss: 0.142890 2025-01-16 02:08:59,913 - INFO - step 19311, loss: 0.219499, best loss: 0.142890 2025-01-16 02:09:00,063 - INFO - step 19312, loss: 0.262274, best loss: 0.142890 2025-01-16 02:09:00,213 - INFO - step 19313, loss: 0.277743, best loss: 0.142890 2025-01-16 02:09:00,363 - INFO - step 19314, loss: 0.235596, best loss: 0.142890 2025-01-16 02:09:00,513 - INFO - step 19315, loss: 0.195950, best loss: 0.142890 2025-01-16 02:09:00,663 - INFO - step 19316, loss: 0.233604, best loss: 0.142890 2025-01-16 02:09:00,814 - INFO - step 19317, loss: 0.245119, best loss: 0.142890 2025-01-16 02:09:00,964 - INFO - step 19318, loss: 0.245257, best loss: 0.142890 2025-01-16 02:09:01,114 - INFO - step 19319, loss: 0.228886, best loss: 0.142890 2025-01-16 02:09:01,264 - INFO - step 19320, loss: 0.199503, best loss: 0.142890 2025-01-16 02:09:01,414 - INFO - step 19321, loss: 0.193062, best loss: 0.142890 2025-01-16 02:09:01,564 - INFO - step 19322, loss: 0.219537, best loss: 0.142890 2025-01-16 02:09:01,714 - INFO - step 19323, loss: 0.265685, best loss: 0.142890 2025-01-16 02:09:01,865 - INFO - step 19324, loss: 0.209764, best loss: 0.142890 2025-01-16 02:09:02,015 - INFO - step 19325, loss: 0.189591, best loss: 0.142890 2025-01-16 02:09:02,165 - INFO - step 19326, loss: 0.222055, best loss: 0.142890 2025-01-16 02:09:02,315 - INFO - step 19327, loss: 0.268802, best loss: 0.142890 2025-01-16 02:09:02,465 - INFO - step 19328, loss: 0.226490, best loss: 0.142890 2025-01-16 02:09:02,615 - INFO - step 19329, loss: 0.211046, best loss: 0.142890 2025-01-16 02:09:02,765 - INFO - step 19330, loss: 0.255017, best loss: 0.142890 2025-01-16 02:09:02,915 - INFO - step 19331, loss: 0.251215, best loss: 0.142890 2025-01-16 02:09:03,065 - INFO - step 19332, loss: 0.207354, best loss: 0.142890 2025-01-16 02:09:03,215 - INFO - step 19333, loss: 0.194568, best loss: 0.142890 2025-01-16 02:09:03,365 - INFO - step 19334, loss: 0.218956, best loss: 0.142890 2025-01-16 02:09:03,515 - INFO - step 19335, loss: 0.217080, best loss: 0.142890 2025-01-16 02:09:03,665 - INFO - step 19336, loss: 0.250802, best loss: 0.142890 2025-01-16 02:09:03,815 - INFO - step 19337, loss: 0.250091, best loss: 0.142890 2025-01-16 02:09:03,965 - INFO - step 19338, loss: 0.255231, best loss: 0.142890 2025-01-16 02:09:04,115 - INFO - step 19339, loss: 0.202573, best loss: 0.142890 2025-01-16 02:09:04,265 - INFO - step 19340, loss: 0.225328, best loss: 0.142890 2025-01-16 02:09:04,415 - INFO - step 19341, loss: 0.226590, best loss: 0.142890 2025-01-16 02:09:04,565 - INFO - step 19342, loss: 0.247560, best loss: 0.142890 2025-01-16 02:09:04,715 - INFO - step 19343, loss: 0.241797, best loss: 0.142890 2025-01-16 02:09:04,865 - INFO - step 19344, loss: 0.200178, best loss: 0.142890 2025-01-16 02:09:05,015 - INFO - step 19345, loss: 0.209862, best loss: 0.142890 2025-01-16 02:09:05,165 - INFO - step 19346, loss: 0.199987, best loss: 0.142890 2025-01-16 02:09:05,315 - INFO - step 19347, loss: 0.233919, best loss: 0.142890 2025-01-16 02:09:05,466 - INFO - step 19348, loss: 0.226073, best loss: 0.142890 2025-01-16 02:09:05,616 - INFO - step 19349, loss: 0.218097, best loss: 0.142890 2025-01-16 02:09:05,766 - INFO - step 19350, loss: 0.202171, best loss: 0.142890 2025-01-16 02:09:05,916 - INFO - step 19351, loss: 0.215207, best loss: 0.142890 2025-01-16 02:09:06,066 - INFO - step 19352, loss: 0.148976, best loss: 0.142890 2025-01-16 02:09:06,217 - INFO - step 19353, loss: 0.226172, best loss: 0.142890 2025-01-16 02:09:06,367 - INFO - step 19354, loss: 0.209953, best loss: 0.142890 2025-01-16 02:09:06,517 - INFO - step 19355, loss: 0.267321, best loss: 0.142890 2025-01-16 02:09:06,667 - INFO - step 19356, loss: 0.227718, best loss: 0.142890 2025-01-16 02:09:06,817 - INFO - step 19357, loss: 0.212989, best loss: 0.142890 2025-01-16 02:09:06,967 - INFO - step 19358, loss: 0.230015, best loss: 0.142890 2025-01-16 02:09:07,117 - INFO - step 19359, loss: 0.287018, best loss: 0.142890 2025-01-16 02:09:07,267 - INFO - step 19360, loss: 0.235871, best loss: 0.142890 2025-01-16 02:09:07,417 - INFO - step 19361, loss: 0.268854, best loss: 0.142890 2025-01-16 02:09:07,567 - INFO - step 19362, loss: 0.210502, best loss: 0.142890 2025-01-16 02:09:07,717 - INFO - step 19363, loss: 0.268358, best loss: 0.142890 2025-01-16 02:09:07,868 - INFO - step 19364, loss: 0.218503, best loss: 0.142890 2025-01-16 02:09:08,018 - INFO - step 19365, loss: 0.209655, best loss: 0.142890 2025-01-16 02:09:08,168 - INFO - step 19366, loss: 0.210507, best loss: 0.142890 2025-01-16 02:09:08,318 - INFO - step 19367, loss: 0.202066, best loss: 0.142890 2025-01-16 02:09:08,468 - INFO - step 19368, loss: 0.221436, best loss: 0.142890 2025-01-16 02:09:08,618 - INFO - step 19369, loss: 0.241566, best loss: 0.142890 2025-01-16 02:09:08,768 - INFO - step 19370, loss: 0.234381, best loss: 0.142890 2025-01-16 02:09:08,918 - INFO - step 19371, loss: 0.186997, best loss: 0.142890 2025-01-16 02:09:09,068 - INFO - step 19372, loss: 0.195604, best loss: 0.142890 2025-01-16 02:09:09,218 - INFO - step 19373, loss: 0.229744, best loss: 0.142890 2025-01-16 02:09:09,369 - INFO - step 19374, loss: 0.242763, best loss: 0.142890 2025-01-16 02:09:09,519 - INFO - step 19375, loss: 0.226145, best loss: 0.142890 2025-01-16 02:09:09,669 - INFO - step 19376, loss: 0.215258, best loss: 0.142890 2025-01-16 02:09:09,819 - INFO - step 19377, loss: 0.182843, best loss: 0.142890 2025-01-16 02:09:09,969 - INFO - step 19378, loss: 0.163129, best loss: 0.142890 2025-01-16 02:09:10,119 - INFO - step 19379, loss: 0.179159, best loss: 0.142890 2025-01-16 02:09:10,269 - INFO - step 19380, loss: 0.275727, best loss: 0.142890 2025-01-16 02:09:10,419 - INFO - step 19381, loss: 0.256574, best loss: 0.142890 2025-01-16 02:09:10,569 - INFO - step 19382, loss: 0.219785, best loss: 0.142890 2025-01-16 02:09:10,719 - INFO - step 19383, loss: 0.194782, best loss: 0.142890 2025-01-16 02:09:10,870 - INFO - step 19384, loss: 0.218492, best loss: 0.142890 2025-01-16 02:09:11,020 - INFO - step 19385, loss: 0.178619, best loss: 0.142890 2025-01-16 02:09:11,170 - INFO - step 19386, loss: 0.215269, best loss: 0.142890 2025-01-16 02:09:11,320 - INFO - step 19387, loss: 0.239668, best loss: 0.142890 2025-01-16 02:09:11,470 - INFO - step 19388, loss: 0.183422, best loss: 0.142890 2025-01-16 02:09:11,620 - INFO - step 19389, loss: 0.176857, best loss: 0.142890 2025-01-16 02:09:11,770 - INFO - step 19390, loss: 0.204055, best loss: 0.142890 2025-01-16 02:09:11,920 - INFO - step 19391, loss: 0.207321, best loss: 0.142890 2025-01-16 02:09:12,071 - INFO - step 19392, loss: 0.189569, best loss: 0.142890 2025-01-16 02:09:12,220 - INFO - step 19393, loss: 0.239151, best loss: 0.142890 2025-01-16 02:09:12,370 - INFO - step 19394, loss: 0.228089, best loss: 0.142890 2025-01-16 02:09:12,521 - INFO - step 19395, loss: 0.179745, best loss: 0.142890 2025-01-16 02:09:12,671 - INFO - step 19396, loss: 0.183467, best loss: 0.142890 2025-01-16 02:09:12,820 - INFO - step 19397, loss: 0.218231, best loss: 0.142890 2025-01-16 02:09:12,971 - INFO - step 19398, loss: 0.222441, best loss: 0.142890 2025-01-16 02:09:13,120 - INFO - step 19399, loss: 0.211549, best loss: 0.142890 2025-01-16 02:09:13,270 - INFO - step 19400, loss: 0.203636, best loss: 0.142890 2025-01-16 02:09:13,421 - INFO - step 19401, loss: 0.247731, best loss: 0.142890 2025-01-16 02:09:13,571 - INFO - step 19402, loss: 0.252949, best loss: 0.142890 2025-01-16 02:09:13,721 - INFO - step 19403, loss: 0.163550, best loss: 0.142890 2025-01-16 02:09:13,871 - INFO - step 19404, loss: 0.171512, best loss: 0.142890 2025-01-16 02:09:14,021 - INFO - step 19405, loss: 0.219342, best loss: 0.142890 2025-01-16 02:09:14,172 - INFO - step 19406, loss: 0.225443, best loss: 0.142890 2025-01-16 02:09:14,322 - INFO - step 19407, loss: 0.178502, best loss: 0.142890 2025-01-16 02:09:14,472 - INFO - step 19408, loss: 0.190328, best loss: 0.142890 2025-01-16 02:09:14,622 - INFO - step 19409, loss: 0.221195, best loss: 0.142890 2025-01-16 02:09:14,772 - INFO - step 19410, loss: 0.208686, best loss: 0.142890 2025-01-16 02:09:14,922 - INFO - step 19411, loss: 0.188388, best loss: 0.142890 2025-01-16 02:09:15,072 - INFO - step 19412, loss: 0.218629, best loss: 0.142890 2025-01-16 02:09:15,222 - INFO - step 19413, loss: 0.199817, best loss: 0.142890 2025-01-16 02:09:15,372 - INFO - step 19414, loss: 0.191212, best loss: 0.142890 2025-01-16 02:09:15,522 - INFO - step 19415, loss: 0.179137, best loss: 0.142890 2025-01-16 02:09:15,672 - INFO - step 19416, loss: 0.199762, best loss: 0.142890 2025-01-16 02:09:15,822 - INFO - step 19417, loss: 0.162389, best loss: 0.142890 2025-01-16 02:09:15,973 - INFO - step 19418, loss: 0.205185, best loss: 0.142890 2025-01-16 02:09:16,123 - INFO - step 19419, loss: 0.226149, best loss: 0.142890 2025-01-16 02:09:16,273 - INFO - step 19420, loss: 0.187195, best loss: 0.142890 2025-01-16 02:09:16,423 - INFO - step 19421, loss: 0.175514, best loss: 0.142890 2025-01-16 02:09:16,573 - INFO - step 19422, loss: 0.209485, best loss: 0.142890 2025-01-16 02:09:16,723 - INFO - step 19423, loss: 0.149682, best loss: 0.142890 2025-01-16 02:09:16,874 - INFO - step 19424, loss: 0.206218, best loss: 0.142890 2025-01-16 02:09:17,024 - INFO - step 19425, loss: 0.162370, best loss: 0.142890 2025-01-16 02:09:17,174 - INFO - step 19426, loss: 0.221911, best loss: 0.142890 2025-01-16 02:09:17,324 - INFO - step 19427, loss: 0.171544, best loss: 0.142890 2025-01-16 02:09:17,474 - INFO - step 19428, loss: 0.201100, best loss: 0.142890 2025-01-16 02:09:17,625 - INFO - step 19429, loss: 0.238837, best loss: 0.142890 2025-01-16 02:09:17,774 - INFO - step 19430, loss: 0.186145, best loss: 0.142890 2025-01-16 02:09:17,925 - INFO - step 19431, loss: 0.271143, best loss: 0.142890 2025-01-16 02:09:18,075 - INFO - step 19432, loss: 0.201059, best loss: 0.142890 2025-01-16 02:09:18,225 - INFO - step 19433, loss: 0.184283, best loss: 0.142890 2025-01-16 02:09:18,375 - INFO - step 19434, loss: 0.179920, best loss: 0.142890 2025-01-16 02:09:18,525 - INFO - step 19435, loss: 0.214153, best loss: 0.142890 2025-01-16 02:09:18,675 - INFO - step 19436, loss: 0.191858, best loss: 0.142890 2025-01-16 02:09:18,825 - INFO - step 19437, loss: 0.256059, best loss: 0.142890 2025-01-16 02:09:18,975 - INFO - step 19438, loss: 0.199535, best loss: 0.142890 2025-01-16 02:09:19,125 - INFO - step 19439, loss: 0.229840, best loss: 0.142890 2025-01-16 02:09:19,275 - INFO - step 19440, loss: 0.229607, best loss: 0.142890 2025-01-16 02:09:19,426 - INFO - step 19441, loss: 0.212277, best loss: 0.142890 2025-01-16 02:09:19,576 - INFO - step 19442, loss: 0.214697, best loss: 0.142890 2025-01-16 02:09:19,726 - INFO - step 19443, loss: 0.267184, best loss: 0.142890 2025-01-16 02:09:19,876 - INFO - step 19444, loss: 0.231339, best loss: 0.142890 2025-01-16 02:09:20,026 - INFO - step 19445, loss: 0.217689, best loss: 0.142890 2025-01-16 02:09:20,176 - INFO - step 19446, loss: 0.177763, best loss: 0.142890 2025-01-16 02:09:20,326 - INFO - step 19447, loss: 0.230794, best loss: 0.142890 2025-01-16 02:09:20,476 - INFO - step 19448, loss: 0.229568, best loss: 0.142890 2025-01-16 02:09:20,626 - INFO - step 19449, loss: 0.250757, best loss: 0.142890 2025-01-16 02:09:20,776 - INFO - step 19450, loss: 0.202503, best loss: 0.142890 2025-01-16 02:09:20,926 - INFO - step 19451, loss: 0.206015, best loss: 0.142890 2025-01-16 02:09:21,076 - INFO - step 19452, loss: 0.246367, best loss: 0.142890 2025-01-16 02:09:21,226 - INFO - step 19453, loss: 0.206183, best loss: 0.142890 2025-01-16 02:09:21,377 - INFO - step 19454, loss: 0.206867, best loss: 0.142890 2025-01-16 02:09:21,527 - INFO - step 19455, loss: 0.169037, best loss: 0.142890 2025-01-16 02:09:21,677 - INFO - step 19456, loss: 0.157518, best loss: 0.142890 2025-01-16 02:09:21,827 - INFO - step 19457, loss: 0.172250, best loss: 0.142890 2025-01-16 02:09:21,977 - INFO - step 19458, loss: 0.198455, best loss: 0.142890 2025-01-16 02:09:22,127 - INFO - step 19459, loss: 0.228432, best loss: 0.142890 2025-01-16 02:09:22,277 - INFO - step 19460, loss: 0.235615, best loss: 0.142890 2025-01-16 02:09:22,428 - INFO - step 19461, loss: 0.233375, best loss: 0.142890 2025-01-16 02:09:22,578 - INFO - step 19462, loss: 0.204961, best loss: 0.142890 2025-01-16 02:09:22,728 - INFO - step 19463, loss: 0.244862, best loss: 0.142890 2025-01-16 02:09:22,878 - INFO - step 19464, loss: 0.173427, best loss: 0.142890 2025-01-16 02:09:23,028 - INFO - step 19465, loss: 0.234548, best loss: 0.142890 2025-01-16 02:09:23,177 - INFO - step 19466, loss: 0.218089, best loss: 0.142890 2025-01-16 02:09:23,327 - INFO - step 19467, loss: 0.303318, best loss: 0.142890 2025-01-16 02:09:23,477 - INFO - step 19468, loss: 0.261176, best loss: 0.142890 2025-01-16 02:09:23,628 - INFO - step 19469, loss: 0.170112, best loss: 0.142890 2025-01-16 02:09:23,777 - INFO - step 19470, loss: 0.251205, best loss: 0.142890 2025-01-16 02:09:23,927 - INFO - step 19471, loss: 0.260029, best loss: 0.142890 2025-01-16 02:09:24,077 - INFO - step 19472, loss: 0.255505, best loss: 0.142890 2025-01-16 02:09:24,227 - INFO - step 19473, loss: 0.246082, best loss: 0.142890 2025-01-16 02:09:24,378 - INFO - step 19474, loss: 0.242552, best loss: 0.142890 2025-01-16 02:09:24,528 - INFO - step 19475, loss: 0.259519, best loss: 0.142890 2025-01-16 02:09:24,678 - INFO - step 19476, loss: 0.204920, best loss: 0.142890 2025-01-16 02:09:24,828 - INFO - step 19477, loss: 0.268360, best loss: 0.142890 2025-01-16 02:09:24,978 - INFO - step 19478, loss: 0.256139, best loss: 0.142890 2025-01-16 02:09:25,128 - INFO - step 19479, loss: 0.209789, best loss: 0.142890 2025-01-16 02:09:25,278 - INFO - step 19480, loss: 0.204076, best loss: 0.142890 2025-01-16 02:09:25,428 - INFO - step 19481, loss: 0.183562, best loss: 0.142890 2025-01-16 02:09:25,578 - INFO - step 19482, loss: 0.196915, best loss: 0.142890 2025-01-16 02:09:25,728 - INFO - step 19483, loss: 0.244236, best loss: 0.142890 2025-01-16 02:09:25,879 - INFO - step 19484, loss: 0.204353, best loss: 0.142890 2025-01-16 02:09:26,029 - INFO - step 19485, loss: 0.238673, best loss: 0.142890 2025-01-16 02:09:26,179 - INFO - step 19486, loss: 0.216464, best loss: 0.142890 2025-01-16 02:09:26,329 - INFO - step 19487, loss: 0.178688, best loss: 0.142890 2025-01-16 02:09:26,479 - INFO - step 19488, loss: 0.233817, best loss: 0.142890 2025-01-16 02:09:26,629 - INFO - step 19489, loss: 0.199074, best loss: 0.142890 2025-01-16 02:09:26,779 - INFO - step 19490, loss: 0.214498, best loss: 0.142890 2025-01-16 02:09:26,930 - INFO - step 19491, loss: 0.205009, best loss: 0.142890 2025-01-16 02:09:27,080 - INFO - step 19492, loss: 0.203284, best loss: 0.142890 2025-01-16 02:09:27,230 - INFO - step 19493, loss: 0.210710, best loss: 0.142890 2025-01-16 02:09:27,380 - INFO - step 19494, loss: 0.206901, best loss: 0.142890 2025-01-16 02:09:27,530 - INFO - step 19495, loss: 0.251399, best loss: 0.142890 2025-01-16 02:09:27,680 - INFO - step 19496, loss: 0.187712, best loss: 0.142890 2025-01-16 02:09:27,830 - INFO - step 19497, loss: 0.191585, best loss: 0.142890 2025-01-16 02:09:27,980 - INFO - step 19498, loss: 0.163674, best loss: 0.142890 2025-01-16 02:09:28,130 - INFO - step 19499, loss: 0.235571, best loss: 0.142890 2025-01-16 02:09:28,280 - INFO - step 19500, loss: 0.206152, best loss: 0.142890 2025-01-16 02:09:28,430 - INFO - step 19501, loss: 0.190790, best loss: 0.142890 2025-01-16 02:09:28,580 - INFO - step 19502, loss: 0.230980, best loss: 0.142890 2025-01-16 02:09:28,730 - INFO - step 19503, loss: 0.184308, best loss: 0.142890 2025-01-16 02:09:28,880 - INFO - step 19504, loss: 0.194666, best loss: 0.142890 2025-01-16 02:09:29,031 - INFO - step 19505, loss: 0.201293, best loss: 0.142890 2025-01-16 02:09:29,181 - INFO - step 19506, loss: 0.249269, best loss: 0.142890 2025-01-16 02:09:29,331 - INFO - step 19507, loss: 0.251467, best loss: 0.142890 2025-01-16 02:09:29,481 - INFO - step 19508, loss: 0.242372, best loss: 0.142890 2025-01-16 02:09:29,632 - INFO - step 19509, loss: 0.219681, best loss: 0.142890 2025-01-16 02:09:29,782 - INFO - step 19510, loss: 0.229707, best loss: 0.142890 2025-01-16 02:09:29,933 - INFO - step 19511, loss: 0.229197, best loss: 0.142890 2025-01-16 02:09:30,083 - INFO - step 19512, loss: 0.253264, best loss: 0.142890 2025-01-16 02:09:30,233 - INFO - step 19513, loss: 0.219387, best loss: 0.142890 2025-01-16 02:09:30,382 - INFO - step 19514, loss: 0.199172, best loss: 0.142890 2025-01-16 02:09:30,532 - INFO - step 19515, loss: 0.196366, best loss: 0.142890 2025-01-16 02:09:30,683 - INFO - step 19516, loss: 0.236121, best loss: 0.142890 2025-01-16 02:09:30,833 - INFO - step 19517, loss: 0.209801, best loss: 0.142890 2025-01-16 02:09:30,983 - INFO - step 19518, loss: 0.182049, best loss: 0.142890 2025-01-16 02:09:31,133 - INFO - step 19519, loss: 0.194591, best loss: 0.142890 2025-01-16 02:09:31,283 - INFO - step 19520, loss: 0.195920, best loss: 0.142890 2025-01-16 02:09:31,433 - INFO - step 19521, loss: 0.202740, best loss: 0.142890 2025-01-16 02:09:31,583 - INFO - step 19522, loss: 0.165625, best loss: 0.142890 2025-01-16 02:09:31,733 - INFO - step 19523, loss: 0.217116, best loss: 0.142890 2025-01-16 02:09:31,883 - INFO - step 19524, loss: 0.173348, best loss: 0.142890 2025-01-16 02:09:32,033 - INFO - step 19525, loss: 0.188235, best loss: 0.142890 2025-01-16 02:09:32,184 - INFO - step 19526, loss: 0.211772, best loss: 0.142890 2025-01-16 02:09:32,334 - INFO - step 19527, loss: 0.217988, best loss: 0.142890 2025-01-16 02:09:32,484 - INFO - step 19528, loss: 0.245010, best loss: 0.142890 2025-01-16 02:09:32,634 - INFO - step 19529, loss: 0.245298, best loss: 0.142890 2025-01-16 02:09:32,784 - INFO - step 19530, loss: 0.219038, best loss: 0.142890 2025-01-16 02:09:32,934 - INFO - step 19531, loss: 0.201208, best loss: 0.142890 2025-01-16 02:09:33,084 - INFO - step 19532, loss: 0.216632, best loss: 0.142890 2025-01-16 02:09:33,234 - INFO - step 19533, loss: 0.193196, best loss: 0.142890 2025-01-16 02:09:33,384 - INFO - step 19534, loss: 0.248158, best loss: 0.142890 2025-01-16 02:09:33,535 - INFO - step 19535, loss: 0.213427, best loss: 0.142890 2025-01-16 02:09:33,685 - INFO - step 19536, loss: 0.193274, best loss: 0.142890 2025-01-16 02:09:33,835 - INFO - step 19537, loss: 0.219942, best loss: 0.142890 2025-01-16 02:09:33,986 - INFO - step 19538, loss: 0.185586, best loss: 0.142890 2025-01-16 02:09:34,136 - INFO - step 19539, loss: 0.216251, best loss: 0.142890 2025-01-16 02:09:34,286 - INFO - step 19540, loss: 0.216875, best loss: 0.142890 2025-01-16 02:09:34,436 - INFO - step 19541, loss: 0.206826, best loss: 0.142890 2025-01-16 02:09:34,586 - INFO - step 19542, loss: 0.195815, best loss: 0.142890 2025-01-16 02:09:34,736 - INFO - step 19543, loss: 0.202161, best loss: 0.142890 2025-01-16 02:09:34,886 - INFO - step 19544, loss: 0.191448, best loss: 0.142890 2025-01-16 02:09:35,036 - INFO - step 19545, loss: 0.197263, best loss: 0.142890 2025-01-16 02:09:35,186 - INFO - step 19546, loss: 0.165375, best loss: 0.142890 2025-01-16 02:09:35,336 - INFO - step 19547, loss: 0.187240, best loss: 0.142890 2025-01-16 02:09:35,486 - INFO - step 19548, loss: 0.188817, best loss: 0.142890 2025-01-16 02:09:35,636 - INFO - step 19549, loss: 0.163145, best loss: 0.142890 2025-01-16 02:09:35,786 - INFO - step 19550, loss: 0.206011, best loss: 0.142890 2025-01-16 02:09:35,936 - INFO - step 19551, loss: 0.183769, best loss: 0.142890 2025-01-16 02:09:36,087 - INFO - step 19552, loss: 0.161822, best loss: 0.142890 2025-01-16 02:09:36,237 - INFO - step 19553, loss: 0.168039, best loss: 0.142890 2025-01-16 02:09:36,387 - INFO - step 19554, loss: 0.176350, best loss: 0.142890 2025-01-16 02:09:36,537 - INFO - step 19555, loss: 0.204256, best loss: 0.142890 2025-01-16 02:09:36,687 - INFO - step 19556, loss: 0.189204, best loss: 0.142890 2025-01-16 02:09:36,837 - INFO - step 19557, loss: 0.219018, best loss: 0.142890 2025-01-16 02:09:36,987 - INFO - step 19558, loss: 0.216347, best loss: 0.142890 2025-01-16 02:09:37,137 - INFO - step 19559, loss: 0.199439, best loss: 0.142890 2025-01-16 02:09:37,287 - INFO - step 19560, loss: 0.172446, best loss: 0.142890 2025-01-16 02:09:37,438 - INFO - step 19561, loss: 0.270024, best loss: 0.142890 2025-01-16 02:09:37,588 - INFO - step 19562, loss: 0.164377, best loss: 0.142890 2025-01-16 02:09:37,738 - INFO - step 19563, loss: 0.203848, best loss: 0.142890 2025-01-16 02:09:37,888 - INFO - step 19564, loss: 0.210120, best loss: 0.142890 2025-01-16 02:09:38,039 - INFO - step 19565, loss: 0.260130, best loss: 0.142890 2025-01-16 02:09:38,189 - INFO - step 19566, loss: 0.193497, best loss: 0.142890 2025-01-16 02:09:38,339 - INFO - step 19567, loss: 0.201606, best loss: 0.142890 2025-01-16 02:09:38,489 - INFO - step 19568, loss: 0.201115, best loss: 0.142890 2025-01-16 02:09:38,638 - INFO - step 19569, loss: 0.177599, best loss: 0.142890 2025-01-16 02:09:38,788 - INFO - step 19570, loss: 0.162273, best loss: 0.142890 2025-01-16 02:09:38,937 - INFO - step 19571, loss: 0.171797, best loss: 0.142890 2025-01-16 02:09:39,086 - INFO - step 19572, loss: 0.244841, best loss: 0.142890 2025-01-16 02:09:39,236 - INFO - step 19573, loss: 0.233135, best loss: 0.142890 2025-01-16 02:09:39,385 - INFO - step 19574, loss: 0.226899, best loss: 0.142890 2025-01-16 02:09:39,535 - INFO - step 19575, loss: 0.226779, best loss: 0.142890 2025-01-16 02:09:39,685 - INFO - step 19576, loss: 0.203155, best loss: 0.142890 2025-01-16 02:09:39,835 - INFO - step 19577, loss: 0.188263, best loss: 0.142890 2025-01-16 02:09:39,984 - INFO - step 19578, loss: 0.173526, best loss: 0.142890 2025-01-16 02:09:40,134 - INFO - step 19579, loss: 0.205842, best loss: 0.142890 2025-01-16 02:09:40,284 - INFO - step 19580, loss: 0.227838, best loss: 0.142890 2025-01-16 02:09:40,433 - INFO - step 19581, loss: 0.229580, best loss: 0.142890 2025-01-16 02:09:40,584 - INFO - step 19582, loss: 0.217263, best loss: 0.142890 2025-01-16 02:09:40,734 - INFO - step 19583, loss: 0.177510, best loss: 0.142890 2025-01-16 02:09:40,884 - INFO - step 19584, loss: 0.248652, best loss: 0.142890 2025-01-16 02:09:41,034 - INFO - step 19585, loss: 0.243754, best loss: 0.142890 2025-01-16 02:09:41,184 - INFO - step 19586, loss: 0.223948, best loss: 0.142890 2025-01-16 02:09:41,334 - INFO - step 19587, loss: 0.174706, best loss: 0.142890 2025-01-16 02:09:41,484 - INFO - step 19588, loss: 0.243823, best loss: 0.142890 2025-01-16 02:09:41,634 - INFO - step 19589, loss: 0.229727, best loss: 0.142890 2025-01-16 02:09:41,784 - INFO - step 19590, loss: 0.236869, best loss: 0.142890 2025-01-16 02:09:41,934 - INFO - step 19591, loss: 0.206025, best loss: 0.142890 2025-01-16 02:09:42,084 - INFO - step 19592, loss: 0.225491, best loss: 0.142890 2025-01-16 02:09:42,234 - INFO - step 19593, loss: 0.268219, best loss: 0.142890 2025-01-16 02:09:42,384 - INFO - step 19594, loss: 0.194630, best loss: 0.142890 2025-01-16 02:09:42,534 - INFO - step 19595, loss: 0.240831, best loss: 0.142890 2025-01-16 02:09:42,685 - INFO - step 19596, loss: 0.281179, best loss: 0.142890 2025-01-16 02:09:42,835 - INFO - step 19597, loss: 0.248216, best loss: 0.142890 2025-01-16 02:09:42,984 - INFO - step 19598, loss: 0.200113, best loss: 0.142890 2025-01-16 02:09:43,134 - INFO - step 19599, loss: 0.245436, best loss: 0.142890 2025-01-16 02:09:43,284 - INFO - step 19600, loss: 0.260012, best loss: 0.142890 2025-01-16 02:09:43,435 - INFO - step 19601, loss: 0.205069, best loss: 0.142890 2025-01-16 02:09:43,585 - INFO - step 19602, loss: 0.316522, best loss: 0.142890 2025-01-16 02:09:43,735 - INFO - step 19603, loss: 0.169643, best loss: 0.142890 2025-01-16 02:09:43,885 - INFO - step 19604, loss: 0.237041, best loss: 0.142890 2025-01-16 02:09:44,035 - INFO - step 19605, loss: 0.196177, best loss: 0.142890 2025-01-16 02:09:44,185 - INFO - step 19606, loss: 0.187098, best loss: 0.142890 2025-01-16 02:09:44,335 - INFO - step 19607, loss: 0.220097, best loss: 0.142890 2025-01-16 02:09:44,485 - INFO - step 19608, loss: 0.206406, best loss: 0.142890 2025-01-16 02:09:44,635 - INFO - step 19609, loss: 0.207746, best loss: 0.142890 2025-01-16 02:09:44,785 - INFO - step 19610, loss: 0.249426, best loss: 0.142890 2025-01-16 02:09:44,935 - INFO - step 19611, loss: 0.236552, best loss: 0.142890 2025-01-16 02:09:45,085 - INFO - step 19612, loss: 0.250464, best loss: 0.142890 2025-01-16 02:09:45,235 - INFO - step 19613, loss: 0.189160, best loss: 0.142890 2025-01-16 02:09:45,385 - INFO - step 19614, loss: 0.171852, best loss: 0.142890 2025-01-16 02:09:45,535 - INFO - step 19615, loss: 0.229434, best loss: 0.142890 2025-01-16 02:09:45,685 - INFO - step 19616, loss: 0.251495, best loss: 0.142890 2025-01-16 02:09:45,835 - INFO - step 19617, loss: 0.250705, best loss: 0.142890 2025-01-16 02:09:45,985 - INFO - step 19618, loss: 0.246914, best loss: 0.142890 2025-01-16 02:09:46,135 - INFO - step 19619, loss: 0.218892, best loss: 0.142890 2025-01-16 02:09:46,286 - INFO - step 19620, loss: 0.215373, best loss: 0.142890 2025-01-16 02:09:46,436 - INFO - step 19621, loss: 0.173980, best loss: 0.142890 2025-01-16 02:09:46,586 - INFO - step 19622, loss: 0.197305, best loss: 0.142890 2025-01-16 02:09:46,736 - INFO - step 19623, loss: 0.203217, best loss: 0.142890 2025-01-16 02:09:46,886 - INFO - step 19624, loss: 0.262500, best loss: 0.142890 2025-01-16 02:09:47,036 - INFO - step 19625, loss: 0.210975, best loss: 0.142890 2025-01-16 02:09:47,185 - INFO - step 19626, loss: 0.272928, best loss: 0.142890 2025-01-16 02:09:47,335 - INFO - step 19627, loss: 0.226085, best loss: 0.142890 2025-01-16 02:09:47,486 - INFO - step 19628, loss: 0.248898, best loss: 0.142890 2025-01-16 02:09:47,636 - INFO - step 19629, loss: 0.198058, best loss: 0.142890 2025-01-16 02:09:47,786 - INFO - step 19630, loss: 0.219512, best loss: 0.142890 2025-01-16 02:09:47,936 - INFO - step 19631, loss: 0.189797, best loss: 0.142890 2025-01-16 02:09:48,086 - INFO - step 19632, loss: 0.231223, best loss: 0.142890 2025-01-16 02:09:48,236 - INFO - step 19633, loss: 0.190784, best loss: 0.142890 2025-01-16 02:09:48,387 - INFO - step 19634, loss: 0.217092, best loss: 0.142890 2025-01-16 02:09:48,537 - INFO - step 19635, loss: 0.185537, best loss: 0.142890 2025-01-16 02:09:48,687 - INFO - step 19636, loss: 0.207462, best loss: 0.142890 2025-01-16 02:09:48,837 - INFO - step 19637, loss: 0.150327, best loss: 0.142890 2025-01-16 02:09:48,987 - INFO - step 19638, loss: 0.212577, best loss: 0.142890 2025-01-16 02:09:49,137 - INFO - step 19639, loss: 0.224121, best loss: 0.142890 2025-01-16 02:09:49,287 - INFO - step 19640, loss: 0.225376, best loss: 0.142890 2025-01-16 02:09:49,437 - INFO - step 19641, loss: 0.227300, best loss: 0.142890 2025-01-16 02:09:49,587 - INFO - step 19642, loss: 0.197656, best loss: 0.142890 2025-01-16 02:09:49,737 - INFO - step 19643, loss: 0.265062, best loss: 0.142890 2025-01-16 02:09:49,887 - INFO - step 19644, loss: 0.211302, best loss: 0.142890 2025-01-16 02:09:50,037 - INFO - step 19645, loss: 0.189862, best loss: 0.142890 2025-01-16 02:09:50,187 - INFO - step 19646, loss: 0.221958, best loss: 0.142890 2025-01-16 02:09:50,337 - INFO - step 19647, loss: 0.240448, best loss: 0.142890 2025-01-16 02:09:50,487 - INFO - step 19648, loss: 0.181055, best loss: 0.142890 2025-01-16 02:09:50,638 - INFO - step 19649, loss: 0.217107, best loss: 0.142890 2025-01-16 02:09:50,788 - INFO - step 19650, loss: 0.179828, best loss: 0.142890 2025-01-16 02:09:50,938 - INFO - step 19651, loss: 0.184996, best loss: 0.142890 2025-01-16 02:09:51,088 - INFO - step 19652, loss: 0.177949, best loss: 0.142890 2025-01-16 02:09:51,238 - INFO - step 19653, loss: 0.240469, best loss: 0.142890 2025-01-16 02:09:51,388 - INFO - step 19654, loss: 0.232746, best loss: 0.142890 2025-01-16 02:09:51,538 - INFO - step 19655, loss: 0.183899, best loss: 0.142890 2025-01-16 02:09:51,688 - INFO - step 19656, loss: 0.177966, best loss: 0.142890 2025-01-16 02:09:51,838 - INFO - step 19657, loss: 0.214095, best loss: 0.142890 2025-01-16 02:09:51,988 - INFO - step 19658, loss: 0.233718, best loss: 0.142890 2025-01-16 02:09:52,138 - INFO - step 19659, loss: 0.219724, best loss: 0.142890 2025-01-16 02:09:52,288 - INFO - step 19660, loss: 0.169335, best loss: 0.142890 2025-01-16 02:09:52,438 - INFO - step 19661, loss: 0.256257, best loss: 0.142890 2025-01-16 02:09:52,588 - INFO - step 19662, loss: 0.216529, best loss: 0.142890 2025-01-16 02:09:52,738 - INFO - step 19663, loss: 0.173189, best loss: 0.142890 2025-01-16 02:09:52,888 - INFO - step 19664, loss: 0.211878, best loss: 0.142890 2025-01-16 02:09:53,038 - INFO - step 19665, loss: 0.186495, best loss: 0.142890 2025-01-16 02:09:53,188 - INFO - step 19666, loss: 0.221955, best loss: 0.142890 2025-01-16 02:09:53,338 - INFO - step 19667, loss: 0.170061, best loss: 0.142890 2025-01-16 02:09:53,488 - INFO - step 19668, loss: 0.244437, best loss: 0.142890 2025-01-16 02:09:53,638 - INFO - step 19669, loss: 0.178514, best loss: 0.142890 2025-01-16 02:09:53,788 - INFO - step 19670, loss: 0.181818, best loss: 0.142890 2025-01-16 02:09:53,938 - INFO - step 19671, loss: 0.192481, best loss: 0.142890 2025-01-16 02:09:54,088 - INFO - step 19672, loss: 0.269721, best loss: 0.142890 2025-01-16 02:09:54,238 - INFO - step 19673, loss: 0.213062, best loss: 0.142890 2025-01-16 02:09:54,388 - INFO - step 19674, loss: 0.189992, best loss: 0.142890 2025-01-16 02:09:54,538 - INFO - step 19675, loss: 0.197930, best loss: 0.142890 2025-01-16 02:09:54,688 - INFO - step 19676, loss: 0.219931, best loss: 0.142890 2025-01-16 02:09:54,838 - INFO - step 19677, loss: 0.196082, best loss: 0.142890 2025-01-16 02:09:54,988 - INFO - step 19678, loss: 0.192720, best loss: 0.142890 2025-01-16 02:09:55,138 - INFO - step 19679, loss: 0.235969, best loss: 0.142890 2025-01-16 02:09:55,289 - INFO - step 19680, loss: 0.196837, best loss: 0.142890 2025-01-16 02:09:55,439 - INFO - step 19681, loss: 0.145396, best loss: 0.142890 2025-01-16 02:09:55,589 - INFO - step 19682, loss: 0.187030, best loss: 0.142890 2025-01-16 02:09:55,739 - INFO - step 19683, loss: 0.179305, best loss: 0.142890 2025-01-16 02:09:55,889 - INFO - step 19684, loss: 0.188665, best loss: 0.142890 2025-01-16 02:09:56,039 - INFO - step 19685, loss: 0.209184, best loss: 0.142890 2025-01-16 02:09:56,188 - INFO - step 19686, loss: 0.210380, best loss: 0.142890 2025-01-16 02:09:56,339 - INFO - step 19687, loss: 0.209976, best loss: 0.142890 2025-01-16 02:09:56,489 - INFO - step 19688, loss: 0.165795, best loss: 0.142890 2025-01-16 02:09:56,639 - INFO - step 19689, loss: 0.197356, best loss: 0.142890 2025-01-16 02:09:56,789 - INFO - step 19690, loss: 0.227735, best loss: 0.142890 2025-01-16 02:09:56,939 - INFO - step 19691, loss: 0.242245, best loss: 0.142890 2025-01-16 02:09:57,089 - INFO - step 19692, loss: 0.212286, best loss: 0.142890 2025-01-16 02:09:57,239 - INFO - step 19693, loss: 0.228319, best loss: 0.142890 2025-01-16 02:09:57,389 - INFO - step 19694, loss: 0.234342, best loss: 0.142890 2025-01-16 02:09:57,539 - INFO - step 19695, loss: 0.177875, best loss: 0.142890 2025-01-16 02:09:57,689 - INFO - step 19696, loss: 0.219081, best loss: 0.142890 2025-01-16 02:09:57,839 - INFO - step 19697, loss: 0.187305, best loss: 0.142890 2025-01-16 02:09:57,989 - INFO - step 19698, loss: 0.177559, best loss: 0.142890 2025-01-16 02:09:58,139 - INFO - step 19699, loss: 0.203221, best loss: 0.142890 2025-01-16 02:09:58,290 - INFO - step 19700, loss: 0.196784, best loss: 0.142890 2025-01-16 02:09:58,440 - INFO - step 19701, loss: 0.196841, best loss: 0.142890 2025-01-16 02:09:58,590 - INFO - step 19702, loss: 0.200824, best loss: 0.142890 2025-01-16 02:09:58,740 - INFO - step 19703, loss: 0.221586, best loss: 0.142890 2025-01-16 02:09:58,890 - INFO - step 19704, loss: 0.270398, best loss: 0.142890 2025-01-16 02:09:59,040 - INFO - step 19705, loss: 0.215918, best loss: 0.142890 2025-01-16 02:09:59,190 - INFO - step 19706, loss: 0.218334, best loss: 0.142890 2025-01-16 02:09:59,340 - INFO - step 19707, loss: 0.240668, best loss: 0.142890 2025-01-16 02:09:59,490 - INFO - step 19708, loss: 0.220395, best loss: 0.142890 2025-01-16 02:09:59,640 - INFO - step 19709, loss: 0.216167, best loss: 0.142890 2025-01-16 02:09:59,791 - INFO - step 19710, loss: 0.209529, best loss: 0.142890 2025-01-16 02:09:59,941 - INFO - step 19711, loss: 0.237789, best loss: 0.142890 2025-01-16 02:10:00,091 - INFO - step 19712, loss: 0.205832, best loss: 0.142890 2025-01-16 02:10:00,241 - INFO - step 19713, loss: 0.220809, best loss: 0.142890 2025-01-16 02:10:00,391 - INFO - step 19714, loss: 0.211613, best loss: 0.142890 2025-01-16 02:10:00,541 - INFO - step 19715, loss: 0.212360, best loss: 0.142890 2025-01-16 02:10:00,691 - INFO - step 19716, loss: 0.213979, best loss: 0.142890 2025-01-16 02:10:00,842 - INFO - step 19717, loss: 0.178432, best loss: 0.142890 2025-01-16 02:10:00,992 - INFO - step 19718, loss: 0.166957, best loss: 0.142890 2025-01-16 02:10:01,142 - INFO - step 19719, loss: 0.230106, best loss: 0.142890 2025-01-16 02:10:01,292 - INFO - step 19720, loss: 0.216732, best loss: 0.142890 2025-01-16 02:10:01,442 - INFO - step 19721, loss: 0.186325, best loss: 0.142890 2025-01-16 02:10:01,592 - INFO - step 19722, loss: 0.145852, best loss: 0.142890 2025-01-16 02:10:01,742 - INFO - step 19723, loss: 0.206918, best loss: 0.142890 2025-01-16 02:10:01,892 - INFO - step 19724, loss: 0.178497, best loss: 0.142890 2025-01-16 02:10:02,042 - INFO - step 19725, loss: 0.152138, best loss: 0.142890 2025-01-16 02:10:02,192 - INFO - step 19726, loss: 0.170420, best loss: 0.142890 2025-01-16 02:10:02,342 - INFO - step 19727, loss: 0.218829, best loss: 0.142890 2025-01-16 02:10:02,492 - INFO - step 19728, loss: 0.169090, best loss: 0.142890 2025-01-16 02:10:02,642 - INFO - step 19729, loss: 0.234366, best loss: 0.142890 2025-01-16 02:10:02,792 - INFO - step 19730, loss: 0.218739, best loss: 0.142890 2025-01-16 02:10:02,943 - INFO - step 19731, loss: 0.223506, best loss: 0.142890 2025-01-16 02:10:03,093 - INFO - step 19732, loss: 0.216237, best loss: 0.142890 2025-01-16 02:10:03,243 - INFO - step 19733, loss: 0.217769, best loss: 0.142890 2025-01-16 02:10:03,393 - INFO - step 19734, loss: 0.164285, best loss: 0.142890 2025-01-16 02:10:03,543 - INFO - step 19735, loss: 0.178549, best loss: 0.142890 2025-01-16 02:10:03,693 - INFO - step 19736, loss: 0.221566, best loss: 0.142890 2025-01-16 02:10:03,842 - INFO - step 19737, loss: 0.200748, best loss: 0.142890 2025-01-16 02:10:03,993 - INFO - step 19738, loss: 0.204585, best loss: 0.142890 2025-01-16 02:10:04,143 - INFO - step 19739, loss: 0.212898, best loss: 0.142890 2025-01-16 02:10:04,293 - INFO - step 19740, loss: 0.197409, best loss: 0.142890 2025-01-16 02:10:04,443 - INFO - step 19741, loss: 0.147741, best loss: 0.142890 2025-01-16 02:10:04,593 - INFO - step 19742, loss: 0.226318, best loss: 0.142890 2025-01-16 02:10:04,743 - INFO - step 19743, loss: 0.213091, best loss: 0.142890 2025-01-16 02:10:04,893 - INFO - step 19744, loss: 0.167311, best loss: 0.142890 2025-01-16 02:10:05,043 - INFO - step 19745, loss: 0.170086, best loss: 0.142890 2025-01-16 02:10:05,193 - INFO - step 19746, loss: 0.169790, best loss: 0.142890 2025-01-16 02:10:05,343 - INFO - step 19747, loss: 0.208254, best loss: 0.142890 2025-01-16 02:10:05,493 - INFO - step 19748, loss: 0.188339, best loss: 0.142890 2025-01-16 02:10:05,644 - INFO - step 19749, loss: 0.186504, best loss: 0.142890 2025-01-16 02:10:05,794 - INFO - step 19750, loss: 0.232482, best loss: 0.142890 2025-01-16 02:10:05,944 - INFO - step 19751, loss: 0.187070, best loss: 0.142890 2025-01-16 02:10:06,094 - INFO - step 19752, loss: 0.205110, best loss: 0.142890 2025-01-16 02:10:06,244 - INFO - step 19753, loss: 0.188427, best loss: 0.142890 2025-01-16 02:10:06,394 - INFO - step 19754, loss: 0.215629, best loss: 0.142890 2025-01-16 02:10:06,544 - INFO - step 19755, loss: 0.232273, best loss: 0.142890 2025-01-16 02:10:06,694 - INFO - step 19756, loss: 0.217004, best loss: 0.142890 2025-01-16 02:10:06,844 - INFO - step 19757, loss: 0.202463, best loss: 0.142890 2025-01-16 02:10:06,994 - INFO - step 19758, loss: 0.194176, best loss: 0.142890 2025-01-16 02:10:07,144 - INFO - step 19759, loss: 0.200866, best loss: 0.142890 2025-01-16 02:10:07,294 - INFO - step 19760, loss: 0.243299, best loss: 0.142890 2025-01-16 02:10:07,444 - INFO - step 19761, loss: 0.233585, best loss: 0.142890 2025-01-16 02:10:07,594 - INFO - step 19762, loss: 0.214224, best loss: 0.142890 2025-01-16 02:10:07,745 - INFO - step 19763, loss: 0.200747, best loss: 0.142890 2025-01-16 02:10:07,895 - INFO - step 19764, loss: 0.173822, best loss: 0.142890 2025-01-16 02:10:08,045 - INFO - step 19765, loss: 0.195314, best loss: 0.142890 2025-01-16 02:10:08,195 - INFO - step 19766, loss: 0.165219, best loss: 0.142890 2025-01-16 02:10:08,345 - INFO - step 19767, loss: 0.185874, best loss: 0.142890 2025-01-16 02:10:08,495 - INFO - step 19768, loss: 0.166830, best loss: 0.142890 2025-01-16 02:10:08,645 - INFO - step 19769, loss: 0.165716, best loss: 0.142890 2025-01-16 02:10:08,795 - INFO - step 19770, loss: 0.209647, best loss: 0.142890 2025-01-16 02:10:08,946 - INFO - step 19771, loss: 0.197576, best loss: 0.142890 2025-01-16 02:10:09,095 - INFO - step 19772, loss: 0.173694, best loss: 0.142890 2025-01-16 02:10:09,245 - INFO - step 19773, loss: 0.224803, best loss: 0.142890 2025-01-16 02:10:09,396 - INFO - step 19774, loss: 0.262031, best loss: 0.142890 2025-01-16 02:10:09,545 - INFO - step 19775, loss: 0.240959, best loss: 0.142890 2025-01-16 02:10:09,696 - INFO - step 19776, loss: 0.158492, best loss: 0.142890 2025-01-16 02:10:09,845 - INFO - step 19777, loss: 0.184288, best loss: 0.142890 2025-01-16 02:10:09,995 - INFO - step 19778, loss: 0.209328, best loss: 0.142890 2025-01-16 02:10:10,146 - INFO - step 19779, loss: 0.229181, best loss: 0.142890 2025-01-16 02:10:10,296 - INFO - step 19780, loss: 0.171947, best loss: 0.142890 2025-01-16 02:10:10,446 - INFO - step 19781, loss: 0.196243, best loss: 0.142890 2025-01-16 02:10:10,596 - INFO - step 19782, loss: 0.281042, best loss: 0.142890 2025-01-16 02:10:10,746 - INFO - step 19783, loss: 0.244717, best loss: 0.142890 2025-01-16 02:10:10,896 - INFO - step 19784, loss: 0.269566, best loss: 0.142890 2025-01-16 02:10:11,046 - INFO - step 19785, loss: 0.203039, best loss: 0.142890 2025-01-16 02:10:11,196 - INFO - step 19786, loss: 0.213418, best loss: 0.142890 2025-01-16 02:10:11,346 - INFO - step 19787, loss: 0.199486, best loss: 0.142890 2025-01-16 02:10:11,496 - INFO - step 19788, loss: 0.207353, best loss: 0.142890 2025-01-16 02:10:11,646 - INFO - step 19789, loss: 0.211193, best loss: 0.142890 2025-01-16 02:10:11,796 - INFO - step 19790, loss: 0.216275, best loss: 0.142890 2025-01-16 02:10:11,946 - INFO - step 19791, loss: 0.223896, best loss: 0.142890 2025-01-16 02:10:12,096 - INFO - step 19792, loss: 0.236922, best loss: 0.142890 2025-01-16 02:10:12,246 - INFO - step 19793, loss: 0.268813, best loss: 0.142890 2025-01-16 02:10:12,396 - INFO - step 19794, loss: 0.180191, best loss: 0.142890 2025-01-16 02:10:12,547 - INFO - step 19795, loss: 0.169539, best loss: 0.142890 2025-01-16 02:10:12,696 - INFO - step 19796, loss: 0.215389, best loss: 0.142890 2025-01-16 02:10:12,846 - INFO - step 19797, loss: 0.314861, best loss: 0.142890 2025-01-16 02:10:12,997 - INFO - step 19798, loss: 0.192630, best loss: 0.142890 2025-01-16 02:10:13,147 - INFO - step 19799, loss: 0.174906, best loss: 0.142890 2025-01-16 02:10:13,297 - INFO - step 19800, loss: 0.226604, best loss: 0.142890 2025-01-16 02:10:13,447 - INFO - step 19801, loss: 0.202519, best loss: 0.142890 2025-01-16 02:10:13,597 - INFO - step 19802, loss: 0.210430, best loss: 0.142890 2025-01-16 02:10:13,747 - INFO - step 19803, loss: 0.194738, best loss: 0.142890 2025-01-16 02:10:13,897 - INFO - step 19804, loss: 0.204828, best loss: 0.142890 2025-01-16 02:10:14,048 - INFO - step 19805, loss: 0.201998, best loss: 0.142890 2025-01-16 02:10:14,198 - INFO - step 19806, loss: 0.226611, best loss: 0.142890 2025-01-16 02:10:14,348 - INFO - step 19807, loss: 0.239663, best loss: 0.142890 2025-01-16 02:10:14,498 - INFO - step 19808, loss: 0.201203, best loss: 0.142890 2025-01-16 02:10:14,648 - INFO - step 19809, loss: 0.223510, best loss: 0.142890 2025-01-16 02:10:14,799 - INFO - step 19810, loss: 0.157881, best loss: 0.142890 2025-01-16 02:10:14,949 - INFO - step 19811, loss: 0.192019, best loss: 0.142890 2025-01-16 02:10:15,099 - INFO - step 19812, loss: 0.158631, best loss: 0.142890 2025-01-16 02:10:15,249 - INFO - step 19813, loss: 0.233077, best loss: 0.142890 2025-01-16 02:10:15,399 - INFO - step 19814, loss: 0.232993, best loss: 0.142890 2025-01-16 02:10:15,549 - INFO - step 19815, loss: 0.210691, best loss: 0.142890 2025-01-16 02:10:15,699 - INFO - step 19816, loss: 0.228815, best loss: 0.142890 2025-01-16 02:10:15,849 - INFO - step 19817, loss: 0.192125, best loss: 0.142890 2025-01-16 02:10:16,000 - INFO - step 19818, loss: 0.234065, best loss: 0.142890 2025-01-16 02:10:16,150 - INFO - step 19819, loss: 0.216034, best loss: 0.142890 2025-01-16 02:10:16,300 - INFO - step 19820, loss: 0.182274, best loss: 0.142890 2025-01-16 02:10:16,450 - INFO - step 19821, loss: 0.186309, best loss: 0.142890 2025-01-16 02:10:16,600 - INFO - step 19822, loss: 0.201029, best loss: 0.142890 2025-01-16 02:10:16,750 - INFO - step 19823, loss: 0.197975, best loss: 0.142890 2025-01-16 02:10:16,900 - INFO - step 19824, loss: 0.238797, best loss: 0.142890 2025-01-16 02:10:17,050 - INFO - step 19825, loss: 0.236895, best loss: 0.142890 2025-01-16 02:10:17,201 - INFO - step 19826, loss: 0.212310, best loss: 0.142890 2025-01-16 02:10:17,351 - INFO - step 19827, loss: 0.207003, best loss: 0.142890 2025-01-16 02:10:17,501 - INFO - step 19828, loss: 0.185415, best loss: 0.142890 2025-01-16 02:10:17,651 - INFO - step 19829, loss: 0.196383, best loss: 0.142890 2025-01-16 02:10:17,801 - INFO - step 19830, loss: 0.163556, best loss: 0.142890 2025-01-16 02:10:17,951 - INFO - step 19831, loss: 0.196494, best loss: 0.142890 2025-01-16 02:10:18,101 - INFO - step 19832, loss: 0.256161, best loss: 0.142890 2025-01-16 02:10:18,251 - INFO - step 19833, loss: 0.187765, best loss: 0.142890 2025-01-16 02:10:18,401 - INFO - step 19834, loss: 0.180655, best loss: 0.142890 2025-01-16 02:10:18,551 - INFO - step 19835, loss: 0.169255, best loss: 0.142890 2025-01-16 02:10:18,701 - INFO - step 19836, loss: 0.209293, best loss: 0.142890 2025-01-16 02:10:18,851 - INFO - step 19837, loss: 0.277753, best loss: 0.142890 2025-01-16 02:10:19,001 - INFO - step 19838, loss: 0.231751, best loss: 0.142890 2025-01-16 02:10:19,151 - INFO - step 19839, loss: 0.253419, best loss: 0.142890 2025-01-16 02:10:19,301 - INFO - step 19840, loss: 0.198041, best loss: 0.142890 2025-01-16 02:10:19,452 - INFO - step 19841, loss: 0.233487, best loss: 0.142890 2025-01-16 02:10:19,602 - INFO - step 19842, loss: 0.222521, best loss: 0.142890 2025-01-16 02:10:19,752 - INFO - step 19843, loss: 0.225778, best loss: 0.142890 2025-01-16 02:10:19,902 - INFO - step 19844, loss: 0.225196, best loss: 0.142890 2025-01-16 02:10:20,052 - INFO - step 19845, loss: 0.227646, best loss: 0.142890 2025-01-16 02:10:20,202 - INFO - step 19846, loss: 0.208747, best loss: 0.142890 2025-01-16 02:10:20,352 - INFO - step 19847, loss: 0.218610, best loss: 0.142890 2025-01-16 02:10:20,503 - INFO - step 19848, loss: 0.250446, best loss: 0.142890 2025-01-16 02:10:20,653 - INFO - step 19849, loss: 0.232666, best loss: 0.142890 2025-01-16 02:10:20,803 - INFO - step 19850, loss: 0.208234, best loss: 0.142890 2025-01-16 02:10:20,953 - INFO - step 19851, loss: 0.209345, best loss: 0.142890 2025-01-16 02:10:21,103 - INFO - step 19852, loss: 0.191408, best loss: 0.142890 2025-01-16 02:10:21,254 - INFO - step 19853, loss: 0.243259, best loss: 0.142890 2025-01-16 02:10:21,404 - INFO - step 19854, loss: 0.156581, best loss: 0.142890 2025-01-16 02:10:21,554 - INFO - step 19855, loss: 0.229100, best loss: 0.142890 2025-01-16 02:10:21,704 - INFO - step 19856, loss: 0.162343, best loss: 0.142890 2025-01-16 02:10:21,854 - INFO - step 19857, loss: 0.178334, best loss: 0.142890 2025-01-16 02:10:22,004 - INFO - step 19858, loss: 0.204931, best loss: 0.142890 2025-01-16 02:10:22,155 - INFO - step 19859, loss: 0.222044, best loss: 0.142890 2025-01-16 02:10:22,305 - INFO - step 19860, loss: 0.219284, best loss: 0.142890 2025-01-16 02:10:22,455 - INFO - step 19861, loss: 0.207770, best loss: 0.142890 2025-01-16 02:10:22,605 - INFO - step 19862, loss: 0.231590, best loss: 0.142890 2025-01-16 02:10:22,756 - INFO - step 19863, loss: 0.217773, best loss: 0.142890 2025-01-16 02:10:22,906 - INFO - step 19864, loss: 0.214063, best loss: 0.142890 2025-01-16 02:10:23,056 - INFO - step 19865, loss: 0.204944, best loss: 0.142890 2025-01-16 02:10:23,206 - INFO - step 19866, loss: 0.196107, best loss: 0.142890 2025-01-16 02:10:23,355 - INFO - step 19867, loss: 0.209984, best loss: 0.142890 2025-01-16 02:10:23,505 - INFO - step 19868, loss: 0.217035, best loss: 0.142890 2025-01-16 02:10:23,655 - INFO - step 19869, loss: 0.174274, best loss: 0.142890 2025-01-16 02:10:23,805 - INFO - step 19870, loss: 0.232743, best loss: 0.142890 2025-01-16 02:10:23,955 - INFO - step 19871, loss: 0.226557, best loss: 0.142890 2025-01-16 02:10:24,105 - INFO - step 19872, loss: 0.178771, best loss: 0.142890 2025-01-16 02:10:24,255 - INFO - step 19873, loss: 0.245829, best loss: 0.142890 2025-01-16 02:10:24,405 - INFO - step 19874, loss: 0.167302, best loss: 0.142890 2025-01-16 02:10:24,555 - INFO - step 19875, loss: 0.208449, best loss: 0.142890 2025-01-16 02:10:24,705 - INFO - step 19876, loss: 0.209767, best loss: 0.142890 2025-01-16 02:10:24,855 - INFO - step 19877, loss: 0.215853, best loss: 0.142890 2025-01-16 02:10:25,005 - INFO - step 19878, loss: 0.226236, best loss: 0.142890 2025-01-16 02:10:25,155 - INFO - step 19879, loss: 0.270087, best loss: 0.142890 2025-01-16 02:10:25,305 - INFO - step 19880, loss: 0.203121, best loss: 0.142890 2025-01-16 02:10:25,455 - INFO - step 19881, loss: 0.164426, best loss: 0.142890 2025-01-16 02:10:25,605 - INFO - step 19882, loss: 0.205889, best loss: 0.142890 2025-01-16 02:10:25,755 - INFO - step 19883, loss: 0.205600, best loss: 0.142890 2025-01-16 02:10:25,905 - INFO - step 19884, loss: 0.184676, best loss: 0.142890 2025-01-16 02:10:26,055 - INFO - step 19885, loss: 0.180163, best loss: 0.142890 2025-01-16 02:10:26,205 - INFO - step 19886, loss: 0.181772, best loss: 0.142890 2025-01-16 02:10:26,355 - INFO - step 19887, loss: 0.172157, best loss: 0.142890 2025-01-16 02:10:26,506 - INFO - step 19888, loss: 0.215006, best loss: 0.142890 2025-01-16 02:10:26,656 - INFO - step 19889, loss: 0.180163, best loss: 0.142890 2025-01-16 02:10:26,806 - INFO - step 19890, loss: 0.143765, best loss: 0.142890 2025-01-16 02:10:26,956 - INFO - step 19891, loss: 0.307811, best loss: 0.142890 2025-01-16 02:10:27,106 - INFO - step 19892, loss: 0.189915, best loss: 0.142890 2025-01-16 02:10:27,256 - INFO - step 19893, loss: 0.181684, best loss: 0.142890 2025-01-16 02:10:27,406 - INFO - step 19894, loss: 0.233339, best loss: 0.142890 2025-01-16 02:10:27,556 - INFO - step 19895, loss: 0.274011, best loss: 0.142890 2025-01-16 02:10:27,706 - INFO - step 19896, loss: 0.241467, best loss: 0.142890 2025-01-16 02:10:27,856 - INFO - step 19897, loss: 0.187771, best loss: 0.142890 2025-01-16 02:10:28,005 - INFO - step 19898, loss: 0.204479, best loss: 0.142890 2025-01-16 02:10:28,155 - INFO - step 19899, loss: 0.225339, best loss: 0.142890 2025-01-16 02:10:28,305 - INFO - step 19900, loss: 0.185387, best loss: 0.142890 2025-01-16 02:10:28,456 - INFO - step 19901, loss: 0.155424, best loss: 0.142890 2025-01-16 02:10:28,606 - INFO - step 19902, loss: 0.199657, best loss: 0.142890 2025-01-16 02:10:28,756 - INFO - step 19903, loss: 0.192425, best loss: 0.142890 2025-01-16 02:10:28,906 - INFO - step 19904, loss: 0.185791, best loss: 0.142890 2025-01-16 02:10:29,056 - INFO - step 19905, loss: 0.191719, best loss: 0.142890 2025-01-16 02:10:29,206 - INFO - step 19906, loss: 0.197097, best loss: 0.142890 2025-01-16 02:10:29,356 - INFO - step 19907, loss: 0.184015, best loss: 0.142890 2025-01-16 02:10:29,506 - INFO - step 19908, loss: 0.184155, best loss: 0.142890 2025-01-16 02:10:29,656 - INFO - step 19909, loss: 0.262153, best loss: 0.142890 2025-01-16 02:10:29,806 - INFO - step 19910, loss: 0.210943, best loss: 0.142890 2025-01-16 02:10:29,957 - INFO - step 19911, loss: 0.211678, best loss: 0.142890 2025-01-16 02:10:30,107 - INFO - step 19912, loss: 0.225400, best loss: 0.142890 2025-01-16 02:10:30,257 - INFO - step 19913, loss: 0.213862, best loss: 0.142890 2025-01-16 02:10:30,407 - INFO - step 19914, loss: 0.234210, best loss: 0.142890 2025-01-16 02:10:30,557 - INFO - step 19915, loss: 0.252009, best loss: 0.142890 2025-01-16 02:10:30,707 - INFO - step 19916, loss: 0.208773, best loss: 0.142890 2025-01-16 02:10:30,857 - INFO - step 19917, loss: 0.196223, best loss: 0.142890 2025-01-16 02:10:31,007 - INFO - step 19918, loss: 0.209560, best loss: 0.142890 2025-01-16 02:10:31,157 - INFO - step 19919, loss: 0.197458, best loss: 0.142890 2025-01-16 02:10:31,307 - INFO - step 19920, loss: 0.246384, best loss: 0.142890 2025-01-16 02:10:31,457 - INFO - step 19921, loss: 0.184722, best loss: 0.142890 2025-01-16 02:10:31,607 - INFO - step 19922, loss: 0.185247, best loss: 0.142890 2025-01-16 02:10:31,757 - INFO - step 19923, loss: 0.224344, best loss: 0.142890 2025-01-16 02:10:31,907 - INFO - step 19924, loss: 0.207692, best loss: 0.142890 2025-01-16 02:10:32,057 - INFO - step 19925, loss: 0.185525, best loss: 0.142890 2025-01-16 02:10:32,207 - INFO - step 19926, loss: 0.201721, best loss: 0.142890 2025-01-16 02:10:32,357 - INFO - step 19927, loss: 0.217842, best loss: 0.142890 2025-01-16 02:10:32,507 - INFO - step 19928, loss: 0.208206, best loss: 0.142890 2025-01-16 02:10:32,657 - INFO - step 19929, loss: 0.275089, best loss: 0.142890 2025-01-16 02:10:32,807 - INFO - step 19930, loss: 0.181554, best loss: 0.142890 2025-01-16 02:10:32,957 - INFO - step 19931, loss: 0.228091, best loss: 0.142890 2025-01-16 02:10:33,107 - INFO - step 19932, loss: 0.214252, best loss: 0.142890 2025-01-16 02:10:33,257 - INFO - step 19933, loss: 0.186715, best loss: 0.142890 2025-01-16 02:10:33,407 - INFO - step 19934, loss: 0.259019, best loss: 0.142890 2025-01-16 02:10:33,557 - INFO - step 19935, loss: 0.148336, best loss: 0.142890 2025-01-16 02:10:33,707 - INFO - step 19936, loss: 0.205924, best loss: 0.142890 2025-01-16 02:10:33,857 - INFO - step 19937, loss: 0.226818, best loss: 0.142890 2025-01-16 02:10:34,007 - INFO - step 19938, loss: 0.226974, best loss: 0.142890 2025-01-16 02:10:34,157 - INFO - step 19939, loss: 0.178829, best loss: 0.142890 2025-01-16 02:10:34,307 - INFO - step 19940, loss: 0.208465, best loss: 0.142890 2025-01-16 02:10:34,457 - INFO - step 19941, loss: 0.231415, best loss: 0.142890 2025-01-16 02:10:34,608 - INFO - step 19942, loss: 0.207356, best loss: 0.142890 2025-01-16 02:10:34,758 - INFO - step 19943, loss: 0.213437, best loss: 0.142890 2025-01-16 02:10:34,908 - INFO - step 19944, loss: 0.162412, best loss: 0.142890 2025-01-16 02:10:35,058 - INFO - step 19945, loss: 0.208079, best loss: 0.142890 2025-01-16 02:10:35,208 - INFO - step 19946, loss: 0.217482, best loss: 0.142890 2025-01-16 02:10:35,358 - INFO - step 19947, loss: 0.194838, best loss: 0.142890 2025-01-16 02:10:35,508 - INFO - step 19948, loss: 0.184406, best loss: 0.142890 2025-01-16 02:10:35,658 - INFO - step 19949, loss: 0.218473, best loss: 0.142890 2025-01-16 02:10:35,809 - INFO - step 19950, loss: 0.187315, best loss: 0.142890 2025-01-16 02:10:35,959 - INFO - step 19951, loss: 0.238122, best loss: 0.142890 2025-01-16 02:10:36,109 - INFO - step 19952, loss: 0.202290, best loss: 0.142890 2025-01-16 02:10:36,259 - INFO - step 19953, loss: 0.226387, best loss: 0.142890 2025-01-16 02:10:36,409 - INFO - step 19954, loss: 0.242640, best loss: 0.142890 2025-01-16 02:10:36,559 - INFO - step 19955, loss: 0.191205, best loss: 0.142890 2025-01-16 02:10:36,709 - INFO - step 19956, loss: 0.198460, best loss: 0.142890 2025-01-16 02:10:36,859 - INFO - step 19957, loss: 0.195522, best loss: 0.142890 2025-01-16 02:10:37,010 - INFO - step 19958, loss: 0.180668, best loss: 0.142890 2025-01-16 02:10:37,160 - INFO - step 19959, loss: 0.235116, best loss: 0.142890 2025-01-16 02:10:37,310 - INFO - step 19960, loss: 0.231126, best loss: 0.142890 2025-01-16 02:10:37,460 - INFO - step 19961, loss: 0.223121, best loss: 0.142890 2025-01-16 02:10:37,610 - INFO - step 19962, loss: 0.204250, best loss: 0.142890 2025-01-16 02:10:37,760 - INFO - step 19963, loss: 0.215036, best loss: 0.142890 2025-01-16 02:10:37,910 - INFO - step 19964, loss: 0.180475, best loss: 0.142890 2025-01-16 02:10:38,060 - INFO - step 19965, loss: 0.186320, best loss: 0.142890 2025-01-16 02:10:38,210 - INFO - step 19966, loss: 0.258894, best loss: 0.142890 2025-01-16 02:10:38,360 - INFO - step 19967, loss: 0.179105, best loss: 0.142890 2025-01-16 02:10:38,510 - INFO - step 19968, loss: 0.198634, best loss: 0.142890 2025-01-16 02:10:38,660 - INFO - step 19969, loss: 0.247308, best loss: 0.142890 2025-01-16 02:10:38,810 - INFO - step 19970, loss: 0.167133, best loss: 0.142890 2025-01-16 02:10:38,961 - INFO - step 19971, loss: 0.203322, best loss: 0.142890 2025-01-16 02:10:39,110 - INFO - step 19972, loss: 0.217567, best loss: 0.142890 2025-01-16 02:10:39,260 - INFO - step 19973, loss: 0.154515, best loss: 0.142890 2025-01-16 02:10:39,410 - INFO - step 19974, loss: 0.213217, best loss: 0.142890 2025-01-16 02:10:39,561 - INFO - step 19975, loss: 0.166833, best loss: 0.142890 2025-01-16 02:10:39,711 - INFO - step 19976, loss: 0.248979, best loss: 0.142890 2025-01-16 02:10:39,861 - INFO - step 19977, loss: 0.177802, best loss: 0.142890 2025-01-16 02:10:40,011 - INFO - step 19978, loss: 0.191219, best loss: 0.142890 2025-01-16 02:10:40,161 - INFO - step 19979, loss: 0.179849, best loss: 0.142890 2025-01-16 02:10:40,311 - INFO - step 19980, loss: 0.192365, best loss: 0.142890 2025-01-16 02:10:40,461 - INFO - step 19981, loss: 0.167022, best loss: 0.142890 2025-01-16 02:10:40,611 - INFO - step 19982, loss: 0.168864, best loss: 0.142890 2025-01-16 02:10:40,761 - INFO - step 19983, loss: 0.233617, best loss: 0.142890 2025-01-16 02:10:40,911 - INFO - step 19984, loss: 0.198582, best loss: 0.142890 2025-01-16 02:10:41,061 - INFO - step 19985, loss: 0.199365, best loss: 0.142890 2025-01-16 02:10:41,211 - INFO - step 19986, loss: 0.228263, best loss: 0.142890 2025-01-16 02:10:41,361 - INFO - step 19987, loss: 0.226976, best loss: 0.142890 2025-01-16 02:10:41,511 - INFO - step 19988, loss: 0.196920, best loss: 0.142890 2025-01-16 02:10:41,661 - INFO - step 19989, loss: 0.207607, best loss: 0.142890 2025-01-16 02:10:41,811 - INFO - step 19990, loss: 0.265704, best loss: 0.142890 2025-01-16 02:10:41,961 - INFO - step 19991, loss: 0.214828, best loss: 0.142890 2025-01-16 02:10:42,111 - INFO - step 19992, loss: 0.213980, best loss: 0.142890 2025-01-16 02:10:42,261 - INFO - step 19993, loss: 0.214794, best loss: 0.142890 2025-01-16 02:10:42,411 - INFO - step 19994, loss: 0.218346, best loss: 0.142890 2025-01-16 02:10:42,561 - INFO - step 19995, loss: 0.184816, best loss: 0.142890 2025-01-16 02:10:42,711 - INFO - step 19996, loss: 0.204120, best loss: 0.142890 2025-01-16 02:10:42,861 - INFO - step 19997, loss: 0.172326, best loss: 0.142890 2025-01-16 02:10:43,011 - INFO - step 19998, loss: 0.221341, best loss: 0.142890 2025-01-16 02:10:43,161 - INFO - step 19999, loss: 0.183639, best loss: 0.142890 2025-01-16 02:10:43,311 - INFO - step 20000, loss: 0.179492, best loss: 0.142890 2025-01-16 02:10:43,461 - INFO - step 20001, loss: 0.202468, best loss: 0.142890 2025-01-16 02:10:43,611 - INFO - step 20002, loss: 0.215495, best loss: 0.142890 2025-01-16 02:10:43,761 - INFO - step 20003, loss: 0.263164, best loss: 0.142890 2025-01-16 02:10:43,911 - INFO - step 20004, loss: 0.201289, best loss: 0.142890 2025-01-16 02:10:44,061 - INFO - step 20005, loss: 0.188962, best loss: 0.142890 2025-01-16 02:10:44,211 - INFO - step 20006, loss: 0.192807, best loss: 0.142890 2025-01-16 02:10:44,361 - INFO - step 20007, loss: 0.169703, best loss: 0.142890 2025-01-16 02:10:44,511 - INFO - step 20008, loss: 0.272791, best loss: 0.142890 2025-01-16 02:10:44,662 - INFO - step 20009, loss: 0.221020, best loss: 0.142890 2025-01-16 02:10:44,812 - INFO - step 20010, loss: 0.171535, best loss: 0.142890 2025-01-16 02:10:44,962 - INFO - step 20011, loss: 0.184962, best loss: 0.142890 2025-01-16 02:10:45,112 - INFO - step 20012, loss: 0.200960, best loss: 0.142890 2025-01-16 02:10:45,262 - INFO - step 20013, loss: 0.190025, best loss: 0.142890 2025-01-16 02:10:45,412 - INFO - step 20014, loss: 0.204860, best loss: 0.142890 2025-01-16 02:10:45,562 - INFO - step 20015, loss: 0.201423, best loss: 0.142890 2025-01-16 02:10:45,712 - INFO - step 20016, loss: 0.206260, best loss: 0.142890 2025-01-16 02:10:45,862 - INFO - step 20017, loss: 0.210319, best loss: 0.142890 2025-01-16 02:10:46,012 - INFO - step 20018, loss: 0.174196, best loss: 0.142890 2025-01-16 02:10:46,162 - INFO - step 20019, loss: 0.210185, best loss: 0.142890 2025-01-16 02:10:46,312 - INFO - step 20020, loss: 0.211444, best loss: 0.142890 2025-01-16 02:10:46,462 - INFO - step 20021, loss: 0.275833, best loss: 0.142890 2025-01-16 02:10:46,613 - INFO - step 20022, loss: 0.241314, best loss: 0.142890 2025-01-16 02:10:46,763 - INFO - step 20023, loss: 0.200855, best loss: 0.142890 2025-01-16 02:10:46,913 - INFO - step 20024, loss: 0.210245, best loss: 0.142890 2025-01-16 02:10:47,063 - INFO - step 20025, loss: 0.220460, best loss: 0.142890 2025-01-16 02:10:47,213 - INFO - step 20026, loss: 0.192432, best loss: 0.142890 2025-01-16 02:10:47,363 - INFO - step 20027, loss: 0.176015, best loss: 0.142890 2025-01-16 02:10:47,513 - INFO - step 20028, loss: 0.183185, best loss: 0.142890 2025-01-16 02:10:47,663 - INFO - step 20029, loss: 0.211457, best loss: 0.142890 2025-01-16 02:10:47,813 - INFO - step 20030, loss: 0.189402, best loss: 0.142890 2025-01-16 02:10:47,963 - INFO - step 20031, loss: 0.162280, best loss: 0.142890 2025-01-16 02:10:48,114 - INFO - step 20032, loss: 0.178216, best loss: 0.142890 2025-01-16 02:10:48,264 - INFO - step 20033, loss: 0.210536, best loss: 0.142890 2025-01-16 02:10:48,414 - INFO - step 20034, loss: 0.207589, best loss: 0.142890 2025-01-16 02:10:48,564 - INFO - step 20035, loss: 0.215845, best loss: 0.142890 2025-01-16 02:10:48,714 - INFO - step 20036, loss: 0.199090, best loss: 0.142890 2025-01-16 02:10:48,864 - INFO - step 20037, loss: 0.222163, best loss: 0.142890 2025-01-16 02:10:49,014 - INFO - step 20038, loss: 0.191385, best loss: 0.142890 2025-01-16 02:10:49,164 - INFO - step 20039, loss: 0.198811, best loss: 0.142890 2025-01-16 02:10:49,314 - INFO - step 20040, loss: 0.192831, best loss: 0.142890 2025-01-16 02:10:49,465 - INFO - step 20041, loss: 0.203165, best loss: 0.142890 2025-01-16 02:10:49,615 - INFO - step 20042, loss: 0.157623, best loss: 0.142890 2025-01-16 02:10:49,765 - INFO - step 20043, loss: 0.227637, best loss: 0.142890 2025-01-16 02:10:49,915 - INFO - step 20044, loss: 0.267878, best loss: 0.142890 2025-01-16 02:10:50,065 - INFO - step 20045, loss: 0.201030, best loss: 0.142890 2025-01-16 02:10:50,215 - INFO - step 20046, loss: 0.212771, best loss: 0.142890 2025-01-16 02:10:50,365 - INFO - step 20047, loss: 0.194271, best loss: 0.142890 2025-01-16 02:10:50,515 - INFO - step 20048, loss: 0.200267, best loss: 0.142890 2025-01-16 02:10:50,665 - INFO - step 20049, loss: 0.220317, best loss: 0.142890 2025-01-16 02:10:50,815 - INFO - step 20050, loss: 0.253806, best loss: 0.142890 2025-01-16 02:10:50,965 - INFO - step 20051, loss: 0.191685, best loss: 0.142890 2025-01-16 02:10:51,115 - INFO - step 20052, loss: 0.162294, best loss: 0.142890 2025-01-16 02:10:51,265 - INFO - step 20053, loss: 0.172809, best loss: 0.142890 2025-01-16 02:10:51,415 - INFO - step 20054, loss: 0.180442, best loss: 0.142890 2025-01-16 02:10:51,565 - INFO - step 20055, loss: 0.214266, best loss: 0.142890 2025-01-16 02:10:51,715 - INFO - step 20056, loss: 0.215745, best loss: 0.142890 2025-01-16 02:10:51,865 - INFO - step 20057, loss: 0.195362, best loss: 0.142890 2025-01-16 02:10:52,015 - INFO - step 20058, loss: 0.154760, best loss: 0.142890 2025-01-16 02:10:52,165 - INFO - step 20059, loss: 0.249336, best loss: 0.142890 2025-01-16 02:10:52,315 - INFO - step 20060, loss: 0.182035, best loss: 0.142890 2025-01-16 02:10:52,465 - INFO - step 20061, loss: 0.162195, best loss: 0.142890 2025-01-16 02:10:52,615 - INFO - step 20062, loss: 0.256937, best loss: 0.142890 2025-01-16 02:10:52,765 - INFO - step 20063, loss: 0.173983, best loss: 0.142890 2025-01-16 02:10:52,915 - INFO - step 20064, loss: 0.168270, best loss: 0.142890 2025-01-16 02:10:53,066 - INFO - step 20065, loss: 0.195497, best loss: 0.142890 2025-01-16 02:10:53,216 - INFO - step 20066, loss: 0.175103, best loss: 0.142890 2025-01-16 02:10:53,366 - INFO - step 20067, loss: 0.188016, best loss: 0.142890 2025-01-16 02:10:53,516 - INFO - step 20068, loss: 0.201069, best loss: 0.142890 2025-01-16 02:10:53,666 - INFO - step 20069, loss: 0.203014, best loss: 0.142890 2025-01-16 02:10:53,816 - INFO - step 20070, loss: 0.190165, best loss: 0.142890 2025-01-16 02:10:53,966 - INFO - step 20071, loss: 0.171344, best loss: 0.142890 2025-01-16 02:10:54,116 - INFO - step 20072, loss: 0.185787, best loss: 0.142890 2025-01-16 02:10:54,266 - INFO - step 20073, loss: 0.205360, best loss: 0.142890 2025-01-16 02:10:54,416 - INFO - step 20074, loss: 0.173678, best loss: 0.142890 2025-01-16 02:10:54,566 - INFO - step 20075, loss: 0.192421, best loss: 0.142890 2025-01-16 02:10:58,100 - INFO - step 20076, loss: 0.138859, best loss: 0.138859 2025-01-16 02:10:58,263 - INFO - step 20077, loss: 0.190276, best loss: 0.138859 2025-01-16 02:10:58,415 - INFO - step 20078, loss: 0.200015, best loss: 0.138859 2025-01-16 02:10:58,566 - INFO - step 20079, loss: 0.193403, best loss: 0.138859 2025-01-16 02:10:58,716 - INFO - step 20080, loss: 0.211114, best loss: 0.138859 2025-01-16 02:10:58,866 - INFO - step 20081, loss: 0.147083, best loss: 0.138859 2025-01-16 02:10:59,016 - INFO - step 20082, loss: 0.201610, best loss: 0.138859 2025-01-16 02:10:59,166 - INFO - step 20083, loss: 0.170632, best loss: 0.138859 2025-01-16 02:10:59,316 - INFO - step 20084, loss: 0.198132, best loss: 0.138859 2025-01-16 02:10:59,466 - INFO - step 20085, loss: 0.180354, best loss: 0.138859 2025-01-16 02:10:59,617 - INFO - step 20086, loss: 0.200169, best loss: 0.138859 2025-01-16 02:10:59,767 - INFO - step 20087, loss: 0.187454, best loss: 0.138859 2025-01-16 02:10:59,917 - INFO - step 20088, loss: 0.199850, best loss: 0.138859 2025-01-16 02:11:00,067 - INFO - step 20089, loss: 0.187007, best loss: 0.138859 2025-01-16 02:11:00,217 - INFO - step 20090, loss: 0.239076, best loss: 0.138859 2025-01-16 02:11:00,367 - INFO - step 20091, loss: 0.178732, best loss: 0.138859 2025-01-16 02:11:00,517 - INFO - step 20092, loss: 0.214108, best loss: 0.138859 2025-01-16 02:11:00,667 - INFO - step 20093, loss: 0.245960, best loss: 0.138859 2025-01-16 02:11:00,817 - INFO - step 20094, loss: 0.200721, best loss: 0.138859 2025-01-16 02:11:00,967 - INFO - step 20095, loss: 0.218035, best loss: 0.138859 2025-01-16 02:11:01,116 - INFO - step 20096, loss: 0.195401, best loss: 0.138859 2025-01-16 02:11:01,266 - INFO - step 20097, loss: 0.197473, best loss: 0.138859 2025-01-16 02:11:01,417 - INFO - step 20098, loss: 0.177120, best loss: 0.138859 2025-01-16 02:11:01,567 - INFO - step 20099, loss: 0.167709, best loss: 0.138859 2025-01-16 02:11:01,717 - INFO - step 20100, loss: 0.184570, best loss: 0.138859 2025-01-16 02:11:01,867 - INFO - step 20101, loss: 0.228250, best loss: 0.138859 2025-01-16 02:11:02,017 - INFO - step 20102, loss: 0.204467, best loss: 0.138859 2025-01-16 02:11:02,167 - INFO - step 20103, loss: 0.191260, best loss: 0.138859 2025-01-16 02:11:02,317 - INFO - step 20104, loss: 0.230895, best loss: 0.138859 2025-01-16 02:11:02,467 - INFO - step 20105, loss: 0.194726, best loss: 0.138859 2025-01-16 02:11:02,617 - INFO - step 20106, loss: 0.141389, best loss: 0.138859 2025-01-16 02:11:02,767 - INFO - step 20107, loss: 0.196809, best loss: 0.138859 2025-01-16 02:11:02,917 - INFO - step 20108, loss: 0.243342, best loss: 0.138859 2025-01-16 02:11:03,067 - INFO - step 20109, loss: 0.160857, best loss: 0.138859 2025-01-16 02:11:03,223 - INFO - step 20110, loss: 0.186317, best loss: 0.138859 2025-01-16 02:11:03,373 - INFO - step 20111, loss: 0.163013, best loss: 0.138859 2025-01-16 02:11:03,523 - INFO - step 20112, loss: 0.293172, best loss: 0.138859 2025-01-16 02:11:03,673 - INFO - step 20113, loss: 0.197579, best loss: 0.138859 2025-01-16 02:11:03,823 - INFO - step 20114, loss: 0.274823, best loss: 0.138859 2025-01-16 02:11:03,974 - INFO - step 20115, loss: 0.201845, best loss: 0.138859 2025-01-16 02:11:04,124 - INFO - step 20116, loss: 0.176968, best loss: 0.138859 2025-01-16 02:11:04,274 - INFO - step 20117, loss: 0.185302, best loss: 0.138859 2025-01-16 02:11:04,424 - INFO - step 20118, loss: 0.209191, best loss: 0.138859 2025-01-16 02:11:04,573 - INFO - step 20119, loss: 0.230606, best loss: 0.138859 2025-01-16 02:11:04,723 - INFO - step 20120, loss: 0.191589, best loss: 0.138859 2025-01-16 02:11:04,874 - INFO - step 20121, loss: 0.201414, best loss: 0.138859 2025-01-16 02:11:05,024 - INFO - step 20122, loss: 0.201441, best loss: 0.138859 2025-01-16 02:11:05,174 - INFO - step 20123, loss: 0.372708, best loss: 0.138859 2025-01-16 02:11:05,324 - INFO - step 20124, loss: 0.183625, best loss: 0.138859 2025-01-16 02:11:05,474 - INFO - step 20125, loss: 0.238434, best loss: 0.138859 2025-01-16 02:11:05,624 - INFO - step 20126, loss: 0.238002, best loss: 0.138859 2025-01-16 02:11:05,774 - INFO - step 20127, loss: 0.212167, best loss: 0.138859 2025-01-16 02:11:05,924 - INFO - step 20128, loss: 0.175502, best loss: 0.138859 2025-01-16 02:11:06,074 - INFO - step 20129, loss: 0.181086, best loss: 0.138859 2025-01-16 02:11:06,224 - INFO - step 20130, loss: 0.190643, best loss: 0.138859 2025-01-16 02:11:06,374 - INFO - step 20131, loss: 0.265206, best loss: 0.138859 2025-01-16 02:11:06,525 - INFO - step 20132, loss: 0.193391, best loss: 0.138859 2025-01-16 02:11:06,675 - INFO - step 20133, loss: 0.169834, best loss: 0.138859 2025-01-16 02:11:06,825 - INFO - step 20134, loss: 0.212746, best loss: 0.138859 2025-01-16 02:11:06,975 - INFO - step 20135, loss: 0.219545, best loss: 0.138859 2025-01-16 02:11:07,125 - INFO - step 20136, loss: 0.234825, best loss: 0.138859 2025-01-16 02:11:07,275 - INFO - step 20137, loss: 0.222634, best loss: 0.138859 2025-01-16 02:11:07,425 - INFO - step 20138, loss: 0.183319, best loss: 0.138859 2025-01-16 02:11:07,575 - INFO - step 20139, loss: 0.174582, best loss: 0.138859 2025-01-16 02:11:07,725 - INFO - step 20140, loss: 0.225214, best loss: 0.138859 2025-01-16 02:11:07,875 - INFO - step 20141, loss: 0.193534, best loss: 0.138859 2025-01-16 02:11:08,025 - INFO - step 20142, loss: 0.147144, best loss: 0.138859 2025-01-16 02:11:08,175 - INFO - step 20143, loss: 0.201232, best loss: 0.138859 2025-01-16 02:11:08,325 - INFO - step 20144, loss: 0.250832, best loss: 0.138859 2025-01-16 02:11:08,475 - INFO - step 20145, loss: 0.212416, best loss: 0.138859 2025-01-16 02:11:08,625 - INFO - step 20146, loss: 0.201153, best loss: 0.138859 2025-01-16 02:11:08,775 - INFO - step 20147, loss: 0.175423, best loss: 0.138859 2025-01-16 02:11:08,925 - INFO - step 20148, loss: 0.188824, best loss: 0.138859 2025-01-16 02:11:09,075 - INFO - step 20149, loss: 0.193207, best loss: 0.138859 2025-01-16 02:11:09,225 - INFO - step 20150, loss: 0.166303, best loss: 0.138859 2025-01-16 02:11:09,375 - INFO - step 20151, loss: 0.164867, best loss: 0.138859 2025-01-16 02:11:09,525 - INFO - step 20152, loss: 0.193141, best loss: 0.138859 2025-01-16 02:11:09,675 - INFO - step 20153, loss: 0.200603, best loss: 0.138859 2025-01-16 02:11:09,825 - INFO - step 20154, loss: 0.216452, best loss: 0.138859 2025-01-16 02:11:09,975 - INFO - step 20155, loss: 0.200283, best loss: 0.138859 2025-01-16 02:11:10,125 - INFO - step 20156, loss: 0.202685, best loss: 0.138859 2025-01-16 02:11:10,275 - INFO - step 20157, loss: 0.189219, best loss: 0.138859 2025-01-16 02:11:10,425 - INFO - step 20158, loss: 0.182277, best loss: 0.138859 2025-01-16 02:11:10,576 - INFO - step 20159, loss: 0.195431, best loss: 0.138859 2025-01-16 02:11:10,726 - INFO - step 20160, loss: 0.180726, best loss: 0.138859 2025-01-16 02:11:10,876 - INFO - step 20161, loss: 0.232353, best loss: 0.138859 2025-01-16 02:11:11,026 - INFO - step 20162, loss: 0.213477, best loss: 0.138859 2025-01-16 02:11:11,176 - INFO - step 20163, loss: 0.187696, best loss: 0.138859 2025-01-16 02:11:11,326 - INFO - step 20164, loss: 0.205501, best loss: 0.138859 2025-01-16 02:11:11,476 - INFO - step 20165, loss: 0.174732, best loss: 0.138859 2025-01-16 02:11:11,627 - INFO - step 20166, loss: 0.185187, best loss: 0.138859 2025-01-16 02:11:11,777 - INFO - step 20167, loss: 0.223864, best loss: 0.138859 2025-01-16 02:11:11,927 - INFO - step 20168, loss: 0.183710, best loss: 0.138859 2025-01-16 02:11:12,077 - INFO - step 20169, loss: 0.214399, best loss: 0.138859 2025-01-16 02:11:12,227 - INFO - step 20170, loss: 0.207955, best loss: 0.138859 2025-01-16 02:11:12,377 - INFO - step 20171, loss: 0.182425, best loss: 0.138859 2025-01-16 02:11:12,527 - INFO - step 20172, loss: 0.186289, best loss: 0.138859 2025-01-16 02:11:12,677 - INFO - step 20173, loss: 0.180370, best loss: 0.138859 2025-01-16 02:11:12,828 - INFO - step 20174, loss: 0.184766, best loss: 0.138859 2025-01-16 02:11:12,978 - INFO - step 20175, loss: 0.233055, best loss: 0.138859 2025-01-16 02:11:13,128 - INFO - step 20176, loss: 0.190401, best loss: 0.138859 2025-01-16 02:11:13,278 - INFO - step 20177, loss: 0.229928, best loss: 0.138859 2025-01-16 02:11:13,428 - INFO - step 20178, loss: 0.149517, best loss: 0.138859 2025-01-16 02:11:13,578 - INFO - step 20179, loss: 0.228198, best loss: 0.138859 2025-01-16 02:11:13,728 - INFO - step 20180, loss: 0.231892, best loss: 0.138859 2025-01-16 02:11:13,879 - INFO - step 20181, loss: 0.185398, best loss: 0.138859 2025-01-16 02:11:14,029 - INFO - step 20182, loss: 0.185412, best loss: 0.138859 2025-01-16 02:11:14,179 - INFO - step 20183, loss: 0.198151, best loss: 0.138859 2025-01-16 02:11:14,329 - INFO - step 20184, loss: 0.180400, best loss: 0.138859 2025-01-16 02:11:14,479 - INFO - step 20185, loss: 0.164142, best loss: 0.138859 2025-01-16 02:11:14,630 - INFO - step 20186, loss: 0.221171, best loss: 0.138859 2025-01-16 02:11:14,780 - INFO - step 20187, loss: 0.213307, best loss: 0.138859 2025-01-16 02:11:14,930 - INFO - step 20188, loss: 0.213069, best loss: 0.138859 2025-01-16 02:11:15,080 - INFO - step 20189, loss: 0.188102, best loss: 0.138859 2025-01-16 02:11:15,230 - INFO - step 20190, loss: 0.202851, best loss: 0.138859 2025-01-16 02:11:15,381 - INFO - step 20191, loss: 0.224921, best loss: 0.138859 2025-01-16 02:11:15,531 - INFO - step 20192, loss: 0.273156, best loss: 0.138859 2025-01-16 02:11:15,681 - INFO - step 20193, loss: 0.206817, best loss: 0.138859 2025-01-16 02:11:15,831 - INFO - step 20194, loss: 0.246072, best loss: 0.138859 2025-01-16 02:11:15,981 - INFO - step 20195, loss: 0.172282, best loss: 0.138859 2025-01-16 02:11:16,131 - INFO - step 20196, loss: 0.190314, best loss: 0.138859 2025-01-16 02:11:16,281 - INFO - step 20197, loss: 0.272017, best loss: 0.138859 2025-01-16 02:11:16,431 - INFO - step 20198, loss: 0.199525, best loss: 0.138859 2025-01-16 02:11:16,581 - INFO - step 20199, loss: 0.218602, best loss: 0.138859 2025-01-16 02:11:16,732 - INFO - step 20200, loss: 0.233891, best loss: 0.138859 2025-01-16 02:11:16,882 - INFO - step 20201, loss: 0.193320, best loss: 0.138859 2025-01-16 02:11:17,032 - INFO - step 20202, loss: 0.200364, best loss: 0.138859 2025-01-16 02:11:17,182 - INFO - step 20203, loss: 0.201224, best loss: 0.138859 2025-01-16 02:11:17,332 - INFO - step 20204, loss: 0.156834, best loss: 0.138859 2025-01-16 02:11:17,482 - INFO - step 20205, loss: 0.192917, best loss: 0.138859 2025-01-16 02:11:17,632 - INFO - step 20206, loss: 0.186440, best loss: 0.138859 2025-01-16 02:11:17,783 - INFO - step 20207, loss: 0.224149, best loss: 0.138859 2025-01-16 02:11:17,933 - INFO - step 20208, loss: 0.255696, best loss: 0.138859 2025-01-16 02:11:18,083 - INFO - step 20209, loss: 0.285189, best loss: 0.138859 2025-01-16 02:11:18,233 - INFO - step 20210, loss: 0.200428, best loss: 0.138859 2025-01-16 02:11:18,383 - INFO - step 20211, loss: 0.191586, best loss: 0.138859 2025-01-16 02:11:18,533 - INFO - step 20212, loss: 0.183200, best loss: 0.138859 2025-01-16 02:11:18,684 - INFO - step 20213, loss: 0.192601, best loss: 0.138859 2025-01-16 02:11:18,834 - INFO - step 20214, loss: 0.200699, best loss: 0.138859 2025-01-16 02:11:18,984 - INFO - step 20215, loss: 0.227378, best loss: 0.138859 2025-01-16 02:11:19,134 - INFO - step 20216, loss: 0.180378, best loss: 0.138859 2025-01-16 02:11:19,284 - INFO - step 20217, loss: 0.195321, best loss: 0.138859 2025-01-16 02:11:19,435 - INFO - step 20218, loss: 0.171016, best loss: 0.138859 2025-01-16 02:11:19,585 - INFO - step 20219, loss: 0.200766, best loss: 0.138859 2025-01-16 02:11:19,735 - INFO - step 20220, loss: 0.187791, best loss: 0.138859 2025-01-16 02:11:19,885 - INFO - step 20221, loss: 0.236499, best loss: 0.138859 2025-01-16 02:11:20,036 - INFO - step 20222, loss: 0.185743, best loss: 0.138859 2025-01-16 02:11:20,186 - INFO - step 20223, loss: 0.211369, best loss: 0.138859 2025-01-16 02:11:20,336 - INFO - step 20224, loss: 0.234645, best loss: 0.138859 2025-01-16 02:11:20,486 - INFO - step 20225, loss: 0.237001, best loss: 0.138859 2025-01-16 02:11:20,636 - INFO - step 20226, loss: 0.183373, best loss: 0.138859 2025-01-16 02:11:20,786 - INFO - step 20227, loss: 0.183182, best loss: 0.138859 2025-01-16 02:11:20,936 - INFO - step 20228, loss: 0.160077, best loss: 0.138859 2025-01-16 02:11:21,086 - INFO - step 20229, loss: 0.224238, best loss: 0.138859 2025-01-16 02:11:21,237 - INFO - step 20230, loss: 0.205330, best loss: 0.138859 2025-01-16 02:11:21,387 - INFO - step 20231, loss: 0.168802, best loss: 0.138859 2025-01-16 02:11:21,537 - INFO - step 20232, loss: 0.216326, best loss: 0.138859 2025-01-16 02:11:21,687 - INFO - step 20233, loss: 0.233022, best loss: 0.138859 2025-01-16 02:11:21,837 - INFO - step 20234, loss: 0.223506, best loss: 0.138859 2025-01-16 02:11:21,987 - INFO - step 20235, loss: 0.200563, best loss: 0.138859 2025-01-16 02:11:22,137 - INFO - step 20236, loss: 0.233848, best loss: 0.138859 2025-01-16 02:11:22,287 - INFO - step 20237, loss: 0.164034, best loss: 0.138859 2025-01-16 02:11:22,437 - INFO - step 20238, loss: 0.196405, best loss: 0.138859 2025-01-16 02:11:22,588 - INFO - step 20239, loss: 0.220856, best loss: 0.138859 2025-01-16 02:11:22,738 - INFO - step 20240, loss: 0.163194, best loss: 0.138859 2025-01-16 02:11:22,888 - INFO - step 20241, loss: 0.228954, best loss: 0.138859 2025-01-16 02:11:23,038 - INFO - step 20242, loss: 0.202899, best loss: 0.138859 2025-01-16 02:11:23,188 - INFO - step 20243, loss: 0.194778, best loss: 0.138859 2025-01-16 02:11:23,338 - INFO - step 20244, loss: 0.195285, best loss: 0.138859 2025-01-16 02:11:23,489 - INFO - step 20245, loss: 0.251206, best loss: 0.138859 2025-01-16 02:11:23,639 - INFO - step 20246, loss: 0.242336, best loss: 0.138859 2025-01-16 02:11:23,789 - INFO - step 20247, loss: 0.213510, best loss: 0.138859 2025-01-16 02:11:23,939 - INFO - step 20248, loss: 0.189613, best loss: 0.138859 2025-01-16 02:11:24,089 - INFO - step 20249, loss: 0.207424, best loss: 0.138859 2025-01-16 02:11:24,240 - INFO - step 20250, loss: 0.189429, best loss: 0.138859 2025-01-16 02:11:24,390 - INFO - step 20251, loss: 0.186418, best loss: 0.138859 2025-01-16 02:11:24,540 - INFO - step 20252, loss: 0.174555, best loss: 0.138859 2025-01-16 02:11:24,690 - INFO - step 20253, loss: 0.198906, best loss: 0.138859 2025-01-16 02:11:24,840 - INFO - step 20254, loss: 0.192264, best loss: 0.138859 2025-01-16 02:11:24,990 - INFO - step 20255, loss: 0.165110, best loss: 0.138859 2025-01-16 02:11:25,140 - INFO - step 20256, loss: 0.183288, best loss: 0.138859 2025-01-16 02:11:25,290 - INFO - step 20257, loss: 0.227059, best loss: 0.138859 2025-01-16 02:11:25,440 - INFO - step 20258, loss: 0.161168, best loss: 0.138859 2025-01-16 02:11:25,591 - INFO - step 20259, loss: 0.223031, best loss: 0.138859 2025-01-16 02:11:25,741 - INFO - step 20260, loss: 0.211156, best loss: 0.138859 2025-01-16 02:11:25,891 - INFO - step 20261, loss: 0.193938, best loss: 0.138859 2025-01-16 02:11:26,041 - INFO - step 20262, loss: 0.239898, best loss: 0.138859 2025-01-16 02:11:26,191 - INFO - step 20263, loss: 0.192659, best loss: 0.138859 2025-01-16 02:11:26,341 - INFO - step 20264, loss: 0.203867, best loss: 0.138859 2025-01-16 02:11:26,491 - INFO - step 20265, loss: 0.165952, best loss: 0.138859 2025-01-16 02:11:26,641 - INFO - step 20266, loss: 0.186415, best loss: 0.138859 2025-01-16 02:11:26,791 - INFO - step 20267, loss: 0.200143, best loss: 0.138859 2025-01-16 02:11:26,941 - INFO - step 20268, loss: 0.179723, best loss: 0.138859 2025-01-16 02:11:27,092 - INFO - step 20269, loss: 0.190868, best loss: 0.138859 2025-01-16 02:11:27,242 - INFO - step 20270, loss: 0.220058, best loss: 0.138859 2025-01-16 02:11:27,392 - INFO - step 20271, loss: 0.205365, best loss: 0.138859 2025-01-16 02:11:27,542 - INFO - step 20272, loss: 0.246233, best loss: 0.138859 2025-01-16 02:11:27,692 - INFO - step 20273, loss: 0.174221, best loss: 0.138859 2025-01-16 02:11:27,842 - INFO - step 20274, loss: 0.152222, best loss: 0.138859 2025-01-16 02:11:27,992 - INFO - step 20275, loss: 0.246603, best loss: 0.138859 2025-01-16 02:11:28,142 - INFO - step 20276, loss: 0.235266, best loss: 0.138859 2025-01-16 02:11:28,293 - INFO - step 20277, loss: 0.237986, best loss: 0.138859 2025-01-16 02:11:28,443 - INFO - step 20278, loss: 0.171247, best loss: 0.138859 2025-01-16 02:11:28,593 - INFO - step 20279, loss: 0.161513, best loss: 0.138859 2025-01-16 02:11:28,743 - INFO - step 20280, loss: 0.194364, best loss: 0.138859 2025-01-16 02:11:28,893 - INFO - step 20281, loss: 0.202903, best loss: 0.138859 2025-01-16 02:11:29,043 - INFO - step 20282, loss: 0.184004, best loss: 0.138859 2025-01-16 02:11:29,193 - INFO - step 20283, loss: 0.208695, best loss: 0.138859 2025-01-16 02:11:29,343 - INFO - step 20284, loss: 0.185607, best loss: 0.138859 2025-01-16 02:11:29,494 - INFO - step 20285, loss: 0.208097, best loss: 0.138859 2025-01-16 02:11:29,644 - INFO - step 20286, loss: 0.211530, best loss: 0.138859 2025-01-16 02:11:29,794 - INFO - step 20287, loss: 0.178002, best loss: 0.138859 2025-01-16 02:11:29,944 - INFO - step 20288, loss: 0.220484, best loss: 0.138859 2025-01-16 02:11:30,094 - INFO - step 20289, loss: 0.205903, best loss: 0.138859 2025-01-16 02:11:30,244 - INFO - step 20290, loss: 0.216681, best loss: 0.138859 2025-01-16 02:11:30,394 - INFO - step 20291, loss: 0.182023, best loss: 0.138859 2025-01-16 02:11:30,544 - INFO - step 20292, loss: 0.188636, best loss: 0.138859 2025-01-16 02:11:30,695 - INFO - step 20293, loss: 0.214351, best loss: 0.138859 2025-01-16 02:11:30,845 - INFO - step 20294, loss: 0.193936, best loss: 0.138859 2025-01-16 02:11:30,996 - INFO - step 20295, loss: 0.190274, best loss: 0.138859 2025-01-16 02:11:31,146 - INFO - step 20296, loss: 0.211912, best loss: 0.138859 2025-01-16 02:11:31,296 - INFO - step 20297, loss: 0.171711, best loss: 0.138859 2025-01-16 02:11:31,446 - INFO - step 20298, loss: 0.180704, best loss: 0.138859 2025-01-16 02:11:31,596 - INFO - step 20299, loss: 0.249298, best loss: 0.138859 2025-01-16 02:11:31,746 - INFO - step 20300, loss: 0.168989, best loss: 0.138859 2025-01-16 02:11:31,896 - INFO - step 20301, loss: 0.195860, best loss: 0.138859 2025-01-16 02:11:32,046 - INFO - step 20302, loss: 0.196919, best loss: 0.138859 2025-01-16 02:11:32,196 - INFO - step 20303, loss: 0.218454, best loss: 0.138859 2025-01-16 02:11:32,346 - INFO - step 20304, loss: 0.192475, best loss: 0.138859 2025-01-16 02:11:32,496 - INFO - step 20305, loss: 0.198657, best loss: 0.138859 2025-01-16 02:11:32,647 - INFO - step 20306, loss: 0.218624, best loss: 0.138859 2025-01-16 02:11:32,797 - INFO - step 20307, loss: 0.220840, best loss: 0.138859 2025-01-16 02:11:32,947 - INFO - step 20308, loss: 0.213888, best loss: 0.138859 2025-01-16 02:11:33,097 - INFO - step 20309, loss: 0.166542, best loss: 0.138859 2025-01-16 02:11:33,247 - INFO - step 20310, loss: 0.236301, best loss: 0.138859 2025-01-16 02:11:36,833 - INFO - step 20311, loss: 0.130650, best loss: 0.130650 2025-01-16 02:11:36,984 - INFO - step 20312, loss: 0.169527, best loss: 0.130650 2025-01-16 02:11:37,134 - INFO - step 20313, loss: 0.221070, best loss: 0.130650 2025-01-16 02:11:37,284 - INFO - step 20314, loss: 0.177191, best loss: 0.130650 2025-01-16 02:11:37,434 - INFO - step 20315, loss: 0.200290, best loss: 0.130650 2025-01-16 02:11:37,584 - INFO - step 20316, loss: 0.228752, best loss: 0.130650 2025-01-16 02:11:37,734 - INFO - step 20317, loss: 0.192945, best loss: 0.130650 2025-01-16 02:11:37,885 - INFO - step 20318, loss: 0.161830, best loss: 0.130650 2025-01-16 02:11:38,035 - INFO - step 20319, loss: 0.220836, best loss: 0.130650 2025-01-16 02:11:38,185 - INFO - step 20320, loss: 0.199434, best loss: 0.130650 2025-01-16 02:11:38,336 - INFO - step 20321, loss: 0.202344, best loss: 0.130650 2025-01-16 02:11:38,486 - INFO - step 20322, loss: 0.164988, best loss: 0.130650 2025-01-16 02:11:38,636 - INFO - step 20323, loss: 0.187640, best loss: 0.130650 2025-01-16 02:11:38,786 - INFO - step 20324, loss: 0.232526, best loss: 0.130650 2025-01-16 02:11:38,936 - INFO - step 20325, loss: 0.197499, best loss: 0.130650 2025-01-16 02:11:39,086 - INFO - step 20326, loss: 0.250600, best loss: 0.130650 2025-01-16 02:11:39,237 - INFO - step 20327, loss: 0.164802, best loss: 0.130650 2025-01-16 02:11:39,387 - INFO - step 20328, loss: 0.198715, best loss: 0.130650 2025-01-16 02:11:39,537 - INFO - step 20329, loss: 0.152355, best loss: 0.130650 2025-01-16 02:11:39,687 - INFO - step 20330, loss: 0.157219, best loss: 0.130650 2025-01-16 02:11:39,837 - INFO - step 20331, loss: 0.169260, best loss: 0.130650 2025-01-16 02:11:39,988 - INFO - step 20332, loss: 0.190660, best loss: 0.130650 2025-01-16 02:11:40,138 - INFO - step 20333, loss: 0.193481, best loss: 0.130650 2025-01-16 02:11:40,288 - INFO - step 20334, loss: 0.185471, best loss: 0.130650 2025-01-16 02:11:40,438 - INFO - step 20335, loss: 0.215937, best loss: 0.130650 2025-01-16 02:11:40,588 - INFO - step 20336, loss: 0.221108, best loss: 0.130650 2025-01-16 02:11:40,739 - INFO - step 20337, loss: 0.231674, best loss: 0.130650 2025-01-16 02:11:40,889 - INFO - step 20338, loss: 0.234409, best loss: 0.130650 2025-01-16 02:11:41,039 - INFO - step 20339, loss: 0.232889, best loss: 0.130650 2025-01-16 02:11:41,189 - INFO - step 20340, loss: 0.167414, best loss: 0.130650 2025-01-16 02:11:41,339 - INFO - step 20341, loss: 0.200281, best loss: 0.130650 2025-01-16 02:11:41,490 - INFO - step 20342, loss: 0.160813, best loss: 0.130650 2025-01-16 02:11:41,640 - INFO - step 20343, loss: 0.198771, best loss: 0.130650 2025-01-16 02:11:41,790 - INFO - step 20344, loss: 0.160437, best loss: 0.130650 2025-01-16 02:11:41,940 - INFO - step 20345, loss: 0.193441, best loss: 0.130650 2025-01-16 02:11:42,090 - INFO - step 20346, loss: 0.174019, best loss: 0.130650 2025-01-16 02:11:42,240 - INFO - step 20347, loss: 0.201374, best loss: 0.130650 2025-01-16 02:11:42,390 - INFO - step 20348, loss: 0.177620, best loss: 0.130650 2025-01-16 02:11:42,540 - INFO - step 20349, loss: 0.165330, best loss: 0.130650 2025-01-16 02:11:42,690 - INFO - step 20350, loss: 0.211927, best loss: 0.130650 2025-01-16 02:11:42,841 - INFO - step 20351, loss: 0.227628, best loss: 0.130650 2025-01-16 02:11:42,991 - INFO - step 20352, loss: 0.161032, best loss: 0.130650 2025-01-16 02:11:43,141 - INFO - step 20353, loss: 0.268208, best loss: 0.130650 2025-01-16 02:11:43,291 - INFO - step 20354, loss: 0.250965, best loss: 0.130650 2025-01-16 02:11:43,441 - INFO - step 20355, loss: 0.210144, best loss: 0.130650 2025-01-16 02:11:43,591 - INFO - step 20356, loss: 0.236896, best loss: 0.130650 2025-01-16 02:11:43,741 - INFO - step 20357, loss: 0.220245, best loss: 0.130650 2025-01-16 02:11:43,892 - INFO - step 20358, loss: 0.163759, best loss: 0.130650 2025-01-16 02:11:44,042 - INFO - step 20359, loss: 0.205312, best loss: 0.130650 2025-01-16 02:11:44,192 - INFO - step 20360, loss: 0.211037, best loss: 0.130650 2025-01-16 02:11:44,342 - INFO - step 20361, loss: 0.161773, best loss: 0.130650 2025-01-16 02:11:44,492 - INFO - step 20362, loss: 0.185862, best loss: 0.130650 2025-01-16 02:11:44,643 - INFO - step 20363, loss: 0.212565, best loss: 0.130650 2025-01-16 02:11:44,793 - INFO - step 20364, loss: 0.233868, best loss: 0.130650 2025-01-16 02:11:44,943 - INFO - step 20365, loss: 0.234504, best loss: 0.130650 2025-01-16 02:11:45,093 - INFO - step 20366, loss: 0.201124, best loss: 0.130650 2025-01-16 02:11:45,243 - INFO - step 20367, loss: 0.180111, best loss: 0.130650 2025-01-16 02:11:45,393 - INFO - step 20368, loss: 0.205679, best loss: 0.130650 2025-01-16 02:11:45,543 - INFO - step 20369, loss: 0.222967, best loss: 0.130650 2025-01-16 02:11:45,693 - INFO - step 20370, loss: 0.215817, best loss: 0.130650 2025-01-16 02:11:45,843 - INFO - step 20371, loss: 0.199790, best loss: 0.130650 2025-01-16 02:11:45,993 - INFO - step 20372, loss: 0.206953, best loss: 0.130650 2025-01-16 02:11:46,143 - INFO - step 20373, loss: 0.200055, best loss: 0.130650 2025-01-16 02:11:46,293 - INFO - step 20374, loss: 0.214615, best loss: 0.130650 2025-01-16 02:11:46,443 - INFO - step 20375, loss: 0.208125, best loss: 0.130650 2025-01-16 02:11:46,594 - INFO - step 20376, loss: 0.172627, best loss: 0.130650 2025-01-16 02:11:46,744 - INFO - step 20377, loss: 0.230960, best loss: 0.130650 2025-01-16 02:11:46,894 - INFO - step 20378, loss: 0.221990, best loss: 0.130650 2025-01-16 02:11:47,044 - INFO - step 20379, loss: 0.188545, best loss: 0.130650 2025-01-16 02:11:47,194 - INFO - step 20380, loss: 0.262864, best loss: 0.130650 2025-01-16 02:11:47,344 - INFO - step 20381, loss: 0.218474, best loss: 0.130650 2025-01-16 02:11:47,494 - INFO - step 20382, loss: 0.177132, best loss: 0.130650 2025-01-16 02:11:47,644 - INFO - step 20383, loss: 0.190178, best loss: 0.130650 2025-01-16 02:11:47,794 - INFO - step 20384, loss: 0.195114, best loss: 0.130650 2025-01-16 02:11:47,944 - INFO - step 20385, loss: 0.147516, best loss: 0.130650 2025-01-16 02:11:48,094 - INFO - step 20386, loss: 0.185279, best loss: 0.130650 2025-01-16 02:11:48,244 - INFO - step 20387, loss: 0.229711, best loss: 0.130650 2025-01-16 02:11:48,394 - INFO - step 20388, loss: 0.174249, best loss: 0.130650 2025-01-16 02:11:48,544 - INFO - step 20389, loss: 0.208080, best loss: 0.130650 2025-01-16 02:11:48,694 - INFO - step 20390, loss: 0.183960, best loss: 0.130650 2025-01-16 02:11:48,844 - INFO - step 20391, loss: 0.241077, best loss: 0.130650 2025-01-16 02:11:48,994 - INFO - step 20392, loss: 0.174016, best loss: 0.130650 2025-01-16 02:11:49,145 - INFO - step 20393, loss: 0.155357, best loss: 0.130650 2025-01-16 02:11:49,295 - INFO - step 20394, loss: 0.188351, best loss: 0.130650 2025-01-16 02:11:49,445 - INFO - step 20395, loss: 0.172619, best loss: 0.130650 2025-01-16 02:11:49,596 - INFO - step 20396, loss: 0.196114, best loss: 0.130650 2025-01-16 02:11:49,746 - INFO - step 20397, loss: 0.171233, best loss: 0.130650 2025-01-16 02:11:49,896 - INFO - step 20398, loss: 0.176424, best loss: 0.130650 2025-01-16 02:11:50,046 - INFO - step 20399, loss: 0.211416, best loss: 0.130650 2025-01-16 02:11:50,196 - INFO - step 20400, loss: 0.221430, best loss: 0.130650 2025-01-16 02:11:50,346 - INFO - step 20401, loss: 0.159188, best loss: 0.130650 2025-01-16 02:11:50,496 - INFO - step 20402, loss: 0.288143, best loss: 0.130650 2025-01-16 02:11:50,646 - INFO - step 20403, loss: 0.190469, best loss: 0.130650 2025-01-16 02:11:50,796 - INFO - step 20404, loss: 0.147549, best loss: 0.130650 2025-01-16 02:11:50,947 - INFO - step 20405, loss: 0.196263, best loss: 0.130650 2025-01-16 02:11:51,097 - INFO - step 20406, loss: 0.133113, best loss: 0.130650 2025-01-16 02:11:51,247 - INFO - step 20407, loss: 0.145544, best loss: 0.130650 2025-01-16 02:11:51,397 - INFO - step 20408, loss: 0.148524, best loss: 0.130650 2025-01-16 02:11:51,547 - INFO - step 20409, loss: 0.181224, best loss: 0.130650 2025-01-16 02:11:51,697 - INFO - step 20410, loss: 0.158971, best loss: 0.130650 2025-01-16 02:11:51,847 - INFO - step 20411, loss: 0.170954, best loss: 0.130650 2025-01-16 02:11:51,997 - INFO - step 20412, loss: 0.192119, best loss: 0.130650 2025-01-16 02:11:52,147 - INFO - step 20413, loss: 0.160518, best loss: 0.130650 2025-01-16 02:11:52,297 - INFO - step 20414, loss: 0.222833, best loss: 0.130650 2025-01-16 02:11:52,447 - INFO - step 20415, loss: 0.183936, best loss: 0.130650 2025-01-16 02:11:52,597 - INFO - step 20416, loss: 0.187609, best loss: 0.130650 2025-01-16 02:11:52,747 - INFO - step 20417, loss: 0.199434, best loss: 0.130650 2025-01-16 02:11:52,897 - INFO - step 20418, loss: 0.164795, best loss: 0.130650 2025-01-16 02:11:53,048 - INFO - step 20419, loss: 0.166943, best loss: 0.130650 2025-01-16 02:11:53,197 - INFO - step 20420, loss: 0.162624, best loss: 0.130650 2025-01-16 02:11:53,348 - INFO - step 20421, loss: 0.162948, best loss: 0.130650 2025-01-16 02:11:53,498 - INFO - step 20422, loss: 0.188414, best loss: 0.130650 2025-01-16 02:11:53,648 - INFO - step 20423, loss: 0.204724, best loss: 0.130650 2025-01-16 02:11:53,798 - INFO - step 20424, loss: 0.186818, best loss: 0.130650 2025-01-16 02:11:53,948 - INFO - step 20425, loss: 0.188882, best loss: 0.130650 2025-01-16 02:11:54,098 - INFO - step 20426, loss: 0.174883, best loss: 0.130650 2025-01-16 02:11:54,248 - INFO - step 20427, loss: 0.168061, best loss: 0.130650 2025-01-16 02:11:54,398 - INFO - step 20428, loss: 0.154117, best loss: 0.130650 2025-01-16 02:11:54,548 - INFO - step 20429, loss: 0.218216, best loss: 0.130650 2025-01-16 02:11:54,698 - INFO - step 20430, loss: 0.182622, best loss: 0.130650 2025-01-16 02:11:54,848 - INFO - step 20431, loss: 0.172054, best loss: 0.130650 2025-01-16 02:11:54,998 - INFO - step 20432, loss: 0.178729, best loss: 0.130650 2025-01-16 02:11:55,148 - INFO - step 20433, loss: 0.211498, best loss: 0.130650 2025-01-16 02:11:55,298 - INFO - step 20434, loss: 0.237073, best loss: 0.130650 2025-01-16 02:11:55,449 - INFO - step 20435, loss: 0.185943, best loss: 0.130650 2025-01-16 02:11:55,599 - INFO - step 20436, loss: 0.140649, best loss: 0.130650 2025-01-16 02:11:55,749 - INFO - step 20437, loss: 0.242648, best loss: 0.130650 2025-01-16 02:11:55,899 - INFO - step 20438, loss: 0.179290, best loss: 0.130650 2025-01-16 02:11:56,049 - INFO - step 20439, loss: 0.202787, best loss: 0.130650 2025-01-16 02:11:56,199 - INFO - step 20440, loss: 0.143569, best loss: 0.130650 2025-01-16 02:11:56,349 - INFO - step 20441, loss: 0.169483, best loss: 0.130650 2025-01-16 02:11:56,499 - INFO - step 20442, loss: 0.217504, best loss: 0.130650 2025-01-16 02:11:56,649 - INFO - step 20443, loss: 0.186912, best loss: 0.130650 2025-01-16 02:11:56,800 - INFO - step 20444, loss: 0.231808, best loss: 0.130650 2025-01-16 02:11:56,950 - INFO - step 20445, loss: 0.171904, best loss: 0.130650 2025-01-16 02:11:57,100 - INFO - step 20446, loss: 0.156041, best loss: 0.130650 2025-01-16 02:11:57,250 - INFO - step 20447, loss: 0.148898, best loss: 0.130650 2025-01-16 02:11:57,400 - INFO - step 20448, loss: 0.147851, best loss: 0.130650 2025-01-16 02:11:57,550 - INFO - step 20449, loss: 0.250060, best loss: 0.130650 2025-01-16 02:11:57,701 - INFO - step 20450, loss: 0.182234, best loss: 0.130650 2025-01-16 02:11:57,851 - INFO - step 20451, loss: 0.176631, best loss: 0.130650 2025-01-16 02:11:58,001 - INFO - step 20452, loss: 0.193446, best loss: 0.130650 2025-01-16 02:11:58,151 - INFO - step 20453, loss: 0.180857, best loss: 0.130650 2025-01-16 02:11:58,301 - INFO - step 20454, loss: 0.159736, best loss: 0.130650 2025-01-16 02:11:58,452 - INFO - step 20455, loss: 0.188448, best loss: 0.130650 2025-01-16 02:11:58,602 - INFO - step 20456, loss: 0.220035, best loss: 0.130650 2025-01-16 02:11:58,752 - INFO - step 20457, loss: 0.168495, best loss: 0.130650 2025-01-16 02:11:58,902 - INFO - step 20458, loss: 0.133839, best loss: 0.130650 2025-01-16 02:11:59,052 - INFO - step 20459, loss: 0.157460, best loss: 0.130650 2025-01-16 02:11:59,202 - INFO - step 20460, loss: 0.190212, best loss: 0.130650 2025-01-16 02:11:59,353 - INFO - step 20461, loss: 0.210159, best loss: 0.130650 2025-01-16 02:11:59,503 - INFO - step 20462, loss: 0.180504, best loss: 0.130650 2025-01-16 02:11:59,653 - INFO - step 20463, loss: 0.167979, best loss: 0.130650 2025-01-16 02:11:59,803 - INFO - step 20464, loss: 0.240971, best loss: 0.130650 2025-01-16 02:11:59,953 - INFO - step 20465, loss: 0.200825, best loss: 0.130650 2025-01-16 02:12:00,103 - INFO - step 20466, loss: 0.206381, best loss: 0.130650 2025-01-16 02:12:00,253 - INFO - step 20467, loss: 0.168769, best loss: 0.130650 2025-01-16 02:12:00,404 - INFO - step 20468, loss: 0.196682, best loss: 0.130650 2025-01-16 02:12:00,554 - INFO - step 20469, loss: 0.161952, best loss: 0.130650 2025-01-16 02:12:00,704 - INFO - step 20470, loss: 0.168088, best loss: 0.130650 2025-01-16 02:12:00,854 - INFO - step 20471, loss: 0.235448, best loss: 0.130650 2025-01-16 02:12:01,004 - INFO - step 20472, loss: 0.187278, best loss: 0.130650 2025-01-16 02:12:01,154 - INFO - step 20473, loss: 0.158993, best loss: 0.130650 2025-01-16 02:12:01,304 - INFO - step 20474, loss: 0.207812, best loss: 0.130650 2025-01-16 02:12:01,454 - INFO - step 20475, loss: 0.196684, best loss: 0.130650 2025-01-16 02:12:01,604 - INFO - step 20476, loss: 0.177029, best loss: 0.130650 2025-01-16 02:12:01,754 - INFO - step 20477, loss: 0.197066, best loss: 0.130650 2025-01-16 02:12:01,904 - INFO - step 20478, loss: 0.229476, best loss: 0.130650 2025-01-16 02:12:02,054 - INFO - step 20479, loss: 0.187818, best loss: 0.130650 2025-01-16 02:12:02,204 - INFO - step 20480, loss: 0.154023, best loss: 0.130650 2025-01-16 02:12:02,354 - INFO - step 20481, loss: 0.213413, best loss: 0.130650 2025-01-16 02:12:02,504 - INFO - step 20482, loss: 0.155286, best loss: 0.130650 2025-01-16 02:12:02,654 - INFO - step 20483, loss: 0.156904, best loss: 0.130650 2025-01-16 02:12:02,804 - INFO - step 20484, loss: 0.207377, best loss: 0.130650 2025-01-16 02:12:02,954 - INFO - step 20485, loss: 0.218501, best loss: 0.130650 2025-01-16 02:12:03,104 - INFO - step 20486, loss: 0.170097, best loss: 0.130650 2025-01-16 02:12:03,254 - INFO - step 20487, loss: 0.167210, best loss: 0.130650 2025-01-16 02:12:03,404 - INFO - step 20488, loss: 0.207226, best loss: 0.130650 2025-01-16 02:12:03,554 - INFO - step 20489, loss: 0.198991, best loss: 0.130650 2025-01-16 02:12:03,704 - INFO - step 20490, loss: 0.227735, best loss: 0.130650 2025-01-16 02:12:03,854 - INFO - step 20491, loss: 0.186967, best loss: 0.130650 2025-01-16 02:12:04,004 - INFO - step 20492, loss: 0.209159, best loss: 0.130650 2025-01-16 02:12:04,154 - INFO - step 20493, loss: 0.147078, best loss: 0.130650 2025-01-16 02:12:04,304 - INFO - step 20494, loss: 0.193962, best loss: 0.130650 2025-01-16 02:12:04,454 - INFO - step 20495, loss: 0.145288, best loss: 0.130650 2025-01-16 02:12:04,604 - INFO - step 20496, loss: 0.198168, best loss: 0.130650 2025-01-16 02:12:04,755 - INFO - step 20497, loss: 0.218411, best loss: 0.130650 2025-01-16 02:12:04,905 - INFO - step 20498, loss: 0.193613, best loss: 0.130650 2025-01-16 02:12:05,055 - INFO - step 20499, loss: 0.190442, best loss: 0.130650 2025-01-16 02:12:05,206 - INFO - step 20500, loss: 0.217901, best loss: 0.130650 2025-01-16 02:12:05,356 - INFO - step 20501, loss: 0.178887, best loss: 0.130650 2025-01-16 02:12:05,506 - INFO - step 20502, loss: 0.208952, best loss: 0.130650 2025-01-16 02:12:05,656 - INFO - step 20503, loss: 0.184694, best loss: 0.130650 2025-01-16 02:12:05,806 - INFO - step 20504, loss: 0.213820, best loss: 0.130650 2025-01-16 02:12:05,956 - INFO - step 20505, loss: 0.199309, best loss: 0.130650 2025-01-16 02:12:06,106 - INFO - step 20506, loss: 0.207829, best loss: 0.130650 2025-01-16 02:12:06,256 - INFO - step 20507, loss: 0.249567, best loss: 0.130650 2025-01-16 02:12:06,406 - INFO - step 20508, loss: 0.226127, best loss: 0.130650 2025-01-16 02:12:06,556 - INFO - step 20509, loss: 0.186420, best loss: 0.130650 2025-01-16 02:12:06,707 - INFO - step 20510, loss: 0.180216, best loss: 0.130650 2025-01-16 02:12:06,857 - INFO - step 20511, loss: 0.182322, best loss: 0.130650 2025-01-16 02:12:07,007 - INFO - step 20512, loss: 0.190062, best loss: 0.130650 2025-01-16 02:12:07,157 - INFO - step 20513, loss: 0.172299, best loss: 0.130650 2025-01-16 02:12:07,307 - INFO - step 20514, loss: 0.182479, best loss: 0.130650 2025-01-16 02:12:07,457 - INFO - step 20515, loss: 0.228718, best loss: 0.130650 2025-01-16 02:12:07,607 - INFO - step 20516, loss: 0.234108, best loss: 0.130650 2025-01-16 02:12:07,757 - INFO - step 20517, loss: 0.200717, best loss: 0.130650 2025-01-16 02:12:07,907 - INFO - step 20518, loss: 0.187096, best loss: 0.130650 2025-01-16 02:12:08,057 - INFO - step 20519, loss: 0.194421, best loss: 0.130650 2025-01-16 02:12:08,208 - INFO - step 20520, loss: 0.172513, best loss: 0.130650 2025-01-16 02:12:08,358 - INFO - step 20521, loss: 0.255487, best loss: 0.130650 2025-01-16 02:12:08,508 - INFO - step 20522, loss: 0.211689, best loss: 0.130650 2025-01-16 02:12:08,658 - INFO - step 20523, loss: 0.159184, best loss: 0.130650 2025-01-16 02:12:08,808 - INFO - step 20524, loss: 0.205185, best loss: 0.130650 2025-01-16 02:12:08,958 - INFO - step 20525, loss: 0.168252, best loss: 0.130650 2025-01-16 02:12:09,108 - INFO - step 20526, loss: 0.215513, best loss: 0.130650 2025-01-16 02:12:09,258 - INFO - step 20527, loss: 0.224763, best loss: 0.130650 2025-01-16 02:12:09,408 - INFO - step 20528, loss: 0.193379, best loss: 0.130650 2025-01-16 02:12:09,559 - INFO - step 20529, loss: 0.224133, best loss: 0.130650 2025-01-16 02:12:09,709 - INFO - step 20530, loss: 0.246220, best loss: 0.130650 2025-01-16 02:12:09,859 - INFO - step 20531, loss: 0.183409, best loss: 0.130650 2025-01-16 02:12:10,009 - INFO - step 20532, loss: 0.204113, best loss: 0.130650 2025-01-16 02:12:10,159 - INFO - step 20533, loss: 0.179799, best loss: 0.130650 2025-01-16 02:12:10,309 - INFO - step 20534, loss: 0.189838, best loss: 0.130650 2025-01-16 02:12:10,459 - INFO - step 20535, loss: 0.175826, best loss: 0.130650 2025-01-16 02:12:10,610 - INFO - step 20536, loss: 0.182148, best loss: 0.130650 2025-01-16 02:12:10,760 - INFO - step 20537, loss: 0.226804, best loss: 0.130650 2025-01-16 02:12:10,910 - INFO - step 20538, loss: 0.185659, best loss: 0.130650 2025-01-16 02:12:11,060 - INFO - step 20539, loss: 0.244099, best loss: 0.130650 2025-01-16 02:12:11,210 - INFO - step 20540, loss: 0.221168, best loss: 0.130650 2025-01-16 02:12:11,360 - INFO - step 20541, loss: 0.181593, best loss: 0.130650 2025-01-16 02:12:11,510 - INFO - step 20542, loss: 0.173017, best loss: 0.130650 2025-01-16 02:12:11,660 - INFO - step 20543, loss: 0.186819, best loss: 0.130650 2025-01-16 02:12:11,810 - INFO - step 20544, loss: 0.198456, best loss: 0.130650 2025-01-16 02:12:11,961 - INFO - step 20545, loss: 0.180029, best loss: 0.130650 2025-01-16 02:12:12,111 - INFO - step 20546, loss: 0.174101, best loss: 0.130650 2025-01-16 02:12:12,261 - INFO - step 20547, loss: 0.141859, best loss: 0.130650 2025-01-16 02:12:12,411 - INFO - step 20548, loss: 0.205725, best loss: 0.130650 2025-01-16 02:12:12,561 - INFO - step 20549, loss: 0.211265, best loss: 0.130650 2025-01-16 02:12:12,711 - INFO - step 20550, loss: 0.173504, best loss: 0.130650 2025-01-16 02:12:12,861 - INFO - step 20551, loss: 0.262967, best loss: 0.130650 2025-01-16 02:12:13,011 - INFO - step 20552, loss: 0.186061, best loss: 0.130650 2025-01-16 02:12:13,161 - INFO - step 20553, loss: 0.217381, best loss: 0.130650 2025-01-16 02:12:13,311 - INFO - step 20554, loss: 0.173957, best loss: 0.130650 2025-01-16 02:12:13,461 - INFO - step 20555, loss: 0.230361, best loss: 0.130650 2025-01-16 02:12:13,611 - INFO - step 20556, loss: 0.194143, best loss: 0.130650 2025-01-16 02:12:13,761 - INFO - step 20557, loss: 0.159895, best loss: 0.130650 2025-01-16 02:12:13,911 - INFO - step 20558, loss: 0.178888, best loss: 0.130650 2025-01-16 02:12:14,062 - INFO - step 20559, loss: 0.217124, best loss: 0.130650 2025-01-16 02:12:14,212 - INFO - step 20560, loss: 0.215424, best loss: 0.130650 2025-01-16 02:12:14,362 - INFO - step 20561, loss: 0.187351, best loss: 0.130650 2025-01-16 02:12:14,512 - INFO - step 20562, loss: 0.177823, best loss: 0.130650 2025-01-16 02:12:14,662 - INFO - step 20563, loss: 0.213552, best loss: 0.130650 2025-01-16 02:12:14,812 - INFO - step 20564, loss: 0.215918, best loss: 0.130650 2025-01-16 02:12:14,962 - INFO - step 20565, loss: 0.166870, best loss: 0.130650 2025-01-16 02:12:15,112 - INFO - step 20566, loss: 0.188495, best loss: 0.130650 2025-01-16 02:12:15,262 - INFO - step 20567, loss: 0.175362, best loss: 0.130650 2025-01-16 02:12:15,413 - INFO - step 20568, loss: 0.136246, best loss: 0.130650 2025-01-16 02:12:15,563 - INFO - step 20569, loss: 0.211690, best loss: 0.130650 2025-01-16 02:12:15,713 - INFO - step 20570, loss: 0.187125, best loss: 0.130650 2025-01-16 02:12:15,863 - INFO - step 20571, loss: 0.180245, best loss: 0.130650 2025-01-16 02:12:16,013 - INFO - step 20572, loss: 0.214969, best loss: 0.130650 2025-01-16 02:12:16,163 - INFO - step 20573, loss: 0.188705, best loss: 0.130650 2025-01-16 02:12:16,313 - INFO - step 20574, loss: 0.169280, best loss: 0.130650 2025-01-16 02:12:16,463 - INFO - step 20575, loss: 0.202987, best loss: 0.130650 2025-01-16 02:12:16,613 - INFO - step 20576, loss: 0.183968, best loss: 0.130650 2025-01-16 02:12:16,764 - INFO - step 20577, loss: 0.181699, best loss: 0.130650 2025-01-16 02:12:16,914 - INFO - step 20578, loss: 0.196002, best loss: 0.130650 2025-01-16 02:12:17,064 - INFO - step 20579, loss: 0.140035, best loss: 0.130650 2025-01-16 02:12:17,214 - INFO - step 20580, loss: 0.181301, best loss: 0.130650 2025-01-16 02:12:17,364 - INFO - step 20581, loss: 0.178327, best loss: 0.130650 2025-01-16 02:12:17,514 - INFO - step 20582, loss: 0.189537, best loss: 0.130650 2025-01-16 02:12:17,665 - INFO - step 20583, loss: 0.240026, best loss: 0.130650 2025-01-16 02:12:17,815 - INFO - step 20584, loss: 0.206138, best loss: 0.130650 2025-01-16 02:12:17,965 - INFO - step 20585, loss: 0.184338, best loss: 0.130650 2025-01-16 02:12:18,115 - INFO - step 20586, loss: 0.201112, best loss: 0.130650 2025-01-16 02:12:18,265 - INFO - step 20587, loss: 0.210937, best loss: 0.130650 2025-01-16 02:12:18,415 - INFO - step 20588, loss: 0.145884, best loss: 0.130650 2025-01-16 02:12:18,565 - INFO - step 20589, loss: 0.203926, best loss: 0.130650 2025-01-16 02:12:18,715 - INFO - step 20590, loss: 0.176246, best loss: 0.130650 2025-01-16 02:12:18,865 - INFO - step 20591, loss: 0.193149, best loss: 0.130650 2025-01-16 02:12:19,015 - INFO - step 20592, loss: 0.222566, best loss: 0.130650 2025-01-16 02:12:19,165 - INFO - step 20593, loss: 0.209282, best loss: 0.130650 2025-01-16 02:12:19,315 - INFO - step 20594, loss: 0.189242, best loss: 0.130650 2025-01-16 02:12:19,465 - INFO - step 20595, loss: 0.178286, best loss: 0.130650 2025-01-16 02:12:19,615 - INFO - step 20596, loss: 0.203609, best loss: 0.130650 2025-01-16 02:12:19,766 - INFO - step 20597, loss: 0.169153, best loss: 0.130650 2025-01-16 02:12:19,916 - INFO - step 20598, loss: 0.229104, best loss: 0.130650 2025-01-16 02:12:20,066 - INFO - step 20599, loss: 0.181526, best loss: 0.130650 2025-01-16 02:12:20,216 - INFO - step 20600, loss: 0.225179, best loss: 0.130650 2025-01-16 02:12:20,366 - INFO - step 20601, loss: 0.155835, best loss: 0.130650 2025-01-16 02:12:20,516 - INFO - step 20602, loss: 0.206942, best loss: 0.130650 2025-01-16 02:12:20,666 - INFO - step 20603, loss: 0.208064, best loss: 0.130650 2025-01-16 02:12:20,816 - INFO - step 20604, loss: 0.144119, best loss: 0.130650 2025-01-16 02:12:20,966 - INFO - step 20605, loss: 0.210006, best loss: 0.130650 2025-01-16 02:12:21,116 - INFO - step 20606, loss: 0.200488, best loss: 0.130650 2025-01-16 02:12:21,266 - INFO - step 20607, loss: 0.176197, best loss: 0.130650 2025-01-16 02:12:21,417 - INFO - step 20608, loss: 0.184186, best loss: 0.130650 2025-01-16 02:12:21,567 - INFO - step 20609, loss: 0.169737, best loss: 0.130650 2025-01-16 02:12:21,717 - INFO - step 20610, loss: 0.172795, best loss: 0.130650 2025-01-16 02:12:21,867 - INFO - step 20611, loss: 0.188822, best loss: 0.130650 2025-01-16 02:12:22,017 - INFO - step 20612, loss: 0.171734, best loss: 0.130650 2025-01-16 02:12:22,167 - INFO - step 20613, loss: 0.169529, best loss: 0.130650 2025-01-16 02:12:22,317 - INFO - step 20614, loss: 0.207294, best loss: 0.130650 2025-01-16 02:12:22,468 - INFO - step 20615, loss: 0.193103, best loss: 0.130650 2025-01-16 02:12:22,618 - INFO - step 20616, loss: 0.212364, best loss: 0.130650 2025-01-16 02:12:22,768 - INFO - step 20617, loss: 0.231013, best loss: 0.130650 2025-01-16 02:12:22,918 - INFO - step 20618, loss: 0.206771, best loss: 0.130650 2025-01-16 02:12:23,068 - INFO - step 20619, loss: 0.215317, best loss: 0.130650 2025-01-16 02:12:23,218 - INFO - step 20620, loss: 0.205471, best loss: 0.130650 2025-01-16 02:12:23,368 - INFO - step 20621, loss: 0.152641, best loss: 0.130650 2025-01-16 02:12:23,518 - INFO - step 20622, loss: 0.186450, best loss: 0.130650 2025-01-16 02:12:23,668 - INFO - step 20623, loss: 0.157174, best loss: 0.130650 2025-01-16 02:12:23,818 - INFO - step 20624, loss: 0.158638, best loss: 0.130650 2025-01-16 02:12:23,968 - INFO - step 20625, loss: 0.202391, best loss: 0.130650 2025-01-16 02:12:24,118 - INFO - step 20626, loss: 0.186854, best loss: 0.130650 2025-01-16 02:12:24,268 - INFO - step 20627, loss: 0.190506, best loss: 0.130650 2025-01-16 02:12:24,418 - INFO - step 20628, loss: 0.186738, best loss: 0.130650 2025-01-16 02:12:24,569 - INFO - step 20629, loss: 0.177430, best loss: 0.130650 2025-01-16 02:12:24,719 - INFO - step 20630, loss: 0.215411, best loss: 0.130650 2025-01-16 02:12:24,869 - INFO - step 20631, loss: 0.140277, best loss: 0.130650 2025-01-16 02:12:25,019 - INFO - step 20632, loss: 0.157254, best loss: 0.130650 2025-01-16 02:12:25,169 - INFO - step 20633, loss: 0.190732, best loss: 0.130650 2025-01-16 02:12:25,319 - INFO - step 20634, loss: 0.203927, best loss: 0.130650 2025-01-16 02:12:25,469 - INFO - step 20635, loss: 0.195566, best loss: 0.130650 2025-01-16 02:12:25,619 - INFO - step 20636, loss: 0.202810, best loss: 0.130650 2025-01-16 02:12:25,769 - INFO - step 20637, loss: 0.245874, best loss: 0.130650 2025-01-16 02:12:25,919 - INFO - step 20638, loss: 0.186559, best loss: 0.130650 2025-01-16 02:12:26,070 - INFO - step 20639, loss: 0.174671, best loss: 0.130650 2025-01-16 02:12:26,220 - INFO - step 20640, loss: 0.162450, best loss: 0.130650 2025-01-16 02:12:26,370 - INFO - step 20641, loss: 0.150370, best loss: 0.130650 2025-01-16 02:12:26,520 - INFO - step 20642, loss: 0.219310, best loss: 0.130650 2025-01-16 02:12:26,670 - INFO - step 20643, loss: 0.228880, best loss: 0.130650 2025-01-16 02:12:26,820 - INFO - step 20644, loss: 0.142624, best loss: 0.130650 2025-01-16 02:12:26,970 - INFO - step 20645, loss: 0.155249, best loss: 0.130650 2025-01-16 02:12:27,120 - INFO - step 20646, loss: 0.229030, best loss: 0.130650 2025-01-16 02:12:27,270 - INFO - step 20647, loss: 0.202266, best loss: 0.130650 2025-01-16 02:12:27,420 - INFO - step 20648, loss: 0.152558, best loss: 0.130650 2025-01-16 02:12:27,570 - INFO - step 20649, loss: 0.189612, best loss: 0.130650 2025-01-16 02:12:27,720 - INFO - step 20650, loss: 0.209451, best loss: 0.130650 2025-01-16 02:12:27,871 - INFO - step 20651, loss: 0.206965, best loss: 0.130650 2025-01-16 02:12:28,021 - INFO - step 20652, loss: 0.181630, best loss: 0.130650 2025-01-16 02:12:28,171 - INFO - step 20653, loss: 0.180409, best loss: 0.130650 2025-01-16 02:12:28,321 - INFO - step 20654, loss: 0.198035, best loss: 0.130650 2025-01-16 02:12:28,471 - INFO - step 20655, loss: 0.162598, best loss: 0.130650 2025-01-16 02:12:28,621 - INFO - step 20656, loss: 0.176772, best loss: 0.130650 2025-01-16 02:12:28,771 - INFO - step 20657, loss: 0.179244, best loss: 0.130650 2025-01-16 02:12:28,921 - INFO - step 20658, loss: 0.166797, best loss: 0.130650 2025-01-16 02:12:29,072 - INFO - step 20659, loss: 0.209876, best loss: 0.130650 2025-01-16 02:12:29,222 - INFO - step 20660, loss: 0.202965, best loss: 0.130650 2025-01-16 02:12:29,372 - INFO - step 20661, loss: 0.190999, best loss: 0.130650 2025-01-16 02:12:29,522 - INFO - step 20662, loss: 0.201672, best loss: 0.130650 2025-01-16 02:12:29,672 - INFO - step 20663, loss: 0.209571, best loss: 0.130650 2025-01-16 02:12:29,822 - INFO - step 20664, loss: 0.158688, best loss: 0.130650 2025-01-16 02:12:29,972 - INFO - step 20665, loss: 0.154311, best loss: 0.130650 2025-01-16 02:12:30,122 - INFO - step 20666, loss: 0.191162, best loss: 0.130650 2025-01-16 02:12:30,272 - INFO - step 20667, loss: 0.190534, best loss: 0.130650 2025-01-16 02:12:30,422 - INFO - step 20668, loss: 0.173622, best loss: 0.130650 2025-01-16 02:12:30,572 - INFO - step 20669, loss: 0.209541, best loss: 0.130650 2025-01-16 02:12:30,722 - INFO - step 20670, loss: 0.199422, best loss: 0.130650 2025-01-16 02:12:30,872 - INFO - step 20671, loss: 0.212491, best loss: 0.130650 2025-01-16 02:12:31,022 - INFO - step 20672, loss: 0.149464, best loss: 0.130650 2025-01-16 02:12:31,173 - INFO - step 20673, loss: 0.206128, best loss: 0.130650 2025-01-16 02:12:31,323 - INFO - step 20674, loss: 0.203629, best loss: 0.130650 2025-01-16 02:12:31,473 - INFO - step 20675, loss: 0.163815, best loss: 0.130650 2025-01-16 02:12:31,623 - INFO - step 20676, loss: 0.205571, best loss: 0.130650 2025-01-16 02:12:31,773 - INFO - step 20677, loss: 0.217599, best loss: 0.130650 2025-01-16 02:12:31,923 - INFO - step 20678, loss: 0.191785, best loss: 0.130650 2025-01-16 02:12:32,072 - INFO - step 20679, loss: 0.202324, best loss: 0.130650 2025-01-16 02:12:32,223 - INFO - step 20680, loss: 0.193228, best loss: 0.130650 2025-01-16 02:12:32,373 - INFO - step 20681, loss: 0.227507, best loss: 0.130650 2025-01-16 02:12:32,523 - INFO - step 20682, loss: 0.145595, best loss: 0.130650 2025-01-16 02:12:32,673 - INFO - step 20683, loss: 0.202242, best loss: 0.130650 2025-01-16 02:12:32,823 - INFO - step 20684, loss: 0.211622, best loss: 0.130650 2025-01-16 02:12:32,974 - INFO - step 20685, loss: 0.185154, best loss: 0.130650 2025-01-16 02:12:33,124 - INFO - step 20686, loss: 0.178727, best loss: 0.130650 2025-01-16 02:12:33,274 - INFO - step 20687, loss: 0.186744, best loss: 0.130650 2025-01-16 02:12:33,424 - INFO - step 20688, loss: 0.164792, best loss: 0.130650 2025-01-16 02:12:33,574 - INFO - step 20689, loss: 0.169506, best loss: 0.130650 2025-01-16 02:12:33,724 - INFO - step 20690, loss: 0.167006, best loss: 0.130650 2025-01-16 02:12:33,875 - INFO - step 20691, loss: 0.216648, best loss: 0.130650 2025-01-16 02:12:34,025 - INFO - step 20692, loss: 0.157939, best loss: 0.130650 2025-01-16 02:12:34,175 - INFO - step 20693, loss: 0.171447, best loss: 0.130650 2025-01-16 02:12:34,325 - INFO - step 20694, loss: 0.204902, best loss: 0.130650 2025-01-16 02:12:34,475 - INFO - step 20695, loss: 0.186719, best loss: 0.130650 2025-01-16 02:12:34,625 - INFO - step 20696, loss: 0.161565, best loss: 0.130650 2025-01-16 02:12:34,775 - INFO - step 20697, loss: 0.140208, best loss: 0.130650 2025-01-16 02:12:34,925 - INFO - step 20698, loss: 0.159916, best loss: 0.130650 2025-01-16 02:12:35,076 - INFO - step 20699, loss: 0.189086, best loss: 0.130650 2025-01-16 02:12:35,226 - INFO - step 20700, loss: 0.185524, best loss: 0.130650 2025-01-16 02:12:35,376 - INFO - step 20701, loss: 0.201188, best loss: 0.130650 2025-01-16 02:12:35,526 - INFO - step 20702, loss: 0.181635, best loss: 0.130650 2025-01-16 02:12:35,676 - INFO - step 20703, loss: 0.187334, best loss: 0.130650 2025-01-16 02:12:35,826 - INFO - step 20704, loss: 0.211329, best loss: 0.130650 2025-01-16 02:12:35,976 - INFO - step 20705, loss: 0.192428, best loss: 0.130650 2025-01-16 02:12:36,126 - INFO - step 20706, loss: 0.163226, best loss: 0.130650 2025-01-16 02:12:36,276 - INFO - step 20707, loss: 0.193848, best loss: 0.130650 2025-01-16 02:12:36,426 - INFO - step 20708, loss: 0.180086, best loss: 0.130650 2025-01-16 02:12:36,576 - INFO - step 20709, loss: 0.180258, best loss: 0.130650 2025-01-16 02:12:36,727 - INFO - step 20710, loss: 0.205781, best loss: 0.130650 2025-01-16 02:12:36,877 - INFO - step 20711, loss: 0.220437, best loss: 0.130650 2025-01-16 02:12:37,026 - INFO - step 20712, loss: 0.190371, best loss: 0.130650 2025-01-16 02:12:37,177 - INFO - step 20713, loss: 0.180392, best loss: 0.130650 2025-01-16 02:12:37,327 - INFO - step 20714, loss: 0.173980, best loss: 0.130650 2025-01-16 02:12:37,477 - INFO - step 20715, loss: 0.141043, best loss: 0.130650 2025-01-16 02:12:37,627 - INFO - step 20716, loss: 0.145100, best loss: 0.130650 2025-01-16 02:12:37,777 - INFO - step 20717, loss: 0.239940, best loss: 0.130650 2025-01-16 02:12:37,927 - INFO - step 20718, loss: 0.162724, best loss: 0.130650 2025-01-16 02:12:38,077 - INFO - step 20719, loss: 0.177922, best loss: 0.130650 2025-01-16 02:12:38,227 - INFO - step 20720, loss: 0.161277, best loss: 0.130650 2025-01-16 02:12:38,377 - INFO - step 20721, loss: 0.132980, best loss: 0.130650 2025-01-16 02:12:38,527 - INFO - step 20722, loss: 0.169898, best loss: 0.130650 2025-01-16 02:12:38,677 - INFO - step 20723, loss: 0.143259, best loss: 0.130650 2025-01-16 02:12:38,827 - INFO - step 20724, loss: 0.166309, best loss: 0.130650 2025-01-16 02:12:38,977 - INFO - step 20725, loss: 0.171195, best loss: 0.130650 2025-01-16 02:12:39,127 - INFO - step 20726, loss: 0.172840, best loss: 0.130650 2025-01-16 02:12:39,277 - INFO - step 20727, loss: 0.185100, best loss: 0.130650 2025-01-16 02:12:39,427 - INFO - step 20728, loss: 0.168977, best loss: 0.130650 2025-01-16 02:12:39,578 - INFO - step 20729, loss: 0.180915, best loss: 0.130650 2025-01-16 02:12:39,728 - INFO - step 20730, loss: 0.178642, best loss: 0.130650 2025-01-16 02:12:39,878 - INFO - step 20731, loss: 0.197183, best loss: 0.130650 2025-01-16 02:12:40,028 - INFO - step 20732, loss: 0.201426, best loss: 0.130650 2025-01-16 02:12:40,178 - INFO - step 20733, loss: 0.213499, best loss: 0.130650 2025-01-16 02:12:40,328 - INFO - step 20734, loss: 0.159612, best loss: 0.130650 2025-01-16 02:12:40,478 - INFO - step 20735, loss: 0.176825, best loss: 0.130650 2025-01-16 02:12:40,627 - INFO - step 20736, loss: 0.148711, best loss: 0.130650 2025-01-16 02:12:40,778 - INFO - step 20737, loss: 0.173980, best loss: 0.130650 2025-01-16 02:12:40,928 - INFO - step 20738, loss: 0.150681, best loss: 0.130650 2025-01-16 02:12:41,078 - INFO - step 20739, loss: 0.149553, best loss: 0.130650 2025-01-16 02:12:41,228 - INFO - step 20740, loss: 0.151270, best loss: 0.130650 2025-01-16 02:12:41,378 - INFO - step 20741, loss: 0.133717, best loss: 0.130650 2025-01-16 02:12:41,528 - INFO - step 20742, loss: 0.191749, best loss: 0.130650 2025-01-16 02:12:41,678 - INFO - step 20743, loss: 0.131632, best loss: 0.130650 2025-01-16 02:12:41,828 - INFO - step 20744, loss: 0.174637, best loss: 0.130650 2025-01-16 02:12:41,978 - INFO - step 20745, loss: 0.151422, best loss: 0.130650 2025-01-16 02:12:42,128 - INFO - step 20746, loss: 0.195339, best loss: 0.130650 2025-01-16 02:12:42,278 - INFO - step 20747, loss: 0.163272, best loss: 0.130650 2025-01-16 02:12:42,428 - INFO - step 20748, loss: 0.173349, best loss: 0.130650 2025-01-16 02:12:42,578 - INFO - step 20749, loss: 0.166319, best loss: 0.130650 2025-01-16 02:12:42,729 - INFO - step 20750, loss: 0.140543, best loss: 0.130650 2025-01-16 02:12:42,879 - INFO - step 20751, loss: 0.153930, best loss: 0.130650 2025-01-16 02:12:43,029 - INFO - step 20752, loss: 0.168720, best loss: 0.130650 2025-01-16 02:12:43,179 - INFO - step 20753, loss: 0.178659, best loss: 0.130650 2025-01-16 02:12:43,328 - INFO - step 20754, loss: 0.149767, best loss: 0.130650 2025-01-16 02:12:43,478 - INFO - step 20755, loss: 0.177702, best loss: 0.130650 2025-01-16 02:12:43,628 - INFO - step 20756, loss: 0.180500, best loss: 0.130650 2025-01-16 02:12:43,778 - INFO - step 20757, loss: 0.190008, best loss: 0.130650 2025-01-16 02:12:43,928 - INFO - step 20758, loss: 0.156785, best loss: 0.130650 2025-01-16 02:12:44,078 - INFO - step 20759, loss: 0.161506, best loss: 0.130650 2025-01-16 02:12:44,228 - INFO - step 20760, loss: 0.166797, best loss: 0.130650 2025-01-16 02:12:44,378 - INFO - step 20761, loss: 0.162548, best loss: 0.130650 2025-01-16 02:12:44,528 - INFO - step 20762, loss: 0.150753, best loss: 0.130650 2025-01-16 02:12:44,678 - INFO - step 20763, loss: 0.140388, best loss: 0.130650 2025-01-16 02:12:44,828 - INFO - step 20764, loss: 0.180056, best loss: 0.130650 2025-01-16 02:12:44,978 - INFO - step 20765, loss: 0.188100, best loss: 0.130650 2025-01-16 02:12:45,128 - INFO - step 20766, loss: 0.131772, best loss: 0.130650 2025-01-16 02:12:45,278 - INFO - step 20767, loss: 0.172383, best loss: 0.130650 2025-01-16 02:12:45,428 - INFO - step 20768, loss: 0.153084, best loss: 0.130650 2025-01-16 02:12:45,578 - INFO - step 20769, loss: 0.151663, best loss: 0.130650 2025-01-16 02:12:45,728 - INFO - step 20770, loss: 0.162445, best loss: 0.130650 2025-01-16 02:12:45,878 - INFO - step 20771, loss: 0.149348, best loss: 0.130650 2025-01-16 02:12:46,029 - INFO - step 20772, loss: 0.199470, best loss: 0.130650 2025-01-16 02:12:46,179 - INFO - step 20773, loss: 0.143799, best loss: 0.130650 2025-01-16 02:12:46,329 - INFO - step 20774, loss: 0.189036, best loss: 0.130650 2025-01-16 02:12:46,479 - INFO - step 20775, loss: 0.178779, best loss: 0.130650 2025-01-16 02:12:50,006 - INFO - step 20776, loss: 0.126631, best loss: 0.126631 2025-01-16 02:12:50,167 - INFO - step 20777, loss: 0.139658, best loss: 0.126631 2025-01-16 02:12:50,318 - INFO - step 20778, loss: 0.162241, best loss: 0.126631 2025-01-16 02:12:50,469 - INFO - step 20779, loss: 0.179531, best loss: 0.126631 2025-01-16 02:12:50,619 - INFO - step 20780, loss: 0.176380, best loss: 0.126631 2025-01-16 02:12:50,769 - INFO - step 20781, loss: 0.163770, best loss: 0.126631 2025-01-16 02:12:50,919 - INFO - step 20782, loss: 0.168076, best loss: 0.126631 2025-01-16 02:12:51,069 - INFO - step 20783, loss: 0.139674, best loss: 0.126631 2025-01-16 02:12:51,219 - INFO - step 20784, loss: 0.152316, best loss: 0.126631 2025-01-16 02:12:51,369 - INFO - step 20785, loss: 0.168549, best loss: 0.126631 2025-01-16 02:12:51,519 - INFO - step 20786, loss: 0.191017, best loss: 0.126631 2025-01-16 02:12:55,196 - INFO - step 20787, loss: 0.106677, best loss: 0.106677 2025-01-16 02:12:55,346 - INFO - step 20788, loss: 0.156200, best loss: 0.106677 2025-01-16 02:12:55,496 - INFO - step 20789, loss: 0.152734, best loss: 0.106677 2025-01-16 02:12:55,647 - INFO - step 20790, loss: 0.163932, best loss: 0.106677 2025-01-16 02:12:55,797 - INFO - step 20791, loss: 0.173114, best loss: 0.106677 2025-01-16 02:12:55,947 - INFO - step 20792, loss: 0.147310, best loss: 0.106677 2025-01-16 02:12:56,097 - INFO - step 20793, loss: 0.155134, best loss: 0.106677 2025-01-16 02:12:56,247 - INFO - step 20794, loss: 0.182395, best loss: 0.106677 2025-01-16 02:12:56,396 - INFO - step 20795, loss: 0.182259, best loss: 0.106677 2025-01-16 02:12:56,546 - INFO - step 20796, loss: 0.174413, best loss: 0.106677 2025-01-16 02:12:56,696 - INFO - step 20797, loss: 0.211051, best loss: 0.106677 2025-01-16 02:12:56,846 - INFO - step 20798, loss: 0.156193, best loss: 0.106677 2025-01-16 02:12:56,997 - INFO - step 20799, loss: 0.143108, best loss: 0.106677 2025-01-16 02:12:57,147 - INFO - step 20800, loss: 0.181095, best loss: 0.106677 2025-01-16 02:12:57,297 - INFO - step 20801, loss: 0.164179, best loss: 0.106677 2025-01-16 02:12:57,447 - INFO - step 20802, loss: 0.144549, best loss: 0.106677 2025-01-16 02:12:57,597 - INFO - step 20803, loss: 0.151124, best loss: 0.106677 2025-01-16 02:12:57,747 - INFO - step 20804, loss: 0.157663, best loss: 0.106677 2025-01-16 02:12:57,897 - INFO - step 20805, loss: 0.166033, best loss: 0.106677 2025-01-16 02:12:58,047 - INFO - step 20806, loss: 0.176815, best loss: 0.106677 2025-01-16 02:12:58,197 - INFO - step 20807, loss: 0.206941, best loss: 0.106677 2025-01-16 02:12:58,348 - INFO - step 20808, loss: 0.185505, best loss: 0.106677 2025-01-16 02:12:58,498 - INFO - step 20809, loss: 0.174056, best loss: 0.106677 2025-01-16 02:12:58,648 - INFO - step 20810, loss: 0.153024, best loss: 0.106677 2025-01-16 02:12:58,798 - INFO - step 20811, loss: 0.197553, best loss: 0.106677 2025-01-16 02:12:58,948 - INFO - step 20812, loss: 0.180380, best loss: 0.106677 2025-01-16 02:12:59,098 - INFO - step 20813, loss: 0.158274, best loss: 0.106677 2025-01-16 02:12:59,248 - INFO - step 20814, loss: 0.147514, best loss: 0.106677 2025-01-16 02:12:59,398 - INFO - step 20815, loss: 0.177733, best loss: 0.106677 2025-01-16 02:12:59,549 - INFO - step 20816, loss: 0.159460, best loss: 0.106677 2025-01-16 02:12:59,699 - INFO - step 20817, loss: 0.155102, best loss: 0.106677 2025-01-16 02:12:59,849 - INFO - step 20818, loss: 0.136976, best loss: 0.106677 2025-01-16 02:12:59,999 - INFO - step 20819, loss: 0.148157, best loss: 0.106677 2025-01-16 02:13:00,149 - INFO - step 20820, loss: 0.147244, best loss: 0.106677 2025-01-16 02:13:00,299 - INFO - step 20821, loss: 0.178236, best loss: 0.106677 2025-01-16 02:13:00,449 - INFO - step 20822, loss: 0.148231, best loss: 0.106677 2025-01-16 02:13:00,600 - INFO - step 20823, loss: 0.171493, best loss: 0.106677 2025-01-16 02:13:00,750 - INFO - step 20824, loss: 0.151271, best loss: 0.106677 2025-01-16 02:13:00,900 - INFO - step 20825, loss: 0.190443, best loss: 0.106677 2025-01-16 02:13:01,050 - INFO - step 20826, loss: 0.162976, best loss: 0.106677 2025-01-16 02:13:01,200 - INFO - step 20827, loss: 0.198047, best loss: 0.106677 2025-01-16 02:13:01,350 - INFO - step 20828, loss: 0.204475, best loss: 0.106677 2025-01-16 02:13:01,500 - INFO - step 20829, loss: 0.196477, best loss: 0.106677 2025-01-16 02:13:01,650 - INFO - step 20830, loss: 0.218363, best loss: 0.106677 2025-01-16 02:13:01,801 - INFO - step 20831, loss: 0.181421, best loss: 0.106677 2025-01-16 02:13:01,951 - INFO - step 20832, loss: 0.228938, best loss: 0.106677 2025-01-16 02:13:02,101 - INFO - step 20833, loss: 0.147972, best loss: 0.106677 2025-01-16 02:13:02,251 - INFO - step 20834, loss: 0.157727, best loss: 0.106677 2025-01-16 02:13:02,401 - INFO - step 20835, loss: 0.143031, best loss: 0.106677 2025-01-16 02:13:02,551 - INFO - step 20836, loss: 0.198659, best loss: 0.106677 2025-01-16 02:13:02,701 - INFO - step 20837, loss: 0.170062, best loss: 0.106677 2025-01-16 02:13:02,851 - INFO - step 20838, loss: 0.210326, best loss: 0.106677 2025-01-16 02:13:03,001 - INFO - step 20839, loss: 0.162591, best loss: 0.106677 2025-01-16 02:13:03,151 - INFO - step 20840, loss: 0.157383, best loss: 0.106677 2025-01-16 02:13:03,302 - INFO - step 20841, loss: 0.182823, best loss: 0.106677 2025-01-16 02:13:03,453 - INFO - step 20842, loss: 0.171169, best loss: 0.106677 2025-01-16 02:13:03,603 - INFO - step 20843, loss: 0.181640, best loss: 0.106677 2025-01-16 02:13:03,753 - INFO - step 20844, loss: 0.174344, best loss: 0.106677 2025-01-16 02:13:03,903 - INFO - step 20845, loss: 0.194103, best loss: 0.106677 2025-01-16 02:13:04,053 - INFO - step 20846, loss: 0.180300, best loss: 0.106677 2025-01-16 02:13:04,204 - INFO - step 20847, loss: 0.181603, best loss: 0.106677 2025-01-16 02:13:04,353 - INFO - step 20848, loss: 0.188232, best loss: 0.106677 2025-01-16 02:13:04,503 - INFO - step 20849, loss: 0.193404, best loss: 0.106677 2025-01-16 02:13:04,654 - INFO - step 20850, loss: 0.169485, best loss: 0.106677 2025-01-16 02:13:04,803 - INFO - step 20851, loss: 0.182567, best loss: 0.106677 2025-01-16 02:13:04,953 - INFO - step 20852, loss: 0.187522, best loss: 0.106677 2025-01-16 02:13:05,103 - INFO - step 20853, loss: 0.158078, best loss: 0.106677 2025-01-16 02:13:05,253 - INFO - step 20854, loss: 0.166280, best loss: 0.106677 2025-01-16 02:13:05,403 - INFO - step 20855, loss: 0.153514, best loss: 0.106677 2025-01-16 02:13:05,554 - INFO - step 20856, loss: 0.202027, best loss: 0.106677 2025-01-16 02:13:05,704 - INFO - step 20857, loss: 0.173690, best loss: 0.106677 2025-01-16 02:13:05,854 - INFO - step 20858, loss: 0.171972, best loss: 0.106677 2025-01-16 02:13:06,004 - INFO - step 20859, loss: 0.171169, best loss: 0.106677 2025-01-16 02:13:06,154 - INFO - step 20860, loss: 0.210456, best loss: 0.106677 2025-01-16 02:13:06,304 - INFO - step 20861, loss: 0.194031, best loss: 0.106677 2025-01-16 02:13:06,454 - INFO - step 20862, loss: 0.187016, best loss: 0.106677 2025-01-16 02:13:06,604 - INFO - step 20863, loss: 0.210476, best loss: 0.106677 2025-01-16 02:13:06,754 - INFO - step 20864, loss: 0.180178, best loss: 0.106677 2025-01-16 02:13:06,904 - INFO - step 20865, loss: 0.193841, best loss: 0.106677 2025-01-16 02:13:07,055 - INFO - step 20866, loss: 0.186373, best loss: 0.106677 2025-01-16 02:13:07,205 - INFO - step 20867, loss: 0.198007, best loss: 0.106677 2025-01-16 02:13:07,355 - INFO - step 20868, loss: 0.183760, best loss: 0.106677 2025-01-16 02:13:07,505 - INFO - step 20869, loss: 0.190896, best loss: 0.106677 2025-01-16 02:13:07,655 - INFO - step 20870, loss: 0.173806, best loss: 0.106677 2025-01-16 02:13:07,805 - INFO - step 20871, loss: 0.183431, best loss: 0.106677 2025-01-16 02:13:07,955 - INFO - step 20872, loss: 0.136053, best loss: 0.106677 2025-01-16 02:13:08,105 - INFO - step 20873, loss: 0.203253, best loss: 0.106677 2025-01-16 02:13:08,255 - INFO - step 20874, loss: 0.183695, best loss: 0.106677 2025-01-16 02:13:08,405 - INFO - step 20875, loss: 0.178690, best loss: 0.106677 2025-01-16 02:13:08,555 - INFO - step 20876, loss: 0.167392, best loss: 0.106677 2025-01-16 02:13:08,705 - INFO - step 20877, loss: 0.180892, best loss: 0.106677 2025-01-16 02:13:08,855 - INFO - step 20878, loss: 0.210520, best loss: 0.106677 2025-01-16 02:13:09,005 - INFO - step 20879, loss: 0.182506, best loss: 0.106677 2025-01-16 02:13:09,156 - INFO - step 20880, loss: 0.161304, best loss: 0.106677 2025-01-16 02:13:09,306 - INFO - step 20881, loss: 0.275562, best loss: 0.106677 2025-01-16 02:13:09,456 - INFO - step 20882, loss: 0.172073, best loss: 0.106677 2025-01-16 02:13:09,606 - INFO - step 20883, loss: 0.173175, best loss: 0.106677 2025-01-16 02:13:09,756 - INFO - step 20884, loss: 0.183782, best loss: 0.106677 2025-01-16 02:13:09,906 - INFO - step 20885, loss: 0.226047, best loss: 0.106677 2025-01-16 02:13:10,056 - INFO - step 20886, loss: 0.182816, best loss: 0.106677 2025-01-16 02:13:10,206 - INFO - step 20887, loss: 0.179090, best loss: 0.106677 2025-01-16 02:13:10,357 - INFO - step 20888, loss: 0.167792, best loss: 0.106677 2025-01-16 02:13:10,507 - INFO - step 20889, loss: 0.180169, best loss: 0.106677 2025-01-16 02:13:10,657 - INFO - step 20890, loss: 0.169691, best loss: 0.106677 2025-01-16 02:13:10,807 - INFO - step 20891, loss: 0.225423, best loss: 0.106677 2025-01-16 02:13:10,957 - INFO - step 20892, loss: 0.234188, best loss: 0.106677 2025-01-16 02:13:11,107 - INFO - step 20893, loss: 0.180094, best loss: 0.106677 2025-01-16 02:13:11,257 - INFO - step 20894, loss: 0.193463, best loss: 0.106677 2025-01-16 02:13:11,407 - INFO - step 20895, loss: 0.211316, best loss: 0.106677 2025-01-16 02:13:11,557 - INFO - step 20896, loss: 0.209221, best loss: 0.106677 2025-01-16 02:13:11,707 - INFO - step 20897, loss: 0.145902, best loss: 0.106677 2025-01-16 02:13:11,857 - INFO - step 20898, loss: 0.155080, best loss: 0.106677 2025-01-16 02:13:12,007 - INFO - step 20899, loss: 0.187700, best loss: 0.106677 2025-01-16 02:13:12,157 - INFO - step 20900, loss: 0.147160, best loss: 0.106677 2025-01-16 02:13:12,307 - INFO - step 20901, loss: 0.185748, best loss: 0.106677 2025-01-16 02:13:12,458 - INFO - step 20902, loss: 0.249746, best loss: 0.106677 2025-01-16 02:13:12,608 - INFO - step 20903, loss: 0.180707, best loss: 0.106677 2025-01-16 02:13:12,757 - INFO - step 20904, loss: 0.198763, best loss: 0.106677 2025-01-16 02:13:12,907 - INFO - step 20905, loss: 0.217140, best loss: 0.106677 2025-01-16 02:13:13,057 - INFO - step 20906, loss: 0.190598, best loss: 0.106677 2025-01-16 02:13:13,208 - INFO - step 20907, loss: 0.195580, best loss: 0.106677 2025-01-16 02:13:13,358 - INFO - step 20908, loss: 0.204411, best loss: 0.106677 2025-01-16 02:13:13,508 - INFO - step 20909, loss: 0.163342, best loss: 0.106677 2025-01-16 02:13:13,658 - INFO - step 20910, loss: 0.218280, best loss: 0.106677 2025-01-16 02:13:13,808 - INFO - step 20911, loss: 0.185865, best loss: 0.106677 2025-01-16 02:13:13,958 - INFO - step 20912, loss: 0.208593, best loss: 0.106677 2025-01-16 02:13:14,108 - INFO - step 20913, loss: 0.184593, best loss: 0.106677 2025-01-16 02:13:14,258 - INFO - step 20914, loss: 0.212840, best loss: 0.106677 2025-01-16 02:13:14,408 - INFO - step 20915, loss: 0.187787, best loss: 0.106677 2025-01-16 02:13:14,558 - INFO - step 20916, loss: 0.176953, best loss: 0.106677 2025-01-16 02:13:14,709 - INFO - step 20917, loss: 0.184084, best loss: 0.106677 2025-01-16 02:13:14,859 - INFO - step 20918, loss: 0.164850, best loss: 0.106677 2025-01-16 02:13:15,009 - INFO - step 20919, loss: 0.172849, best loss: 0.106677 2025-01-16 02:13:15,159 - INFO - step 20920, loss: 0.206019, best loss: 0.106677 2025-01-16 02:13:15,309 - INFO - step 20921, loss: 0.246745, best loss: 0.106677 2025-01-16 02:13:15,459 - INFO - step 20922, loss: 0.230670, best loss: 0.106677 2025-01-16 02:13:15,609 - INFO - step 20923, loss: 0.204625, best loss: 0.106677 2025-01-16 02:13:15,759 - INFO - step 20924, loss: 0.181139, best loss: 0.106677 2025-01-16 02:13:15,909 - INFO - step 20925, loss: 0.156480, best loss: 0.106677 2025-01-16 02:13:16,059 - INFO - step 20926, loss: 0.203373, best loss: 0.106677 2025-01-16 02:13:16,209 - INFO - step 20927, loss: 0.161129, best loss: 0.106677 2025-01-16 02:13:16,359 - INFO - step 20928, loss: 0.187708, best loss: 0.106677 2025-01-16 02:13:16,509 - INFO - step 20929, loss: 0.182280, best loss: 0.106677 2025-01-16 02:13:16,659 - INFO - step 20930, loss: 0.187351, best loss: 0.106677 2025-01-16 02:13:16,810 - INFO - step 20931, loss: 0.200652, best loss: 0.106677 2025-01-16 02:13:16,960 - INFO - step 20932, loss: 0.156795, best loss: 0.106677 2025-01-16 02:13:17,110 - INFO - step 20933, loss: 0.162432, best loss: 0.106677 2025-01-16 02:13:17,260 - INFO - step 20934, loss: 0.180108, best loss: 0.106677 2025-01-16 02:13:17,410 - INFO - step 20935, loss: 0.159300, best loss: 0.106677 2025-01-16 02:13:17,560 - INFO - step 20936, loss: 0.220739, best loss: 0.106677 2025-01-16 02:13:17,711 - INFO - step 20937, loss: 0.164773, best loss: 0.106677 2025-01-16 02:13:17,861 - INFO - step 20938, loss: 0.207542, best loss: 0.106677 2025-01-16 02:13:18,011 - INFO - step 20939, loss: 0.187086, best loss: 0.106677 2025-01-16 02:13:18,161 - INFO - step 20940, loss: 0.172822, best loss: 0.106677 2025-01-16 02:13:18,311 - INFO - step 20941, loss: 0.183444, best loss: 0.106677 2025-01-16 02:13:18,461 - INFO - step 20942, loss: 0.187283, best loss: 0.106677 2025-01-16 02:13:18,611 - INFO - step 20943, loss: 0.156588, best loss: 0.106677 2025-01-16 02:13:18,761 - INFO - step 20944, loss: 0.196579, best loss: 0.106677 2025-01-16 02:13:18,911 - INFO - step 20945, loss: 0.227425, best loss: 0.106677 2025-01-16 02:13:19,061 - INFO - step 20946, loss: 0.217944, best loss: 0.106677 2025-01-16 02:13:19,211 - INFO - step 20947, loss: 0.184498, best loss: 0.106677 2025-01-16 02:13:19,361 - INFO - step 20948, loss: 0.148297, best loss: 0.106677 2025-01-16 02:13:19,511 - INFO - step 20949, loss: 0.199318, best loss: 0.106677 2025-01-16 02:13:19,661 - INFO - step 20950, loss: 0.149038, best loss: 0.106677 2025-01-16 02:13:19,811 - INFO - step 20951, loss: 0.157217, best loss: 0.106677 2025-01-16 02:13:19,961 - INFO - step 20952, loss: 0.194715, best loss: 0.106677 2025-01-16 02:13:20,111 - INFO - step 20953, loss: 0.147292, best loss: 0.106677 2025-01-16 02:13:20,262 - INFO - step 20954, loss: 0.176141, best loss: 0.106677 2025-01-16 02:13:20,412 - INFO - step 20955, loss: 0.166010, best loss: 0.106677 2025-01-16 02:13:20,562 - INFO - step 20956, loss: 0.208789, best loss: 0.106677 2025-01-16 02:13:20,712 - INFO - step 20957, loss: 0.185000, best loss: 0.106677 2025-01-16 02:13:20,862 - INFO - step 20958, loss: 0.178285, best loss: 0.106677 2025-01-16 02:13:21,012 - INFO - step 20959, loss: 0.178203, best loss: 0.106677 2025-01-16 02:13:21,162 - INFO - step 20960, loss: 0.169593, best loss: 0.106677 2025-01-16 02:13:21,312 - INFO - step 20961, loss: 0.192716, best loss: 0.106677 2025-01-16 02:13:21,462 - INFO - step 20962, loss: 0.166110, best loss: 0.106677 2025-01-16 02:13:21,612 - INFO - step 20963, loss: 0.202552, best loss: 0.106677 2025-01-16 02:13:21,762 - INFO - step 20964, loss: 0.176121, best loss: 0.106677 2025-01-16 02:13:21,912 - INFO - step 20965, loss: 0.146155, best loss: 0.106677 2025-01-16 02:13:22,062 - INFO - step 20966, loss: 0.235416, best loss: 0.106677 2025-01-16 02:13:22,213 - INFO - step 20967, loss: 0.158073, best loss: 0.106677 2025-01-16 02:13:22,363 - INFO - step 20968, loss: 0.165296, best loss: 0.106677 2025-01-16 02:13:22,513 - INFO - step 20969, loss: 0.165038, best loss: 0.106677 2025-01-16 02:13:22,663 - INFO - step 20970, loss: 0.182259, best loss: 0.106677 2025-01-16 02:13:22,813 - INFO - step 20971, loss: 0.225741, best loss: 0.106677 2025-01-16 02:13:22,963 - INFO - step 20972, loss: 0.193658, best loss: 0.106677 2025-01-16 02:13:23,113 - INFO - step 20973, loss: 0.165465, best loss: 0.106677 2025-01-16 02:13:23,263 - INFO - step 20974, loss: 0.152900, best loss: 0.106677 2025-01-16 02:13:23,413 - INFO - step 20975, loss: 0.154374, best loss: 0.106677 2025-01-16 02:13:23,563 - INFO - step 20976, loss: 0.247980, best loss: 0.106677 2025-01-16 02:13:23,714 - INFO - step 20977, loss: 0.177658, best loss: 0.106677 2025-01-16 02:13:23,864 - INFO - step 20978, loss: 0.166589, best loss: 0.106677 2025-01-16 02:13:24,014 - INFO - step 20979, loss: 0.182244, best loss: 0.106677 2025-01-16 02:13:24,164 - INFO - step 20980, loss: 0.213987, best loss: 0.106677 2025-01-16 02:13:24,314 - INFO - step 20981, loss: 0.197179, best loss: 0.106677 2025-01-16 02:13:24,464 - INFO - step 20982, loss: 0.164563, best loss: 0.106677 2025-01-16 02:13:24,614 - INFO - step 20983, loss: 0.200005, best loss: 0.106677 2025-01-16 02:13:24,764 - INFO - step 20984, loss: 0.234786, best loss: 0.106677 2025-01-16 02:13:24,914 - INFO - step 20985, loss: 0.161771, best loss: 0.106677 2025-01-16 02:13:25,065 - INFO - step 20986, loss: 0.179150, best loss: 0.106677 2025-01-16 02:13:25,215 - INFO - step 20987, loss: 0.151217, best loss: 0.106677 2025-01-16 02:13:25,365 - INFO - step 20988, loss: 0.177907, best loss: 0.106677 2025-01-16 02:13:25,515 - INFO - step 20989, loss: 0.220350, best loss: 0.106677 2025-01-16 02:13:25,665 - INFO - step 20990, loss: 0.158847, best loss: 0.106677 2025-01-16 02:13:25,815 - INFO - step 20991, loss: 0.187439, best loss: 0.106677 2025-01-16 02:13:25,965 - INFO - step 20992, loss: 0.191292, best loss: 0.106677 2025-01-16 02:13:26,115 - INFO - step 20993, loss: 0.165098, best loss: 0.106677 2025-01-16 02:13:26,265 - INFO - step 20994, loss: 0.163492, best loss: 0.106677 2025-01-16 02:13:26,416 - INFO - step 20995, loss: 0.175005, best loss: 0.106677 2025-01-16 02:13:26,566 - INFO - step 20996, loss: 0.142963, best loss: 0.106677 2025-01-16 02:13:26,716 - INFO - step 20997, loss: 0.143170, best loss: 0.106677 2025-01-16 02:13:26,866 - INFO - step 20998, loss: 0.165002, best loss: 0.106677 2025-01-16 02:13:27,016 - INFO - step 20999, loss: 0.190987, best loss: 0.106677 2025-01-16 02:13:27,166 - INFO - step 21000, loss: 0.152520, best loss: 0.106677 2025-01-16 02:13:27,316 - INFO - step 21001, loss: 0.203699, best loss: 0.106677 2025-01-16 02:13:27,466 - INFO - step 21002, loss: 0.156778, best loss: 0.106677 2025-01-16 02:13:27,616 - INFO - step 21003, loss: 0.180547, best loss: 0.106677 2025-01-16 02:13:27,766 - INFO - step 21004, loss: 0.194613, best loss: 0.106677 2025-01-16 02:13:27,916 - INFO - step 21005, loss: 0.203736, best loss: 0.106677 2025-01-16 02:13:28,067 - INFO - step 21006, loss: 0.215585, best loss: 0.106677 2025-01-16 02:13:28,217 - INFO - step 21007, loss: 0.169974, best loss: 0.106677 2025-01-16 02:13:28,367 - INFO - step 21008, loss: 0.171740, best loss: 0.106677 2025-01-16 02:13:28,517 - INFO - step 21009, loss: 0.172090, best loss: 0.106677 2025-01-16 02:13:28,667 - INFO - step 21010, loss: 0.165050, best loss: 0.106677 2025-01-16 02:13:28,817 - INFO - step 21011, loss: 0.182793, best loss: 0.106677 2025-01-16 02:13:28,967 - INFO - step 21012, loss: 0.187832, best loss: 0.106677 2025-01-16 02:13:29,118 - INFO - step 21013, loss: 0.216649, best loss: 0.106677 2025-01-16 02:13:29,268 - INFO - step 21014, loss: 0.185596, best loss: 0.106677 2025-01-16 02:13:29,418 - INFO - step 21015, loss: 0.171236, best loss: 0.106677 2025-01-16 02:13:29,568 - INFO - step 21016, loss: 0.180693, best loss: 0.106677 2025-01-16 02:13:29,718 - INFO - step 21017, loss: 0.185905, best loss: 0.106677 2025-01-16 02:13:29,868 - INFO - step 21018, loss: 0.152202, best loss: 0.106677 2025-01-16 02:13:30,018 - INFO - step 21019, loss: 0.234499, best loss: 0.106677 2025-01-16 02:13:30,168 - INFO - step 21020, loss: 0.166332, best loss: 0.106677 2025-01-16 02:13:30,319 - INFO - step 21021, loss: 0.198700, best loss: 0.106677 2025-01-16 02:13:30,469 - INFO - step 21022, loss: 0.198748, best loss: 0.106677 2025-01-16 02:13:30,619 - INFO - step 21023, loss: 0.201476, best loss: 0.106677 2025-01-16 02:13:30,769 - INFO - step 21024, loss: 0.203192, best loss: 0.106677 2025-01-16 02:13:30,919 - INFO - step 21025, loss: 0.206251, best loss: 0.106677 2025-01-16 02:13:31,069 - INFO - step 21026, loss: 0.218056, best loss: 0.106677 2025-01-16 02:13:31,219 - INFO - step 21027, loss: 0.160004, best loss: 0.106677 2025-01-16 02:13:31,369 - INFO - step 21028, loss: 0.188157, best loss: 0.106677 2025-01-16 02:13:31,519 - INFO - step 21029, loss: 0.227255, best loss: 0.106677 2025-01-16 02:13:31,669 - INFO - step 21030, loss: 0.217969, best loss: 0.106677 2025-01-16 02:13:31,819 - INFO - step 21031, loss: 0.210978, best loss: 0.106677 2025-01-16 02:13:31,969 - INFO - step 21032, loss: 0.211800, best loss: 0.106677 2025-01-16 02:13:32,119 - INFO - step 21033, loss: 0.160252, best loss: 0.106677 2025-01-16 02:13:32,269 - INFO - step 21034, loss: 0.246408, best loss: 0.106677 2025-01-16 02:13:32,419 - INFO - step 21035, loss: 0.153042, best loss: 0.106677 2025-01-16 02:13:32,569 - INFO - step 21036, loss: 0.193740, best loss: 0.106677 2025-01-16 02:13:32,719 - INFO - step 21037, loss: 0.197751, best loss: 0.106677 2025-01-16 02:13:32,869 - INFO - step 21038, loss: 0.172588, best loss: 0.106677 2025-01-16 02:13:33,019 - INFO - step 21039, loss: 0.149762, best loss: 0.106677 2025-01-16 02:13:33,169 - INFO - step 21040, loss: 0.157204, best loss: 0.106677 2025-01-16 02:13:33,319 - INFO - step 21041, loss: 0.211754, best loss: 0.106677 2025-01-16 02:13:33,469 - INFO - step 21042, loss: 0.165250, best loss: 0.106677 2025-01-16 02:13:33,619 - INFO - step 21043, loss: 0.168169, best loss: 0.106677 2025-01-16 02:13:33,769 - INFO - step 21044, loss: 0.140914, best loss: 0.106677 2025-01-16 02:13:33,920 - INFO - step 21045, loss: 0.139450, best loss: 0.106677 2025-01-16 02:13:34,070 - INFO - step 21046, loss: 0.171293, best loss: 0.106677 2025-01-16 02:13:34,220 - INFO - step 21047, loss: 0.199920, best loss: 0.106677 2025-01-16 02:13:34,370 - INFO - step 21048, loss: 0.193965, best loss: 0.106677 2025-01-16 02:13:34,520 - INFO - step 21049, loss: 0.178335, best loss: 0.106677 2025-01-16 02:13:34,670 - INFO - step 21050, loss: 0.148016, best loss: 0.106677 2025-01-16 02:13:34,820 - INFO - step 21051, loss: 0.186495, best loss: 0.106677 2025-01-16 02:13:34,970 - INFO - step 21052, loss: 0.196337, best loss: 0.106677 2025-01-16 02:13:35,120 - INFO - step 21053, loss: 0.152194, best loss: 0.106677 2025-01-16 02:13:35,270 - INFO - step 21054, loss: 0.178414, best loss: 0.106677 2025-01-16 02:13:35,420 - INFO - step 21055, loss: 0.200921, best loss: 0.106677 2025-01-16 02:13:35,571 - INFO - step 21056, loss: 0.160635, best loss: 0.106677 2025-01-16 02:13:35,721 - INFO - step 21057, loss: 0.148370, best loss: 0.106677 2025-01-16 02:13:35,871 - INFO - step 21058, loss: 0.160186, best loss: 0.106677 2025-01-16 02:13:36,021 - INFO - step 21059, loss: 0.171496, best loss: 0.106677 2025-01-16 02:13:36,171 - INFO - step 21060, loss: 0.162472, best loss: 0.106677 2025-01-16 02:13:36,321 - INFO - step 21061, loss: 0.143765, best loss: 0.106677 2025-01-16 02:13:36,471 - INFO - step 21062, loss: 0.195116, best loss: 0.106677 2025-01-16 02:13:36,621 - INFO - step 21063, loss: 0.179777, best loss: 0.106677 2025-01-16 02:13:36,771 - INFO - step 21064, loss: 0.158377, best loss: 0.106677 2025-01-16 02:13:36,921 - INFO - step 21065, loss: 0.208264, best loss: 0.106677 2025-01-16 02:13:37,072 - INFO - step 21066, loss: 0.180753, best loss: 0.106677 2025-01-16 02:13:37,222 - INFO - step 21067, loss: 0.158149, best loss: 0.106677 2025-01-16 02:13:37,372 - INFO - step 21068, loss: 0.181044, best loss: 0.106677 2025-01-16 02:13:37,522 - INFO - step 21069, loss: 0.184268, best loss: 0.106677 2025-01-16 02:13:37,672 - INFO - step 21070, loss: 0.177098, best loss: 0.106677 2025-01-16 02:13:37,822 - INFO - step 21071, loss: 0.138310, best loss: 0.106677 2025-01-16 02:13:37,972 - INFO - step 21072, loss: 0.161550, best loss: 0.106677 2025-01-16 02:13:38,122 - INFO - step 21073, loss: 0.220331, best loss: 0.106677 2025-01-16 02:13:38,272 - INFO - step 21074, loss: 0.178966, best loss: 0.106677 2025-01-16 02:13:38,423 - INFO - step 21075, loss: 0.146425, best loss: 0.106677 2025-01-16 02:13:38,573 - INFO - step 21076, loss: 0.213396, best loss: 0.106677 2025-01-16 02:13:38,723 - INFO - step 21077, loss: 0.183363, best loss: 0.106677 2025-01-16 02:13:38,873 - INFO - step 21078, loss: 0.197434, best loss: 0.106677 2025-01-16 02:13:39,023 - INFO - step 21079, loss: 0.158329, best loss: 0.106677 2025-01-16 02:13:39,172 - INFO - step 21080, loss: 0.173754, best loss: 0.106677 2025-01-16 02:13:39,322 - INFO - step 21081, loss: 0.177551, best loss: 0.106677 2025-01-16 02:13:39,473 - INFO - step 21082, loss: 0.157320, best loss: 0.106677 2025-01-16 02:13:39,623 - INFO - step 21083, loss: 0.178179, best loss: 0.106677 2025-01-16 02:13:39,774 - INFO - step 21084, loss: 0.177707, best loss: 0.106677 2025-01-16 02:13:39,923 - INFO - step 21085, loss: 0.187783, best loss: 0.106677 2025-01-16 02:13:40,073 - INFO - step 21086, loss: 0.157168, best loss: 0.106677 2025-01-16 02:13:40,223 - INFO - step 21087, loss: 0.161206, best loss: 0.106677 2025-01-16 02:13:40,374 - INFO - step 21088, loss: 0.172650, best loss: 0.106677 2025-01-16 02:13:40,524 - INFO - step 21089, loss: 0.147331, best loss: 0.106677 2025-01-16 02:13:40,674 - INFO - step 21090, loss: 0.171053, best loss: 0.106677 2025-01-16 02:13:40,824 - INFO - step 21091, loss: 0.163418, best loss: 0.106677 2025-01-16 02:13:40,975 - INFO - step 21092, loss: 0.159902, best loss: 0.106677 2025-01-16 02:13:41,125 - INFO - step 21093, loss: 0.135838, best loss: 0.106677 2025-01-16 02:13:41,275 - INFO - step 21094, loss: 0.169277, best loss: 0.106677 2025-01-16 02:13:41,425 - INFO - step 21095, loss: 0.179167, best loss: 0.106677 2025-01-16 02:13:41,575 - INFO - step 21096, loss: 0.140210, best loss: 0.106677 2025-01-16 02:13:41,725 - INFO - step 21097, loss: 0.128500, best loss: 0.106677 2025-01-16 02:13:41,875 - INFO - step 21098, loss: 0.201921, best loss: 0.106677 2025-01-16 02:13:42,025 - INFO - step 21099, loss: 0.187764, best loss: 0.106677 2025-01-16 02:13:42,175 - INFO - step 21100, loss: 0.141811, best loss: 0.106677 2025-01-16 02:13:42,325 - INFO - step 21101, loss: 0.195221, best loss: 0.106677 2025-01-16 02:13:42,475 - INFO - step 21102, loss: 0.211060, best loss: 0.106677 2025-01-16 02:13:42,625 - INFO - step 21103, loss: 0.128131, best loss: 0.106677 2025-01-16 02:13:42,775 - INFO - step 21104, loss: 0.140213, best loss: 0.106677 2025-01-16 02:13:42,925 - INFO - step 21105, loss: 0.172155, best loss: 0.106677 2025-01-16 02:13:43,075 - INFO - step 21106, loss: 0.113095, best loss: 0.106677 2025-01-16 02:13:43,225 - INFO - step 21107, loss: 0.147989, best loss: 0.106677 2025-01-16 02:13:43,375 - INFO - step 21108, loss: 0.165741, best loss: 0.106677 2025-01-16 02:13:43,525 - INFO - step 21109, loss: 0.173037, best loss: 0.106677 2025-01-16 02:13:43,676 - INFO - step 21110, loss: 0.186391, best loss: 0.106677 2025-01-16 02:13:43,826 - INFO - step 21111, loss: 0.161455, best loss: 0.106677 2025-01-16 02:13:43,976 - INFO - step 21112, loss: 0.142408, best loss: 0.106677 2025-01-16 02:13:44,126 - INFO - step 21113, loss: 0.153405, best loss: 0.106677 2025-01-16 02:13:44,276 - INFO - step 21114, loss: 0.153324, best loss: 0.106677 2025-01-16 02:13:44,426 - INFO - step 21115, loss: 0.173300, best loss: 0.106677 2025-01-16 02:13:44,576 - INFO - step 21116, loss: 0.219911, best loss: 0.106677 2025-01-16 02:13:44,726 - INFO - step 21117, loss: 0.118708, best loss: 0.106677 2025-01-16 02:13:44,876 - INFO - step 21118, loss: 0.150543, best loss: 0.106677 2025-01-16 02:13:45,026 - INFO - step 21119, loss: 0.181010, best loss: 0.106677 2025-01-16 02:13:45,176 - INFO - step 21120, loss: 0.156798, best loss: 0.106677 2025-01-16 02:13:45,326 - INFO - step 21121, loss: 0.212404, best loss: 0.106677 2025-01-16 02:13:45,476 - INFO - step 21122, loss: 0.169801, best loss: 0.106677 2025-01-16 02:13:45,626 - INFO - step 21123, loss: 0.169202, best loss: 0.106677 2025-01-16 02:13:45,776 - INFO - step 21124, loss: 0.181584, best loss: 0.106677 2025-01-16 02:13:45,926 - INFO - step 21125, loss: 0.181790, best loss: 0.106677 2025-01-16 02:13:46,076 - INFO - step 21126, loss: 0.166514, best loss: 0.106677 2025-01-16 02:13:46,226 - INFO - step 21127, loss: 0.193076, best loss: 0.106677 2025-01-16 02:13:46,376 - INFO - step 21128, loss: 0.211621, best loss: 0.106677 2025-01-16 02:13:46,526 - INFO - step 21129, loss: 0.148724, best loss: 0.106677 2025-01-16 02:13:46,676 - INFO - step 21130, loss: 0.153630, best loss: 0.106677 2025-01-16 02:13:46,827 - INFO - step 21131, loss: 0.150239, best loss: 0.106677 2025-01-16 02:13:46,977 - INFO - step 21132, loss: 0.167317, best loss: 0.106677 2025-01-16 02:13:47,127 - INFO - step 21133, loss: 0.161168, best loss: 0.106677 2025-01-16 02:13:47,277 - INFO - step 21134, loss: 0.155470, best loss: 0.106677 2025-01-16 02:13:47,427 - INFO - step 21135, loss: 0.152530, best loss: 0.106677 2025-01-16 02:13:47,577 - INFO - step 21136, loss: 0.198128, best loss: 0.106677 2025-01-16 02:13:47,727 - INFO - step 21137, loss: 0.136274, best loss: 0.106677 2025-01-16 02:13:47,877 - INFO - step 21138, loss: 0.217623, best loss: 0.106677 2025-01-16 02:13:48,027 - INFO - step 21139, loss: 0.176271, best loss: 0.106677 2025-01-16 02:13:48,177 - INFO - step 21140, loss: 0.136215, best loss: 0.106677 2025-01-16 02:13:48,327 - INFO - step 21141, loss: 0.152155, best loss: 0.106677 2025-01-16 02:13:48,478 - INFO - step 21142, loss: 0.117561, best loss: 0.106677 2025-01-16 02:13:48,628 - INFO - step 21143, loss: 0.202021, best loss: 0.106677 2025-01-16 02:13:48,778 - INFO - step 21144, loss: 0.169879, best loss: 0.106677 2025-01-16 02:13:48,928 - INFO - step 21145, loss: 0.195540, best loss: 0.106677 2025-01-16 02:13:49,079 - INFO - step 21146, loss: 0.160722, best loss: 0.106677 2025-01-16 02:13:49,229 - INFO - step 21147, loss: 0.150438, best loss: 0.106677 2025-01-16 02:13:49,379 - INFO - step 21148, loss: 0.148503, best loss: 0.106677 2025-01-16 02:13:49,529 - INFO - step 21149, loss: 0.174614, best loss: 0.106677 2025-01-16 02:13:49,679 - INFO - step 21150, loss: 0.193878, best loss: 0.106677 2025-01-16 02:13:49,830 - INFO - step 21151, loss: 0.155193, best loss: 0.106677 2025-01-16 02:13:49,980 - INFO - step 21152, loss: 0.202798, best loss: 0.106677 2025-01-16 02:13:50,130 - INFO - step 21153, loss: 0.177640, best loss: 0.106677 2025-01-16 02:13:50,280 - INFO - step 21154, loss: 0.196384, best loss: 0.106677 2025-01-16 02:13:50,430 - INFO - step 21155, loss: 0.175348, best loss: 0.106677 2025-01-16 02:13:50,580 - INFO - step 21156, loss: 0.162263, best loss: 0.106677 2025-01-16 02:13:50,730 - INFO - step 21157, loss: 0.172870, best loss: 0.106677 2025-01-16 02:13:50,880 - INFO - step 21158, loss: 0.197269, best loss: 0.106677 2025-01-16 02:13:51,030 - INFO - step 21159, loss: 0.147346, best loss: 0.106677 2025-01-16 02:13:51,180 - INFO - step 21160, loss: 0.222979, best loss: 0.106677 2025-01-16 02:13:51,330 - INFO - step 21161, loss: 0.194073, best loss: 0.106677 2025-01-16 02:13:51,480 - INFO - step 21162, loss: 0.240211, best loss: 0.106677 2025-01-16 02:13:51,630 - INFO - step 21163, loss: 0.140035, best loss: 0.106677 2025-01-16 02:13:51,780 - INFO - step 21164, loss: 0.155422, best loss: 0.106677 2025-01-16 02:13:51,930 - INFO - step 21165, loss: 0.169724, best loss: 0.106677 2025-01-16 02:13:52,080 - INFO - step 21166, loss: 0.141228, best loss: 0.106677 2025-01-16 02:13:52,230 - INFO - step 21167, loss: 0.194715, best loss: 0.106677 2025-01-16 02:13:52,380 - INFO - step 21168, loss: 0.176982, best loss: 0.106677 2025-01-16 02:13:52,530 - INFO - step 21169, loss: 0.156784, best loss: 0.106677 2025-01-16 02:13:52,681 - INFO - step 21170, loss: 0.177061, best loss: 0.106677 2025-01-16 02:13:52,831 - INFO - step 21171, loss: 0.183673, best loss: 0.106677 2025-01-16 02:13:52,981 - INFO - step 21172, loss: 0.142947, best loss: 0.106677 2025-01-16 02:13:53,131 - INFO - step 21173, loss: 0.159977, best loss: 0.106677 2025-01-16 02:13:53,281 - INFO - step 21174, loss: 0.153117, best loss: 0.106677 2025-01-16 02:13:53,431 - INFO - step 21175, loss: 0.157409, best loss: 0.106677 2025-01-16 02:13:53,581 - INFO - step 21176, loss: 0.163526, best loss: 0.106677 2025-01-16 02:13:53,731 - INFO - step 21177, loss: 0.213108, best loss: 0.106677 2025-01-16 02:13:53,881 - INFO - step 21178, loss: 0.181910, best loss: 0.106677 2025-01-16 02:13:54,031 - INFO - step 21179, loss: 0.181449, best loss: 0.106677 2025-01-16 02:13:54,181 - INFO - step 21180, loss: 0.143148, best loss: 0.106677 2025-01-16 02:13:54,331 - INFO - step 21181, loss: 0.186897, best loss: 0.106677 2025-01-16 02:13:54,481 - INFO - step 21182, loss: 0.177521, best loss: 0.106677 2025-01-16 02:13:54,631 - INFO - step 21183, loss: 0.141424, best loss: 0.106677 2025-01-16 02:13:54,782 - INFO - step 21184, loss: 0.175024, best loss: 0.106677 2025-01-16 02:13:54,932 - INFO - step 21185, loss: 0.161478, best loss: 0.106677 2025-01-16 02:13:55,082 - INFO - step 21186, loss: 0.136009, best loss: 0.106677 2025-01-16 02:13:55,232 - INFO - step 21187, loss: 0.172547, best loss: 0.106677 2025-01-16 02:13:55,382 - INFO - step 21188, loss: 0.190911, best loss: 0.106677 2025-01-16 02:13:55,532 - INFO - step 21189, loss: 0.168711, best loss: 0.106677 2025-01-16 02:13:55,682 - INFO - step 21190, loss: 0.162069, best loss: 0.106677 2025-01-16 02:13:55,832 - INFO - step 21191, loss: 0.152250, best loss: 0.106677 2025-01-16 02:13:55,982 - INFO - step 21192, loss: 0.168805, best loss: 0.106677 2025-01-16 02:13:56,132 - INFO - step 21193, loss: 0.159842, best loss: 0.106677 2025-01-16 02:13:56,282 - INFO - step 21194, loss: 0.164099, best loss: 0.106677 2025-01-16 02:13:56,433 - INFO - step 21195, loss: 0.198389, best loss: 0.106677 2025-01-16 02:13:56,583 - INFO - step 21196, loss: 0.196723, best loss: 0.106677 2025-01-16 02:13:56,733 - INFO - step 21197, loss: 0.175619, best loss: 0.106677 2025-01-16 02:13:56,883 - INFO - step 21198, loss: 0.166692, best loss: 0.106677 2025-01-16 02:13:57,033 - INFO - step 21199, loss: 0.162267, best loss: 0.106677 2025-01-16 02:13:57,183 - INFO - step 21200, loss: 0.163107, best loss: 0.106677 2025-01-16 02:13:57,333 - INFO - step 21201, loss: 0.137336, best loss: 0.106677 2025-01-16 02:13:57,484 - INFO - step 21202, loss: 0.193080, best loss: 0.106677 2025-01-16 02:13:57,634 - INFO - step 21203, loss: 0.147654, best loss: 0.106677 2025-01-16 02:13:57,784 - INFO - step 21204, loss: 0.145493, best loss: 0.106677 2025-01-16 02:13:57,934 - INFO - step 21205, loss: 0.131965, best loss: 0.106677 2025-01-16 02:13:58,084 - INFO - step 21206, loss: 0.149703, best loss: 0.106677 2025-01-16 02:13:58,234 - INFO - step 21207, loss: 0.192130, best loss: 0.106677 2025-01-16 02:13:58,384 - INFO - step 21208, loss: 0.152967, best loss: 0.106677 2025-01-16 02:13:58,534 - INFO - step 21209, loss: 0.171078, best loss: 0.106677 2025-01-16 02:13:58,684 - INFO - step 21210, loss: 0.180106, best loss: 0.106677 2025-01-16 02:13:58,834 - INFO - step 21211, loss: 0.196485, best loss: 0.106677 2025-01-16 02:13:58,984 - INFO - step 21212, loss: 0.165788, best loss: 0.106677 2025-01-16 02:13:59,134 - INFO - step 21213, loss: 0.159914, best loss: 0.106677 2025-01-16 02:13:59,284 - INFO - step 21214, loss: 0.183179, best loss: 0.106677 2025-01-16 02:13:59,435 - INFO - step 21215, loss: 0.201336, best loss: 0.106677 2025-01-16 02:13:59,585 - INFO - step 21216, loss: 0.223349, best loss: 0.106677 2025-01-16 02:13:59,735 - INFO - step 21217, loss: 0.146608, best loss: 0.106677 2025-01-16 02:13:59,885 - INFO - step 21218, loss: 0.157170, best loss: 0.106677 2025-01-16 02:14:00,035 - INFO - step 21219, loss: 0.196192, best loss: 0.106677 2025-01-16 02:14:00,185 - INFO - step 21220, loss: 0.175329, best loss: 0.106677 2025-01-16 02:14:00,335 - INFO - step 21221, loss: 0.160003, best loss: 0.106677 2025-01-16 02:14:00,485 - INFO - step 21222, loss: 0.152661, best loss: 0.106677 2025-01-16 02:14:00,635 - INFO - step 21223, loss: 0.199077, best loss: 0.106677 2025-01-16 02:14:00,785 - INFO - step 21224, loss: 0.156248, best loss: 0.106677 2025-01-16 02:14:00,935 - INFO - step 21225, loss: 0.179701, best loss: 0.106677 2025-01-16 02:14:01,085 - INFO - step 21226, loss: 0.193390, best loss: 0.106677 2025-01-16 02:14:01,235 - INFO - step 21227, loss: 0.159104, best loss: 0.106677 2025-01-16 02:14:01,385 - INFO - step 21228, loss: 0.144622, best loss: 0.106677 2025-01-16 02:14:01,535 - INFO - step 21229, loss: 0.164249, best loss: 0.106677 2025-01-16 02:14:01,686 - INFO - step 21230, loss: 0.156967, best loss: 0.106677 2025-01-16 02:14:01,836 - INFO - step 21231, loss: 0.182549, best loss: 0.106677 2025-01-16 02:14:01,986 - INFO - step 21232, loss: 0.176949, best loss: 0.106677 2025-01-16 02:14:02,136 - INFO - step 21233, loss: 0.168883, best loss: 0.106677 2025-01-16 02:14:02,286 - INFO - step 21234, loss: 0.180251, best loss: 0.106677 2025-01-16 02:14:02,436 - INFO - step 21235, loss: 0.172624, best loss: 0.106677 2025-01-16 02:14:02,586 - INFO - step 21236, loss: 0.181015, best loss: 0.106677 2025-01-16 02:14:02,736 - INFO - step 21237, loss: 0.159534, best loss: 0.106677 2025-01-16 02:14:02,886 - INFO - step 21238, loss: 0.187082, best loss: 0.106677 2025-01-16 02:14:03,036 - INFO - step 21239, loss: 0.156073, best loss: 0.106677 2025-01-16 02:14:03,186 - INFO - step 21240, loss: 0.183759, best loss: 0.106677 2025-01-16 02:14:03,336 - INFO - step 21241, loss: 0.177418, best loss: 0.106677 2025-01-16 02:14:03,486 - INFO - step 21242, loss: 0.180787, best loss: 0.106677 2025-01-16 02:14:03,637 - INFO - step 21243, loss: 0.187407, best loss: 0.106677 2025-01-16 02:14:03,787 - INFO - step 21244, loss: 0.188256, best loss: 0.106677 2025-01-16 02:14:03,937 - INFO - step 21245, loss: 0.143834, best loss: 0.106677 2025-01-16 02:14:04,087 - INFO - step 21246, loss: 0.153853, best loss: 0.106677 2025-01-16 02:14:04,237 - INFO - step 21247, loss: 0.181496, best loss: 0.106677 2025-01-16 02:14:04,387 - INFO - step 21248, loss: 0.148548, best loss: 0.106677 2025-01-16 02:14:04,537 - INFO - step 21249, loss: 0.284372, best loss: 0.106677 2025-01-16 02:14:04,687 - INFO - step 21250, loss: 0.165349, best loss: 0.106677 2025-01-16 02:14:04,837 - INFO - step 21251, loss: 0.191316, best loss: 0.106677 2025-01-16 02:14:04,987 - INFO - step 21252, loss: 0.227736, best loss: 0.106677 2025-01-16 02:14:05,137 - INFO - step 21253, loss: 0.189744, best loss: 0.106677 2025-01-16 02:14:05,288 - INFO - step 21254, loss: 0.194229, best loss: 0.106677 2025-01-16 02:14:05,438 - INFO - step 21255, loss: 0.139804, best loss: 0.106677 2025-01-16 02:14:05,588 - INFO - step 21256, loss: 0.177945, best loss: 0.106677 2025-01-16 02:14:05,738 - INFO - step 21257, loss: 0.197290, best loss: 0.106677 2025-01-16 02:14:05,888 - INFO - step 21258, loss: 0.188413, best loss: 0.106677 2025-01-16 02:14:06,038 - INFO - step 21259, loss: 0.171733, best loss: 0.106677 2025-01-16 02:14:06,188 - INFO - step 21260, loss: 0.164945, best loss: 0.106677 2025-01-16 02:14:06,338 - INFO - step 21261, loss: 0.176848, best loss: 0.106677 2025-01-16 02:14:06,488 - INFO - step 21262, loss: 0.206827, best loss: 0.106677 2025-01-16 02:14:06,639 - INFO - step 21263, loss: 0.153993, best loss: 0.106677 2025-01-16 02:14:06,789 - INFO - step 21264, loss: 0.170998, best loss: 0.106677 2025-01-16 02:14:06,939 - INFO - step 21265, loss: 0.155902, best loss: 0.106677 2025-01-16 02:14:07,089 - INFO - step 21266, loss: 0.173131, best loss: 0.106677 2025-01-16 02:14:07,240 - INFO - step 21267, loss: 0.178831, best loss: 0.106677 2025-01-16 02:14:07,390 - INFO - step 21268, loss: 0.205620, best loss: 0.106677 2025-01-16 02:14:07,540 - INFO - step 21269, loss: 0.157132, best loss: 0.106677 2025-01-16 02:14:07,690 - INFO - step 21270, loss: 0.152681, best loss: 0.106677 2025-01-16 02:14:07,840 - INFO - step 21271, loss: 0.199326, best loss: 0.106677 2025-01-16 02:14:07,990 - INFO - step 21272, loss: 0.262533, best loss: 0.106677 2025-01-16 02:14:08,140 - INFO - step 21273, loss: 0.146493, best loss: 0.106677 2025-01-16 02:14:08,291 - INFO - step 21274, loss: 0.189488, best loss: 0.106677 2025-01-16 02:14:08,441 - INFO - step 21275, loss: 0.176952, best loss: 0.106677 2025-01-16 02:14:08,591 - INFO - step 21276, loss: 0.202621, best loss: 0.106677 2025-01-16 02:14:08,741 - INFO - step 21277, loss: 0.181575, best loss: 0.106677 2025-01-16 02:14:08,891 - INFO - step 21278, loss: 0.145682, best loss: 0.106677 2025-01-16 02:14:09,041 - INFO - step 21279, loss: 0.177170, best loss: 0.106677 2025-01-16 02:14:09,191 - INFO - step 21280, loss: 0.165224, best loss: 0.106677 2025-01-16 02:14:09,341 - INFO - step 21281, loss: 0.158500, best loss: 0.106677 2025-01-16 02:14:09,491 - INFO - step 21282, loss: 0.241069, best loss: 0.106677 2025-01-16 02:14:09,642 - INFO - step 21283, loss: 0.113274, best loss: 0.106677 2025-01-16 02:14:09,792 - INFO - step 21284, loss: 0.169575, best loss: 0.106677 2025-01-16 02:14:09,942 - INFO - step 21285, loss: 0.183590, best loss: 0.106677 2025-01-16 02:14:10,092 - INFO - step 21286, loss: 0.216089, best loss: 0.106677 2025-01-16 02:14:10,242 - INFO - step 21287, loss: 0.182797, best loss: 0.106677 2025-01-16 02:14:10,393 - INFO - step 21288, loss: 0.175862, best loss: 0.106677 2025-01-16 02:14:10,542 - INFO - step 21289, loss: 0.190586, best loss: 0.106677 2025-01-16 02:14:10,693 - INFO - step 21290, loss: 0.164534, best loss: 0.106677 2025-01-16 02:14:10,843 - INFO - step 21291, loss: 0.151798, best loss: 0.106677 2025-01-16 02:14:10,993 - INFO - step 21292, loss: 0.158159, best loss: 0.106677 2025-01-16 02:14:11,143 - INFO - step 21293, loss: 0.220224, best loss: 0.106677 2025-01-16 02:14:11,293 - INFO - step 21294, loss: 0.206652, best loss: 0.106677 2025-01-16 02:14:11,443 - INFO - step 21295, loss: 0.174279, best loss: 0.106677 2025-01-16 02:14:11,593 - INFO - step 21296, loss: 0.248450, best loss: 0.106677 2025-01-16 02:14:11,743 - INFO - step 21297, loss: 0.158860, best loss: 0.106677 2025-01-16 02:14:11,893 - INFO - step 21298, loss: 0.154391, best loss: 0.106677 2025-01-16 02:14:12,043 - INFO - step 21299, loss: 0.175485, best loss: 0.106677 2025-01-16 02:14:12,193 - INFO - step 21300, loss: 0.180272, best loss: 0.106677 2025-01-16 02:14:12,344 - INFO - step 21301, loss: 0.197477, best loss: 0.106677 2025-01-16 02:14:12,494 - INFO - step 21302, loss: 0.167007, best loss: 0.106677 2025-01-16 02:14:12,644 - INFO - step 21303, loss: 0.172914, best loss: 0.106677 2025-01-16 02:14:12,794 - INFO - step 21304, loss: 0.140544, best loss: 0.106677 2025-01-16 02:14:12,944 - INFO - step 21305, loss: 0.185214, best loss: 0.106677 2025-01-16 02:14:13,094 - INFO - step 21306, loss: 0.237762, best loss: 0.106677 2025-01-16 02:14:13,244 - INFO - step 21307, loss: 0.189030, best loss: 0.106677 2025-01-16 02:14:13,394 - INFO - step 21308, loss: 0.139729, best loss: 0.106677 2025-01-16 02:14:13,545 - INFO - step 21309, loss: 0.143688, best loss: 0.106677 2025-01-16 02:14:13,695 - INFO - step 21310, loss: 0.168332, best loss: 0.106677 2025-01-16 02:14:13,845 - INFO - step 21311, loss: 0.176573, best loss: 0.106677 2025-01-16 02:14:13,995 - INFO - step 21312, loss: 0.181400, best loss: 0.106677 2025-01-16 02:14:14,145 - INFO - step 21313, loss: 0.170761, best loss: 0.106677 2025-01-16 02:14:14,295 - INFO - step 21314, loss: 0.170637, best loss: 0.106677 2025-01-16 02:14:14,445 - INFO - step 21315, loss: 0.200010, best loss: 0.106677 2025-01-16 02:14:14,595 - INFO - step 21316, loss: 0.175241, best loss: 0.106677 2025-01-16 02:14:14,746 - INFO - step 21317, loss: 0.213321, best loss: 0.106677 2025-01-16 02:14:14,896 - INFO - step 21318, loss: 0.194539, best loss: 0.106677 2025-01-16 02:14:15,046 - INFO - step 21319, loss: 0.155187, best loss: 0.106677 2025-01-16 02:14:15,196 - INFO - step 21320, loss: 0.156669, best loss: 0.106677 2025-01-16 02:14:15,346 - INFO - step 21321, loss: 0.176678, best loss: 0.106677 2025-01-16 02:14:15,496 - INFO - step 21322, loss: 0.183713, best loss: 0.106677 2025-01-16 02:14:15,647 - INFO - step 21323, loss: 0.163425, best loss: 0.106677 2025-01-16 02:14:15,797 - INFO - step 21324, loss: 0.156336, best loss: 0.106677 2025-01-16 02:14:15,947 - INFO - step 21325, loss: 0.197653, best loss: 0.106677 2025-01-16 02:14:16,097 - INFO - step 21326, loss: 0.208932, best loss: 0.106677 2025-01-16 02:14:16,247 - INFO - step 21327, loss: 0.145203, best loss: 0.106677 2025-01-16 02:14:16,397 - INFO - step 21328, loss: 0.139666, best loss: 0.106677 2025-01-16 02:14:16,547 - INFO - step 21329, loss: 0.153728, best loss: 0.106677 2025-01-16 02:14:16,697 - INFO - step 21330, loss: 0.165021, best loss: 0.106677 2025-01-16 02:14:16,847 - INFO - step 21331, loss: 0.173814, best loss: 0.106677 2025-01-16 02:14:16,997 - INFO - step 21332, loss: 0.157101, best loss: 0.106677 2025-01-16 02:14:17,147 - INFO - step 21333, loss: 0.148957, best loss: 0.106677 2025-01-16 02:14:17,297 - INFO - step 21334, loss: 0.174016, best loss: 0.106677 2025-01-16 02:14:17,447 - INFO - step 21335, loss: 0.229403, best loss: 0.106677 2025-01-16 02:14:17,597 - INFO - step 21336, loss: 0.170539, best loss: 0.106677 2025-01-16 02:14:17,747 - INFO - step 21337, loss: 0.153384, best loss: 0.106677 2025-01-16 02:14:17,897 - INFO - step 21338, loss: 0.175047, best loss: 0.106677 2025-01-16 02:14:18,047 - INFO - step 21339, loss: 0.167780, best loss: 0.106677 2025-01-16 02:14:18,197 - INFO - step 21340, loss: 0.218207, best loss: 0.106677 2025-01-16 02:14:18,347 - INFO - step 21341, loss: 0.137809, best loss: 0.106677 2025-01-16 02:14:18,497 - INFO - step 21342, loss: 0.219491, best loss: 0.106677 2025-01-16 02:14:18,648 - INFO - step 21343, loss: 0.164898, best loss: 0.106677 2025-01-16 02:14:18,798 - INFO - step 21344, loss: 0.189687, best loss: 0.106677 2025-01-16 02:14:18,947 - INFO - step 21345, loss: 0.126743, best loss: 0.106677 2025-01-16 02:14:19,098 - INFO - step 21346, loss: 0.161931, best loss: 0.106677 2025-01-16 02:14:19,248 - INFO - step 21347, loss: 0.174569, best loss: 0.106677 2025-01-16 02:14:19,398 - INFO - step 21348, loss: 0.195945, best loss: 0.106677 2025-01-16 02:14:19,548 - INFO - step 21349, loss: 0.160793, best loss: 0.106677 2025-01-16 02:14:19,698 - INFO - step 21350, loss: 0.210326, best loss: 0.106677 2025-01-16 02:14:19,848 - INFO - step 21351, loss: 0.228360, best loss: 0.106677 2025-01-16 02:14:19,998 - INFO - step 21352, loss: 0.203694, best loss: 0.106677 2025-01-16 02:14:20,148 - INFO - step 21353, loss: 0.153513, best loss: 0.106677 2025-01-16 02:14:20,299 - INFO - step 21354, loss: 0.272453, best loss: 0.106677 2025-01-16 02:14:20,449 - INFO - step 21355, loss: 0.204981, best loss: 0.106677 2025-01-16 02:14:20,599 - INFO - step 21356, loss: 0.192190, best loss: 0.106677 2025-01-16 02:14:20,749 - INFO - step 21357, loss: 0.158186, best loss: 0.106677 2025-01-16 02:14:20,900 - INFO - step 21358, loss: 0.167134, best loss: 0.106677 2025-01-16 02:14:21,050 - INFO - step 21359, loss: 0.193269, best loss: 0.106677 2025-01-16 02:14:21,200 - INFO - step 21360, loss: 0.215860, best loss: 0.106677 2025-01-16 02:14:21,350 - INFO - step 21361, loss: 0.223997, best loss: 0.106677 2025-01-16 02:14:21,500 - INFO - step 21362, loss: 0.176953, best loss: 0.106677 2025-01-16 02:14:21,650 - INFO - step 21363, loss: 0.171953, best loss: 0.106677 2025-01-16 02:14:21,800 - INFO - step 21364, loss: 0.206284, best loss: 0.106677 2025-01-16 02:14:21,950 - INFO - step 21365, loss: 0.157394, best loss: 0.106677 2025-01-16 02:14:22,100 - INFO - step 21366, loss: 0.173875, best loss: 0.106677 2025-01-16 02:14:22,250 - INFO - step 21367, loss: 0.183835, best loss: 0.106677 2025-01-16 02:14:22,400 - INFO - step 21368, loss: 0.166660, best loss: 0.106677 2025-01-16 02:14:22,550 - INFO - step 21369, loss: 0.146221, best loss: 0.106677 2025-01-16 02:14:22,700 - INFO - step 21370, loss: 0.185027, best loss: 0.106677 2025-01-16 02:14:22,850 - INFO - step 21371, loss: 0.157108, best loss: 0.106677 2025-01-16 02:14:23,000 - INFO - step 21372, loss: 0.173609, best loss: 0.106677 2025-01-16 02:14:23,151 - INFO - step 21373, loss: 0.145460, best loss: 0.106677 2025-01-16 02:14:23,301 - INFO - step 21374, loss: 0.183235, best loss: 0.106677 2025-01-16 02:14:23,451 - INFO - step 21375, loss: 0.168392, best loss: 0.106677 2025-01-16 02:14:23,601 - INFO - step 21376, loss: 0.174562, best loss: 0.106677 2025-01-16 02:14:23,751 - INFO - step 21377, loss: 0.178052, best loss: 0.106677 2025-01-16 02:14:23,901 - INFO - step 21378, loss: 0.209569, best loss: 0.106677 2025-01-16 02:14:24,051 - INFO - step 21379, loss: 0.252141, best loss: 0.106677 2025-01-16 02:14:24,201 - INFO - step 21380, loss: 0.164784, best loss: 0.106677 2025-01-16 02:14:24,351 - INFO - step 21381, loss: 0.141566, best loss: 0.106677 2025-01-16 02:14:24,501 - INFO - step 21382, loss: 0.169379, best loss: 0.106677 2025-01-16 02:14:24,651 - INFO - step 21383, loss: 0.135990, best loss: 0.106677 2025-01-16 02:14:24,801 - INFO - step 21384, loss: 0.168319, best loss: 0.106677 2025-01-16 02:14:24,951 - INFO - step 21385, loss: 0.146769, best loss: 0.106677 2025-01-16 02:14:25,101 - INFO - step 21386, loss: 0.183582, best loss: 0.106677 2025-01-16 02:14:25,252 - INFO - step 21387, loss: 0.189418, best loss: 0.106677 2025-01-16 02:14:25,402 - INFO - step 21388, loss: 0.153531, best loss: 0.106677 2025-01-16 02:14:25,552 - INFO - step 21389, loss: 0.165586, best loss: 0.106677 2025-01-16 02:14:25,702 - INFO - step 21390, loss: 0.166279, best loss: 0.106677 2025-01-16 02:14:25,852 - INFO - step 21391, loss: 0.165404, best loss: 0.106677 2025-01-16 02:14:26,002 - INFO - step 21392, loss: 0.233022, best loss: 0.106677 2025-01-16 02:14:26,152 - INFO - step 21393, loss: 0.163160, best loss: 0.106677 2025-01-16 02:14:26,302 - INFO - step 21394, loss: 0.134709, best loss: 0.106677 2025-01-16 02:14:26,452 - INFO - step 21395, loss: 0.200584, best loss: 0.106677 2025-01-16 02:14:26,603 - INFO - step 21396, loss: 0.130879, best loss: 0.106677 2025-01-16 02:14:26,753 - INFO - step 21397, loss: 0.169693, best loss: 0.106677 2025-01-16 02:14:26,903 - INFO - step 21398, loss: 0.158965, best loss: 0.106677 2025-01-16 02:14:27,053 - INFO - step 21399, loss: 0.137109, best loss: 0.106677 2025-01-16 02:14:27,203 - INFO - step 21400, loss: 0.154726, best loss: 0.106677 2025-01-16 02:14:27,353 - INFO - step 21401, loss: 0.182001, best loss: 0.106677 2025-01-16 02:14:27,504 - INFO - step 21402, loss: 0.166909, best loss: 0.106677 2025-01-16 02:14:27,654 - INFO - step 21403, loss: 0.172648, best loss: 0.106677 2025-01-16 02:14:27,804 - INFO - step 21404, loss: 0.194920, best loss: 0.106677 2025-01-16 02:14:27,954 - INFO - step 21405, loss: 0.163113, best loss: 0.106677 2025-01-16 02:14:28,104 - INFO - step 21406, loss: 0.184315, best loss: 0.106677 2025-01-16 02:14:28,254 - INFO - step 21407, loss: 0.166977, best loss: 0.106677 2025-01-16 02:14:28,404 - INFO - step 21408, loss: 0.184344, best loss: 0.106677 2025-01-16 02:14:28,555 - INFO - step 21409, loss: 0.158894, best loss: 0.106677 2025-01-16 02:14:28,705 - INFO - step 21410, loss: 0.161171, best loss: 0.106677 2025-01-16 02:14:28,855 - INFO - step 21411, loss: 0.168089, best loss: 0.106677 2025-01-16 02:14:29,005 - INFO - step 21412, loss: 0.158262, best loss: 0.106677 2025-01-16 02:14:29,155 - INFO - step 21413, loss: 0.139146, best loss: 0.106677 2025-01-16 02:14:29,305 - INFO - step 21414, loss: 0.135829, best loss: 0.106677 2025-01-16 02:14:29,456 - INFO - step 21415, loss: 0.173997, best loss: 0.106677 2025-01-16 02:14:29,606 - INFO - step 21416, loss: 0.157403, best loss: 0.106677 2025-01-16 02:14:29,756 - INFO - step 21417, loss: 0.149146, best loss: 0.106677 2025-01-16 02:14:29,906 - INFO - step 21418, loss: 0.153383, best loss: 0.106677 2025-01-16 02:14:30,056 - INFO - step 21419, loss: 0.150939, best loss: 0.106677 2025-01-16 02:14:30,207 - INFO - step 21420, loss: 0.170531, best loss: 0.106677 2025-01-16 02:14:30,357 - INFO - step 21421, loss: 0.166481, best loss: 0.106677 2025-01-16 02:14:30,507 - INFO - step 21422, loss: 0.148243, best loss: 0.106677 2025-01-16 02:14:30,657 - INFO - step 21423, loss: 0.149566, best loss: 0.106677 2025-01-16 02:14:30,807 - INFO - step 21424, loss: 0.183882, best loss: 0.106677 2025-01-16 02:14:30,957 - INFO - step 21425, loss: 0.127817, best loss: 0.106677 2025-01-16 02:14:31,107 - INFO - step 21426, loss: 0.141879, best loss: 0.106677 2025-01-16 02:14:31,258 - INFO - step 21427, loss: 0.125051, best loss: 0.106677 2025-01-16 02:14:31,408 - INFO - step 21428, loss: 0.153068, best loss: 0.106677 2025-01-16 02:14:31,558 - INFO - step 21429, loss: 0.140472, best loss: 0.106677 2025-01-16 02:14:31,708 - INFO - step 21430, loss: 0.170892, best loss: 0.106677 2025-01-16 02:14:31,858 - INFO - step 21431, loss: 0.165460, best loss: 0.106677 2025-01-16 02:14:32,008 - INFO - step 21432, loss: 0.230134, best loss: 0.106677 2025-01-16 02:14:32,158 - INFO - step 21433, loss: 0.140208, best loss: 0.106677 2025-01-16 02:14:32,309 - INFO - step 21434, loss: 0.197003, best loss: 0.106677 2025-01-16 02:14:32,459 - INFO - step 21435, loss: 0.149782, best loss: 0.106677 2025-01-16 02:14:36,045 - INFO - step 21436, loss: 0.096971, best loss: 0.096971 2025-01-16 02:14:36,045 - INFO - Target loss reached! Training completed at step 21436 2025-01-16 02:14:36,046 - INFO - Training completed! 2025-01-16 02:14:36,046 - INFO - Final loss: 0.096971 2025-01-16 02:14:36,046 - INFO - Best loss achieved: 0.096971 2025-01-16 02:14:36,046 - INFO - Best model saved to: /kaggle/working/best_model.pth