Qwen2.5-0.5B-Open-R1-Distill / train_results.json
herman66's picture
Model save
9596bf4 verified
raw
history blame contribute delete
216 Bytes
{
"total_flos": 1.900575720430633e+17,
"train_loss": 1.19111062203986,
"train_runtime": 26492.3013,
"train_samples": 16610,
"train_samples_per_second": 6.525,
"train_steps_per_second": 0.408
}