trained on top scored grandmaster speedup on test grandmaster with official repo: 2.3771892594499193 vikhr-1.5b-smpo as a base model