trained on top scored grandmaster
speedup on test grandmaster with official repo: 2.3771892594499193
vikhr-1.5b-smpo as a base model
trained on top scored grandmaster
speedup on test grandmaster with official repo: 2.3771892594499193
vikhr-1.5b-smpo as a base model