v1_1000_STEPS_5e6_rate_01_beta_DPO / model-00002-of-00003.safetensors

Commit History

End of training
d4a5e13
verified

tsavage68 commited on