JHuel
/

Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning

Model card Files Files and versions Community

Mistral-Nemo-Instruct-2407_DPO_qlora

1 contributor

History: 8 commits

JHuel's picture

Update README.md

db7d153 verified 2 months ago