File size: 328 Bytes
36cd813 25a3b2e 36cd813 25a3b2e 36cd813 25a3b2e 36cd813 25a3b2e 36cd813 25a3b2e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
---
library_name: transformers
tags:
- DPO
- reasoning
- mistral
license: apache-2.0
datasets:
- argilla/distilabel-intel-orca-dpo-pairs
pipeline_tag: text-generation
---
# Model Card for felladrin-tinymistral-248m-v4-dpo
SFT model trained with orca DPO
## Model Details
### Model Description
Experimental.
ChatML format. |