Update README.md
Browse files
README.md
CHANGED
@@ -115,7 +115,10 @@ model-index:
|
|
115 |
|
116 |
## This finetune
|
117 |
|
118 |
-
Qwen2-72B-Orpo-v0.1 is a QLoRA finetune of `Qwen/Qwen2-72B-Instruct` on 1.5k rows of `mlabonne/orpo-dpo-mix-40k`.
|
|
|
|
|
|
|
119 |
|
120 |

|
121 |
|
|
|
115 |
|
116 |
## This finetune
|
117 |
|
118 |
+
Qwen2-72B-Orpo-v0.1 is a QLoRA finetune of `Qwen/Qwen2-72B-Instruct` on 1.5k rows of `mlabonne/orpo-dpo-mix-40k`. It was trained as a generalist language model for a variety of text generation use cases, including support of agentic capabilities, roleplaying, reasoning, multi-turn conversations, long context coherence, and more.
|
119 |
+
|
120 |
+
Thanks to [mlabonne](https://huggingface.co/mlabonne), [Qwen](https://huggingface.com/Qwen), and all other contributors to the source dataset and base model.
|
121 |
+
|
122 |
|
123 |

|
124 |
|