dfurman
/

Qwen2-72B-Orpo-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

dfurman commited on Jul 5, 2024

Commit

b5462e2

·

verified ·

1 Parent(s): 514c6b8

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -14,4 +14,19 @@ base_model:
 - Qwen/Qwen2-72B-Instruct
 ---
-# dfurman/Qwen2-72B-Orpo-v0.1

 - Qwen/Qwen2-72B-Instruct
 ---
+# dfurman/Qwen2-72B-Orpo-v0.1
+## Introduction
+Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 72B Qwen2 model.
+Compared with the state-of-the-art opensource language models, including the previous released Qwen1.5, Qwen2 has generally surpassed most opensource models and demonstrated competitiveness against proprietary models across a series of benchmarks targeting for language understanding, language generation, multilingual capability, coding, mathematics, reasoning, etc.
+Qwen2-72B-Instruct supports a context length of up to 131,072 tokens, enabling the processing of extensive inputs. Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2 for handling long texts.
+For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2/), [GitHub](https://github.com/QwenLM/Qwen2), and [Documentation](https://qwen.readthedocs.io/en/latest/).
+<br>
+## This finetune
+Qwen2-72B-Orpo-v0.1 is a QLoRA finetune of `Qwen/Qwen2-72B-Instruct` on 1.5k rows of `mlabonne/orpo-dpo-mix-40k`.