DRXD1000
/

Phoenix-7B

Text Generation

alignment-handbook

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DRXD1000 commited on Jan 10, 2024

Commit

3570ad2

·

1 Parent(s): f07212d

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -12,9 +12,13 @@ tags:
 # Model Card for Phoenix
-Phoenixis a model trained using Direct Preference Optimization (DPO). This model is the first version, which has been trained following the process of the Alignment-Handbook from Huggingface. In contrast to zephyr and notus this model has been trained using german instruction and dpo-data. In detail, a German translation of HuggingFaceH4/ultrachat_200k
-and HuggingFaceH4/ultrafeedback_binarized was created in addition to a series of instruction datasets. The LLM haoranxu/ALMA-13B was used for this. While the mistral model performs really well, it is not really suitable for the german language. Therefore we have used the fantastic LeoLM/leo-mistral-hessianai-7b.
-Thanks to the new type of training, Phoenix is not only able to compete with the Mistral model from LeoLM but also **beats the Llama-70b-chat model in 2 mt-bench categories**This model **wouldn't have been possible without the amazing work of Huggingface, LeoLM, openbnb, Argilla the Alma-Team and many others of the AI community**
 ## MT-Bench-DE Scores
 Phoenix beats the LeoLM-Mistral model in all categories except for coding and humanities.

 # Model Card for Phoenix
+**Phoenix** is a model trained using Direct Preference Optimization (DPO). This model is the first version, which has been trained following the process of the alignment-handbook from Huggingface.
+In contrast to zephyr and notus this model has been trained using german instruction and dpo-data. In detail, a german translation of HuggingFaceH4/ultrachat_200k
+and HuggingFaceH4/ultrafeedback_binarized was created in addition to a series of instruction datasets. The LLM haoranxu/ALMA-13B was used for this.
+While the mistral model performs really well, it is not really suitable for the german language. Therefore we have used the fantastic LeoLM/leo-mistral-hessianai-7b.
+Thanks to the new type of training, Phoenix is not only able to compete with the Mistral model from LeoLM but also **beats the Llama-70b-chat model in 2 mt-bench categories**.
+This model **wouldn't have been possible without the amazing work of Huggingface, LeoLM, openbnb, Argilla the Alma-Team and many others of the AI community**.
+i would like to personally thank all AI researchers who make the training of such models possible
 ## MT-Bench-DE Scores
 Phoenix beats the LeoLM-Mistral model in all categories except for coding and humanities.