DRXD1000
/

Phoenix-7B

Text Generation

alignment-handbook

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DRXD1000 commited on Jan 10, 2024

Commit

f07212d

·

1 Parent(s): 65131bf

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -17,6 +17,26 @@ and HuggingFaceH4/ultrafeedback_binarized was created in addition to a series of
 Thanks to the new type of training, Phoenix is not only able to compete with the Mistral model from LeoLM but also **beats the Llama-70b-chat model in 2 mt-bench categories**This model **wouldn't have been possible without the amazing work of Huggingface, LeoLM, openbnb, Argilla the Alma-Team and many others of the AI community**
 ## MT-Bench-DE Scores
 ## Model Details

 Thanks to the new type of training, Phoenix is not only able to compete with the Mistral model from LeoLM but also **beats the Llama-70b-chat model in 2 mt-bench categories**This model **wouldn't have been possible without the amazing work of Huggingface, LeoLM, openbnb, Argilla the Alma-Team and many others of the AI community**
 ## MT-Bench-DE Scores
+Phoenix beats the LeoLM-Mistral model in all categories except for coding and humanities.
+Additionally it also Beats LeoLM/Llama-2-70b-chat in roleplay and reasoning which shows the power of DPO.
+```
+{
+    "first_turn": 6.39375,
+    "second_turn": 5.1625,
+    "categories": {
+        "writing": 7.45,
+        "roleplay": 7.9,
+        "reasoning": 4.3,
+        "math": 3.25,
+        "coding": 2.5,
+        "extraction": 5.9,
+        "stem": 7.125,
+        "humanities": 7.8
+    },
+    "average": 5.778124999999999
+}
+```
 ## Model Details