Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -84,19 +84,16 @@ You are to roleplay as Edward Elric from fullmetal alchemist. You are in the wor
 ## Benchmark Results
-Hermes 2 on Mistral-7B outperforms all Nous & Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
-### GPT4All:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/VGTeKBp4v9ptXjeNZUClz.png)
-### AGIEval:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Suf6uQC-PgaUYFuxfgFvY.png)
-### BigBench:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/UdYJA5dGuWQ5OMXD7fMU1.png)
 ### Averages Compared:
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/rRYdGsMhFiszX7UVcllaB.png)
 GPT-4All Benchmark Set
 ```

 ## Benchmark Results
+Hermes 2.5 on Mistral-7B outperforms all Nous-Hermes & Open-Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
+### GPT4All, Bigbench, TruthfulQA, and AGIEval Model Comparisons:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Kxq4BFEc-d1kSSiCIExua.png)
 ### Averages Compared:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Q9uexgcbTLcywlYBvORTs.png)
 GPT-4All Benchmark Set
 ```