cmarkea
/

Mixtral-8x7B-Instruct-v0.1-4bit

Text Generation

Mixture of Experts

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Cyrile commited on Sep 19, 2024

Commit

5d27613

·

verified ·

1 Parent(s): d654c21

Update README.md

Files changed (1) hide show

README.md +14 -14

README.md CHANGED Viewed

@@ -19,20 +19,20 @@ Evaluation of the model was conducted using the PoLL (Pool of LLM) technique, as
 (two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
 Performance Scores (on a scale of 5):
-| Model                                        | Score   | # params (Billion) | size (GB) |
-|---------------------------------------------:|:-------:|:------------------:|:---------:|
-| gpt-4o                                       | 4.13    | N/A                | N/A       |
-| mistralai/Mixtral-8x7B-Instruct-v0.1         | 3.71    | 46.7               | 93.4      |
-| **cmarkea/Mixtral-8x7B-Instruct-v0.1-4bit**  | 3.68    | 46.7               | 23.35     |
-| gpt-3.5-turbo                                | 3.66    | 175                | 350       |
-| TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ     | 3.56    | 46.7               | 46.7      |
-| mistralai/Mistral-7B-Instruct-v0.2           | 1.98    | 7.25               | 14.5      |
-| cmarkea/bloomz-7b1-mt-sft-chat               | 1.69    | 7.07               | 14.14     |
-| cmarkea/bloomz-3b-dpo-chat                   | 1.68    | 3                  | 6         |
-| cmarkea/bloomz-3b-sft-chat                   | 1.51    | 3                  | 6         |
-| croissantllm/CroissantLLMChat-v0.1           | 1.19    | 1.3                | 2.7       |
-| cmarkea/bloomz-560m-sft-chat                 | 1.04    | 0.56               | 1.12      |
-| OpenLLM-France/Claire-Mistral-7B-0.1         | 0.38    | 7.25               | 14.5      |
 The impact of quantization is negligible.

 (two per evaluator). The evaluators included GPT-4o, Gemini-1.5-pro, and Claude3.5-sonnet.
 Performance Scores (on a scale of 5):
+| Model                                        | Score    | # params (Billion) | size (GB) |
+|---------------------------------------------:|:--------:|:------------------:|:---------:|
+| gpt-4o                                       | 4.13     | N/A                | N/A       |
+| mistralai/Mixtral-8x7B-Instruct-v0.1         | 3.71     | 46.7               | 93.4      |
+| **cmarkea/Mixtral-8x7B-Instruct-v0.1-4bit**  | **3.68** | **46.7**           | **23.35** |
+| gpt-3.5-turbo                                | 3.66     | 175                | 350       |
+| TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ     | 3.56     | 46.7               | 46.7      |
+| mistralai/Mistral-7B-Instruct-v0.2           | 1.98     | 7.25               | 14.5      |
+| cmarkea/bloomz-7b1-mt-sft-chat               | 1.69     | 7.07               | 14.14     |
+| cmarkea/bloomz-3b-dpo-chat                   | 1.68     | 3                  | 6         |
+| cmarkea/bloomz-3b-sft-chat                   | 1.51     | 3                  | 6         |
+| croissantllm/CroissantLLMChat-v0.1           | 1.19     | 1.3                | 2.7       |
+| cmarkea/bloomz-560m-sft-chat                 | 1.04     | 0.56               | 1.12      |
+| OpenLLM-France/Claire-Mistral-7B-0.1         | 0.38     | 7.25               | 14.5      |
 The impact of quantization is negligible.