gobean
/

Mixtral-8x7B-Instruct-v0.1.llamafile

Model card Files Files and versions Community

gobean commited on Apr 3, 2024

Commit

ed482f9

·

verified ·

1 Parent(s): 26ed0c7

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -1,11 +1,14 @@
 ---
 license: apache-2.0
 ---
-Coming soon! just learned about thebloke's quant issues, will update later.
-This is a llamafile for [Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1).
@@ -22,6 +25,6 @@ doesn't work, the file might be named something else so I had success with
 If that fails too, just navigate to  `/proc/sys/fs/binfmt_msc`  and see what files look like  `WSLInterop`  and echo a -1 to whatever it's called by changing that part of the recommended command.
-Llamafiles are a standalone executable that run an LLM server locally on a variety of operating systems.
-You just run it, open the chat interface in a browser, and interact.
-Options can be passed in to expose the api etc.

 ---
 license: apache-2.0
 ---
+This is a llamafile for [Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1).
+These were converted and quantized from source safetensors using llama.cpp on April 3, 2024.
+This matters because there are several GGUF files on HF which were created before llama.cpp's support for MoE quantization was fully debugged,
+even though it looked like it was producing working files at the time.
+I'll be uploading the quantized .gguf sources I created as well if anyone wants them as a reference or for further work.
 If that fails too, just navigate to  `/proc/sys/fs/binfmt_msc`  and see what files look like  `WSLInterop`  and echo a -1 to whatever it's called by changing that part of the recommended command.
+Llamafiles are a standalone executable that run an LLM server locally on a variety of operating systems including FreeBSD, Windows, Windows via WSL, Linux, and Mac.
+The same file works everywhere. You just download, run it, open the chat interface in a browser, and interact. Options can be passed in to expose the api etc.
+See their docs for details.