gobean's picture
Update README.md
0feeaa9 verified
|
raw
history blame
1.59 kB
metadata
license: apache-2.0

This is a llamafile for WizardLM-2-7B.

Converted and tested on 4/15/2024.

The q3-k-l sized quant is under 4gb if you want something to share with your windows-only users.

-= Llamafile =-

Llamafiles are a standalone executable that run an LLM server locally on a variety of operating systems including FreeBSD, Windows, Windows via WSL, Linux, and Mac. The same file works everywhere, I've tested several of these on FreeBSD, Windows, Windows via WSL, and Linux. You just download the .llamafile, (chmod +x or rename to .exe as needed), run it, open the chat interface in a browser, and interact. Options can be passed in to expose the api etc. See their docs for details.

Mozilla Blog Announcement for Llamafile

  • Windows note: If it's over 4gb and you want to use it on Windows, you'll have to run it from WSL.

  • WSL note: If you get the error about APE, and the recommended command

    sudo sh -c 'echo -1 > /proc/sys/fs/binfmt_misc/WSLInterop'

    doesn't work, the WSLInterop file might be named something else. I had success with

    sudo sh -c 'echo -1 > /proc/sys/fs/binfmt_misc/WSLInterop-late'

    If that fails too, just navigate to /proc/sys/fs/binfmt_msc and see what files look like WSLInterop and echo a -1 to whatever they're called by changing that part of the recommended command.

  • FreeBSD note: Yes, it actually works on a fresh install of FreeBSD.