Ollama/llama.cpp support

#10

by dpreti - opened 29 days ago

Almawave org 29 days ago

We observed that at the moment the model is not served correctly with Ollama and llama.cpp .

We are currently investigating the reasons behind this unexpected behavior.
In the meanwhile we strongly suggest to serve the model using vLLM or the Transformer library as showed in the model card.

dpreti

Almawave org 22 days ago

Velvet-14B model has been released on the ollama library. It can be used in its q4_K_M quantized version with the command

ollama run Almawave/velvet:14b

Other versions, including the q8_0, and fp16 are available and can be found here:
https://ollama.com/Almawave/Velvet

dpreti changed discussion status to closed 22 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment