macadeliccc
/

Mistral-7B-v0.2-OpenHermes

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

macadeliccc commited on Mar 29, 2024

Commit

25f0062

·

verified ·

1 Parent(s): 9b718ec

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -48,7 +48,7 @@ install instructions for vllm can be found [here](https://docs.vllm.ai/en/latest
 ```bash
 python -m vllm.entrypoints.openai.api_server \
---model teknium/OpenHermes-2.5-Mistral-7B \
 --gpu-memory-utilization 0.9 \ # can go as low as 0.83-0.85 if you need a little more gpu for your application
 --max-model-len 16000 # 32000 if you can run it. This works on 4090
 --chat-template ./examples/template_chatml.jinja

 ```bash
 python -m vllm.entrypoints.openai.api_server \
+--model macadeliccc/Mistral-7B-v0.2-OpenHermes \
 --gpu-memory-utilization 0.9 \ # can go as low as 0.83-0.85 if you need a little more gpu for your application
 --max-model-len 16000 # 32000 if you can run it. This works on 4090
 --chat-template ./examples/template_chatml.jinja