Update README.md
Browse files
README.md
CHANGED
@@ -48,7 +48,7 @@ install instructions for vllm can be found [here](https://docs.vllm.ai/en/latest
|
|
48 |
|
49 |
```bash
|
50 |
python -m vllm.entrypoints.openai.api_server \
|
51 |
-
--model
|
52 |
--gpu-memory-utilization 0.9 \ # can go as low as 0.83-0.85 if you need a little more gpu for your application
|
53 |
--max-model-len 16000 # 32000 if you can run it. This works on 4090
|
54 |
--chat-template ./examples/template_chatml.jinja
|
|
|
48 |
|
49 |
```bash
|
50 |
python -m vllm.entrypoints.openai.api_server \
|
51 |
+
--model macadeliccc/Mistral-7B-v0.2-OpenHermes \
|
52 |
--gpu-memory-utilization 0.9 \ # can go as low as 0.83-0.85 if you need a little more gpu for your application
|
53 |
--max-model-len 16000 # 32000 if you can run it. This works on 4090
|
54 |
--chat-template ./examples/template_chatml.jinja
|