Update README.md
Browse files
README.md
CHANGED
@@ -22,6 +22,7 @@ Only the weights and activations of the linear operators within transformers blo
|
|
22 |
|
23 |
FP8 compuation is supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
24 |
|
|
|
25 |
|
26 |
## Use with vLLM
|
27 |
|
|
|
22 |
|
23 |
FP8 compuation is supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
24 |
|
25 |
+
**DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!**
|
26 |
|
27 |
## Use with vLLM
|
28 |
|