Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ widget:
|
|
27 |
It uses dynamic quantization for lighter deployment and faster inference.
|
28 |
|
29 |
Original model: **float16**, ~6.4GB
|
30 |
-
Quantized model: **int8 dynamic**, ~6.4GB
|
31 |
|
32 |
## ⚡️ Quickstart
|
33 |
|
|
|
27 |
It uses dynamic quantization for lighter deployment and faster inference.
|
28 |
|
29 |
Original model: **float16**, ~6.4GB
|
30 |
+
Quantized model: **int8 dynamic**, ~6.4GB, ~20% faster inference
|
31 |
|
32 |
## ⚡️ Quickstart
|
33 |
|