RobAgrees commited on
Commit
8eb0be6
·
verified ·
1 Parent(s): e9325c8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -27,7 +27,7 @@ widget:
27
  It uses dynamic quantization for lighter deployment and faster inference.
28
 
29
  Original model: **float16**, ~6.4GB
30
- Quantized model: **int8 dynamic**, ~6.4GB
31
 
32
  ## ⚡️ Quickstart
33
 
 
27
  It uses dynamic quantization for lighter deployment and faster inference.
28
 
29
  Original model: **float16**, ~6.4GB
30
+ Quantized model: **int8 dynamic**, ~6.4GB, ~20% faster inference
31
 
32
  ## ⚡️ Quickstart
33