Update README.md
Browse files
README.md
CHANGED
|
@@ -32,8 +32,7 @@ python3 quantize_quark.py \
|
|
| 32 |
## Deployment
|
| 33 |
Quark has its own export format and allows FP8 quantized models to be efficiently deployed using the SGLang backend.
|
| 34 |
## Evaluation
|
| 35 |
-
|
| 36 |
-
The quantization evaluation results are conducted in pseudo-quantization mode, which may slightly differ from the actual quantized inference accuracy. These results are provided for reference only.
|
| 37 |
#### Evaluation scores
|
| 38 |
<table>
|
| 39 |
<tr>
|
|
@@ -45,11 +44,11 @@ The quantization evaluation results are conducted in pseudo-quantization mode, w
|
|
| 45 |
</td>
|
| 46 |
</tr>
|
| 47 |
<tr>
|
| 48 |
-
<td>
|
| 49 |
</td>
|
| 50 |
-
<td>
|
| 51 |
</td>
|
| 52 |
-
<td>
|
| 53 |
</td>
|
| 54 |
</tr>
|
| 55 |
</table>
|
|
|
|
| 32 |
## Deployment
|
| 33 |
Quark has its own export format and allows FP8 quantized models to be efficiently deployed using the SGLang backend.
|
| 34 |
## Evaluation
|
| 35 |
+
|
|
|
|
| 36 |
#### Evaluation scores
|
| 37 |
<table>
|
| 38 |
<tr>
|
|
|
|
| 44 |
</td>
|
| 45 |
</tr>
|
| 46 |
<tr>
|
| 47 |
+
<td>gsm8k
|
| 48 |
</td>
|
| 49 |
+
<td>0.821
|
| 50 |
</td>
|
| 51 |
+
<td>0.817
|
| 52 |
</td>
|
| 53 |
</tr>
|
| 54 |
</table>
|