pytorch
/

Qwen3-32B-FP8

Text Generation

text-generation-inference

Model card Files Files and versions

jerryzh168 commited on May 22

Commit

a3302a3

·

verified ·

1 Parent(s): 23ccbd2

Update README.md

Files changed (1) hide show

README.md +6 -10

README.md CHANGED Viewed

@@ -129,19 +129,15 @@ We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-h
 | Benchmark                        |                |                           |
 |----------------------------------|----------------|---------------------------|
-|                                  | Qwen3-32B       | Qwen3-32B-float8dq       |
 | **General**                      |                |                           |
-| mmlu                             | WIP          | WIP                      |
-| mmlu_pro                         | WIP          | WIP                     |
-| bbh                              | WIP          | WIP                     |
 | **Multilingual**                 |                |                           |
-| mgsm_en_cot_en                   | WIP           | WIP                      |
-| m_mmlu (avg)                     | WIP          | WIP                     |
 | **Math**                         |                |                           |
-| gpqa_main_zeroshot               | WIP          | WIP                     |
-| gsm8k                            | WIP          | WIP                     |
-| leaderboard_math_hard (v3)       | WIP           | WIP                   |
-| **Overall**                      | WIP          | WIP                     |
 <details>
 <summary> Reproduce Model Quality Results </summary>

 | Benchmark                        |                |                           |
 |----------------------------------|----------------|---------------------------|
+|                                  | Qwen3-32B      | Qwen3-32B-float8dq        |
 | **General**                      |                |                           |
+| mmlu                             | 80.71          | 80.67                     |
+| bbh                              | 37.49          | 38.01                     |
 | **Multilingual**                 |                |                           |
+| mgsm_en_cot_en                   | 64.40          | WIP                       |
 | **Math**                         |                |                           |
+| gpqa_main_zeroshot               | 41.96          | 42.63                     |
+| **Overall**                      | 56.14          | WIP                       |
 <details>
 <summary> Reproduce Model Quality Results </summary>