ControlLLM
/

Llama3.1-8B-OpenMath16-Instruct

Text Generation

Model card Files Files and versions

hawei commited on Jan 10

Commit

0d94b52

·

verified ·

1 Parent(s): ba08dcc

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -91,9 +91,7 @@ The plot below highlights the alignment comparison of the model trained with Con
 The table below summarizes the evaluation results across mathematical tasks and original capabilities for various models and training approaches.
 | **Model**                | **Math Tasks**             |          |           |          | **Original Capabilities**   |         |         |           | **Overall Avg.** |
-|--------------------------|----------------------------|----------|-----------|----------|-----------------------------|---------|---------|-----------|------------------|
 |                          | **MathHard**              | **Math** | **GSM8K** | **Avg.** | **ARC**                     | **GPQA**| **MMLU**| **MMLUP** |                  |
-|--------------------------|----------------------------|----------|-----------|----------|-----------------------------|---------|---------|-----------|------------------|
 | Llama3.1-8B-Instruct     | 23.7                      | 50.9     | 85.6      | 52.1     | 83.4                        | 29.9    | 72.4    | 46.7      | 56.3             |
 | OpenMath2-Llama3.1       | 38.4                      | 64.1     | 90.3      | 64.3     | 45.8                        | 1.3     | 4.5     | 19.5      | 38.6             |
 | **Full Param Tune**       | **38.5**                  | **63.7** | 90.2      | **63.9** | 58.2                        | 1.1     | 7.3     | 23.5      | 40.1             |

 The table below summarizes the evaluation results across mathematical tasks and original capabilities for various models and training approaches.
 | **Model**                | **Math Tasks**             |          |           |          | **Original Capabilities**   |         |         |           | **Overall Avg.** |
 |                          | **MathHard**              | **Math** | **GSM8K** | **Avg.** | **ARC**                     | **GPQA**| **MMLU**| **MMLUP** |                  |
 | Llama3.1-8B-Instruct     | 23.7                      | 50.9     | 85.6      | 52.1     | 83.4                        | 29.9    | 72.4    | 46.7      | 56.3             |
 | OpenMath2-Llama3.1       | 38.4                      | 64.1     | 90.3      | 64.3     | 45.8                        | 1.3     | 4.5     | 19.5      | 38.6             |
 | **Full Param Tune**       | **38.5**                  | **63.7** | 90.2      | **63.9** | 58.2                        | 1.1     | 7.3     | 23.5      | 40.1             |