ControlLLM
/

Llama3.1-8B-OpenMath16-Instruct

hawei commited on Jan 9

Commit

1cfd3f7

verified ·

1 Parent(s): 50abae0

Update model card with benchmark data source

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,8 +12,10 @@ model-index:
   - task:
       type: math-evaluation
     dataset:
-      type: nvidia/OpenMathInstruct-2
       name: OpenMathInstruct
     metrics:
     - name: exact_match,none
       type: exact_match
@@ -40,6 +42,8 @@ model-index:
     dataset:
       type: meta/arc-dataset
       name: Meta-ARC Dataset
     metrics:
     - name: exact_match,strict-match
       type: exact_match
@@ -68,4 +72,4 @@ model-index:
       verified: false
 ---
 # Control-LLM-Llama3.1-8B-Math16
-This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.

   - task:
       type: math-evaluation
     dataset:
+      type: parquet
       name: OpenMathInstruct
+      dataset_kwargs:
+        data_files: "/home/jobuser/controlllm/inference/llm_eval_harness/additional_tasks/math/joined_math.parquet"
     metrics:
     - name: exact_match,none
       type: exact_match
     dataset:
       type: meta/arc-dataset
       name: Meta-ARC Dataset
+      dataset_path: "meta-llama/llama-3.1-8_b-instruct-evals"
+      dataset_name: "Llama-3.1-8B-Instruct-evals__arc_challenge__details"
     metrics:
     - name: exact_match,strict-match
       type: exact_match
       verified: false
 ---
 # Control-LLM-Llama3.1-8B-Math16
+This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.