open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-56.054920.json with huggingface_hub
67f172f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-46.501705.json with huggingface_hub
5bfafcb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-29.220999.json with huggingface_hub
ae822c9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-24.243296.json with huggingface_hub
1d7d9d5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-20.947090.json with huggingface_hub
7811ca8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-18.757860.json with huggingface_hub
5fe887c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-11-09.814885.json with huggingface_hub
b419469
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-10-36.509469.json with huggingface_hub
c2cda35
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-10-32.393806.json with huggingface_hub
9851eee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-10-18.625315.json with huggingface_hub
74a5853
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-10-18.504078.json with huggingface_hub
492998f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.26/aimo_kaggle_hard/results_2024-06-08T09-45-06.157549.json with huggingface_hub
eae9906
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.26/aimo_kaggle_medium/results_2024-06-08T09-44-55.801095.json with huggingface_hub
ea71c2f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.27/aimo_kaggle_medium/results_2024-06-08T09-43-34.879678.json with huggingface_hub
2c43624
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.27/aimo_kaggle_hard/results_2024-06-08T09-43-16.416429.json with huggingface_hub
b4686f3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.23/aimo_kaggle_hard/results_2024-06-08T09-32-23.677766.json with huggingface_hub
bef77ce
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.23/aimo_kaggle_medium/results_2024-06-08T09-32-08.382575.json with huggingface_hub
8867a8e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.28/aimo_kaggle_medium/results_2024-06-08T09-31-55.673827.json with huggingface_hub
9d4b0a1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.28/aimo_kaggle_hard/results_2024-06-08T09-31-51.824312.json with huggingface_hub
365c2bb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.12/aimo_kaggle_hard/results_2024-06-08T09-14-40.330477.json with huggingface_hub
3023c1b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.12/aimo_kaggle_medium/results_2024-06-08T09-14-01.087112.json with huggingface_hub
441fd67
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.31/aimo_kaggle_hard_pot/results_2024-06-08T08-07-49.076149.json with huggingface_hub
fabae68
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.29/aimo_kaggle_hard_pot/results_2024-06-08T07-50-07.161021.json with huggingface_hub
4ef3754
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.31/aimo_kaggle_medium_pot/results_2024-06-08T07-47-34.945814.json with huggingface_hub
ca11903
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.18/aimo_kaggle_medium/results_2024-06-08T07-36-34.975945.json with huggingface_hub
7952109
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.30/aimo_kaggle_hard_pot/results_2024-06-08T07-36-25.842843.json with huggingface_hub
ba861f1
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.18/aimo_kaggle_hard/results_2024-06-08T07-35-55.927304.json with huggingface_hub
711ee40
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.15/aimo_kaggle_hard/results_2024-06-08T07-35-27.256528.json with huggingface_hub
43afcea
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.15/aimo_kaggle_medium/results_2024-06-08T07-35-22.821895.json with huggingface_hub
b2d509a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.29/aimo_kaggle_medium_pot/results_2024-06-08T07-25-41.843259.json with huggingface_hub
07bb11a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.30/aimo_kaggle_medium_pot/results_2024-06-08T07-24-00.792820.json with huggingface_hub
f26e5f6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.24/aimo_kaggle_medium/results_2024-06-08T07-01-04.390649.json with huggingface_hub
6e06b08
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.27/aimo_kaggle_hard_pot/results_2024-06-08T07-00-59.123923.json with huggingface_hub
c7bf2bc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.24/aimo_kaggle_hard/results_2024-06-08T07-00-54.420302.json with huggingface_hub
61df9e6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.28/aimo_kaggle_hard_pot/results_2024-06-08T06-59-44.155533.json with huggingface_hub
8f25fe7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.37/aimo_kaggle_hard/results_2024-06-08T06-50-02.535733.json with huggingface_hub
d6c4d0e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.37/aimo_kaggle_medium/results_2024-06-08T06-49-51.426788.json with huggingface_hub
9aa9508
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.34/aimo_kaggle_hard/results_2024-06-08T06-46-23.842961.json with huggingface_hub
ae8b2f2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.34/aimo_kaggle_medium/results_2024-06-08T06-46-05.034713.json with huggingface_hub
4e74b8e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.33/aimo_kaggle_hard/results_2024-06-08T06-43-47.675530.json with huggingface_hub
76456f8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.33/aimo_kaggle_medium/results_2024-06-08T06-43-41.652288.json with huggingface_hub
4e37a7a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.26/aimo_kaggle_hard_pot/results_2024-06-08T06-42-56.854816.json with huggingface_hub
abb16de
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.28/aimo_kaggle_medium_pot/results_2024-06-08T06-40-27.560584.json with huggingface_hub
b316d2e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.27/aimo_kaggle_medium_pot/results_2024-06-08T06-39-31.726755.json with huggingface_hub
2d17701
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.31/aimo_kaggle_hard/results_2024-06-08T06-39-24.218088.json with huggingface_hub
0ea51ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.31/aimo_kaggle_medium/results_2024-06-08T06-38-46.787947.json with huggingface_hub
7f81bb5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.29/aimo_kaggle_hard/results_2024-06-08T06-34-55.619694.json with huggingface_hub
ea42f84
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.29/aimo_kaggle_medium/results_2024-06-08T06-34-49.129609.json with huggingface_hub
cd5d44c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.19/aimo_kaggle_medium/results_2024-06-08T06-30-19.806902.json with huggingface_hub
a0d6556
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.19/aimo_kaggle_hard/results_2024-06-08T06-30-19.467876.json with huggingface_hub
3a8cc70
verified

edbeeching HF staff commited on