open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-08T11-10-18.504078.json with huggingface_hub
492998f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.26/aimo_kaggle_hard/results_2024-06-08T09-45-06.157549.json with huggingface_hub
eae9906
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.26/aimo_kaggle_medium/results_2024-06-08T09-44-55.801095.json with huggingface_hub
ea71c2f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.27/aimo_kaggle_medium/results_2024-06-08T09-43-34.879678.json with huggingface_hub
2c43624
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.27/aimo_kaggle_hard/results_2024-06-08T09-43-16.416429.json with huggingface_hub
b4686f3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.23/aimo_kaggle_hard/results_2024-06-08T09-32-23.677766.json with huggingface_hub
bef77ce
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.23/aimo_kaggle_medium/results_2024-06-08T09-32-08.382575.json with huggingface_hub
8867a8e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.28/aimo_kaggle_medium/results_2024-06-08T09-31-55.673827.json with huggingface_hub
9d4b0a1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.28/aimo_kaggle_hard/results_2024-06-08T09-31-51.824312.json with huggingface_hub
365c2bb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.12/aimo_kaggle_hard/results_2024-06-08T09-14-40.330477.json with huggingface_hub
3023c1b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.12/aimo_kaggle_medium/results_2024-06-08T09-14-01.087112.json with huggingface_hub
441fd67
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.31/aimo_kaggle_hard_pot/results_2024-06-08T08-07-49.076149.json with huggingface_hub
fabae68
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.29/aimo_kaggle_hard_pot/results_2024-06-08T07-50-07.161021.json with huggingface_hub
4ef3754
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.31/aimo_kaggle_medium_pot/results_2024-06-08T07-47-34.945814.json with huggingface_hub
ca11903
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.18/aimo_kaggle_medium/results_2024-06-08T07-36-34.975945.json with huggingface_hub
7952109
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.30/aimo_kaggle_hard_pot/results_2024-06-08T07-36-25.842843.json with huggingface_hub
ba861f1
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.18/aimo_kaggle_hard/results_2024-06-08T07-35-55.927304.json with huggingface_hub
711ee40
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.15/aimo_kaggle_hard/results_2024-06-08T07-35-27.256528.json with huggingface_hub
43afcea
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.15/aimo_kaggle_medium/results_2024-06-08T07-35-22.821895.json with huggingface_hub
b2d509a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.29/aimo_kaggle_medium_pot/results_2024-06-08T07-25-41.843259.json with huggingface_hub
07bb11a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.30/aimo_kaggle_medium_pot/results_2024-06-08T07-24-00.792820.json with huggingface_hub
f26e5f6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.24/aimo_kaggle_medium/results_2024-06-08T07-01-04.390649.json with huggingface_hub
6e06b08
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.27/aimo_kaggle_hard_pot/results_2024-06-08T07-00-59.123923.json with huggingface_hub
c7bf2bc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.24/aimo_kaggle_hard/results_2024-06-08T07-00-54.420302.json with huggingface_hub
61df9e6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.28/aimo_kaggle_hard_pot/results_2024-06-08T06-59-44.155533.json with huggingface_hub
8f25fe7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.37/aimo_kaggle_hard/results_2024-06-08T06-50-02.535733.json with huggingface_hub
d6c4d0e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.37/aimo_kaggle_medium/results_2024-06-08T06-49-51.426788.json with huggingface_hub
9aa9508
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.34/aimo_kaggle_hard/results_2024-06-08T06-46-23.842961.json with huggingface_hub
ae8b2f2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.34/aimo_kaggle_medium/results_2024-06-08T06-46-05.034713.json with huggingface_hub
4e74b8e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.33/aimo_kaggle_hard/results_2024-06-08T06-43-47.675530.json with huggingface_hub
76456f8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.33/aimo_kaggle_medium/results_2024-06-08T06-43-41.652288.json with huggingface_hub
4e37a7a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.26/aimo_kaggle_hard_pot/results_2024-06-08T06-42-56.854816.json with huggingface_hub
abb16de
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.28/aimo_kaggle_medium_pot/results_2024-06-08T06-40-27.560584.json with huggingface_hub
b316d2e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.27/aimo_kaggle_medium_pot/results_2024-06-08T06-39-31.726755.json with huggingface_hub
2d17701
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.31/aimo_kaggle_hard/results_2024-06-08T06-39-24.218088.json with huggingface_hub
0ea51ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.31/aimo_kaggle_medium/results_2024-06-08T06-38-46.787947.json with huggingface_hub
7f81bb5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.29/aimo_kaggle_hard/results_2024-06-08T06-34-55.619694.json with huggingface_hub
ea42f84
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.29/aimo_kaggle_medium/results_2024-06-08T06-34-49.129609.json with huggingface_hub
cd5d44c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.19/aimo_kaggle_medium/results_2024-06-08T06-30-19.806902.json with huggingface_hub
a0d6556
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.19/aimo_kaggle_hard/results_2024-06-08T06-30-19.467876.json with huggingface_hub
3a8cc70
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.26/aimo_kaggle_medium_pot/results_2024-06-08T06-15-10.610754.json with huggingface_hub
de9dc2f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.24/aimo_kaggle_hard_pot/results_2024-06-08T06-12-55.885231.json with huggingface_hub
a45bb52
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.23/aimo_kaggle_hard_pot/results_2024-06-08T06-12-22.717919.json with huggingface_hub
524036f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.22/aimo_kaggle_hard_pot/results_2024-06-08T05-58-41.856863.json with huggingface_hub
1a65ff9
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.24/aimo_kaggle_medium_pot/results_2024-06-08T05-56-56.085049.json with huggingface_hub
817a989
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.23/aimo_kaggle_medium_pot/results_2024-06-08T05-50-51.873250.json with huggingface_hub
2fc68e6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.40/aimo_kaggle_hard/results_2024-06-08T05-38-38.203732.json with huggingface_hub
472747f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.40/aimo_kaggle_medium/results_2024-06-08T05-37-32.637446.json with huggingface_hub
bfd319e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.21/aimo_kaggle_hard_pot/results_2024-06-08T05-32-59.251832.json with huggingface_hub
29c5c4e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.22/aimo_kaggle_medium_pot/results_2024-06-08T05-30-48.920199.json with huggingface_hub
5104588
verified

lewtun HF staff commited on