open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.23/aimo_kaggle_medium_pot/results_2024-06-09T18-47-05.048895.json with huggingface_hub
4af2607
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.24/aimo_kaggle_hard_pot/results_2024-06-09T18-45-45.726335.json with huggingface_hub
235837a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.24/aimo_kaggle_medium_pot/results_2024-06-09T18-39-48.926794.json with huggingface_hub
c9bced8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.21/aimo_kaggle_hard_pot/results_2024-06-09T18-34-49.367127.json with huggingface_hub
db06ff8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.21/aimo_kaggle_medium_pot/results_2024-06-09T18-32-48.067713.json with huggingface_hub
21affea
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.20/aimo_kaggle_hard_pot/results_2024-06-09T18-32-07.026805.json with huggingface_hub
e7806d7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.18/aimo_kaggle_medium_pot/results_2024-06-09T18-26-24.612005.json with huggingface_hub
a87cceb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.20/aimo_kaggle_medium_pot/results_2024-06-09T18-26-08.491117.json with huggingface_hub
fafa03f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.18/aimo_kaggle_hard_pot/results_2024-06-09T18-17-35.715547.json with huggingface_hub
d78750a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.19/aimo_kaggle_hard_pot/results_2024-06-09T18-04-03.707738.json with huggingface_hub
6750423
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.19/aimo_kaggle_medium_pot/results_2024-06-09T17-59-16.549383.json with huggingface_hub
ed23bbd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.17/aimo_kaggle_hard_pot/results_2024-06-09T17-51-23.100258.json with huggingface_hub
5e52a96
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.17/aimo_kaggle_medium_pot/results_2024-06-09T17-45-17.773759.json with huggingface_hub
fe23677
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.15/aimo_kaggle_hard_pot/results_2024-06-09T17-37-53.246010.json with huggingface_hub
948a3f2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.14/aimo_kaggle_hard_pot/results_2024-06-09T17-32-22.006131.json with huggingface_hub
bafe814
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.15/aimo_kaggle_medium_pot/results_2024-06-09T17-31-26.776685.json with huggingface_hub
ff4bf60
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.14/aimo_kaggle_medium_pot/results_2024-06-09T17-31-00.147508.json with huggingface_hub
e992945
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.16/aimo_kaggle_hard_pot/results_2024-06-09T17-30-21.244206.json with huggingface_hub
3d005f1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.13/aimo_kaggle_hard_pot/results_2024-06-09T17-26-30.671337.json with huggingface_hub
d8bba59
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.16/aimo_kaggle_medium_pot/results_2024-06-09T17-24-33.910527.json with huggingface_hub
6b4d26d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.10/aimo_kaggle_hard_pot/results_2024-06-09T17-24-02.556963.json with huggingface_hub
69c7a38
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.12/aimo_kaggle_hard_pot/results_2024-06-09T17-23-28.688208.json with huggingface_hub
866d2ed
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.13/aimo_kaggle_medium_pot/results_2024-06-09T17-21-24.446886.json with huggingface_hub
feb75fd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.10/aimo_kaggle_medium_pot/results_2024-06-09T17-18-35.485834.json with huggingface_hub
2fdfcfe
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.12/aimo_kaggle_medium_pot/results_2024-06-09T17-16-59.109451.json with huggingface_hub
598a30c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.11/aimo_kaggle_hard_pot/results_2024-06-09T17-16-48.983341.json with huggingface_hub
0629231
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.93/aimo_kaggle_hard/results_2024-06-09T17-12-43.537646.json with huggingface_hub
a9a2e83
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.11/aimo_kaggle_medium_pot/results_2024-06-09T17-10-53.013815.json with huggingface_hub
7c9c93e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.98/aimo_kaggle_hard/results_2024-06-09T17-09-17.942571.json with huggingface_hub
a4fd76f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.93/aimo_kaggle_medium/results_2024-06-09T17-09-13.721441.json with huggingface_hub
eb9269c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.98/aimo_kaggle_medium/results_2024-06-09T17-08-05.264154.json with huggingface_hub
7631e99
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.100/aimo_kaggle_hard/results_2024-06-09T17-03-43.681130.json with huggingface_hub
d157e3c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.100/aimo_kaggle_medium/results_2024-06-09T17-00-46.587216.json with huggingface_hub
04cda61
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.26/aimo_kaggle_hard/results_2024-06-09T15-31-48.575360.json with huggingface_hub
f5d020b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.97/aimo_kaggle_hard/results_2024-06-09T15-26-49.504520.json with huggingface_hub
1825c4b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.97/aimo_kaggle_medium/results_2024-06-09T15-26-09.865000.json with huggingface_hub
3dacc7c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.26/aimo_kaggle_medium/results_2024-06-09T15-25-12.248382.json with huggingface_hub
5a0450b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.00/aimo_kaggle_medium_pot/results_2024-06-09T15-22-00.582971.json with huggingface_hub
91c4974
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.89/aimo_kaggle_hard/results_2024-06-09T15-16-31.074145.json with huggingface_hub
cb341cf
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.89/aimo_kaggle_medium/results_2024-06-09T15-16-25.139490.json with huggingface_hub
ac3859c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.96/aimo_kaggle_hard/results_2024-06-09T15-12-14.985259.json with huggingface_hub
f288043
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.96/aimo_kaggle_medium/results_2024-06-09T15-11-46.967792.json with huggingface_hub
87f4c1d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.92/aimo_kaggle_hard/results_2024-06-09T15-11-04.004933.json with huggingface_hub
5539f3c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.25/aimo_kaggle_tora_hard/results_2024-06-09T15-11-05.983526.json with huggingface_hub
5f6c235
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.92/aimo_kaggle_medium/results_2024-06-09T15-10-16.297817.json with huggingface_hub
0f79d4c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.83/aimo_kaggle_hard/results_2024-06-09T15-05-46.801457.json with huggingface_hub
26d9cd9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.83/aimo_kaggle_medium/results_2024-06-09T15-04-48.128028.json with huggingface_hub
06ada59
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.105/aimo_kaggle_hard/results_2024-06-09T15-02-18.988715.json with huggingface_hub
451f7d4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.24/aimo_kaggle_hard/results_2024-06-09T15-01-21.002841.json with huggingface_hub
6b89d4c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.105/aimo_kaggle_medium/results_2024-06-09T15-00-22.329396.json with huggingface_hub
9d88cf9
verified

edbeeching HF staff commited on