open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.23/aimo_kaggle_hard_pot/results_2024-06-04T03-19-16.087024.json with huggingface_hub
b5bd8be
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.15/aimo_kaggle_hard/results_2024-06-04T03-17-17.338502.json with huggingface_hub
61ebe52
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.15/aimo_kaggle_medium_pot/results_2024-06-04T03-16-42.350775.json with huggingface_hub
1484d8f
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.15/aimo_kaggle_medium/results_2024-06-04T03-16-37.593667.json with huggingface_hub
ae45585
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.19/aimo_kaggle_medium_pot/results_2024-06-04T03-14-49.833700.json with huggingface_hub
490a852
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.19/aimo_kaggle_hard/results_2024-06-04T03-13-26.582917.json with huggingface_hub
a915602
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.01/aimo_kaggle_hard_pot/results_2024-06-04T03-12-35.717169.json with huggingface_hub
8b50349
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.23/aimo_kaggle_medium_pot/results_2024-06-04T03-12-15.168984.json with huggingface_hub
8ad274d
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.19/aimo_kaggle_medium/results_2024-06-04T03-11-42.337354.json with huggingface_hub
a37fadb
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.23/aimo_kaggle_hard/results_2024-06-04T03-10-12.636287.json with huggingface_hub
53cc407
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.23/aimo_kaggle_medium/results_2024-06-04T03-10-06.025216.json with huggingface_hub
6808a55
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.08/aimo_kaggle_hard_pot/results_2024-06-04T03-07-21.697649.json with huggingface_hub
fba9505
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.01/aimo_kaggle_medium_pot/results_2024-06-04T03-05-45.274051.json with huggingface_hub
b2e3a05
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.06/aimo_kaggle_hard_pot/results_2024-06-04T03-04-15.560023.json with huggingface_hub
eab64f4
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.01/aimo_kaggle_hard/results_2024-06-04T03-04-18.442187.json with huggingface_hub
87f40d2
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.01/aimo_kaggle_medium/results_2024-06-04T03-04-13.249795.json with huggingface_hub
0872029
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.08/aimo_kaggle_medium_pot/results_2024-06-04T03-03-10.882328.json with huggingface_hub
6fbbca6
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.08/aimo_kaggle_medium/results_2024-06-04T03-02-51.439079.json with huggingface_hub
57c97f2
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.08/aimo_kaggle_hard/results_2024-06-04T03-02-49.069461.json with huggingface_hub
cc2216d
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.06/aimo_kaggle_hard/results_2024-06-04T02-59-26.303066.json with huggingface_hub
37d567f
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.06/aimo_kaggle_medium_pot/results_2024-06-04T02-58-36.936636.json with huggingface_hub
ac9693d
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-kto/aimo_v24.34.06/aimo_kaggle_medium/results_2024-06-04T02-58-08.862068.json with huggingface_hub
9c08750
verified

vwxyzjn commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.12/aimo_kaggle_hard/results_2024-06-04T02-57-05.997610.json with huggingface_hub
62b36a9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.12/aimo_kaggle_medium/results_2024-06-04T02-54-35.567386.json with huggingface_hub
16f6968
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.12/aimo_kaggle/results_2024-06-04T02-53-17.446360.json with huggingface_hub
8bcd94c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.23/aimo_kaggle/results_2024-06-04T02-30-04.636029.json with huggingface_hub
b8a9d99
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.23/aimo_kaggle_hard/results_2024-06-04T02-29-56.350970.json with huggingface_hub
feefa48
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.23/aimo_kaggle_medium/results_2024-06-04T02-29-27.151871.json with huggingface_hub
32ad4ea
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.31/aimo_kaggle_hard_pot/results_2024-06-04T02-15-05.348304.json with huggingface_hub
b9eb92f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.33/aimo_kaggle_medium/results_2024-06-04T02-08-38.598739.json with huggingface_hub
f532515
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.33/aimo_kaggle/results_2024-06-04T02-07-40.414272.json with huggingface_hub
022782f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.33/aimo_kaggle_hard/results_2024-06-04T02-07-31.805200.json with huggingface_hub
8262191
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.17/aimo_kaggle_hard_pot/results_2024-06-04T02-04-49.923963.json with huggingface_hub
5ef0607
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.24/aimo_kaggle_hard/results_2024-06-04T02-00-10.996861.json with huggingface_hub
9a64304
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.24/aimo_kaggle/results_2024-06-04T01-59-24.140902.json with huggingface_hub
96804a4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.24/aimo_kaggle_medium/results_2024-06-04T01-58-38.071755.json with huggingface_hub
88dd501
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.19/aimo_kaggle_medium/results_2024-06-04T01-57-41.988800.json with huggingface_hub
cdd40f0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.19/aimo_kaggle/results_2024-06-04T01-57-20.546699.json with huggingface_hub
eb491b5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.19/aimo_kaggle_hard/results_2024-06-04T01-57-14.308136.json with huggingface_hub
5d814be
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.17/aimo_kaggle_medium_pot/results_2024-06-04T01-54-47.497433.json with huggingface_hub
9e297d8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.30/aimo_kaggle_medium_pot/results_2024-06-04T01-53-23.856570.json with huggingface_hub
31ab834
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.29/aimo_kaggle_hard_pot/results_2024-06-04T01-52-08.944733.json with huggingface_hub
c5ca70d
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.30/aimo_kaggle_hard/results_2024-06-04T01-51-25.176765.json with huggingface_hub
2ec3122
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v03.23/aimo_kaggle_hard_pot/results_2024-06-04T01-50-54.706437.json with huggingface_hub
4205019
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.30/aimo_kaggle/results_2024-06-04T01-49-28.941489.json with huggingface_hub
69e1db9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.39/aimo_kaggle/results_2024-06-04T01-49-21.408433.json with huggingface_hub
b65b0a5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.30/aimo_kaggle_medium/results_2024-06-04T01-49-17.328395.json with huggingface_hub
3eb589a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.39/aimo_kaggle_hard/results_2024-06-04T01-49-00.239847.json with huggingface_hub
e1f982e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.39/aimo_kaggle_medium/results_2024-06-04T01-48-37.228564.json with huggingface_hub
a0cb0b5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v27.35/aimo_kaggle/results_2024-06-04T01-48-20.628908.json with huggingface_hub
fb1fa6b
verified

edbeeching HF staff commited on