open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.65/aimo_kaggle_hard/results_2024-06-09T07-34-29.935846.json with huggingface_hub
d7ab3f0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.21/aimo_kaggle_hard/results_2024-06-09T07-32-24.172891.json with huggingface_hub
5673e56
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.65/aimo_kaggle_medium/results_2024-06-09T07-30-13.645336.json with huggingface_hub
4241ae5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.21/aimo_kaggle_medium/results_2024-06-09T07-28-32.391286.json with huggingface_hub
fb11d6c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.53/aimo_kaggle_hard/results_2024-06-09T07-04-56.025946.json with huggingface_hub
9aa5d88
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.53/aimo_kaggle_medium/results_2024-06-09T07-04-14.181927.json with huggingface_hub
9ca6543
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.67/aimo_kaggle_hard/results_2024-06-09T07-02-53.225928.json with huggingface_hub
908a85c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.67/aimo_kaggle_medium/results_2024-06-09T07-02-40.135486.json with huggingface_hub
bafd29b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.68/aimo_kaggle_hard/results_2024-06-09T06-53-41.810988.json with huggingface_hub
f54a72a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.44/aimo_kaggle_hard/results_2024-06-09T06-52-51.636261.json with huggingface_hub
fb3316d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.68/aimo_kaggle_medium/results_2024-06-09T06-52-29.817702.json with huggingface_hub
2c9222c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.44/aimo_kaggle_medium/results_2024-06-09T06-50-31.009368.json with huggingface_hub
d208460
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.32/aimo_kaggle_hard/results_2024-06-09T06-47-47.928312.json with huggingface_hub
362d42f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.32/aimo_kaggle_medium/results_2024-06-09T06-46-59.541745.json with huggingface_hub
395fd40
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.60/aimo_kaggle_hard/results_2024-06-09T06-36-08.923545.json with huggingface_hub
73ef207
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.60/aimo_kaggle_medium/results_2024-06-09T06-36-07.204329.json with huggingface_hub
63b7b3f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.59/aimo_kaggle_hard/results_2024-06-09T06-34-28.078608.json with huggingface_hub
c098fbc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.59/aimo_kaggle_medium/results_2024-06-09T06-34-18.379294.json with huggingface_hub
ae39472
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.43/aimo_kaggle_hard/results_2024-06-09T06-30-31.307553.json with huggingface_hub
90a408b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.43/aimo_kaggle_medium/results_2024-06-09T06-30-21.575010.json with huggingface_hub
f5260e3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.45/aimo_kaggle_hard/results_2024-06-09T06-22-23.190560.json with huggingface_hub
3d1ea10
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.48/aimo_kaggle_hard/results_2024-06-09T06-02-02.477117.json with huggingface_hub
b65262a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.48/aimo_kaggle_medium/results_2024-06-09T06-01-20.905333.json with huggingface_hub
6dff63b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.46/aimo_kaggle_hard/results_2024-06-09T05-56-34.596011.json with huggingface_hub
0745d4d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.42/aimo_kaggle_hard/results_2024-06-09T05-55-27.523809.json with huggingface_hub
acfd81b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.46/aimo_kaggle_medium/results_2024-06-09T05-55-17.910034.json with huggingface_hub
8c37e31
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.42/aimo_kaggle_medium/results_2024-06-09T05-54-59.230520.json with huggingface_hub
90383d1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.22/aimo_kaggle_hard/results_2024-06-09T05-51-07.670119.json with huggingface_hub
ead45b4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.22/aimo_kaggle_medium/results_2024-06-09T05-49-48.084865.json with huggingface_hub
4a69cd7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.61/aimo_kaggle_hard/results_2024-06-09T05-49-41.848743.json with huggingface_hub
e99ff26
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.61/aimo_kaggle_medium/results_2024-06-09T05-48-32.197750.json with huggingface_hub
0b7fa61
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.54/aimo_kaggle_hard/results_2024-06-09T05-31-51.468973.json with huggingface_hub
02e839f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.54/aimo_kaggle_medium/results_2024-06-09T05-26-19.322929.json with huggingface_hub
b57e6f2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.36/aimo_kaggle_hard/results_2024-06-09T05-25-47.206881.json with huggingface_hub
0c3a329
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.36/aimo_kaggle_medium/results_2024-06-09T05-25-46.362071.json with huggingface_hub
6774e00
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.13/aimo_kaggle_hard/results_2024-06-09T05-14-49.731174.json with huggingface_hub
163d4b5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.13/aimo_kaggle_medium/results_2024-06-09T05-03-15.006719.json with huggingface_hub
92bda1e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.25/aimo_kaggle_hard/results_2024-06-09T04-56-57.466760.json with huggingface_hub
8892280
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.25/aimo_kaggle_medium/results_2024-06-09T04-52-25.848654.json with huggingface_hub
d946ca5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-56-05.272216.json with huggingface_hub
bfaa74d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-50-11.830580.json with huggingface_hub
ab8ccd0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-50-00.718762.json with huggingface_hub
54aaccb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-50-00.532071.json with huggingface_hub
1f61c4d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-57.384969.json with huggingface_hub
27813f5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-54.602357.json with huggingface_hub
29f66ae
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-50.620028.json with huggingface_hub
60ac077
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-49.124237.json with huggingface_hub
aafd7eb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-47.209064.json with huggingface_hub
96b7d69
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-46.450280.json with huggingface_hub
56bf30d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v24.34/aimo_kaggle_tora_medium/results_2024-06-08T11-49-45.598983.json with huggingface_hub
2415b40
verified

edbeeching HF staff commited on