open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.57/aimo_kaggle_hard/results_2024-06-09T09-30-58.592522.json with huggingface_hub
582dd38
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.57/aimo_kaggle_medium/results_2024-06-09T09-28-16.887450.json with huggingface_hub
3a224f9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.47/aimo_kaggle_hard/results_2024-06-09T09-25-42.967711.json with huggingface_hub
c13f20d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.47/aimo_kaggle_medium/results_2024-06-09T09-24-06.130835.json with huggingface_hub
ceb5025
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.85/aimo_kaggle_hard/results_2024-06-09T09-19-42.549440.json with huggingface_hub
871b09c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.85/aimo_kaggle_medium/results_2024-06-09T09-18-55.325649.json with huggingface_hub
025ef0e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.56/aimo_kaggle_hard/results_2024-06-09T09-02-22.762929.json with huggingface_hub
4cce876
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.56/aimo_kaggle_medium/results_2024-06-09T09-01-20.750905.json with huggingface_hub
120ef1b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.63/aimo_kaggle_hard/results_2024-06-09T09-00-26.840062.json with huggingface_hub
0a38b6e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.63/aimo_kaggle_medium/results_2024-06-09T08-57-37.858954.json with huggingface_hub
1b24e33
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.72/aimo_kaggle_hard/results_2024-06-09T08-47-38.552685.json with huggingface_hub
be61790
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.72/aimo_kaggle_medium/results_2024-06-09T08-45-38.440327.json with huggingface_hub
acd2e94
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.50/aimo_kaggle_medium/results_2024-06-09T08-39-57.934404.json with huggingface_hub
7099751
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.50/aimo_kaggle_hard/results_2024-06-09T08-39-20.518256.json with huggingface_hub
e25a52a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.10/aimo_kaggle_hard/results_2024-06-09T08-22-28.687858.json with huggingface_hub
2ec8366
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.10/aimo_kaggle_medium/results_2024-06-09T08-21-17.441264.json with huggingface_hub
b17f427
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.62/aimo_kaggle_hard/results_2024-06-09T08-06-52.625696.json with huggingface_hub
747f2dd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.62/aimo_kaggle_medium/results_2024-06-09T08-05-46.855329.json with huggingface_hub
31735cc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.18/aimo_kaggle_medium/results_2024-06-09T08-03-53.900804.json with huggingface_hub
32fb34a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.18/aimo_kaggle_hard/results_2024-06-09T08-03-31.527206.json with huggingface_hub
58e03ef
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.74/aimo_kaggle_hard/results_2024-06-09T08-01-48.193586.json with huggingface_hub
ee050fb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.74/aimo_kaggle_medium/results_2024-06-09T08-01-12.031761.json with huggingface_hub
7a5812e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.14/aimo_kaggle_hard/results_2024-06-09T08-00-18.794056.json with huggingface_hub
2a83b47
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.14/aimo_kaggle_medium/results_2024-06-09T07-58-20.498815.json with huggingface_hub
230bcbc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.58/aimo_kaggle_hard/results_2024-06-09T07-55-14.964657.json with huggingface_hub
3bcb878
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.58/aimo_kaggle_medium/results_2024-06-09T07-53-50.377909.json with huggingface_hub
07040f5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.51/aimo_kaggle_hard/results_2024-06-09T07-52-37.868001.json with huggingface_hub
503ff01
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.51/aimo_kaggle_medium/results_2024-06-09T07-49-57.770574.json with huggingface_hub
d8c2441
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.77/aimo_kaggle_hard/results_2024-06-09T07-46-28.714990.json with huggingface_hub
129a9d9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.77/aimo_kaggle_medium/results_2024-06-09T07-45-07.253381.json with huggingface_hub
f6e3734
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.65/aimo_kaggle_hard/results_2024-06-09T07-34-29.935846.json with huggingface_hub
d7ab3f0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.21/aimo_kaggle_hard/results_2024-06-09T07-32-24.172891.json with huggingface_hub
5673e56
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.65/aimo_kaggle_medium/results_2024-06-09T07-30-13.645336.json with huggingface_hub
4241ae5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.21/aimo_kaggle_medium/results_2024-06-09T07-28-32.391286.json with huggingface_hub
fb11d6c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.53/aimo_kaggle_hard/results_2024-06-09T07-04-56.025946.json with huggingface_hub
9aa5d88
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.53/aimo_kaggle_medium/results_2024-06-09T07-04-14.181927.json with huggingface_hub
9ca6543
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.67/aimo_kaggle_hard/results_2024-06-09T07-02-53.225928.json with huggingface_hub
908a85c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.67/aimo_kaggle_medium/results_2024-06-09T07-02-40.135486.json with huggingface_hub
bafd29b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.68/aimo_kaggle_hard/results_2024-06-09T06-53-41.810988.json with huggingface_hub
f54a72a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.44/aimo_kaggle_hard/results_2024-06-09T06-52-51.636261.json with huggingface_hub
fb3316d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.68/aimo_kaggle_medium/results_2024-06-09T06-52-29.817702.json with huggingface_hub
2c9222c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.44/aimo_kaggle_medium/results_2024-06-09T06-50-31.009368.json with huggingface_hub
d208460
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.32/aimo_kaggle_hard/results_2024-06-09T06-47-47.928312.json with huggingface_hub
362d42f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.32/aimo_kaggle_medium/results_2024-06-09T06-46-59.541745.json with huggingface_hub
395fd40
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.60/aimo_kaggle_hard/results_2024-06-09T06-36-08.923545.json with huggingface_hub
73ef207
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.60/aimo_kaggle_medium/results_2024-06-09T06-36-07.204329.json with huggingface_hub
63b7b3f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.59/aimo_kaggle_hard/results_2024-06-09T06-34-28.078608.json with huggingface_hub
c098fbc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.59/aimo_kaggle_medium/results_2024-06-09T06-34-18.379294.json with huggingface_hub
ae39472
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.43/aimo_kaggle_hard/results_2024-06-09T06-30-31.307553.json with huggingface_hub
90a408b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.43/aimo_kaggle_medium/results_2024-06-09T06-30-21.575010.json with huggingface_hub
f5260e3
verified

edbeeching HF staff commited on