open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-41-03.559549.json with huggingface_hub
da56e88
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.47/aimo_kaggle_hard_pot/results_2024-06-11T16-40-50.644269.json with huggingface_hub
ffaec33
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-39-49.848448.json with huggingface_hub
d445fbb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-38-50.859864.json with huggingface_hub
14cc116
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-38-54.812312.json with huggingface_hub
d579fe6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-38-03.424851.json with huggingface_hub
ebda22f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-37-22.958393.json with huggingface_hub
51ef95d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-36-18.220230.json with huggingface_hub
706f2e9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-36-03.147264.json with huggingface_hub
e850165
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-35-55.863224.json with huggingface_hub
8d1a367
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-35-13.247654.json with huggingface_hub
57f736b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-34-49.508327.json with huggingface_hub
7edce1e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-34-07.760195.json with huggingface_hub
afeacc9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T16-32-05.331996.json with huggingface_hub
52d7f2f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.43/aimo_kaggle_medium_pot/results_2024-06-11T16-14-11.351425.json with huggingface_hub
93e27f8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.36/aimo_kaggle_hard_pot/results_2024-06-11T16-09-30.388243.json with huggingface_hub
e2d59a8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.47/aimo_kaggle_medium_pot/results_2024-06-11T15-56-18.544278.json with huggingface_hub
8e4a74a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.35/aimo_kaggle_hard_pot/results_2024-06-11T15-52-28.158077.json with huggingface_hub
4eb82e9
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.36/aimo_kaggle_medium_pot/results_2024-06-11T15-33-22.151676.json with huggingface_hub
589af84
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.41/aimo_kaggle_hard_pot/results_2024-06-11T15-15-54.440514.json with huggingface_hub
21c5f19
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-11T15-09-53.105293.json with huggingface_hub
c4aaebf
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.41/aimo_kaggle_medium_pot/results_2024-06-11T14-54-48.619994.json with huggingface_hub
32246a6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.35/aimo_kaggle_medium_pot/results_2024-06-11T14-47-41.620034.json with huggingface_hub
5b22d10
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.38/aimo_kaggle_hard_pot/results_2024-06-11T13-57-56.732400.json with huggingface_hub
464e10b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.34/aimo_kaggle_hard_pot/results_2024-06-11T13-55-27.508404.json with huggingface_hub
c153f93
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.38/aimo_kaggle_medium_pot/results_2024-06-11T13-32-17.006411.json with huggingface_hub
37565ce
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.33/aimo_kaggle_hard_pot/results_2024-06-11T13-19-28.808693.json with huggingface_hub
855af17
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.34/aimo_kaggle_medium_pot/results_2024-06-11T13-12-45.552192.json with huggingface_hub
75e0ed7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.32/aimo_kaggle_hard_pot/results_2024-06-11T13-08-41.321953.json with huggingface_hub
34ceca7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.33/aimo_kaggle_medium_pot/results_2024-06-11T12-44-59.658236.json with huggingface_hub
9ba48cc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.32/aimo_kaggle_medium_pot/results_2024-06-11T12-44-13.632886.json with huggingface_hub
cd51dcc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.31/aimo_kaggle_tora_hard/results_2024-06-11T12-11-56.395186.json with huggingface_hub
ce7b978
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.00/aimo_kaggle_tora_hard/results_2024-06-11T12-00-18.557167.json with huggingface_hub
287d6d6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-33-21.097376.json with huggingface_hub
12f6b21
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-33-20.195180.json with huggingface_hub
d30cb6d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-33-15.621176.json with huggingface_hub
2f68638
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-33-01.378911.json with huggingface_hub
902e2c6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-33-00.213700.json with huggingface_hub
bac681e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-32-05.929183.json with huggingface_hub
23d4095
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-31-15.770796.json with huggingface_hub
57c20f8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-31-06.024755.json with huggingface_hub
cf9159d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-30-55.704851.json with huggingface_hub
1d1b072
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-11T07-30-50.153120.json with huggingface_hub
ab35180
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.102/aimo_kaggle_tora_medium/results_2024-06-10T21-00-07.434375.json with huggingface_hub
b5ff137
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.102/aimo_kaggle_tora_medium/results_2024-06-10T20-59-57.952934.json with huggingface_hub
b9dd172
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.104/aimo_kaggle_tora_medium/results_2024-06-10T20-59-49.103360.json with huggingface_hub
339788d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.70/aimo_kaggle_tora_medium/results_2024-06-10T20-59-46.581324.json with huggingface_hub
5144f08
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.104/aimo_kaggle_tora_medium/results_2024-06-10T20-59-44.382240.json with huggingface_hub
5246b47
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.104/aimo_kaggle_tora_medium/results_2024-06-10T20-59-40.559814.json with huggingface_hub
8b18542
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.102/aimo_kaggle_tora_medium/results_2024-06-10T20-59-36.999327.json with huggingface_hub
9fec9e9
verified

edbeeching HF staff commited on