open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.30/aimo_kaggle_hard_pot/results_2024-06-04T09-49-14.242541.json with huggingface_hub
dcdad32
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.21/aimo_kaggle_hard_pot/results_2024-06-04T09-49-04.818628.json with huggingface_hub
a568979
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.26/aimo_kaggle_medium_pot/results_2024-06-04T09-48-15.389375.json with huggingface_hub
c2ed697
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.17/aimo_kaggle_hard_pot/results_2024-06-04T09-48-09.632034.json with huggingface_hub
7e7b389
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.29/aimo_kaggle_hard_pot/results_2024-06-04T09-47-16.158211.json with huggingface_hub
f37ad74
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.19/aimo_kaggle_medium_pot/results_2024-06-04T09-46-20.924466.json with huggingface_hub
399e9d7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.30/aimo_kaggle_medium_pot/results_2024-06-04T09-44-27.005735.json with huggingface_hub
0b708a9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v01.00/aimo_kaggle_hard_pot/results_2024-06-04T09-44-20.633206.json with huggingface_hub
a2850e4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.41/aimo_kaggle_hard_pot/results_2024-06-04T09-44-15.602809.json with huggingface_hub
94a7d7a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.39/aimo_kaggle_hard_pot/results_2024-06-04T09-44-09.085156.json with huggingface_hub
c369427
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.35/aimo_kaggle_hard_pot/results_2024-06-04T09-43-50.291179.json with huggingface_hub
d478b00
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.38/aimo_kaggle_hard_pot/results_2024-06-04T09-43-04.561644.json with huggingface_hub
2195127
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.21/aimo_kaggle_medium_pot/results_2024-06-04T09-42-19.890313.json with huggingface_hub
0c70c86
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.16/aimo_kaggle_hard_pot/results_2024-06-04T09-42-14.648715.json with huggingface_hub
fb36785
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.29/aimo_kaggle_medium_pot/results_2024-06-04T09-41-49.440105.json with huggingface_hub
073e86e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.15/aimo_kaggle_hard_pot/results_2024-06-04T09-41-36.855613.json with huggingface_hub
c7ef4b0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.17/aimo_kaggle_medium_pot/results_2024-06-04T09-41-36.548136.json with huggingface_hub
eab24e9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.33/aimo_kaggle_hard_pot/results_2024-06-04T09-39-31.320932.json with huggingface_hub
de5c1e4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.22/aimo_kaggle_hard_pot/results_2024-06-04T09-38-46.081979.json with huggingface_hub
8a5d9df
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.39/aimo_kaggle_medium_pot/results_2024-06-04T09-38-38.313777.json with huggingface_hub
f3dfdf5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.41/aimo_kaggle_medium_pot/results_2024-06-04T09-38-28.924640.json with huggingface_hub
75fb158
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.35/aimo_kaggle_medium_pot/results_2024-06-04T09-37-21.605497.json with huggingface_hub
37c65a9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.38/aimo_kaggle_medium_pot/results_2024-06-04T09-37-15.133439.json with huggingface_hub
241a0c9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v01.00/aimo_kaggle_medium_pot/results_2024-06-04T09-37-05.905394.json with huggingface_hub
cbab02b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.15/aimo_kaggle_medium_pot/results_2024-06-04T09-36-00.723160.json with huggingface_hub
86db1d5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.14/aimo_kaggle_hard_pot/results_2024-06-04T09-35-48.468073.json with huggingface_hub
47c4db7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v29.16/aimo_kaggle_medium_pot/results_2024-06-04T09-35-39.269398.json with huggingface_hub
6ec92ed
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.13/aimo_kaggle_hard_pot/results_2024-06-04T09-35-34.784672.json with huggingface_hub
642e777
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.40/aimo_kaggle_hard_pot/results_2024-06-04T09-35-29.443415.json with huggingface_hub
1eddc0f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.33/aimo_kaggle_medium_pot/results_2024-06-04T09-33-47.305791.json with huggingface_hub
bbeed30
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.18/aimo_kaggle_hard_pot/results_2024-06-04T09-33-28.726446.json with huggingface_hub
4b7f489
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.25/aimo_kaggle_hard_pot/results_2024-06-04T09-33-14.286365.json with huggingface_hub
7424500
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.10/aimo_kaggle_hard_pot/results_2024-06-04T09-33-08.558196.json with huggingface_hub
376e9c2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.22/aimo_kaggle_medium_pot/results_2024-06-04T09-32-35.846327.json with huggingface_hub
5c85b98
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.24/aimo_kaggle_hard_pot/results_2024-06-04T09-32-13.391163.json with huggingface_hub
f090043
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.40/aimo_kaggle_medium_pot/results_2024-06-04T09-30-35.329350.json with huggingface_hub
19b08a7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.13/aimo_kaggle_medium_pot/results_2024-06-04T09-30-28.004381.json with huggingface_hub
1725f4d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.12/aimo_kaggle_hard_pot/results_2024-06-04T09-29-53.922386.json with huggingface_hub
ca00b9b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.19/aimo_kaggle_hard_pot/results_2024-06-04T09-29-33.685311.json with huggingface_hub
b1db590
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.14/aimo_kaggle_medium_pot/results_2024-06-04T09-29-18.988673.json with huggingface_hub
ba4accc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.23/aimo_kaggle_hard_pot/results_2024-06-04T09-29-05.754131.json with huggingface_hub
0ce4946
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.31/aimo_kaggle_hard_pot/results_2024-06-04T09-28-37.472540.json with huggingface_hub
64baafd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.25/aimo_kaggle_medium_pot/results_2024-06-04T09-28-21.981094.json with huggingface_hub
011ff57
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.18/aimo_kaggle_medium_pot/results_2024-06-04T09-27-29.564298.json with huggingface_hub
f2e24f1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.10/aimo_kaggle_medium_pot/results_2024-06-04T09-27-13.709246.json with huggingface_hub
1318a39
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.41/aimo_kaggle_hard_pot/results_2024-06-04T09-26-50.561342.json with huggingface_hub
98fd143
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.24/aimo_kaggle_medium_pot/results_2024-06-04T09-26-35.484824.json with huggingface_hub
1d45615
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.29/aimo_kaggle_hard_pot/results_2024-06-04T09-25-36.010350.json with huggingface_hub
6a4f8d0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.12/aimo_kaggle_medium_pot/results_2024-06-04T09-24-29.394944.json with huggingface_hub
b379695
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v28.19/aimo_kaggle_medium_pot/results_2024-06-04T09-23-27.201638.json with huggingface_hub
4fa512a
verified

edbeeching HF staff commited on