open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.34/aimo_kaggle_hard_pot/results_2024-05-25T11-41-01.819061.json with huggingface_hub
8636181
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.34/aimo_kaggle_medium_pot/results_2024-05-25T11-40-45.662100.json with huggingface_hub
c47a0c3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.40/aimo_kaggle_hard_pot/results_2024-05-25T11-40-36.845786.json with huggingface_hub
3ba2ec1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.33/aimo_kaggle_hard_pot/results_2024-05-25T11-40-31.946905.json with huggingface_hub
4c5aee9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.19/aimo_kaggle_hard_pot/results_2024-05-25T11-40-15.479319.json with huggingface_hub
c51db1f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.33/aimo_kaggle_medium_pot/results_2024-05-25T11-40-12.232484.json with huggingface_hub
1135411
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.40/aimo_kaggle_pot/results_2024-05-25T11-40-08.068339.json with huggingface_hub
f443253
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_medium_pot/results_2024-05-25T11-40-08.471267.json with huggingface_hub
1a0b683
verified

edbeeching HF staff commited on

Upload eval_results/meta-llama/Meta-Llama-3-70B-Instruct/main/ifeval/results_2024-05-25T11-39-50.035526.json with huggingface_hub
dbc840a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.33/aimo_kaggle_pot/results_2024-05-25T11-39-41.085587.json with huggingface_hub
1d21c00
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.19/aimo_kaggle_medium_pot/results_2024-05-25T11-39-28.048621.json with huggingface_hub
d24fde8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.40/aimo_kaggle_medium_pot/results_2024-05-25T11-39-31.520214.json with huggingface_hub
3155a9a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.29/aimo_kaggle_hard_pot/results_2024-05-25T11-39-22.629754.json with huggingface_hub
629a8f7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_pot/results_2024-05-25T11-39-17.762875.json with huggingface_hub
0c6b99a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.30/aimo_kaggle_hard_pot/results_2024-05-25T11-38-52.490352.json with huggingface_hub
f3bd29a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.29/aimo_kaggle_medium_pot/results_2024-05-25T11-38-40.302637.json with huggingface_hub
fcbf325
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.30/aimo_kaggle_pot/results_2024-05-25T11-38-22.337612.json with huggingface_hub
fa7c0d9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.17/aimo_kaggle_hard_pot/results_2024-05-25T11-38-25.496139.json with huggingface_hub
ca24ae8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.29/aimo_kaggle_pot/results_2024-05-25T11-38-24.408979.json with huggingface_hub
fa90e25
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.30/aimo_kaggle_medium_pot/results_2024-05-25T11-38-15.370127.json with huggingface_hub
f13d1ea
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.19/aimo_kaggle_pot/results_2024-05-25T11-38-09.236731.json with huggingface_hub
0c1bde4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.17/aimo_kaggle_medium_pot/results_2024-05-25T11-38-01.298586.json with huggingface_hub
e4a06d9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.17/aimo_kaggle_pot/results_2024-05-25T11-37-48.243850.json with huggingface_hub
7e68c9e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.41/aimo_kaggle_hard_pot/results_2024-05-25T11-37-38.866375.json with huggingface_hub
376b8f2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.21/aimo_kaggle_hard_pot/results_2024-05-25T11-37-36.674975.json with huggingface_hub
69cb88a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.39/aimo_kaggle_hard_pot/results_2024-05-25T11-37-19.358590.json with huggingface_hub
3a6e676
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.20/aimo_kaggle_hard_pot/results_2024-05-25T11-37-18.262121.json with huggingface_hub
b9a2e6f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.21/aimo_kaggle_medium_pot/results_2024-05-25T11-37-11.698342.json with huggingface_hub
17350a9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.21/aimo_kaggle_pot/results_2024-05-25T11-37-04.251113.json with huggingface_hub
92bf64e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.38/aimo_kaggle_hard_pot/results_2024-05-25T11-36-47.635876.json with huggingface_hub
2f329fe
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.41/aimo_kaggle_medium_pot/results_2024-05-25T11-36-35.882220.json with huggingface_hub
7046815
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.41/aimo_kaggle_pot/results_2024-05-25T11-36-26.040126.json with huggingface_hub
237d066
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.35/aimo_kaggle_hard_pot/results_2024-05-25T11-36-26.696007.json with huggingface_hub
55ddcd1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.39/aimo_kaggle_pot/results_2024-05-25T11-36-22.355000.json with huggingface_hub
111251c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.39/aimo_kaggle_medium_pot/results_2024-05-25T11-36-15.833534.json with huggingface_hub
587e799
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.20/aimo_kaggle_medium_pot/results_2024-05-25T11-36-15.944044.json with huggingface_hub
1affbe0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.20/aimo_kaggle_pot/results_2024-05-25T11-36-10.990397.json with huggingface_hub
46b1ca9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.11/aimo_kaggle_hard_pot/results_2024-05-25T11-35-47.222717.json with huggingface_hub
6830a3e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.38/aimo_kaggle_medium_pot/results_2024-05-25T11-35-43.654368.json with huggingface_hub
a0cdec3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.16/aimo_kaggle_hard_pot/results_2024-05-25T11-35-37.871835.json with huggingface_hub
c751de2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.11/aimo_kaggle_pot/results_2024-05-25T11-35-32.747284.json with huggingface_hub
adcbf4c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.35/aimo_kaggle_medium_pot/results_2024-05-25T11-35-17.797649.json with huggingface_hub
3f326c4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.38/aimo_kaggle_pot/results_2024-05-25T11-35-14.036257.json with huggingface_hub
e92ebc6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.11/aimo_kaggle_medium_pot/results_2024-05-25T11-35-12.034306.json with huggingface_hub
758fb16
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.16/aimo_kaggle_medium_pot/results_2024-05-25T11-35-00.398166.json with huggingface_hub
cab5b8a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.16/aimo_kaggle_pot/results_2024-05-25T11-34-26.447439.json with huggingface_hub
1898fa9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.00/aimo_kaggle_hard_pot/results_2024-05-25T11-34-16.448440.json with huggingface_hub
bd2a426
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.35/aimo_kaggle_pot/results_2024-05-25T11-34-04.748642.json with huggingface_hub
b9b5e2b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.00/aimo_kaggle_medium_pot/results_2024-05-25T11-33-28.584808.json with huggingface_hub
6c5f33a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.00/aimo_kaggle_pot/results_2024-05-25T11-33-26.472110.json with huggingface_hub
f1ed1e6
verified

edbeeching HF staff commited on