open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.18/aimo_kaggle_tora_medium/results_2024-06-10T08-26-57.560570.json with huggingface_hub
b4d0c53
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.12/aimo_kaggle_tora_medium/results_2024-06-10T08-25-35.925130.json with huggingface_hub
7d43151
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.20/aimo_kaggle_tora_medium/results_2024-06-10T08-21-11.066595.json with huggingface_hub
7cadfd5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.14/aimo_kaggle_tora_medium/results_2024-06-10T08-21-08.401441.json with huggingface_hub
f1b8372
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.15/aimo_kaggle_tora_medium/results_2024-06-10T08-21-06.154970.json with huggingface_hub
6394449
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.17/aimo_kaggle_tora_medium/results_2024-06-10T08-14-59.848488.json with huggingface_hub
5b3445e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.16/aimo_kaggle_tora_medium/results_2024-06-10T08-14-42.382319.json with huggingface_hub
37d203a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.11/aimo_kaggle_tora_medium/results_2024-06-10T08-11-23.386542.json with huggingface_hub
8b19d33
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v01.10/aimo_kaggle_tora_medium/results_2024-06-10T08-09-42.924553.json with huggingface_hub
dde11a7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.29/aimo_kaggle_hard/results_2024-06-10T08-04-10.825188.json with huggingface_hub
c4573a6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.29/aimo_kaggle_medium/results_2024-06-10T08-00-07.428432.json with huggingface_hub
9ea47cd
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.30/aimo_kaggle_hard/results_2024-06-10T07-59-47.598520.json with huggingface_hub
d1fb0d6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.30/aimo_kaggle_medium/results_2024-06-10T07-56-26.587943.json with huggingface_hub
50b8e9a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.19/aimo_kaggle_hard_pot/results_2024-06-10T07-51-48.786760.json with huggingface_hub
d26d63f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.14/aimo_kaggle_hard_pot/results_2024-06-10T07-47-09.278466.json with huggingface_hub
e5b4948
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.25/aimo_kaggle_hard_pot/results_2024-06-10T07-46-31.141405.json with huggingface_hub
03f02cc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.28/aimo_kaggle_hard/results_2024-06-10T07-44-13.268485.json with huggingface_hub
eedc258
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.28/aimo_kaggle_medium/results_2024-06-10T07-42-10.502774.json with huggingface_hub
fb7104e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.07/aimo_kaggle_medium_pot/results_2024-06-10T07-38-45.201968.json with huggingface_hub
3ab66bc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.84/aimo_kaggle_hard/results_2024-06-10T07-35-08.389220.json with huggingface_hub
f5eea95
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.84/aimo_kaggle_medium/results_2024-06-10T07-34-14.177004.json with huggingface_hub
a9d63b1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.104/aimo_kaggle_hard_pot/results_2024-06-10T07-31-27.438374.json with huggingface_hub
41c3d09
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.103/aimo_kaggle_hard_pot/results_2024-06-10T07-30-47.551568.json with huggingface_hub
59893c2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.25/aimo_kaggle_medium_pot/results_2024-06-10T07-29-01.536013.json with huggingface_hub
d54fdf1
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.102/aimo_kaggle_hard_pot/results_2024-06-10T07-28-30.919334.json with huggingface_hub
426c700
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.14/aimo_kaggle_medium_pot/results_2024-06-10T07-27-58.003343.json with huggingface_hub
c487a63
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.19/aimo_kaggle_medium_pot/results_2024-06-10T07-27-52.201822.json with huggingface_hub
f1ef325
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.105/aimo_kaggle_hard_pot/results_2024-06-10T07-27-12.652397.json with huggingface_hub
cff6046
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.100/aimo_kaggle_hard_pot/results_2024-06-10T07-26-50.903698.json with huggingface_hub
779c2d2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.94/aimo_kaggle_hard_pot/results_2024-06-10T07-26-33.068337.json with huggingface_hub
cafee1c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.84/aimo_kaggle_hard_pot/results_2024-06-10T07-25-45.010064.json with huggingface_hub
6e37392
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.104/aimo_kaggle_medium_pot/results_2024-06-10T07-25-14.138543.json with huggingface_hub
9fce914
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.103/aimo_kaggle_medium_pot/results_2024-06-10T07-24-44.273479.json with huggingface_hub
fb94b99
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.102/aimo_kaggle_medium_pot/results_2024-06-10T07-24-34.262897.json with huggingface_hub
10ead05
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.97/aimo_kaggle_hard_pot/results_2024-06-10T07-24-06.455665.json with huggingface_hub
d9222fa
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.98/aimo_kaggle_hard_pot/results_2024-06-10T07-24-07.457562.json with huggingface_hub
39d2221
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.93/aimo_kaggle_hard_pot/results_2024-06-10T07-22-52.251896.json with huggingface_hub
ad00f63
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.96/aimo_kaggle_hard_pot/results_2024-06-10T07-22-19.897510.json with huggingface_hub
6393442
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.105/aimo_kaggle_medium_pot/results_2024-06-10T07-22-14.056861.json with huggingface_hub
f69f25e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.15/aimo_kaggle_hard/results_2024-06-10T07-21-06.291854.json with huggingface_hub
5e7b299
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.99/aimo_kaggle_hard_pot/results_2024-06-10T07-20-56.012416.json with huggingface_hub
bf75ece
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.94/aimo_kaggle_medium_pot/results_2024-06-10T07-20-54.840526.json with huggingface_hub
5421abe
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.100/aimo_kaggle_medium_pot/results_2024-06-10T07-20-40.635283.json with huggingface_hub
7779c2b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.84/aimo_kaggle_medium_pot/results_2024-06-10T07-20-11.716175.json with huggingface_hub
c69e452
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.27/aimo_kaggle_hard/results_2024-06-10T07-19-35.200405.json with huggingface_hub
a867e5b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.89/aimo_kaggle_hard_pot/results_2024-06-10T07-19-23.020106.json with huggingface_hub
b49e612
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.97/aimo_kaggle_medium_pot/results_2024-06-10T07-18-49.092637.json with huggingface_hub
e6724b4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.92/aimo_kaggle_hard_pot/results_2024-06-10T07-17-46.409227.json with huggingface_hub
6d1bbc2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.98/aimo_kaggle_medium_pot/results_2024-06-10T07-17-44.167671.json with huggingface_hub
1944d6c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v32.93/aimo_kaggle_medium_pot/results_2024-06-10T07-17-16.911342.json with huggingface_hub
c4a28c4
verified

edbeeching HF staff commited on