open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.15/aimo_kaggle_medium/results_2024-06-02T21-16-31.925206.json with huggingface_hub
2d8f788
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.00/aimo_kaggle_tora_medium/results_2024-06-02T21-15-31.579265.json with huggingface_hub
e6f7bfc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.16/aimo_kaggle_medium/results_2024-06-02T21-13-03.119629.json with huggingface_hub
261a741
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.12/aimo_kaggle_hard/results_2024-06-02T21-08-13.197075.json with huggingface_hub
04cece9
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.13/aimo_kaggle_hard/results_2024-06-02T21-04-16.041322.json with huggingface_hub
4372ca3
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.12/aimo_kaggle_medium/results_2024-06-02T21-02-21.002626.json with huggingface_hub
f217395
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.26/aimo_kaggle_hard/results_2024-06-02T20-53-42.909221.json with huggingface_hub
3d058ab
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.18/aimo_kaggle_hard/results_2024-06-02T20-52-21.520324.json with huggingface_hub
adbf66e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.02/aimo_kaggle_hard/results_2024-06-02T20-51-53.488192.json with huggingface_hub
6d64820
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.18/aimo_kaggle_medium/results_2024-06-02T20-46-52.827967.json with huggingface_hub
df0a8dd
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.26/aimo_kaggle_medium/results_2024-06-02T20-46-29.786558.json with huggingface_hub
a32f055
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.13/aimo_kaggle_medium/results_2024-06-02T20-45-46.109505.json with huggingface_hub
6563f7a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.04/aimo_kaggle_hard/results_2024-06-02T20-42-16.326524.json with huggingface_hub
0788b04
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.02/aimo_kaggle_medium/results_2024-06-02T20-42-00.478164.json with huggingface_hub
31e1607
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.04/aimo_kaggle_medium/results_2024-06-02T20-33-18.192995.json with huggingface_hub
3cf90e7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.11/aimo_kaggle_hard/results_2024-06-02T20-31-55.039540.json with huggingface_hub
c3ea81d
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.00/aimo_kaggle_hard/results_2024-06-02T20-31-44.139144.json with huggingface_hub
e3fca3f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.21/aimo_kaggle_hard/results_2024-06-02T20-31-18.763693.json with huggingface_hub
73c43a9
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.00/aimo_kaggle_medium/results_2024-06-02T20-29-35.319612.json with huggingface_hub
79eef1e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.21/aimo_kaggle_medium/results_2024-06-02T20-24-25.899584.json with huggingface_hub
74a8b7c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.25/aimo_kaggle_hard/results_2024-06-02T20-23-19.037929.json with huggingface_hub
45f945c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.11/aimo_kaggle_medium/results_2024-06-02T20-21-32.012950.json with huggingface_hub
f0e0eaa
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.25/aimo_kaggle_medium/results_2024-06-02T20-10-02.902299.json with huggingface_hub
745c50d
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.07/aimo_kaggle_hard/results_2024-06-02T20-07-43.571119.json with huggingface_hub
8fee5b5
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.18/aimo_kaggle_hard/results_2024-06-02T20-04-32.429315.json with huggingface_hub
9925da9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.18/aimo_kaggle_medium/results_2024-06-02T20-03-59.774755.json with huggingface_hub
ed0c1ad
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.18/aimo_kaggle/results_2024-06-02T20-03-07.489443.json with huggingface_hub
3a78d26
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.10/aimo_kaggle_hard/results_2024-06-02T20-00-55.391068.json with huggingface_hub
5e19930
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.10/aimo_kaggle_medium/results_2024-06-02T19-58-49.662787.json with huggingface_hub
45de243
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.03/aimo_kaggle_hard/results_2024-06-02T19-57-46.560546.json with huggingface_hub
e3db704
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.10/aimo_kaggle/results_2024-06-02T19-57-15.384420.json with huggingface_hub
8654275
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.08/aimo_kaggle_hard/results_2024-06-02T19-54-48.845623.json with huggingface_hub
ba98fde
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.03/aimo_kaggle_medium/results_2024-06-02T19-54-11.538221.json with huggingface_hub
19a7f3f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.14/aimo_kaggle_hard/results_2024-06-02T19-53-35.519132.json with huggingface_hub
bd8c080
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.08/aimo_kaggle_medium/results_2024-06-02T19-51-32.490934.json with huggingface_hub
372e591
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.14/aimo_kaggle_medium/results_2024-06-02T19-47-54.814443.json with huggingface_hub
2042727
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.14/aimo_kaggle/results_2024-06-02T19-46-44.922456.json with huggingface_hub
37de752
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.10/aimo_kaggle_hard/results_2024-06-02T19-41-43.508063.json with huggingface_hub
a30ecd2
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.10/aimo_kaggle_medium/results_2024-06-02T19-39-43.943937.json with huggingface_hub
917cc4c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.01/aimo_kaggle_medium/results_2024-06-02T19-39-38.224181.json with huggingface_hub
3948a94
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.14/aimo_kaggle_hard/results_2024-06-02T19-39-23.654335.json with huggingface_hub
fb3afb4
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.01/aimo_kaggle_hard/results_2024-06-02T19-38-15.068105.json with huggingface_hub
9df11dc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.19/aimo_kaggle_hard/results_2024-06-02T19-35-03.845660.json with huggingface_hub
f08e2aa
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.19/aimo_kaggle_medium/results_2024-06-02T19-33-00.537881.json with huggingface_hub
700b969
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl/main/aimo_kaggle_tora_medium/results_2024-06-02T19-30-48.388556.json with huggingface_hub
9e3cc1d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.32/aimo_kaggle_medium/results_2024-06-02T19-16-14.022090.json with huggingface_hub
c1fbdbf
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.32/aimo_kaggle_hard/results_2024-06-02T19-15-54.352849.json with huggingface_hub
de28287
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.36/aimo_kaggle_hard/results_2024-06-02T19-13-23.663602.json with huggingface_hub
8c07027
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.36/aimo_kaggle_medium/results_2024-06-02T19-13-06.737980.json with huggingface_hub
dd8396c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v00.32/aimo_kaggle/results_2024-06-02T19-12-56.283293.json with huggingface_hub
abaf377
verified

edbeeching HF staff commited on