open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/aime25_part1/results_2025-02-10T14-25-44.819964.json with huggingface_hub
7d42b7e
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000200/math_500/results_2025-02-10T13-04-08.704563.json with huggingface_hub
3151cf3
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000200/gpqa/results_2025-02-10T13-02-58.838191.json with huggingface_hub
3543a02
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000200/aime24/results_2025-02-10T13-02-58.213467.json with huggingface_hub
6946fd9
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.16/main-step-000000150/aime24/results_2025-02-10T12-02-28.019665.json with huggingface_hub
bd97bd8
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000150/aime24/results_2025-02-10T11-16-36.506103.json with huggingface_hub
23548a9
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000150/math_500/results_2025-02-10T11-16-21.763577.json with huggingface_hub
21745bf
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000150/gpqa/results_2025-02-10T11-14-44.024514.json with huggingface_hub
ea4b77d
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000200/math_500/results_2025-02-10T10-46-08.084089.json with huggingface_hub
4c7f235
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000200/gpqa/results_2025-02-10T10-44-18.855777.json with huggingface_hub
ce4b0fb
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000200/aime24/results_2025-02-10T10-44-13.954665.json with huggingface_hub
5400461
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000227/math_500/results_2025-02-10T10-36-02.315232.json with huggingface_hub
8abdcbc
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000227/gpqa/results_2025-02-10T10-35-34.349032.json with huggingface_hub
16a04dd
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000227/aime24/results_2025-02-10T10-35-02.676984.json with huggingface_hub
ec2673b
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000225/gpqa/results_2025-02-10T10-19-53.114292.json with huggingface_hub
cc28b63
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000225/math_500/results_2025-02-10T10-20-33.669529.json with huggingface_hub
c1dc0c6
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000225/aime24/results_2025-02-10T10-18-53.116618.json with huggingface_hub
81c0d1d
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000227/math_500/results_2025-02-10T10-02-13.578716.json with huggingface_hub
2373bf0
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000227/aime24/results_2025-02-10T10-01-10.649296.json with huggingface_hub
f149827
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000227/gpqa/results_2025-02-10T10-01-00.634842.json with huggingface_hub
7faf257
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000175/math_500/results_2025-02-10T10-00-02.564666.json with huggingface_hub
38171aa
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000175/aime24/results_2025-02-10T09-59-46.916506.json with huggingface_hub
0537872
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000175/gpqa/results_2025-02-10T09-58-57.850164.json with huggingface_hub
64b15a5
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000225/aime24/results_2025-02-10T09-48-00.568109.json with huggingface_hub
a785893
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000225/math_500/results_2025-02-10T09-47-54.576070.json with huggingface_hub
67bbd02
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000225/gpqa/results_2025-02-10T09-47-01.186733.json with huggingface_hub
10cdb11
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.00/main-step-000000200/gpqa/results_2025-02-10T08-25-08.858638.json with huggingface_hub
30b39c5
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.16/main-step-000000125/gpqa/results_2025-02-10T07-14-21.022877.json with huggingface_hub
2d82297
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.16/main-step-000000125/math_500/results_2025-02-10T07-14-13.572863.json with huggingface_hub
22027cb
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.16/main-step-000000125/aime24/results_2025-02-10T07-13-13.305402.json with huggingface_hub
b0d7d75
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000200/math_500/results_2025-02-10T07-12-59.080924.json with huggingface_hub
2d65eb4
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000200/aime24/results_2025-02-10T07-12-28.123915.json with huggingface_hub
4c33a0a
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000200/gpqa/results_2025-02-10T07-12-19.400193.json with huggingface_hub
86b5032
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000150/math_500/results_2025-02-10T06-59-51.220609.json with huggingface_hub
0d10da9
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000150/gpqa/results_2025-02-10T06-58-47.293921.json with huggingface_hub
3f4c019
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000150/aime24/results_2025-02-10T06-58-17.138801.json with huggingface_hub
dfe9e85
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000200/math_500/results_2025-02-10T06-43-21.364875.json with huggingface_hub
e59f0c6
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000200/gpqa/results_2025-02-10T06-42-09.781083.json with huggingface_hub
482ac54
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000125/math_500/results_2025-02-10T06-41-56.221025.json with huggingface_hub
75aba0b
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000200/aime24/results_2025-02-10T06-41-46.508146.json with huggingface_hub
1ebba94
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000125/gpqa/results_2025-02-10T06-40-24.048581.json with huggingface_hub
bdfbca8
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.12/main-step-000000125/aime24/results_2025-02-10T06-40-08.504151.json with huggingface_hub
996c474
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001227/math_500/results_2025-02-10T06-24-30.151934.json with huggingface_hub
c4ba71d
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001227/gpqa/results_2025-02-10T06-22-30.346597.json with huggingface_hub
f7cb860
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001227/aime24/results_2025-02-10T06-22-10.280014.json with huggingface_hub
6c73a85
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001220/math_500/results_2025-02-10T06-16-50.923236.json with huggingface_hub
9a7a676
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001220/gpqa/results_2025-02-10T06-14-28.936384.json with huggingface_hub
6fd7715
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000001220/aime24/results_2025-02-10T06-13-57.388006.json with huggingface_hub
531ef3f
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.02/main-step-000000060/aime24/results_2025-02-10T06-08-11.513964.json with huggingface_hub
b83b629
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.02/main-step-000000060/math_500/results_2025-02-10T06-07-52.223343.json with huggingface_hub
85fcaac
verified

lewtun HF staff commited on