open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000200/aime24/results_2025-02-09T19-41-13.115064.json with huggingface_hub
2a577ea
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000660/math_500/results_2025-02-09T19-37-15.400366.json with huggingface_hub
cb5f059
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000660/gpqa/results_2025-02-09T19-36-56.772757.json with huggingface_hub
3d21aa0
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000660/aime24/results_2025-02-09T19-36-52.263684.json with huggingface_hub
a9c7d4f
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000600/math_500/results_2025-02-09T19-28-28.228020.json with huggingface_hub
8e8c9b0
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000600/gpqa/results_2025-02-09T19-26-10.279530.json with huggingface_hub
ab34a55
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000600/aime24/results_2025-02-09T19-25-22.976601.json with huggingface_hub
2260c3d
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000640/gpqa/results_2025-02-09T19-22-03.933966.json with huggingface_hub
0942f43
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000640/aime24/results_2025-02-09T19-20-47.819205.json with huggingface_hub
2c7392b
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000640/math_500/results_2025-02-09T19-18-47.919180.json with huggingface_hub
f181032
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000114/math_500/results_2025-02-09T19-18-23.565237.json with huggingface_hub
f78394d
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000114/gpqa/results_2025-02-09T19-18-13.882441.json with huggingface_hub
5fbd107
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000125/aime24/results_2025-02-09T19-17-53.234327.json with huggingface_hub
a8fc7ba
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000114/aime24/results_2025-02-09T19-17-27.164525.json with huggingface_hub
c141ae6
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000125/gpqa/results_2025-02-09T19-16-47.816383.json with huggingface_hub
64b3221
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000100/gpqa/results_2025-02-09T19-12-57.360146.json with huggingface_hub
7992c26
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.15/main-step-000000100/aime24/results_2025-02-09T19-11-38.442717.json with huggingface_hub
0bb1033
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000580/math_500/results_2025-02-09T19-08-20.774905.json with huggingface_hub
dc14e84
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000580/gpqa/results_2025-02-09T19-07-54.858438.json with huggingface_hub
a7439e5
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000620/math_500/results_2025-02-09T19-06-57.398553.json with huggingface_hub
59e0930
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000580/aime24/results_2025-02-09T19-06-16.088850.json with huggingface_hub
6aa92dd
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000620/gpqa/results_2025-02-09T19-01-56.421399.json with huggingface_hub
ca163e1
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000620/aime24/results_2025-02-09T19-01-40.495482.json with huggingface_hub
bd03d3a
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000100/math_500/results_2025-02-09T18-57-45.857252.json with huggingface_hub
5ca4a33
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.11/main-step-000000100/aime24/results_2025-02-09T18-56-39.660251.json with huggingface_hub
25834e3
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000560/gpqa/results_2025-02-09T18-47-30.496310.json with huggingface_hub
80d032c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000560/math_500/results_2025-02-09T18-45-30.787353.json with huggingface_hub
6f8c52b
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000560/aime24/results_2025-02-09T18-45-17.207421.json with huggingface_hub
9560ad6
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000600/gpqa/results_2025-02-09T18-45-08.391133.json with huggingface_hub
8e58e15
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000600/math_500/results_2025-02-09T18-43-36.160611.json with huggingface_hub
25a2db8
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000600/aime24/results_2025-02-09T18-42-47.570969.json with huggingface_hub
3aa0fb8
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000050/math_500/results_2025-02-09T18-41-01.844034.json with huggingface_hub
1f478f5
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000050/aime24/results_2025-02-09T18-39-54.313040.json with huggingface_hub
e787e70
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.19/main-step-000000050/gpqa/results_2025-02-09T18-39-37.337910.json with huggingface_hub
cd06fe6
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000453/math_500/results_2025-02-09T18-37-29.171222.json with huggingface_hub
52ab814
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000453/gpqa/results_2025-02-09T18-36-20.497580.json with huggingface_hub
2d6aef4
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000453/aime24/results_2025-02-09T18-35-28.849275.json with huggingface_hub
123dd5d
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000450/math_500/results_2025-02-09T18-34-16.424151.json with huggingface_hub
e8a01f0
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000450/aime24/results_2025-02-09T18-33-36.140604.json with huggingface_hub
c1c2691
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000100/math_500/results_2025-02-09T18-30-49.972021.json with huggingface_hub
2dd431b
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000100/aime24/results_2025-02-09T18-30-38.071773.json with huggingface_hub
7fe9c07
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000580/gpqa/results_2025-02-09T18-29-56.626543.json with huggingface_hub
6c2a417
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.18/main-step-000000100/gpqa/results_2025-02-09T18-29-42.446545.json with huggingface_hub
7841139
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000580/math_500/results_2025-02-09T18-28-14.434322.json with huggingface_hub
044a3ee
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000580/aime24/results_2025-02-09T18-28-03.375083.json with huggingface_hub
ed366f7
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000540/math_500/results_2025-02-09T18-25-29.943613.json with huggingface_hub
85c7bd6
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000560/gpqa/results_2025-02-09T18-14-11.229098.json with huggingface_hub
874b895
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000540/gpqa/results_2025-02-09T18-24-19.946716.json with huggingface_hub
fe2925e
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000540/aime24/results_2025-02-09T18-23-58.725055.json with huggingface_hub
4f6e791
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.01/main-step-000000100/math_500/results_2025-02-09T18-16-43.930923.json with huggingface_hub
2d07480
verified

lewtun HF staff commited on