open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.01/main-step-000000100/gpqa/results_2025-02-09T18-16-39.336193.json with huggingface_hub
a4109d8
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v00.01/main-step-000000100/aime24/results_2025-02-09T18-16-04.104613.json with huggingface_hub
e297139
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000425/math_500/results_2025-02-09T18-15-20.803883.json with huggingface_hub
40a6b21
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000425/gpqa/results_2025-02-09T18-13-47.565282.json with huggingface_hub
3b40ef5
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000615/aime24/results_2025-02-09T18-13-36.298667.json with huggingface_hub
35d4fb7
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000425/aime24/results_2025-02-09T18-13-19.670188.json with huggingface_hub
221c277
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000615/gpqa/results_2025-02-09T18-13-11.502397.json with huggingface_hub
b484aa4
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000615/math_500/results_2025-02-09T18-11-52.792822.json with huggingface_hub
43bf427
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000560/math_500/results_2025-02-09T18-09-25.524439.json with huggingface_hub
b6e73c5
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000560/aime24/results_2025-02-09T18-09-26.454116.json with huggingface_hub
6dfe41c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000520/gpqa/results_2025-02-09T18-06-59.043943.json with huggingface_hub
90c102c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000520/aime24/results_2025-02-09T18-06-15.144414.json with huggingface_hub
7b64928
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000520/math_500/results_2025-02-09T18-05-56.120615.json with huggingface_hub
3aef575
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000600/math_500/results_2025-02-09T18-04-56.423721.json with huggingface_hub
74e3e0c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000600/gpqa/results_2025-02-09T18-04-26.099400.json with huggingface_hub
305df20
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000600/aime24/results_2025-02-09T18-03-58.690620.json with huggingface_hub
0f05d55
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000580/gpqa/results_2025-02-09T17-57-55.625431.json with huggingface_hub
33b159c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000400/math_500/results_2025-02-09T17-54-41.102551.json with huggingface_hub
1d8dacd
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000540/math_500/results_2025-02-09T17-54-30.071248.json with huggingface_hub
a2b070e
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000400/aime24/results_2025-02-09T17-54-01.852634.json with huggingface_hub
5043107
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000580/aime24/results_2025-02-09T17-53-34.960418.json with huggingface_hub
bdb9818
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000580/math_500/results_2025-02-09T17-52-12.073876.json with huggingface_hub
64219a9
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000540/gpqa/results_2025-02-09T17-49-07.267167.json with huggingface_hub
938aad6
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000480/aime24/results_2025-02-09T17-37-46.963788.json with huggingface_hub
cf1fd85
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000540/aime24/results_2025-02-09T17-48-17.658717.json with huggingface_hub
55c6560
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000500/math_500/results_2025-02-09T17-43-58.806527.json with huggingface_hub
b18a0bf
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000500/gpqa/results_2025-02-09T17-42-38.756526.json with huggingface_hub
5682a84
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000560/math_500/results_2025-02-09T17-42-26.706939.json with huggingface_hub
d702675
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000560/aime24/results_2025-02-09T17-42-07.809125.json with huggingface_hub
849fe9f
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000560/gpqa/results_2025-02-09T17-42-01.492959.json with huggingface_hub
a9589b1
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000500/aime24/results_2025-02-09T17-41-46.702705.json with huggingface_hub
c940ff2
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000540/aime24/results_2025-02-09T17-37-41.268815.json with huggingface_hub
6872b90
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000540/math_500/results_2025-02-09T17-37-31.650478.json with huggingface_hub
d756c2d
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000520/math_500/results_2025-02-09T17-37-31.120281.json with huggingface_hub
c8442d5
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000540/gpqa/results_2025-02-09T17-37-29.386493.json with huggingface_hub
55c5cfc
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000520/aime24/results_2025-02-09T17-37-29.426159.json with huggingface_hub
d3040d7
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000520/gpqa/results_2025-02-09T17-37-02.393793.json with huggingface_hub
02d7cc9
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000480/gpqa/results_2025-02-09T17-35-33.459604.json with huggingface_hub
6cc3b10
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000375/gpqa/results_2025-02-09T17-35-01.688199.json with huggingface_hub
0649815
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000375/math_500/results_2025-02-09T17-33-58.797215.json with huggingface_hub
9bcabf1
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.17/main-step-000000375/aime24/results_2025-02-09T17-32-28.901556.json with huggingface_hub
591cf29
verified

edbeeching HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.01/main-step-000000480/math_500/results_2025-02-09T17-31-56.266339.json with huggingface_hub
4eada79
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000500/aime24/results_2025-02-09T17-31-53.834623.json with huggingface_hub
5c637e7
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000500/math_500/results_2025-02-09T17-31-50.715064.json with huggingface_hub
5c2c41a
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000520/gpqa/results_2025-02-09T17-31-40.375052.json with huggingface_hub
e76597d
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.02/main-step-000000500/gpqa/results_2025-02-09T17-31-30.043143.json with huggingface_hub
c705a76
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000500/math_500/results_2025-02-09T17-31-26.201864.json with huggingface_hub
01a2936
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000520/math_500/results_2025-02-09T17-31-25.197359.json with huggingface_hub
e82009c
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000500/gpqa/results_2025-02-09T17-31-12.032194.json with huggingface_hub
0759416
verified

lewtun HF staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-7B-GRPO-LIMO-v02.00/main-step-000000520/aime24/results_2025-02-09T17-30-55.859323.json with huggingface_hub
15a269e
verified

lewtun HF staff commited on