open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000050/gpqa/results_2025-02-09T05-48-02.598952.json with huggingface_hub
a62ec38
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000075/math_500/results_2025-02-09T04-59-11.433133.json with huggingface_hub
603761e
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000075/gpqa/results_2025-02-09T04-58-01.069809.json with huggingface_hub
652f6c7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-LIMO-v00.02/main-step-000000160/math_500/results_2025-02-09T02-44-56.854928.json with huggingface_hub
88d4de0
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-LIMO-v00.02/main-step-000000160/gpqa/results_2025-02-09T02-44-17.028777.json with huggingface_hub
f78e0a6
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-LIMO-v00.02/main-step-000000160/aime24/results_2025-02-09T02-44-03.222533.json with huggingface_hub
8d8da27
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000114/math_500/results_2025-02-09T02-36-44.146698.json with huggingface_hub
51fc187
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000114/aime24/results_2025-02-09T02-34-25.946747.json with huggingface_hub
dd4e64b
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000114/gpqa/results_2025-02-09T02-33-25.937639.json with huggingface_hub
e3bcd1e
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000453/math_500/results_2025-02-09T02-31-17.858922.json with huggingface_hub
eec6769
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000453/aime24/results_2025-02-09T02-29-29.172084.json with huggingface_hub
4af87ec
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000450/math_500/results_2025-02-09T02-28-35.850624.json with huggingface_hub
4b7a6e3
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000450/gpqa/results_2025-02-09T02-27-22.031257.json with huggingface_hub
4499ec8
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000450/aime24/results_2025-02-09T02-27-20.453605.json with huggingface_hub
da117a0
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000425/math_500/results_2025-02-09T02-06-30.087854.json with huggingface_hub
763d6a0
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000425/aime24/results_2025-02-09T02-06-08.691670.json with huggingface_hub
861cd2a
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000425/gpqa/results_2025-02-09T02-05-58.020260.json with huggingface_hub
6ae3377
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000050/math_500/results_2025-02-09T01-59-51.753907.json with huggingface_hub
6b56945
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000050/aime24/results_2025-02-09T01-57-54.710331.json with huggingface_hub
8e21b95
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.07/main-step-000000050/gpqa/results_2025-02-09T01-57-41.520292.json with huggingface_hub
e40388c
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000100/math_500/results_2025-02-09T01-46-37.245823.json with huggingface_hub
59fba83
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000400/math_500/results_2025-02-09T01-46-12.830353.json with huggingface_hub
7c71521
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000100/aime24/results_2025-02-09T01-45-33.238755.json with huggingface_hub
19572e7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000100/gpqa/results_2025-02-09T01-44-50.361099.json with huggingface_hub
a5c4760
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000400/gpqa/results_2025-02-09T01-44-47.643691.json with huggingface_hub
caada0a
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000400/aime24/results_2025-02-09T01-44-37.266854.json with huggingface_hub
fbe8b89
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000375/math_500/results_2025-02-09T01-24-45.244747.json with huggingface_hub
6826fd6
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000375/aime24/results_2025-02-09T01-24-10.505828.json with huggingface_hub
1074055
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000375/gpqa/results_2025-02-09T01-23-39.242914.json with huggingface_hub
5c44412
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000350/math_500/results_2025-02-09T01-03-32.568858.json with huggingface_hub
a947c72
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000350/gpqa/results_2025-02-09T01-02-14.938002.json with huggingface_hub
d20365c
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000350/aime24/results_2025-02-09T01-02-16.844339.json with huggingface_hub
51fdfe8
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000325/math_500/results_2025-02-09T00-41-55.157683.json with huggingface_hub
cfe014b
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000325/aime24/results_2025-02-09T00-41-11.600339.json with huggingface_hub
e45c4b1
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000325/gpqa/results_2025-02-09T00-41-08.173580.json with huggingface_hub
189e059
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000025/math_500/results_2025-02-09T00-38-18.652825.json with huggingface_hub
29e5b25
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000025/aime24/results_2025-02-09T00-36-14.877552.json with huggingface_hub
e1682af
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.08/main-step-000000025/gpqa/results_2025-02-09T00-36-06.712840.json with huggingface_hub
3505488
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000300/math_500/results_2025-02-09T00-21-19.074479.json with huggingface_hub
51b664b
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000075/math_500/results_2025-02-09T00-21-11.440625.json with huggingface_hub
f520918
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000075/aime24/results_2025-02-09T00-20-30.478623.json with huggingface_hub
61a20d9
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000300/gpqa/results_2025-02-09T00-20-16.599496.json with huggingface_hub
194bfed
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000300/aime24/results_2025-02-09T00-20-12.429992.json with huggingface_hub
ad5d8c7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.06/main-step-000000075/gpqa/results_2025-02-09T00-20-08.096943.json with huggingface_hub
439b98b
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000275/math_500/results_2025-02-08T23-59-54.645896.json with huggingface_hub
74f4c3c
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000275/gpqa/results_2025-02-08T23-59-27.495923.json with huggingface_hub
e808f38
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000250/math_500/results_2025-02-08T23-38-42.215081.json with huggingface_hub
cbca4b0
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000250/aime24/results_2025-02-08T23-37-30.114460.json with huggingface_hub
5cc10c8
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v00.05/main-step-000000250/gpqa/results_2025-02-08T23-37-27.146704.json with huggingface_hub
fceda48
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-LIMO-v00.02/main-step-000000140/math_500/results_2025-02-08T23-21-40.695977.json with huggingface_hub
10ff6da
verified

lewtun HF Staff commited on