open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.91/aimo_kaggle_medium/results_2024-06-09T10-36-28.927894.json with huggingface_hub
53732d5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.66/aimo_kaggle_medium/results_2024-06-09T10-36-25.884247.json with huggingface_hub
6d98b46
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.20/aimo_kaggle_hard/results_2024-06-09T10-29-09.839820.json with huggingface_hub
f5b0ec0
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.20/aimo_kaggle_medium/results_2024-06-09T10-25-54.873057.json with huggingface_hub
e19a8d1
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.95/aimo_kaggle_hard/results_2024-06-09T10-25-26.870090.json with huggingface_hub
b8e3dfd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.95/aimo_kaggle_medium/results_2024-06-09T10-21-20.175029.json with huggingface_hub
b4171d3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.80/aimo_kaggle_hard/results_2024-06-09T10-15-31.175696.json with huggingface_hub
c1c640f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.80/aimo_kaggle_medium/results_2024-06-09T10-14-01.100076.json with huggingface_hub
ba7b473
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.23/aimo_kaggle_hard/results_2024-06-09T10-08-48.165810.json with huggingface_hub
c9306de
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.71/aimo_kaggle_hard/results_2024-06-09T10-02-20.529009.json with huggingface_hub
be7723b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.71/aimo_kaggle_medium/results_2024-06-09T10-01-56.231691.json with huggingface_hub
eb3c2b1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/qwen-14b-sft/aimo_v00.23/aimo_kaggle_medium/results_2024-06-09T10-00-56.801358.json with huggingface_hub
e74480b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.64/aimo_kaggle_hard/results_2024-06-09T09-52-35.682832.json with huggingface_hub
f97d438
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.64/aimo_kaggle_medium/results_2024-06-09T09-50-23.171888.json with huggingface_hub
48c7bd8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.88/aimo_kaggle_hard/results_2024-06-09T09-43-47.123570.json with huggingface_hub
9b457bd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.88/aimo_kaggle_medium/results_2024-06-09T09-41-34.082924.json with huggingface_hub
cad90ca
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.49/aimo_kaggle_hard/results_2024-06-09T09-37-32.559097.json with huggingface_hub
173c7be
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.49/aimo_kaggle_medium/results_2024-06-09T09-36-17.506567.json with huggingface_hub
e6680b8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.75/aimo_kaggle_hard/results_2024-06-09T09-33-56.256092.json with huggingface_hub
3bc6feb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.75/aimo_kaggle_medium/results_2024-06-09T09-31-59.894996.json with huggingface_hub
8ee3f7a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.57/aimo_kaggle_hard/results_2024-06-09T09-30-58.592522.json with huggingface_hub
582dd38
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.57/aimo_kaggle_medium/results_2024-06-09T09-28-16.887450.json with huggingface_hub
3a224f9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.47/aimo_kaggle_hard/results_2024-06-09T09-25-42.967711.json with huggingface_hub
c13f20d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.47/aimo_kaggle_medium/results_2024-06-09T09-24-06.130835.json with huggingface_hub
ceb5025
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.85/aimo_kaggle_hard/results_2024-06-09T09-19-42.549440.json with huggingface_hub
871b09c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.85/aimo_kaggle_medium/results_2024-06-09T09-18-55.325649.json with huggingface_hub
025ef0e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.56/aimo_kaggle_hard/results_2024-06-09T09-02-22.762929.json with huggingface_hub
4cce876
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.56/aimo_kaggle_medium/results_2024-06-09T09-01-20.750905.json with huggingface_hub
120ef1b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.63/aimo_kaggle_hard/results_2024-06-09T09-00-26.840062.json with huggingface_hub
0a38b6e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.63/aimo_kaggle_medium/results_2024-06-09T08-57-37.858954.json with huggingface_hub
1b24e33
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.72/aimo_kaggle_hard/results_2024-06-09T08-47-38.552685.json with huggingface_hub
be61790
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.72/aimo_kaggle_medium/results_2024-06-09T08-45-38.440327.json with huggingface_hub
acd2e94
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.50/aimo_kaggle_medium/results_2024-06-09T08-39-57.934404.json with huggingface_hub
7099751
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.50/aimo_kaggle_hard/results_2024-06-09T08-39-20.518256.json with huggingface_hub
e25a52a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.10/aimo_kaggle_hard/results_2024-06-09T08-22-28.687858.json with huggingface_hub
2ec8366
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.10/aimo_kaggle_medium/results_2024-06-09T08-21-17.441264.json with huggingface_hub
b17f427
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.62/aimo_kaggle_hard/results_2024-06-09T08-06-52.625696.json with huggingface_hub
747f2dd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.62/aimo_kaggle_medium/results_2024-06-09T08-05-46.855329.json with huggingface_hub
31735cc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.18/aimo_kaggle_medium/results_2024-06-09T08-03-53.900804.json with huggingface_hub
32fb34a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.18/aimo_kaggle_hard/results_2024-06-09T08-03-31.527206.json with huggingface_hub
58e03ef
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.74/aimo_kaggle_hard/results_2024-06-09T08-01-48.193586.json with huggingface_hub
ee050fb
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.74/aimo_kaggle_medium/results_2024-06-09T08-01-12.031761.json with huggingface_hub
7a5812e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.14/aimo_kaggle_hard/results_2024-06-09T08-00-18.794056.json with huggingface_hub
2a83b47
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.14/aimo_kaggle_medium/results_2024-06-09T07-58-20.498815.json with huggingface_hub
230bcbc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.58/aimo_kaggle_hard/results_2024-06-09T07-55-14.964657.json with huggingface_hub
3bcb878
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.58/aimo_kaggle_medium/results_2024-06-09T07-53-50.377909.json with huggingface_hub
07040f5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.51/aimo_kaggle_hard/results_2024-06-09T07-52-37.868001.json with huggingface_hub
503ff01
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.51/aimo_kaggle_medium/results_2024-06-09T07-49-57.770574.json with huggingface_hub
d8c2441
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.77/aimo_kaggle_hard/results_2024-06-09T07-46-28.714990.json with huggingface_hub
129a9d9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.77/aimo_kaggle_medium/results_2024-06-09T07-45-07.253381.json with huggingface_hub
f6e3734
verified

edbeeching HF staff commited on