open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.14/mini_math_v2/results_2024-04-29T21-34-25.358229.json with huggingface_hub
0f9ead8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.25/aimo_kaggle/results_2024-04-29T21-31-32.624794.json with huggingface_hub
7de9612
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.14/aimo_kaggle/results_2024-04-29T21-27-59.870726.json with huggingface_hub
e6c9858
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.10/aimo_kaggle/results_2024-04-29T21-25-03.633141.json with huggingface_hub
6786a93
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.17/mini_math_v2/results_2024-04-29T21-11-11.105505.json with huggingface_hub
a1dd73d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.17/aimo_kaggle/results_2024-04-29T21-00-28.299808.json with huggingface_hub
48037af
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.03/mini_math_v2/results_2024-04-29T19-25-47.349255.json with huggingface_hub
7a8a16b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.03/aimo_kaggle/results_2024-04-29T19-18-49.129733.json with huggingface_hub
fbccfc0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/Eurus-7b-sft/aimo_v01.00/mini_math_v2_pot/results_2024-04-29T18-56-30.425559.json with huggingface_hub
47e5379
verified

kashif HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.02/mini_math_v2/results_2024-04-29T18-53-10.349482.json with huggingface_hub
9847b90
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.02/aimo_kaggle/results_2024-04-29T18-41-14.055385.json with huggingface_hub
0ed89ec
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.01/mini_math_v2/results_2024-04-29T17-24-38.693499.json with huggingface_hub
5cda036
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/Eurus-7b-sft/aimo_v01.00/mini_math_v2_pot/results_2024-04-29T17-13-28.855058.json with huggingface_hub
b56bbce
verified

kashif HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.01/aimo_kaggle/results_2024-04-29T17-13-14.904093.json with huggingface_hub
430a2ec
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/Eurus-7b-sft/aimo_v01.00/aimo_kaggle_pot/results_2024-04-29T16-32-29.451670.json with huggingface_hub
fb80fb7
verified

kashif HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.40/mini_math_v2/results_2024-04-29T14-54-43.079462.json with huggingface_hub
e40b8c2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v00.00/math_v2/results_2024-04-29T14-39-12.406581.json with huggingface_hub
a141fc8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v00.01/math_v2/results_2024-04-29T14-03-18.023791.json with huggingface_hub
0b7bb81
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/Eurus-7b-sft/aimo_v01.00/mini_math_v2_cot/results_2024-04-29T13-29-15.050137.json with huggingface_hub
8836bbc
verified

kashif HF staff commited on

Upload eval_results/AI-MO/Eurus-7b-sft/aimo_v01.00/aimo_kaggle_cot/results_2024-04-29T13-03-30.759017.json with huggingface_hub
caae418
verified

kashif HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-2epoch-ohp-15k-strat-1-1epoch/main/ifeval/results_2024-04-29T10-26-31.639549.json with huggingface_hub
d51bb07
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-1epoch-ohp-15k-strat-1-2epoch/main/ifeval/results_2024-04-29T10-25-44.870513.json with huggingface_hub
a9e074d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-capybara-1epoch/main/ifeval/results_2024-04-29T10-24-41.782918.json with huggingface_hub
4c8feb3
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-math-2epoch/main/ifeval/results_2024-04-29T10-24-16.304411.json with huggingface_hub
819f78d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-2epoch-ohp-15k-strat-1-beta0.2-1epoch/main/ifeval/results_2024-04-29T10-23-46.422894.json with huggingface_hub
f14d9df
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/ifeval/results_2024-04-29T10-23-39.363862.json with huggingface_hub
7455052
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-1epoch-ohp-15k-strat-1-beta0.2-2epoch/main/ifeval/results_2024-04-29T10-22-48.705080.json with huggingface_hub
331a203
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/ifeval/results_2024-04-29T10-22-06.612661.json with huggingface_hub
e8a171f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/ifeval/results_2024-04-29T10-21-39.119505.json with huggingface_hub
9224f42
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.1/main/ifeval/results_2024-04-29T10-21-28.056053.json with huggingface_hub
9253413
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/ifeval/results_2024-04-29T10-19-54.552148.json with huggingface_hub
fcc4d54
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05/main/ifeval/results_2024-04-29T10-19-42.213798.json with huggingface_hub
d2e47ff
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/ifeval/results_2024-04-29T10-19-13.447992.json with huggingface_hub
7f5bf74
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/ifeval/results_2024-04-29T10-18-59.247695.json with huggingface_hub
7479df6
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.2/main/ifeval/results_2024-04-29T10-18-47.393352.json with huggingface_hub
2be22ba
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/ifeval/results_2024-04-29T10-18-20.798552.json with huggingface_hub
811aa27
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/ifeval/results_2024-04-29T10-17-48.555715.json with huggingface_hub
28323e8
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-2epoch/main/ifeval/results_2024-04-29T10-17-25.835438.json with huggingface_hub
4283692
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-2epoch-ohp-15k-strat-1-1epoch/main/agieval/results_2024-04-29T10-16-38.511689.json with huggingface_hub
3cbd398
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-1epoch-ohp-15k-strat-1-2epoch/main/agieval/results_2024-04-29T10-16-19.112706.json with huggingface_hub
8c84bd0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-capybara-1epoch/main/agieval/results_2024-04-29T10-15-31.076717.json with huggingface_hub
338848e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/agieval/results_2024-04-29T10-15-08.162984.json with huggingface_hub
4e2c70c
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-2epoch-ohp-15k-strat-1-beta0.2-1epoch/main/agieval/results_2024-04-29T10-14-50.366272.json with huggingface_hub
732585a
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-math-2epoch/main/agieval/results_2024-04-29T10-14-33.662520.json with huggingface_hub
17df96e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-1epoch-ohp-15k-strat-1-beta0.2-2epoch/main/agieval/results_2024-04-29T10-14-27.170182.json with huggingface_hub
ce27413
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/agieval/results_2024-04-29T10-14-19.555573.json with huggingface_hub
e8a7d2c
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-1epoch-ohp-15k-strat-1-2epoch/main/bbh/results_2024-04-29T10-14-32.462450.json with huggingface_hub
705a764
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-2epoch-ohp-15k-strat-1-1epoch/main/bbh/results_2024-04-29T10-14-29.750473.json with huggingface_hub
c3496f3
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/agieval/results_2024-04-29T10-13-31.084828.json with huggingface_hub
d2cd107
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/bbh/results_2024-04-29T10-13-58.641639.json with huggingface_hub
33868ad
verified

lewtun HF staff commited on