open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/agieval/results_2024-04-30T07-11-40.984933.json with huggingface_hub
58b8555
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/agieval/results_2024-04-29T10-10-00.788731.json with huggingface_hub
9d2fcee
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/agieval/results_2024-04-30T07-09-22.314160.json with huggingface_hub
08df480
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/bbh/results_2024-04-30T07-08-53.771754.json with huggingface_hub
ac39200
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.16/mini_math_v2/results_2024-04-30T05-16-36.771565.json with huggingface_hub
ce65131
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.16/aimo_kaggle/results_2024-04-30T05-07-48.551420.json with huggingface_hub
fc7f250
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.06/mini_math_v2/results_2024-04-30T03-05-51.163452.json with huggingface_hub
7605555
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.06/aimo_kaggle/results_2024-04-30T03-01-07.259895.json with huggingface_hub
d7ac1ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.22/mini_math_v2/results_2024-04-30T02-42-39.244083.json with huggingface_hub
f1a8b88
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.22/aimo_kaggle/results_2024-04-30T02-37-32.599587.json with huggingface_hub
2b319ab
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.11/mini_math_v2/results_2024-04-30T01-59-16.174382.json with huggingface_hub
2b8bd84
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.11/aimo_kaggle/results_2024-04-30T01-50-31.125806.json with huggingface_hub
79143ff
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.21/mini_math_v2/results_2024-04-30T01-21-46.645950.json with huggingface_hub
5f8916d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.21/aimo_kaggle/results_2024-04-30T01-17-53.501856.json with huggingface_hub
441dbe5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.12/mini_math_v2/results_2024-04-30T01-09-05.115927.json with huggingface_hub
ed61634
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.39/mini_math_v2/results_2024-04-30T01-03-10.665017.json with huggingface_hub
798ca8c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.12/aimo_kaggle/results_2024-04-30T01-01-01.245040.json with huggingface_hub
1f78a6e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.39/aimo_kaggle/results_2024-04-30T00-54-26.896164.json with huggingface_hub
857c8a2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.09/mini_math_v2/results_2024-04-30T00-04-15.407234.json with huggingface_hub
fb0b9ef
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.20/mini_math_v2/results_2024-04-30T00-00-35.769928.json with huggingface_hub
ec2ef74
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.09/aimo_kaggle/results_2024-04-29T23-54-21.693438.json with huggingface_hub
78a1ea3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.20/aimo_kaggle/results_2024-04-29T23-52-35.032099.json with huggingface_hub
c99e520
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.08/aimo_kaggle/results_2024-04-29T23-51-16.342703.json with huggingface_hub
29ef150
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.23/mini_math_v2/results_2024-04-29T23-50-42.071132.json with huggingface_hub
ab1e349
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.23/aimo_kaggle/results_2024-04-29T23-40-14.254139.json with huggingface_hub
4286737
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.04/mini_math_v2/results_2024-04-29T23-32-30.318063.json with huggingface_hub
d4cd409
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.15/mini_math_v2/results_2024-04-29T23-29-01.738204.json with huggingface_hub
773a922
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.04/aimo_kaggle/results_2024-04-29T23-22-48.294970.json with huggingface_hub
2868ed6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.15/aimo_kaggle/results_2024-04-29T23-19-54.451368.json with huggingface_hub
c2175c2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.30/mini_math_v2/results_2024-04-29T23-18-28.034969.json with huggingface_hub
4817973
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.27/mini_math_v2/results_2024-04-29T23-15-13.075768.json with huggingface_hub
f2a7316
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.30/aimo_kaggle/results_2024-04-29T23-11-11.399807.json with huggingface_hub
3d567d2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.27/aimo_kaggle/results_2024-04-29T23-09-51.090480.json with huggingface_hub
ae77c87
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.18/mini_math_v2/results_2024-04-29T22-56-11.506453.json with huggingface_hub
f226f6d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.29/mini_math_v2/results_2024-04-29T22-48-53.567460.json with huggingface_hub
9146f6d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.18/aimo_kaggle/results_2024-04-29T22-46-34.009351.json with huggingface_hub
50c0316
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.07/mini_math_v2/results_2024-04-29T22-46-24.177054.json with huggingface_hub
02e8812
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.07/aimo_kaggle/results_2024-04-29T22-39-17.784710.json with huggingface_hub
d7b3c20
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.05/mini_math_v2/results_2024-04-29T21-50-27.970315.json with huggingface_hub
dcf5524
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.31/mini_math_v2/results_2024-04-29T21-47-30.583806.json with huggingface_hub
ee0459b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.13/mini_math_v2/results_2024-04-29T21-47-15.908535.json with huggingface_hub
1a7b167
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.00/mini_math_v2/results_2024-04-29T21-46-40.481169.json with huggingface_hub
b8710a0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.00/aimo_kaggle/results_2024-04-29T21-43-41.701670.json with huggingface_hub
2f95b90
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.05/aimo_kaggle/results_2024-04-29T21-43-31.340988.json with huggingface_hub
dd9d5dc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.31/aimo_kaggle/results_2024-04-29T21-40-29.137801.json with huggingface_hub
2c434cf
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.25/mini_math_v2/results_2024-04-29T21-39-19.800167.json with huggingface_hub
709eb6c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.13/aimo_kaggle/results_2024-04-29T21-37-49.467956.json with huggingface_hub
bb42885
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.10/mini_math_v2/results_2024-04-29T21-34-58.105508.json with huggingface_hub
1f7b48b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.14/mini_math_v2/results_2024-04-29T21-34-25.358229.json with huggingface_hub
0f9ead8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.25/aimo_kaggle/results_2024-04-29T21-31-32.624794.json with huggingface_hub
7de9612
verified

edbeeching HF staff commited on