open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T16-47-29.json with huggingface_hub
0ff2e8c
verified

lewtun HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_cot/results_2024-04-30T09-30-28.546746.json with huggingface_hub
b924051
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/aimo_kaggle_cot/results_2024-04-30T09-13-09.617333.json with huggingface_hub
4c95547
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/aimo_kaggle_pot/results_2024-04-30T09-01-43.293808.json with huggingface_hub
8120af4
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_pot/results_2024-04-30T08-53-06.445146.json with huggingface_hub
8daa67f
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_pot/results_2024-04-30T08-44-25.864782.json with huggingface_hub
3d89280
verified

kashif HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/mmlu/results_2024-04-30T07-31-14.764120.json with huggingface_hub
bc899ba
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/mmlu/results_2024-04-30T07-30-47.581731.json with huggingface_hub
aaabe84
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/ifeval/results_2024-04-30T07-30-38.066174.json with huggingface_hub
a481d61
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/ifeval/results_2024-04-30T07-29-36.522189.json with huggingface_hub
816ef05
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/gsm8k/results_2024-04-30T07-24-14.540105.json with huggingface_hub
0c73a45
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/gsm8k/results_2024-04-30T07-23-58.557875.json with huggingface_hub
68823e1
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/agieval/results_2024-04-30T07-21-47.713758.json with huggingface_hub
335c220
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/agieval/results_2024-04-30T07-21-14.889377.json with huggingface_hub
d94d9ca
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/bbh/results_2024-04-30T07-19-53.540998.json with huggingface_hub
e1b4e8d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/bbh/results_2024-04-30T07-19-31.330635.json with huggingface_hub
cab3eaf
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/ifeval/results_2024-04-30T07-16-28.966848.json with huggingface_hub
40078bf
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/ifeval/results_2024-04-29T10-18-44.402139.json with huggingface_hub
0dfd5b6
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/agieval/results_2024-04-30T07-11-40.984933.json with huggingface_hub
58b8555
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/agieval/results_2024-04-29T10-10-00.788731.json with huggingface_hub
9d2fcee
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/agieval/results_2024-04-30T07-09-22.314160.json with huggingface_hub
08df480
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/bbh/results_2024-04-30T07-08-53.771754.json with huggingface_hub
ac39200
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.16/mini_math_v2/results_2024-04-30T05-16-36.771565.json with huggingface_hub
ce65131
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.16/aimo_kaggle/results_2024-04-30T05-07-48.551420.json with huggingface_hub
fc7f250
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.06/mini_math_v2/results_2024-04-30T03-05-51.163452.json with huggingface_hub
7605555
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.06/aimo_kaggle/results_2024-04-30T03-01-07.259895.json with huggingface_hub
d7ac1ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.22/mini_math_v2/results_2024-04-30T02-42-39.244083.json with huggingface_hub
f1a8b88
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.22/aimo_kaggle/results_2024-04-30T02-37-32.599587.json with huggingface_hub
2b319ab
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.11/mini_math_v2/results_2024-04-30T01-59-16.174382.json with huggingface_hub
2b8bd84
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.11/aimo_kaggle/results_2024-04-30T01-50-31.125806.json with huggingface_hub
79143ff
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.21/mini_math_v2/results_2024-04-30T01-21-46.645950.json with huggingface_hub
5f8916d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.21/aimo_kaggle/results_2024-04-30T01-17-53.501856.json with huggingface_hub
441dbe5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.12/mini_math_v2/results_2024-04-30T01-09-05.115927.json with huggingface_hub
ed61634
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.39/mini_math_v2/results_2024-04-30T01-03-10.665017.json with huggingface_hub
798ca8c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.12/aimo_kaggle/results_2024-04-30T01-01-01.245040.json with huggingface_hub
1f78a6e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.39/aimo_kaggle/results_2024-04-30T00-54-26.896164.json with huggingface_hub
857c8a2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.09/mini_math_v2/results_2024-04-30T00-04-15.407234.json with huggingface_hub
fb0b9ef
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.20/mini_math_v2/results_2024-04-30T00-00-35.769928.json with huggingface_hub
ec2ef74
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.09/aimo_kaggle/results_2024-04-29T23-54-21.693438.json with huggingface_hub
78a1ea3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.20/aimo_kaggle/results_2024-04-29T23-52-35.032099.json with huggingface_hub
c99e520
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.08/aimo_kaggle/results_2024-04-29T23-51-16.342703.json with huggingface_hub
29ef150
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.23/mini_math_v2/results_2024-04-29T23-50-42.071132.json with huggingface_hub
ab1e349
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.23/aimo_kaggle/results_2024-04-29T23-40-14.254139.json with huggingface_hub
4286737
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.04/mini_math_v2/results_2024-04-29T23-32-30.318063.json with huggingface_hub
d4cd409
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.15/mini_math_v2/results_2024-04-29T23-29-01.738204.json with huggingface_hub
773a922
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.04/aimo_kaggle/results_2024-04-29T23-22-48.294970.json with huggingface_hub
2868ed6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.15/aimo_kaggle/results_2024-04-29T23-19-54.451368.json with huggingface_hub
c2175c2
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.30/mini_math_v2/results_2024-04-29T23-18-28.034969.json with huggingface_hub
4817973
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.27/mini_math_v2/results_2024-04-29T23-15-13.075768.json with huggingface_hub
f2a7316
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v04.30/aimo_kaggle/results_2024-04-29T23-11-11.399807.json with huggingface_hub
3d567d2
verified

edbeeching HF staff commited on