open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/orpo-explorers/mistral-7b-orpo-v2.0/main/ifeval/results_2024-04-15T05-53-54.501754.json with huggingface_hub
1f9fd18
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v0.0/main/ifeval/results_2024-04-15T05-52-49.125822.json with huggingface_hub
438db6d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v2.0/main/agieval/results_2024-04-15T05-44-13.273757.json with huggingface_hub
542e83e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v0.0/main/agieval/results_2024-04-15T05-44-04.161924.json with huggingface_hub
b8f383b
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v2.0/main/bbh/results_2024-04-15T05-42-31.597473.json with huggingface_hub
3eae87b
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v0.0/main/bbh/results_2024-04-15T05-42-00.602862.json with huggingface_hub
cc8fe93
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v3.0/main/ifeval/results_2024-04-14T20-36-55.973171.json with huggingface_hub
8962620
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v3.0/main/agieval/results_2024-04-14T20-28-32.975338.json with huggingface_hub
ddaafa7
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v3.0/main/bbh/results_2024-04-14T20-27-12.348807.json with huggingface_hub
2cc5784
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v1.0/main/ifeval/results_2024-04-14T20-23-01.492456.json with huggingface_hub
bc793dd
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v1.0/main/agieval/results_2024-04-14T20-15-26.437168.json with huggingface_hub
55311e3
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/mistral-7b-orpo-v1.0/main/bbh/results_2024-04-14T20-13-44.098182.json with huggingface_hub
72fe263
verified

lewtun HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/ifeval/results_2024-04-14T15-56-38.588846.json with huggingface_hub
1eed69f
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/gsm8k/results_2024-04-14T15-29-17.543930.json with huggingface_hub
22ec489
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/ifeval/results_2024-04-14T15-25-19.825529.json with huggingface_hub
e4cb942
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/gsm8k/results_2024-04-14T15-10-24.321834.json with huggingface_hub
2d79842
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/mmlu/results_2024-04-14T14-49-52.152190.json with huggingface_hub
54d6b51
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/hellaswag/results_2024-04-14T14-12-35.699557.json with huggingface_hub
4d178ea
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/mmlu/results_2024-04-14T14-08-28.050405.json with huggingface_hub
3efea34
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/agieval/results_2024-04-14T13-40-15.764250.json with huggingface_hub
b2d7e7f
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/hellaswag/results_2024-04-14T13-31-19.509663.json with huggingface_hub
745a1ac
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/truthfulqa/results_2024-04-14T13-16-37.146644.json with huggingface_hub
bcebac4
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/bbh/results_2024-04-14T13-12-19.351329.json with huggingface_hub
cfa5621
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/arc/results_2024-04-14T13-11-24.629818.json with huggingface_hub
fc1ee87
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/winogrande/results_2024-04-14T13-06-04.592120.json with huggingface_hub
bf55f3b
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/agieval/results_2024-04-14T12-57-28.880808.json with huggingface_hub
eb6ec4c
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/truthfulqa/results_2024-04-14T12-36-35.480410.json with huggingface_hub
d5bc32d
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/bbh/results_2024-04-14T12-32-13.013639.json with huggingface_hub
30e42bd
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/arc/results_2024-04-14T12-29-11.059162.json with huggingface_hub
fcd4027
verified

edbeeching HF staff commited on

Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.1/winogrande/results_2024-04-14T12-24-04.643448.json with huggingface_hub
d892612
verified

edbeeching HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.3/main/ifeval/results_2024-04-11T17-51-46.583512.json with huggingface_hub
28f2375
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.2/main/ifeval/results_2024-04-11T15-32-24.008762.json with huggingface_hub
a65e067
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.2/main/agieval/results_2024-04-11T13-24-29.183943.json with huggingface_hub
a309480
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.3/main/agieval/results_2024-04-11T13-19-38.418934.json with huggingface_hub
aba4e4e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.3/main/bbh/results_2024-04-11T12-24-53.725864.json with huggingface_hub
3f0f214
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.2/main/bbh/results_2024-04-11T12-23-21.420988.json with huggingface_hub
6eccf39
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.1/main/ifeval/results_2024-04-11T09-14-05.400570.json with huggingface_hub
3638843
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.1/main/agieval/results_2024-04-11T06-59-22.134906.json with huggingface_hub
1a6f459
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mixtral-8x22B-capybara-v0.1/main/bbh/results_2024-04-11T06-01-58.145964.json with huggingface_hub
3f4aa0d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.3/main/ifeval/results_2024-04-10T21-21-23.127994.json with huggingface_hub
e3a066e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.3/main/agieval/results_2024-04-10T21-14-21.591491.json with huggingface_hub
a1e3727
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.3/main/bbh/results_2024-04-10T21-12-50.380362.json with huggingface_hub
151263a
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.2/main/ifeval/results_2024-04-10T19-53-51.745306.json with huggingface_hub
b5aa26d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.2/main/agieval/results_2024-04-10T19-46-57.528981.json with huggingface_hub
8137e1e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.2/main/bbh/results_2024-04-10T19-45-32.032239.json with huggingface_hub
c72b1b3
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.1/main/ifeval/results_2024-04-10T16-32-32.193221.json with huggingface_hub
4543408
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.1/main/agieval/results_2024-04-10T16-25-46.375479.json with huggingface_hub
d60b535
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-capybara-v0.1/main/bbh/results_2024-04-10T16-24-14.917515.json with huggingface_hub
5132f89
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-v0.1-ORPO-Capybara-TRL-wo-pl/main/ifeval/results_2024-04-10T10-50-27.200179.json with huggingface_hub
3022d75
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/Mistral-7B-v0.1-ORPO-Capybara-TRL-w-pl/main/ifeval/results_2024-04-10T10-46-47.172367.json with huggingface_hub
ec6372d
verified

lewtun HF staff commited on