open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.52/winogrande/results_2024-03-07T11-25-42.859916.json with huggingface_hub
4e027de
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.50/ifeval/results_2024-03-07T11-24-54.075881.json with huggingface_hub
24111b9
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.47/hellaswag/results_2024-03-07T11-21-40.723095.json with huggingface_hub
4f7378e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.50/hellaswag/results_2024-03-07T11-20-13.011059.json with huggingface_hub
7cc3ce2
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.51/arc/results_2024-03-07T11-19-45.457331.json with huggingface_hub
507dc45
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.51/truthfulqa/results_2024-03-07T11-19-29.666574.json with huggingface_hub
ea5afca
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.51/winogrande/results_2024-03-07T11-18-42.958760.json with huggingface_hub
609f2e0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.47/arc/results_2024-03-07T11-15-05.332425.json with huggingface_hub
7fa02dc
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.47/truthfulqa/results_2024-03-07T11-14-54.540653.json with huggingface_hub
c080cb3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.47/winogrande/results_2024-03-07T11-14-25.354707.json with huggingface_hub
0f8779c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.50/arc/results_2024-03-07T11-13-34.995315.json with huggingface_hub
7eafc1d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.50/truthfulqa/results_2024-03-07T11-13-22.520571.json with huggingface_hub
c67c6f7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.50/winogrande/results_2024-03-07T11-12-55.238310.json with huggingface_hub
cf15cbf
verified

edbeeching HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/mmlu/results_2024-03-07T11-11-18.431956.json with huggingface_hub
46bdccb
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/gsm8k/results_2024-03-07T11-09-56.244336.json with huggingface_hub
a570ae3
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/ifeval/results_2024-03-07T11-09-17.029159.json with huggingface_hub
8da1bfb
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/hellaswag/results_2024-03-07T11-05-53.241131.json with huggingface_hub
9170ff4
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/truthfulqa/results_2024-03-07T10-59-32.209037.json with huggingface_hub
00ce4d1
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/arc/results_2024-03-07T10-59-25.152270.json with huggingface_hub
3e8c776
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/winogrande/results_2024-03-07T10-59-13.198277.json with huggingface_hub
a95481b
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/gsm8k/results_2024-03-07T10-56-00.237466.json with huggingface_hub
f14d281
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/mmlu/results_2024-03-07T10-52-18.064538.json with huggingface_hub
e0325f0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/ifeval/results_2024-03-07T10-50-57.964180.json with huggingface_hub
4522f84
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/hellaswag/results_2024-03-07T10-46-29.887987.json with huggingface_hub
dbec42c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/gsm8k/results_2024-03-07T10-45-55.818517.json with huggingface_hub
8262d6e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/mmlu/results_2024-03-07T10-43-15.523483.json with huggingface_hub
e1345ac
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/ifeval/results_2024-03-07T10-40-50.392013.json with huggingface_hub
b9b977b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/arc/results_2024-03-07T10-39-56.810038.json with huggingface_hub
e7c1acd
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/truthfulqa/results_2024-03-07T10-39-46.569217.json with huggingface_hub
7550774
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.45/winogrande/results_2024-03-07T10-39-10.797310.json with huggingface_hub
ce922b8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/hellaswag/results_2024-03-07T10-36-50.436692.json with huggingface_hub
bc38da0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/gsm8k/results_2024-03-07T10-32-46.678253.json with huggingface_hub
876fbfd
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/mmlu/results_2024-03-07T10-30-09.277619.json with huggingface_hub
4a1dd32
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/arc/results_2024-03-07T10-30-19.682784.json with huggingface_hub
04c0277
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/truthfulqa/results_2024-03-07T10-30-12.283704.json with huggingface_hub
1310352
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.43/winogrande/results_2024-03-07T10-29-35.814308.json with huggingface_hub
897c0a5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/ifeval/results_2024-03-07T10-27-58.017058.json with huggingface_hub
2f71675
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/hellaswag/results_2024-03-07T10-23-54.476196.json with huggingface_hub
3cff561
verified

edbeeching HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/mmlu/results_2024-03-07T10-16-58.764677.json with huggingface_hub
abf00e0
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/arc/results_2024-03-07T10-17-18.448579.json with huggingface_hub
cd2fda2
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/truthfulqa/results_2024-03-07T10-17-00.990346.json with huggingface_hub
a82045d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.44/winogrande/results_2024-03-07T10-16-51.068310.json with huggingface_hub
410f813
verified

edbeeching HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/gsm8k/results_2024-03-07T10-07-08.412008.json with huggingface_hub
c66e7c0
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/ifeval/results_2024-03-07T10-04-38.729778.json with huggingface_hub
5e91b53
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/hellaswag/results_2024-03-07T10-00-44.951624.json with huggingface_hub
24e1d5a
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/truthfulqa/results_2024-03-07T09-55-44.796157.json with huggingface_hub
4c7c5c6
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/arc/results_2024-03-07T09-55-37.842476.json with huggingface_hub
59a1b06
verified

lewtun HF staff commited on

Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-mix7/main/winogrande/results_2024-03-07T09-55-18.274244.json with huggingface_hub
b8ba85d
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.42/gsm8k/results_2024-03-07T09-54-48.555480.json with huggingface_hub
bdba349
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.42/mmlu/results_2024-03-07T09-50-43.602985.json with huggingface_hub
d5d65a6
verified

edbeeching HF staff commited on