open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.118/winogrande/results_2024-03-13T00-12-31.506928.json with huggingface_hub
fca7720
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.117/arc/results_2024-03-13T00-10-31.401436.json with huggingface_hub
6248701
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.117/truthfulqa/results_2024-03-13T00-10-19.296405.json with huggingface_hub
973ce11
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.116/ifeval/results_2024-03-13T00-10-13.646648.json with huggingface_hub
613dbf8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.117/winogrande/results_2024-03-13T00-09-51.206742.json with huggingface_hub
8a337b5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.109/hellaswag/results_2024-03-13T00-08-41.899199.json with huggingface_hub
45d3459
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.116/hellaswag/results_2024-03-13T00-04-55.963560.json with huggingface_hub
30bbb20
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/gsm8k/results_2024-03-13T00-04-06.626024.json with huggingface_hub
20b17d7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.109/winogrande/results_2024-03-13T00-03-52.302738.json with huggingface_hub
9ed8e7b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.109/truthfulqa/results_2024-03-13T00-02-33.611589.json with huggingface_hub
3fd5e2e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.109/arc/results_2024-03-13T00-02-06.545971.json with huggingface_hub
21c631a
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/mmlu/results_2024-03-12T23-59-24.234040.json with huggingface_hub
bffe451
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.116/truthfulqa/results_2024-03-12T23-59-41.450873.json with huggingface_hub
a41c66b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/gsm8k/results_2024-03-12T23-59-32.299994.json with huggingface_hub
6673a35
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.116/winogrande/results_2024-03-12T23-59-10.194987.json with huggingface_hub
844b914
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/ifeval/results_2024-03-12T23-58-10.347065.json with huggingface_hub
199a571
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.116/arc/results_2024-03-12T23-56-40.399599.json with huggingface_hub
2c37e60
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/gsm8k/results_2024-03-12T23-55-27.050960.json with huggingface_hub
d08d6b9
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/mmlu/results_2024-03-12T23-54-00.912861.json with huggingface_hub
c2d3971
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/gsm8k/results_2024-03-12T23-53-30.248107.json with huggingface_hub
80f776c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/ifeval/results_2024-03-12T23-53-21.719905.json with huggingface_hub
9f1c53c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/mmlu/results_2024-03-12T23-51-37.721129.json with huggingface_hub
73798c4
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/hellaswag/results_2024-03-12T23-51-25.941401.json with huggingface_hub
ee872cb
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/ifeval/results_2024-03-12T23-49-49.394074.json with huggingface_hub
193ee18
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/hellaswag/results_2024-03-12T23-48-15.967717.json with huggingface_hub
fd7de0e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/gsm8k/results_2024-03-12T23-48-54.090709.json with huggingface_hub
97111ad
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/ifeval/results_2024-03-12T23-48-34.504087.json with huggingface_hub
688ad3c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/mmlu/results_2024-03-12T23-46-51.690142.json with huggingface_hub
41bd497
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/truthfulqa/results_2024-03-12T23-47-04.436792.json with huggingface_hub
5458cb1
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/hellaswag/results_2024-03-12T23-45-51.511248.json with huggingface_hub
1e45d1e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/winogrande/results_2024-03-12T23-46-23.803706.json with huggingface_hub
92d59ce
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/mmlu/results_2024-03-12T23-44-40.675276.json with huggingface_hub
ae83d19
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/ifeval/results_2024-03-12T23-44-59.878849.json with huggingface_hub
d2c0a82
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.115/arc/results_2024-03-12T23-44-45.339669.json with huggingface_hub
993491b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/winogrande/results_2024-03-12T23-42-18.690766.json with huggingface_hub
dc8da71
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/truthfulqa/results_2024-03-12T23-41-35.678242.json with huggingface_hub
c1e5570
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.108/arc/results_2024-03-12T23-41-26.521251.json with huggingface_hub
35188ca
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/hellaswag/results_2024-03-12T23-40-33.600808.json with huggingface_hub
9a0bd36
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/gsm8k/results_2024-03-12T23-40-17.994984.json with huggingface_hub
d055818
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/winogrande/results_2024-03-12T23-39-16.931324.json with huggingface_hub
beb078f
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/arc/results_2024-03-12T23-39-12.549311.json with huggingface_hub
b12d7e7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.99/truthfulqa/results_2024-03-12T23-39-12.876931.json with huggingface_hub
c822cdd
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/hellaswag/results_2024-03-12T23-38-15.779173.json with huggingface_hub
401df3d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/mmlu/results_2024-03-12T23-36-27.044869.json with huggingface_hub
1d03b80
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/winogrande/results_2024-03-12T23-36-51.248905.json with huggingface_hub
2124f7b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/ifeval/results_2024-03-12T23-36-33.110078.json with huggingface_hub
d9dc59d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/truthfulqa/results_2024-03-12T23-36-06.912940.json with huggingface_hub
19cb904
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/gsm8k/results_2024-03-12T23-34-48.721843.json with huggingface_hub
e9343e3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.114/arc/results_2024-03-12T23-33-53.866481.json with huggingface_hub
bff3d6b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/truthfulqa/results_2024-03-12T23-32-14.408236.json with huggingface_hub
6298a51
verified

edbeeching HF staff commited on