open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/winogrande/results_2024-03-12T23-31-35.028480.json with huggingface_hub
8baa207
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/mmlu/results_2024-03-12T23-29-10.852634.json with huggingface_hub
2490a01
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/hellaswag/results_2024-03-12T23-29-06.552939.json with huggingface_hub
a09084b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/ifeval/results_2024-03-12T23-29-49.714820.json with huggingface_hub
ca585cf
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.107/arc/results_2024-03-12T23-29-25.928843.json with huggingface_hub
ca3e03d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/gsm8k/results_2024-03-12T23-26-41.950552.json with huggingface_hub
13d071e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/truthfulqa/results_2024-03-12T23-24-29.889437.json with huggingface_hub
43edac2
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/winogrande/results_2024-03-12T23-23-58.930224.json with huggingface_hub
e8df2b5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/hellaswag/results_2024-03-12T23-22-22.685397.json with huggingface_hub
2bbb8d6
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.97/arc/results_2024-03-12T23-22-10.105946.json with huggingface_hub
abfc39b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/mmlu/results_2024-03-12T23-20-54.659666.json with huggingface_hub
cf14229
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/ifeval/results_2024-03-12T23-21-48.075495.json with huggingface_hub
a518307
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/gsm8k/results_2024-03-12T23-19-54.354947.json with huggingface_hub
8400094
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/truthfulqa/results_2024-03-12T23-17-10.672117.json with huggingface_hub
c06916d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/winogrande/results_2024-03-12T23-16-45.192235.json with huggingface_hub
1d874cc
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/ifeval/results_2024-03-12T23-16-17.360891.json with huggingface_hub
5d4fcda
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.113/arc/results_2024-03-12T23-15-04.184530.json with huggingface_hub
7e7c3d1
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/mmlu/results_2024-03-12T23-13-39.655638.json with huggingface_hub
df21edf
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/hellaswag/results_2024-03-12T23-13-40.694921.json with huggingface_hub
e1417b5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/gsm8k/results_2024-03-12T23-13-10.042323.json with huggingface_hub
63c9be3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/winogrande/results_2024-03-12T23-11-48.512097.json with huggingface_hub
f0cf78f
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/truthfulqa/results_2024-03-12T23-09-51.269307.json with huggingface_hub
bd241f7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/mmlu/results_2024-03-12T23-08-37.335109.json with huggingface_hub
2cacec3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/ifeval/results_2024-03-12T23-09-44.557791.json with huggingface_hub
852c4d8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/hellaswag/results_2024-03-12T23-06-52.938730.json with huggingface_hub
53d95f4
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.96/arc/results_2024-03-12T23-06-23.351951.json with huggingface_hub
3678f4c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/truthfulqa/results_2024-03-12T23-04-30.643363.json with huggingface_hub
af81a14
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/winogrande/results_2024-03-12T23-04-00.981875.json with huggingface_hub
970d01a
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/hellaswag/results_2024-03-12T23-01-02.035266.json with huggingface_hub
67175d0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/gsm8k/results_2024-03-12T23-00-11.672274.json with huggingface_hub
3a9a7dc
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.112/arc/results_2024-03-12T22-59-22.646801.json with huggingface_hub
714adc0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/winogrande/results_2024-03-12T22-57-01.276630.json with huggingface_hub
58b81e8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/truthfulqa/results_2024-03-12T22-56-25.716165.json with huggingface_hub
a1f90a8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/ifeval/results_2024-03-12T22-55-59.984337.json with huggingface_hub
c56937a
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.104/gsm8k/results_2024-03-12T22-55-14.505433.json with huggingface_hub
ab1c48d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/mmlu/results_2024-03-12T22-53-51.709171.json with huggingface_hub
e981935
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.105/arc/results_2024-03-12T22-54-13.145928.json with huggingface_hub
049eb43
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.94/gsm8k/results_2024-03-12T22-51-52.085505.json with huggingface_hub
25a3940
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.104/mmlu/results_2024-03-12T22-50-28.486884.json with huggingface_hub
e3f49d1
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.104/ifeval/results_2024-03-12T22-50-37.106178.json with huggingface_hub
459b390
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/hellaswag/results_2024-03-12T22-47-34.229342.json with huggingface_hub
3209079
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.94/mmlu/results_2024-03-12T22-46-21.243177.json with huggingface_hub
4600e0e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.93/gsm8k/results_2024-03-12T22-47-01.628915.json with huggingface_hub
590994b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.94/ifeval/results_2024-03-12T22-46-12.969498.json with huggingface_hub
5b618f9
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.104/hellaswag/results_2024-03-12T22-44-22.157283.json with huggingface_hub
a1f0d9e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.93/mmlu/results_2024-03-12T22-43-02.471193.json with huggingface_hub
5da66ec
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.93/ifeval/results_2024-03-12T22-43-32.914063.json with huggingface_hub
ae8f2c1
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/winogrande/results_2024-03-12T22-42-25.470250.json with huggingface_hub
5c51008
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/truthfulqa/results_2024-03-12T22-42-08.641778.json with huggingface_hub
5306542
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.111/arc/results_2024-03-12T22-41-00.795683.json with huggingface_hub
22dc175
verified

edbeeching HF staff commited on