open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/mmlu/results_2024-03-14T12-37-10.426455.json with huggingface_hub
3bf1be5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/mmlu/results_2024-03-14T12-37-03.903529.json with huggingface_hub
ad5695c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/mmlu/results_2024-03-14T12-36-53.798690.json with huggingface_hub
f99fd3d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/mmlu/results_2024-03-14T12-36-38.178613.json with huggingface_hub
437d3fa
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/gsm8k/results_2024-03-14T12-37-21.372718.json with huggingface_hub
d487bc1
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/gsm8k/results_2024-03-14T12-36-53.394926.json with huggingface_hub
52947ec
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/gsm8k/results_2024-03-14T12-36-34.373733.json with huggingface_hub
83ca56e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/gsm8k/results_2024-03-14T12-36-02.642572.json with huggingface_hub
6195b1a
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/ifeval/results_2024-03-14T12-35-23.992867.json with huggingface_hub
762554e
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/ifeval/results_2024-03-14T12-35-00.673167.json with huggingface_hub
75550f7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/ifeval/results_2024-03-14T12-34-41.592801.json with huggingface_hub
0bb5e21
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/ifeval/results_2024-03-14T12-34-18.153949.json with huggingface_hub
44af62b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/hellaswag/results_2024-03-14T12-32-18.464907.json with huggingface_hub
a090614
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/hellaswag/results_2024-03-14T12-32-01.877387.json with huggingface_hub
227c0e3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/hellaswag/results_2024-03-14T12-31-46.174056.json with huggingface_hub
d54b1f0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/hellaswag/results_2024-03-14T12-31-30.979512.json with huggingface_hub
b4c2627
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/truthfulqa/results_2024-03-14T12-27-18.034006.json with huggingface_hub
65aa28a
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/arc/results_2024-03-14T12-27-12.858154.json with huggingface_hub
bbd7dca
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/arc/results_2024-03-14T12-27-07.029748.json with huggingface_hub
0d9d2f3
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/arc/results_2024-03-14T12-27-01.804181.json with huggingface_hub
de47d5d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/truthfulqa/results_2024-03-14T12-26-52.673017.json with huggingface_hub
2497c3c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.3/winogrande/results_2024-03-14T12-26-52.201850.json with huggingface_hub
ea318d8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/arc/results_2024-03-14T12-26-40.564284.json with huggingface_hub
17f3829
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.2/winogrande/results_2024-03-14T12-26-42.332976.json with huggingface_hub
2a492c7
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/truthfulqa/results_2024-03-14T12-26-40.687156.json with huggingface_hub
d786778
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.1/winogrande/results_2024-03-14T12-26-29.765581.json with huggingface_hub
5438db5
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.0/winogrande/results_2024-03-14T12-26-10.213615.json with huggingface_hub
d6afff9
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.5/gsm8k/results_2024-03-14T01-21-36.364019.json with huggingface_hub
39856cd
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.5/ifeval/results_2024-03-14T01-13-35.332888.json with huggingface_hub
20befac
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.1/gsm8k/results_2024-03-13T22-43-28.269257.json with huggingface_hub
cdbdf3d
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.1/ifeval/results_2024-03-13T22-36-50.799870.json with huggingface_hub
3bc5814
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.4/gsm8k/results_2024-03-13T22-28-19.904141.json with huggingface_hub
7aa3e77
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.4/ifeval/results_2024-03-13T22-20-56.658219.json with huggingface_hub
8acb1a4
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.3/gsm8k/results_2024-03-13T21-06-15.157181.json with huggingface_hub
ce11a4b
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.3/ifeval/results_2024-03-13T20-57-47.478024.json with huggingface_hub
9fdd3fa
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.4/gsm8k/results_2024-03-13T20-34-36.123511.json with huggingface_hub
de82110
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-sft/v0.2/ifeval/results_2024-03-13T20-29-43.433964.json with huggingface_hub
4177656
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.3/gsm8k/results_2024-03-13T20-28-46.423506.json with huggingface_hub
ab45db4
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.4/ifeval/results_2024-03-13T20-27-54.934967.json with huggingface_hub
dfd9a4b
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.3/ifeval/results_2024-03-13T20-23-24.862475.json with huggingface_hub
64a33a1
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-sft/v0.1/ifeval/results_2024-03-13T20-17-22.544457.json with huggingface_hub
7ef5ee4
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.0/ifeval/results_2024-03-13T19-29-27.337757.json with huggingface_hub
7c3e359
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-sft/v0.0/ifeval/results_2024-03-13T18-57-58.086756.json with huggingface_hub
5a9151b
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.2/gsm8k/results_2024-03-13T17-44-33.997746.json with huggingface_hub
1686724
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.2/ifeval/results_2024-03-13T17-38-04.664431.json with huggingface_hub
24f6998
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.1/gsm8k/results_2024-03-13T16-35-12.278537.json with huggingface_hub
6cd996d
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.1/ifeval/results_2024-03-13T16-29-54.450273.json with huggingface_hub
98804fd
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/qwen-1.5-0.5b-ift/v1.0/ifeval/results_2024-03-13T16-26-49.501942.json with huggingface_hub
c8fc538
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.106/gsm8k/results_2024-03-13T10-13-31.201691.json with huggingface_hub
95a1a1b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.106/mmlu/results_2024-03-13T10-06-14.269649.json with huggingface_hub
a403a0c
verified

edbeeching HF staff commited on