open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-1epoch-ohp-15k-strat-1-2epoch/main/gsm8k/results_2024-04-30T21-18-49.829093.json with huggingface_hub
738b3ce
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/mmlu/results_2024-04-30T21-17-32.402186.json with huggingface_hub
319132d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/gsm8k/results_2024-04-30T21-18-34.590037.json with huggingface_hub
0755df9
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.1/main/mmlu/results_2024-04-30T21-17-25.542636.json with huggingface_hub
6a8dadd
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/mmlu/results_2024-04-30T21-17-21.258934.json with huggingface_hub
9ff1a4c
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.2/main/alpaca_eval/results_2024-04-30T21-18-21.json with huggingface_hub
a9daca7
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/alpaca_eval/results_2024-04-30T21-18-21.json with huggingface_hub
e5062f4
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/alpaca_eval/results_2024-04-30T21-18-21.json with huggingface_hub
4c0b16a
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05/main/mmlu/results_2024-04-30T21-16-56.062870.json with huggingface_hub
489c8bb
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.2/main/mmlu/results_2024-04-30T21-16-51.996873.json with huggingface_hub
036de22
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/alpaca_eval/results_2024-04-30T21-18-02.json with huggingface_hub
de96e5e
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-math-2epoch/main/gsm8k/results_2024-04-30T21-17-49.554870.json with huggingface_hub
430da40
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-2epoch-ohp-15k-strat-1-beta0.2-1epoch/main/gsm8k/results_2024-04-30T21-17-27.508549.json with huggingface_hub
e2fb931
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/mmlu/results_2024-04-30T21-16-13.047248.json with huggingface_hub
7402a13
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/mmlu/results_2024-04-30T21-15-58.624589.json with huggingface_hub
5e440e6
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/mmlu/results_2024-04-30T21-15-49.897827.json with huggingface_hub
ea69df4
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/mmlu/results_2024-04-30T21-15-49.538453.json with huggingface_hub
0318cc0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/mmlu/results_2024-04-30T21-15-39.455009.json with huggingface_hub
5bcc50f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-2epoch/main/mmlu/results_2024-04-30T21-15-34.495525.json with huggingface_hub
bd03d79
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/mmlu/results_2024-04-30T21-15-12.791854.json with huggingface_hub
4554fd0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-1epoch-ohp-15k-strat-1-beta0.2-2epoch/main/gsm8k/results_2024-04-30T21-13-37.213263.json with huggingface_hub
83211c8
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-capybara-1epoch/main/gsm8k/results_2024-04-30T21-13-02.811893.json with huggingface_hub
f0a30fc
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/gsm8k/results_2024-04-30T21-12-43.502655.json with huggingface_hub
e5704c2
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/gsm8k/results_2024-04-30T21-12-23.276217.json with huggingface_hub
331c032
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.1/main/gsm8k/results_2024-04-30T21-12-21.365030.json with huggingface_hub
b6bde75
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05/main/gsm8k/results_2024-04-30T21-11-50.651160.json with huggingface_hub
0ea2679
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/gsm8k/results_2024-04-30T21-11-39.576371.json with huggingface_hub
9054754
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.2/main/gsm8k/results_2024-04-30T21-11-36.103729.json with huggingface_hub
525ba9d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/gsm8k/results_2024-04-30T21-11-09.365386.json with huggingface_hub
53ac45b
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/gsm8k/results_2024-04-30T21-10-56.507816.json with huggingface_hub
7ce3601
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/gsm8k/results_2024-04-30T21-10-54.357669.json with huggingface_hub
3c96bc7
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-2epoch/main/gsm8k/results_2024-04-30T21-10-39.251644.json with huggingface_hub
9d753b1
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/gsm8k/results_2024-04-30T21-10-26.931701.json with huggingface_hub
af93059
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/gsm8k/results_2024-04-30T21-10-22.767105.json with huggingface_hub
dd6a1ec
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/alpaca_eval/results_2024-04-30T21-05-29.json with huggingface_hub
e1b8bcc
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/alpaca_eval/results_2024-04-30T20-39-34.json with huggingface_hub
198b839
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.2/main/alpaca_eval/results_2024-04-30T20-33-21.json with huggingface_hub
9945698
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T20-09-50.json with huggingface_hub
ca6b90b
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval
d609a3e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T19-42-21.json with huggingface_hub
80fcbb1
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T19-28-46.json with huggingface_hub
f127539
verified

lewtun HF staff commited on

Remove annotations
9e76baa

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T19-01-22.json with huggingface_hub
11f42cd
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/annotations.json with huggingface_hub
f00767b
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval
66ff641
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval
f888870
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T18-07-44.json with huggingface_hub
0c3e915
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T17-39-25.json with huggingface_hub
4062e32
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/annotations.json with huggingface_hub
3aa4a80
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T16-47-29.json with huggingface_hub
0ff2e8c
verified

lewtun HF staff commited on