open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/mmlu/results_2024-04-30T21-15-49.897827.json with huggingface_hub
ea69df4
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/mmlu/results_2024-04-30T21-15-49.538453.json with huggingface_hub
0318cc0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/mmlu/results_2024-04-30T21-15-39.455009.json with huggingface_hub
5bcc50f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-2epoch/main/mmlu/results_2024-04-30T21-15-34.495525.json with huggingface_hub
bd03d79
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/mmlu/results_2024-04-30T21-15-12.791854.json with huggingface_hub
4554fd0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-1epoch-ohp-15k-strat-1-beta0.2-2epoch/main/gsm8k/results_2024-04-30T21-13-37.213263.json with huggingface_hub
83211c8
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-capybara-1epoch/main/gsm8k/results_2024-04-30T21-13-02.811893.json with huggingface_hub
f0a30fc
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-capybara-2epoch/main/gsm8k/results_2024-04-30T21-12-43.502655.json with huggingface_hub
e5704c2
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/gsm8k/results_2024-04-30T21-12-23.276217.json with huggingface_hub
331c032
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.1/main/gsm8k/results_2024-04-30T21-12-21.365030.json with huggingface_hub
b6bde75
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05/main/gsm8k/results_2024-04-30T21-11-50.651160.json with huggingface_hub
0ea2679
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.1/main/gsm8k/results_2024-04-30T21-11-39.576371.json with huggingface_hub
9054754
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.2/main/gsm8k/results_2024-04-30T21-11-36.103729.json with huggingface_hub
525ba9d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05/main/gsm8k/results_2024-04-30T21-11-09.365386.json with huggingface_hub
53ac45b
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/gsm8k/results_2024-04-30T21-10-56.507816.json with huggingface_hub
7ce3601
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/gsm8k/results_2024-04-30T21-10-54.357669.json with huggingface_hub
3c96bc7
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-2epoch/main/gsm8k/results_2024-04-30T21-10-39.251644.json with huggingface_hub
9d753b1
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/gsm8k/results_2024-04-30T21-10-26.931701.json with huggingface_hub
af93059
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/gsm8k/results_2024-04-30T21-10-22.767105.json with huggingface_hub
dd6a1ec
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta-0.05-1epoch/main/alpaca_eval/results_2024-04-30T21-05-29.json with huggingface_hub
e1b8bcc
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/alpaca_eval/results_2024-04-30T20-39-34.json with huggingface_hub
198b839
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.2/main/alpaca_eval/results_2024-04-30T20-33-21.json with huggingface_hub
9945698
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T20-09-50.json with huggingface_hub
ca6b90b
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval
d609a3e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T19-42-21.json with huggingface_hub
80fcbb1
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T19-28-46.json with huggingface_hub
f127539
verified

lewtun HF staff commited on

Remove annotations
9e76baa

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T19-01-22.json with huggingface_hub
11f42cd
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/annotations.json with huggingface_hub
f00767b
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval
66ff641
verified

lewtun HF staff commited on

Delete eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval
f888870
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T18-07-44.json with huggingface_hub
0c3e915
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-04-30T17-39-25.json with huggingface_hub
4062e32
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/annotations.json with huggingface_hub
3aa4a80
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/alpaca_eval/results_2024-04-30T16-47-29.json with huggingface_hub
0ff2e8c
verified

lewtun HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_cot/results_2024-04-30T09-30-28.546746.json with huggingface_hub
b924051
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/aimo_kaggle_cot/results_2024-04-30T09-13-09.617333.json with huggingface_hub
4c95547
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/aimo_kaggle_pot/results_2024-04-30T09-01-43.293808.json with huggingface_hub
8120af4
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_pot/results_2024-04-30T08-53-06.445146.json with huggingface_hub
8daa67f
verified

kashif HF staff commited on

Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_pot/results_2024-04-30T08-44-25.864782.json with huggingface_hub
3d89280
verified

kashif HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/mmlu/results_2024-04-30T07-31-14.764120.json with huggingface_hub
bc899ba
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/mmlu/results_2024-04-30T07-30-47.581731.json with huggingface_hub
aaabe84
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/ifeval/results_2024-04-30T07-30-38.066174.json with huggingface_hub
a481d61
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/ifeval/results_2024-04-30T07-29-36.522189.json with huggingface_hub
816ef05
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/gsm8k/results_2024-04-30T07-24-14.540105.json with huggingface_hub
0c73a45
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/gsm8k/results_2024-04-30T07-23-58.557875.json with huggingface_hub
68823e1
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/agieval/results_2024-04-30T07-21-47.713758.json with huggingface_hub
335c220
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/agieval/results_2024-04-30T07-21-14.889377.json with huggingface_hub
d94d9ca
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/bbh/results_2024-04-30T07-19-53.540998.json with huggingface_hub
e1b4e8d
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/bbh/results_2024-04-30T07-19-31.330635.json with huggingface_hub
cab3eaf
verified

lewtun HF staff commited on