open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v2/main/winogrande/results_2024-05-02T06-11-19.440174.json with huggingface_hub
312225a
verified

abhishek commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v00.00/math_v2/results_2024-05-02T02-34-55.213377.json with huggingface_hub
38564c3
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v00.00/mini_math_v2/results_2024-05-02T00-06-43.336916.json with huggingface_hub
51609fe
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v00.00/aimo_kaggle/results_2024-05-01T23-52-48.250595.json with huggingface_hub
c9cae79
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.01/math_v2/results_2024-05-01T22-28-40.931116.json with huggingface_hub
b47db93
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.02/math_v2/results_2024-05-01T22-18-48.673223.json with huggingface_hub
d6766b2
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.02/mini_math_v2/results_2024-05-01T20-56-55.525944.json with huggingface_hub
e4a803f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.02/aimo_kaggle/results_2024-05-01T20-48-07.028513.json with huggingface_hub
5fb1d51
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.01/mini_math_v2/results_2024-05-01T20-42-44.533414.json with huggingface_hub
6725339
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.01/aimo_kaggle/results_2024-05-01T20-35-30.808328.json with huggingface_hub
b20e556
verified

lewtun HF staff commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/ifeval/results_2024-05-01T20-19-35.471494.json with huggingface_hub
fb67266
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/mmlu/results_2024-05-01T18-50-41.016030.json with huggingface_hub
242f4aa
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/gsm8k/results_2024-05-01T18-51-23.575547.json with huggingface_hub
d0ffe1b
verified

abhishek commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.00/mini_math_v2/results_2024-05-01T18-15-44.926897.json with huggingface_hub
835021a
verified

lewtun HF staff commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/hellaswag/results_2024-05-01T18-10-18.674527.json with huggingface_hub
46ed9bc
verified

abhishek commited on

Upload eval_results/AI-MO/internlm-math-7b-sft/aimo_v00.00/aimo_kaggle/results_2024-05-01T18-07-17.755219.json with huggingface_hub
3432998
verified

lewtun HF staff commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/agieval/results_2024-05-01T17-53-21.084520.json with huggingface_hub
6abc6d6
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/arc/results_2024-05-01T17-43-45.291244.json with huggingface_hub
7978a51
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/bbh/results_2024-05-01T17-42-51.018793.json with huggingface_hub
0246ea9
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/truthfulqa/results_2024-05-01T17-42-45.527465.json with huggingface_hub
fc3d7cc
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-mixtral-8x7b-orpo-v1/main/winogrande/results_2024-05-01T17-41-08.262432.json with huggingface_hub
6be9a3d
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-llama3-oh-sft-v0-2/main/mmlu/results_2024-05-01T16-38-40.409433.json with huggingface_hub
82ab148
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-llama3-oh-sft-v0-2/main/gsm8k/results_2024-05-01T14-50-25.787339.json with huggingface_hub
42158a3
verified

abhishek commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/ifeval/results_2024-05-01T14-19-29.885553.json with huggingface_hub
fe1951c
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/mmlu/results_2024-05-01T14-17-55.248549.json with huggingface_hub
0fa60c9
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1/main/alpaca_eval/results_2024-05-01T14-18-34.json with huggingface_hub
e39a639
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/gsm8k/results_2024-05-01T14-13-35.615524.json with huggingface_hub
c623015
verified

lewtun HF staff commited on

Upload eval_results/abhishek/autotrain-llama3-oh-sft-v0-2/main/truthfulqa/results_2024-05-01T12-28-36.409527.json with huggingface_hub
2ebbd23
verified

abhishek commited on

Upload eval_results/abhishek/autotrain-llama3-oh-sft-v0-2/main/truthfulqa/results_2024-05-01T14-03-51.831774.json with huggingface_hub
ec28199
verified

abhishek commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/alpaca_eval/results_2024-05-01T12-13-44.json with huggingface_hub
3e89a3f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/agieval/results_2024-05-01T11-58-00.352912.json with huggingface_hub
7eb92ed
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-2epoch/main/bbh/results_2024-05-01T11-56-47.533473.json with huggingface_hub
cae8e94
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/alpaca_eval/results_2024-05-01T09-37-04.json with huggingface_hub
0061baf
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-1-2-3-beta-0.2-epoch-3/main/alpaca_eval/results_2024-05-01T09-33-20.json with huggingface_hub
cf08ee0
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/argilla-mistral-orpo-OHP-15k-Stratified-3-beta-0.2-epoch-1/main/gsm8k/results_2024-05-01T09-17-19.636318.json with huggingface_hub
3d707d2
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch/main/alpaca_eval/results_2024-05-01T09-00-50.json with huggingface_hub
1133de8
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-2epoch-ohp-15k-strat-1-1epoch/main/alpaca_eval/results_2024-05-01T08-16-25.json with huggingface_hub
3a9a6b2
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-1epoch-ohp-15k-strat-1-2epoch/main/alpaca_eval/results_2024-05-01T08-16-16.json with huggingface_hub
add73f4
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/alpaca_eval/results_2024-05-01T08-16-11.json with huggingface_hub
2449336
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-math-2epoch/main/alpaca_eval/results_2024-05-01T08-16-08.json with huggingface_hub
91f5922
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-2epoch-ohp-15k-strat-1-beta0.2-1epoch/main/alpaca_eval/results_2024-05-01T08-15-55.json with huggingface_hub
f3bb8f2
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2/main/alpaca_eval/results_2024-05-01T08-15-32.json with huggingface_hub
8213847
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-capybara-1epoch/main/alpaca_eval/results_2024-05-01T08-15-30.json with huggingface_hub
9e5c48a
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/alpaca_eval/results_2024-05-01T08-14-48.json with huggingface_hub
866e203
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.05-1epoch/main/alpaca_eval/results_2024-05-01T08-13-27.json with huggingface_hub
9052f53
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch-math-1epoch/main/alpaca_eval/results_2024-05-01T07-59-16.json with huggingface_hub
19c87ca
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Mathcode-2epoch-ohp-15k-strat-1-1epoch/main/alpaca_eval/results_2024-05-01T07-58-44.json with huggingface_hub
5a3427f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-2epoch/main/alpaca_eval/results_2024-05-01T07-58-27.json with huggingface_hub
f57f99f
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-OHP-15k-Stratified-1-beta-0.2-1epoch-math-2epoch/main/alpaca_eval/results_2024-05-01T07-58-24.json with huggingface_hub
8f42345
verified

lewtun HF staff commited on

Upload eval_results/orpo-explorers/kaist-mistral-orpo-capybara-beta0.05-2epoch-ohp-15k-strat-1-beta0.2-1epoch/main/alpaca_eval/results_2024-05-01T07-58-22.json with huggingface_hub
eafd9f5
verified

lewtun HF staff commited on