open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.05/aimo_kaggle_medium_pot/results_2024-06-08T01-43-45.330171.json with huggingface_hub
0db24d7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.06/aimo_kaggle_medium_pot/results_2024-06-08T01-41-32.531482.json with huggingface_hub
0781b9e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.16/aimo_kaggle_hard/results_2024-06-08T01-37-37.235102.json with huggingface_hub
7134bd7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v31.16/aimo_kaggle_medium/results_2024-06-08T01-36-17.423236.json with huggingface_hub
ee00ff5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.10/aimo_kaggle_hard/results_2024-06-08T01-27-57.428004.json with huggingface_hub
d89a9a3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.10/aimo_kaggle_medium/results_2024-06-08T01-25-49.978327.json with huggingface_hub
a65a88a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.04/aimo_kaggle_hard_pot/results_2024-06-08T01-20-06.290515.json with huggingface_hub
9b643c5
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.04/aimo_kaggle_medium_pot/results_2024-06-08T01-03-04.641549.json with huggingface_hub
86c3eba
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.14/aimo_kaggle_hard/results_2024-06-08T00-49-42.456750.json with huggingface_hub
61f3695
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.14/aimo_kaggle_medium/results_2024-06-08T00-44-45.087140.json with huggingface_hub
6c1a4d3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.25/aimo_kaggle_hard/results_2024-06-07T23-33-42.515609.json with huggingface_hub
3ea7920
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.03/aimo_kaggle_hard_pot/results_2024-06-07T21-54-53.301991.json with huggingface_hub
444e7c3
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.02/aimo_kaggle_hard_pot/results_2024-06-07T21-54-24.913249.json with huggingface_hub
5f2ae9c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.03/aimo_kaggle_medium_pot/results_2024-06-07T21-33-02.425825.json with huggingface_hub
ac91cec
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.02/aimo_kaggle_medium_pot/results_2024-06-07T21-23-45.411316.json with huggingface_hub
f203e2c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.00/aimo_kaggle_hard_pot/results_2024-06-07T21-20-21.565764.json with huggingface_hub
1c9c32e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.01/aimo_kaggle_hard_pot/results_2024-06-07T21-06-56.319479.json with huggingface_hub
5d9596e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.00/aimo_kaggle_medium_pot/results_2024-06-07T20-57-07.159782.json with huggingface_hub
9aa4dbe
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-50-01.635421.json with huggingface_hub
f510068
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/mixtral-47b-sft/aimo_v03.01/aimo_kaggle_medium_pot/results_2024-06-07T20-49-53.514860.json with huggingface_hub
ea152c1
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-56.825025.json with huggingface_hub
43a1fd4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-47.472007.json with huggingface_hub
9c8c437
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-41.459261.json with huggingface_hub
b642f06
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-42.074976.json with huggingface_hub
35df9ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-41.776663.json with huggingface_hub
04152c8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-39.399786.json with huggingface_hub
2d06726
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-32.135273.json with huggingface_hub
17476fd
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-32.033648.json with huggingface_hub
2c29865
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-24.834826.json with huggingface_hub
95603b3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-08.086466.json with huggingface_hub
697f07d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-05.671433.json with huggingface_hub
cf86542
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-05.462917.json with huggingface_hub
e688f9b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-04.772194.json with huggingface_hub
7607d2f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-04.650538.json with huggingface_hub
515b294
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-02.147244.json with huggingface_hub
ca2b9f8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-49-01.766608.json with huggingface_hub
acf6a26
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-48-20.010104.json with huggingface_hub
8fb4db0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-47-49.128809.json with huggingface_hub
0763702
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-47-17.164651.json with huggingface_hub
9b118e5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.18/aimo_kaggle_hard/results_2024-06-07T20-44-06.583459.json with huggingface_hub
62b73b6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-34b-Python-hf-sft/aimo_v00.18/aimo_kaggle_medium/results_2024-06-07T20-38-12.961733.json with huggingface_hub
1ea070b
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.awq_tora_calib_512_4096/aimo_kaggle_tora_medium/results_2024-06-07T20-01-21.637282.json with huggingface_hub
8d216f3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-50.606026.json with huggingface_hub
cada001
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-50.460297.json with huggingface_hub
947611a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-45.035670.json with huggingface_hub
1c91595
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-37.307520.json with huggingface_hub
3e4a69c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-34.969894.json with huggingface_hub
91f7f7a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-07T20-00-30.792767.json with huggingface_hub
2c0333e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.awq_tora_calib_512_4096/aimo_kaggle_tora_medium/results_2024-06-07T20-00-30.118345.json with huggingface_hub
ad0bd91
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.gptq-calib-tora-chosen-v0.1-256-8bits/aimo_kaggle_tora_medium/results_2024-06-07T20-00-29.360253.json with huggingface_hub
63e1847
verified

edbeeching HF staff commited on