open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.36/aimo_kaggle_tora_hard/results_2024-06-02T22-42-52.737095.json with huggingface_hub
e51f3c0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.awq_baseline/aimo_kaggle_tora_medium/results_2024-06-02T22-42-46.691615.json with huggingface_hub
fe88704
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.awq_baseline/aimo_kaggle_tora_medium/results_2024-06-02T22-42-46.553796.json with huggingface_hub
448cb49
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.34/aimo_kaggle_tora_hard/results_2024-06-02T22-42-46.292941.json with huggingface_hub
357a5f3
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-02T22-42-38.667567.json with huggingface_hub
d74ff06
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-02T22-42-23.684782.json with huggingface_hub
281cb46
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31.awq_baseline/aimo_kaggle_tora_medium/results_2024-06-02T22-40-56.893002.json with huggingface_hub
c9eac96
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v20.31/aimo_kaggle_tora_medium/results_2024-06-02T22-40-28.734946.json with huggingface_hub
38b8bd8
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.31/aimo_kaggle_tora_hard/results_2024-06-02T22-39-51.700178.json with huggingface_hub
7ba9d5e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.32/aimo_kaggle_tora_hard/results_2024-06-02T22-38-48.232654.json with huggingface_hub
c55b113
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.33/aimo_kaggle_tora_hard/results_2024-06-02T22-38-05.467295.json with huggingface_hub
46dd6fa
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.35/aimo_kaggle_tora_hard/results_2024-06-02T22-36-16.371703.json with huggingface_hub
04f7670
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.36/aimo_kaggle_tora_medium/results_2024-06-02T22-36-03.792778.json with huggingface_hub
5ac14b9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.37/aimo_kaggle_tora_medium/results_2024-06-02T22-35-07.541058.json with huggingface_hub
d0d41cc
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.40/aimo_kaggle_tora_medium/results_2024-06-02T22-33-47.166831.json with huggingface_hub
239622d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.24/aimo_kaggle_tora_hard/results_2024-06-02T22-32-08.051447.json with huggingface_hub
401dc9d
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.29/aimo_kaggle_tora_hard/results_2024-06-02T22-31-50.143789.json with huggingface_hub
c9746b9
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.33/aimo_kaggle_tora_medium/results_2024-06-02T22-30-50.104411.json with huggingface_hub
51f273c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.27/aimo_kaggle_tora_hard/results_2024-06-02T22-30-31.522905.json with huggingface_hub
85cf7f1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.34/aimo_kaggle_tora_medium/results_2024-06-02T22-29-43.776246.json with huggingface_hub
9975902
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.28/aimo_kaggle_tora_hard/results_2024-06-02T22-29-31.798736.json with huggingface_hub
4f20fe0
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.41/aimo_kaggle_tora_medium/results_2024-06-02T22-29-12.501872.json with huggingface_hub
75156f4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.30/aimo_kaggle_tora_hard/results_2024-06-02T22-26-57.767202.json with huggingface_hub
f48a6c5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.31/aimo_kaggle_tora_medium/results_2024-06-02T22-24-43.665257.json with huggingface_hub
3de2f58
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.32/aimo_kaggle_tora_medium/results_2024-06-02T22-24-37.711568.json with huggingface_hub
9481522
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.35/aimo_kaggle_tora_medium/results_2024-06-02T22-24-03.942825.json with huggingface_hub
b5df23a
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v03.13/aimo_kaggle_hard/results_2024-06-02T22-23-45.288208.json with huggingface_hub
0236eb4
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.23/aimo_kaggle_tora_hard/results_2024-06-02T22-21-07.158391.json with huggingface_hub
4a41b03
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.25/aimo_kaggle_tora_hard/results_2024-06-02T22-20-56.376083.json with huggingface_hub
bd0230c
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.27/aimo_kaggle_hard/results_2024-06-02T22-20-32.982898.json with huggingface_hub
b74340d
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v03.13/aimo_kaggle_medium/results_2024-06-02T22-20-26.336239.json with huggingface_hub
ce34e77
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.26/aimo_kaggle_tora_hard/results_2024-06-02T22-18-10.202613.json with huggingface_hub
8b9db55
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.27/aimo_kaggle_medium/results_2024-06-02T22-17-16.414041.json with huggingface_hub
513948f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v03.00/aimo_kaggle_hard/results_2024-06-02T22-15-10.617497.json with huggingface_hub
1de8ea7
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.30/aimo_kaggle_tora_medium/results_2024-06-02T22-14-48.740358.json with huggingface_hub
e3c1a2f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.22/aimo_kaggle_tora_hard/results_2024-06-02T22-13-18.388402.json with huggingface_hub
b3a07e5
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.28/aimo_kaggle_hard/results_2024-06-02T22-10-21.393682.json with huggingface_hub
e7656df
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.28/aimo_kaggle_tora_medium/results_2024-06-02T22-09-42.415540.json with huggingface_hub
27164c1
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.19/aimo_kaggle_tora_hard/results_2024-06-02T22-09-36.552560.json with huggingface_hub
db949c4
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.29/aimo_kaggle_tora_medium/results_2024-06-02T22-08-30.267746.json with huggingface_hub
a227b32
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.20/aimo_kaggle_tora_hard/results_2024-06-02T22-06-55.051165.json with huggingface_hub
58786ff
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.27/aimo_kaggle_tora_medium/results_2024-06-02T22-06-36.616670.json with huggingface_hub
ec841d7
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.24/aimo_kaggle_tora_medium/results_2024-06-02T22-06-00.946075.json with huggingface_hub
2d81a7e
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.21/aimo_kaggle_tora_hard/results_2024-06-02T22-05-24.619676.json with huggingface_hub
11f4bd6
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.25/aimo_kaggle_tora_medium/results_2024-06-02T22-04-49.229067.json with huggingface_hub
394d5ee
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.18/aimo_kaggle_tora_hard/results_2024-06-02T22-04-33.160784.json with huggingface_hub
1ca63bf
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/deepseek-coder-33b-sft/aimo_v02.28/aimo_kaggle_medium/results_2024-06-02T22-03-51.744370.json with huggingface_hub
5363e61
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.23/aimo_kaggle_tora_medium/results_2024-06-02T22-03-06.937576.json with huggingface_hub
c8fe33f
verified

edbeeching HF staff commited on

Upload eval_results/AI-MO/internlm-math-20b-sft/aimo_v03.00/aimo_kaggle_medium/results_2024-06-02T22-01-38.360337.json with huggingface_hub
8d3201b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/CodeLlama-7b-Python-hf-sft/aimo_v01.26/aimo_kaggle_tora_medium/results_2024-06-02T22-01-44.170016.json with huggingface_hub
e76a13a
verified

edbeeching HF staff commited on