open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-11T07-37-53.638852.json with huggingface_hub
3b88562
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-10T09-00-16.233745.json with huggingface_hub
9ff6df6
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-10T08-55-01.368723.json with huggingface_hub
968d2e8
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-10T08-49-03.776874.json with huggingface_hub
b7653bb
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-10T08-43-18.454038.json with huggingface_hub
0a9a0f7
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/math_tora/results_2024-07-10T08-39-37.784816.json with huggingface_hub
7c5eaed
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.04/aimo_kaggle_medium/results_2024-07-09T12-50-46.437737.json with huggingface_hub
6c0b2b0
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.24/aimo_kaggle_hard/results_2024-07-09T12-10-02.863659.json with huggingface_hub
f532fc5
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.15/aimo_kaggle_medium/results_2024-07-09T10-25-57.552908.json with huggingface_hub
ecced74
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.22/aimo_kaggle_medium/results_2024-07-09T08-49-49.103281.json with huggingface_hub
00a67bb
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.02/aimo_kaggle_hard/results_2024-07-09T08-34-31.701351.json with huggingface_hub
a1fb67a
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.18/aimo_kaggle_hard/results_2024-07-09T08-30-19.841996.json with huggingface_hub
f43fa6d
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.28/aimo_kaggle_hard/results_2024-07-09T08-30-07.799123.json with huggingface_hub
19d238f
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.07/aimo_kaggle_medium/results_2024-07-09T08-28-18.211754.json with huggingface_hub
c492ac2
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.13/aimo_kaggle_hard/results_2024-07-09T08-27-13.166146.json with huggingface_hub
accfa91
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.05/aimo_kaggle_medium/results_2024-07-09T08-24-45.550927.json with huggingface_hub
0af4e8b
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.11/aimo_kaggle_medium/results_2024-07-09T08-22-59.589964.json with huggingface_hub
7560913
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.08/aimo_kaggle_medium/results_2024-07-09T08-21-06.645228.json with huggingface_hub
d95c637
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.14/aimo_kaggle_medium/results_2024-07-09T08-21-00.706612.json with huggingface_hub
ccfc2f0
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.33/aimo_kaggle_medium/results_2024-07-09T08-18-59.298419.json with huggingface_hub
60b452d
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.00/aimo_kaggle_hard/results_2024-07-09T07-50-56.036093.json with huggingface_hub
e7ffbef
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.09/aimo_kaggle_medium/results_2024-07-08T23-16-40.493272.json with huggingface_hub
03c4f53
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.13/aimo_kaggle_medium/results_2024-07-08T22-38-29.146924.json with huggingface_hub
17fed7f
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.03/aimo_kaggle_medium/results_2024-07-08T22-31-51.744896.json with huggingface_hub
71dc81a
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.07/aimo_kaggle_hard/results_2024-07-08T22-23-10.130701.json with huggingface_hub
ae2d3de
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.12/aimo_kaggle_hard/results_2024-07-08T22-08-59.347932.json with huggingface_hub
d762066
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.12/aimo_kaggle_medium/results_2024-07-08T22-06-59.758528.json with huggingface_hub
25d8cb9
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.11/aimo_kaggle_hard/results_2024-07-08T21-43-19.376164.json with huggingface_hub
3c90b22
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.18/aimo_kaggle_medium/results_2024-07-08T20-39-09.863677.json with huggingface_hub
099e7ad
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.33/aimo_kaggle_hard/results_2024-07-08T17-54-21.173202.json with huggingface_hub
17aa200
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.02/aimo_kaggle_medium/results_2024-07-08T17-53-24.262525.json with huggingface_hub
d709941
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.14/aimo_kaggle_hard/results_2024-07-08T17-41-09.106575.json with huggingface_hub
c196a50
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.08/aimo_kaggle_hard/results_2024-07-08T16-51-08.183447.json with huggingface_hub
ee680f2
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.28/aimo_kaggle_medium/results_2024-07-08T16-48-38.834311.json with huggingface_hub
abe6a28
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_math_integer_lvl5/results_2024-07-08T16-12-04.680230.json with huggingface_hub
667e05f
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.27/aimo_kaggle_medium/results_2024-07-08T15-36-38.497136.json with huggingface_hub
c56fa45
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_math_integer_lvl4/results_2024-07-08T15-06-41.582458.json with huggingface_hub
d91fc33
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.00/aimo_math_integer_lvl5/results_2024-07-08T13-20-07.815311.json with huggingface_hub
b6ff441
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.10/aimo_kaggle_medium/results_2024-07-08T13-19-10.863632.json with huggingface_hub
e34de62
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v00.05/aimo_kaggle_hard/results_2024-07-08T12-31-29.435076.json with huggingface_hub
8e6fea3
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.00/aimo_math_integer_lvl4/results_2024-07-08T12-18-36.739256.json with huggingface_hub
6c5d7e4
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_hard_extended/results_2024-07-08T12-07-16.347727.json with huggingface_hub
4fccd9c
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_hard_extended/results_2024-07-08T11-54-53.411347.json with huggingface_hub
ca95aa6
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_medium_extended/results_2024-07-08T11-54-32.379641.json with huggingface_hub
2c78825
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_hard_extended/results_2024-07-08T11-51-04.852053.json with huggingface_hub
74a6219
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.00/aimo_kaggle_tora_hard_extended/results_2024-07-08T11-50-29.407275.json with huggingface_hub
f387cbc
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_hard_extended/results_2024-07-08T11-50-24.597950.json with huggingface_hub
90c626a
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_hard_extended/results_2024-07-08T11-50-07.646204.json with huggingface_hub
1ef7531
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_medium_extended/results_2024-07-08T11-37-12.737137.json with huggingface_hub
72faff0
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/DeepSeek-Coder-V2-Lite-Base-sft/aimo_v01.01/aimo_kaggle_tora_medium_extended/results_2024-07-08T11-36-36.342556.json with huggingface_hub
0f9f14e
verified

edbeeching HF Staff commited on