open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/data/deepseek-math-7b-kto-v00.00/checkpoint-1100/main/aimo_kaggle_hard_pot/results_2024-05-29T11-53-40.101219.json with huggingface_hub
d61134f
verified

kashif HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.08/aimo_kaggle_medium_pot/results_2024-05-29T11-52-05.215208.json with huggingface_hub
735df6a
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.15/aimo_kaggle_medium_pot/results_2024-05-29T11-51-29.220941.json with huggingface_hub
341f86c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.11/aimo_kaggle_hard_pot/results_2024-05-29T11-50-42.929974.json with huggingface_hub
219626b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.14/aimo_kaggle_medium_pot/results_2024-05-29T11-50-28.874157.json with huggingface_hub
cef3312
verified

lewtun HF staff commited on

Upload eval_results/data/deepseek-math-7b-kto-v00.00/checkpoint-1100/main/aimo_kaggle_medium_pot/results_2024-05-29T11-48-26.048131.json with huggingface_hub
219dd12
verified

kashif HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.11/aimo_kaggle_medium_pot/results_2024-05-29T11-47-50.222565.json with huggingface_hub
fa73fe3
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.13/aimo_kaggle_hard_pot/results_2024-05-29T11-38-48.360863.json with huggingface_hub
7283d38
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.06/aimo_kaggle_hard_pot/results_2024-05-29T11-38-30.995797.json with huggingface_hub
070ba2d
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.03/aimo_kaggle_hard_pot/results_2024-05-29T11-38-02.069309.json with huggingface_hub
d0a180b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.13/aimo_kaggle_medium_pot/results_2024-05-29T11-35-58.678490.json with huggingface_hub
86a29ea
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.12/aimo_kaggle_hard_pot/results_2024-05-29T11-33-56.880741.json with huggingface_hub
9a55778
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.06/aimo_kaggle_medium_pot/results_2024-05-29T11-33-30.213628.json with huggingface_hub
e3299bb
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.03/aimo_kaggle_medium_pot/results_2024-05-29T11-31-43.192841.json with huggingface_hub
5d576a8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.12/aimo_kaggle_medium_pot/results_2024-05-29T11-29-47.875044.json with huggingface_hub
21e6c92
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.02/aimo_kaggle_medium_pot/results_2024-05-29T11-28-29.335398.json with huggingface_hub
55405be
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.01/aimo_kaggle_hard_pot/results_2024-05-29T11-23-52.961253.json with huggingface_hub
c6ce381
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.00/aimo_kaggle_hard_pot/results_2024-05-29T11-21-51.579924.json with huggingface_hub
5640d9e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.01/aimo_kaggle_medium_pot/results_2024-05-29T11-21-12.893665.json with huggingface_hub
4bde84c
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.00/aimo_kaggle_medium_pot/results_2024-05-29T11-16-16.296513.json with huggingface_hub
bb630d4
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.04/aimo_kaggle_hard_pot/results_2024-05-29T11-07-43.719040.json with huggingface_hub
04da1f4
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v03.04/aimo_kaggle_medium_pot/results_2024-05-29T11-05-09.182422.json with huggingface_hub
4ad43eb
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/ppo_aimo_vllm_extract_answer_warmup_1e-6_promising/main/aimo_kaggle_hard_pot/results_2024-05-28T18-00-39.130142.json with huggingface_hub
988412f
verified

vwxyzjn commited on

Upload eval_results/AI-MO/ppo_aimo_vllm_extract_answer_warmup_1e-6_promising/main/aimo_kaggle_medium_pot/results_2024-05-28T17-57-19.923847.json with huggingface_hub
706b768
verified

vwxyzjn commited on

Upload eval_results/AI-MO/tora-code-34b-v1.0/main/aimo_kaggle_hard_pot/results_2024-05-28T13-50-54.020181.json with huggingface_hub
3f30f85
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/tora-code-34b-v1.0/main/aimo_kaggle_medium_pot/results_2024-05-28T13-35-50.440403.json with huggingface_hub
9fc03e5
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.07/aimo_kaggle_hard_pot/results_2024-05-28T13-02-50.288282.json with huggingface_hub
0a02732
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.06/aimo_kaggle_hard_pot/results_2024-05-28T13-01-09.601125.json with huggingface_hub
78d0a90
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.07/aimo_kaggle_medium_pot/results_2024-05-28T12-58-31.352244.json with huggingface_hub
b7e865f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.05/aimo_kaggle_hard_pot/results_2024-05-28T12-57-46.618208.json with huggingface_hub
a0fd79e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.05/aimo_kaggle_medium_pot/results_2024-05-28T12-57-06.873167.json with huggingface_hub
78c29f6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.04/aimo_kaggle_hard_pot/results_2024-05-28T12-56-58.057675.json with huggingface_hub
2dfd7ea
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.06/aimo_kaggle_medium_pot/results_2024-05-28T12-56-46.573145.json with huggingface_hub
eed7499
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.03/aimo_kaggle_hard_pot/results_2024-05-28T12-56-21.516367.json with huggingface_hub
89be26e
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.00/aimo_kaggle_hard_pot/results_2024-05-28T12-55-36.093434.json with huggingface_hub
59f344f
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.02/aimo_kaggle_hard_pot/results_2024-05-28T12-55-01.397305.json with huggingface_hub
70593ef
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.04/aimo_kaggle_medium_pot/results_2024-05-28T12-51-40.862457.json with huggingface_hub
72c1268
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.01/aimo_kaggle_medium_pot/results_2024-05-28T12-50-58.201617.json with huggingface_hub
ceed061
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.03/aimo_kaggle_medium_pot/results_2024-05-28T12-50-44.346114.json with huggingface_hub
4e7eccc
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.02/aimo_kaggle_medium_pot/results_2024-05-28T12-50-39.846365.json with huggingface_hub
ff4ce63
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-sft/aimo_v18.00/aimo_kaggle_medium_pot/results_2024-05-28T12-50-35.721838.json with huggingface_hub
339cb8b
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.31/aimo_kaggle_hard_pot/results_2024-05-28T12-08-56.600318.json with huggingface_hub
9221f96
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.30/aimo_kaggle_hard_pot/results_2024-05-28T12-08-21.827835.json with huggingface_hub
2d7afd6
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.29/aimo_kaggle_hard_pot/results_2024-05-28T12-07-58.800926.json with huggingface_hub
b53c727
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.28/aimo_kaggle_hard_pot/results_2024-05-28T12-06-07.332698.json with huggingface_hub
03990ac
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.31/aimo_kaggle_medium_pot/results_2024-05-28T12-05-51.343604.json with huggingface_hub
da18187
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.30/aimo_kaggle_medium_pot/results_2024-05-28T12-03-52.793974.json with huggingface_hub
ea2f336
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.29/aimo_kaggle_medium_pot/results_2024-05-28T12-03-20.120048.json with huggingface_hub
7ba9ef8
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.28/aimo_kaggle_medium_pot/results_2024-05-28T12-02-20.235857.json with huggingface_hub
b4813c0
verified

lewtun HF staff commited on

Upload eval_results/AI-MO/deepseek-math-7b-rl-sft/aimo_v02.27/aimo_kaggle_hard_pot/results_2024-05-28T12-02-19.613725.json with huggingface_hub
b79f255
verified

lewtun HF staff commited on