open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v31.0/eval_bbh.json with huggingface_hub
5ccf3a0
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/eval_bbh.json with huggingface_hub
3b64438
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/eval_mmlu.json with huggingface_hub
c131383
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/eval_gsm8k.json with huggingface_hub
27e557c
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/eval_truthfulqa.json with huggingface_hub
73e79f8
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v31.0/eval_gsm8k.json with huggingface_hub
3014ec7
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v31.0/eval_mmlu.json with huggingface_hub
345c2ae
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-ift/v31.0/eval_truthfulqa.json with huggingface_hub
c159a5e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/eval_truthfulqa.json with huggingface_hub
5f0c9a9
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/eval_truthfulqa.json with huggingface_hub
24a786c
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/eval_gsm8k.json with huggingface_hub
c176a43
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/eval_mmlu.json with huggingface_hub
bacab84
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/eval_ifeval.json with huggingface_hub
2c689aa
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/eval_truthfulqa.json with huggingface_hub
4a871bc
verified

lewtun HF staff commited on