Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
26
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
0896c95
open-r1-eval-leaderboard
/
eval_results
2 contributors
History:
7519 commits
edbeeching
HF staff
Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v01.20/aimo_kaggle_tora_hard/results_2024-06-06T11-27-49.943734.json with huggingface_hub
0896c95
verified
8 months ago
AI-MO
Upload eval_results/AI-MO/CodeLlama-13b-Python-hf-sft/aimo_v01.20/aimo_kaggle_tora_hard/results_2024-06-06T11-27-49.943734.json with huggingface_hub
8 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v2.8/gsm8k/results_2024-05-02T20-41-30.188527.json with huggingface_hub
10 months ago
Nexusflow
Upload eval_results/Nexusflow/Starling-LM-7B-beta/main/ifeval/results_2024-03-28T19-55-03.124753.json with huggingface_hub
11 months ago
NousResearch
Upload eval_results/NousResearch/Hermes-2-Pro-Mistral-7B/main/ifeval/results_2024-03-28T19-53-41.347489.json with huggingface_hub
11 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/alpaca_eval/results_2024-05-01T07-35-15.json with huggingface_hub
10 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
abhishek
Upload eval_results/abhishek/autotrain-llama3-70b-math-v1/main/gsm8k/results_2024-05-07T10-49-46.881112.json with huggingface_hub
9 months ago
alignment-handbook
Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/mmlu/results_2024-03-07T11-11-18.431956.json with huggingface_hub
11 months ago
alpindale
Upload eval_results/alpindale/WizardLM-2-8x22B/main/ifeval/results_2024-04-16T11-06-30.245588.json with huggingface_hub
10 months ago
alvarobartt
Upload eval_results/alvarobartt/mistral-7b-orpo-capybara-reproduction/main/ifeval/results_2024-03-28T17-38-53.532479.json with huggingface_hub
11 months ago
bigcode
Upload eval_results/bigcode/starcoder2-15b/main/gsm8k/results_2024-03-07T16-00-15.667709.json with huggingface_hub
11 months ago
codellama
Upload eval_results/codellama/CodeLlama-70b-Instruct-hf/main/ifeval/results_2024-03-04T22-20-43.443022.json with huggingface_hub
12 months ago
databricks
Upload eval_results/databricks/dbrx-instruct/main/ifeval/results_2024-04-25T16-10-40.919095.json with huggingface_hub
10 months ago
deepseek-ai
Upload eval_results/deepseek-ai/deepseek-math-7b-rl/main/aimo_kaggle_hard_pot/results_2024-05-28T09-06-30.789083.json with huggingface_hub
9 months ago
edbeeching
Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/ifeval/results_2024-04-14T15-56-38.588846.json with huggingface_hub
10 months ago
google
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub
11 months ago
kaist-ai
Upload eval_results/kaist-ai/mistral-orpo-capybara-7k/main/ifeval/results_2024-03-28T19-24-15.810436.json with huggingface_hub
11 months ago
kashif
Upload eval_results/kashif/ppo_aimo_vllm_python_eval_warmup_1e-6_promising/main/aimo_kaggle_hard_pot/results_2024-05-30T07-46-56.590830.json with huggingface_hub
9 months ago
lewtun
Upload eval_results/lewtun/zephyr-2b-gemma-sft-mix6/main/mmlu/results_2024-03-06T15-57-58.551054.json with huggingface_hub
11 months ago
meta-llama
Upload eval_results/meta-llama/Meta-Llama-3-70B-Instruct/main/ifeval/results_2024-05-25T11-39-50.035526.json with huggingface_hub
9 months ago
mistralai
Upload eval_results/mistralai/Mixtral-8x22B-Instruct-v0.1/main/alpaca_eval/results_2024-05-25T17-34-07.json with huggingface_hub
9 months ago
openbmb
Upload eval_results/openbmb/Eurus-7b-sft/main/mini_math_v2_cot/results_2024-04-30T09-30-28.546746.json with huggingface_hub
10 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/agieval/results_2024-03-28T16-28-08.688920.json with huggingface_hub
11 months ago
orpo-explorers
Upload eval_results/orpo-explorers/zephyr-orpo-llama3-8b-base-chatml-beta-0.05-lr-5e-5/main/alpaca_eval/results_2024-05-30T18-10-28.json with huggingface_hub
9 months ago
stabilityai
Upload eval_results/stabilityai/stablelm-zephyr-3b/main/mmlu/results_2024-03-05T21-42-00.735420.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/agieval/results_2024-03-28T15-53-33.021821.json with huggingface_hub
11 months ago