Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
26
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
5a92797
open-r1-eval-leaderboard
/
eval_results
2 contributors
History:
2447 commits
edbeeching
HF staff
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/mini_math/results_2024-04-23T12-52-50.054036.json with huggingface_hub
5a92797
verified
10 months ago
AI-MO
Upload eval_results/AI-MO/mistral-7b-sft/aimo_v03.00/mini_math/results_2024-04-23T12-07-13.400858.json with huggingface_hub
10 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v3.2/gsm8k/results_2024-04-02T22-34-30.468109.json with huggingface_hub
11 months ago
Nexusflow
Upload eval_results/Nexusflow/Starling-LM-7B-beta/main/ifeval/results_2024-03-28T19-55-03.124753.json with huggingface_hub
11 months ago
NousResearch
Upload eval_results/NousResearch/Hermes-2-Pro-Mistral-7B/main/ifeval/results_2024-03-28T19-53-41.347489.json with huggingface_hub
11 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/mini_math/results_2024-04-23T12-52-50.054036.json with huggingface_hub
10 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
alignment-handbook
Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/mmlu/results_2024-03-07T11-11-18.431956.json with huggingface_hub
11 months ago
alpindale
Upload eval_results/alpindale/WizardLM-2-8x22B/main/ifeval/results_2024-04-16T11-06-30.245588.json with huggingface_hub
10 months ago
alvarobartt
Upload eval_results/alvarobartt/mistral-7b-orpo-capybara-reproduction/main/ifeval/results_2024-03-28T17-38-53.532479.json with huggingface_hub
11 months ago
bigcode
Upload eval_results/bigcode/starcoder2-15b/main/gsm8k/results_2024-03-07T16-00-15.667709.json with huggingface_hub
11 months ago
codellama
Upload eval_results/codellama/CodeLlama-70b-Instruct-hf/main/ifeval/results_2024-03-04T22-20-43.443022.json with huggingface_hub
12 months ago
databricks
Upload eval_results/databricks/dbrx-base/main/agieval/results_2024-03-30T21-38-32.137846.json with huggingface_hub
11 months ago
deepseek-ai
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/agieval/results_2024-03-28T17-11-01.242076.json with huggingface_hub
11 months ago
edbeeching
Upload eval_results/edbeeching/mixtral-8x7b-instruct-v0.1_merged/v0.2/ifeval/results_2024-04-14T15-56-38.588846.json with huggingface_hub
10 months ago
google
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub
11 months ago
kaist-ai
Upload eval_results/kaist-ai/mistral-orpo-capybara-7k/main/ifeval/results_2024-03-28T19-24-15.810436.json with huggingface_hub
11 months ago
lewtun
Upload eval_results/lewtun/zephyr-2b-gemma-sft-mix6/main/mmlu/results_2024-03-06T15-57-58.551054.json with huggingface_hub
11 months ago
meta-llama
Upload eval_results/meta-llama/Meta-Llama-3-8B-Instruct/main/ifeval/results_2024-04-22T11-59-54.794597.json with huggingface_hub
10 months ago
mistralai
Upload eval_results/mistralai/Mixtral-8x22B-Instruct-v0.1/main/ifeval/results_2024-04-17T17-55-40.247263.json with huggingface_hub
10 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/agieval/results_2024-03-28T16-28-08.688920.json with huggingface_hub
11 months ago
orpo-explorers
Upload eval_results/orpo-explorers/hf-llama3-8b-orpo-v0.0/main/ifeval/results_2024-04-22T10-36-22.008357.json with huggingface_hub
10 months ago
stabilityai
Upload eval_results/stabilityai/stablelm-zephyr-3b/main/mmlu/results_2024-03-05T21-42-00.735420.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/agieval/results_2024-03-28T15-53-33.021821.json with huggingface_hub
11 months ago