Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
28
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
80d2acf
open-r1-eval-leaderboard
/
eval_results
3 contributors
History:
2102 commits
lewtun
HF staff
Upload eval_results/alvarobartt/mistral-7b-orpo-airoboros-pref-10k/main/agieval/results_2024-03-28T16-53-48.794914.json with huggingface_hub
80d2acf
verified
11 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/starcoder2-15b-dpo/v4.1/agieval/results_2024-03-28T16-48-35.789845.json with huggingface_hub
11 months ago
NousResearch
Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/agieval/results_2024-03-28T16-50-16.504147.json with huggingface_hub
11 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/agieval/results_2024-03-28T16-38-49.297471.json with huggingface_hub
11 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
alignment-handbook
Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/mmlu/results_2024-03-07T11-11-18.431956.json with huggingface_hub
12 months ago
alvarobartt
Upload eval_results/alvarobartt/mistral-7b-orpo-airoboros-pref-10k/main/agieval/results_2024-03-28T16-53-48.794914.json with huggingface_hub
11 months ago
bigcode
Upload eval_results/bigcode/starcoder2-15b/main/gsm8k/results_2024-03-07T16-00-15.667709.json with huggingface_hub
12 months ago
codellama
Upload eval_results/codellama/CodeLlama-70b-Instruct-hf/main/ifeval/results_2024-03-04T22-20-43.443022.json with huggingface_hub
12 months ago
databricks
Upload eval_results/databricks/dbrx-instruct/main/ifeval/results_2024-03-27T21-57-19.694730.json with huggingface_hub
11 months ago
deepseek-ai
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-28T16-35-56.180836.json with huggingface_hub
11 months ago
google
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub
11 months ago
lewtun
Upload eval_results/lewtun/zephyr-2b-gemma-sft-mix6/main/mmlu/results_2024-03-06T15-57-58.551054.json with huggingface_hub
12 months ago
meta-llama
Upload eval_results/meta-llama/Llama-2-7b-chat-hf/main/agieval/results_2024-03-28T16-52-55.106765.json with huggingface_hub
11 months ago
mistralai
Upload eval_results/mistralai/Mistral-7B-Instruct-v0.2/main/agieval/results_2024-03-28T16-44-41.848289.json with huggingface_hub
11 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/agieval/results_2024-03-28T16-28-08.688920.json with huggingface_hub
11 months ago
stabilityai
Upload eval_results/stabilityai/stablelm-zephyr-3b/main/mmlu/results_2024-03-05T21-42-00.735420.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/agieval/results_2024-03-28T15-53-33.021821.json with huggingface_hub
11 months ago