Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
26
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
05025ca
open-r1-eval-leaderboard
/
eval_results
2 contributors
History:
1709 commits
edbeeching
HF staff
Upload eval_results/HuggingFaceH4/Qwen1.5-1.8B-Chat-dpo/v0.2/mmlu/results_2024-03-22T14-39-41.575312.json with huggingface_hub
05025ca
verified
11 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/Qwen1.5-1.8B-Chat-dpo/v0.2/mmlu/results_2024-03-22T14-39-41.575312.json with huggingface_hub
11 months ago
NousResearch
Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/bbh/results_2024-03-18T20-43-57.392460.json with huggingface_hub
11 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/bbh/results_2024-03-18T20-40-36.554904.json with huggingface_hub
11 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
alignment-handbook
Upload eval_results/alignment-handbook/zephyr-2b-gemma-dpo-mix7-beta-0.05-epoch-6/main/mmlu/results_2024-03-07T11-11-18.431956.json with huggingface_hub
11 months ago
bigcode
Upload eval_results/bigcode/starcoder2-15b/main/gsm8k/results_2024-03-07T16-00-15.667709.json with huggingface_hub
11 months ago
codellama
Upload eval_results/codellama/CodeLlama-70b-Instruct-hf/main/ifeval/results_2024-03-04T22-20-43.443022.json with huggingface_hub
12 months ago
deepseek-ai
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-18T20-51-01.093533.json with huggingface_hub
11 months ago
google
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub
11 months ago
lewtun
Upload eval_results/lewtun/zephyr-2b-gemma-sft-mix6/main/mmlu/results_2024-03-06T15-57-58.551054.json with huggingface_hub
11 months ago
meta-llama
Upload eval_results/meta-llama/Llama-2-70b-chat-hf/main/ifeval/results_2024-03-05T02-24-29.062223.json with huggingface_hub
12 months ago
mistralai
Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-18T20-58-12.014656.json with huggingface_hub
11 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/gsm8k/results_2024-03-02T19-12-59.806609.json with huggingface_hub
12 months ago
stabilityai
Upload eval_results/stabilityai/stablelm-zephyr-3b/main/mmlu/results_2024-03-05T21-42-00.735420.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/bbh/results_2024-03-18T19-49-31.908303.json with huggingface_hub
11 months ago