Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
26
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
abc6af8
open-r1-eval-leaderboard
/
eval_results
2 contributors
History:
588 commits
lewtun
HF staff
Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-hermes-epoch-1-block-4096/main/mmlu/results_2024-03-05T18-14-37.723820.json with huggingface_hub
abc6af8
verified
12 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/mistral-7b-ift/v48.21/ifeval/results_2024-03-05T16-09-47.181822.json with huggingface_hub
12 months ago
NousResearch
Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/ifeval/results_2024-03-05T00-32-10.317065.json with huggingface_hub
12 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/mmlu/results_2024-03-02T21-44-42.749463.json with huggingface_hub
12 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
alignment-handbook
Upload eval_results/alignment-handbook/zephyr-2b-gemma-sft-hermes-epoch-1-block-4096/main/mmlu/results_2024-03-05T18-14-37.723820.json with huggingface_hub
12 months ago
codellama
Upload eval_results/codellama/CodeLlama-70b-Instruct-hf/main/ifeval/results_2024-03-04T22-20-43.443022.json with huggingface_hub
12 months ago
deepseek-ai
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/ifeval/results_2024-03-05T03-30-18.059805.json with huggingface_hub
12 months ago
google
Upload eval_results/google/gemma-7b-it/main/mmlu/results_2024-03-02T15-50-24.914824.json with huggingface_hub
12 months ago
meta-llama
Upload eval_results/meta-llama/Llama-2-70b-chat-hf/main/ifeval/results_2024-03-05T02-24-29.062223.json with huggingface_hub
12 months ago
mistralai
Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/gsm8k/results_2024-03-02T19-44-12.500885.json with huggingface_hub
12 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/gsm8k/results_2024-03-02T19-12-59.806609.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/gsm8k/results_2024-03-02T15-46-34.932666.json with huggingface_hub
12 months ago