Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
30
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
8dbf476
open-r1-eval-leaderboard
/
eval_results
3 contributors
History:
388 commits
lewtun
HF staff
Upload eval_results/openchat/openchat-3.5-0106/main/ifeval/results_2024-03-02T19-08-01.536771.json with huggingface_hub
8dbf476
verified
12 months ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.1/mmlu/results_2024-03-02T19-06-24.999981.json with huggingface_hub
12 months ago
NousResearch
Upload eval_results/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/main/truthfulqa/results_2024-03-02T18-43-49.033814.json with huggingface_hub
12 months ago
Qwen
Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/hellaswag/results_2024-03-02T16-25-19.857293.json with huggingface_hub
12 months ago
abacaj
Upload eval_results/abacaj/phi-2-super/main/gsm8k/results_2024-03-02T15-50-45.205069.json with huggingface_hub
12 months ago
google
Upload eval_results/google/gemma-7b-it/main/mmlu/results_2024-03-02T15-50-24.914824.json with huggingface_hub
12 months ago
mistralai
Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/truthfulqa/results_2024-03-02T18-36-54.862980.json with huggingface_hub
12 months ago
openchat
Upload eval_results/openchat/openchat-3.5-0106/main/ifeval/results_2024-03-02T19-08-01.536771.json with huggingface_hub
12 months ago
teknium
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/gsm8k/results_2024-03-02T15-46-34.932666.json with huggingface_hub
12 months ago