Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
33
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
e5da1ed
open-r1-eval-leaderboard
/
eval_results
/
Qwen
/
Qwen1.5-1.8B-Chat
/
main
3 contributors
History:
8 commits
lewtun
HF staff
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-18T20-11-24.511185.json with huggingface_hub
754f9ed
verified
11 months ago
arc
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/arc/results_2024-03-02T15-25-07.149843.json with huggingface_hub
12 months ago
bbh
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-18T20-11-24.511185.json with huggingface_hub
11 months ago
gsm8k
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/gsm8k/results_2024-03-02T15-34-38.810277.json with huggingface_hub
12 months ago
hellaswag
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/hellaswag/results_2024-03-02T15-30-02.600431.json with huggingface_hub
12 months ago
ifeval
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/ifeval/results_2024-03-02T15-33-11.860503.json with huggingface_hub
12 months ago
mmlu
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/mmlu/results_2024-03-02T15-35-07.778605.json with huggingface_hub
12 months ago
truthfulqa
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/truthfulqa/results_2024-03-02T15-25-06.033335.json with huggingface_hub
12 months ago
winogrande
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/winogrande/results_2024-03-02T15-24-35.629848.json with huggingface_hub
12 months ago