Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
30
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
2f08769
open-r1-eval-leaderboard
/
eval_results
/
Qwen
/
Qwen1.5-4B-Chat
/
main
3 contributors
History:
10 commits
lewtun
HF staff
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/agieval/results_2024-03-28T16-36-45.773188.json with huggingface_hub
7fc097e
verified
11 months ago
agieval
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/agieval/results_2024-03-28T16-36-45.773188.json with huggingface_hub
11 months ago
arc
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/arc/results_2024-03-02T15-25-48.691363.json with huggingface_hub
12 months ago
bbh
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/bbh/results_2024-03-28T16-35-21.201380.json with huggingface_hub
11 months ago
gsm8k
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/gsm8k/results_2024-03-02T15-38-58.924347.json with huggingface_hub
12 months ago
hellaswag
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/hellaswag/results_2024-03-02T15-31-55.303310.json with huggingface_hub
12 months ago
ifeval
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/ifeval/results_2024-03-02T15-33-34.597523.json with huggingface_hub
12 months ago
mmlu
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/mmlu/results_2024-03-02T15-37-56.167343.json with huggingface_hub
12 months ago
truthfulqa
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/truthfulqa/results_2024-03-02T15-25-55.226760.json with huggingface_hub
12 months ago
winogrande
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/winogrande/results_2024-03-02T15-25-12.447850.json with huggingface_hub
12 months ago