Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
26
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
7ac902b
open-r1-eval-leaderboard
/
eval_results
/
HuggingFaceH4
/
qwen-1.5-1.8b-odpo
/
v0.3
2 contributors
History:
6 commits
edbeeching
HF staff
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/mmlu/results_2024-03-23T06-40-33.361734.json with huggingface_hub
3b13635
verified
11 months ago
arc
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/arc/results_2024-03-23T06-27-44.236175.json with huggingface_hub
11 months ago
gsm8k
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/gsm8k/results_2024-03-23T06-38-04.381758.json with huggingface_hub
11 months ago
hellaswag
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/hellaswag/results_2024-03-23T06-33-13.580213.json with huggingface_hub
11 months ago
mmlu
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/mmlu/results_2024-03-23T06-40-33.361734.json with huggingface_hub
11 months ago
truthfulqa
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/truthfulqa/results_2024-03-23T06-28-07.226987.json with huggingface_hub
11 months ago
winogrande
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-odpo/v0.3/winogrande/results_2024-03-23T06-27-34.539629.json with huggingface_hub
11 months ago