Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
33
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
0896c95
open-r1-eval-leaderboard
/
eval_results
/
HuggingFaceH4
/
qwen-1.5-1.8b-dpo
/
v0.6
3 contributors
History:
8 commits
edbeeching
HF staff
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/gsm8k/results_2024-03-22T22-05-37.451133.json with huggingface_hub
5fc2ec6
verified
11 months ago
arc
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/arc/results_2024-03-22T21-47-58.817787.json with huggingface_hub
11 months ago
gsm8k
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/gsm8k/results_2024-03-22T22-05-37.451133.json with huggingface_hub
11 months ago
hellaswag
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/hellaswag/results_2024-03-22T21-53-34.188268.json with huggingface_hub
11 months ago
ifeval
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/ifeval/results_2024-03-16T22-42-33.074718.json with huggingface_hub
11 months ago
mmlu
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/mmlu/results_2024-03-22T22-01-48.156691.json with huggingface_hub
11 months ago
truthfulqa
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/truthfulqa/results_2024-03-22T21-53-49.135432.json with huggingface_hub
11 months ago
winogrande
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/winogrande/results_2024-03-22T21-54-39.064441.json with huggingface_hub
11 months ago