Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ar08
/
zzz
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
zzz
/
evaluation
/
benchmarks
6.23 MB
1 contributor
History:
1 commit
ar08
Upload 1040 files
246d201
verified
10 months ago
EDA
Upload 1040 files
10 months ago
agent_bench
Upload 1040 files
10 months ago
aider_bench
Upload 1040 files
10 months ago
biocoder
Upload 1040 files
10 months ago
bird
Upload 1040 files
10 months ago
browsing_delegation
Upload 1040 files
10 months ago
commit0_bench
Upload 1040 files
10 months ago
discoverybench
Upload 1040 files
10 months ago
gaia
Upload 1040 files
10 months ago
gorilla
Upload 1040 files
10 months ago
gpqa
Upload 1040 files
10 months ago
humanevalfix
Upload 1040 files
10 months ago
logic_reasoning
Upload 1040 files
10 months ago
miniwob
Upload 1040 files
10 months ago
mint
Upload 1040 files
10 months ago
ml_bench
Upload 1040 files
10 months ago
scienceagentbench
Upload 1040 files
10 months ago
swe_bench
Upload 1040 files
10 months ago
the_agent_company
Upload 1040 files
10 months ago
toolqa
Upload 1040 files
10 months ago
webarena
Upload 1040 files
10 months ago