Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
jwilles/leaderboard-test
vector-institute
/
eval-leaderboard
like
3
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
363cbd2
eval-leaderboard
/
refactor_eval_results.py
Commit History
Change model names to reflect version
954d8ee
xeon27
commited on
Jan 31
Add agentharm and swe-bench tasks
1289818
xeon27
commited on
Jan 29
Add results for GAIA and GDM tasks
2718fde
xeon27
commited on
Jan 28
Add model name links and change single-turn to base
9c55d6d
xeon27
commited on
Jan 27
Change nomenclature to single-turn
eb538cb
xeon27
commited on
Jan 24
Replace missing values by None
18638a9
xeon27
commited on
Jan 24
Add relevant model links
5438c77
xeon27
commited on
Jan 21
Add tmp code
e004342
xeon27
commited on
Jan 20
Add script for refactoring results from log files
8b91831
xeon27
commited on
Jan 20