Commit History
fix: show partial results even if some evaluations haven't finished
7fdb5f5
Update app.py
9a10727
verified
add wrapping to leaderboard
a5bd804
remove submit tab
117d89c
Update app.py
9b8b426
verified
debug restart interval
fdb1fcf
verified
fix: type hints for styling function
0be9d2f
Factor out floating point styling to a function
90021e9
fix: filtering support for models missing details
5e8e87c
remove intro text and citation block
dcb54b6
Increase floating point number in benchmark metrics
7fcf611
show private models by default
2bd1158
revert to correct usage of ModelDetails (without api)
24c8d00
Added model API to submission screen
20fd601
verified
Update app.py
84582a1
verified
simplified the template
24622c4
Clémentine
commited on
CPU, TOKEN, env variables (#4)
55cc480
verified
Update app.py
4879b93
verified
removed last restart
daf60ae
Clémentine
commited on
simplified calls
50df158
Clémentine
commited on
now with a functionning backend
1ffc326
Clémentine
commited on
fix
1257fc3
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
adding pull back
d084b26
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
make faster thanks to no concurrency limit
d4aa996
Clémentine
commited on
fix order of request file vs request file list, to avoid resubmitting issues
976f398
Clémentine
commited on
cache
4ff9eef
Clémentine
commited on
update for caching
395eff6
Clémentine
commited on
simplify launcher + remove dataframe warning on boolean columns
ab6f548
Clémentine
commited on
add model architecture as column
3dfaf22
Clémentine
commited on
Try concurrency management
bb149ba
Clémentine
commited on
fix
be0d7e4
Clémentine
commited on
Refactor 2 - added plotting back
b1a1395
Clémentine
commited on
Update app.py
a163e5c
fix col width
fc1e99b
Clémentine
commited on
refacto style + rate limit
df66f6e
Clémentine
commited on
adding collections back
ae85651
Clémentine
commited on
refacto part 1
2a5f9fb
Clémentine
commited on
add new evals to the leaderboard
e3aaf53
Nathan Habib
commited on
add safefail for when we cannot download datasets, will simply restart the space
26286b2
Nathan Habib
commited on
token for checking gated base models
f3cda22
Clémentine
commited on
Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
0f4fbd6
Nathan Habib
commited on
reorg to simplify nav in code base
6e56e0d
Clémentine
commited on
Creating functions for plotting results over time (#295)
f2bc0a5
added automatic update of the best LLM models
e295ac3
Clémentine
commited on
reformat files, put metadata in request files
adb0416
Nathan Habib
commited on
updated GPTQ display!
5491f2d
Clémentine
commited on