eval-leaderboard / src /populate.py

Commit History

Fix bug
6bf1f8e

xeon27 commited on

[WIP] Fix bug
cd7b8dd

xeon27 commited on

Fix bug
25f1697

xeon27 commited on

Add model name links and change single-turn to base
9c55d6d

xeon27 commited on

Remove commented code
aa87c61

xeon27 commited on

Change .eval path
37ebe4e

jwilles commited on

Fix bug
8ad1a09

xeon27 commited on

Add separate tab for agentic benchmark
1d1f5e9

xeon27 commited on

Use dash symbol for markdown
0796d85

xeon27 commited on

Use dash symbol for markdown
a319d81

xeon27 commited on

Fix bug
e7a2635

xeon27 commited on

Log df shape
f066ed8

xeon27 commited on

Fix bug
b1f9063

xeon27 commited on

Log df shape
116683a

xeon27 commited on

Add '-' for empty results
8555000

xeon27 commited on

Fix bug
323e17d

xeon27 commited on

Fix bug
e7fe9f8

xeon27 commited on

Remove column for average
a2f2df3

xeon27 commited on

Replace missing values by None
18638a9

xeon27 commited on

Change extension of web log file to .eval
cd53742

xeon27 commited on

Remove links to col names due to issues
cdca101

xeon27 commited on

Make task names clickable and link to inspect-evals repo
36244aa

xeon27 commited on

Clean up
2a314d2

xeon27 commited on

Fix bug
a2189ab

xeon27 commited on

Fix bug
ca19cea

xeon27 commited on

Fix bug
346f5e5

xeon27 commited on

Make values clickable
bbde2b0

xeon27 commited on

Debug
2c5e9d1

xeon27 commited on

Debug
c054278

xeon27 commited on

Debug
7c6bd6c

xeon27 commited on

Debug
3a37ec7

xeon27 commited on

Debug
d7d56ae

xeon27 commited on

Remove debug code
40ac9c7

xeon27 commited on

Debug
dea22be

xeon27 commited on

Duplicate from demo-leaderboard-backend/leaderboard
4a78d34
verified

jwilles clefourrier HF Staff commited on