demo / backend /tasks /evaluation_task.py

Commit History

add qwen2.5 32b as the default
eee5a9a

sumuks commited on

add llama 8b to models to evaluate as a baseline
afbd2d5
verified

sumuks commited on

update light eval task
1728789

tfrere commited on

update evaluation progress
22c253b

tfrere commited on

update evaluation progress
79407fd

tfrere commited on

remove unusued get_model_provider
57683d7

tfrere commited on

add moder provider switching to eval
4fb52f5

tfrere commited on

block >1mo files | translate comments in english
d6f0b38

tfrere commited on

add downloadable documents | add full demo link
a8a8975

tfrere commited on

update lighteval results
39acd70

tfrere commited on

add prerendered documents | update filename | refactor
ffa4ae8

tfrere commited on