demo / backend /tasks

Commit History

speedup for demo!
85cc54e
Running
verified

sumuks commited on

update question download format
e64aebd

tfrere commited on

improve security on hf token check
c4411e8

tfrere commited on

update token test
aae1c13

tfrere commited on

update model testing at server startup
97bea1c

tfrere commited on

update model handling in benchmark generation
4759fe1

tfrere commited on

update yourbench error handling
95bf1fc

tfrere commited on

improve get available model provider
d2805fc

tfrere commited on

normalize config file and update default benchmark model
8695aa8

tfrere commited on

add config file for models | fix github link in intro
c2b7f1b

tfrere commited on

add url importer | improve yourbench error handling | refactor
c750639

tfrere commited on

remove bill_to in get available model provider
e98040e

tfrere commited on

update intro | fix evaluation
a86c1f9

tfrere commited on

update models
ab8d3f5

tfrere commited on

update error handling, improve upload security checks
81e0b0c

tfrere commited on

improve get available model provider standalone test
e097fac

tfrere commited on

add qwen2.5 32b as the default
eee5a9a

sumuks commited on

Update backend/tasks/get_available_model_provider.py
4724e8f
verified

sumuks commited on

add llama 8b to models to evaluate as a baseline
afbd2d5
verified

sumuks commited on

change prompt for more creativity, and reduce overall number of questions generated
da80f69
verified

sumuks commited on

add fireworks
6025399
verified

sumuks commited on

temporarily change to llama 8b to allow more scale
89c3b6d
verified

sumuks commited on

update light eval task
1728789

tfrere commited on

update evaluation progress
22c253b

tfrere commited on

update evaluation progress
79407fd

tfrere commited on

update error message and avoid double benchmark generation
7f7e436

tfrere commited on

remove unusued get_model_provider
57683d7

tfrere commited on

refactor get available model provider
13efede

tfrere commited on

add moder provider switching to eval
4fb52f5

tfrere commited on

add get available model provider to benchmark generation
0e34dc4

tfrere commited on

block >1mo files | translate comments in english
d6f0b38

tfrere commited on

add downloadable documents | add full demo link
a8a8975

tfrere commited on

cleanup generation logs
7e389db

tfrere commited on

update lighteval results
39acd70

tfrere commited on

update progress handling | fix logo link zone | prioritize sambanova as a provider
8e3f969

tfrere commited on

update
0874ba3

tfrere commited on

add prerendered documents | update filename | refactor
ffa4ae8

tfrere commited on

update eveluationTask to remove local storage
83d60af

tfrere commited on

update production api url
d6b6619

tfrere commited on

update dockerfile
f8ec36f

tfrere commited on

update on tasks
2a8ebbd

tfrere commited on

update frontend
ebdfd67

tfrere commited on

first commit
970eef1

tfrere commited on