demo / backend

Commit History

speedup for demo!
85cc54e
Running
verified

sumuks commited on

update benchmark displayed questions
7839773

tfrere commited on

update benchmark model
da60a9e

tfrere commited on

update question download format
e64aebd

tfrere commited on

improve security on hf token check
c4411e8

tfrere commited on

update token test
aae1c13

tfrere commited on

update model testing at server startup
97bea1c

tfrere commited on

update model handling in benchmark generation
4759fe1

tfrere commited on

update yourbench error handling
95bf1fc

tfrere commited on

improve get available model provider
d2805fc

tfrere commited on

normalize config file and update default benchmark model
8695aa8

tfrere commited on

add config file for models | fix github link in intro
c2b7f1b

tfrere commited on

improve security on upload-url route
b1e6db8

tfrere commited on

add url importer | improve yourbench error handling | refactor
c750639

tfrere commited on

remove bill_to in get available model provider
e98040e

tfrere commited on

update intro | fix evaluation
a86c1f9

tfrere commited on

update models
ab8d3f5

tfrere commited on

update error handling, improve upload security checks
81e0b0c

tfrere commited on

improve upload validation
d88a570

tfrere commited on

improve upload file security
2a342ed

tfrere commited on

improve get available model provider standalone test
e097fac

tfrere commited on

add qwen2.5 32b as the default
eee5a9a

sumuks commited on

Update backend/tasks/get_available_model_provider.py
4724e8f
verified

sumuks commited on

add llama 8b to models to evaluate as a baseline
afbd2d5
verified

sumuks commited on

change prompt for more creativity, and reduce overall number of questions generated
da80f69
verified

sumuks commited on

add fireworks
6025399
verified

sumuks commited on

temporarily change to llama 8b to allow more scale
89c3b6d
verified

sumuks commited on

update light eval task
1728789

tfrere commited on

update card
17aa340

tfrere commited on

update evaluation progress
22c253b

tfrere commited on

remove cleanup to try results
ff91163

tfrere commited on

always print benchmark question 2 and 3
47f7bc8

tfrere commited on

update evaluation progress
79407fd

tfrere commited on

update error message and avoid double benchmark generation
7f7e436

tfrere commited on

remove unusued get_model_provider
57683d7

tfrere commited on

cleanup
7d26143

tfrere commited on

refactor get available model provider
13efede

tfrere commited on

add moder provider switching to eval
4fb52f5

tfrere commited on

add get available model provider to benchmark generation
0e34dc4

tfrere commited on

add session clenaup and timeout messages
373381c

tfrere commited on

block >1mo files | translate comments in english
d6f0b38

tfrere commited on

add downloadable documents | add full demo link
a8a8975

tfrere commited on

cleanup generation logs
7e389db

tfrere commited on

update lighteval results
39acd70

tfrere commited on

update progress handling | fix logo link zone | prioritize sambanova as a provider
8e3f969

tfrere commited on

update
0874ba3

tfrere commited on

add prerendered documents | update filename | refactor
ffa4ae8

tfrere commited on

update eveluationTask to remove local storage
83d60af

tfrere commited on

update
debda0e

tfrere commited on

remove docker dev and update backend prod url
3c1419a

tfrere commited on