MEDIC-Benchmark / src /populate.py

Commit History

[ADD] Submission of private models
b50c184

tathagataraha commited on

[ADD] CI intervals for med-safety
ba515db

tathagataraha commited on

[MODIFY] Med-Safety: Average -> Harmfulness Score
2a7ac72

tathagataraha commited on

[MODIFY] Metrics for medical summarization, aci bench and soap notes
c92b14d

tathagataraha commited on

[MODIFY] Cross-evaluation framework column names
faceee1

tathagataraha commited on

[ADD] Cross-examination framework
553b217

tathagataraha commited on

[ADD] Open-ended evaluation
0da5ee3

tathagataraha commited on

[MODIFY] Added support for other frameworks in submit, evaluation queue and harness results displau
d86ca68

tathagataraha commited on

[FIX] Filters and search
d8147b8

tathagataraha commited on

[ADD] Submit form, upload requests to requests dataset
b3eff40

tathagataraha commited on

[ADD] Harness tasks, data display
09b313f

tathagataraha commited on