Commit History

[ADD] Dataset descriptions for cross-examination framework
5c80286

tathagataraha commited on

[MODIFY] Column descriptions for the cross examination framework
23fd02c

tathagataraha commited on

[ADD] CI intervals for med-safety
ba515db

tathagataraha commited on

[ADD] CSS for logo and MEDIC 5-pillar diagram
85b4142

tathagataraha commited on

Rename assets/MEDIC_Diagram_v1p6-cropped_pages-to-jpg-0001.jpg to assets/MEDIC_Diagram.jpg
3b28b62
verified

tathagataraha commited on

Upload MEDIC_Diagram_v1p6-cropped_pages-to-jpg-0001.jpg
fbe9353
verified

tathagataraha commited on

[ADD] MEDIC 5-pillar diagram
ef49d36

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
32eaa7c

tathagataraha commited on

Update src/about.py
a111e91
verified

cchristophe commited on

[MODIFY] Med-Safety: Average -> Harmfulness Score
2a7ac72

tathagataraha commited on

[MODIFY] Metrics for medical summarization, aci bench and soap notes
c92b14d

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
7d6aad6

tathagataraha commited on

Update src/about.py
818cb65
verified

cchristophe commited on

Update src/about.py
8c295d9
verified

cchristophe commited on

[MODIFY] Cross-evaluation framework column names
faceee1

tathagataraha commited on

[ADD] Cross-examination framework
553b217

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
8b771ed

tathagataraha commited on

Update src/about.py
9e77e60
verified

cchristophe commited on

[ADD] Open-ended evaluation
0da5ee3

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
b5701cc

tathagataraha commited on

[FIX] handled cases where one of the results are not present
34c150d

tathagataraha commited on

Update src/about.py
a2d8d52
verified

cchristophe commited on

[MODIFY] Added support for other frameworks in submit, evaluation queue and harness results displau
d86ca68

tathagataraha commited on

Remove About text
acb30f3
verified

cchristophe commited on

Upload image.png
380b27f
verified

cchristophe commited on

Delete assets/image.png
083b876
verified

cchristophe commited on

Update about.py
dfd63f4
verified

cchristophe commited on

[FIX] Minor bug in app.py
78e5f8b

tathagataraha commited on

[FIX] Minor bug in app.py
73a4cc2

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
df906f3

tathagataraha commited on

[ADD]Model submission guide and citation
27e5b96

tathagataraha commited on

Update README.md
5872c94
verified

tathagataraha commited on

Update README.md
2d7478d
verified

tathagataraha commited on

[FIX] Filters and search
d8147b8

tathagataraha commited on

[FIX] Preference tuned model symbol
e1cdc4b

tathagataraha commited on

[ADD] Auto Precision for loading directly from model
671e1a6

tathagataraha commited on

Merge branch 'main' of https://huggingface.co/spaces/m42-health/MEDIC-Benchmark
61cd814

tathagataraha commited on

Update README.md
be0620b
verified

tathagataraha commited on

[REMOVED] Average column
c63935d

tathagataraha commited on

[ADD] Support for slurm id
8a76c2c

tathagataraha commited on

[ADD] Submit form, upload requests to requests dataset
b3eff40

tathagataraha commited on

[ADD] Harness tasks, data display
09b313f

tathagataraha commited on