Spaces:

m42-health
/

MEDIC-Benchmark

Running

App Files Files Community

MEDIC-Benchmark / src /populate.py

Commit History

[ADD] Closed ended arabic

20dad4a

tathagataraha commited on Jan 29

[ADD] Submission of private models

b50c184

tathagataraha commited on Jan 20

[ADD] CI intervals for med-safety

ba515db

tathagataraha commited on Jan 14

[MODIFY] Med-Safety: Average -> Harmfulness Score

2a7ac72

tathagataraha commited on Jan 9

[MODIFY] Metrics for medical summarization, aci bench and soap notes

c92b14d

tathagataraha commited on Jan 9

[MODIFY] Cross-evaluation framework column names

faceee1

tathagataraha commited on Jan 6

[ADD] Cross-examination framework

553b217

tathagataraha commited on Jan 3

[TEMP] Offline

6c10fa6

tathagataraha commited on Dec 29, 2024

[ADD] Med Safety

0a14325

tathagataraha commited on Nov 25, 2024

[ADD] Open-ended evaluation

0da5ee3

tathagataraha commited on Nov 12, 2024

[MODIFY] Added support for other frameworks in submit, evaluation queue and harness results displau

d86ca68

tathagataraha commited on Nov 11, 2024

[FIX] Filters and search

d8147b8

tathagataraha commited on Oct 24, 2024

[ADD] Submit form, upload requests to requests dataset

b3eff40

tathagataraha commited on Oct 17, 2024

[ADD] Harness tasks, data display

09b313f

tathagataraha commited on Oct 16, 2024

Duplicate from demo-leaderboard-backend/leaderboard

9ae8d89
verified

clefourrier HF staff commited on Oct 14, 2024