home |
| archive | weekly digest
0:00 -:--
the best model scored 0.31 and kept ordering the same labs idle
jun
nav
0:00 -:--
the best model scored 0.31 and kept ordering the same labs idle
jun
01
the best model scored 0.31
and kept ordering the same labs
02
clinenv gave seven models
a real inpatient admission
with stages and four agents to query
03
diagnosis f1
0.51
04
management f1
0.17
05
the model knew the patient had pneumonia
by discharge
but could not decide what to do in the meantime
06
every usmle pass rate
hid this gap
07
fifty years after mycin
diagnosis is still not management