Superhuman performance on sepsis MIMIC-III data by distributional reinforcement learning

Böck, Markus; Malle, Julien; Pasterk, Daniel; Kukina, Hrvoje; Hasani, Ramin; Heitzinger, Clemens

doi:10.1371/journal.pone.0275358

DC Field

Value

Language

dc.contributor.author

Böck, Markus

dc.contributor.author

Malle, Julien

dc.contributor.author

Pasterk, Daniel

dc.contributor.author

Kukina, Hrvoje

dc.contributor.author

Hasani, Ramin

dc.contributor.author

Heitzinger, Clemens

dc.date.accessioned

2023-01-06T14:36:23Z

dc.date.available

2023-01-06T14:36:23Z

dc.date.issued

2022-11-03

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Böck, M., Malle, J., Pasterk, D., Kukina, H., Hasani, R., & Heitzinger, C. (2022). Superhuman performance on sepsis MIMIC-III data by distributional reinforcement learning. <i>PLoS ONE</i>, <i>17</i>(11), Article e0275358. https://doi.org/10.1371/journal.pone.0275358</div> </div>

dc.identifier.issn

1932-6203

dc.identifier.uri

http://hdl.handle.net/20.500.12708/139405

dc.description.abstract

We present a novel setup for treating sepsis using distributional reinforcement learning (RL). Sepsis is a life-threatening medical emergency. Its treatment is considered to be a challenging high-stakes decision-making problem, which has to procedurally account for risk. Treating sepsis by machine learning algorithms is difficult due to a couple of reasons: There is limited and error-afflicted initial data in a highly complex biological system combined with the need to make robust, transparent and safe decisions. We demonstrate a suitable method that combines data imputation by a kNN model using a custom distance with state representation by discretization using clustering, and that enables superhuman decision-making using speedy Q-learning in the framework of distributional RL. Compared to clinicians, the recovery rate is increased by more than 3% on the test data set. Our results illustrate how risk-aware RL agents can play a decisive role in critical situations such as the treatment of sepsis patients, a situation acerbated due to the COVID-19 pandemic (Martineau 2020). In addition, we emphasize the tractability of the methodology and the learning behavior while addressing some criticisms of the previous work (Komorowski et al. 2018) on this topic.

dc.description.sponsorship

Fonds zur Förderung der wissenschaftlichen Forschung (FWF)

dc.language.iso

dc.publisher

PUBLIC LIBRARY SCIENCE

dc.relation.ispartof

PLoS ONE

dc.subject

Pandemics

dc.subject

Reinforcement, Psychology

dc.subject

Algorithms

dc.subject

COVID-19

dc.subject

Sepsis

dc.title

Superhuman performance on sepsis MIMIC-III data by distributional reinforcement learning

dc.type

Article

dc.type

Artikel

dc.identifier.pmid

36327195

dc.identifier.scopus

2-s2.0-85141890310

dc.identifier.url

https://api.elsevier.com/content/abstract/scopus_id/85141890310

dc.contributor.affiliation

TU Wien, Austria

dc.relation.grantno

Y660-N25

dcterms.dateSubmitted

2021-04-28

dc.type.category

Original Research Article

tuw.container.volume

tuw.container.issue

tuw.journal.peerreviewed

true

tuw.peerreviewed

true

wb.publication.intCoWork

International Co-publication

tuw.project.title

Partielle Differentialgleichungen für Nanotechnologie

tuw.researchTopic.id

tuw.researchTopic.name

Modeling and Simulation

tuw.researchTopic.value

100

dcterms.isPartOf.title

PLoS ONE

tuw.publication.orgunit

E101-03-2 - Forschungsgruppe Maschinelles Lernen und Unsicherheitsquantifizierung

tuw.publisher.doi

10.1371/journal.pone.0275358

dc.identifier.articleid

e0275358

dc.identifier.eissn

1932-6203

dc.description.numberOfPages

tuw.author.orcid

0000-0001-9533-1379

tuw.author.orcid

0000-0002-9889-5222

wb.sci

true

wb.sciencebranch

Mathematik

wb.sciencebranch.oefos

1010

wb.sciencebranch.value

100

item.cerifentitytype

Publications

item.languageiso639-1

item.fulltext

no Fulltext

item.openairetype

research article

item.openairecristype

http://purl.org/coar/resource_type/c_2df8fbb1

item.grantfulltext

none

crisitem.project.funder

FWF Fonds zur Förderung der wissenschaftlichen Forschung (FWF)

crisitem.project.grantno

Y660-N25

crisitem.author.dept

E194-01 - Forschungsbereich Software Engineering

crisitem.author.dept

E101-03 - Forschungsbereich Scientific Computing and Modelling

crisitem.author.dept

TU Wien

crisitem.author.dept

E191-01 - Forschungsbereich Cyber-Physical Systems

crisitem.author.dept

E194-06 - Forschungsbereich Machine Learning

crisitem.author.orcid

0000-0002-9889-5222

crisitem.author.parentorg

E194 - Institut für Information Systems Engineering

crisitem.author.parentorg

E101 - Institut für Analysis und Scientific Computing

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E194 - Institut für Information Systems Engineering

Appears in Collections:

Article

Show simple item record

Page view(s)

214

checked on Dec 1, 2023

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM