Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment-Reference-Cited by-同舟云学术

Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment

Published:2023-11-08 Issue:2 Volume:52 Page:e79-e88
ISSN:0090-3493
Container-title:Critical Care Medicine
language:en
Short-container-title:

Author:

Otten Martijn¹²,Jagesar Ameet R.¹²,Dam Tariq A.¹²,Biesheuvel Laurens A.¹²,den Hengst Floris²,Ziesemer Kirsten A.³,Thoral Patrick J.¹,de Grooth Harm-Jan¹,Girbes Armand R.J.¹,François-Lavet Vincent²,Hoogendoorn Mark²,Elbers Paul W.G.¹

Affiliation:

1. Department of Intensive Care Medicine, Center for Critical Care Computational Intelligence, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Science (ACS), Amsterdam UMC, Vrije Universiteit, Amsterdam, The Netherlands.

2. Quantitative Data Analytics Group, Department of Computer Science, Faculty of Science, Vrije Universiteit, Amsterdam, The Netherlands.

3. University Library, Vrije Universiteit, Amsterdam, The Netherlands.

Abstract

OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients. DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking. DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinician’s policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed. DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure. CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Critical Care and Intensive Care Medicine

Reference44 articles.

1. Mastering the game of Go with deep neural networks and tree search.;Silver;Nature,2016

2. Highly accurate protein structure prediction with AlphaFold.;Jumper;Nature,2021

3. An introduction to deep reinforcement learning.;François-Lavet;Found Trends® Mach Learn,2018

4. Reinforcement learning for clinical decision support in critical care: Comprehensive review.;Liu;J Med Internet Res,2020

5. Time to stop randomized and large pragmatic trials for intensive care medicine syndromes: The case of sepsis and acute respiratory distress syndrome.;Girbes;J Thorac Dis,2020