Informative presence and observation in routine health data: A review of methodology for clinical risk prediction

Author:

Sisk Rose1ORCID,Lin Lijing1,Sperrin Matthew1,Barrett Jessica K23,Tom Brian2,Diaz-Ordaz Karla4,Peek Niels156ORCID,Martin Glen P1

Affiliation:

1. Division of Informatics, Imaging and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom

2. MRC Biostatistics Unit, University of Cambridge, Cambridge, United Kingdom

3. Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom

4. Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, United Kingdom

5. NIHR Biomedical Research Centre, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom

6. Alan Turing Institute, University of Manchester, London, United Kingdom

Abstract

AbstractObjectiveInformative presence (IP) is the phenomenon whereby the presence or absence of patient data is potentially informative with respect to their health condition, with informative observation (IO) being the longitudinal equivalent. These phenomena predominantly exist within routinely collected healthcare data, in which data collection is driven by the clinical requirements of patients and clinicians. The extent to which IP and IO are considered when using such data to develop clinical prediction models (CPMs) is unknown, as is the existing methodology aiming at handling these issues. This review aims to synthesize such existing methodology, thereby helping identify an agenda for future methodological work.Materials and MethodsA systematic literature search was conducted by 2 independent reviewers using prespecified keywords.ResultsThirty-six articles were included. We categorized the methods presented within as derived predictors (including some representation of the measurement process as a predictor in the model), modeling under IP, and latent structures. Including missing indicators or summary measures as predictors is the most commonly presented approach amongst the included studies (24 of 36 articles).DiscussionThis is the first review to collate the literature in this area under a prediction framework. A considerable body relevant of literature exists, and we present ways in which the described methods could be developed further. Guidance is required for specifying the conditions under which each method should be used to enable applied prediction modelers to use these methods.ConclusionsA growing recognition of IP and IO exists within the literature, and methodology is increasingly becoming available to leverage these phenomena for prediction purposes. IP and IO should be approached differently in a prediction context than when the primary goal is explanation. The work included in this review has demonstrated theoretical and empirical benefits of incorporating IP and IO, and therefore we recommend that applied health researchers consider incorporating these methods in their work.

Funder

Medical Research Council

Alan Turing Institute under the “Predictive Healthcare” project

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Reference72 articles.

1. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review;Goldstein;J Am Med Inform Assoc,2017

2. A general framework for considering selection bias in EHR-based studies: what data are observed and why?;Haneuse;EGEMS (Wash DC),2016

3. Sick patients have more data: the non-random completeness of electronic health records;Weiskopf;AMIA Annu Symp Proceedings AMIA Symp,2013

4. MIMIC-III, a freely accessible critical care database;Johnson;Sci Data,2016

5. Illustrating informed presence bias in electronic health records data: how patient interactions with a health system can impact inference;Phelan;EGEMS (Wash DC),2017

Cited by 23 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3