Early Detection of Heart Failure Using Electronic Health Records

Author:

Ng Kenney1,Steinhubl Steven R.1,deFilippi Christopher1,Dey Sanjoy1,Stewart Walter F.1

Affiliation:

1. From the Center for Computational Health, IBM Research, T.J. Watson Research Center, Cambridge, MA (K.N.); Cardiovascular Wellness, Geisinger Health System, Danville, PA (S.R.S.); Digital Medicine, Scripps Health, San Diego, CA (S.R.S.); Cardiology, Inova Heart and Vascular Institute, Fairfax, VA (C.d.); Center for Computational Health, IBM Research, T.J. Watson Research Center, Yorktown Heights, NY (S.D.); and Research, Sutter Health Research, Walnut Creek, CA (W.F.S.).

Abstract

Background— Using electronic health records data to predict events and onset of diseases is increasingly common. Relatively little is known, although, about the tradeoffs between data requirements and model utility. Methods and Results— We examined the performance of machine learning models trained to detect prediagnostic heart failure in primary care patients using longitudinal electronic health records data. Model performance was assessed in relation to data requirements defined by the prediction window length (time before clinical diagnosis), the observation window length (duration of observation before prediction window), the number of different data domains (data diversity), the number of patient records in the training data set (data quantity), and the density of patient encounters (data density). A total of 1684 incident heart failure cases and 13 525 sex, age-category, and clinic matched controls were used for modeling. Model performance improved as (1) the prediction window length decreases, especially when <2 years; (2) the observation window length increases but then levels off after 2 years; (3) the training data set size increases but then levels off after 4000 patients; (4) more diverse data types are used, but, in order, the combination of diagnosis, medication order, and hospitalization data was most important; and (5) data were confined to patients who had ≥10 phone or face-to-face encounters in 2 years. Conclusions— These empirical findings suggest possible guidelines for the minimum amount and type of data needed to train effective disease onset predictive models using longitudinal electronic health records data.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Cardiology and Cardiovascular Medicine

Reference20 articles.

1. American Heart Association Statistics Committee; Stroke Statistics Subcommittee, Heart Disease and Stroke Statistics—2016 Update: A Report From the American Heart Association.;Writing Group Members;Circulation,2015

2. Trends in Heart Failure Incidence and Survival in a Community-Based Population

3. Deaths: final data for 2010.;Murphy SL;Natl Vital Stat Rep,2013

4. Early detection of heart failure with varying prediction windows by structured and unstructured data in electronic health records.;Wang Y;Conf Proc IEEE Eng Med Biol Soc,2015

5. Prevalence of Heart Failure Signs and Symptoms in a Large Primary Care Population Identified Through the Use of Text and Data Mining of the Electronic Health Record

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3