Deriving Weight From Big Data: Comparison of Body Weight Measurement–Cleaning Algorithms-Reference-Cited by-同舟云学术

Deriving Weight From Big Data: Comparison of Body Weight Measurement–Cleaning Algorithms

Published:2022-03-09 Issue:3 Volume:10 Page:e30328
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Evans Richard^ORCID,Burns Jennifer^ORCID,Damschroder Laura^ORCID,Annis Ann^ORCID,Freitag Michelle B^ORCID,Raffa Susan^ORCID,Wiitala Wyndy^ORCID

Abstract

Background Patient body weight is a frequently used measure in biomedical studies, yet there are no standard methods for processing and cleaning weight data. Conflicting documentation on constructing body weight measurements presents challenges for research and program evaluation. Objective In this study, we aim to describe and compare methods for extracting and cleaning weight data from electronic health record databases to develop guidelines for standardized approaches that promote reproducibility. Methods We conducted a systematic review of studies published from 2008 to 2018 that used Veterans Health Administration electronic health record weight data and documented the algorithms for constructing patient weight. We applied these algorithms to a cohort of veterans with at least one primary care visit in 2016. The resulting weight measures were compared at the patient and site levels. Results We identified 496 studies and included 62 (12.5%) that used weight as an outcome. Approximately 48% (27/62) included a replicable algorithm. Algorithms varied from cutoffs of implausible weights to complex models using measures within patients over time. We found differences in the number of weight values after applying the algorithms (71,961/1,175,995, 6.12% to 1,175,177/1,175,995, 99.93% of raw data) but little difference in average weights across methods (93.3, SD 21.0 kg to 94.8, SD 21.8 kg). The percentage of patients with at least 5% weight loss over 1 year ranged from 9.37% (4933/52,642) to 13.99% (3355/23,987). Conclusions Contrasting algorithms provide similar results and, in some cases, the results are not different from using raw, unprocessed data despite algorithm complexity. Studies using point estimates of weight may benefit from a simple cleaning rule based on cutoffs of implausible values; however, research questions involving weight trajectories and other, more complex scenarios may benefit from a more nuanced algorithm that considers all available weight data.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference52 articles.

1. Impact of the HITECH Act on physicians’ adoption of electronic health records

2. Next-generation phenotyping of electronic health records

3. Exploiting the potential of large databases of electronic health records for research using rapid search algorithms and an intuitive query interface

4. ZozusMRichessonRHammondWAcquiring and using electronic health record dataNIH Collaboratory Electronic Health Records2022-02-12https://rethinkingclinicaltrials.org/resources/acquiring-and-using-electronic-health-record-data/

5. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perioperative Nursing Informatics Relevant Data Standard Research in the Context of Medical Big Data: Improving Patients? Health Behavior;American Journal of Health Behavior;2023-06-30

2. Comparison of weight captured via electronic health record and cellular scales to the gold‐standard clinical method;Obesity Science & Practice;2023-01-12

3. Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms;JAMIA Open;2022-10-04