Author:
Rabin Alexander S.,Weinstein Julien B.,Seelye Sarah M.,Whittington Taylor N.,Hogan Cainnear K.,Prescott Hallie C.
Abstract
Abstract
Objective
Pulmonary function test (PFT) results are recorded variably across hospitals in the Department of Veterans Affairs (VA) electronic health record (EHR), using both unstructured and semi-structured notes. We developed and validated a hospital-specific code to extract pre-bronchodilator measures of obstruction (ratio of forced expiratory volume in one second [FEV1] to forced vital capacity [FVC]) and severity of obstruction (percent predicted of FEV1).
Results
Among 36 VA facilities with the most PFTs completed between 2018 and 2022 from a parent cohort of veterans receiving long-acting controller inhalers, 12 had a consistent syntactical convention or template for reporting PFT data in the EHR. Of the 42,718 PFTs identified from these 12 facilities, the hospital-specific text processing pipeline yielded 24,860 values for the FEV1:FVC ratio and 23,729 values for FEV1. A ratio of FEV1:FVC less than 0.7 was identified in 17,615 of 24,922 studies (70.7%); 8864 of 24,922 (35.6%) had a severe or very severe reduction in FEV1 (< 50% of the predicted value). Among 100 randomly selected PFT reports reviewed by two pulmonary physicians, the coding solution correctly identified the presence of obstruction in 99 out of 100 studies and the degree of obstruction in 96 out of 100 studies.
Funder
U.S. Department of Veterans Affairs
Publisher
Springer Science and Business Media LLC
Reference11 articles.
1. Sauer BC, Jones BE, Gary Globe, Leng J, Lu C-C, He T et al. Performance of an NLP Tool to extract PFT reports from structured and semi-structured VA data. EGEMS (Wash, DC). 2016;4:10.
2. Akgün KM, Sigel K, Cheung K-H, Kidwai-Khan F, Bryant AK, Brandt C, et al. Extracting lung function measurements to enhance phenotyping of chronic obstructive pulmonary disease (COPD) in an electronic health record using automated tools. PLoS ONE. 2020;15:e0227730.
3. England BR, Roul P, Yang Y, Hershberger D, Sayles H, Rojas J, et al. Extracting forced vital capacity from the electronic health record through natural language processing in rheumatoid arthritis-associated interstitial lung disease. Pharmacoepidemiol Drug Saf. 2023. https://doi.org/10.1002/pds.5744.
4. Corporate Data Warehouse (CDW). 2023. https://www.hsrd.research.va.gov/for_researchers/vinci/cdw.cfm. Accessed 14 Feb 2024.
5. IIR 23–032– HSR study. 2024. https://www.hsrd.research.va.gov/research/abstracts.cfm?Project_ID=2141709943. Accessed 27 Feb 2024.