Author:
Joo Hyonsoo,Lee Daeun,Lee Sang Haak,Kim Young Kyoon,Rhee Chin Kook
Abstract
Abstract
Introduction
Analysis of the National Health Insurance data has been actively carried out for the purpose of academic research and establishing scientific evidences for health care service policy in asthma. However, there has been a limitation for the accuracy of the data extracted through conventional operational definition. In this study, we verified the accuracy of conventional operational definition of asthma, by applying it to a real hospital setting. And by using a machine learning technique, we established an appropriate operational definition that predicts asthma more accurately.
Methods
We extracted asthma patients using the conventional operational definition of asthma at Seoul St. Mary’s hospital and St. Paul’s hospital at the Catholic University of Korea between January 2017 and January 2018. Among these extracted patients of asthma, 10% of patients were randomly sampled. We verified the accuracy of the conventional operational definition for asthma by matching actual diagnosis through medical chart review. And then we operated machine learning approaches to predict asthma more accurately.
Results
A total of 4,235 patients with asthma were identified using a conventional asthma definition during the study period. Of these, 353 patients were collected. The patients of asthma were 56% of study population, 44% of patients were not asthma. The use of machine learning techniques improved the overall accuracy. The XGBoost prediction model for asthma diagnosis showed an accuracy of 87.1%, an AUC of 93.0%, sensitivity of 82.5%, and specificity of 97.9%. Major explanatory variable were ICS/LABA,LAMA and LTRA for proper diagnosis of asthma.
Conclusions
The conventional operational definition of asthma has limitation to extract true asthma patients in real world. Therefore, it is necessary to establish an accurate standardized operational definition of asthma. In this study, machine learning approach could be a good option for building a relevant operational definition in research using claims data.
Funder
Ministry of Health & Welfare, Republic of Korea
Publisher
Springer Science and Business Media LLC
Subject
Pulmonary and Respiratory Medicine
Reference14 articles.
1. Ferver K, Burton B, Jesilow P. The use of claims data in healthcare research. Open Public Health J. 2009;2(1):11–24.
2. Lee J, Lee JS, Park SH, Shin SA, Kim K. Cohort profile: the National Health Insurance Service-National Sample Cohort (NHIS-NSC), South Korea. Int J Epidemiol. 2017;46(2):e15.
3. Choi JY, Yoon HK, Lee JH, Yoo KH, Kim BY, Bae HW, Kim YK, Rhee CK. Current status of asthma care in South Korea: nationwide the health insurance review and assessment service database. J Thorac Dis. 2017;9(9):3208–14.
4. Choi JY, Yoon HK, Lee JH, Yoo KH, Kim BY, Bae HW, Kim YK, Rhee CK. Nationwide pulmonary function test rates in South Korean asthma patients. J Thorac Dis. 2018;10(7):4360–7.
5. Choi JY, Yoon HK, Lee JH, Yoo KH, Kim BY, Bae HW, Kim YK, Rhee CK. Nationwide use of inhaled corticosteroids by South Korean asthma patients: an examination of the health insurance review and service database. J Thorac Dis. 2018;10(9):5405–13.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献