Abstract
Abstract
Background
Chronic obstructive pulmonary disease (COPD) is combination of progressive lung diseases. The diagnosis of COPD is generally based on the pulmonary function testing, however, difficulties underlie in prognosis of smokers or early stage of COPD patients due to the complexity and heterogeneity of the pathogenesis. Computational analyses of omics technologies are expected as one of the solutions to resolve such complexities.
Methods
We obtained transcriptomic data by in vitro testing with exposures of human bronchial epithelial cells to the inducers for early events of COPD to identify the potential descriptive marker genes. With the identified genes, the machine learning technique was employed with the publicly available transcriptome data obtained from the lung specimens of COPD and non-COPD patients to develop the model that can reflect the risk continuum across smoking and COPD.
Results
The expression levels of 15 genes were commonly altered among in vitro tissues exposed to known inducible factors for earlier events of COPD (exposure to cigarette smoke, DNA damage, oxidative stress, and inflammation), and 10 of these genes and their corresponding proteins have not previously reported as COPD biomarkers. Although these genes were able to predict each group with 65% accuracy, the accuracy with which they were able to discriminate COPD subjects from smokers was only 29%.
Furthermore, logistic regression enabled the conversion of gene expression levels to a numerical index, which we named the “potential risk factor (PRF)” index. The highest significant index value was recorded in COPD subjects (0.56 at the median), followed by smokers (0.30) and non-smokers (0.02). In vitro tissues exposed to cigarette smoke displayed dose-dependent increases of PRF, suggesting its utility for prospective risk estimation of tobacco products.
Conclusions
Our experimental-based transcriptomic analysis identified novel genes associated with COPD, and the 15 genes could distinguish smokers and COPD subjects from non-smokers via machine-learning classification with remarkable accuracy. We also suggested a PRF index that can quantitatively reflect the risk continuum across smoking and COPD pathogenesis, and we believe it will provide an improved understanding of smoking effects and new insights into COPD.
Publisher
Springer Science and Business Media LLC
Subject
Pulmonary and Respiratory Medicine
Reference66 articles.
1. Hurd S. The impact of COPD on lung health worldwide: epidemiology and incidence. Chest. 2000;117(2):1S–4S.
2. Castaldi PJ, Dy J, Ross J, Chang Y, Washko GR, Curran-Everett D, Williams A, Lynch DA, Make BJ, Crapo JD, et al. Cluster analysis in the COPDGene study identifies subtypes of smokers with distinct patterns of airway disease and emphysema. Thorax. 2014;69(5):415–22.
3. Adkison SE, O'Connor RJ, Bansal-Travers M, Hyland A, Borland R, Yong HH, Cummings KM, McNeill A, Thrasher JF, Hammond D, et al. Electronic nicotine delivery systems: international tobacco control four-country survey. Am J Prev Med. 2013;44(3):207–15.
4. Agaku IT, King BA, Husten CG, Bunnell R, Ambrose BK, Hu SS, Holder-Hayes E, Day HR. Tobacco product use among adults--United States, 2012-2013. MMWR Morb Mortal Wkly Rep. 2014;63(25):542–7.
5. Murphy J, Gaca M, Lowe F, Minet E, Breheny D, Prasad K, Camacho O, Fearon IM, Liu C, Wright C, et al. Assessing modified risk tobacco and nicotine products: description of the scientific framework and assessment of a closed modular electronic cigarette. Regul Toxicol Pharmacol. 2017;90:342–57.
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献