Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA-Reference-Cited by-同舟云学术

Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA

Published:2021-12 Issue:1 Volume:11 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Ahmed Zia U.,Sun Kang,Shelly Michael,Mu Lina

Abstract

AbstractMachine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explore and visualize the spatial distribution of the contributions of known risk factors to lung and bronchus cancer (LBC) mortality rates in the conterminous United States. We used five base-learners—generalized linear model (GLM), random forest (RF), Gradient boosting machine (GBM), extreme Gradient boosting machine (XGBoost), and Deep Neural Network (DNN) for developing stack-ensemble models. Then we applied several model-agnostic approaches to interpret and visualize the stack ensemble model's output in global and local scales (at the county level). The stack ensemble generally performs better than all the base learners and three spatial regression models. A permutation-based feature importance technique ranked smoking prevalence as the most important predictor, followed by poverty and elevation. However, the impact of these risk factors on LBC mortality rates varies spatially. This is the first study to use ensemble machine learning with explainable algorithms to explore and visualize the spatial heterogeneity of the relationships between LBC mortality and risk factors in the contiguous USA.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-021-03198-8.pdf

Reference96 articles.

1. Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J. Clin. 68, 394–424. https://doi.org/10.3322/caac.21492 (2018).

2. Wang, H. et al. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: A systematic analysis for the global burden of disease study 2015. Lancet 388, 1459–1544. https://doi.org/10.1016/S0140-6736(16)31012-1 (2016).

3. Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2019. CA: Cancer J. Clin. 69, 7–34. https://doi.org/10.3322/caac.21551 (2019).

4. Centers for Disease Control and Prevention (CDC). U.S. Cancer Statistics Working Group. https://www.cdc.gov/cancer/lung/statistics/ (2019).

5. Mokdad, A. H. et al. Trends and patterns of disparities in cancer mortality among US counties, 1980–2014. JAMA 317, 388–406. https://doi.org/10.1001/jama.2016.20324 (2017).

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting blood lead in uruguayan children: Individual- vs neighborhood-level ensemble learners;PLOS Global Public Health;2024-09-04

2. Prediction of spatial heterogeneity in nutrient-limited sub-tropical maize yield: Implications for precision management in the eastern Indo-Gangetic Plains;Artificial Intelligence in Agriculture;2024-09

3. An explainable AI-assisted web application in cancer drug value prediction;MethodsX;2024-06

4. Spatiotemporal Characteristics and Influencing Factors of Talent Inflow in Northeast China from the Perspective of Urban Amenity;Journal of Urban Planning and Development;2024-06

5. Feature Engineering-Assisted Drug Repurposing on Disease–Drug Transcriptome Profiles in Gastric Cancer;ASSAY and Drug Development Technologies;2024-06-01