Pseudo datasets explain artificial neural networks-Reference-Cited by-同舟云学术

Pseudo datasets explain artificial neural networks

Published:2024-04-10 Issue: Volume: Page:
ISSN:2364-415X
Container-title:International Journal of Data Science and Analytics
language:en
Short-container-title:Int J Data Sci Anal

Author:

Chu Yi-Chi,Chen Yi-Hau,Guo Chao-Yu

Abstract

AbstractMachine learning enhances predictive ability in various research compared to conventional statistical approaches. However, the advantage of the regression model is that it can effortlessly interpret the effect of each predictor. Therefore, interpretable machine-learning models are desirable as the deep-learning technique advances. Although many studies have proposed ways to explain neural networks, this research suggests an intuitive and feasible algorithm to interpret any set of input features of artificial neural networks at the population-mean level changes. The new algorithm provides a novel concept of generating pseudo datasets and evaluating the impact due to changes in the input features. Our approach can accurately obtain the effect estimate from single to multiple input neurons and depict the association between the predictive and outcome variables. According to computer simulation studies, the explanatory effects of the predictors derived by the neural network as a particular case could approximate the general linear model estimates. Besides, we applied the new method to three real-life analyzes. The results demonstrated that the new algorithm could obtain similar effect estimates from the neural networks and regression models. Besides, it yields better predictive errors than the conventional regression models. Again, it is worth noting that the new pipeline is much less computationally intensive than the SHapley Additive exPlanations (SHAP), which could not simultaneously measure the impact due to two or more inputs while adjusting for other features.

Funder

The National Science and Technology Council

National Yang Ming Chiao Tung University

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s41060-024-00526-9.pdf

Reference41 articles.

1. Jordan, M.I., Mitchell, T.M.: Machine learning: trends, perspectives, and prospects. Science 349(6245), 255–260 (2015)

2. Raita, Y., et al.: Emergency department triage prediction of clinical outcomes using machine learning models. Crit. Care 23(1), 1–13 (2019)

3. Giuste, F.O., et al.: Early and fair COVID-19 outcome risk assessment using robust feature selection. Sci. Rep. 13(1), 18981 (2023)

4. Yarkoni, T., Westfall, J.: Choosing prediction over explanation in psychology: lessons from machine learning. Perspect. Psychol. Sci. 12(6), 1100–1122 (2017)

5. Guo, C.Y., Chou, Y.C.: A novel machine learning strategy for model selections—stepwise support vector machine (StepSVM). PLoS ONE 15(8), e0238384 (2020). https://doi.org/10.1371/journal.pone.0238384.eCollection