Statistical quantification of confounding bias in machine learning models-Reference-Cited by-同舟云学术

Statistical quantification of confounding bias in machine learning models

Published:2022 Issue: Volume:11 Page:
ISSN:2047-217X
Container-title:GigaScience
language:en
Short-container-title:

Author:

Spisak Tamas¹^ORCID

Affiliation:

1. Center for Translational Neuro- and Behavioral Sciences, Institute for Diagnostic and Interventional Radiology and Neuroradiology, Center University Hospital Essen, Essen , D-45147, Germany

Abstract

Abstract Background The lack of nonparametric statistical tests for confounding bias significantly hampers the development of robust, valid, and generalizable predictive models in many fields of research. Here I propose the partial confounder test, which, for a given confounder variable, probes the null hypotheses of the model being unconfounded. Results The test provides a strict control for type I errors and high statistical power, even for nonnormally and nonlinearly dependent predictions, often seen in machine learning. Applying the proposed test on models trained on large-scale functional brain connectivity data (N= 1,865) (i) reveals previously unreported confounders and (ii) shows that state-of-the-art confound mitigation approaches may fail preventing confounder bias in several cases. Conclusions The proposed test (implemented in the package mlconfound; https://mlconfound.readthedocs.io) can aid the assessment and improvement of the generalizability and validity of predictive models and, thereby, fosters the development of clinically useful machine learning biomarkers.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Oxford University Press (OUP)

Subject

Computer Science Applications,Health Informatics

Link

https://academic.oup.com/gigascience/article-pdf/doi/10.1093/gigascience/giac082/45550171/giac082.pdf

Reference69 articles.

1. Machine learning in neuroscience;Vogt;Nat Methods,2018

2. Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects;Kent;BMJ,2018

3. Pain-free resting-state functional brain connectivity predicts individual pain sensitivity;Spisak;Nat Communications,2020