Controlling for effects of confounding variables on machine learning predictions-Reference-Cited by-同舟云学术

Controlling for effects of confounding variables on machine learning predictions

Published:2020-08-18 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dinga Richard,Schmaal Lianne,Penninx Brenda W.J.H.,Veltman Dick J.,Marquand Andre F.

Abstract

ABSTRACTMachine learning predictive models are being used in neuroimaging to predict information about the task or stimuli or to identify potentially clinically useful biomarkers. However, the predictions can be driven by confounding variables unrelated to the signal of interest, such as scanner effect or head motion, limiting the clinical usefulness and interpretation of machine learning models. The most common method to control for confounding effects is regressing out the confounding variables separately from each input variable before machine learning modeling. However, we show that this method is insufficient because machine learning models can learn information from the data that cannot be regressed out. Instead of regressing out confounding effects from each input variable, we propose controlling for confounds post-hoc on the level of machine learning predictions. This allows partitioning of the predictive performance into the performance that can be explained by confounds and performance independent of confounds. This approach is flexible and allows for parametric and non-parametric confound adjustment. We show in real and simulated data that this method correctly controls for confounding effects even when traditional input variable adjustment produces false-positive findings.

Publisher

Cold Spring Harbor Laboratory

Reference41 articles.

1. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank;Neuroimage,2018

2. Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses

3. Chyzhyk, D. , Varoquaux, G. , Thirion, B. , Milham, M. , 2018. Controlling a confound in predictive models with a test set minimizing its effect, in: 2018 International Workshop on Pattern Recognition in Neuroimaging, PRNI 2018. IEEE, pp. 1–4. https://doi.org/10.1109/PRNI.2018.8423961

4. Craddock, C. , Benhajali, Y. , Chu, C. , Chouinard, F. , Evans, A. , Jakab, A. , Khundrakpam, B. , Lewis, J. , Li, Q. , Milham, M. , Yan, C. , Bellec, P. , 2013. The Neuro Bureau Preprocessing Initiative: open sharing of preprocessed neuroimaging data and derivatives. Front. Neuroinform. 7. https://doi.org/10.3389/conf.fninf.2013.09.00041

5. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Identifying Patterns of Smoking Cessation App Feature Use That Predict Successful Quitting: Secondary Analysis of Experimental Data Leveraging Machine Learning;JMIR AI;2024-05-22

2. A Biomarker-Based Framework for the Prediction of Future Chronic Pain;2024-05-17

3. A Biomarker-Centric Framework for the Prediction of Future Chronic Pain;2024-04-20

4. A survey of explainable knowledge tracing;Applied Intelligence;2024-04

5. Unraveling Metabolic Changes following Stroke: Insights from a Urinary Metabolomics Analysis;Metabolites;2024-02-28