Evaluating Machine Learning Stability in Predicting Depression and Anxiety Amidst Subjective Response Errors-Reference-Cited by-同舟云学术

Evaluating Machine Learning Stability in Predicting Depression and Anxiety Amidst Subjective Response Errors

Published:2024-03-10 Issue:6 Volume:12 Page:625
ISSN:2227-9032
Container-title:Healthcare
language:en
Short-container-title:Healthcare

Author:

Ku Wai Lim¹^ORCID,Min Hua²^ORCID

Affiliation:

1. Systems Biology Center, National Heart, Lung and Blood Institute, NIH, Bethesda, MD 20892, USA

2. Department of Health Administration and Policy, College of Public Health, George Mason University, Fairfax, VA 22030, USA

Abstract

Major Depressive Disorder (MDD) and Generalized Anxiety Disorder (GAD) pose significant burdens on individuals and society, necessitating accurate prediction methods. Machine learning (ML) algorithms utilizing electronic health records and survey data offer promising tools for forecasting these conditions. However, potential bias and inaccuracies inherent in subjective survey responses can undermine the precision of such predictions. This research investigates the reliability of five prominent ML algorithms—a Convolutional Neural Network (CNN), Random Forest, XGBoost, Logistic Regression, and Naive Bayes—in predicting MDD and GAD. A dataset rich in biomedical, demographic, and self-reported survey information is used to assess the algorithms’ performance under different levels of subjective response inaccuracies. These inaccuracies simulate scenarios with potential memory recall bias and subjective interpretations. While all algorithms demonstrate commendable accuracy with high-quality survey data, their performance diverges significantly when encountering erroneous or biased responses. Notably, the CNN exhibits superior resilience in this context, maintaining performance and even achieving enhanced accuracy, Cohen’s kappa score, and positive precision for both MDD and GAD. This highlights the CNN’s superior ability to handle data unreliability, making it a potentially advantageous choice for predicting mental health conditions based on self-reported data. These findings underscore the critical importance of algorithmic resilience in mental health prediction, particularly when relying on subjective data. They emphasize the need for careful algorithm selection in such contexts, with the CNN emerging as a promising candidate due to its robustness and improved performance under data uncertainties.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-9032/12/6/625/pdf

Reference55 articles.

1. Comorbid generalized anxiety disorder and its association with quality of life in patients with major depressive disorder;Zhou;Sci. Rep.,2017

2. Depressive symptoms, anxiety and cognitive impairment: Emerging evidence in multiple sclerosis;Margoni;Transl. Psychiatry,2023

3. Prognosis and Improved Outcomes in Major Depression: A Review;Kraus;Focus,2020

4. Detecting Careless Responding in Survey Data Using Stochastic Gradient Boosting;Schroeders;Educ. Psychol. Meas.,2022

5. Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data;Gianfrancesco;JAMA Intern. Med.,2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Improved Expeditious Meta-Heuristic Clustering Method for Classifying Student Psychological Issues with Homogeneous Characteristics;Mathematics;2024-05-22