Feature group partitioning: an approach for depression severity prediction with class balancing using machine learning algorithms

Author:

Shaha Tumpa Rani,Begum Momotaz,Uddin Jia,Torres Vanessa Yélamos,Iturriaga Josep Alemany,Ashraf Imran,Samad Md. Abdus

Abstract

AbstractIn contemporary society, depression has emerged as a prominent mental disorder that exhibits exponential growth and exerts a substantial influence on premature mortality. Although numerous research applied machine learning methods to forecast signs of depression. Nevertheless, only a limited number of research have taken into account the severity level as a multiclass variable. Besides, maintaining the equality of data distribution among all the classes rarely happens in practical communities. So, the inevitable class imbalance for multiple variables is considered a substantial challenge in this domain. Furthermore, this research emphasizes the significance of addressing class imbalance issues in the context of multiple classes. We introduced a new approach Feature group partitioning (FGP) in the data preprocessing phase which effectively reduces the dimensionality of features to a minimum. This study utilized synthetic oversampling techniques, specifically Synthetic Minority Over-sampling Technique (SMOTE) and Adaptive Synthetic (ADASYN), for class balancing. The dataset used in this research was collected from university students by administering the Burn Depression Checklist (BDC). For methodological modifications, we implemented heterogeneous ensemble learning stacking, homogeneous ensemble bagging, and five distinct supervised machine learning algorithms. The issue of overfitting was mitigated by evaluating the accuracy of the training, validation, and testing datasets. To justify the effectiveness of the prediction models, balanced accuracy, sensitivity, specificity, precision, and f1-score indices are used. Overall, comprehensive analysis demonstrates the discrimination between the Conventional Depression Screening (CDS) and FGP approach. In summary, the results show that the stacking classifier for FGP with SMOTE approach yields the highest balanced accuracy, with a rate of 92.81%. The empirical evidence has demonstrated that the FGP approach, when combined with the SMOTE, able to produce better performance in predicting the severity of depression. Most importantly the optimization of the training time of the FGP approach for all of the classifiers is a significant achievement of this research.

Funder

This study is funded by the European University of Atlantic.

Publisher

Springer Science and Business Media LLC

Reference61 articles.

1. Zafar A, Chitnis S. Survey of depression detection using social networking sites via data mining. In: 2020 10th international conference on cloud computing, data science \& engineering (confluence). 2020:88-93. https://doi.org/10.1109/Confluence47617.2020.9058189.

2. World health organization-what you can do-mental health. https://www.emro.who.int/mnh/what-we-do/index.html. Accessed 13 Dec 2023.

3. Mohit M, Maruf M, Ahmed H, Alam M. Depression and physical illnesses: an update. Bangladesh Med J. 2011;40(1):53–8.

4. Whooley MA, Wong JM. Depression and cardiovascular disorders. Annu Rev Clin Psychol. 2013;9:327–54.

5. Stacy Mosel LMSW. Alcohol and Depression: The Link Between Alcoholism and Depression. 2023. https://americanaddictioncenters.org/alcoholism-treatment/depression. Accessed 13 Dec 2023.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3