Affiliation:
1. Industrial and Management Systems Engineering, West Virginia University, Morgantown, WV 26506, USA
Abstract
Suicide is the second leading cause of death among individuals aged 5 to 24 in the United States (US). However, the precursors to suicide often do not surface, making suicide prevention challenging. This study aims to develop a machine learning model for predicting suicide ideation (SI), suicide planning (SP), and suicide attempts (SA) among adolescents in the US during the coronavirus pandemic. We used the 2021 Adolescent Behaviors and Experiences Survey Data. Class imbalance was addressed using the proposed data augmentation method tailored for binary variables, Modified Synthetic Minority Over-Sampling Technique. Five different ML models were trained and compared. SHapley Additive exPlanations analysis was conducted for explainability. The Logistic Regression model, identified as the most effective, showed superior performance across all targets, achieving high scores in recall: 0.82, accuracy: 0.80, and area under the Receiver Operating Characteristic curve: 0.88. Variables such as sad feelings, hopelessness, sexual behavior, and being overweight were noted as the most important predictors. Our model holds promise in helping health policymakers design effective public health interventions. By identifying vulnerable sub-groups within regions, our model can guide the implementation of tailored interventions that facilitate early identification and referral to medical treatment.
Reference45 articles.
1. Brådvik, L. (2018). Suicide risk and mental disorders. Int. J. Environ. Res. Public Health, 15.
2. Suicide and suicide risk;Turecki;Nat. Rev. Dis. Primers,2019
3. Suicide and self-harm;Knipe;Lancet,2022
4. Hedegaard, H., Curtin, S.C., and Warner, M. (2020). Increase in Suicide Mortality in the United States, 1999–2018.
5. Gender differences in suicide among patients with bipolar disorder: A systematic review and meta-analysis;Hu;J. Affect. Disord.,2023