Machine Learning Driven Mental Stress Detection on Reddit Posts Using Natural Language Processing

Author:

Inamdar Shaunak,Chapekar Rishikesh,Gite Shilpa,Pradhan BiswajeetORCID

Abstract

Abstract People’s mental conditions are often reflected in their social media activity due to the internet's anonymity. Psychiatric issues are often detected through such activities and can be addressed in their early stages, potentially preventing the consequences of unattended mental disorders like depression and anxiety. In this paper, the authors have implemented machine learning models and used various embedding techniques to classify posts from the famous social media blog site Reddit as stressful and non-stressful. The dataset used contains user posts that can be analyzed to detect patterns in the social media activity of those diagnosed with mental disorders. This paper uses different NLP (Natural Language Processing) tools such as ELMo (Embeddings from Language Models) word embeddings, BERT (Bidirectional Encoder Representations from Transformers) tokenizers, and BoW (Bag of Words) approach to create word/sentence data that can be fed to machine learning models. The results of each method have been discussed. The results achieved a top F1 score of 0.76, a Precision score of 0.71, and a Recall of 0.74 using only the preprocessed texts and machine learning algorithms to classify the posts. The results achieved by this paper are significant and have the potential to be applied in real-world scenarios to analyze mental stress among social media users. Although this paper focuses on data from Reddit, the techniques used can be transferred to similar social media platforms and could help solve the growing mental health crisis.

Funder

Centre for Advanced Modelling and Geospatial lnformation Systems, University of Technology Sydney

Publisher

Springer Science and Business Media LLC

Reference52 articles.

1. American Psychological Association. (2021, March 11). One year of Unhealthy weight gains and increased drinking were reported by Americans coping with pandemic stress [Press release]. http://www.apa.org/news/press/releases/2021/03/one-year-pandemic-stress

2. Turcan, E., & McKeown, K. (2019, October 31). Dreaddit: A Reddit dataset for stress analysis in Social Media. arXiv.org. Retrieved November 7, 2021, from https://arxiv.org/abs/1911.00133.

3. Stirman S, Pennebaker J. Word use in the poetry of suicidal and nonsuicidal poets. Psychosom Med. 2001;63:517–22. https://doi.org/10.1097/00006842-200107000-00001.

4. Zinken J, Zinken K, Wilson J, Butler L, Skinner T. Analysis of syntax and word use to predict successful participation in guided self-help for anxiety and depression. Psychiatry Res. 2010;179:181–6. https://doi.org/10.1016/j.psychres.2010.04.011.

5. Rude S, Gortner E-M, Pennebaker J. Language use of depressed and depression-vulnerable college students. Cogn Emot. 2004;18:1121–33. https://doi.org/10.1080/02699930441000030.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3