Abstract
Abstract
People’s mental conditions are often reflected in their social media activity due to the internet's anonymity. Psychiatric issues are often detected through such activities and can be addressed in their early stages, potentially preventing the consequences of unattended mental disorders like depression and anxiety. In this paper, the authors have implemented machine learning models and used various embedding techniques to classify posts from the famous social media blog site Reddit as stressful and non-stressful. The dataset used contains user posts that can be analyzed to detect patterns in the social media activity of those diagnosed with mental disorders. This paper uses different NLP (Natural Language Processing) tools such as ELMo (Embeddings from Language Models) word embeddings, BERT (Bidirectional Encoder Representations from Transformers) tokenizers, and BoW (Bag of Words) approach to create word/sentence data that can be fed to machine learning models. The results of each method have been discussed. The results achieved a top F1 score of 0.76, a Precision score of 0.71, and a Recall of 0.74 using only the preprocessed texts and machine learning algorithms to classify the posts. The results achieved by this paper are significant and have the potential to be applied in real-world scenarios to analyze mental stress among social media users. Although this paper focuses on data from Reddit, the techniques used can be transferred to similar social media platforms and could help solve the growing mental health crisis.
Funder
Centre for Advanced Modelling and Geospatial lnformation Systems, University of Technology Sydney
Publisher
Springer Science and Business Media LLC
Reference52 articles.
1. American Psychological Association. (2021, March 11). One year of Unhealthy weight gains and increased drinking were reported by Americans coping with pandemic stress [Press release]. http://www.apa.org/news/press/releases/2021/03/one-year-pandemic-stress
2. Turcan, E., & McKeown, K. (2019, October 31). Dreaddit: A Reddit dataset for stress analysis in Social Media. arXiv.org. Retrieved November 7, 2021, from https://arxiv.org/abs/1911.00133.
3. Stirman S, Pennebaker J. Word use in the poetry of suicidal and nonsuicidal poets. Psychosom Med. 2001;63:517–22. https://doi.org/10.1097/00006842-200107000-00001.
4. Zinken J, Zinken K, Wilson J, Butler L, Skinner T. Analysis of syntax and word use to predict successful participation in guided self-help for anxiety and depression. Psychiatry Res. 2010;179:181–6. https://doi.org/10.1016/j.psychres.2010.04.011.
5. Rude S, Gortner E-M, Pennebaker J. Language use of depressed and depression-vulnerable college students. Cogn Emot. 2004;18:1121–33. https://doi.org/10.1080/02699930441000030.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献