Affiliation:
1. University of Johannesburg
Abstract
Abstract
The Covid-19 pandemic has caused a significant impact on society, with discussions about the virus taking place on various social media platforms. In this study, different machine learning techniques for sentiment analysis of COVID-19 Subvariant XBB.1.5 were explored. The datasets of tweets containing hashtags related to Covid-19 Subvariant XBB.1.5 were collected and natural language processing techniques were used as processing techniques to pre-process the text. In this research, all tweets related to COVID-19 Subvariant XBB.1.5 from October 15th, 2022 are collected using the Twitter API. Different machine learning algorithms were later used to classify the tweets as positive, neutral, or negative in sentiment. The different algorithms used includes Stochastic Gradient Descent, Logistic regression, Naïve Bayes, Random Forest, Support Vector Machine and Extreme Gradient Boosting Our results showed that Logistic Regression achieved the highest accuracy, with an overall accuracy of 89% testing accuracy and 100% training accuracy, with positive sentiments having 0.95 Precision, 0.9 recall, and 0.93 F1-Score. The higher percentage of Positive tweets sentiments about COVID-19 Subvariant XBB.1.5, showed that most people were not disturbed about the negative impact the virus could have on them in comparison to the first and other previous Covid variants. The statistical performance of the different machine learning algorithms is measured using Accuracy, F1-Score, recall, precision, and ROC. The accuracy of the different classifiers applied is high.
Publisher
Research Square Platform LLC
Reference54 articles.
1. 1. S. Das and A. K. Kolya, “Predicting the pandemic: sentiment evaluation and predictive analysis from large-scale tweets on Covid-19 by deep convolutional neural network,” Evol. Intell., vol. 15, no. 3, pp. 1913–1934, 2022, doi: 10.1007/s12065-021-00598-7.
2. 2. M. Ghiassi, D. Zimbra, and S. Lee, “Targeted Twitter Sentiment Analysis for Brands Using Supervised Feature Engineering and the Dynamic Architecture for Artificial Neural Networks,” J. Manag. Inf. Syst., vol. 33, no. 4, pp. 1034–1058, 2016, doi: 10.1080/07421222.2016.1267526.
3. 3. J. Gibbons et al., “Twitter-based measures of neighborhood sentiment as predictors of residential population health,” PLoS One, vol. 14, no. 7, Jul. 2019, doi: 10.1371/journal.pone.0219550.
4. 4. B. K. Norambuena, … E. L.-I. data, and undefined 2019, “Sentiment analysis and opinion mining applied to scientific paper reviews,” content.iospress.com, vol. 23, pp. 191–214, 2019, doi: 10.3233/IDA-173807.
5. 5. “Alowaidi S, Saleh M, Abulnaja O (2017) Semantic sentiment analysis of Arabic texts. Int J Adv Comput Sci Appl 8(2):256–262 - Google Search.” https://www.google.com/search?q=Alowaidi+S%2C+Saleh+M%2C+Abulnaja+O+(2017)+Semantic+sentiment+analysis+of+Arabic+texts.+Int+J+Adv+Comput+Sci+Appl+8(2)%3A256–262&rlz=1C1CHBF_enZA1035ZA1035&oq=Alowaidi+S%2C+Saleh+M%2C+Abulnaja+O+(2017)+Semantic+sentiment+analysis+of+Arabic+texts.+Int+J+Adv+Comput+Sci+Appl+8(2)%3A256–262&aqs=chrome..69i57.1012j0j7&sourceid=chrome&ie=UTF-8 (accessed Feb. 13, 2023).