Empirical Analysis of Supervised and Unsupervised Machine Learning Algorithms with Aspect-Based Sentiment Analysis

Author:

Singh Satwinder1ORCID,Kaur Harpreet1,Kanozia Rubal2ORCID,Kaur Gurpreet3

Affiliation:

1. Department of CST , Central University of Punjab , Bathinda

2. 3 Department of Mass Communications & Journalism , Central University of Punjab , Bathinda

3. 4 Faculty of Law , Guru Kashi University , Talwandi Sabo , Bathinda

Abstract

Abstract Machine learning based sentiment analysis is an interdisciplinary approach in opinion mining, particularly in the field of media and communication research. In spite of their different backgrounds, researchers have collaborated to test, train and again retest the machine learning approach to collect, analyse and withdraw a meaningful insight from large datasets. This research classifies the texts of micro-blog (tweets) into positive and negative responses about a particular phenomenon. The study also demonstrates the process of compilation of corpus for review of sentiments, cleaning the body of text to make it a meaningful text, find people’s emotions about it, and interpret the findings. Till date the public sentiment after abrogation of Article 370 has not been studied, which adds the novelty to this scientific study. This study includes the dataset collection from Twitter that comprises 66.7 % of positive tweets and 34.3 % of negative tweets of the people about the abrogation of Article 370. Experimental testing reveals that the proposed methodology is much more effective than the previously proposed methodology. This study focuses on comparison of unsupervised lexicon-based models (TextBlob, AFINN, Vader Sentiment) and supervised machine learning models (KNN, SVM, Random Forest and Naïve Bayes) for sentiment analysis. This is the first study with cyber public opinion over the abrogation of Article 370. Twitter data of more than 2 lakh tweets were collected by the authors. After cleaning, 29732 tweets were selected for analysis. As per the results among supervised learning, Random Forest performs the best, whereas among unsupervised learning TextBlob achieves the highest accuracy of 99 % and 88 %, respectively. Performance parameters of the proposed supervised machine learning models also surpass the result of the recent study performed in 2023 for sentiment analysis.

Publisher

Walter de Gruyter GmbH

Reference27 articles.

1. The Hindu, “Abrogation of Article 370 led to breakdown of law and order in J&K,” 2020. [Online]. Available: https://www.thehindu.com/news/cities/Visakhapatnam/abrogation-of-article-370-led-to-breakdown-of-law-and-order-in-jk/article30669954.ece. (Accessed on: 26 June 2020).

2. S. Bhat, “J&K administration ends house arrest of political leaders in Jammu,” Feb. 2022. [Online]. Available: https://www.indiatoday.in/india/story/j-k-administration-ends-house-arrest-of-political-leaders-in-jammu-1605412-2019-10-02

3. The Hindu, “Left parties protest amendment to Article 370, vow to continue fighting,” Aug. 2019. [Online]. Available: https://www.thehindu.com/news/national/left-parties-protest-scrapping-of-article-370-vow-to-continue-the-fight/article28825167.ece.

4. Y. Dang, Y. Zhang, and H. Chen, “A lexicon-enhanced method for sentiment classification,” IEEE Intell. Syst., vol .25, no. 4, pp. 46–53, Nov. 2010. https://doi.org/10.1109/MIS.2009.105

5. M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, “Lexicon-based methods for sentiment analysis,” Comput. Linguist., vol. 37, no. 2, pp. 267–307, Jun. 2011. https://doi.org/10.1162/COLI_a_00049

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3