Enhancing Sentiment Analysis via Random Majority Under-Sampling with Reduced Time Complexity for Classifying Tweet Reviews

Author:

Almuayqil Saleh Naif,Humayun MamoonaORCID,Jhanjhi N. Z.ORCID,Almufareh Maram FahaadORCID,Khan Navid Ali

Abstract

Twitter has become a unique platform for social interaction from people all around the world, leading to an extensive amount of knowledge that can be used for various reasons. People share and spread their own ideologies and point of views on unique topics leading to the production of a lot of content. Sentiment analysis is of extreme importance to various businesses as it can directly impact their important decisions. Several challenges related to the research subject of sentiment analysis includes issues such as imbalanced dataset, lexical uniqueness, and processing time complexity. Most machine learning models are sequential: they need a considerable amount of time to complete execution. Therefore, we propose a model sentiment analysis specifically designed for imbalanced datasets that can reduce the time complexity of the task by using various text sequenced preprocessing techniques combined with random majority under-sampling. Our proposed model provides competitive results to other models while simultaneously reducing the time complexity for sentiment analysis. The results obtained after the experimentation corroborate that our model provides great results producing the accuracy of 86.5% and F1 score of 0.874 through XGB.

Funder

Jouf University

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference46 articles.

1. Sentiment analysis algorithms and applications: A survey;Ain Shams Eng. J.,2014

2. Alwakid, G., Osman, T., El Haj, M., Alanazi, S., Humayun, M., and Sama, N.U. (2022). MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media. Appl. Sci., 12.

3. The Evolution of Social Commerce: The People, Management, Technology, and Information Dimensions;Commun. Assoc. Inf. Syst.,2012

4. Davies, A., and Ghahramani, Z. (2011, January 21). Language-independent Bayesian sentiment mining of Twitter. Proceedings of the 5th SNA-KDD Workshop, San Diego, CA, USA.

5. Opinion Mining and Sentiment Analysis;Found. Trends Inf. Retr.,2008

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Comparative Analysis of Machine Learning Algorithms for Arabic Sentiment Analysis on Imbalanced Social Media Data;2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS);2024-01-28

2. Learning Vector Quantization-Based Fuzzy Rules Oversampling Method;Computers, Materials & Continua;2024

3. Decoding consumer voice : Sentiment analysis of web-scraped product reviews;Journal of Information and Optimization Sciences;2024

4. Analyzing Trendy Twitter Hashtags in the 2022 French Election;Studies in Computational Intelligence;2024

5. Mitigating Class Imbalance in Sentiment Analysis through GPT-3-Generated Synthetic Sentences;Applied Sciences;2023-08-29

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3