An automated learning model for sentiment analysis and data classification of Twitter data using balanced CA-SVM

Author:

Cyril C Pretty Diana1,Beulah J Rene1,Subramani Neelakandan2,Mohan Prakash3ORCID,Harshavardhan A4,Sivabalaselvamani D5

Affiliation:

1. Department of Computer Science and Engineering, College of Engineering and Technology, Faculty of Engineering and Technology, SRM Institute of Science and Technology, Chennai, Tamil Nadu, India

2. Department of Information Technology, Jeppiaar Institute of Technology, Chennai, Tamil Nadu, India

3. Data Science and Analytics Center, Karpagam College of Engineering, Coimbatore, Tamil Nadu, India

4. Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, Telangana, India

5. Department of Computer Applications, Kongu Engineering College, Erode, Tamil Nadu, India

Abstract

The modern society runs over the social media for their most time of every day. The web users spend their most time in social media and they share many details with their friends. Such information obtained from their chat has been used in several applications. The sentiment analysis is the one which has been applied with Twitter data set toward identifying the emotion of any user and based on those different problems can be solved. Primarily, the data as of the Twitter database is preprocessed. In this step, tokenization, stemming, stop word removal, and number removal are done. The proposed automated learning with CA-SVM based sentiment analysis model reads the Twitter data set. After that they have been processed to extract the features which yield set of terms. Using the terms, the tweets are clustered using TGS-K means clustering which measures Euclidean distance according to different features like semantic sentiment score (SSS), gazetteer and symbolic sentiment support (GSSS), and topical sentiment score (TSS). Further, the method classifies the tweets according to support vector machine (CA-SVM) which classifies the tweet according to the support value which is measured based on the above two measures. The attained results are validated utilizing k-fold cross-validation methodology. Then, the classification is performed by utilizing the Balanced CA-SVM (Deep Learning Modified Neural Network). The results are evaluated and compared with the existing works. The Proposed model achieved 92.48 % accuracy and 92.05% sentiment score contrasted with the existing works.

Publisher

SAGE Publications

Subject

Computer Science Applications,General Engineering,Modeling and Simulation

Cited by 65 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Considerations on sentiment of social network posts as a feature of destructive impacts;AI Communications;2024-08-13

2. Using the equal sentiment enhancement with distribution (ESED) algorithm in text sentiment analysis: predicting customers purchasing intention (CPI) for IT services on freelance platforms;Third International Conference on Electronic Information Engineering and Data Processing (EIEDP 2024);2024-07-05

3. Sentiment Analysis Model Using Deep Learning;Algorithms for Intelligent Systems;2024

4. Improving Digital Marketing Using Sentiment Analysis with Deep LSTM;Lecture Notes in Networks and Systems;2024

5. Endometriosis Labelling using Machine learning;2023 4th International Conference on Communication, Computing and Industry 6.0 (C216);2023-12-15

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3