Real-Time Twitter Spam Detection and Sentiment Analysis using Machine Learning and Deep Learning Techniques

Author:

Rodrigues Anisha P1ORCID,Fernandes Roshan1ORCID,A Aakash1,B Abhishek1,Shetty Adarsh1,K Atul1,Lakshmanna Kuruva2ORCID,Shafi R. Mahammad3ORCID

Affiliation:

1. Department of Computer Science and Engineering, NMAM Institute of Technology, Nitte, Karkala, India

2. SITE, Vellore Institute of Technology, Vellore, Tamilnadu, India

3. Department of Electrical and Computer Engineering, College of Engineering and Technology, Tepi Campus, Mizan-Tepi University, Tepi, Ethiopia

Abstract

In this modern world, we are accustomed to a constant stream of data. Major social media sites like Twitter, Facebook, or Quora face a huge dilemma as a lot of these sites fall victim to spam accounts. These accounts are made to trap unsuspecting genuine users by making them click on malicious links or keep posting redundant posts by using bots. This can greatly impact the experiences that users have on these sites. A lot of time and research has gone into effective ways to detect these forms of spam. Performing sentiment analysis on these posts can help us in solving this problem effectively. The main purpose of this proposed work is to develop a system that can determine whether a tweet is “spam” or “ham” and evaluate the emotion of the tweet. The extracted features after preprocessing the tweets are classified using various classifiers, namely, decision tree, logistic regression, multinomial naïve Bayes, support vector machine, random forest, and Bernoulli naïve Bayes for spam detection. The stochastic gradient descent, support vector machine, logistic regression, random forest, naïve Bayes, and deep learning methods, namely, simple recurrent neural network (RNN) model, long short-term memory (LSTM) model, bidirectional long short-term memory (BiLSTM) model, and 1D convolutional neural network (CNN) model are used for sentiment analysis. The performance of each classifier is analyzed. The classification results showed that the features extracted from the tweets can be satisfactorily used to identify if a certain tweet is spam or not and create a learning model that will associate tweets with a particular sentiment.

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Reference34 articles.

1. A real time spam classification of twitter data with comparative analysis of classifiers;S. K. Rawat;IJSTE - International Journal of Science Technology & Engineering,2016

2. A framework for real-time spam detection in Twitter

3. Making the most of tweet-inherent features for social spam detection on twitter;B. Wang,2015

4. A social network spam detection model;O. O. Helen;International Journal of Scientific Engineering and Research,2017

5. Spam Detection in Social Media Networking Sites using Ensemble Methodology with Cross Validation

Cited by 99 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3