Affiliation:
1. Department of Computer Science and Engineering, NMAM Institute of Technology, Nitte, Karkala, India
2. SITE, Vellore Institute of Technology, Vellore, Tamilnadu, India
3. Department of Electrical and Computer Engineering, College of Engineering and Technology, Tepi Campus, Mizan-Tepi University, Tepi, Ethiopia
Abstract
In this modern world, we are accustomed to a constant stream of data. Major social media sites like Twitter, Facebook, or Quora face a huge dilemma as a lot of these sites fall victim to spam accounts. These accounts are made to trap unsuspecting genuine users by making them click on malicious links or keep posting redundant posts by using bots. This can greatly impact the experiences that users have on these sites. A lot of time and research has gone into effective ways to detect these forms of spam. Performing sentiment analysis on these posts can help us in solving this problem effectively. The main purpose of this proposed work is to develop a system that can determine whether a tweet is “spam” or “ham” and evaluate the emotion of the tweet. The extracted features after preprocessing the tweets are classified using various classifiers, namely, decision tree, logistic regression, multinomial naïve Bayes, support vector machine, random forest, and Bernoulli naïve Bayes for spam detection. The stochastic gradient descent, support vector machine, logistic regression, random forest, naïve Bayes, and deep learning methods, namely, simple recurrent neural network (RNN) model, long short-term memory (LSTM) model, bidirectional long short-term memory (BiLSTM) model, and 1D convolutional neural network (CNN) model are used for sentiment analysis. The performance of each classifier is analyzed. The classification results showed that the features extracted from the tweets can be satisfactorily used to identify if a certain tweet is spam or not and create a learning model that will associate tweets with a particular sentiment.
Subject
General Mathematics,General Medicine,General Neuroscience,General Computer Science
Reference34 articles.
1. A real time spam classification of twitter data with comparative analysis of classifiers;S. K. Rawat;IJSTE - International Journal of Science Technology & Engineering,2016
2. A framework for real-time spam detection in Twitter
3. Making the most of tweet-inherent features for social spam detection on twitter;B. Wang,2015
4. A social network spam detection model;O. O. Helen;International Journal of Scientific Engineering and Research,2017
5. Spam Detection in Social Media Networking Sites using Ensemble Methodology with Cross Validation
Cited by
99 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献