Affiliation:
1. CHMES Society’s, Dr. Moonje Institute, Nashik, India
Abstract
Many social media platforms have emerged as a result of the online social network's (OSN) rapid expansion. They have become important in day-to-day life, and spammers have turned their attention to them. Spam detection is done in two different ways, such as machine learning (ML) and expert-based detection. The expert-based detection technique’s accuracy depends on expert knowledge, and the manual process is a time-consuming task. Thus, ML-based spam detection is preferred in OSN. Spam identification on social networks is a difficult operation involving a variety of factors, and spam and ham have resulted in an imbalanced data distribution, which gives an advantage to spammers for corrupting our devices. Spam detection based on ML algorithms like Logistic Regression (LR), K-Nearest Neighbour (KNN), Decision Trees (DT), Random Forest (RF), Support Vector Machine (SVM), and XGB, Voting Classifier (VC), and many other algorithms are used to design the address balance and to attain high assessment accuracy. There is a non-balance issue. Text is vectorized by vectorizers and all the relative results are stored. The experimental result shows that, as compared to KN, NB, ETC, RF, SVC, LR, XGB, and DT, the proposed VC provides a higher classification accuracy rate of 97.96%. The proposed methods are effective in identifying balanced and imbalanced datasets, as evidenced by the validation results. The website was created to detect messages as spam or not.
Reference10 articles.
1. [1] RohitV.Adagale, AniketC.Sanap, Anil V.Gitte, Prof. R. H. Kulkarni, “A Survey on Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability”, International Journal of Interdisciplinary Innovative Research & Development (IJIIRD), ISSN: 2456-236X Vol. 02 Issue 02 | 2018.
2. [2] Z. Miller, B. Dickinson, W. Deitrick, W. Hu, and A. H. Wang, “Twitter spammer detection using data stream clustering,” Inf. Sci., vol. 260, pp. 64–73, Mar. 2014.
3. [3] NambouriSravya, ChavanaSaipraneetha, S. Saraswathi, “Identify the Human or Bots Twitter Data using Machine Learning Algorithms”, International Research Journal of Engineering and Technology (IRJET), Volume: 06 Issue: 03 | Mar 2019.
4. [4] CLAUDIA MEDA, FEDERICA BISIO, PAOLO GASTALDO, RODOLFO ZUNINO DITEN, “Machine Learning Techniques applied to Twitter Spammers Detection”, Recent Advances in Electrical and Electronic Engineering, ISBN: 978- 960-474-399-5, AUGUST, 2016.
5. [5] PatxiGal´an-Garc´ıa, Jos´eGaviria de la Puerta, Carlos LaordenG´omez, Igor Santos and Pablo Garc´ıaBringas, “Supervised Machine Learning for the Detection of Troll Profiles in Twitter Social Network: Cyberbullying”, IET Software, Vol. 6, Iss. 6, MAY 2014.