Spam Detection in Social Networks Using Machine Learning

Author:

Miss. Sneha Bajirao Sahane 1

Affiliation:

1. CHMES Society’s, Dr. Moonje Institute, Nashik, India

Abstract

Many social media platforms have emerged as a result of the online social network's (OSN) rapid expansion. They have become important in day-to-day life, and spammers have turned their attention to them. Spam detection is done in two different ways, such as machine learning (ML) and expert-based detection. The expert-based detection technique’s accuracy depends on expert knowledge, and the manual process is a time-consuming task. Thus, ML-based spam detection is preferred in OSN. Spam identification on social networks is a difficult operation involving a variety of factors, and spam and ham have resulted in an imbalanced data distribution, which gives an advantage to spammers for corrupting our devices. Spam detection based on ML algorithms like Logistic Regression (LR), K-Nearest Neighbour (KNN), Decision Trees (DT), Random Forest (RF), Support Vector Machine (SVM), and XGB, Voting Classifier (VC), and many other algorithms are used to design the address balance and to attain high assessment accuracy. There is a non-balance issue. Text is vectorized by vectorizers and all the relative results are stored. The experimental result shows that, as compared to KN, NB, ETC, RF, SVC, LR, XGB, and DT, the proposed VC provides a higher classification accuracy rate of 97.96%. The proposed methods are effective in identifying balanced and imbalanced datasets, as evidenced by the validation results. The website was created to detect messages as spam or not.

Publisher

Naksh Solutions

Reference10 articles.

1. [1] RohitV.Adagale, AniketC.Sanap, Anil V.Gitte, Prof. R. H. Kulkarni, “A Survey on Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability”, International Journal of Interdisciplinary Innovative Research & Development (IJIIRD), ISSN: 2456-236X Vol. 02 Issue 02 | 2018.

2. [2] Z. Miller, B. Dickinson, W. Deitrick, W. Hu, and A. H. Wang, “Twitter spammer detection using data stream clustering,” Inf. Sci., vol. 260, pp. 64–73, Mar. 2014.

3. [3] NambouriSravya, ChavanaSaipraneetha, S. Saraswathi, “Identify the Human or Bots Twitter Data using Machine Learning Algorithms”, International Research Journal of Engineering and Technology (IRJET), Volume: 06 Issue: 03 | Mar 2019.

4. [4] CLAUDIA MEDA, FEDERICA BISIO, PAOLO GASTALDO, RODOLFO ZUNINO DITEN, “Machine Learning Techniques applied to Twitter Spammers Detection”, Recent Advances in Electrical and Electronic Engineering, ISBN: 978- 960-474-399-5, AUGUST, 2016.

5. [5] PatxiGal´an-Garc´ıa, Jos´eGaviria de la Puerta, Carlos LaordenG´omez, Igor Santos and Pablo Garc´ıaBringas, “Supervised Machine Learning for the Detection of Troll Profiles in Twitter Social Network: Cyberbullying”, IET Software, Vol. 6, Iss. 6, MAY 2014.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3