Hybrid feature selection approach to identify optimal features of profile metadata to detect social bots in Twitter-Reference-Cited by-同舟云学术

Hybrid feature selection approach to identify optimal features of profile metadata to detect social bots in Twitter

Published:2021-09-19 Issue:1 Volume:11 Page:
ISSN:1869-5450
Container-title:Social Network Analysis and Mining
language:en
Short-container-title:Soc. Netw. Anal. Min.

Author:

Alothali Eiman,Hayawi Kadhim,Alashwal Hany^ORCID

Abstract

AbstractThe last few years have revealed that social bots in social networks have become more sophisticated in design as they adapt their features to avoid detection systems. The deceptive nature of bots to mimic human users is due to the advancement of artificial intelligence and chatbots, where these bots learn and adjust very quickly. Therefore, finding the optimal features needed to detect them is an area for further investigation. In this paper, we propose a hybrid feature selection (FS) method to evaluate profile metadata features to find these optimal features, which are evaluated using random forest, naïve Bayes, support vector machines, and neural networks. We found that the cross-validation attribute evaluation performance was the best when compared to other FS methods. Our results show that the random forest classifier with six optimal features achieved the best score of 94.3% for the area under the curve. The results maintained overall 89% accuracy, 83.8% precision, and 83.3% recall for the bot class. We found that using four features: favorites_count, verified, statuses_count, and average_tweets_per_day, achieves good performance metrics for bot detection (84.1% precision, 81.2% recall).

Funder

Zayed University

Publisher

Springer Science and Business Media LLC

Subject

Computer Science Applications,Human-Computer Interaction,Media Technology,Communication,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s13278-021-00786-4.pdf

Reference47 articles.

1. Abokhodair, N, Daisy Y, McDonald DW (2015) Dissecting a social botnet. In: Proceedings of the 18th ACM conference on computer supported cooperative work and social computing, New York, NY, USA: ACM, 839–51. https://doi.org/10.1145/2675133.2675208

2. Alothali E, Nazar Z, Mohamed EA, Hany A (2018) Detecting social bots on Twitter: a literature review. In: 2018 international conference on innovations in information technology (IIT), IEEE, 175–80. https://doi.org/10.1109/INNOVATIONS.2018.8605995

3. H Ariyaluran A Riyaz N Fariza G Abdullah IAT Hashem A Ejaz I Muhammad 2019 Real-time big data processing for anomaly detection: a survey Int J Inf Manag 45 289 307 https://doi.org/10.1016/j.ijinfomgt.2018.08.006

4. DM Beskow KM Carley 2019 Its all in a name: detecting and labeling bots by their name Comput Math Organ Theory 25 1 24 35 https://doi.org/10.1007/s10588-018-09290-1

5. Botometer (2020) Datasets 2020. https://botometer.osome.iu.edu/bot-repository/datasets.html

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TL‐PBot: Twitter bot profile detection using transfer learning based on DNN model;Engineering Reports;2024-01-10

2. An Efficient Hybrid Feature Selection Technique Toward Prediction of Suspicious URLs in IoT Environment;IEEE Access;2024

3. Machine Learning Classifiers for Social Media Bots Detection on Twitter using Explainable AI;2023 Second International Conference on Informatics (ICI);2023-11-23

4. Systematic Literature Review of Social Media Bots Detection Systems;Journal of King Saud University - Computer and Information Sciences;2023-05

5. Towards a Comprehensive Approach for Socialbot Detection on Twitter: Integrating Multiple Features;2023-04-05