Author:
Dehghan Ashkan,Siuta Kinga,Skorupka Agata,Dubey Akshat,Betlen Andrei,Miller David,Xu Wei,Kamiński Bogumił,Prałat Paweł
Abstract
AbstractUsers on social networks such as Twitter interact with each other without much knowledge of the real-identity behind the accounts they interact with. This anonymity has created a perfect environment for bot accounts to influence the network by mimicking real-user behaviour. Although not all bot accounts have malicious intent, identifying bot accounts in general is an important and difficult task. In the literature there are three distinct types of feature sets one could use for building machine learning models for classifying bot accounts. These feature-sets are: user profile metadata, natural language features (NLP) extracted from user tweets and finally features extracted from the the underlying social network. Profile metadata and NLP features are typically explored in detail in the bot-detection literature. At the same time less attention has been given to the predictive power of features that can be extracted from the underlying network structure. To fill this gap we explore and compare two classes of embedding algorithms that can be used to take advantage of information that network structure provides. The first class are classical embedding techniques, which focus on learning proximity information. The second class are structural embedding algorithms, which capture the local structure of node neighbourhood. We show that features created using structural embeddings have higher predictive power when it comes to bot detection. This supports the hypothesis that the local social network formed around bot accounts on Twitter contains valuable information that can be used to identify bot accounts.
Publisher
Springer Science and Business Media LLC
Subject
Information Systems and Management,Computer Networks and Communications,Hardware and Architecture,Information Systems
Reference48 articles.
1. Ahmed Nesreen K, Rossi Ryan A, Lee John Boaz, Willke Theodore L, Zhou Rong, Kong Xiangnan, Eldardiry Hoda. role2vec: Role-based network embeddings. In Proc. DLG KDD, 2019;1–7.
2. Aiello Luca Maria, Deplano Martina, Schifanella Rossano, Ruffo Giancarlo. People are strange when you’re a stranger: Impact and influence of bots on social networks. In Sixth International AAAI Conference on Weblogs and Social Media, 2012.
3. Ali Alhosseini Seyed, Bin Tareaf Raad, Najafi Pejman, Meinel Christoph. Detect me if you can: Spam bot detection using inductive representation learning. In Companion Proceedings of The 2019 World Wide Web Conference, 2019;pages 148–153.
4. Alkulaib Lulwah, Zhang Lei, Sun Yanshen, Lu Chang-Tien. Twitter bot identification: An anomaly detection approach. In 2022 IEEE International Conference on Big Data (Big Data), pages 3577–3585. IEEE, 2022.
5. Bail Christopher A, Guay Brian, Maloney Emily, Combs Aidan, Hillygus D Sunshine, Merhout Friedolin, Freelon Deen, Volfovsky Alexander. Assessing the Russian internet research agency’s impact on the political attitudes and behaviors of American twitter users in late 2017. Proc Natl Acad Sci. 2020;117(1):243–50.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献