Abstract
AbstractUse of online social networks (OSNs) undoubtedly brings the world closer. OSNs like Twitter provide a space for expressing one’s opinions in a public platform. This great potential is misused by the creation of bot accounts, which spread fake news and manipulate opinions. Hence, distinguishing genuine human accounts from bot accounts has become a pressing issue for researchers. In this paper, we propose a framework based on deep learning to classify Twitter accounts as either ‘human’ or ‘bot.’ We use the information from user profile metadata of the Twitter account like description, follower count and tweet count. We name the framework ‘DeeProBot,’ which stands for Deep Profile-based Bot detection framework. The raw text from the description field of the Twitter account is also considered a feature for training the model by embedding the raw text using pre-trained Global Vectors (GLoVe) for word representation. Using only the user profile-based features considerably reduces the feature engineering overhead compared with that of user timeline-based features like user tweets and retweets. DeeProBot handles mixed types of features including numerical, binary, and text data, making the model hybrid. The network is designed with long short-term memory (LSTM) units and dense layers to accept and process the mixed input types. The proposed model is evaluated on a collection of publicly available labeled datasets. We have designed the model to make it generalizable across different datasets. The model is evaluated using two ways: testing on a hold-out set of the same dataset; and training with one dataset and testing with a different dataset. With these experiments, the proposed model achieved AUC as high as 0.97 with a selected set of features.
Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Human-Computer Interaction,Media Technology,Communication,Information Systems
Reference55 articles.
1. Abu-El-Rub N, Mueen A (2019) Botcamp: bot-driven interactions in social campaigns. In: The World Wide Web Conference, pp 2529–2535
2. Alothali E, Zaki N, Mohamed EA, Alashwal H (2018) Detecting social bots on twitter: a literature review. In: 2018 International Conference on Innovations in Information Technology (IIT), IEEE, pp 175–180
3. Braker C, Shiaeles S, Bendiab G et al (2020) BotSpot: Deep learning classification of bot accounts within twitter. In: Olga G, Sergey A et al (eds) Internet of things, smart spaces, and next generation networks and systems. Springer, Cham, pp 165–175
4. Chang H-CH, Chen E, Zhang M, et al (2021) Social bots and social media manipulation in 2020: The Year in Review.arXiv:210208436 arXiv preprint arXiv:210208436
5. Chollet F (2016) Using pre-trained word embeddings in a Keras model. In: The Keras Blog. https://blog.keras.io/using-pre-trained-word-embeddings-in-a-keras-model.html
Cited by
39 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献