Author detection: Analyzing tweets by using a Naïve Bayes classifier-Reference-Cited by-同舟云学术

Author detection: Analyzing tweets by using a Naïve Bayes classifier

Published:2020-08-31 Issue:2 Volume:39 Page:2331-2339
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Abascal-Mena Rocío¹,López-Ornelas Erick¹

Affiliation:

1. Department of Information Technologies, Universidad Autónoma Metropolitana, Unidad Cuajimalpa. Avenida Vasco de Quiroga 4871, Colonia. Santa Fe, Cuajimalpa. Delegación Cuajimalpa de Morelos. C.P. 05348, Ciudad de México, México

Abstract

In the context of digital social media, where users have multiple ways to obtain information, it is important to have tools to detect the authorship within a corpus supposedly created by a single author. With the tremendous amount of information coming from social networks there is a lot of research concerning author profiling, but there is a lack of research about the authorship identification. In order to detect the author of a group of tweets, a Naïve Bayes classifier is proposed which is an automatic algorithm based on Bayes’ theorem. The main objective is to determine if a particular tweet was made by a specific user or not, based on its content. The data used correspond to a simple data set, obtained with the Twitter API, composed of four political accounts accompanied by their username and tweet identifier as it is mixed with multiple user tweets. To describe the performance of the classification model and interpret the obtained results, a confusion matrix is used as it contains values like accuracy, sensitivity, specificity, Kappa measure, the positive predictive and negative predictive value. These results show that the prediction model, after several cases of use, have acceptable values against the observed probabilities.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference3 articles.

1. Murthy D. , Twitter: Social Communication in the Twitter Age. Cambridge, UK: Polity Press, (2013), pp. 193.

2. Investigating the use of machine learning algorithms in detecting gender of the Arabic tweet author;AlSukhni;International Journal of Advanced Computer Science & Applications,2016

3. Supervised machine learning for the detection of troll profiles in twitter social network: Application to a real case of cyberbullying;Galán-García;Logic Journal of the IGPL,2016

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The potential of automated classification to categorise police force tweeting behaviours: leading the way to large scale analysis;Policing and Society;2023-09-28

2. Convolutional Neural Networks Based Turkish Text Classification;Bilişim Teknolojileri Dergisi;2023-01-31

3. A Method for Bengali Author Detection Using State of the Arts Supervised Machine Learning Classifiers;Artificial Intelligence and Industrial Applications;2023