Affiliation:
1. Department of Information Technologies, Universidad Autónoma Metropolitana, Unidad Cuajimalpa. Avenida Vasco de Quiroga 4871, Colonia. Santa Fe, Cuajimalpa. Delegación Cuajimalpa de Morelos. C.P. 05348, Ciudad de México, México
Abstract
In the context of digital social media, where users have multiple ways to obtain information, it is important to have tools to detect the authorship within a corpus supposedly created by a single author. With the tremendous amount of information coming from social networks there is a lot of research concerning author profiling, but there is a lack of research about the authorship identification. In order to detect the author of a group of tweets, a Naïve Bayes classifier is proposed which is an automatic algorithm based on Bayes’ theorem. The main objective is to determine if a particular tweet was made by a specific user or not, based on its content. The data used correspond to a simple data set, obtained with the Twitter API, composed of four political accounts accompanied by their username and tweet identifier as it is mixed with multiple user tweets. To describe the performance of the classification model and interpret the obtained results, a confusion matrix is used as it contains values like accuracy, sensitivity, specificity, Kappa measure, the positive predictive and negative predictive value. These results show that the prediction model, after several cases of use, have acceptable values against the observed probabilities.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference3 articles.
1. Murthy D. , Twitter: Social Communication in the Twitter Age. Cambridge, UK: Polity Press, (2013), pp. 193.
2. Investigating the use of machine learning algorithms in detecting gender of the Arabic tweet author;AlSukhni;International Journal of Advanced Computer Science & Applications,2016
3. Supervised machine learning for the detection of troll profiles in twitter social network: Application to a real case of cyberbullying;Galán-García;Logic Journal of the IGPL,2016
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献