Author:
Peters Uwe,Quintana Ignacio Ojea
Abstract
AbstractMany philosophers hold that generics (i.e., unquantified generalizations) are pervasive in communication and that when they are about social groups, this may offend and polarize people because generics gloss over variations between individuals. Generics about social groups might be particularly common on Twitter (X). This remains unexplored, however. Using machine learning (ML) techniques, we therefore developed an automatic classifier for social generics, applied it to 1.1 million tweets about people, and analyzed the tweets. While it is often suggested that generics are ubiquitous in everyday communication, we found that most tweets (78%) about people contained no generics. However, tweets with generics received more “likes” and retweets. Furthermore, while recent psychological research may lead to the prediction that tweets with generics about political groups are more common than tweets with generics about ethnic groups, we found the opposite. However, consistent with recent claims that political animosity is less constrained by social norms than animosity against gender and ethnic groups, negative tweets with generics about political groups were significantly more prevalent and retweeted than negative tweets about ethnic groups. Our study provides the first ML-based insights into the use and impact of social generics on Twitter.
Publisher
Springer Science and Business Media LLC
Reference83 articles.
1. Ahmed, B. (2014). Lexical normalisation of twitter data. Science and Information Conference (SAI), 2015, 326–328.
2. Allaway, E., Hwang, J. D., Bhagavatula, C., McKeown, K., Downey, D., & Choi, Y. (2023). Penguins don’t fly: Reasoning about generics through instantiations and exceptions. Conference of the European Chapter of the Association for Computational Linguistics. https://doi.org/10.48550/arXiv.2205.11658
3. Antonakaki, D., Fragopoulou, P., & Ioannidis, S. (2021). A survey of twitter research: Data model, graph structure, sentiment analysis and attacks. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2020.114006
4. Bailly, A., Blanc, C., Francis, É., Guillotin, T., Jamal, F., Wakim, B., & Roy, P. (2022). Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models. Computer Methods and Programs in Biomedicine, 213, 106504. https://doi.org/10.1016/j.cmpb.2021.106504
5. Barbieri, F., Camacho-Collados, J., Neves, L., & Espinosa-Anke, L. (2020). TweetEval: Unified benchmark and comparative evaluation for tweet classification. Findings of the Association for Computational Linguistics: EMNLP, 2020, 1644–1650.