Author:
Brglez Mojca,Zayed Omnia,Buitelaar Paul
Abstract
AbstractThe COVID pandemic spurred the use of various metaphors, some very common and universal, others depending on the language, country and culture. The use of metaphors by the general public, especially in languages other than English, has not yet been sufficiently investigated, one of the reasons being the lack of resources and automatic tools for metaphor analysis. To fill this gap, we introduce TCMeta, a dataset of tweets annotated for metaphors around COVID-19, in two languages from ten different countries. The dataset contains metaphoric phrases covering four source domains. Furthermore, we introduce a semi-automatic methodology to annotate more than 2000 tweets in English and Slovene. To the best of our knowledge, this is the first multilingual semi-automatically compiled dataset of user-generated texts aimed at investigating metaphorical language about the pandemic. It is also the first Slovene dataset of tweets annotated for metaphors.
Funder
Javna Agencija za Raziskovalno Dejavnost RS
Science Foundation Ireland
Horizon 2020
Publisher
Springer Science and Business Media LLC
Reference123 articles.
1. Abdo, M.S., Alghonaim, A.S., & Essam, B.A. (2020). Public perception of COVID-19’s global health crisis on Twitter until 14 weeks after the outbreak. Digital Scholarship in the Humanities, Sep 2 (fqaa037). https://doi.org/10.1093/llc/fqaa037
2. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. (2011). Sentiment analysis of Twitter data. In: Proceedings of the workshop on language in social media (LSM 2011) (pp. 30–38). Association for Computational Linguistics. Retrieved from https://aclanthology.org/W11-0705
3. Alash, H., & Al-Sultany, G. (2020). Improve topic modeling algorithms based on Twitter hashtags. Journal of Physics: Conference Series, 1660(1), 012100. https://doi.org/10.1088/1742-6596/1660/1/012100
4. Angelov, D. (2020). top2vec: Distributed representations of topics. ArXiv, abs/2008.09470. Retrieved from arXiv:2008.09470
5. Antloga, Š. (2020a). Metaphor corpus KOMET 1.0. Retrieved from http://hdl.handle.net/11356/1293 (Slovenian language resource repository CLARIN.SI)