The limitations of irony detection in Dutch social media-Reference-Cited by-同舟云学术

The limitations of irony detection in Dutch social media

Published:2023-07-23 Issue: Volume: Page:
ISSN:1574-020X
Container-title:Language Resources and Evaluation
language:en
Short-container-title:Lang Resources & Evaluation

Author:

Maladry Aaron,Lefever Els,Van Hee Cynthia,Hoste Véronique

Abstract

AbstractIn this paper, we explore the feasibility of irony detection in Dutch social media. To this end, we investigate both transformer models with embedding representations, as well as traditional machine learning classifiers with extensive feature sets. Our feature-based methodology implements a variety of information sources including lexical, semantic, syntactic, sentiment features, as well as two new data-driven features to model common sense. Based on patterns in the syntactic structure of tweets, we aim to model the presence of contrasting sentiments, a phenomenon that is known to be indicative of verbal irony and sarcasm. Feature selection, as well as voting ensemble techniques were implemented to enhance the classification performance. The final systems reach F1-scores up to 0.79, which are promising results for a task as difficult as irony detection. Besides a quantitative analysis, this paper also describes a thorough qualitative analysis of the system output. Although lexical cues appear to be very important to express irony, our analysis also revealed the need for more advanced modeling of common-sense knowledge to detect more subtle examples of irony.

Funder

Universiteit Gent

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Linguistics and Language,Education,Language and Linguistics

Link

https://link.springer.com/content/pdf/10.1007/s10579-023-09656-1.pdf

Reference48 articles.

1. Abnar, S., & Zuidema, W. (2020). Quantifying attention flow in transformers. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, (pp. 4190–4197). Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.acl-main.385. https://aclanthology.org/2020.acl-main.385

2. Babanejad, N., Davoudi, H., An, A., & Papagelis, M. (2020). Affective and contextual embedding for sarcasm detection. In Proceedings of the 28th International Conference on Computational Linguistics, (pp. 225–243). International Committee on Computational Linguistics, Barcelona. https://doi.org/10.18653/v1/2020.coling-main.20. https://aclanthology.org/2020.coling-main.20

3. Barbieri, F., Espinosa Anke, L., & Camacho-Collados, J. (2022). XLM-T: Multilingual language models in Twitter for sentiment analysis and beyond. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, (pp. 258–266). European Language Resources Association, Marseille. Retrieved from https://aclanthology.org/2022.lrec-1.27

4. Barbieri, F., Saggion, H., & Ronzano, F. (2014). Modelling sarcasm in twitter, a novel approach. In Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, (pp. 50–58)

5. Bouazizi, M., & Ohtsuki, T. (2015). Sarcasm detection in twitter:“ all your products are incredibly amazing!!!”-are they really? In 2015 IEEE Global Communications Conference (GLOBECOM), (pp. 1–6). IEEE

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sabiá in Action: An Investigation of its Abilities in Aspect-Based Sentiment Analysis, Hate Speech Detection, Irony Detection, and Question-Answering;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. GreenRu: A Russian Dataset for Detecting Mentions of Green Practices in Social Media Posts;Applied Sciences;2024-05-23