A qualitative analysis of sarcasm, irony and related #hashtags on Twitter-Reference-Cited by-同舟云学术

A qualitative analysis of sarcasm, irony and related #hashtags on Twitter

Published:2020-07 Issue:2 Volume:7 Page:205395172097273
ISSN:2053-9517
Container-title:Big Data & Society
language:en
Short-container-title:Big Data & Society

Author:

Sykora Martin^ORCID,Elayan Suzanne,Jackson Thomas W¹

Affiliation:

1. Centre for Information Management, School of Business and Economics, Loughborough University, Loughborough, UK

Abstract

As the use of automated social media analysis tools surges, concerns over accuracy of analytics have increased. Some tentative evidence suggests that sarcasm alone could account for as much as a 50% drop in accuracy when automatically detecting sentiment. This paper assesses and outlines the prevalence of sarcastic and ironic language within social media posts. Several past studies proposed models for automatic sarcasm and irony detection for sentiment analysis; however, these approaches result in models trained on training data of highly questionable quality, with little qualitative appreciation of the underlying data. To understand the issues and scale of the problem, we are the first to conduct and present results of a focused manual semantic annotation analysis of two datasets of Twitter messages (in total 4334 tweets), associated with; (i) hashtags commonly employed in automated sarcasm and irony detection approaches, and (ii) tweets relating to 25 distinct events, including, scandals, product releases, cultural events, accidents, terror incidents, etc. We also highlight the contextualised use of multi-word hashtags in the communication of humour, sarcasm and irony, pointing out that many sentiment analysis tools simply fail to recognise such hashtag-based expressions. Our findings also offer indicative evidence regarding the quality of training data used for automated machine learning models in sarcasm, irony and sentiment detection. Worryingly only 15% of tweets labelled as sarcastic were truly sarcastic. We highlight the need for future research studies to rethink their approach to data preparation and a more careful interpretation of sentiment analysis.

Publisher

SAGE Publications

Subject

Library and Information Sciences,Information Systems and Management,Computer Science Applications,Communication,Information Systems

Link

http://journals.sagepub.com/doi/pdf/10.1177/2053951720972735

Reference38 articles.

1. A Survey of Figurative Language and Its Computational Detection in Online Social Networks

2. Carvalho P, Sarmento L, Silva M, et al. (2009) Clues for detecting irony in user-generated contents: Oh……!! It’s “so easy”. In: Proceedings of the 1st international CIKM workshop on topic-sentiment analysis for mass opinion, Hong Kong, China, 6 November.

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Digital intermediaries in pandemic times: social media and the role of bots in communicating emotions and stress about Coronavirus;Journal of Computational Social Science;2024-08-09

2. Changes in Online Moral Discourse About Public Figures During #MeToo;Affective Science;2024-08-01

3. Twitter and the projection of political personalities in India;Commonwealth & Comparative Politics;2024-04-02

4. Online Evaluation Information Cascade and Its Impact on Consumer Decision Making: Analyzing Movie Reviews Using Sentiment Corpus;IEEE Access;2024

5. Emotions Matter: A Systematic Review and Meta-Analysis of the Detection and Classification of Students’ Emotions in STEM during Online Learning;Education Sciences;2023-09-08