Corpus Annotation and Analysis of Sarcasm in Twitter: #CatsMovie vs. #TheRiseOfSkywalker-Reference-Cited by-同舟云学术

Corpus Annotation and Analysis of Sarcasm in Twitter: #CatsMovie vs. #TheRiseOfSkywalker

Published:2022-06-29 Issue:1 Volume:44 Page:186-207
ISSN:1989-6840
Container-title:Atlantis. Journal of the Spanish Association for Anglo-American Studies
language:
Short-container-title:Atlantis

Author:

Moreno-Ortiz Antonio^ORCID, ,García-Gámez María^ORCID,

Abstract

Sentiment analysis is a natural language processing task that has received increased attention in the last decade due to the vast amount of opinionated data on social media platforms such as Twitter. Although the methodologies employed have grown in number and sophistication, analysing irony and sarcasm still poses a severe problem. From the linguistic perspective, sarcasm has been studied in discourse analysis from several perspectives, but little attention has been given to specific metrics that measure its relevance. In this paper we describe the creation of a manually-annotated dataset where detailed text markers are included. This dataset is a sample from a larger corpus of tweets (n= 76,764) on two highly controversial films: Cats and Star Wars: The Rise of Skywalker. We took two different samples for each film, one before and one after their release, to compare reception and presence of sarcasm. We then used a sentiment analysis tool to measure the impact of sarcasm in polarity detection and then manually classified the mechanisms of sarcasm generation. The resulting corpus will be useful for machine learning approaches to sarcasm detection as well as discourse analysis studies on irony and sarcasm.

Publisher

AEDEAN (Asociacion Espanola de Estudios Anglo-Norteamericanos)

Subject

Literature and Literary Theory,Linguistics and Language,Language and Linguistics,Cultural Studies

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. “She’ll Never Be a Man” A Corpus-Based Forensic Linguistic Analysis of Misgendering Discrimination on X;Languages;2024-08-30

2. Managing Large Twitter Datasets;Making Sense of Large Social Media Corpora;2024