Negation Detection on Mexican Spanish Tweets: The T-MexNeg Corpus-Reference-Cited by-同舟云学术

Negation Detection on Mexican Spanish Tweets: The T-MexNeg Corpus

Published:2021-04-25 Issue:9 Volume:11 Page:3880
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Bel-Enguix Gemma^ORCID,Gómez-Adorno Helena^ORCID,Pimentel Alejandro^ORCID,Ojeda-Trueba Sergio-Luis^ORCID,Aguilar-Vizuet Brian^ORCID

Abstract

In this paper, we introduce the T-MexNeg corpus of Tweets written in Mexican Spanish. It consists of 13,704 Tweets, of which 4895 contain negation structures. We performed an analysis of negation statements embedded in the language employed on social media. This research paper aims to present the annotation guidelines along with a novel resource targeted at the negation detection task. The corpus was manually annotated with labels of negation cue, scope, and, event. We report the analysis of the inter-annotator agreement for all the components of the negation structure. This resource is freely available. Furthermore, we performed various experiments to automatically identify negation using the T-MexNeg corpus and the SFU ReviewSP-NEG for training a machine learning algorithm. By comparing two different methodologies, one based on a dictionary and the other based on the Conditional Random Fields algorithm, we found that the results of negation identification on Twitter are lower when the model is trained on the SFU ReviewSP-NEG Corpus. Therefore, this paper shows the importance of having resources built specifically to deal with social media language.

Funder

PAPIIT - UNAM

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/9/3880/pdf

Reference53 articles.

1. Analysis on the word-formation of English netspeak neologism;Liu;J. Arts Humanit.,2014

2. What is CMC? An overview of scholarly definitions;Ferris;Comput. Mediat. Commun. Mag.,1997

3. Ciberpragmática 2.0. Nuevos usos del lenguaje en Internet;Yus Ramos,2010

4. Corpora Annotated with Negation: An Overview

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Augmenting a Spanish clinical dataset for transformer-based linking of negations and their out-of-scope references;Natural Language Processing;2024-05-17

2. Unraveling Negation in Modern Greek Using Machine Learning: A Comprehensive Analysis and Detection Framework;IFIP Advances in Information and Communication Technology;2024

3. Clinical Text Mining in Spanish Enhanced by Negation Detection and Named Entity Recognition;Computación y Sistemas;2023-12-27

4. Negation and speculation processing: A study on cue-scope labelling and assertion classification in Spanish clinical text;Artificial Intelligence in Medicine;2023-11

5. NoNiRes: A Catalan corpus annotated with negation;PROCES LENG NAT;2023