A versatile framework for resource-limited sentiment articulation, annotation, and analysis of short texts-Reference-Cited by-同舟云学术

A versatile framework for resource-limited sentiment articulation, annotation, and analysis of short texts

Published:2020-11-12 Issue:11 Volume:15 Page:e0242050
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Batanović Vuk^ORCID,Cvetanović Miloš,Nikolić Boško

Abstract

Choosing a comprehensive and cost-effective way of articulating and annotating the sentiment of a text is not a trivial task, particularly when dealing with short texts, in which sentiment can be expressed through a wide variety of linguistic and rhetorical phenomena. This problem is especially conspicuous in resource-limited settings and languages, where design options are restricted either in terms of manpower and financial means required to produce appropriate sentiment analysis resources, or in terms of available language tools, or both. In this paper, we present a versatile approach to addressing this issue, based on multiple interpretations of sentiment labels that encode information regarding the polarity, subjectivity, and ambiguity of a text, as well as the presence of sarcasm or a mixture of sentiments. We demonstrate its use on Serbian, a resource-limited language, via the creation of a main sentiment analysis dataset focused on movie comments, and two smaller datasets belonging to the movie and book domains. In addition to measuring the quality of the annotation process, we propose a novel metric to validate its cost-effectiveness. Finally, the practicality of our approach is further validated by training, evaluating, and determining the optimal configurations of several different kinds of machine-learning models on a range of sentiment classification tasks using the produced dataset.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference66 articles.

1. Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment Classification using Machine Learning Techniques. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002). Philadelphia, Pennsylvania, USA: Association for Computational Linguistics; 2002. pp. 79–86. http://dl.acm.org/citation.cfm?id=1118704

2. Turney PD. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002). Philadelphia, Pennsylvania, USA: Association for Computational Linguistics; 2002. pp. 417–424.

3. Pang B, Lee L. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL 2004). Morristown, New Jersey, USA: Association for Computational Linguistics; 2004. p. Article No. 271.

4. Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C. Learning Word Vectors for Sentiment Analysis. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011). Portland, Oregon, USA: Association for Computational Linguistics; 2011. pp. 142–150. http://dl.acm.org/citation.cfm?id=2002491

5. Maynard D, Greenwood MA. Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014). Reykjavik, Iceland: European Language Resources Association (ELRA); 2014. pp. 4238–4243. http://www.lrec-conf.org/proceedings/lrec2014/pdf/67_Paper.pdf

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mitigating Large Language Model Bias: Automated Dataset Augmentation and Prejudice Quantification;Computers;2024-06-04

2. Analysis of the retraining strategies for multi-label text message classification in call/contact center systems;Scientific Reports;2024-05-02

3. Automated stance detection in complex topics and small languages: The challenging case of immigration in polarizing news media;PLOS ONE;2024-04-26

4. RETRACTED: Multi-modal sarcasm detection based on emotion perception and cross-modality attention fusion;Journal of Intelligent & Fuzzy Systems;2024-04-18

5. SUH-AIFRD: A self-training-based hybrid approach for individual fake reviewer detection;Multimedia Tools and Applications;2024-01-26