Exploring Multi-lingual, Multi-task, and Adversarial Learning for Low-resource Sentiment Analysis

Author:

Mamta 1ORCID,Ekbal Asif1ORCID,Bhattacharyya Pushpak1ORCID

Affiliation:

1. Indian Institute of Technology Patna, Patna, India

Abstract

Deep learning has become most prominent in solving various Natural Language Processing (NLP) tasks including sentiment analysis. However, these techniques require a considerably large amount of annotated corpus, which is not easy to obtain for most of the languages, especially under the scenario of low-resource settings. In this article, we propose a deep multi-task multi-lingual adversarial framework to solve the resource-scarcity problem of sentiment analysis by leveraging the useful and relevant knowledge from a high-resource language. To transfer the knowledge between the different languages, both the languages are mapped to the shared semantic space using cross-lingual word embeddings. We evaluate our proposed architecture on a low-resource language, Hindi, using English as the high-resource language. Experiments show that our proposed model achieves an accuracy of 60.09% for the movie review dataset and 72.14% for the product review dataset. The effectiveness of our proposed approach is demonstrated with significant performance gains over the state-of-the-art systems and translation-based baselines.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference52 articles.

1. Mamta, Asif Ekbal, Pushpak Bhattacharyya, Shikha Srivastava, Alka Kumar, and Tista Saha. 2020. Multi-domain tweet corpora for sentiment analysis: Resource creation and evaluation. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 5046–5054.

2. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. Tensorflow: A system for large-scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI’16). 265–283.

3. Borrow from rich cousin: transfer learning for emotion detection using cross lingual embedding

4. How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [Application Notes]

5. Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Quality achhi hai (is good), satisfied! Towards aspect based sentiment analysis in code-mixed language;Computer Speech & Language;2025-01

2. Recent advancements and challenges of NLP-based sentiment analysis: A state-of-the-art review;Natural Language Processing Journal;2024-03

3. Multilinguality in Misinformation Detection;The Information Retrieval Series;2024

4. Adversarial Training Method for Machine Learning Model in a Resource-Constrained Environment;Proceedings of the 19th ACM International Symposium on QoS and Security for Wireless and Mobile Networks;2023-10-30

5. Transformer based multilingual joint learning framework for code-mixed and english sentiment analysis;Journal of Intelligent Information Systems;2023-09-15

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3