Toward Integrated CNN-based Sentiment Analysis of Tweets for Scarce-resource Language

Toward Integrated CNN-based Sentiment Analysis of Tweets for Scarce-resource Language—Hindi

Published:2021-09-30 Issue:5 Volume:20 Page:1-23
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Gupta Vedika¹^ORCID,Jain Nikita¹,Shubham Shubham¹,Madan Agam¹,Chaudhary Ankit¹,Xin Qin²

Affiliation:

1. Department of Computer Science & Engineering, Bharati Vidyapeeth's College of Engineering, New Delhi, India

2. Faculty of Science and Technology, University of the Faroe Islands, Faroe Islands

Abstract

Linguistic resources for commonly used languages such as English and Mandarin Chinese are available in abundance, hence the existing research in these languages. However, there are languages for which linguistic resources are scarcely available. One of these languages is the Hindi language. Hindi, being the fourth-most popular language, still lacks in richly populated linguistic resources, owing to the challenges involved in dealing with the Hindi language. This article first explores the machine learning-based approaches—Naïve Bayes, Support Vector Machine, Decision Tree, and Logistic Regression—to analyze the sentiment contained in Hindi language text derived from Twitter. Further, the article presents lexicon-based approaches (Hindi Senti-WordNet, NRC Emotion Lexicon) for sentiment analysis in Hindi while also proposing a Domain-specific Sentiment Dictionary. Finally, an integrated convolutional neural network (CNN)—Recurrent Neural Network and Long Short-term Memory—is proposed to analyze sentiment from Hindi language tweets, a total of 23,767 tweets classified into positive, negative, and neutral. The proposed CNN approach gives an accuracy of 85%.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3450447

Reference35 articles.

1. Aspect-based sentiment analysis of mobile reviews

2. R. Piryani V. Gupta V. K. Singh and U. Ghose. 2017. A linguistic rule-based approach for aspect-level sentiment analysis of movie reviews. In Advances in Computer and Computational Sciences. Springer Singapore 201–109. R. Piryani V. Gupta V. K. Singh and U. Ghose. 2017. A linguistic rule-based approach for aspect-level sentiment analysis of movie reviews. In Advances in Computer and Computational Sciences. Springer Singapore 201–109.

3. Movie Prism: A novel system for aspect level sentiment profiling of movies

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sentimental impact of fake news on social media using an integrated ensemble framework;Social Network Analysis and Mining;2024-09-09

2. DoSLex: automatic generation of all domain semantically rich sentiment lexicon;Language Resources and Evaluation;2024-07-18

3. Multimodal sentiment analysis of english and hinglish memes;Multimedia Tools and Applications;2024-06-20

4. Which words are important?: an empirical study of Assamese sentiment analysis;Language Resources and Evaluation;2024-06-19

5. Sentence Annotation for Aspect-oriented Sentiment Analysis: A Lexicon based Approach with Marathi Movie Reviews;Journal of The Institution of Engineers (India): Series B;2024-05-18