Sentiment analysis in Nepali: Exploring machine learning and lexicon-based approaches-Reference-Cited by-同舟云学术

Sentiment analysis in Nepali: Exploring machine learning and lexicon-based approaches

Published:2020-08-31 Issue:2 Volume:39 Page:2201-2212
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Piryani Rajesh¹,Piryani Bhawna²,Singh Vivek Kumar³,Pinto David⁴

Affiliation:

1. Department of Computer Science, South Asian University, New Delhi, India

2. Department of Graduate Studies, Nepal College of Information Technology (NCIT), Kathmandu, Nepal

3. Department of Computer Science, Banaras Hindu University, Varanasi, India

4. Faculty of Computer Science, Benemerita Universidad Autonoma de Puebla, Puebla (Mexico)

Abstract

In recent times, sentiment analysis research has achieved tremendous impetus on English textual data, however, a very less amount of research has been focused on Nepali textual data. This work is focused towards Nepali textual data. We have explored machine learning approaches and proposed a lexicon-based approach using linguistic features and lexical resources to perform sentiment analysis for tweets written in Nepali language. This lexicon-based approach, first pre-process the tweet, locate the opinion-oriented features and then compute the sentiment polarity of tweet. We have investigated both conventional machine learning models (Multinomial Naïve Bayes (NB), Decision Tree, Support Vector Machine (SVM) and logistic regression) and deep learning models (Convolution Neural Network (CNN), Long Short-Term Memory (LSTM) and CNN-LSTM) for sentiment analysis of Nepali text. These machine learning models and lexicon-based approach have been evaluated on tweet dataset related to Nepal Earthquake 2015 and Nepal blockade 2015. Lexicon based approach has outperformed than conventional machine learning models. Deep learning models have outperformed than conventional machine learning models and lexicon-based approach. We have also created Nepali SentiWordNet and Nepali SenticNet sentiment lexicon from existing English language resources as by-product.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference15 articles.

1. Construction and annotation of a corpus of contemporary Nepali;Yadava;Corpora,2008

2. Nepali Spellchecker;Bal;PAN Localization Working Papers,2004

3. A morphological analyzer and a stemmer for Nepali. PAN localization;Bal;Working Papers,2004

4. A hybrid algorithm for stemming of Nepali text;Sitaula;Intelligent Information Management,2013

5. Hardie A. , A collocation-based approach to Nepali postpositions, Corpus Linguistics and Linguistic Theory 4(1) (2008).

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Surveying the Use of Social Media Data and Natural Language Processing Techniques to Investigate Natural Disasters;Natural Hazards Review;2024-11

2. Share What You Already Know: Cross-Language-Script Transfer and Alignment for Sentiment Detection in Code-Mixed Data;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-07-12

3. Text-Based Emotion Analysis Approach for Understanding Human Behavior;2023 International Conference on Integration of Computational Intelligent System (ICICIS);2023-11-01

4. A survey on sentiment analysis and its applications;Neural Computing and Applications;2023-08-17

5. Review on positional significance of LSTM and CNN in the multilayer deep neural architecture for efficient sentiment classification;Journal of Intelligent & Fuzzy Systems;2023-07-21