Sentiment lexicon for cross-domain adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model-Reference-Cited by-同舟云学术

Sentiment lexicon for cross-domain adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model

Published:2022-09-22 Issue:5 Volume:43 Page:6433-6450
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Suresh Kumar K.¹,Helen Sulochana C.²,Radhamani A.S.³,Ananth Kumar T.¹

Affiliation:

1. IFET College of Engineering (Autonomous), Villupuram, Tamilnadu, India

2. St. Xavier’s Catholic College of Engineering, Kanyakumari, Tamilnadu, India

3. Amrita College of Engineering & Technology, Nagerkoil, India

Abstract

Many websites are attempting to offer a platform for users or customers to leave their reviews and comments about the products or services in their native languages. The cross-domain adaptation (CDA) analyses sentiment across domains. The sentiment lexicon falls short resulting in issues like feature mismatch, sparsity, polarity mismatch and polysemy. In this research, an augmented sentiment dictionary is developed in our native regional language (Tamil) that intends to construct the contextual links between terms in multi-domain datasets to reduce problems like polarity mismatch, feature mismatch, and polysemy. Data from the source domain and target domain both labeled and unlabeled are used in the proposed dictionary. To be more specific, the initial dictionary uses normalised pointwise mutual information (nPMI) to derive contextual weight, whereas the final dictionary uses the value of terms across all reviews to compute the accurate rank score. Here, a deep learning model called BERT is used for sentiment classification. For cross-domain adaptation, a modified multi-layer fuzzy-based convolutional neural network (M-FCNN) is deployed. This work aims to build a single dictionary using large number of vocabularies for classifying the reviews in Tamil for several target domains. This extendible dictionary enhances the accuracy of CDA greatly when compared to existing baseline techniques and easily handles a large number of terms in different domains.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference28 articles.

1. Customer reviews sentiment-based analysis and clustering for market-oriented tourism services and products development or positioning,} };Jardim;Procedia Computer Science,2022

2. A review of natural language processing techniques for opinion mining systems;Sun;Information Fusion,2017

3. Unified Cross-domain Classification via Geometric and Statistical Adaptations;Liu;Pattern Recognition,2021

4. 360 degree view of cross-domain opinion classification: a survey;Singh;Artificial Intelligence Review,2021

5. Approaches to cross-domain sentiment analysis: A systematic literature review;Al-Moslmi;IEEE Access,2017

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning and Sentiment Analysis;Advances in Marketing, Customer Relationship Management, and E-Services;2024-04-19

2. Sentiment Analysis of Short Texts Using SVMs and VSMs-Based Multiclass Semantic Classification;Applied Artificial Intelligence;2024-03-14

3. A coarse-to-fine unsupervised domain adaptation method based on metric learning;Journal of Intelligent & Fuzzy Systems;2024-01-10

4. Predicting Mortgage-Backed Securities Prepayment Risk Using Machine Learning Models;2023 2nd International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN);2023-04-21