SSTSA: A Self-Supervised Topic Sentiment Analysis Using Semantic Similarity Measures and Transformers-Reference-Cited by-同舟云学术

SSTSA: A Self-Supervised Topic Sentiment Analysis Using Semantic Similarity Measures and Transformers

Published:2023-08-02 Issue: Volume: Page:1-39
ISSN:0219-6220
Container-title:International Journal of Information Technology & Decision Making
language:en
Short-container-title:Int. J. Info. Tech. Dec. Mak.

Author:

Seilsepour Azam¹,Ravanmehr Reza¹^ORCID,Nassiri Ramin¹

Affiliation:

1. Department of Computer Engineering, Central Tehran Branch, Islamic Azad University, Tehran, Iran

Abstract

The exponentially increasing amount of data generated by the public on social media platforms is a precious source of information. It can be used to find the topics and analyze the comments. Some researchers have extended the Latent Dirichlet Allocation (LDA) method by adding a sentiment layer to simultaneously find the topics and their related sentiments. However, most of these approaches do not achieve admirable accuracy in Topic Sentiment Analysis (TSA), particularly when there is insufficient training data or the texts are complex, ambiguous, and short. In this paper, a self-supervised novel approach called SSTSA is proposed for TSA that extracts the hidden topics and analyzes the total sentiment related to each topic. The SSTSA proposes a new method called Pseudo-label Generator. For this purpose, first, it employs semantic similarity and Word Mover’s Distance (WMD) measures. Then, the document embedding technique is employed to semantically estimate the sentiment orientation of samples and generate the pseudo-labels (positive or negative). Afterward, a hybrid classifier composed of a pre-trained Robustly Optimized BERT (RoBERTa) and a Long Short-Term Memory (LSTM) model is trained to predict the sentiment of unseen data. The evaluation results on different datasets of various domains demonstrate that the SSTSA outperforms similar unsupervised/self-supervised methods.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Computer Science (miscellaneous),Computer Science (miscellaneous)

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219622023500736

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel self-supervised sentiment classification approach using semantic labeling based on contextual embeddings;Multimedia Tools and Applications;2024-05-13

2. A personalized context and sequence aware point of interest recommendation;Multimedia Tools and Applications;2024-02-27

3. A Topic Mapping-based framework to analyze textual risk reports from social media big data contents;The Journal of Supercomputing;2023-12-14