O$$^2$$-Bert: Two-Stage Target-Based Sentiment Analysis-Reference-Cited by-同舟云学术

O$$^2$$-Bert: Two-Stage Target-Based Sentiment Analysis

Published:2023-09-01 Issue:1 Volume:16 Page:158-176
ISSN:1866-9956
Container-title:Cognitive Computation
language:en
Short-container-title:Cogn Comput

Author:

Yan Yan^ORCID,Zhang Bo-Wen,Ding Guanwen,Li Wenjie,Zhang Jie,Li Jia-Jing,Gao Wenchao

Abstract

AbstractTarget-based sentiment analysis (TBSA) is one of the most important NLP research topics for widespread applications. However, the task is challenging, especially when the targets contain multiple words or do not exist in the sequences. Conventional approaches cannot accurately extract the (target, sentiment) pairs due to the limitations of the fixed end-to-end architecture design. In this paper, we propose a framework named O

$$^2$$

-Bert, which consists of Opinion target extraction (OTE-Bert) and Opinion sentiment classification (OSC-Bert) to complete the task in two stages. More specifically, we divide the OTE-Bert into three modules. First, an entity number prediction module predicts the number of entities in a sequence, even in an extreme situation where no entities are contained. Afterwards, with predicted number of entities, an entity starting annotation module is responsible for predicting their starting positions. Finally, an entity length prediction module predicts the lengths of these entities, and thus, accomplishes target extraction. In OSC-Bert, the sentiment polarities of extracted targets from OTE-Bert. According to the characteristics of BERT encoders, our framework can be adapted to short English sequences without domain limitations. For other languages, our approach might work through altering the tokenization. Experimental results on the SemEval 2014-16 benchmarks show that the proposed model achieves competitive performances on both domains (restaurants and laptops) and both tasks (target extraction and sentiment classification), with F1-score as evaluated metrics. Specifically, OTE-Bert achieves 84.63%, 89.20%, 83.16%, and 86.88% F1 scores for target extraction, while OSC-Bert achieves 82.90%, 80.73%, 76.94%, and 83.58% F1 scores for sentiment classification, on the chosen benchmarks. The statistics validate the effectiveness and robustness of our approach and the new “two-stage paradigm”. In future work, we will explore more possibilities of the new paradigm on other NLP tasks.

Funder

Fundamental Research Funds for Central Universities of the Central South University

Publisher

Springer Science and Business Media LLC

Subject

Cognitive Neuroscience,Computer Science Applications,Computer Vision and Pattern Recognition

Link

https://link.springer.com/content/pdf/10.1007/s12559-023-10191-y.pdf

Reference47 articles.

1. Wang D, Fan H, Liu J. Learning with joint cross-document information via multi-task learning for named entity recognition. Inf Sci. 2021;579:454–67.

2. Tang H, Ji D, Zhou Q. End-to-end masked graph-based crf for joint slot filling and intent detection. Neurocomputing. 2020;413:348–59.

3. Ni J, Huang Z, Hu Y, Lin C. A two-stage embedding model for recommendation with multimodal auxiliary information. Inf Sci. 2022;582:22–37.

4. Zhang Y, Du J, Ma X, Wen H, Fortino G. Aspect-based sentiment analysis for user reviews. Cogn Comput. 2021;13(5):1114–27.

5. Guo L , Jiang S , Du W , Gan S. Recurrent neural crf for aspect term extraction with dependency transmission. In: CCF International Conference on Natural Language Processing and Chinese Computing. Springer; 2018 p. 378–90.