Sentiment analysis of video danmakus based on MIBE-RoBERTa-FF-BiLSTM
-
Published:2024-03-09
Issue:1
Volume:14
Page:
-
ISSN:2045-2322
-
Container-title:Scientific Reports
-
language:en
-
Short-container-title:Sci Rep
Author:
Zhao Jianbo,Liu Huailiang,Wang Yakai,Zhang Weili,Zhang Xiaojin,Li Bowei,Sun Tong,Qi Yanwei,Zhang Shanzhuang
Abstract
AbstractDanmakus are user-generated comments that overlay on videos, enabling real-time interactions between viewers and video content. The emotional orientation of danmakus can reflect the attitudes and opinions of viewers on video segments, which can help video platforms optimize video content recommendation and evaluate users’ abnormal emotion levels. Aiming at the problems of low transferability of traditional sentiment analysis methods in the danmaku domain, low accuracy of danmaku text segmentation, poor consistency of sentiment annotation, and insufficient semantic feature extraction, this paper proposes a video danmaku sentiment analysis method based on MIBE-RoBERTa-FF-BiLSTM. This paper constructs a “Bilibili Must-Watch List and Top Video Danmaku Sentiment Dataset” by ourselves, covering 10,000 positive and negative sentiment danmaku texts of 18 themes. A new word recognition algorithm based on mutual information (MI) and branch entropy (BE) is used to discover 2610 irregular network popular new words from trigrams to heptagrams in the dataset, forming a domain lexicon. The Maslow’s hierarchy of needs theory is applied to guide the consistent sentiment annotation. The domain lexicon is integrated into the feature fusion layer of the RoBERTa-FF-BiLSTM model to fully learn the semantic features of word information, character information, and context information of danmaku texts and perform sentiment classification. Comparative experiments on the dataset show that the model proposed in this paper has the best comprehensive performance among the mainstream models for video danmaku text sentiment classification, with an F1 value of 94.06%, and its accuracy and robustness are also better than other models. The limitations of this paper are that the construction of the domain lexicon still requires manual participation and review, the semantic information of danmaku video content and the positive case preference are ignored.
Funder
Ministry of Science and Technology of the People´s Republic of China
Xi'an Municipal Bureau of Science and Technology,China
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. Ni, W. & Coupé, C. Time-synchronic comments on video streaming website reveal core structures of audience engagement in movie viewing. Front. Psychol. 13, 1040755 (2023).
2. China Internet Network Information Center. The 47th Statistical Report on China’s Internet Development. Preprint at http://www.cac.gov.cn/2021-02/03/c_1613923423079314.htm (2022).
3. Bo, Y. In 2020 IEEE Learning With MOOCS (LWMOOCS) 100–104 (IEEE, 2020).
4. Hao, X., Xu, S. & Zhang, X. Barrage participation and feedback in travel reality shows: The effects of media on destination image among Generation Y. J. Destin. Mark. Manag. 12, 27–36 (2019).
5. Yuan, H., Fang, Q. & Bai, L. A study of the time-varying effects of danmaku on the process of online consumer behavior. J. Manag. 17, 1059–1066 (2020).