Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape-Reference-Cited by-同舟云学术

Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape

Published:2024-04-22 Issue:4 Volume:42 Page:553-575
ISSN:0264-0473
Container-title:The Electronic Library
language:en
Short-container-title:EL

Author:

Zhang Ruoxi,Ren Chenhan

Abstract

Purpose This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering. Design/methodology/approach This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data. Findings The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others. Originality/value Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.

Publisher

Emerald

Reference36 articles.

1. A new framework for predicting customer behavior in terms of RFM by considering the temporal aspect based on time series techniques;Journal of Ambient Intelligence and Humanized Computing,2021

2. Sentiment analysis of movie reviews using machine learning techniques;International Journal of Computer Applications,2017

3. FCM: the fuzzy c-means clustering algorithm;Computers and Geosciences,1984

4. Time series sentiment analysis (SA) of relief operations using social media (SM) platform for efficient resource management;International Journal of Disaster Risk Reduction,2022

5. Efficient agglomerative hierarchical clustering;Expert Systems with Applications,2015

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Consumer segmentation with large language models;Journal of Retailing and Consumer Services;2025-01