Author:
Tang Xiaobo,Mou Hao,Liu Jiangnan,Du Xin
Abstract
AbstractDue to its potential impact on business efficiency, automated customer complaint labeling and classification are of great importance for management decision making and business applications. The majority of the current research on automated labeling uses large and well-balanced datasets. However, customer complaint labels are hierarchical in structure, with many labels at the lowest hierarchy level. Relying on lower-level labels leads to small and imbalanced samples, thus rendering the current automatic labeling practices inapplicable to customer complaints. This article proposes an automatic labeling model incorporating the BERT and word2vec methods. The model is validated on electric utility customer complaint data. Within the model, the BERT method serves to obtain shallow text tags. Furthermore, text enhancement is used to mitigate the problem of imbalanced samples that emerge when the number of labels is large. Finally, the word2vec model is utilized for deep text analysis. Experiments demonstrate the proposed model's efficiency in automating customer complaint labeling. Consequently, the proposed model supports enterprises in improving their service quality while simultaneously reducing labor costs.
Funder
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC
Reference13 articles.
1. Atliha, V. & Sesok, D. Text augmentation using BERT for image captioning. Appl. Sci. Basel 10, 17 (2020).
2. Kim, S., Park, H. & Lee, J. Word2vec-based latent semantic analysis (W2V-LSA) for topic modeling: A study on blockchain technology trend analysis. Expert Syst. Appl. 152, 12 (2020).
3. Bharti S. K., & Babu K. S. Automatic keyword extraction for text summarization: A survey. arXiv:1704.03242
(arXiv preprint) 2017.
4. Luhn, H. P. A statistical approach to mechanized encoding and searching of literary information. IBM J. Res. Dev. 1(4), 309–317 (1957).
5. Lois, L. E. Experiments in automatic indexing and extracting. Inf. Storage Retr. 6(4), 313–330 (1970).
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献