Abstract
This paper focuses on the automatic analysis of conversation transcriptions in the call center of a customer care service. The goal is to recognize topics related to problems and complaints discussed in several dialogues between customers and agents. Our study aims to implement a framework able to automatically cluster conversation transcriptions into cohesive and well-separated groups based on the content of the data. The framework can alleviate the analyst selecting proper values for the analysis and the clustering processes. To pursue this goal, we consider a probabilistic model based on the latent Dirichlet allocation, which associates transcriptions with a mixture of topics in different proportions. A case study consisting of transcriptions in the Italian natural language, and collected in a customer support center of an energy supplier, is considered in the paper. Performance comparison of different inference techniques is discussed using the case study. The experimental results demonstrate the approach’s efficacy in clustering Italian conversation transcriptions. It also results in a practical tool to simplify the analytic process and off-load the parameter tuning from the end-user. According to recent works in the literature, this paper may be valuable for introducing latent Dirichlet allocation approaches in topic modeling for the Italian natural language.
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference46 articles.
1. A review of natural language processing techniques for opinion mining systems
2. From Classical Machine Learning to Deep Neural Networks: A Simplified Scientometric Review
3. Role of text mining in business intelligence;Gupta;Gian Jyoti E-J.,2012
4. Probabilistic latent semantic indexing;Hofmann;Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,1999
5. Document clustering based on non-negative matrix factorization;Xu;Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,2003
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献