Affiliation:
1. School of Information Technology, Anqing Vocational & Technical College, Anqing, China
Abstract
With the development of the Internet and mobile networks, social networks have gradually become an essential tool and widespread application. Therefore, the research on short text semantic modelling of social networks has attracted widespread attention. However, modelling short texts encounter the semantics sparsity and multiple meanings of a word in social networks. To solve the above problems, we propose a user-based topic model with topical word embeddings semantic modelling method, namely SM-UTM. Firstly, we construct the user topic model to aggregate short text. Secondly, we build word pair in the user topic model to alleviate semantics sparsity in social networks. In addition, we introduce the time information of social networks into the topic model to jointly constrain the generation process of topics, to improve the quality of semantic representation of social network short texts. Finally, we use the topic word embedding learning based on deep learning to train and optimize the word vector according to the learning results of the user topic model, to alleviate the problem of polysemy in social networks. We build multiple groups of quantitative and qualitative experiments based on the crawled real Sina Weibo data. The experimental results show that our SM-UTM is significantly better than the comparison method in the evaluation indicators of topic consistency, purity and entropy.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference41 articles.
1. Latent dirichlet allocation;Blei;Journal of Machine Learning Research,2003
2. A survey of unsupervised generative models for exploratory data analysis and representation learning;Abukmeil;ACM Computing Surveys,2021
3. Short text topic modelling by exploring original documents;Li;Knowledge and Information Systems,2018
4. BTM: Topic modelling over short texts;Cheng;IEEE Transactions on Knowledge and Data Engineering,2014
5. Short text topic modelling techniques, applications, and performance: a survey;Qiang;IEEE Transactions on Knowledge and Data Engineering,2020