Multiple Relational Topic Modeling for Noisy Short Texts-Reference-Cited by-同舟云学术

Multiple Relational Topic Modeling for Noisy Short Texts

Published:2018-11 Issue:11n12 Volume:28 Page:1559-1574
ISSN:0218-1940
Container-title:International Journal of Software Engineering and Knowledge Engineering
language:en
Short-container-title:Int. J. Soft. Eng. Knowl. Eng.

Author:

Liu Zheng¹,Liu Chiyu¹,Xia Bin¹,Li Tao¹

Affiliation:

1. Jiangsu Key Laboratory of Big Data Security & Intelligent Processing School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, P. R. China

Abstract

Understanding contents in social networks by inferring high-quality latent topics from short texts is a significant task in social analysis, which is challenging because social network contents are usually extremely short, noisy and full of informal vocabularies. Due to the lack of sufficient word co-occurrence instances, well-known topic modeling methods such as LDA and LSA cannot uncover high-quality topic structures. Existing research works seek to pool short texts from social networks into pseudo documents or utilize the explicit relations among these short texts such as hashtags in tweets to make classic topic modeling methods work. In this paper, we explore this problem by proposing a topic model for noisy short texts with multiple relations called MRTM (Multiple Relational Topic Modeling). MRTM exploits both explicit and implicit relations by introducing a document-attribute distribution and a two-step random sampling strategy. Extensive experiments, compared with the state-of-the-art topic modeling approaches, demonstrate that MRTM can alleviate the word co-occurrence sparsity and uncover high-quality latent topics from noisy short texts.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Networks and Communications,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S021819401840017X

Reference6 articles.

1. Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email

2. Word network topic model: a simple but general solution for short and imbalanced texts

3. Improving Topic Models with Latent Feature Word Representations

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Microblog Comment Clustering Algorithm Based on Emotional Topic Feature Word Weighting;2024 IEEE 2nd International Conference on Control, Electronics and Computer Technology (ICCECT);2024-04-26

2. Short text topic modelling approaches in the context of big data: taxonomy, survey, and analysis;Artificial Intelligence Review;2022-10-26

3. Fault Diagnosis of Signal Equipment on the Lanzhou-Xinjiang High-Speed Railway Using Machine Learning for Natural Language Processing;Complexity;2021-07-28

4. Sentiment word co-occurrence and knowledge pair feature extraction based LDA short text clustering algorithm;Journal of Intelligent Information Systems;2020-05-25

5. Collaboratively Modeling and Embedding of Latent Topics for Short Texts;IEEE Access;2020