Emerging Research Topic Detection Using Filtered-LDA-Reference-Cited by-同舟云学术

Emerging Research Topic Detection Using Filtered-LDA

Published:2021-10-31 Issue:4 Volume:2 Page:578-599
ISSN:2673-2688
Container-title:AI
language:en
Short-container-title:AI

Author:

Alattar Fuad^ORCID,Shaalan Khaled^ORCID

Abstract

Comparing two sets of documents to identify new topics is useful in many applications, like discovering trending topics from sets of scientific papers, emerging topic detection in microblogs, and interpreting sentiment variations in Twitter. In this paper, the main topic-modeling-based approaches to address this task are examined to identify limitations and necessary enhancements. To overcome these limitations, we introduce two separate frameworks to discover emerging topics through a filtered latent Dirichlet allocation (filtered-LDA) model. The model acts as a filter that identifies old topics from a timestamped set of documents, removes all documents that focus on old topics, and keeps documents that discuss new topics. Filtered-LDA also genuinely reduces the chance of using keywords from old topics to represent emerging topics. The final stage of the filter uses multiple topic visualization formats to improve human interpretability of the filtered topics, and it presents the most-representative document for each topic.

Publisher

MDPI AG

Link

https://www.mdpi.com/2673-2688/2/4/35/pdf

Reference38 articles.

1. Indexing by latent semantic analysis

2. Learning the parts of objects by non-negative matrix factorization

3. Latent Dirichlet Allocation;Blei;J. Mach. Learn. Res.,2003

4. A Survey on Opinion Reason Mining and Interpreting Sentiment Variations

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Noise-aware celestial clustering for hot topic detection from microblog datasets with not well-separated topics;Knowledge and Information Systems;2024-08-09

2. Risk Topics Discovery and Trend Analysis in Air Traffic Control Operations—Air Traffic Control Incident Reports from 2000 to 2022;Sustainability;2023-08-07

3. Sentiment Reason Mining Framework for Analyzing Twitter Discourse on Critical Issues in US Healthcare Industry;2023 International Research Conference on Smart Computing and Systems Engineering (SCSE);2023-06-29

4. Dot-Product Similarity based Centroid Clustering for Popular Topic Detection;2022 IEEE International Conference on Big Data (Big Data);2022-12-17