Affiliation:
1. State Key Lab of IoTSC and University of Macau, Taipa, Macau S.A.R, China
2. University of Technology Sydney, Ultimo, New South Wales, Sydney, Australia
Abstract
The online event discovery in social media based documents is useful, such as for disaster recognition and intervention. However, the diverse events incrementally identified from social media streams remain accumulated, ad hoc, and unstructured. They cannot assist users in digesting the tremendous amount of information and finding their interested events. Further, most of the existing work is challenged by jointly identifying incremental events and dynamically organizing them in an adaptive hierarchy. To address these problems, this article proposes
d
ynamic and
h
ierarchical
C
ategorization
M
odeling (dhCM) for social media stream. Instead of manually dividing the timeframe, a multimodal event miner exploits a density estimation technique to continuously capture the temporal influence between documents and incrementally identify online events in textual, temporal, and spatial spaces. At the same time, an adaptive categorization hierarchy is formed to automatically organize the documents into proper categories at multiple levels of granularities. In a nonparametric manner, dhCM accommodates the increasing complexity of data streams with automatically growing the categorization hierarchy over adaptive growth. A sequential Monte Carlo algorithm is used for the online inference of the dhCM parameters. Extensive experiments show that dhCM outperforms the state-of-the-art models in terms of term coherence, category abstraction and specialization, hierarchical affinity, and event categorization and discovery accuracy.
Funder
National Key D&R Program of China
FDCT
GSTIC
University of Macau
GDST
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Reference59 articles.
1. The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
2. Kevin Canini Lei Shi and Thomas Griffiths. 2009. Online inference of topics with latent Dirichlet allocation. In Artificial Intelligence and Statistics. 65–72. Kevin Canini Lei Shi and Thomas Griffiths. 2009. Online inference of topics with latent Dirichlet allocation. In Artificial Intelligence and Statistics. 65–72.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献