Affiliation:
1. Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Quebec, Canada H3G 1M8, Canada
Abstract
Clustering as an exploratory technique has been a promising approach for performing data analysis. In this paper, we propose a non-parametric Bayesian inference to address clustering problem. This approach is based on infinite multivariate Beta mixture models constructed through the framework of Dirichlet process. We apply an accelerated variational method to learn the model. The motivation behind proposing this technique is that Dirichlet process mixture models are capable to fit the data where the number of components is unknown. For large-scale data, this approach is computationally expensive. We overcome this problem with the help of accelerated Dirichlet process mixture models. Moreover, the truncation is managed using kd-trees. The performance of the model is validated on real medical applications and compared to three other similar alternatives. The results show the outperformance of our proposed framework.
Publisher
World Scientific Pub Co Pte Ltd
Subject
Artificial Intelligence,Computer Networks and Communications,Computer Science Applications,Linguistics and Language,Information Systems,Software
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献