RASCL: a randomised approach to subspace clusters-Reference-Cited by-同舟云学术

RASCL: a randomised approach to subspace clusters

Published:2022-05-11 Issue:3 Volume:14 Page:243-259
ISSN:2364-415X
Container-title:International Journal of Data Science and Analytics
language:en
Short-container-title:Int J Data Sci Anal

Author:

Moens Sandy,Cule Boris^ORCID,Goethals Bart

Abstract

AbstractSubspace clustering aims to discover clusters in projections of highly dimensional numerical data. In this paper, we focus on discovering small collections of highly interesting subspace clusters that do not try to cluster all data points, leaving noisy data points unclustered. To this end, we propose a randomised method that first converts the highly dimensional database to a binarised one using projected samples of the original database. Subsequently, this database is mined for frequent itemsets, which we show can be translated back to subspace clusters. In this way, we are able to explore multiple subspaces of different sizes at the same time. In our extensive experimental analysis, we show on synthetic as well as real-world data that our method is capable of discovering highly interesting subspace clusters efficiently.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computational Theory and Mathematics,Computer Science Applications,Modeling and Simulation,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s41060-022-00327-y.pdf

Reference31 articles.

1. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)

2. Bellman, R.E.: Adaptive Control Processes: A Guided Tour. Princeton University Press, Princeton (2015)

3. Parsons, L., Haque, E., Liu, H.: Subspace clustering for high dimensional data: a review. ACM SIGKDD Explor. Newslett. 6(1), 90–105 (2004)

4. Moise, G., Sander, J., Ester, M.: P3c: A robust projected clustering algorithm. In Sixth international conference on data mining (ICDM’06). IEEE, 2006, pp. 414–425

5. Aksehirli, E., Goethals, B., Muller, E., Vreeken, J.: Cartification: a neighborhood preserving transformation for mining high dimensional data, In 2013 IEEE 13th international conference on data mining (ICDM), IEEE, (2013), pp. 937–942

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On-line outer bounding ellipsoid algorithm for clustering of hyperplanes in the presence of bounded noise;Cluster Computing;2023-01-30