Author:
Zhang Kunhui,Chen Yen-Chi
Abstract
In this paper, we propose a new clustering method inspired by mode-clustering that not only finds clusters, but also assigns each cluster with an attribute label. Clusters obtained from our method show connectivity of the underlying distribution. We also design a local two-sample test based on the clustering result that has more power than a conventional method. We apply our method to the Astronomy and GvHD data and show that our method finds meaningful clusters. We also derive the statistical and computational theory of our method.
Funder
National Science Foundation
Reference43 articles.
1. A Nonparametric Statistical Approach to Clustering via Mode Identification;Li;J. Mach. Learn. Res.,2007
2. Clusters and water flows: A novel approach to modal clustering through Morse theory;Chacón;arXiv,2012
3. On the Estimation of the Gradient Lines of a Density and the Consistency of the Mean-Shift Algorithm;Arias-Castro;J. Mach. Learn. Res.,2016
4. A comprehensive approach to mode clustering
5. The estimation of the gradient of a density function, with applications in pattern recognition