Affiliation:
1. INRIA Saclay -- Île-de-France
2. Stanford University
Abstract
We present a clustering scheme that combines a mode-seeking phase with a cluster merging phase in the corresponding density map. While mode detection is done by a standard graph-based hill-climbing scheme, the novelty of our approach resides in its use of
topological persistence
to guide the merging of clusters. Our algorithm provides additional feedback in the form of a set of points in the plane, called a
persistence diagram
(PD), which provably reflects the prominences of the modes of the density. In practice, this feedback enables the user to choose relevant parameter values, so that under mild sampling conditions the algorithm will output the
correct
number of clusters, a notion that can be made formally sound within persistence theory. In addition, the output clusters have the property that their spatial locations are bound to the ones of the basins of attraction of the peaks of the density.
The algorithm only requires rough estimates of the density at the data points, and knowledge of (approximate) pairwise distances between them. It is therefore applicable in any metric space. Meanwhile, its complexity remains practical: although the size of the input distance matrix may be up to quadratic in the number of data points, a careful implementation only uses a linear amount of memory and takes barely more time to run than to read through the input.
Funder
Google
National Science Foundation
Stanford University
Division of Computer and Network Systems
Agence Nationale de la Recherche
Division of Computing and Communication Foundations
Division of Mathematical Sciences
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software
Reference41 articles.
1. A weighted k-nearest neighbor density estimate for geometric inference
2. Topology and data
3. On the Local Behavior of Spaces of Natural Images
4. Persistence barcodes for shapes
5. F. Chazal and D. Cohen-Steiner. 2007. Geometric Inference. (To appear as a book chapter Springer.) http://www-sop.inria.fr/geometrica/team/Frederic.Chazal/. F. Chazal and D. Cohen-Steiner. 2007. Geometric Inference . (To appear as a book chapter Springer.) http://www-sop.inria.fr/geometrica/team/Frederic.Chazal/.
Cited by
110 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献