Affiliation:
1. Faculty of Science and Engineering, University of Groningen, Groningen 9747AA, The Netherlands mohammadimathstar@gmail.com
2. Department of Computer Science, University of Birmingham, Birmingham B15 2TT p.tino@cs.bham.ac.uk
3. Faculty of Science and Engineering, University of Groningen, Groningen 9747 AG, The Netherlands k.bunte@rug.nl
Abstract
Abstract
The presence of manifolds is a common assumption in many applications, including astronomy and computer vision. For instance, in astronomy, low-dimensional stellar structures, such as streams, shells, and globular clusters, can be found in the neighborhood of big galaxies such as the Milky Way. Since these structures are often buried in very large data sets, an algorithm, which can not only recover the manifold but also remove the background noise (or outliers), is highly desirable. While other works try to recover manifolds either by pushing all points toward manifolds or by downsampling from dense regions, aiming to solve one of the problems, they generally fail to suppress the noise on manifolds and remove background noise simultaneously. Inspired by the collective behavior of biological ants in food-seeking process, we propose a new algorithm that employs several random walkers equipped with a local alignment measure to detect and denoise manifolds. During the walking process, the agents release pheromone on data points, which reinforces future movements. Over time the pheromone concentrates on the manifolds, while it fades in the background noise due to an evaporation procedure. We use the Markov chain (MC) framework to provide a theoretical analysis of the convergence of the algorithm and its performance. Moreover, an empirical analysis, based on synthetic and real-world data sets, is provided to demonstrate its applicability in different areas, such as improving the performance of t-distributed stochastic neighbor embedding (t-SNE) and spectral clustering using the underlying MC formulas, recovering astronomical low-dimensional structures, and improving the performance of the fast Parzen window density estimator.
Subject
Cognitive Neuroscience,Arts and Humanities (miscellaneous)
Reference43 articles.
1. Laplacian eigenmaps for dimensionality reduction and data representation;Belkin;Neural Computation,2003
2. Multidimensional binary search trees used for associative searching;Bentley;Communications of the ACM,1975
3. Local kernels and the geometric structure of data;Berry;Applied and Computational Harmonic Analysis,2016
4. Bishop, C. M.
(2006). Pattern recognition and machine learning. Berlin: Springer.
5. Blum, C., Roli, A., & Dorigo, M. (2001). HC–ACO: The hyper-cube framework for ant colony optimization. In Proceedings of the Metaheuristics International Conference, vol. 2 (pp. 399–403).
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献