Smoothed Analysis of the k-Means Method-Reference-Cited by-同舟云学术

Smoothed Analysis of the k-Means Method

Published:2011-10 Issue:5 Volume:58 Page:1-31
ISSN:0004-5411
Container-title:Journal of the ACM
language:en
Short-container-title:J. ACM

Author:

Arthur David¹,Manthey Bodo²,Röglin Heiko³

Affiliation:

1. Stanford University

2. University of Twente

3. University of Bonn

Abstract

The k -means method is one of the most widely used clustering algorithms, drawing its popularity from its speed in practice. Recently, however, it was shown to have exponential worst-case running time. In order to close the gap between practical performance and theoretical analysis, the k -means method has been studied in the model of smoothed analysis. But even the smoothed analyses so far are unsatisfactory as the bounds are still super-polynomial in the number n of data points. In this article, we settle the smoothed running time of the k -means method. We show that the smoothed number of iterations is bounded by a polynomial in n and 1/ σ , where σ is the standard deviation of the Gaussian perturbations. This means that if an arbitrary input data set is randomly perturbed, then the k -means method will run in expected polynomial time on that input set.

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software

Link

https://dl.acm.org/doi/pdf/10.1145/2027216.2027217

Reference32 articles.

1. Clustering for metric and nonmetric distance measures

2. NP-hardness of Euclidean sum-of-squares clustering

3. k-Means Has Polynomial Smoothed Complexity

4. How slow is the k-means method?

Cited by 49 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speeding up random walk mixing by starting from a uniform vertex;Electronic Journal of Probability;2024-01-01

2. “Intelligent Heuristics Are the Future of Computing”;ACM Transactions on Intelligent Systems and Technology;2023-11-14

3. The Study of Historical Progression in the Distribution of Urban Commercial Space Locations—Example of Paris;Sustainability;2023-10-05

4. The approach to supply chain cooperation in the implementation of sustainable development initiatives and company's economic performance;Equilibrium. Quarterly Journal of Economics and Economic Policy;2023-03-25

5. A Comparative Performance Analysis of Fast K-Means Clustering Algorithms;Information Integration and Web Intelligence;2022