Self-Adjusting Variable Neighborhood Search Algorithm for Near-Optimal k-Means Clustering-Reference-Cited by-同舟云学术

Self-Adjusting Variable Neighborhood Search Algorithm for Near-Optimal k-Means Clustering

Published:2020-11-05 Issue:4 Volume:8 Page:90
ISSN:2079-3197
Container-title:Computation
language:en
Short-container-title:Computation

Author:

Kazakovtsev Lev,Rozhnov Ivan^ORCID,Popov Aleksey,Tovbis Elena^ORCID

Abstract

The k-means problem is one of the most popular models in cluster analysis that minimizes the sum of the squared distances from clustered objects to the sought cluster centers (centroids). The simplicity of its algorithmic implementation encourages researchers to apply it in a variety of engineering and scientific branches. Nevertheless, the problem is proven to be NP-hard which makes exact algorithms inapplicable for large scale problems, and the simplest and most popular algorithms result in very poor values of the squared distances sum. If a problem must be solved within a limited time with the maximum accuracy, which would be difficult to improve using known methods without increasing computational costs, the variable neighborhood search (VNS) algorithms, which search in randomized neighborhoods formed by the application of greedy agglomerative procedures, are competitive. In this article, we investigate the influence of the most important parameter of such neighborhoods on the computational efficiency and propose a new VNS-based algorithm (solver), implemented on the graphics processing unit (GPU), which adjusts this parameter. Benchmarking on data sets composed of up to millions of objects demonstrates the advantage of the new algorithm in comparison with known local search algorithms, within a fixed time, allowing for online computation.

Publisher

MDPI AG

Subject

Applied Mathematics,Modeling and Simulation,General Computer Science,Theoretical Computer Science

Link

https://www.mdpi.com/2079-3197/8/4/90/pdf

Reference128 articles.

1. Survey of Clustering Data Mining Techniques;Berkhin,2002

2. A Review of Classification

3. Least squares quantization in PCM

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Multiobjective Variable Neighborhood Strategy Adaptive Search to Optimize the Dynamic EMS Location–Allocation Problem;Computation;2022-06-20

2. Dynamic Uncertainty Study of Multi-Center Location and Route Optimization for Medicine Logistics Company;Mathematics;2022-03-16

3. A GENETIC ALGORITHM WITH GREEDY CROSSOVER AND ELITISM FOR CAPACITY PLANNING;FACTA UNIV-SER MATH;2022

4. Self-Configuring (1 + 1)-Evolutionary Algorithm for the Continuous p-Median Problem with Agglomerative Mutation;Algorithms;2021-04-22

5. Self-adjusting Genetic Algorithm with Greedy Agglomerative Crossover for Continuous p-Median Problems;Communications in Computer and Information Science;2021