K-RBBSO Algorithm: A Result-Based Stochastic Search Algorithm in Big Data
-
Published:2022-12-05
Issue:23
Volume:12
Page:12451
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Park SungjinORCID,
Kim Sangkyun
Abstract
Clustering is widely used in client-facing businesses to categorize their customer base and deliver personalized services. This study proposes an algorithm to stochastically search for an optimum solution based on the outcomes of a data clustering process. Fundamentally, the aforementioned goal is achieved using a result-based stochastic search algorithm. Hence, shortcomings of existing stochastic search algorithms are identified, and the k-means-initiated rapid biogeography-based silhouette optimization (K-RBBSO) algorithm is proposed to overcome them. The proposed algorithm is validated by creating a data clustering engine and comparing the performance of the K-RBBSO algorithm with those of currently used stochastic search techniques, such as simulated annealing and artificial bee colony, on a validation dataset. The results indicate that K-RBBSO is more effective with larger volumes of data compared to the other algorithms. Finally, we describe some prospective beneficial uses of a data clustering algorithm in unsupervised learning based on the findings of this study.
Funder
National Research Foundation of Korea and funded by the Korean Government
Ministry of Science and ICT, Korea, under the Information Technology Research Center support program
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference22 articles.
1. A survey on data Stream clustering and classification;Nguyen;Knowl. Inf. Syst.,2015
2. Meta-heuristic algorithm for K-means clustering: Areview;Jahwar;PalArchs J. Archaeol. Egypt Egyptol.,2020
3. HA-CCP: A hybrid algorithm for solving capacitated clustering problem;Liu;Comput. Intell. Neurosci.,2022
4. Why simheuristics? Benefits, limitations, and best practices when combining metaheuristics with simulation;Chica;SSRN,2017
5. Metaheuristic research: A comprehensive survey;Hussain;Artif. Intell. Rev.,2019