Affiliation:
1. Institute for Advanced Modelling and Simulation, University of Nicosia, Nicosia CY-2417, Cyprus
2. Laboratory of Applied Mathematics, University of Crete, GR-70013 Heraklion, Greece
Abstract
This paper presents the development of a novel algorithm for unsupervised learning called RUN-ICON (Reduce UNcertainty and Increase CONfidence). The primary objective of the algorithm is to enhance the reliability and confidence of unsupervised clustering. RUN-ICON leverages the K-means++ method to identify the most frequently occurring dominant centres through multiple repetitions. It distinguishes itself from existing K-means variants by introducing novel metrics, such as the Clustering Dominance Index and Uncertainty, instead of relying solely on the Sum of Squared Errors, for identifying the most dominant clusters. The algorithm exhibits notable characteristics such as robustness, high-quality clustering, automation, and flexibility. Extensive testing on diverse data sets with varying characteristics demonstrates its capability to determine the optimal number of clusters under different scenarios. The algorithm will soon be deployed in real-world scenarios, where it will undergo rigorous testing against data sets based on measurements and simulations, further proving its effectiveness.
Funder
European Union’s Horizon Europe Research and Innovation Actions programme
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference38 articles.
1. The “Wake-Sleep” Algorithm for Unsupervised Neural Networks;Hinton;Science,1995
2. Unsupervised learning by competing hidden units;Krotov;Proc. Natl. Acad. Sci. USA,2019
3. Dimensionality reduction by learning an invariant mapping;Hadsell;Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06),2006
4. Alloghani, M., Al-Jumeily Obe, D., Mustafina, J., Hussain, A., and Aljaaf, A. (2020). Supervised and Unsupervised Learning for Data Science, Springer.
5. Na, S., Xumin, L., and Yong, G. (2010, January 2–4). Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm. Proceedings of the 2010 Third International Symposium on Intelligent Information Technology and Security Informatics, Jian, China.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献