Affiliation:
1. Computer Science and Engineering, P.E.S. College of Engineering, Aurangabad, Maharashtra, India
2. Assistant Professor, Vivekanand College, Aurangabad, Maharashtra, India
Abstract
Artificial Intelligence (AI) and Machine Learning (ML), which are becoming a part of interest rapidly for various researchers. ML is the field of Computer Science study, which gives capability to learn without being absolutely programmed. This work focuses on the standard k-means clustering algorithm and analysis the shortcomings of the standard k-means algorithm. The k-means clustering algorithm calculates the distance between each data object and not all cluster centres in every iteration, which makes the efficiency of clustering is high. In this work, we have to try to improve the k-means algorithm to solve simple data to store some information in every iteration, which is to be used in the next interaction. This method avoids computing distance of data object to the cluster centre repeatedly, saving the running time. An experimental result shows the enhanced speed of clustering, accuracy, reducing the computational complexity of the k-means. In this, we have work on iris dataset extracted from Kaggle.
Reference14 articles.
1. Bhattacharya, Sambit & Czejdo, Bogdan & Agrawal, Rajeev & Erdemir, Erdem & Gokaraju, Balakrishna. (2018). 1-4. 10.1109/SECON.2018.8479098. Sambit Bhattacharya,
2. Mohamed Alloghani, Dhiya Al-Jumeily, Jamila Mustafina “A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science” January 2020 DOI: 10.1007/978-3-030-22475-2_1 In book: Supervised and Unsupervised Learning for Data Science (pp.3-21)
3. https://www.ibm.com/cloud/learn/unsupervised-learning
4. L. B. Goncalves, M. M. B. R. Vellasco, M. A. C. Pacheco and Flavio Joaquim de Souza, "Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases," in IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 36, no. 2, pp. 236-248, March 2006.
5. P. H. Ahmad and S. Dang, "Performance evaluation of clustering algorithm using different datasets", Int. J. Adv. Res. Comput. Sci. Manag. Stud., vol. 3, no. 1, pp. 167-173, 2015.