Text Document Clustering Approach by Improved Sine Cosine Algorithm
-
Published:2023-07-15
Issue:2
Volume:52
Page:541-561
-
ISSN:2335-884X
-
Container-title:Information Technology and Control
-
language:
-
Short-container-title:ITC
Author:
Radomirović Branislav,Jovanović Vuk,Nikolić Bosko,Stojanović Sasa,Venkatachalam K.,Zivkovic Miodrag,Njeguš Angelina,Bacanin Nebojsa,Strumberger Ivana
Abstract
Due to the vast amounts of textual data available in various forms such as online content, social media comments, corporate data, public e-services and media data, text clustering has been experiencing rapid development. Text clustering involves categorizing and grouping similar content. It is a process of identifying significant patterns from unstructured textual data. Algorithms are being developed globally to extract useful and relevant information from large amounts of text data. Measuring the significance of content in documents to partition the collection of text data is one of the most important obstacles in text clustering. This study suggests utilizing an improved metaheuristics algorithm to fine-tune the K-means approach for text clustering task. The suggested technique is evaluated using the first 30 unconstrained test functions from the CEC2017 test-suite and six standard criterion text datasets. The simulation results and comparison with existing techniques demonstrate the robustness and supremacy of the suggested method.
Publisher
Kaunas University of Technology (KTU)
Subject
Electrical and Electronic Engineering,Computer Science Applications,Control and Systems Engineering
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献