Hybrid Fruit-Fly Optimization Algorithm with K-Means for Text Document Clustering-Reference-Cited by-同舟云学术

Hybrid Fruit-Fly Optimization Algorithm with K-Means for Text Document Clustering

Published:2021-08-13 Issue:16 Volume:9 Page:1929
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Bezdan Timea^ORCID,Stoean Catalin^ORCID,Naamany Ahmed Al^ORCID,Bacanin Nebojsa^ORCID,Rashid Tarik A.^ORCID,Zivkovic Miodrag^ORCID,Venkatachalam K.^ORCID

Abstract

The fast-growing Internet results in massive amounts of text data. Due to the large volume of the unstructured format of text data, extracting relevant information and its analysis becomes very challenging. Text document clustering is a text-mining process that partitions the set of text-based documents into mutually exclusive clusters in such a way that documents within the same group are similar to each other, while documents from different clusters differ based on the content. One of the biggest challenges in text clustering is partitioning the collection of text data by measuring the relevance of the content in the documents. Addressing this issue, in this work a hybrid swarm intelligence algorithm with a K-means algorithm is proposed for text clustering. First, the hybrid fruit-fly optimization algorithm is tested on ten unconstrained CEC2019 benchmark functions. Next, the proposed method is evaluated on six standard benchmark text datasets. The experimental evaluation on the unconstrained functions, as well as on text-based documents, indicated that the proposed approach is robust and superior to other state-of-the-art methods.

Funder

Romanian Ministry of Education and Research

Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/9/16/1929/pdf

Reference74 articles.

1. A new Fruit Fly Optimization Algorithm: Taking the financial distress model as an example

2. Firefly Algorithms for Multimodal Optimization;Yang,2009

3. Some Methods for Classification and Analysis of MultiVariate Observations;MacQueen,1967

4. A Comprehensive Survey of Clustering Algorithms

Cited by 90 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Clustering Method with Graph Maximum Decoding Information;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. Optimizing Machine Learning for Breast Cancer Detection by Hybrid Metaheuristic Approach;2024 12th International Symposium on Digital Forensics and Security (ISDFS);2024-04-29

3. Design and optimization of haze prediction model based on particle swarm optimization algorithm and graphics processor;Scientific Reports;2024-04-26

4. A novel text clustering model based on topic modelling and social network analysis;Chaos, Solitons & Fractals;2024-04

5. A Novel Artificial Electric Field Algorithm for Solving Global Optimization and Real-World Engineering Problems;Biomimetics;2024-03-19