An Improved B-hill Climbing Optimization Technique for Solving the Text Documents Clustering Problem

Author:

Abualigah Laith Mohammad1,Hanandeh Essam Said2,Khader Ahamad Tajudin3,Otair Mohammed Abdallh1,Shandilya Shishir Kumar4

Affiliation:

1. Faculty of Computer Sciences and Informatics, Amman Arab University, Amman - 11953, Jordan

2. Department of Computer Information System, Zarqa University, P.O. Box 13132, Zarqa, Jordan

3. School of Computer Science, Universiti Sains Malaysia, Penang, Malaysia

4. Department of Computer Science & Engineering, NRI Institute of Information Science and Technology, Bhopal, India

Abstract

Background: Considering the increasing volume of text document information on Internet pages, dealing with such a tremendous amount of knowledge becomes totally complex due to its large size. Text clustering is a common optimization problem used to manage a large amount of text information into a subset of comparable and coherent clusters. Aims: This paper presents a novel local clustering technique, namely, β-hill climbing, to solve the problem of the text document clustering through modeling the β-hill climbing technique for partitioning the similar documents into the same cluster. Methods: The β parameter is the primary innovation in β-hill climbing technique. It has been introduced in order to perform a balance between local and global search. Local search methods are successfully applied to solve the problem of the text document clustering such as; k-medoid and kmean techniques. Results: Experiments were conducted on eight benchmark standard text datasets with different characteristics taken from the Laboratory of Computational Intelligence (LABIC). The results proved that the proposed β-hill climbing achieved better results in comparison with the original hill climbing technique in solving the text clustering problem. Conclusion: The performance of the text clustering is useful by adding the β operator to the hill climbing.

Publisher

Bentham Science Publishers Ltd.

Subject

Radiology, Nuclear Medicine and imaging

Reference47 articles.

1. Shehab M.; Khader A.T.; Al-Betar M.A.; Abualigah L.M.; Hybridizing cuckoo search algorithm with hill climbing for numerical optimization problems. In: 8th International Conference on Information Technology (ICIT) ,36-43

2. Bolaji A.L.A.; Al-Betar M.A.; Awadallah M.A.; Khader A.T.; Abualigah L.M.; A comprehensive review: Krill Herd algorithm (KH) and its applications. Appl Soft Comput 2016,49,437-446

3. Abualigah L.M.; Khader A.T.; Al-Betar M.A.; Multi-objectives-based text clustering technique using K-mean algorithm. In: 7th International Conference on Computer Science and Information Technology (CSIT) IEEE 2016; pp. 1-6

4. Tunali A.V.; Bilgin T.; Camurcu A.; An improved clustering algorithm for text mining: Multi-cluster spherical k-means. Int Arab J Inf Technol 2016,13(1),12-19

5. Abualigah L.M.; Khader A.T.; Al-Betar M.A.; Alomari O.A.; Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering. Expert Syst Appl 2017,84,24-36

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3