Chameleon 2

Author:

Barton Tomas1ORCID,Bruna Tomas2,Kordik Pavel2

Affiliation:

1. Czech Technical University in Prague, Institute of Molecular Genetics ASCR

2. Czech Technical University in Prague, Prague, Czech Republic

Abstract

Traditional clustering algorithms fail to produce human-like results when confronted with data of variable density, complex distributions, or in the presence of noise. We propose an improved graph-based clustering algorithm called Chameleon 2, which overcomes several drawbacks of state-of-the-art clustering approaches. We modified the internal cluster quality measure and added an extra step to ensure algorithm robustness. Our results reveal a significant positive impact on the clustering quality measured by Normalized Mutual Information on 32 artificial datasets used in the clustering literature. This significant improvement is also confirmed on real-world datasets. The performance of clustering algorithms such as DBSCAN is extremely parameter sensitive, and exhaustive manual parameter tuning is necessary to obtain a meaningful result. All hierarchical clustering methods are very sensitive to cutoff selection, and a human expert is often required to find the true cutoff for each clustering result. We present an automated cutoff selection method that enables the Chameleon 2 algorithm to generate high-quality clustering in autonomous mode.

Funder

Youth and Sports of Czech Republic

Agency of the Czech Republic

Ministry of Education

CTU

ERDF Grant Upgrade of National Infrastructure for Chemical Biology

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference54 articles.

Cited by 27 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A comprehensive review of clustering techniques in artificial intelligence for knowledge discovery: Taxonomy, challenges, applications and future prospects;Advanced Engineering Informatics;2024-10

2. Hierarchical clustering algorithm based on natural local density peaks;Signal, Image and Video Processing;2024-08-11

3. Object Based Analytics for Finding Homogeneous Collections in Aerial Imagery Datasets;IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium;2024-07-07

4. Graph Convolutional Spectral Clustering for Electricity Market Data Clustering;Applied Sciences;2024-06-18

5. Finding Homogeneous Collections in Aerial Object Detection Datasets;2024 11th International Conference on Signal Processing and Integrated Networks (SPIN);2024-03-21

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3