Normalised Clustering Accuracy: An Asymmetric External Cluster Validity Measure-Reference-Cited by-同舟云学术

Normalised Clustering Accuracy: An Asymmetric External Cluster Validity Measure

Published:2024-06-28 Issue: Volume: Page:
ISSN:0176-4268
Container-title:Journal of Classification
language:en
Short-container-title:J Classif

Author:

Gagolewski Marek^ORCID

Abstract

AbstractThere is no, nor will there ever be, single best clustering algorithm. Nevertheless, we would still like to be able to distinguish between methods that work well on certain task types and those that systematically underperform. Clustering algorithms are traditionally evaluated using either internal or external validity measures. Internal measures quantify different aspects of the obtained partitions, e.g., the average degree of cluster compactness or point separability. However, their validity is questionable because the clusterings they endorse can sometimes be meaningless. External measures, on the other hand, compare the algorithms’ outputs to fixed ground truth groupings provided by experts. In this paper, we argue that the commonly used classical partition similarity scores, such as the normalised mutual information, Fowlkes–Mallows, or adjusted Rand index, miss some desirable properties. In particular, they do not identify worst-case scenarios correctly, nor are they easily interpretable. As a consequence, the evaluation of clustering algorithms on diverse benchmark datasets can be difficult. To remedy these issues, we propose and analyse a new measure: a version of the optimal set-matching accuracy, which is normalised, monotonic with respect to some similarity relation, scale-invariant, and corrected for the imbalancedness of cluster sizes (but neither symmetric nor adjusted for chance).

Funder

Australian Research Council

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00357-024-09482-2.pdf

Reference62 articles.

1. Ackerman, M., Ben-David, S., Brânzei, S., & Loker, D. (2021). Weighted clustering: Towards solving the user’s dilemma. Pattern Recognition, 120, 108152. https://doi.org/10.1016/j.patcog.2021.108152

2. Andrews, J., Browne, R., & Hvingelby, C. (2022). On assessments of agreement between fuzzy partitions. Journal of Classification, 39, 326–342.

3. Arbelaitz, O., Gurrutxaga, I., Muguerza, J., Pérez, J. M., & Perona, I. (2013). An extensive comparative study of cluster validity indices. Pattern Recognition, 46(1), 243–256. https://doi.org/10.1016/j.patcog.2012.07.021

4. Arinik, N., Labatut, V., & Figueiredo, R. (2021). Characterizing and comparing external measures for the assessment of cluster analysis and community detection. IEEE Access, 9, 20255–20276. https://doi.org/10.1109/ACCESS.2021.3054621

5. Arnold, B. C. (2015). Pareto distributions. New York, USA: Chapman and Hall/CRC. https://doi.org/10.1201/b18141

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. genieclust: Fast and Robust Hierarchical Clustering with Noise Points Detection;CRAN: Contributed Packages;2020-07-30