Abstract
We introduce a new approach to clustering categorical data: Condorcet clustering with a fixed number of groups, denoted α-Condorcet. As k-modes, this approach is essentially based on similarity and dissimilarity measures. The paper is divided into three parts: first, we propose a new Condorcet criterion, with a fixed number of groups (to select cases into clusters). In the second part, we propose a heuristic algorithm to carry out the task. In the third part, we compare α-Condorcet clustering with k-modes clustering. The comparison is made with a quality’s index, accuracy of a measurement, and a within-cluster sum-of-squares index. Our findings are illustrated using real datasets: the feline dataset and the US Census 1990 dataset.
Funder
Agencia Nacional de Investigación y Desarrollo
DIUBB
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference40 articles.
1. Zur Differentialdiagnose der Neandertalgruppe;Czekanowski;Korespondentblatt der Deutschen Gesellschaft für Anthropologie Ethnologie und Urgeschichte,1909
2. A survey on clustering methods and algorithms;Harkanth;Int. J. Comput. Sci. Inf. Technol.,2013
3. AN OVERVIEW ON CLUSTERING METHODS
4. AN OVERVIEW ON CLUSTERING METHODS
5. Survey of Clustering Algorithms
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献