K‐Plus anticlustering: An improved <i>k</i>‐means criterion for maximizing between‐group similarity-Reference-Cited by-同舟云学术

K‐Plus anticlustering: An improved k‐means criterion for maximizing between‐group similarity

Published:2023-07-11 Issue:1 Volume:77 Page:80-102
ISSN:0007-1102
Container-title:British Journal of Mathematical and Statistical Psychology
language:en
Short-container-title:Brit J Math & Statis

Author:

Papenberg Martin¹^ORCID

Affiliation:

1. Heinrich Heine University Düsseldorf Düsseldorf Germany

Abstract

AbstractAnticlustering refers to the process of partitioning elements into disjoint groups with the goal of obtaining high between‐group similarity and high within‐group heterogeneity. Anticlustering thereby reverses the logic of its better known twin—cluster analysis—and is usually approached by maximizing instead of minimizing a clustering objective function. This paper presents k‐plus, an extension of the classical k‐means objective of maximizing between‐group similarity in anticlustering applications. K‐plus represents between‐group similarity as discrepancy in distribution moments (means, variance, and higher‐order moments), whereas the k‐means criterion only reflects group differences with regard to means. While constituting a new criterion for anticlustering, it is shown that k‐plus anticlustering can be implemented by optimizing the original k‐means criterion after the input data have been augmented with additional variables. A computer simulation and practical examples show that k‐plus anticlustering achieves high between‐group similarity with regard to multiple objectives. In particular, optimizing between‐group similarity with regard to variances usually does not compromise similarity with regard to means; the k‐plus extension is therefore generally preferred over classical k‐means anticlustering. Examples are given on how k‐plus anticlustering can be applied to real norming data using the open source R package anticlust, which is freely available via CRAN.

Publisher

Wiley

Subject

General Psychology,Arts and Humanities (miscellaneous),General Medicine,Statistics and Probability

Reference45 articles.

1. Graphs in Statistical Analysis

2. Aust F. &Barth M.(2018).papaja: Create APA manuscripts with R Markdown.https://github.com/crsh/papaja

3. Methods for assigning students to groups: a study of alternative objective functions

4. Intense Beauty Requires Intense Pleasure

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient neighborhood evaluation for the maximally diverse grouping problem;Annals of Operations Research;2024-08-21

2. anticlust: Subset Partitioning via Anticlustering;CRAN: Contributed Packages;2020-06-29