Affiliation:
1. Mathematisch Instituut Universiteit Leiden Leiden Netherlands
2. Department of Statistical Sciences University of Padua Padua Italy
Abstract
AbstractWe consider the problem of clustering functional data according to their covariance structure. We contribute a soft clustering methodology based on the Wasserstein–Procrustes distance, where the in‐between cluster variability is penalized by a term proportional to the entropy of the partition matrix. In this way, each covariance operator can be partially classified into more than one group. Such soft classification allows for clusters to overlap, and arises naturally in situations where the separation between all or some of the clusters is not well‐defined. We also discuss how to estimate the number of groups and to test for the presence of any cluster structure. The algorithm is illustrated using simulated and real data. An R implementation is available in the Appendix S1.
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Reference38 articles.
1. Barycenters in the Wasserstein space;Agueh M.;Society for Industrial and Applied Mathematics,2011
2. On functional data analysis and related topics;Aneiros G.;Journal of Multivariate Analysis,2022
3. An extensive comparative study of cluster validity indices;Arbelaitz O.;Pattern Recognition,2013
4. Clustering with the average silhouette width;Batool F.;Computational Statistics & Data Analysis,2021
5. Testing equality between several populations covariance operators;Boente G.;Annals of the Institute of Statistical Mathematics,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献