Affiliation:
1. Department of Mathematics, Aarhus University , Ny Munkegade 118, DK-8000 Aarhus C, Denmark
Abstract
Summary
There is growing empirical evidence that spherical $k$-means clustering performs well at identifying groups of concomitant extremes in high dimensions, thereby leading to sparse models. We provide one of the first theoretical results supporting this approach, but also demonstrate some pitfalls. Furthermore, we show that an alternative cost function may be more appropriate for identifying concomitant extremes, and it results in a novel spherical $k$-principal-components clustering algorithm. Our main result establishes a broadly satisfied sufficient condition guaranteeing the success of this method, albeit in a rather basic setting. Finally, we illustrate in simulations that $k$-principal components clustering outperforms $k$-means clustering in the difficult case of weak asymptotic dependence within the groups.
Publisher
Oxford University Press (OUP)
Subject
Applied Mathematics,Statistics, Probability and Uncertainty,General Agricultural and Biological Sciences,Agricultural and Biological Sciences (miscellaneous),General Mathematics,Statistics and Probability
Reference27 articles.
1. Dimension reduction in multivariate extreme value analysis;Chautru,;Electron. J. Statist.,2015
2. A multivariate extreme value theory approach to anomaly clustering and visualization;Chiapino,;Comp. Statist.,2020
3. Decompositions of dependence for high-dimensional extremes;Cooley,;Biometrika,2019
4. Statistics of extremes;Davison,;Ann. Rev. Statist. Appl.,2015
5. Geostatistics of dependent and asymptotically independent extremes;Davison,;Math. Geosci.,2013
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Total positivity in multivariate extremes;The Annals of Statistics;2023-06-01
2. Structure Learning for Extremal Tree Models;Journal of the Royal Statistical Society Series B: Statistical Methodology;2022-11-01