Estimation of the Complexity of a Finite Mixture Distribution: From Well- to Less Known Methods-Reference-Cited by-同舟云学术

Estimation of the Complexity of a Finite Mixture Distribution: From Well- to Less Known Methods

Published:2022-08-25 Issue:4 Volume:16 Page:
ISSN:1559-8608
Container-title:Journal of Statistical Theory and Practice
language:en
Short-container-title:J Stat Theory Pract

Author:

Balabdaoui Fadoua^ORCID,Kolar Andrei,Kulagina Yulia,Müller Lilian

Abstract

AbstractMixture models occur in numerous settings including random and fixed effects models, clustering, deconvolution, empirical Bayes problems and many others. They are often used to model data originating from a heterogeneous population, consisting of several homogeneous subpopulations, and the problem of finding a good estimator for the number of components in the mixture arises naturally. Estimation of the order of a finite mixture model is a hard statistical task, and multiple techniques have been suggested for solving it. We will concentrate on several methods that have not gained much popularity yet deserve the attention of practitioners. These can be categorized into three groups: tools built upon the determinant of the Hankel matrix of moments of the mixing distribution, minimum distance estimators, likelihood ratio tests. We will address theoretical pillars underlying each of the methods, provide some useful modifications for enhancing their performance and present the results of the comparative numerical study that has been conducted under various scenarios. According to the results, none of the methods proves to be a “magic pill”. The results uncover limitations of the techniques and provide practical hints for choosing the best-suited tool under specific conditions.

Funder

Swiss Federal Institute of Technology Zurich

Publisher

Springer Science and Business Media LLC

Subject

Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s42519-022-00289-1.pdf

Reference76 articles.

1. Aitkin M, Anderson D, Hinde J (1981) Statistical modelling of data on teaching styles. J R Stat Soc Ser A (General) 144(4):419–448. https://doi.org/10.2307/2981826

2. Akaike H (1998) Information theory and an extension of the maximum likelihood principle. In: Selected papers of Hirotugu Akaike, pp 199–213. Springer

3. Aldrich J (1997) Ra fisher and the making of maximum likelihood 1912–1922. Stat Sci 12(3):162–176

4. Adelchi A, Bowman Adrian W (1990) A look at some data on the old faithful geyser. J R Stat Soc Ser C (Appl Stat) 39(3):357–365. https://doi.org/10.2307/2347385

5. Balabdaoui F, Butucea C (2014) On location mixtures with pólya frequency components. Stat Probab Lett 95:144–149. https://doi.org/10.1016/j.spl.2014.08.013