Missing data imputation in multivariate t distribution with unknown degrees of freedom using expectation maximization algorithm and its stochastic variants-Reference-Cited by-同舟云学术

Missing data imputation in multivariate t distribution with unknown degrees of freedom using expectation maximization algorithm and its stochastic variants

Published:2020-10-09 Issue:3 Volume:15 Page:263-272
ISSN:1574-1699
Container-title:Model Assisted Statistics and Applications
language:
Short-container-title:MAS

Author:

Kinyanjui Paul Kimani,Tamba Cox Lwaka,Orawo Luke Akong’o,Okenye Justin Obwoge

Abstract

Many researchers encounter the missing data problem. The phenomenon may be occasioned by data omission, non-response, death of respondents, recording errors, among others. It is important to find an appropriate data imputation technique to fill in the missing positions. In this study, the Expectation Maximization (EM) algorithm and two of its stochastic variants, stochastic EM (SEM) and Monte Carlo EM (MCEM), are employed in missing data imputation and parameter estimation in multivariate t distribution with unknown degrees of freedom. The imputation efficiencies of the three methods are then compared using mean square error (MSE) criterion. SEM yields the lowest MSE, making it the most efficient method in data imputation when the data assumes the multivariate t distribution. The algorithm’s stochastic nature enables it to avoid local saddle points and achieve global maxima; ultimately increasing its efficiency. The EM and MCEM techniques yield almost similar results. Large sample draws in the MCEM’s E-step yield more or less the same results as the deterministic EM. In parameter estimation, it is observed that the parameter estimates for EM and MCEM are relatively close to the simulated data’s maximum likelihood (ML) estimates. This is not the case in SEM, owing to the random nature of the algorithm.

Publisher

IOS Press

Subject

Applied Mathematics,Modelling and Simulation,Statistics and Probability

Reference31 articles.

1. Biscarat, J. C., Celeux, G., & Diebolt, J. (1992). Stochastic versions of the EM algorithm (No. TR-227). Washington University Seattle Department of Statistics.

2. The SEM algorithm: a probabilistic teacher algorithm derived from the EM algorithm for the mixture problem;Celeux;Computational Statistics Quarterly,1985

3. Convergence of a stochastic approximation version of the EM algorithm;Delyon;Annals of Statistics,1999

4. Maximum likelihood from incomplete data via the EM algorithm;Dempster;Journal of the royal statistical society. Series B (methodological),1977

5. A new REML (parameter expanded) EM algorithm for linear mixed models;Diffey;Australian & New Zealand Journal of Statistics,2017

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CondMVT: Conditional Multivariate t Distribution, Expectation Maximization Algorithm, and Its Stochastic Variants;CRAN: Contributed Packages;2022-06-28