Abstract
In this work, we present a rigorous application of the Expectation Maximization algorithm to determine the marginal distributions and the dependence structure in a Gaussian copula model with missing data. We further show how to circumvent a priori assumptions on the marginals with semiparametric modeling. Further, we outline how expert knowledge on the marginals and the dependency structure can be included. A simulation study shows that the distribution learned through this algorithm is closer to the true distribution than that obtained with existing methods and that the incorporation of domain knowledge provides benefits.
Subject
General Physics and Astronomy
Reference48 articles.
1. Imputing missings in official statistics for general tasks–our vote for distributional accuracy;Thurow;Stat. J. IAOS,2021
2. Missing value imputation for industrial IoT sensor data with large gaps;Liu;IEEE Internet Things J.,2020
3. Silverman, B. (2018). Density Estimation for Statistics and Data Analysis, Routledge.
4. Kertel, M., Harmeling, S., and Pauly, M. (2022). Learning causal graphs in manufacturing domains using structural equation models. arXiv.
5. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions;Genest;Biometrika,1995
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献