Abstract
Missing data is a common and inevitable phenomenon. In practical applications, the datasets usually contain noises for various reasons. Most of the existing missing data imputing algorithms are affected by noises which reduce the accuracy of the imputation. This paper proposes a noise-aware missing data multiple imputation algorithm NPMI in static data. Different multiple imputation models are proposed according to the missing mechanism of data. Secondly, the method to determine the imputation order of multivariablesmissing is given. A random sampling consistency algorithm is proposed to estimate the initial values of the parameters of the multiple imputation model to reduce the influence of noise data and improve the algorithm’s robustness. Experiments on two real datasets and two synthetic datasets verify the accuracy and efficiency of the proposed NPMI algorithm, and the results are analyzed.
Funder
National Natural Science Foundation of China
Fundamental Research Funds of the Central Universities
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference36 articles.
1. Lindig, S., Louwen, A., Moser, D., and Topic, M. (2020). Outdoor PV system monitoring—Input data quality, data imputation and filtering approaches. Energies, 13.
2. Proposing suitable data imputation methods by adopting a Stage wise approach for various classes of smart meters missing data–Practical approach;Hemanth;Expert Syst. Appl.,2022
3. Data gaps, data incomparability, and data imputation: A review of poverty measurement methods for data-scarce environments;Dang;J. Econ. Surv.,2019
4. Missing data imputation using an iterative denoising autoencoder (IDAE) for dissolved gas analysis;Seo;Electr. Power Syst. Res.,2022
5. Missing Data Imputation: A Survey;Kelkar;Int. J. Decis. Support Syst. Technol.,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献