Reviewing Autoencoders for Missing Data Imputation: Technical Trends, Applications and Outcomes-Reference-Cited by-同舟云学术

Reviewing Autoencoders for Missing Data Imputation: Technical Trends, Applications and Outcomes

Published:2020-12-14 Issue: Volume:69 Page:1255-1285
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Cardoso Pereira Ricardo^ORCID,Seoane Santos Miriam,Pereira Rodrigues Pedro,Henriques Abreu Pedro

Abstract

Missing data is a problem often found in real-world datasets and it can degrade the performance of most machine learning models. Several deep learning techniques have been used to address this issue, and one of them is the Autoencoder and its Denoising and Variational variants. These models are able to learn a representation of the data with missing values and generate plausible new ones to replace them. This study surveys the use of Autoencoders for the imputation of tabular data and considers 26 works published between 2014 and 2020. The analysis is mainly focused on discussing patterns and recommendations for the architecture, hyperparameters and training settings of the network, while providing a detailed discussion of the results obtained by Autoencoders when compared to other state-of-the-art methods, and of the data contexts where they have been applied. The conclusions include a set of recommendations for the technical settings of the network, and show that Denoising Autoencoders outperform their competitors, particularly the often used statistical methods.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Imputation of data Missing Not at Random: Artificial generation and benchmark analysis;Expert Systems with Applications;2024-09

2. SWIR based estimation of TIR emissivity of bare soil surfaces using deep conditional generative adversarial network in Landsat data;Plant and Soil;2024-08-06

3. XU-NetI: Simple U-Shaped Encoder-Decoder Network for Accurate Imputation of Multivariate Missing Data;Franklin Open;2024-08

4. Ephemeris accuracy improvement for moons of gas giants: a deep learning based method;Discover Space;2024-07-03

5. A transferred spatio-temporal deep model based on multi-LSTM auto-encoder for air pollution time series missing value imputation;Future Generation Computer Systems;2024-07