Autoencoders reloaded-Reference-Cited by-同舟云学术

Autoencoders reloaded

Published:2022-06-21 Issue:4 Volume:116 Page:389-406
ISSN:1432-0770
Container-title:Biological Cybernetics
language:en
Short-container-title:Biol Cybern

Author:

Bourlard Hervé,Kabil Selen Hande^ORCID

Abstract

AbstractIn Bourlard and Kamp (Biol Cybern 59(4):291–294, 1998), it was theoretically proven that autoencoders (AE) with single hidden layer (previously called “auto-associative multilayer perceptrons”) were, in the best case, implementing singular value decomposition (SVD) Golub and Reinsch (Linear algebra, Singular value decomposition and least squares solutions, pp 134–151. Springer, 1971), equivalent to principal component analysis (PCA) Hotelling (Educ Psychol 24(6/7):417–441, 1993); Jolliffe (Principal component analysis, springer series in statistics, 2nd edn. Springer, New York ). That is, AE are able to derive the eigenvalues that represent the amount of variance covered by each component even with the presence of the nonlinear function (sigmoid-like, or any other nonlinear functions) present on their hidden units. Today, with the renewed interest in “deep neural networks” (DNN), multiple types of (deep) AE are being investigated as an alternative to manifold learning Cayton (Univ California San Diego Tech Rep 12(1–17):1, 2005) for conducting nonlinear feature extraction or fusion, each with its own specific (expected) properties. Many of those AE are currently being developed as powerful, nonlinear encoder–decoder models, or used to generate reduced and discriminant feature sets that are more amenable to different modeling and classification tasks. In this paper, we start by recalling and further clarifying the main conclusions of Bourlard and Kamp (Biol Cybern 59(4):291–294, 1998), supporting them by extensive empirical evidences, which were not possible to be provided previously (in 1988), due to the dataset and processing limitations. Upon full understanding of the underlying mechanisms, we show that it remains hard (although feasible) to go beyond the state-of-the-art PCA/SVD techniques for auto-association. Finally, we present a brief overview on different autoencoder models that are mainly in use today and discuss their rationale, relations and application areas.

Funder

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

Publisher

Springer Science and Business Media LLC

Subject

General Computer Science,Biotechnology

Link

https://link.springer.com/content/pdf/10.1007/s00422-022-00937-6.pdf

Reference65 articles.

1. Ashby WR (1961) An introduction to cybernetics. Chapman & Hall Ltd, New York

2. Baldi P (2012) Autoencoders, unsupervised learning, and deep architectures. In: Proceedings of ICML workshop on unsupervised and transfer learning. JMLR Workshop and Conference Proceedings, pp 37–49

3. Baldi P, Hornik K (1989) Neural networks and principal component analysis: Learning from examples without local minima. Neural Netw 2(1):53–58

4. Baldi PF, Hornik K (1995) Learning in linear neural networks: a survey. IEEE Trans Neural Netw 6(4):837–858

5. Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. In: Advances in neural information processing systems, pp 153–160

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep learning based decoding of single local field potential events;NeuroImage;2024-08

2. Data-Driven Nonintrusive Model-Order Reduction for Aerodynamic Design Optimization;AIAA Journal;2024-07

3. Chatter monitoring method of Ti-6Al-4V thin-walled parts based on MAML optimized transfer learning;The International Journal of Advanced Manufacturing Technology;2024-06-15

4. Conditional Aggregation Operator Defined by the Power Information Concerning Type-2 Fuzzy Deep Learning Algorithm for Financial Investment Data Decision-Making;IEEE Access;2024

5. Fault Detection via Autoencoder Latent Space Differences Between Reference Model and the Plant Operation;IFAC-PapersOnLine;2024