From NeurODEs to AutoencODEs: A mean-field control framework for width-varying neural networks-Reference-Cited by-同舟云学术

From NeurODEs to AutoencODEs: A mean-field control framework for width-varying neural networks

Published:2024-02-08 Issue: Volume: Page:1-43
ISSN:0956-7925
Container-title:European Journal of Applied Mathematics
language:en
Short-container-title:Eur. J. Appl. Math

Author:

Cipriani Cristina^ORCID,Fornasier Massimo,Scagliotti Alessandro^ORCID

Abstract

Abstract The connection between Residual Neural Networks (ResNets) and continuous-time control systems (known as NeurODEs) has led to a mathematical analysis of neural networks, which has provided interesting results of both theoretical and practical significance. However, by construction, NeurODEs have been limited to describing constant-width layers, making them unsuitable for modelling deep learning architectures with layers of variable width. In this paper, we propose a continuous-time Autoencoder, which we call AutoencODE, based on a modification of the controlled field that drives the dynamics. This adaptation enables the extension of the mean-field control framework originally devised for conventional NeurODEs. In this setting, we tackle the case of low Tikhonov regularisation, resulting in potentially non-convex cost landscapes. While the global results obtained for high Tikhonov regularisation may not hold globally, we show that many of them can be recovered in regions where the loss function is locally convex. Inspired by our theoretical findings, we develop a training method tailored to this specific type of Autoencoders with residual connections, and we validate our approach through numerical experiments conducted on various examples.

Publisher

Cambridge University Press (CUP)

Reference47 articles.

1. Deep learning as optimal control problems: models and numerical methods;Benning;J. Comput. Dyn.,2019

2. [23] Goodfellow, I. J. , Shlens, J. & Szegedy, C. (2014) Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572.

3. Modeling the influence of data structure on learning in neural networks: the hidden manifold model;Goldt;Phys. Rev. X,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Minimax Optimal Control Approach for Robust Neural ODEs;2024 European Control Conference (ECC);2024-06-25