Large deviation analysis of function sensitivity in random deep neural networks-Reference-Cited by-同舟云学术

Large deviation analysis of function sensitivity in random deep neural networks

Published:2020-02-20 Issue:10 Volume:53 Page:104002
ISSN:1751-8113
Container-title:Journal of Physics A: Mathematical and Theoretical
language:
Short-container-title:J. Phys. A: Math. Theor.

Author:

Li Bo^ORCID,Saad David^ORCID

Abstract

Abstract Mean field theory has been successfully used to analyze deep neural networks (DNN) in the infinite size limit. Given the finite size of realistic DNN, we utilize the large deviation theory and path integral analysis to study the deviation of functions represented by DNN from their typical mean field solutions. The parameter perturbations investigated include weight sparsification (dilution) and binarization, which are commonly used in model simplification, for both ReLU and sign activation functions. We find that random networks with ReLU activation are more robust to parameter perturbations with respect to their counterparts with sign activation, which arguably is reflected in the simplicity of the functions they generate.

Funder

Leverhulme Trust

Engineering and Physical Sciences Research Council

H2020 Marie Skłodowska-Curie Actions

Publisher

IOP Publishing

Subject

General Physics and Astronomy,Mathematical Physics,Modelling and Simulation,Statistics and Probability,Statistical and Nonlinear Physics

Link

https://iopscience.iop.org/article/10.1088/1751-8121/ab6a6f/pdf

Reference36 articles.

1. Deep learning