Flexible, non-parametric modeling using regularized neural networks-Reference-Cited by-同舟云学术

Flexible, non-parametric modeling using regularized neural networks

Published:2022-01-07 Issue:4 Volume:37 Page:2029-2047
ISSN:0943-4062
Container-title:Computational Statistics
language:en
Short-container-title:Comput Stat

Author:

Allerbo Oskar^ORCID,Jörnsten Rebecka

Abstract

AbstractNon-parametric, additive models are able to capture complex data dependencies in a flexible, yet interpretable way. However, choosing the format of the additive components often requires non-trivial data exploration. Here, as an alternative, we propose PrAda-net, a one-hidden-layer neural network, trained with proximal gradient descent and adaptive lasso. PrAda-net automatically adjusts the size and architecture of the neural network to reflect the complexity and structure of the data. The compact network obtained by PrAda-net can be translated to additive model components, making it suitable for non-parametric statistical modelling with automatic model selection. We demonstrate PrAda-net on simulated data, where we compare the test error performance, variable importance and variable subset identification properties of PrAda-net to other lasso-based regularization approaches for neural networks. We also apply PrAda-net to the massive U.K. black smoke data set, to demonstrate how PrAda-net can be used to model complex and heterogeneous data with spatial and temporal components. In contrast to classical, statistical non-parametric approaches, PrAda-net requires no preliminary modeling to select the functional forms of the additive components, yet still results in an interpretable model representation.

Funder

Vetenskapsrådet

Stiftelsen för Strategisk Forskning

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s00180-021-01190-4.pdf

Reference23 articles.

1. Ainsworth SK, Foti NJ, Lee AK, Fox EB (2018) oi-vae: output interpretable vaes for nonlinear group factor analysis. In: international conference on machine learning, pp. 119–128

2. Cybenko G (1989) Approximation by superpositions of a sigmoidal function. Math Control Signals Syst 2(4):303–314

3. Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1

4. Friedman JH, Stuetzle W (1981) Projection pursuit regression. J Am Stat Assoc 76(376):817–823

5. Hastie TJ, Tibshirani RJ (1990) Generalized additive models, vol 43. CRC Press, London

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. 3D Cable Intelligent Management Platform Based on Parametric Modeling Technology;Lecture Notes in Electrical Engineering;2024