Gradient free stochastic training of ANNs, with local approximation in partitions-Reference-Cited by-同舟云学术

Gradient free stochastic training of ANNs, with local approximation in partitions

Published:2023-03-07 Issue:7 Volume:37 Page:2603-2617
ISSN:1436-3240
Container-title:Stochastic Environmental Research and Risk Assessment
language:en
Short-container-title:Stoch Environ Res Risk Assess

Author:

Bakas N. P.,Langousis A.,Nicolaou M. A.,Chatzichristofis S. A.

Abstract

AbstractWe present a numerical scheme for computation of Artificial Neural Networks (ANN) weights, which stems from the Universal Approximation Theorem, avoiding costly iterations. The proposed algorithm adheres to the underlying theory, is highly fast, and results in remarkably low errors when applied to regression and classification problems of complex data sets with

$${\textbf{x}} \in {\mathbb {R}}^{n}$$

x ∈ R n (e.g. Griewank, Gomez-Levy, Shekel, and Polynomial functions) with random noise addition (i.e. Uniform, Normal, Generalized Pareto, Log-Normal, and a mixture of Log-Normal, Exponential, and Frechet), as well as the database for handwritten digits recognition MNIST (Modified National Institute of Standards and Technology) with

$$7\times 10^4$$

7 × 10 4 images. The same mathematical formulation was found capable of approximating highly nonlinear functions in multiple dimensions, with low errors (e.g.

$$10^{-10}$$

10 - 10 ) for the test set of the unknown functions, their higher-order partial derivatives, as well as numerically solving Partial Differential Equations, such as those appearing in Physics, Engineering, Environmental Sciences, etc. The method is based on the calculation of the weights of each neuron in small neighbourhoods of the data. Accordingly, optimization of hyperparameters is not necessary, as the number of neurons stems directly from the dimensionality of the data, further improving the algorithmic speed. Under this setting, overfitting is inherently avoided, and the results are interpretable and reproducible. The complexity of the proposed algorithm is of class P with

$${\mathcal {O}}(mNni_{cl} + Nmn^2+Nn^3 + mN^2+N^3)$$

O ( m N n i cl + N m n 2 + N n 3 + m N 2 + N 3 ) computing time, with respect to the observations m, features n, and Neurons N, contrary to the NP-Complete class of standard algorithms for ANN training. The performance of the method is high, irrespective of the size of the data set, and the test set errors are similar or smaller than the training errors, indicating the generalization efficiency of the algorithm. A supplementary computer code in Julia and Python Languages is provided, which can be used to reproduce the validation examples, and/or apply the algorithm to other data sets.

Funder

European Commission

European Regional Development Fund

University of Patras

Publisher

Springer Science and Business Media LLC

Subject

General Environmental Science,Safety, Risk, Reliability and Quality,Water Science and Technology,Environmental Chemistry,Environmental Engineering

Link

https://link.springer.com/content/pdf/10.1007/s00477-023-02407-2.pdf

Reference59 articles.

1. Arthur D, Vassilvitskii S (2006) How slow is the k-means method? In: Proceedings of the twenty-second annual symposium on computational geometry, ACM, New York, NY, USA, SCG ’06, pp 144–153. https://doi.org/10.1145/1137856.1137880

2. Arthur D, Vassilvitskii S (2007) k-means++: The advantages of careful seeding. In: Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms. Society for Industrial and Applied Mathematics, pp 1027–1035

3. Babouskos NG, Katsikadelis JT (2015) Optimum design of thin plates via frequency optimization using BEM. Arch Appl Mech 85(9–10):1175–1190. https://doi.org/10.1007/s00419-014-0962-7

4. Bakas NP (2019) Numerical solution for the extrapolation problem of analytic functions. Research 2019(3903187):1–10. https://doi.org/10.34133/2019/3903187

5. Bakas NP, Plevris V, Langousis A, Chatzichristofis SA (2022) ITSO: A novel inverse transform sampling-based optimization algorithm for stochastic search. Stoch Env Res Risk Assess 36(1):67–76

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Developing predictive models for the load-displacement response of laterally loaded reinforced concrete piles in stiff unsaturated clay using machine learning algorithms;Structures;2024-06

2. Fine-Tuning Large-Scale Project Scheduling;Lecture Notes in Business Information Processing;2024

3. Experimental investigation and predictive modeling of shear performance for concrete-encased steel beams using artificial neural networks;Materials and Structures;2023-09-02