Dynamic multilayer growth: Parallel vs. sequential approaches-Reference-Cited by-同舟云学术

Dynamic multilayer growth: Parallel vs. sequential approaches

Published:2024-05-09 Issue:5 Volume:19 Page:e0301513
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Ross Matt^ORCID,Berberian Nareg^ORCID,Nikolla Albino,Chartier Sylvain

Abstract

The decision of when to add a new hidden unit or layer is a fundamental challenge for constructive algorithms. It becomes even more complex in the context of multiple hidden layers. Growing both network width and depth offers a robust framework for leveraging the ability to capture more information from the data and model more complex representations. In the context of multiple hidden layers, should growing units occur sequentially with hidden units only being grown in one layer at a time or in parallel with hidden units growing across multiple layers simultaneously? The effects of growing sequentially or in parallel are investigated using a population dynamics-inspired growing algorithm in a multilayer context. A modified version of the constructive growing algorithm capable of growing in parallel is presented. Sequential and parallel growth methodologies are compared in a three-hidden layer multilayer perceptron on several benchmark classification tasks. Several variants of these approaches are developed for a more in-depth comparison based on the type of hidden layer initialization and the weight update methods employed. Comparisons are then made to another sequential growing approach, Dynamic Node Creation. Growing hidden layers in parallel resulted in comparable or higher performances than sequential approaches. Growing hidden layers in parallel promotes growing narrower deep architectures tailored to the task. Dynamic growth inspired by population dynamics offers the potential to grow the width and depth of deeper neural networks in either a sequential or parallel fashion.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Public Library of Science (PLoS)

Reference69 articles.

1. From shallow feature learning to deep learning: Benefits from the width and depth of deep architectures;G Zhong;Wiley Interdiscip Rev Data Min Knowl Discov [Internet].,2019

2. Multilayer feedforward networks are universal approximators.;K Hornik;Neural Networks.,1989

3. On the approximate realization of continuous mappings by neural networks.;KI Funahashi;Neural Networks.,1989

4. On the complexity of neural network classifiers: A comparison between shallow and deep architectures;M Bianchini;IEEE Trans Neural Networks Learn Syst,2014

5. Deep learning.;Y Lecun;Nat 2015 5217553 [Internet].,2015