Dynamic multilayer growth: Parallel vs. sequential approaches

Author:

Ross MattORCID,Berberian NaregORCID,Nikolla Albino,Chartier Sylvain

Abstract

The decision of when to add a new hidden unit or layer is a fundamental challenge for constructive algorithms. It becomes even more complex in the context of multiple hidden layers. Growing both network width and depth offers a robust framework for leveraging the ability to capture more information from the data and model more complex representations. In the context of multiple hidden layers, should growing units occur sequentially with hidden units only being grown in one layer at a time or in parallel with hidden units growing across multiple layers simultaneously? The effects of growing sequentially or in parallel are investigated using a population dynamics-inspired growing algorithm in a multilayer context. A modified version of the constructive growing algorithm capable of growing in parallel is presented. Sequential and parallel growth methodologies are compared in a three-hidden layer multilayer perceptron on several benchmark classification tasks. Several variants of these approaches are developed for a more in-depth comparison based on the type of hidden layer initialization and the weight update methods employed. Comparisons are then made to another sequential growing approach, Dynamic Node Creation. Growing hidden layers in parallel resulted in comparable or higher performances than sequential approaches. Growing hidden layers in parallel promotes growing narrower deep architectures tailored to the task. Dynamic growth inspired by population dynamics offers the potential to grow the width and depth of deeper neural networks in either a sequential or parallel fashion.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Public Library of Science (PLoS)

Reference69 articles.

1. From shallow feature learning to deep learning: Benefits from the width and depth of deep architectures;G Zhong;Wiley Interdiscip Rev Data Min Knowl Discov [Internet].,2019

2. Multilayer feedforward networks are universal approximators.;K Hornik;Neural Networks.,1989

3. On the approximate realization of continuous mappings by neural networks.;KI Funahashi;Neural Networks.,1989

4. On the complexity of neural network classifiers: A comparison between shallow and deep architectures;M Bianchini;IEEE Trans Neural Networks Learn Syst,2014

5. Deep learning.;Y Lecun;Nat 2015 5217553 [Internet].,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3