Junk-neuron-deletion strategy for hyperparameter optimization of neural networks

Author:

Huang Ying,Gu Chang-Gui,Yang Hui-Jie,

Abstract

With the complexity of problems in reality increasing, the sizes of deep learning neural networks, including the number of layers, neurons, and connections, are increasing in an explosive way. Optimizing hyperparameters to improve the prediction performance of neural networks has become an important task. In literatures, the methods of finding optimal parameters, such as sensitivity pruning and grid search, are complicated and cost a large amount of computation time. In this paper, a hyperparameter optimization strategy called junk neuron deletion is proposed. A neuron with small mean weight in the weight matrix can be ignored in the prediction, and is defined subsequently as a junk neuron. This strategy is to obtain a simplified network structure by deleting the junk neurons, to effectively shorten the computation time and improve the prediction accuracy and model the generalization capability. The LSTM model is used to train the time series data generated by Logistic, Henon and Rossler dynamical systems, and the relatively optimal parameter combination is obtained by grid search with a certain step length. The partial weight matrix that can influence the model output is extracted under this parameter combination, and the neurons with smaller mean weights are eliminated with different thresholds. It is found that using the weighted mean value of 0.1 as the threshold, the identification and deletion of junk neurons can significantly improve the prediction efficiency. Increasing the threshold accuracy will gradually fall back to the initial level, but with the same prediction effect, more operating costs will be saved. Further reduction will result in prediction ability lower than the initial level due to lack of fitting. Using this strategy, the prediction performance of LSTM model for several typical chaotic dynamical systems is improved significantly.

Publisher

Acta Physica Sinica, Chinese Physical Society and Institute of Physics, Chinese Academy of Sciences

Subject

General Physics and Astronomy

Reference20 articles.

1. Deng S 2019 Appl. Res. Comput. 36 1984
邓帅 2019 计算机应用研究 36 1984

2. Shao E Z, Wu Z Y, Wang C 2020 Ind. Contrl. Comput. 33 11
邵恩泽, 吴正勇, 王灿 2020 工业控制计算机 33 11

3. Qiao J F, Fan R Y, Han H G, Ruan X G 2010 Contl. Theor. Appl. 27 111
乔俊飞, 樊瑞元, 韩红桂, 阮晓钢 2010 控制理论与应用 27 111

4. Chen G M, Yu T T, Liu X W 2021 J. Num. Method. Comp. Appl. 42 215
陈国茗, 于腾腾, 刘新为 2021 数值计算与计算机应用 42 215

5. Wei D Z, Chen F J, Zheng X X 2015 Acta Phys. Sin. 64 110503
魏德志, 陈福集, 郑小雪 2015 物理学报 64 110503

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3