Author:
Huang Ying,Gu Chang-Gui,Yang Hui-Jie,
Abstract
With the complexity of problems in reality increasing, the sizes of deep learning neural networks, including the number of layers, neurons, and connections, are increasing in an explosive way. Optimizing hyperparameters to improve the prediction performance of neural networks has become an important task. In literatures, the methods of finding optimal parameters, such as sensitivity pruning and grid search, are complicated and cost a large amount of computation time. In this paper, a hyperparameter optimization strategy called junk neuron deletion is proposed. A neuron with small mean weight in the weight matrix can be ignored in the prediction, and is defined subsequently as a junk neuron. This strategy is to obtain a simplified network structure by deleting the junk neurons, to effectively shorten the computation time and improve the prediction accuracy and model the generalization capability. The LSTM model is used to train the time series data generated by Logistic, Henon and Rossler dynamical systems, and the relatively optimal parameter combination is obtained by grid search with a certain step length. The partial weight matrix that can influence the model output is extracted under this parameter combination, and the neurons with smaller mean weights are eliminated with different thresholds. It is found that using the weighted mean value of 0.1 as the threshold, the identification and deletion of junk neurons can significantly improve the prediction efficiency. Increasing the threshold accuracy will gradually fall back to the initial level, but with the same prediction effect, more operating costs will be saved. Further reduction will result in prediction ability lower than the initial level due to lack of fitting. Using this strategy, the prediction performance of LSTM model for several typical chaotic dynamical systems is improved significantly.
Publisher
Acta Physica Sinica, Chinese Physical Society and Institute of Physics, Chinese Academy of Sciences
Subject
General Physics and Astronomy
Reference20 articles.
1. Deng S 2019 Appl. Res. Comput. 36 1984
邓帅 2019 计算机应用研究 36 1984
2. Shao E Z, Wu Z Y, Wang C 2020 Ind. Contrl. Comput. 33 11
邵恩泽, 吴正勇, 王灿 2020 工业控制计算机 33 11
3. Qiao J F, Fan R Y, Han H G, Ruan X G 2010 Contl. Theor. Appl. 27 111
乔俊飞, 樊瑞元, 韩红桂, 阮晓钢 2010 控制理论与应用 27 111
4. Chen G M, Yu T T, Liu X W 2021 J. Num. Method. Comp. Appl. 42 215
陈国茗, 于腾腾, 刘新为 2021 数值计算与计算机应用 42 215
5. Wei D Z, Chen F J, Zheng X X 2015 Acta Phys. Sin. 64 110503
魏德志, 陈福集, 郑小雪 2015 物理学报 64 110503
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献