Affiliation:
1. Biocomputing and Developmental Systems Research Group, University of Limerick, V94 T9PX Limerick, Ireland
Abstract
Neural networks have revolutionised the way we approach problem solving across multiple domains; however, their effective design and efficient use of computational resources is still a challenging task. One of the most important factors influencing this process is model hyperparameters which vary significantly with models and datasets. Recently, there has been an increased focus on automatically tuning these hyperparameters to reduce complexity and to optimise resource utilisation. From traditional human-intuitive tuning methods to random search, grid search, Bayesian optimisation, and evolutionary algorithms, significant advancements have been made in this direction that promise improved performance while using fewer resources. In this article, we propose HyperGE, a two-stage model for automatically tuning hyperparameters driven by grammatical evolution (GE), a bioinspired population-based machine learning algorithm. GE provides an advantage in that it allows users to define their own grammar for generating solutions, making it ideal for defining search spaces across datasets and models. We test HyperGE to fine-tune VGG-19 and ResNet-50 pre-trained networks using three benchmark datasets. We demonstrate that the search space is significantly reduced by a factor of ~90% in Stage 2 with fewer number of trials. HyperGE could become an invaluable tool within the deep learning community, allowing practitioners greater freedom when exploring complex problem domains for hyperparameter fine-tuning.
Funder
Science Foundation Ireland
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference41 articles.
1. Kshirsagar, M., More, T., Lahoti, R., Adgaonkar, S., Jain, S., and Ryan, C. (2022, January 3–5). Rethinking Traffic Management with Congestion Pricing and Vehicular Routing for Sustainable and Clean Transport. Proceedings of the 14th International Conference on Agents and Artificial Intelligence—Volume 3: ICAART, Online.
2. Bahja, M. (2020). E-Business-Higher Education and Intelligence Applications, BoD–Books on Demand.
3. Recurrent Neural Networks for Time Series Forecasting: Current status and future directions;Hewamalage;Int. J. Forecast.,2021
4. Xiao, Y., Wu, L., Guo, J., Li, J., Zhang, M., Qin, T., and Liu, T.Y. (2023). A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond. IEEE Trans. Pattern Anal. Mach. Intell., 1–20.
5. An effective algorithm for hyperparameter optimisation of neural networks;Diaz;IBM J. Res. Dev.,2017