Author:
Chantrapornchai Chantana,Kajkamhaeng Supasit,Romphet Phattharaphon
Abstract
AbstractConvolutional Neural Network (CNN) models have been commonly used primarily in image recognition tasks in the deep learning area. Finding the right architecture needs a lot of hand-tune experiments which are time-consuming. In this paper, we exploit an AutoML framework that adds to the exploration of the micro-architecture block and the multi-input option. The proposed adaption has been applied to SqueezeNet with SE blocks combined with the residual block combinations. The experiments assume three search strategies: Random, Hyperband, and Bayesian algorithms. Such combinations can lead to solutions with superior accuracy while the model size can be monitored. We demonstrate the application of the approach against benchmarks: CIFAR-10 and Tsinghua Facial Expression datasets. The searches allow the designer to find the architectures with better accuracy than the traditional architectures without hand-tune efforts. For example, CIFAR-10, leads to the SqueezeNet architecture using only 4 fire modules with 59% accuracy. When exploring SE block insertion, the model with good insertion points can lead to an accuracy of 78% while the traditional SqueezeNet can achieve an accuracy of around 50%. For other tasks, such as facial expression recognition, the proposed approach can lead up to an accuracy of 71% with the proper insertion of SE blocks, the appropriate number of fire modules, and adequate input merging, while the traditional model can achieve the accuracy under 20%.
Funder
Office of National Higher Education Science Research and Innovation Policy Council
National Research Council of Thailand, Thailand Research Fund
Kasetsart University Research and Development Institute
Faculty of Engineering, Kasetsart University
NVIDIA Hardware grant
Publisher
Springer Science and Business Media LLC
Reference41 articles.
1. Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and $$<$$0.5MB model size. arXiv:1602.07360 (2016)
2. Goodfellow, I. J., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).
3. Song, G., Long, Q., Luo, Y., Wang, Y. & Jin, Y. Deep convolutional neural network based medical concept normalization. IEEE Trans. Big Datahttps://doi.org/10.1109/TBDATA.2020.3021389 (2020).
4. Yan, Q. et al. Covid-19 chest CT image segmentation network by multi-scale fusion and enhancement operations. IEEE Trans. Big Data 7(1), 13–24. https://doi.org/10.1109/TBDATA.2021.3056564 (2021).
5. Babiker, M.A.A., Elawad, M.A.O., Ahmed, A.H.M.: Convolutional neural network for a self-driving car in a virtual environment. In: 2019 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE), pp. 1–6 (2019). https://doi.org/10.1109/ICCCEEE46830.2019.9070826
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献