Adaptive Modular Convolutional Neural Network for Image Recognition-Reference-Cited by-同舟云学术

Adaptive Modular Convolutional Neural Network for Image Recognition

Published:2022-07-22 Issue:15 Volume:22 Page:5488
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wu Wenbo,Pan Yun

Abstract

Image recognition has long been one of the research hotspots in computer vision tasks. The development of deep learning is rapid in recent years, and convolutional neural networks usually need to be designed with fixed resources. If sufficient resources are available, the model can be scaled up to achieve higher accuracy, for example, VggNet, ResNet, GoogLeNet, etc. Although the accuracy of large-scale models has been improved, the following problems will occur with the expansion of model scale: (1) There may be over-fitting; (2) increasing model parameters; (3) slow model convergence. This paper proposes a design method for a modular convolutional neural network model which solves the problem of over-fitting and large model parameters by connecting multiple modules in parallel. Moreover, each module contains several submodules (three submodules in this paper) and fuses the features extracted from the submodules. The model convergence can be accelerated by using the fused features (the fused features contain more image information). In this study, we add a gate unit based on the attention mechanism to the model, which aims to optimize the structure of the model (select the optimal number of modules), allowing the model to select an optimum network structure by learning and dynamically reducing FLOPs (floating-point operations per second) of the model. Compared to VggNet, ResNet, and GoogLeNet, the structure of the model proposed in this paper is simple and the parameters are small. The proposed model achieves good results in the Kaggle datasets Cats-vs.-Dogs (99.3%), 10-Monkey Species (99.26%), and Birds-400 (99.13%).

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/15/5488/pdf

Reference40 articles.

1. Reducing the Dimensionality of Data with Neural Networks

2. Very deep convolutional networks for large-scale image recognition;Simonyan;arXiv,2014

3. Deep residual learning for image recognition;He;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016

4. Wide Residual Networks;Zagoruyko;Proceedings of the British Machine Vision Conference 2016,2016

5. Efficientnet: Rethinking model scaling for convolutional neural networks;Tan;Proceedings of the International Conference on Machine Learning, PMLR,2019

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Bird species recognition using transfer learning with a hybrid hyperparameter optimization scheme (HHOS);Ecological Informatics;2024-05

2. A Deep Learning Approach for Detection and Classification of Ten Species of Monkeys;2023 International Conference on Smart Systems for applications in Electrical Sciences (ICSSES);2023-07-07

3. Multi-agent Knowledge Transfer in a Society of Interpretable Neural Network Minds for Dynamic Context Formation in Swarm Shepherding;2023 International Joint Conference on Neural Networks (IJCNN);2023-06-18

4. A Comparative Study on Different Transfer Learning Approaches for Identification of Plant Diseases;2023 International Conference on Next-Generation Computing, IoT and Machine Learning (NCIM);2023-06-16

5. A Transfer Learning Approach to Bird Species Recognition using MobileNetV2;2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS);2023-05-17