Adaptive Modular Convolutional Neural Network for Image Recognition

Author:

Wu Wenbo,Pan Yun

Abstract

Image recognition has long been one of the research hotspots in computer vision tasks. The development of deep learning is rapid in recent years, and convolutional neural networks usually need to be designed with fixed resources. If sufficient resources are available, the model can be scaled up to achieve higher accuracy, for example, VggNet, ResNet, GoogLeNet, etc. Although the accuracy of large-scale models has been improved, the following problems will occur with the expansion of model scale: (1) There may be over-fitting; (2) increasing model parameters; (3) slow model convergence. This paper proposes a design method for a modular convolutional neural network model which solves the problem of over-fitting and large model parameters by connecting multiple modules in parallel. Moreover, each module contains several submodules (three submodules in this paper) and fuses the features extracted from the submodules. The model convergence can be accelerated by using the fused features (the fused features contain more image information). In this study, we add a gate unit based on the attention mechanism to the model, which aims to optimize the structure of the model (select the optimal number of modules), allowing the model to select an optimum network structure by learning and dynamically reducing FLOPs (floating-point operations per second) of the model. Compared to VggNet, ResNet, and GoogLeNet, the structure of the model proposed in this paper is simple and the parameters are small. The proposed model achieves good results in the Kaggle datasets Cats-vs.-Dogs (99.3%), 10-Monkey Species (99.26%), and Birds-400 (99.13%).

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Reference40 articles.

1. Reducing the Dimensionality of Data with Neural Networks

2. Very deep convolutional networks for large-scale image recognition;Simonyan;arXiv,2014

3. Deep residual learning for image recognition;He;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016

4. Wide Residual Networks;Zagoruyko;Proceedings of the British Machine Vision Conference 2016,2016

5. Efficientnet: Rethinking model scaling for convolutional neural networks;Tan;Proceedings of the International Conference on Machine Learning, PMLR,2019

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Bird species recognition using transfer learning with a hybrid hyperparameter optimization scheme (HHOS);Ecological Informatics;2024-05

2. A Deep Learning Approach for Detection and Classification of Ten Species of Monkeys;2023 International Conference on Smart Systems for applications in Electrical Sciences (ICSSES);2023-07-07

3. Multi-agent Knowledge Transfer in a Society of Interpretable Neural Network Minds for Dynamic Context Formation in Swarm Shepherding;2023 International Joint Conference on Neural Networks (IJCNN);2023-06-18

4. A Comparative Study on Different Transfer Learning Approaches for Identification of Plant Diseases;2023 International Conference on Next-Generation Computing, IoT and Machine Learning (NCIM);2023-06-16

5. A Transfer Learning Approach to Bird Species Recognition using MobileNetV2;2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS);2023-05-17

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3