Author:
Chen Hong-Yen,Su Chung-Yen
Abstract
Complicated and deep neural network models can achieve high accuracy for image recognition. However, they require a huge amount of computations and model parameters, which are not suitable for mobile and embedded devices. Therefore, MobileNet was proposed, which can reduce the number of parameters and computational cost dramatically. The main idea of MobileNet is to use a depthwise separable convolution. Two hyper-parameters, a width multiplier and a resolution multiplier, are used to the trade-off between the accuracy and the latency. In this paper, we propose a new architecture to improve the MobileNet. Instead of using the resolution multiplier, we use a depth multiplier and combine with either Fractional Max Pooling or the max pooling. Experimental results on CIFAR database show that the proposed architecture can reduce the amount of computational cost and increase the accuracy simultaneously.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Reconstructing Pruned Filters using Cheap Spatial Transformations;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02
2. Heritage of India: Advanced Monuments Classification using Artificial Intelligence;2023 3rd International Conference on Computing and Information Technology (ICCIT);2023-09-13
3. FIDGAN: A Generative Adversarial Network with An Inception Distance;2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC);2023-02-20
4. A High Performance FPGA-based Accelerator for MobileNet;2022 International Conference on Informatics, Networking and Computing (ICINC);2022-10
5. Less Is More: Matched Wavelet Pooling-Based Light-Weight CNNs With Application to Image Classification;IEEE Access;2022