Evaluating CNN Architectures Using Attention Mechanisms: Convolutional Block Attention Module, Squeeze, and Excitation for Image Classification on CIFAR10 Dataset-Reference-Cited by-同舟云学术

Evaluating CNN Architectures Using Attention Mechanisms: Convolutional Block Attention Module, Squeeze, and Excitation for Image Classification on CIFAR10 Dataset

Published:2023-08-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ganguly Abhisek¹,Ruby A. Usha¹,J George Chellin Chandran¹

Affiliation:

1. VIT Bhopal University

Abstract

Abstract This paper compares the performance of various popular convolutional neural network (CNN) architectures for image classification on the CIFAR10 dataset. The comparison includes CNN architectures such as Inception V3, Inception-ResNet-v2, ResNetV1, and V2, ResNeXt, MobileNet, and DenseNet, with the addition of two attention mechanisms - Convolutional Block Attention Module (CBAM), and Squeeze and Excitation (SE). CBAM and SE are believed to improve CNNs' performance, especially for complex images with multiple objects and backgrounds. The models are evaluated using loss and accuracy. The main focus of this study is to identify the most effective CNN architecture for image classification on the CIFAR10 dataset with attention mechanisms. The study aims to compare the accuracy of various CNN architectures with and without attention mechanisms and to identify the critical differences between these architectures in terms of their ability to handle complex images. The findings of this study could have implications for developing advanced CNN architectures that can potentially improve the accuracy of computer vision systems in various applications.

Publisher

Research Square Platform LLC

Reference31 articles.

1. Rawat W, Wang Z (2017) “Deep convolutional neural networks for image classification: A comprehensive review,” Neural computation, vol. 29, no. 9, pp. 2352–2449, Aug. doi: 10.1162/neco_a_00990

2. Optimized CNN based image recognition through target region selection;Hao W;Optik

3. Pak M, Kim S (2017) “A review of deep learning in image recognition,” In 2017 4th international conference on computer applications and information processing technology (CAIPT), pp. 1–3, Aug. doi:10.1109/CAIPT.2017.8320684

4. Attention mechanism-based CNN for facial expression recognition;Li J;Neurocomputing

5. Peng C, Liu Y, Yuan X, Chen Q (2022) “Research of image recognition method based on enhanced inception-ResNet-V2,” Multimedia Tools and Applications, vol. 81, no. 24, pp. 34345–34365, Oct. doi:10.1007/s11042-022-12387-0

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Camera-Radar Fusion with Radar Channel Extension and Dual-CBAM-FPN for Object Detection;Sensors;2024-08-16

2. Jordanian banknote data recognition: A CNN-based approach with attention mechanism;Journal of King Saud University - Computer and Information Sciences;2024-04