Multibranch Attention Mechanism Based on Channel and Spatial Attention Fusion-Reference-Cited by-同舟云学术

Multibranch Attention Mechanism Based on Channel and Spatial Attention Fusion

Published:2022-11-06 Issue:21 Volume:10 Page:4150
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Mao Guojun,Liao Guanyi,Zhu Hengliang,Sun Bo

Abstract

Recently, it has been demonstrated that the performance of an object detection network can be improved by embedding an attention module into it. In this work, we propose a lightweight and effective attention mechanism named multibranch attention (M3Att). For the input feature map, our M3Att first uses the grouped convolutional layer with a pyramid structure for feature extraction, and then calculates channel attention and spatial attention simultaneously and fuses them to obtain more complementary features. It is a “plug and play” module that can be easily added to the object detection network and significantly improves the performance of the object detection network with a small increase in parameters. We demonstrate the effectiveness of M3Att on various challenging object detection tasks, including PASCAL VOC2007, PASCAL VOC2012, KITTI, and Zhanjiang Underwater Robot Competition. The experimental results show that this method dramatically improves the object detection effect, especially for the PASCAL VOC2007, and the mapping index of the original network increased by 4.93% when embedded in the YOLOV4 (You Only Look Once v4) network.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Science and Technology Project of Fujian University of Technology

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/21/4150/pdf

Reference37 articles.

1. Recent advances in deep learning for object detection;Wu;Neurocomputing,2020

2. Zou, Z., Shi, Z., Guo, Y., and Ye, J. Object detection in 20 years: A survey. arXiv, 2019.

3. A novel nonlocal-aware pyramid and multiscale multitask refinement detector for object detection in remote sensing images;Huang;IEEE Trans. Geosci. Remote Sens.,2021

4. Guo, M., Xu, T., Liu, J., Liu, Z., Jiang, P., Mu, T., Zhang, S., Martin, R., Cheng, M., and Hu, S. Attention mechanisms in computer vision: A survey. arXiv, 2021.

5. Hu, J., Shen, L., and Sun, G. Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hand-object pose estimation based on HOPE-net and attention mechanism;Journal of Physics: Conference Series;2024-08-01

2. Global context aware dual channel pyramid model for robust image shadow removal;Engineering Applications of Artificial Intelligence;2024-07

3. Tire defect detection based on low and high-level feature fusion;Measurement Science and Technology;2024-06-04

4. DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention;Animals;2024-02-02

5. Dermoscopic Image Classification Using Attention Mechanism and Ensemble Learning Approaches;2023 IEEE International Conference on Big Data (BigData);2023-12-15