A New Multi-Scale Convolutional Model Based on Multiple Attention for Image Classification-Reference-Cited by-同舟云学术

A New Multi-Scale Convolutional Model Based on Multiple Attention for Image Classification

Published:2019-12-20 Issue:1 Volume:10 Page:101
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yang Yadong^ORCID,Xu Chengji^ORCID,Dong Feng,Wang Xiaofeng

Abstract

Computer vision systems are insensitive to the scale of objects in natural scenes, so it is important to study the multi-scale representation of features. Res2Net implements hierarchical multi-scale convolution in residual blocks, but its random grouping method affects the robustness and intuitive interpretability of the network. We propose a new multi-scale convolution model based on multiple attention. It introduces the attention mechanism into the structure of a Res2-block to better guide feature expression. First, we adopt channel attention to score channels and sort them in descending order of the feature’s importance (Channels-Sort). The sorted residual blocks are grouped and intra-block hierarchically convolved to form a single attention and multi-scale block (AMS-block). Then, we implement channel attention on the residual small blocks to constitute a dual attention and multi-scale block (DAMS-block). Introducing spatial attention before sorting the channels to form multi-attention multi-scale blocks(MAMS-block). A MAMS-convolutional neural network (CNN) is a series of multiple MAMS-blocks. It enables significant information to be expressed at more levels, and can also be easily grafted into different convolutional structures. Limited by hardware conditions, we only prove the validity of the proposed ideas through convolutional networks of the same magnitude. The experimental results show that the convolution model with an attention mechanism and multi-scale features is superior in image classification.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/1/101/pdf

Reference54 articles.

1. Recent Advances of Generative Adversarial Networks in Computer Vision

2. Real-time visual tracking by deep reinforced decision making

3. Multiple relations extraction among multiple entities in unstructured text

4. Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. VT-3DCapsNet: Visual tempos 3D-Capsule network for video-based facial expression recognition;PLOS ONE;2024-08-23

2. Using Hybrid Models of AI for Identification of Trees by UAV Images of Forests: I. Machine-learning Component of the Models;WSEAS TRANSACTIONS ON SIGNAL PROCESSING;2024-07-04

3. MEDMCN: a novel multi-modal EfficientDet with multi-scale CapsNet for object detection;The Journal of Supercomputing;2024-02-23

4. A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks;Mathematics;2023-05-27

5. A Novel Convolutional Neural Networks for Stock Trading Based on DDQN Algorithm;IEEE Access;2023