Author:
Guo Meng-Hao,Xu Tian-Xing,Liu Jiang-Jiang,Liu Zheng-Ning,Jiang Peng-Tao,Mu Tai-Jiang,Zhang Song-Hai,Martin Ralph R.,Cheng Ming-Ming,Hu Shi-Min
Abstract
AbstractHumans can naturally and effectively find salient regions in complex scenes. Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multimodal tasks, and self-supervised learning. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention; a related repositoryhttps://github.com/MenghaoGuo/Awesome-Vision-Attentionsis dedicated to collecting related work. We also suggest future directions for attention mechanism research.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition
Cited by
1131 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献