LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization-Reference-Cited by-同舟云学术

LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

Published:2022-06-28 Issue:1 Volume:36 Page:410-418
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Chen Zhiwei,Wang Changan,Wang Yabiao,Jiang Guannan,Shen Yunhang,Tai Ying,Wang Chengjie,Zhang Wei,Cao Liujuan

Abstract

Weakly supervised object localization (WSOL) aims to learn object localizer solely by using image-level labels. The convolution neural network (CNN) based techniques often result in highlighting the most discriminative part of objects while ignoring the entire object extent. Recently, the transformer architecture has been deployed to WSOL to capture the long-range feature dependencies with self-attention mechanism and multilayer perceptron structure. Nevertheless, transformers lack the locality inductive bias inherent to CNNs and therefore may deteriorate local feature details in WSOL. In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies. To this end, we propose a relational patch-attention module (RPAM), which considers cross-patch information on a global basis. We further design a cue digging module (CDM), which utilizes local features to guide the learning trend of the model for highlighting the weak local responses. Finally, comprehensive experiments are carried out on two widely used datasets, ie, CUB-200-2011 and ILSVRC, to verify the effectiveness of our method.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic-Constraint Matching for transformer-based weakly supervised object localization;Pattern Recognition;2025-02

2. FRCE: Transformer-based feature reconstruction and cross-enhancement for occluded person re-identification;Expert Systems with Applications;2024-12

3. Clustering-Guided Class Activation for Weakly Supervised Semantic Segmentation;IEEE Access;2024

4. Weakly Supervised Object Localization Based on Implicit Spatial Constraints;Lecture Notes in Computer Science;2024

5. Re-Perceive Global Vision of Transformer for Remote Sensing Weakly Supervised Object Localization;2024