Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification-Reference-Cited by-同舟云学术

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Published:2021-05-18 Issue:2 Volume:35 Page:929-937
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Behera Ardhendu,Wharton Zachary,Hewage Pradeep R P G,Bera Asish

Abstract

Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene plays a key role since it exhibits a significant variance in the same subcategory and subtle variance among different subcategories. Finding the subtle variance that fully characterizes the object/scene is not straightforward. To address this, we propose a novel context-aware attentional pooling (CAP) that effectively captures subtle changes via sub-pixel gradients, and learns to attend informative integral regions and their importance in discriminating different subcategories without requiring the bounding-box and/or distinguishable part annotations. We also introduce a novel feature encoding by considering the intrinsic consistency between the informativeness of the integral regions and their spatial structures to capture the semantic correlation among them. Our approach is simple yet extremely effective and can be easily applied on top of a standard classification backbone network. We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets. Our method significantly outperforms the SotA approaches on six datasets and is very competitive with the remaining two.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 49 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fine-grained recognition via submodular optimization regulated progressive training;Pattern Recognition;2024-12

2. XMNet: XGBoost with Multitasking Network for Classification and Segmentation of Ultra-Fine-Grained Datasets;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. Discrete Structure Aggregation and Global-region Query-located Network for Fine-grained Visual Classification;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. TransFGVC: transformer-based fine-grained visual classification;The Visual Computer;2024-06-28

5. INT-FUP: Intuitionistic Fuzzy Pooling;Mathematics;2024-06-03