Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization-Reference-Cited by-同舟云学术

Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization

Published:2020-04-03 Issue:07 Volume:34 Page:11555-11562
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Liu Chuanbin,Xie Hongtao,Zha Zheng-Jun,Ma Lingfeng,Yu Lingyun,Zhang Yongdong

Abstract

Delicate attention of the discriminative regions plays a critical role in Fine-Grained Visual Categorization (FGVC). Unfortunately, most of the existing attention models perform poorly in FGVC, due to the pivotal limitations in discriminative regions proposing and region-based feature learning. 1) The discriminative regions are predominantly located based on the filter responses over the images, which can not be directly optimized with a performance metric. 2) Existing methods train the region-based feature extractor as a one-hot classification task individually, while neglecting the knowledge from the entire object. To address the above issues, in this paper, we propose a novel “Filtration and Distillation Learning” (FDL) model to enhance the region attention of discriminate parts for FGVC. Firstly, a Filtration Learning (FL) method is put forward for discriminative part regions proposing based on the matchability between proposing and predicting. Specifically, we utilize the proposing-predicting matchability as the performance metric of Region Proposal Network (RPN), thus enable a direct optimization of RPN to filtrate most discriminative regions. Go in detail, the object-based feature learning and region-based feature learning are formulated as “teacher” and “student”, which can furnish better supervision for region-based feature learning. Accordingly, our FDL can enhance the region attention effectively, and the overall framework can be trained end-to-end without neither object nor parts annotations. Extensive experiments verify that FDL yields state-of-the-art performance under the same backbone with the most competitive approaches on several FGVC tasks.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 86 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DACBN: Dual attention convolutional broad network for fine-grained visual recognition;Pattern Recognition;2024-12

2. Enhance Fine-Grained Visual Classification with Attention-Guided Region Selection and Contrastive Feature Alignment;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. TransFGVC: transformer-based fine-grained visual classification;The Visual Computer;2024-06-28

4. Multi-level information fusion Transformer with background filter for fine-grained image recognition;Applied Intelligence;2024-06-20

5. SwinFG: A fine-grained recognition scheme based on swin transformer;Expert Systems with Applications;2024-06