Improved Fine-Grained Image Classification in Few-Shot Learning Based on Channel-Spatial Attention and Grouped Bilinear Convolution

Author:

Zeng Ziwei1,Li Lihong1,Zhao Zoufei1,Liu Qingqing1

Affiliation:

1. Hebei University of Engineering

Abstract

Abstract

In the context of the complexities of fine-grained image classification intertwined with the constraints of few-shot learning, this paper focuses on overcoming the challenges posed by subtle inter-class differences. To enhance the model's capability to recognize key visual patterns, such as eyes and beaks, this research ingeniously integrates spatial and channel attention mechanisms along with grouped bilinear convolution techniques to adapt to the few-shot learning environment. Specifically, a novel neural network architecture is designed that integrates channel and spatial information, and interactively applies these two types of information to collaboratively optimize the weights of channel and spatial attention. Additionally, to further explore the complex dependencies among features, a grouped bilinear convolution strategy is introduced. This algorithm divides the weighted feature maps into multiple independent groups, where bilinear operations are performed within each group. This strategy captures higher-order feature interactions while reducing network parameters. Comprehensive experiments conducted on three fine-grained benchmark datasets for two few-shot tasks demonstrate the superiority of our algorithm in handling fine-grained features. Notably, in the experiments on the Stanford Cars dataset, a classification accuracy of 95.42% was achieved, confirming its effectiveness and applicability in few shot learning scenarios. Codes are available at: https://github.com/204503zzw/atb.

Publisher

Springer Science and Business Media LLC

Reference51 articles.

1. Self-reconstruction network for fine-grained few-shot classification;Li XX;Pattern Recognition,2024

2. Yang, L.F., Li, X., Song, R.J., Zhao, B.R., Tao, J.T., Zhou, S.H., Liang, J.J., Yang, J.: Dynamic mlp for fine-grained image classification by leveraging geographical and temporal information. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10945–10954 (2022)

3. Jiang, J.J., Chen, Z.W., Lei, F.Y., Xu, L., Huang, J.H., Yuan,X.C.: Multi-Granularity Hypergraph Enhanced Hierarchical Neural Network Framework for Visual Classification. The Visual Computer (2024)

4. Revisiting Local and Global Descriptor-Based Metric Network for Few-Shot SAR Target Classification;Zheng J;IEEE Transactions on Geoscience and Remote Sensing,2024

5. Disentangled feature representation for few-shot image classification;Cheng H;IEEE Transactions on Neural Networks and Learning Systems,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3