Attention-Assisted Feature Comparison and Feature Enhancement for Class-Agnostic Counting
Author:
Dong Liang1, Yu Yian1, Zhang Di1, Huo Yan1
Affiliation:
1. College of Information Engineering, Shenyang University, Shenyang 110044, China
Abstract
In this study, we address the class-agnostic counting (CAC) challenge, aiming to count instances in a query image, using just a few exemplars. Recent research has shifted towards few-shot counting (FSC), which involves counting previously unseen object classes. We present ACECount, an FSC framework that combines attention mechanisms and convolutional neural networks (CNNs). ACECount identifies query image–exemplar similarities, using cross-attention mechanisms, enhances feature representations with a feature attention module, and employs a multi-scale regression head, to handle scale variations in CAC. ACECount’s experiments on the FSC-147 dataset exhibited the expected performance. ACECount achieved a reduction of 0.3 in the mean absolute error (MAE) on the validation set and a reduction of 0.26 on the test set of FSC-147, compared to previous methods. Notably, ACECount also demonstrated convincing performance in class-specific counting (CSC) tasks. Evaluation on crowd and vehicle counting datasets revealed that ACECount surpasses FSC algorithms like GMN, FamNet, SAFECount, LOCA, and SPDCN, in terms of performance. These results highlight the robust dataset generalization capabilities of our proposed algorithm.
Funder
the China Postdoctoral Science Foundation the PhD Research Startup Foundation project of Liaoning Province of China the funding project of Northeast Geological S&T Innovation Center of China Geological Survey
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference46 articles.
1. Class-Agnostic Counting;Lu;Lecture Notes in Computer Science, Proceedings of the 14th Asian Conference on Computer Vision (ACCV), Perth, Australia, 2–6 December 2018,2019 2. Shi, M., Lu, H., Feng, C., Liu, C.X., and Cao, Z.G. (2022, January 18–24). Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA. 3. You, Z.Y., Yang, K., Luo, W.H., Lu, X., Cui, L., and Le, X.Y. (2023, January 2–7). Few-shot Object Counting with Similarity-Aware Feature Enhancement. Proceedings of the 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA. 4. Huo, Y., Gang, S., and Guan, C. (2023). FCIHMRT: Feature Cross-Layer Interaction Hybrid Method Based on Res2Net and Transformer for Remote Sensing Scene Classification. Electronics, 12. 5. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017, January 4–9). Attention Is All You Need. Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA.
|
|