Generalized Category Discovery in Aerial Image Classification via Slot Attention
Author:
Zhou Yifan1ORCID, Zhu Haoran1, Zhang Yan1ORCID, Liang Shuo2, Wang Yujing2, Yang Wen1ORCID
Affiliation:
1. School of Electronic Information, Wuhan University, Wuhan 430072, China 2. The 54th Research Institution of CETC, Shijiazhuang 050081, China
Abstract
Aerial images record the dynamic Earth terrain, reflecting changes in land cover patterns caused by natural processes and human activities. Nonetheless, prevailing aerial image classification methodologies predominantly function within a closed-set framework, thereby encountering challenges when confronted with the identification of newly emerging scenes. To address this, this paper explores an aerial image recognition scenario in which a dataset comprises both labeled and unlabeled aerial images, intending to classify all images within the unlabeled subset, termed Generalized Category Discovery (GCD). It is noteworthy that the unlabeled images may pertain to labeled classes or represent novel classes. Specifically, we first develop a contrastive learning framework drawing upon the cutting-edge algorithms in GCD. Based on the multi-object characteristics of aerial images, we then propose a slot attention-based GCD training process (Slot-GCD) that contrasts learning at both the object and image levels. It decouples multiple local object features from feature maps using slots and then reconstructs the overall semantic feature of the image based on slot confidence scores and the feature map. Finally, these object-level and image-level features are input into the contrastive learning module to enable the model to learn more precise image semantic features. Comprehensive evaluations across three public aerial image datasets highlight the superiority of our approach over state-of-the-art methods. Particularly, Slot-GCD achieves a recognition accuracy of 91.5% for known old classes and 81.9% for unknown novel class data on the AID dataset.
Funder
National Natural Science Foundation of China (NSFC) Regional Innovation and Development Joint Fund the CETC key laboratory of aerospace information applications
Reference60 articles.
1. Detka, J., Coyle, H., Gomez, M., and Gilbert, G.S. (2023). A Drone-Powered Deep Learning Methodology for High Precision Remote Sensing in California’s Coastal Shrubs. Drones, 7. 2. Shi, Y., Fu, B., Wang, N., Cheng, Y., Fang, J., Liu, X., and Zhang, G. (2023). Spectral-Spatial Attention Rotation-Invariant Classification Network for Airborne Hyperspectral Images. Drones, 7. 3. Safonova, A., Hamad, Y., Dmitriev, E., Georgiev, G., Trenkin, V., Georgieva, M., Dimitrov, S., and Iliev, M. (2021). Individual Tree Crown Delineation for the Species Classification and Assessment of Vital Status of Forest Stands from UAV Images. Drones, 5. 4. Jiménez-Torres, M., Silva, C.P., Riquelme, C., Estay, S.A., and Soto-Gamboa, M. (2023). Automatic Recognition of Black-Necked Swan (Cygnus melancoryphus) from Drone Imagery. Drones, 7. 5. Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities;Cheng;IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens.,2020
|
|