Affiliation:
1. Department of Electrical Engineering and Computer Science, Technische Universität Berlin, 10623 Berlin, Germany
2. Department of Electrical Engineering, National Ilan University, Yilan 260007, Taiwan
Abstract
Few-Shot Semantic Segmentation (FSS) has drawn massive attention recently due to its remarkable ability to segment novel-class objects given only a handful of support samples. However, current FSS methods mainly focus on natural images and pay little attention to more practical and challenging scenarios, e.g., remote sensing image segmentation. In the field of remote sensing image analysis, the characteristics of remote sensing images, like complex backgrounds and tiny foreground objects, make novel-class segmentation challenging. To cope with these obstacles, we propose a Class-Aware Self- and Cross-Attention Network (CSCANet) for FSS in remote sensing imagery, consisting of a lightweight self-attention module and a supervised prior-guided cross-attention module. Concretely, the self-attention module abstracts robust unseen-class information from support features, while the cross-attention module generates a superior quality query attention map for directing the network to focus on novel objects. Experiments demonstrate that our CSCANet achieves outstanding performance on the standard remote sensing FSS benchmark iSAID-5i, surpassing the existing state-of-the-art FSS models across all combinations of backbone networks and K-shot settings.
Funder
National Science and Technology Council, Taiwan
Reference45 articles.
1. Graph-regularized fast and robust principal component analysis for hyperspectral band selection;Sun;IEEE Trans. Geosci. Remote Sens.,2018
2. Discriminative transfer joint matching for domain adaptation in hyperspectral image classification;Peng;IEEE Geosci. Remote Sens. Lett.,2019
3. Revealing influencing factors on global waste distribution via deep-learning based dumpsite detection from satellite imagery;Sun;Nat. Commun.,2023
4. Fully convolutional networks for semantic segmentation;Shelhamer;IEEE Trans. Pattern Anal. Mach. Intell.,2017
5. Long, J., Shelhamer, E., and Darrell, T. (2015, January 7–12). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.