Author:
Zeng Sifu,Yang Jie,Luo Wang,Ruan Yudi
Abstract
Establishing the relationship between a limited number of samples and segmented objects in diverse scenarios is the primary challenge in few-shot segmentation. However, many previous works overlooked the crucial support-query set interaction and the deeper information that needs to be explored. This oversight can lead to model failure when confronted with complex scenarios, such as ambiguous boundaries. To solve this problem, a duplex network that utilizes the suppression and focus concept is proposed to effectively suppress the background and focus on the foreground. Our network includes dynamic convolution to enhance the support-query interaction and a prototype match structure to fully extract information from support and query. The proposed model is called dynamic prototype mixture convolutional networks (DPMC). To minimize the impact of redundant information, we have incorporated a hybrid attentional module called double-layer attention augmented convolutional module (DAAConv) into DPMC. This module enables the network to concentrate more on foreground information. Our experiments on PASCAL-5i and COCO-20i datasets suggested that DPMC and DAAConv outperform traditional prototype-based methods by up to 5–8% on average.
Subject
Artificial Intelligence,Biomedical Engineering
Reference53 articles.
1. Few-shot semantic segmentation via mask aggregation;Ao;arXiv:2202.07231.,2022
2. “Attention augmented convolutional networks,”;Bello,2019
3. “Few-shot segmentation without meta-learning: a good transductive inference is all you need?,”;Boudiaf,2021
4. “Crossvit: cross-attention multi-scale vision transformer for image classification,”;Chen,2021
5. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs;Chen;IEEE Trans. Pattern Anal. Mach. Intell.