Affiliation:
1. National Key Laboratory of Science and Technology of Underwater Vehicle Harbin Engineering University Harbin China
2. Fourth Affiliated Hospital Harbin Medical University Harbin China
3. College of Computer Science and Artificial Intelligence Wenzhou University Wenzhou China
4. State Key Laboratory of Management and Control for Complex System Institute of Automation Chinese Academy of Sciences Beijing China
Abstract
AbstractBackgroundU‐Net and its variations have achieved remarkable performances in medical image segmentation. However, they have two limitations. First, the shallow layer feature of the encoder always contains background noise. Second, semantic gaps exist between the features of the encoder and the decoder. Skip‐connections directly connect the encoder to the decoder, which will lead to the fusion of semantically dissimilar feature maps.PurposeTo overcome these two limitations, this paper proposes a novel medical image segmentation algorithm, called feature‐guided attention network, which consists of U‐Net, the cross‐level attention filtering module (CAFM), and the attention‐guided upsampling module (AUM).MethodsIn the proposed method, the AUM and the CAFM were introduced into the U‐Net, where the AUM learns to filter the background noise in the low‐level feature map of the encoder and the CAFM tries to eliminate the semantic gap between the encoder and the decoder. Specifically, the AUM adopts a top‐down pathway to use the high‐level feature map so as to filter the background noise in the low‐level feature map of the encoder. The AUM uses the encoder features to guide the upsampling of the corresponding decoder features, thus eliminating the semantic gap between them. Four medical image segmentation tasks, including coronary atherosclerotic plaque segmentation (Dataset A), retinal vessel segmentation (Dataset B), skin lesion segmentation (Dataset C), and multiclass retinal edema lesions segmentation (Dataset D), were used to validate the proposed method.ResultsFor Dataset A, the proposed method achieved higher Intersection over Union (IoU) (), dice (), accuracy (), and sensitivity () than the previous best method: CA‐Net. For Dataset B, the proposed method achieved higher sensitivity (83.50%) and accuracy (97.55%) than the previous best method: SCS‐Net. For Dataset C, the proposed method had highest IoU () and dice () than those of all compared previous methods. For Dataset D, the proposed method had highest dice (average: 81.53%; retina edema area [REA]: 83.78%; pigment epithelial detachment [PED] 77.13%), sensitivity (REA: 89.01%; SRF: 85.50%), specificity (REA: 99.35%; PED: 100.00), and accuracy (98.73%) among all compared previous networks. In addition, the number of parameters of the proposed method was 2.43 M, which is less than CA‐Net (3.21 M) and CPF‐Net (3.07 M).ConclusionsThe proposed method demonstrated state‐of‐the‐art performance, outperforming other top‐notch medical image segmentation algorithms. The CAFM filtered the background noise in the low‐level feature map of the encoder, while the AUM eliminated the semantic gap between the encoder and the decoder. Furthermore, the proposed method was of high computational efficiency.
Funder
Beijing Science and Technology Planning Project
National Natural Science Foundation of China
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献