Feature ensemble network for medical image segmentation with multi‐scale atrous transformer-Reference-Cited by-同舟云学术

Feature ensemble network for medical image segmentation with multi‐scale atrous transformer

Published:2024-06-25 Issue:11 Volume:18 Page:3082-3092
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Gai Di¹,Geng Yuhan²,Huang Xia³,Huang Zheng¹^ORCID,Xiong Xin³^ORCID,Zhou Ruihua⁴,Wang Qi¹

Affiliation:

1. School of Mathematics and Computer Sciences Nanchang University Nanchang China

2. School of Public Health University of Michigan Ann Arbor Michigan USA

3. The First Affiliated Hospital, Jiangxi Medical College Nanchang University Nanchang China

4. School of Software Nanchang University Nanchang China

Abstract

AbstractRecent years have witnessed notable advancements in medical image segmentation through deep convolutional neural networks. However, a notable limitation lies in the local operation of convolution, which hinders the ability to fully exploit global semantic information. To overcome the challenges prevalent in medical image segmentation, the feature ensemble network with multi‐scale atrous transformer is proposed. At the core of the approach lies the multi‐scale contextual integration module, which is based on the multi‐scale atrous transformer and facilitates contextual integration of multi‐level features. To extract discriminative fine‐grained features of the target region, a hybrid attention mechanism that synergistically combines spatial and channel attention, thereby sharpening the model's focus on crucial target information within high‐level features, is incorporated. Additionally, the channel‐aware feature reconstruction module is introduced as an innovative component engineered to tackle feature similarity issues across different categories. This module performs feature reconstruction based on channel perception, effectively widening the feature gap between categories and enhancing the segmentation capability. It is worth mentioning that our approach surpasses the state‐of‐the‐art method using three benchmark datasets in medical image segmentation.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Reference41 articles.

1. Long J. Shelhamer E. Darrell T.:Fully convolutional networks for semantic segmentation. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp. 3431–3440(2015)

2. GL-Segnet: Global-Local representation learning net for medical image segmentation

3. Deep neural network pulmonary nodule segmentation methods for CT images: Literature review and experimental comparisons

4. A graph‐based edge attention gate medical image segmentation method

5. CE-Net: Context Encoder Network for 2D Medical Image Segmentation