FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation-Reference-Cited by-同舟云学术

FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation

Published:2024-08-16 Issue:4 Volume:32 Page:931-951
ISSN:0895-3996
Container-title:Journal of X-Ray Science and Technology
language:
Short-container-title:XST

Author:

Jiang Zhongchuan¹²,Wu Yun¹²,Huang Lei¹²,Gu Maohua¹²

Affiliation:

1. State Key Laboratory of Public Big Data, Guiyang, China

2. College of Computer Science and Technology, Guizhou University, Guiyang, China

Abstract

BACKGROUND: The rapid development of deep learning techniques has greatly improved the performance of medical image segmentation, and medical image segmentation networks based on convolutional neural networks and Transformer have been widely used in this field. However, due to the limitation of the restricted receptive field of convolutional operation and the lack of local fine information extraction ability of the self-attention mechanism in Transformer, the current neural networks with pure convolutional or Transformer structure as the backbone still perform poorly in medical image segmentation. METHODS: In this paper, we propose FDB-Net (Fusion Double Branch Network, FDB-Net), a double branch medical image segmentation network combining CNN and Transformer, by using a CNN containing gnConv blocks and a Transformer containing Varied-Size Window Attention (VWA) blocks as the feature extraction backbone network, the dual-path encoder ensures that the network has a global receptive field as well as access to the target local detail features. We also propose a new feature fusion module (Deep Feature Fusion, DFF), which helps the image to simultaneously fuse features from two different structural encoders during the encoding process, ensuring the effective fusion of global and local information of the image. CONCLUSION: Our model achieves advanced results in all three typical tasks of medical image segmentation, which fully validates the effectiveness of FDB-Net.

Publisher

IOS Press

Reference47 articles.

1. D-former: A ushaped dilated transformer for 3d medical image segmentation;Wu;Neural Computing and Applications,2022

2. Weighted res-unet for high-quality retina vessel segmentation;Xiao;2018 9th International Conference on Information Technology in Medicine and Education (ITME),2018

3. Fully dense unet for 2-d sparse photoacoustic tomography artifact removal;Guan;IEEE Journal of Biomedical and Health Informatics,2019

4. D-unet: a dimension-fusion u shape network for chronic stroke lesion segmentation;Zhou;IEEE/ACM Transactions on Computational Biology and Bioinformatics,2019