A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
-
Published:2023-09-30
Issue:19
Volume:13
Page:10881
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Cui Jianguo1, Wang Liejun1ORCID, Jiang Shaochen1
Affiliation:
1. College of Information Science and Engineering, Xinjiang University, Urumqi 830049, China
Abstract
The U-net network, with its simple and powerful encoder–decoder structure, dominates the field of medical image segmentation. However, convolution operations are limited by receptive fields. They do not have the ability to model long-range dependencies, but Transformer has the capability of long-term modeling thanks to its core self-attention mechanism, which has been widely applied in the field of medical image segmentation. However, both CNNs and Transformer can only perform correlation calculations for a single sample, ignoring the correlation between different samples. To address these problems, we propose a new Transformer, which we call the Dual-Attention Transformer (DAT). This module captures correlations within a single sample while also learning correlations between different samples. The current U-net and some of its variant models have the problem of inadequate feature fusion, so we also improve the skip connection to strengthen the association between feature maps at different scales, reduce the semantic gap between the encoder and decoder, and further improve the segmentation performance. We refer to this structure as DATUnet. We conducted extensive experiments on the Synapse and ACDC datasets to validate the superior performance of our network, and we achieved an average DSC (%) of 83.6 and 90.9 and an average HD95 of 13.99 and 1.466 for the Synapse and ACDC datasets, respectively.
Funder
Special Funds for the Central Government to Guide Local Science and Technology Development Scientific and Technological Innovation 2030 Major Project
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference46 articles.
1. Zhu, J., Yang, G., and Lio, P. (2023). A residual dense vision transformer for medical image super-resolution with segmentation-based perceptual loss fine-tuning. arXiv. 2. Isensee, F., Kickingereder, P., Wick, W., Bendszus, M., and Maier-Hein, K.H. (2018). Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: Proceedings of the Third International Workshop, BrainLes 2017, Held in Conjunction with MICCAI 2017, Quebec City, QC, Canada, 14 September 2017, Springer. Revised Selected Papers. 3. Zhang, Y., Liu, H., and Hu, Q. (2021). Medical Image Computing and Computer Assisted Intervention—MICCAI 2021: Proceedings of the 24th International Conference, Strasbourg, France, 27 September–1 October 2021, Proceedings, Part I, Springer. 4. Zhao, Z., Zhu, A., Zeng, Z., Veeravalli, B., and Guan, C. (2022, January 16–19). Act-net: Asymmetric co-teacher network for semi-supervised memory-efficient medical image segmentation. Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France. 5. Tragakis, A., Kaul, C., Murray-Smith, R., and Husmeier, D. (2023, January 3–7). The fully convolutional transformer for medical image segmentation. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|