Automatic Medical Image Segmentation with Vision Transformer-Reference-Cited by-同舟云学术

Automatic Medical Image Segmentation with Vision Transformer

Published:2024-03-25 Issue:7 Volume:14 Page:2741
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhang Jie¹²^ORCID,Li Fan¹,Zhang Xin¹,Wang Huaijun¹,Hei Xinhong¹^ORCID

Affiliation:

1. School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China

2. Provincial Key Laboratory of Network Computing and Security Technology, Xi’an University of Technology, Xi’an 710048, China

Abstract

Automatic image segmentation is vital for the computer-aided determination of treatment directions, particularly in terms of labelling lesions or infected areas. However, the manual labelling of disease regions is inconsistent and a time-consuming assignment. Meanwhile, radiologists’ comments are exceedingly subjective, regularly impacted by personal clinical encounters. To address these issues, we proposed a transformer learning strategy to automatically recognize infected areas in medical images. We firstly utilize a parallel partial decoder to aggregate high-level features and then generate a global feature map. Explicit edge attention and implicit reverse attention are applied to demonstrate boundaries and enhance their expression. Additionally, to alleviate the need for extensive labeled data, we propose a segmentation network combining propagation and transformer architectures that requires only a small amount of labeled data while leveraging fundamentally unlabeled images. The attention mechanisms are integrated within convolutional networks, keeping their global structures intact. Standalone transformers connected straightforwardly and receiving image patches can also achieve impressive segmentation performance. Our network enhanced the learning ability and attained a higher quality execution. We conducted a variety of ablation studies to demonstrate the adequacy of each modelling component. Experiments conducted across various medical imaging modalities illustrate that our model beats the most popular segmentation models. The comprehensive results also show that our transformer architecture surpasses established frameworks in accuracy while better preserving the natural variations in anatomy. Both quantitatively and qualitatively, our model achieves a higher overlap with ground truth segmentations and improved boundary adhesion.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/7/2741/pdf

Reference63 articles.

1. Deep learning in medical image analysis;Shen;Annu. Rev. Biomed. Eng.,2017

2. Ronneberger, O., Fischer, P., and Brox, T. (2015). Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015, Springer.

3. Long, J., Shelhamer, E., and Darrell, T. (2015, January 7–13). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.

4. Unsupervised CT lung image segmentation of a mycobacterium tuberculosis infection model;Gordaliza;Sci. Rep.,2018

5. Jin, D., Xu, Z., Tang, Y., Harrison, A.P., and Mollura, D.J. (2018). Medical Image Computing and Computer Assisted Intervention—MICCAI 2018, Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Granada, Spain, 16–20 September 2018, Springer.