Improved UNet with Attention for Medical Image Segmentation-Reference-Cited by-同舟云学术

Improved UNet with Attention for Medical Image Segmentation

Published:2023-10-20 Issue:20 Volume:23 Page:8589
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

AL Qurri Ahmed¹,Almekkawy Mohamed¹^ORCID

Affiliation:

1. School of Electrical Engineering and Computer Science, Pennsylvania State University, University Park, PA 16802, USA

Abstract

Medical image segmentation is crucial for medical image processing and the development of computer-aided diagnostics. In recent years, deep Convolutional Neural Networks (CNNs) have been widely adopted for medical image segmentation and have achieved significant success. UNet, which is based on CNNs, is the mainstream method used for medical image segmentation. However, its performance suffers owing to its inability to capture long-range dependencies. Transformers were initially designed for Natural Language Processing (NLP), and sequence-to-sequence applications have demonstrated the ability to capture long-range dependencies. However, their abilities to acquire local information are limited. Hybrid architectures of CNNs and Transformer, such as TransUNet, have been proposed to benefit from Transformer’s long-range dependencies and CNNs’ low-level details. Nevertheless, automatic medical image segmentation remains a challenging task due to factors such as blurred boundaries, the low-contrast tissue environment, and in the context of ultrasound, issues like speckle noise and attenuation. In this paper, we propose a new model that combines the strengths of both CNNs and Transformer, with network architectural improvements designed to enrich the feature representation captured by the skip connections and the decoder. To this end, we devised a new attention module called Three-Level Attention (TLA). This module is composed of an Attention Gate (AG), channel attention, and spatial normalization mechanism. The AG preserves structural information, whereas channel attention helps to model the interdependencies between channels. Spatial normalization employs the spatial coefficient of the Transformer to improve spatial attention akin to TransNorm. To further improve the skip connection and reduce the semantic gap, skip connections between the encoder and decoder were redesigned in a manner similar to that of the UNet++ dense connection. Moreover, deep supervision using a side-output channel was introduced, analogous to BASNet, which was originally used for saliency predictions. Two datasets from different modalities, a CT scan dataset and an ultrasound dataset, were used to evaluate the proposed UNet architecture. The experimental results showed that our model consistently improved the prediction performance of the UNet across different datasets.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/20/8589/pdf

Reference63 articles.

1. Gao, Q., and Almekkawy, M. (2021). ASUNet++: A nested UNet with adaptive feature extractions for liver tumor segmentation. Comput. Biol. Med., 136.

2. Current and emerging trends in medical image segmentation with deep learning;Conze;IEEE Trans. Radiat. Plasma Med. Sci.,2023

3. Statistical shape models for 3D medical image segmentation: A review;Heimann;Med. Image Anal.,2009

4. Kakumani, A.K., Sree, L.P., Kumar, B.V., Rao, S.K., Garrepally, M., and Chandrakanth, M. (2022, January 7–9). Segmentation of Cell Nuclei in Microscopy Images using Modified ResUNet. Proceedings of the 2022 IEEE 3rd Global Conference for Advancement in Technology (GCAT), Bangalore, India.

5. Active contour model based on local and global intensity information for medical image segmentation;Zhou;Neurocomputing,2016

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep learning revealed statistics of the MgO particles dissolution rate in a CaO–Al2O3–SiO2–MgO slag;Scientific Reports;2024-09-11

2. Deep Learning-driven Automatic Nuclei Segmentation of Label-free Live Cell Chromatin-sensitive Partial Wave Spectroscopic Microscopy Imaging;2024-08-21

3. Application of the bicharacteristic attention residual pyramid for the treatment of brain tumors;Heliyon;2024-08

4. Automatic cancer nuclei segmentation on histological images: comparison study of deep learning methods;Biotechnology and Bioprocess Engineering;2024-07-04

5. Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance;Biomedicines;2024-06-13