SEF-UNet: advancing abdominal multi-organ segmentation with SEFormer and depthwise cascaded upsampling

Author:

Zhao Yaping1,Jiang Yizhang1,Huang Lijun2,Xia Kaijian34

Affiliation:

1. School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, Jiangsu, China

2. Department of Medical Imaging, The Changshu Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China

3. Department of Scientific Research, The Changshu Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China

4. Changshu Key Laboratory of Medical Artificial Intelligence and Big Data, Suzhou, Jiangsu, China

Abstract

The abdomen houses multiple vital organs, which are associated with various diseases posing significant risks to human health. Early detection of abdominal organ conditions allows for timely intervention and treatment, preventing deterioration of patients’ health. Segmenting abdominal organs aids physicians in more accurately diagnosing organ lesions. However, the anatomical structures of abdominal organs are relatively complex, with organs overlapping each other, sharing similar features, thereby presenting challenges for segmentation tasks. In real medical scenarios, models must demonstrate real-time and low-latency features, necessitating an improvement in segmentation accuracy while minimizing the number of parameters. Researchers have developed various methods for abdominal organ segmentation, ranging from convolutional neural networks (CNNs) to Transformers. However, these methods often encounter difficulties in accurately identifying organ segmentation boundaries. MetaFormer abstracts the framework of Transformers, excluding the multi-head Self-Attention, offering a new perspective for solving computer vision problems and overcoming the limitations of Vision Transformers and CNN backbone networks. To further enhance segmentation effectiveness, we propose a U-shaped network, integrating SEFormer and depthwise cascaded upsampling (dCUP) as the encoder and decoder, respectively, into the UNet structure, named SEF-UNet. SEFormer combines Squeeze-and-Excitation modules with depthwise separable convolutions, instantiating the MetaFormer framework, enhancing the capture of local details and texture information, thereby improving edge segmentation accuracy. dCUP further integrates shallow and deep information layers during the upsampling process. Our model significantly improves segmentation accuracy while reducing the parameter count and exhibits superior performance in segmenting organ edges that overlap each other, thereby offering potential deployment in real medical scenarios.

Funder

Suzhou Key Supporting Subjects, Health Informatics

Changshu Science and Technology Program

Changshu Key Laboratory of Medical Artificial Intelligence and Big Data

Publisher

PeerJ

Reference27 articles.

1. A review on the use of deep learning for medical images segmentation;Aljabri;Neurocomputing,2022

2. Segnet: a deep convolutional encoder-decoder architecture for image segmentation;Badrinarayanan;IEEE Transactions on Pattern Analysis and Machine Intelligence,2017

3. Swin-Unet: unet-like pure transformer for medical image segmentation;Cao,2022

4. Transunet: transformers make strong encoders for medical image segmentation;Chen,2021

5. An image is worth 16×16 words: transformers for image recognition at scale;Dosovitskiy,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3