A new architecture combining convolutional and transformer‐based networks for automatic 3D multi‐organ segmentation on CT images

Author:

Li Chengyin1,Bagher‐Ebadian Hassan2345,Sultan Rafi Ibn1,Elshaikh Mohamed2,Movsas Benjamin2,Zhu Dongxiao1,Chetty Indrin J.26

Affiliation:

1. College of Engineering ‐ Dept. of Computer Science Wayne State University Detroit Michigan USA

2. Department of Radiation Oncology Henry Ford Cancer Institute Detroit Michigan USA

3. Department of Radiology Michigan State University East Lansing Michigan USA

4. Department of Osteopathic Medicine Michigan State University East Lansing Michigan USA

5. Department of Physics Oakland University Rochester Michigan USA

6. Department of Radiation Oncology Cedars Sinai Medical Center Los Angeles CA USA

Abstract

AbstractPurposeDeep learning‐based networks have become increasingly popular in the field of medical image segmentation. The purpose of this research was to develop and optimize a new architecture for automatic segmentation of the prostate gland and normal organs in the pelvic, thoracic, and upper gastro‐intestinal (GI) regions.MethodsWe developed an architecture which combines a shifted‐window (Swin) transformer with a convolutional U‐Net. The network includes a parallel encoder, a cross‐fusion block, and a CNN‐based decoder to extract local and global information and merge related features on the same scale. A skip connection is applied between the cross‐fusion block and decoder to integrate low‐level semantic features. Attention gates (AGs) are integrated within the CNN to suppress features in image background regions. Our network is termed “SwinAttUNet.” We optimized the architecture for automatic image segmentation. Training datasets consisted of planning‐CT datasets from 300 prostate cancer patients from an institutional database and 100 CT datasets from a publicly available dataset (CT‐ORG). Images were linearly interpolated and resampled to a spatial resolution of (1.0 × 1.0× 1.5) mm3. A volume patch (192 × 192 × 96) was used for training and inference, and the dataset was split into training (75%), validation (10%), and test (15%) cohorts. Data augmentation transforms were applied consisting of random flip, rotation, and intensity scaling. The loss function comprised Dice and cross‐entropy equally weighted and summed. We evaluated Dice coefficients (DSC), 95th percentile Hausdorff Distances (HD95), and Average Surface Distances (ASD) between results of our network and ground truth data.ResultsSwinAttUNet, DSC values were 86.54 ± 1.21, 94.15 ± 1.17, and 87.15 ± 1.68% and HD95 values were 5.06 ± 1.42, 3.16 ± 0.93, and 5.54 ± 1.63 mm for the prostate, bladder, and rectum, respectively. Respective ASD values were 1.45 ± 0.57, 0.82 ± 0.12, and 1.42 ± 0.38 mm. For the lung, liver, kidneys and pelvic bones, respective DSC values were: 97.90 ± 0.80, 96.16 ± 0.76, 93.74 ± 2.25, and 89.31 ± 3.87%. Respective HD95 values were: 5.13 ± 4.11, 2.73 ± 1.19, 2.29 ± 1.47, and 5.31 ± 1.25 mm. Respective ASD values were: 1.88 ± 1.45, 1.78 ± 1.21, 0.71 ± 0.43, and 1.21 ± 1.11 mm. Our network outperformed several existing deep learning approaches using only attention‐based convolutional or Transformer‐based feature strategies, as detailed in the results section.ConclusionsWe have demonstrated that our new architecture combining Transformer‐ and convolution‐based features is able to better learn the local and global context for automatic segmentation of multi‐organ, CT‐based anatomy.

Publisher

Wiley

Subject

General Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3