Affiliation:
1. Department of Computer Science, Mathematics, Physics and Statistics University of British Columbia Okanagan Kelowna British Columbia Canada
2. School of Engineering The University of British Columbia Okanagan Campus Kelowna British Columbia Canada
3. Department of Medical Physics BC Cancer – Kelowna Kelowna Canada
Abstract
AbstractPurposeDeep learning‐based auto‐segmentation algorithms can improve clinical workflow by defining accurate regions of interest while reducing manual labor. Over the past decade, convolutional neural networks (CNNs) have become prominent in medical image segmentation applications. However, CNNs have limitations in learning long‐range spatial dependencies due to the locality of the convolutional layers. Transformers were introduced to address this challenge. In transformers with self‐attention mechanism, even the first layer of information processing makes connections between distant image locations. Our paper presents a novel framework that bridges these two unique techniques, CNNs and transformers, to segment the gross tumor volume (GTV) accurately and efficiently in computed tomography (CT) images of non‐small cell‐lung cancer (NSCLC) patients.MethodsUnder this framework, input of multiple resolution images was used with multi‐depth backbones to retain the benefits of high‐resolution and low‐resolution images in the deep learning architecture. Furthermore, a deformable transformer was utilized to learn the long‐range dependency on the extracted features. To reduce computational complexity and to efficiently process multi‐scale, multi‐depth, high‐resolution 3D images, this transformer pays attention to small key positions, which were identified by a self‐attention mechanism. We evaluated the performance of the proposed framework on a NSCLC dataset which contains 563 training images and 113 test images. Our novel deep learning algorithm was benchmarked against five other similar deep learning models.ResultsThe experimental results indicate that our proposed framework outperforms other CNN‐based, transformer‐based, and hybrid methods in terms of Dice score (0.92) and Hausdorff Distance (1.33). Therefore, our proposed model could potentially improve the efficiency of auto‐segmentation of early‐stage NSCLC during the clinical workflow. This type of framework may potentially facilitate online adaptive radiotherapy, where an efficient auto‐segmentation workflow is required.ConclusionsOur deep learning framework, based on CNN and transformer, performs auto‐segmentation efficiently and could potentially assist clinical radiotherapy workflow.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献