A new visual State Space Model for low‐dose CT denoising

Author:

Huang Jiexing1,Zhong Anni2,Wei Yajing3

Affiliation:

1. Department of Radiation Oncology The First Affiliated Hospital Sun Yat‐sen University Guangzhou China

2. Department of Digital Hospital Construction the Sixth Affiliated Hospital Sun Yat‐sen University Guangzhou China

3. Department of Obstetrics and Gynecology The First Affiliated Hospital Sun Yat‐sen University Guangzhou China

Abstract

AbstractBackgroundLow‐dose computed tomography (LDCT) can mitigate potential health risks to the public. However, the severe noise and artifacts in LDCT images can impede subsequent clinical diagnosis and analysis. Convolutional neural networks (CNNs) and Transformers stand out as the two most popular backbones in LDCT denoising. Nonetheless, CNNs suffer from a lack of long‐range modeling capabilities, while Transformers are hindered by high computational complexity.PurposeIn this study, our main goal is to develop a simple and efficient model that can both focus on local spatial context and model long‐range dependencies with linear computational complexity for LDCT denoising.MethodsIn this study, we make the first attempt to apply the State Space Model to LDCT denoising and propose a novel LDCT denoising model named Visual Mamba Encoder‐Decoder Network (ViMEDnet). To efficiently and effectively capture both the local and global features, we propose the Mixed State Space Module (MSSM), where the depth‐wise convolution, max‐pooling, and 2D Selective Scan Module (2DSSM) are coupled together through a partial channel splitting mechanism. 2DSSM is capable of capturing global information with linear computational complexity, while convolution and max‐pooling can effectively learn local signals to facilitate detail restoration. Furthermore, the network uses a weighted gradient‐sensitive hybrid loss function to facilitate the preservation of image details, improving the overall denoising performance.ResultsThe performance of our proposed ViMEDnet is compared to five state‐of‐the‐art LDCT denoising methods, including an iterative algorithm, two CNN‐based methods, and two Transformer‐based methods. The comparative experimental results demonstrate that the proposed ViMEDnet can achieve better visual quality and quantitative assessment outcomes. In visual evaluation, ViMEDnet effectively removes noise and artifacts, while exhibiting superior performance in restoring fine structures and low‐contrast structural edges, resulting in minimal deviation of denoised images from NDCT. In quantitative assessment, ViMEDnet obtains the lowest RMSE and the highest PSNR, SSIM, and FSIM scores, further substantiating the superiority of ViMEDnet.ConclusionsThe proposed ViMEDnet possesses excellent LDCT denoising performance and provides a new alternative to LDCT denoising models beyond the existing CNN and Transformer options.

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3