Pediatric evaluations for deep learning CT denoising

Author:

Nelson Brandon J.1,Kc Prabhat1,Badal Andreu1,Jiang Lu2,Masters Shane C.3,Zeng Rongping1

Affiliation:

1. Center for Devices and Radiological Health Office of Science and Engineering Labs Division of Imaging Diagnostics, and Software Reliability U.S. Food and Drug Administration Silver Spring Maryland USA

2. Center for Devices and Radiological Health Office of Product Evaluation and Quality Office of Radiological Health U.S. Food and Drug Administration Silver Spring Maryland USA

3. Center for Drug Evaluation and Research Office of Specialty Medicine Division of Imaging and Radiation Medicine U.S. Food and Drug Administration Silver Spring Maryland USA

Abstract

AbstractBackgroundDeep learning (DL) CT denoising models have the potential to improve image quality for lower radiation dose exams. These models are generally trained with large quantities of adult patient image data. However, CT, and increasingly DL denoising methods, are used in both adult and pediatric populations. Pediatric body habitus and size can differ significantly from adults and vary dramatically from newborns to adolescents. Ensuring that pediatric subgroups of different body sizes are not disadvantaged by DL methods requires evaluations capable of assessing performance in each subgroup.PurposeTo assess DL CT denoising in pediatric and adult‐sized patients, we built a framework of computer simulated image quality (IQ) control phantoms and evaluation methodology.MethodsThe computer simulated IQ phantoms in the framework featured pediatric‐sized versions of standard CatPhan 600 and MITA‐LCD phantoms with a range of diameters matching the mean effective diameters of pediatric patients ranging from newborns to 18 years old. These phantoms were used in simulating CT images that were then inputs for a DL denoiser to evaluate performance in different sized patients. Adult CT test images were simulated using standard‐sized phantoms scanned with adult scan protocols. Pediatric CT test images were simulated with pediatric‐sized phantoms and adjusted pediatric protocols. The framework's evaluation methodology consisted of denoising both adult and pediatric test images then assessing changes in image quality, including noise, image sharpness, CT number accuracy, and low contrast detectability. To demonstrate the use of the framework, a REDCNN denoising model trained on adult patient images was evaluated. To validate that the DL model performance measured with the proposed pediatric IQ phantoms was representative of performance in more realistic patient anatomy, anthropomorphic pediatric XCAT phantoms of the same age range were also used to compare noise reduction performance.ResultsUsing the proposed pediatric‐sized IQ phantom framework, size differences between adult and pediatric‐sized phantoms were observed to substantially influence the adult trained DL denoising model's performance. When applied to adult images, the DL model achieved a 60% reduction in noise standard deviation without substantial loss in sharpness in mid or high spatial frequencies. However, in smaller phantoms the denoising performance dropped due to different image noise textures resulting from the smaller field of view (FOV) between adult and pediatric protocols. In the validation study, noise reduction trends in the pediatric‐sized IQ phantoms were found to be consistent with those found in anthropomorphic phantoms.ConclusionWe developed a framework of using pediatric‐sized IQ phantoms for pediatric subgroup evaluation of DL denoising models. Using the framework, we found the performance of an adult trained DL denoiser did not generalize well in the smaller diameter phantoms corresponding to younger pediatric patient sizes. Our work suggests noise texture differences from FOV changes between adult and pediatric protocols can contribute to poor generalizability in DL denoising and that the proposed framework is an effective means to identify these performance disparities for a given model.

Publisher

Wiley

Subject

General Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3