A Framework for Harmonization of Radiomics Data for Multicenter Studies and Clinical Trials

Author:

Soliman Moataz A.S.1ORCID,Kelahan Linda C.1,Magnetta Michael1ORCID,Savas Hatice1ORCID,Agrawal Rishi1ORCID,Avery Ryan J.1ORCID,Aouad Pascale1,Liu Benjamin1,Xue Yue2,Chae Young K.34ORCID,Salem Riad14ORCID,Benson Al B.34ORCID,Yaghmai Vahid5,Velichko Yuri S.14ORCID

Affiliation:

1. Department of Radiology, Feinberg School of Medicine, Northwestern University, Chicago, IL

2. Department of Pathology, Feinberg School of Medicine, Northwestern University, Chicago, IL

3. Department of Medicine, Division of Hematology/Oncology, Feinberg School of Medicine, Northwestern University, Chicago, IL

4. Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL

5. Department of Radiological Sciences, University of California, Irvine UCI Health, University of California Irvine, Orange, CA

Abstract

PURPOSE Variability in computed tomography images intrinsic to individual scanners limits the application of radiomics in clinical and research settings. The development of reproducible and generalizable radiomics-based models to assess lesions requires harmonization of data. The purpose of this study was to develop, test, and analyze the efficacy of a radiomics data harmonization model. MATERIALS AND METHODS Radiomic features from biopsy-proven untreated hepatic metastasis (N = 380) acquired from 167 unique patients with pancreatic, colon, and breast cancers were analyzed. Radiomic features from volume-match 551 samples of normal liver tissue and 188 hepatic cysts were included as references. A novel linear mixed effect model was used to identify effects associated with lesion size, tissue type, and scanner model. Six separate machine learning models were then used to test the effectiveness of radiomic feature harmonization using multivariate analysis. RESULTS Proposed model identifies and removes scanner-associated effects while preserving cancer-specific functional dependence of radiomic features on the tumor size. Data harmonization improves the performance of classification models by reducing the scanner-associated variability. For example, the multiclass logistic regression model, LogitBoost, demonstrated the improvement in sensitivity in the range from 15% to 40% for each type of liver metastasis, whereas the overall model accuracy and the kappa coefficient increased by 5% and 8% accordingly. CONCLUSION The model removed scanner-associated effects while preserving cancer-specific functional dependence of radiomic features.

Publisher

American Society of Clinical Oncology (ASCO)

Subject

General Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3