Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches

Author:

Ubeira-Gabellini Maria Giulia1ORCID,Mori Martina1ORCID,Palazzo Gabriele1,Cicchetti Alessandro2ORCID,Mangili Paola1,Pavarini Maddalena1,Rancati Tiziana2ORCID,Fodor Andrei3ORCID,del Vecchio Antonella1ORCID,Di Muzio Nadia Gisella34,Fiorino Claudio1ORCID

Affiliation:

1. Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy

2. Data Science Unit, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133 Milan, Italy

3. Radiotherapy, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy

4. Department of Radiotherapy, Vita-Salute San Raffaele University, 20132 Milan, Italy

Abstract

Purpose. Different ML models were compared to predict toxicity in RT on a large cohort (n = 1314). Methods. The endpoint was RTOG G2/G3 acute toxicity, resulting in 204/1314 patients with the event. The dataset, including 25 clinical, anatomical, and dosimetric features, was split into 984 for training and 330 for internal tests. The dataset was standardized; features with a high p-value at univariate LR and with Spearman ρ>0.8 were excluded; synthesized data of the minority were generated to compensate for class imbalance. Twelve ML methods were considered. Model optimization and sequential backward selection were run to choose the best models with a parsimonious feature number. Finally, feature importance was derived for every model. Results. The model’s performance was compared on a training–test dataset over different metrics: the best performance model was LightGBM. Logistic regression with three variables (LR3) selected via bootstrapping showed performances similar to the best-performing models. The AUC of test data is slightly above 0.65 for the best models (highest value: 0.662 with LightGBM). Conclusions. No model performed the best for all metrics: more complex ML models had better performances; however, models with just three features showed performances comparable to the best models using many (n = 13–19) features.

Funder

Fondazione Regionale per la Ricerca Biomedica

Publisher

MDPI AG

Reference50 articles.

1. Grand challenges for medical physics in radiation oncology;Fiorino;Radiother. Oncol.,2020

2. Technology-driven research for radiotherapy innovation;Fiorino;Mol. Oncol.,2020

3. Artificial intelligence in radiotherapy;Siddique;Rep. Pract. Oncol. Radiother.,2020

4. Artificial intelligence in cancer therapy;Ho;Science,2020

5. Chow, J.C.L. (2021). Artificial Intelligence in Medicine, Springer International Publishing.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3