Evaluating Machine Learning Classifiers in Breast Cancer: Non-Linear Contributions of MR Diffusion-Perfusion Features to Molecular-based Prognostic Stratification

Author:

Amini Behnam,Ghasemi Moein,Farazandeh Dorreh,M. Mohammad H. Akbarizadeh,Farzaneh Hana,Torabi Sarah,Sedaghat Mona,Jafarimehrabady Niloofar,Hajiabbasi MobasherORCID,Azizi Ashkan,Gorjestani Omidreza,Naviafar Anahita,Hosseini Mohammad M.,Karimi Nastaran,Parsaei Amirhossein,Rahmani AlirezaORCID,Doshmanziari Reza,Vajihinezhad MaryamORCID,Rikhtehgar Masih1,Nokiani Alireza Almasi1

Affiliation:

1. Department of Radiology, Iran University of Medical Sciences

Abstract

Abstract Background Diffusion-weighted imaging (DWI) map the microenvironment of breast cancer (BC) into cellular density and membrane integrity, and captures the effects of capillary microcirculation and intracellular structures through multi b-value analyses. Amidst potential biases in the radiomics pipeline, we aim to discern clinically relevant features from artifacts, improving machine learning (ML) classifier applicability in BC diagnostics through informed feature selection. Methods We prospectively enrolled 148 BC patients for ML classifier training, with an additional 98 patients included retrospectively for validation, ensuring consistent imaging and post-processing standards. Tumor subtypes were classified based on hormone receptor (HR), Human Epidermal Growth Factor Receptor 2 (HER2), and Ki67 levels. Utilizing a wide range of ML classifiers, we pinpointed an optimal feature count range of 8–13 for maximal training efficacy and generalizability, given our training and validation cohort sizes. Specifically, 12 domain-specific multi b-value DWI features were selected, focusing on entropy and first-order statistics of the of apparent diffusion coefficient (ADC), and higher-order statistical features (intravoxel incoherent motion (IVIM) parameters Dt, fp, Dp; diffusion kurtosis imaging (DKI) metrics MD, MK). Classifier stability was gauged by the interfold range of 4-fold cross-validation area under the curve (AUC) for the training dataset, while performance was assessed based on validation dataset AUC. Significant DWI features for molecular-based stratifications were identified based on a combined criterion applied to the ML classifier with the highest validation AUC, prioritizing the top three features ranked by importance and with a stability score over 0.7 in subsampling. Results Among linear classifiers, Stochastic Gradient Descent (SGD) stood out by showing distinct predictive power for HR status, contrasting with the generally limited effectiveness of other linear models. Non-linear classifiers significantly outperformed linear models across other categories. Random Forest excelled in Ki67 and luminal A subtype, AdaBoost in triple-negative subtyping, and XGBoost in HER2 status and subtype. SVM with Radial Basis Function kernels and Feed-Forward Neural Network jointly showed proficiency in classifying luminal HER2. Notably, XGBoost and Random Forest demonstrated stable feature selection processes. The entropy and first-order features of ADC was pivotal across molecular-based prognostic stratifications. IVIM features significantly influenced HR and Ki67 statuses, along with their attributed subtypes (luminal A, luminal B, and triple-negative). Conversely, DKI features were uniquely predictive in the HER2 domain, highlighting their distinctive contributions to the stratification of luminal HER2 and HER2 subtypes. Conclusions Non-linear machine learning classifiers excel in BC stratification, leveraging complex DWI features to deepen insights into cancer subtypes and molecular characteristics, marking a strategic evolution towards precision diagnostics.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3