Buyer Beware: confounding factors and biases abound when predicting omics-based biomarkers from histological images

Author:

Dawood MuhammadORCID,Branson Kim,Tejpar Sabine,Rajpoot NasirORCID,Minhas Fayyaz ul Amir Afsar

Abstract

SummaryBackgroundRecent advancements in computational pathology have introduced deep learning methods to predict genomic, transcriptomic and molecular biomarkers from routine histology whole slide images (WSIs) for cancer diagnosis, prognosis, and treatment. However, existing methods often overlook the critical role of co-dependencies among biomarker statuses during training and inference. We hypothesize that this oversight results in models that predict the combined effect of multiple interdependent biomarkers rather than individual statuses independently, akin to attributing the quality of an orchestral symphony to a single instrument, highlighting limitations of current predictors.MethodsUsing large datasets (n = 8,221 patients), we conducted statistical co-dependence testing to demonstrate significant interdependencies among biomarker statuses in training datasets. Following standard protocols, we trained two machine learning models to predict biomarkers from WSIs achieving or matching state-of-the-art predictive performance. We then employed permutation testing and stratification analysis to evaluate their predictive quality based on the principle of conditional independence, i.e., if a model accurately captures the phenotypic influence of a specific biomarker independent of other biomarkers, its performance should remain consistent across subgroups of patients stratified by other biomarkers, aligning with its overall performance on the entire dataset.FindingsOur statistical analysis reveals significant interdependencies among biomarkers, reflecting expected co-occurrence and mutual exclusivity patterns influenced by pathological and biological processes that are consistent across datasets, as well as sampling artefacts that can be different across datasets. Our results indicate that the predictive quality of an image-based predictor for a biomarker is contingent on the status of other biomarkers, revealing that models capture aggregated influences rather than predicting individual statuses independently. For example, mutation predictions are confounded by the overall tumour mutation burden. We also show that, due to the presence of such correlations, deep learning models may not offer significant advantages in predicting certain biomarkers in comparison to simply using pathologist-assigned grades for their prediction.InterpretationWe show that current deep learning models in computational pathology fall short in isolating individual biomarker effects, leading to confounded and less precise predictions. Our findings suggest revisiting model training protocols to recognize and adjust for biomarker interdependencies at all development stages—from problem definition to usage guidelines. This involves selecting diverse datasets to reflect clinical heterogeneity, defining prediction variables or grouping patients based on co-dependencies, designing models to disentangle complex relationships, and stringent stratification testing. Clinically, failure to account for interdependencies may lead to suboptimal decisions, necessitating appropriate usage guidelines for predictive models.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3