Improving prediction of survival and progression in metastatic non-small cell lung cancer following immunotherapy through machine learning of circulating tumor DNA dynamics

Author:

Ding HaolunORCID,Yuan MinORCID,Yang YaningORCID,Xu Xu StevenORCID

Abstract

AbstractObjectivesTo use modern machine-learning approaches to enhance and automate the feature extraction from the longitudinal ctDNA data and to improve the prediction of survival and disease progression, risk stratification, and treatment strategies for patients with 1L NSCLC.MethodsUsing IMpower150 trial data on untreated metastatic non-small cell lung cancer patients treated with atezolizumab and chemotherapies, we developed a machine-learning algorithm to extract predictive features from ctDNA kinetics, improving survival and progression prediction. We analyzed kinetic data from 17 ctDNA summary markers, including cell-free DNA concentration, allele frequency, tumor molecules in plasma, and mutation counts. Our machine-learning workflow (FPCRF) involved functional principal component analysis (FPCA) for automated feature extraction, random forest and bagging ensemble algorithms for feature selection, standard PCA for dimension reduction, and Cox proportional-hazards regression for survival analysis. The dataset was divided into training and test cohorts in the same way as a previous study.Results398 patients with ctDNA data (206 in training, 192 in validation) were analyzed. Our machine-learning models automated feature extraction, excelling in predicting overall survival (OS) and progression-free survival (PFS) at different landmarks. In identical train-test cohorts, our models outperformed existing ones using handcrafted ctDNA features, raising OS c-index to 0.72 and 0.71 from 0.67 and 0.63 for C3D1 and C4D1, and substantially improving PFS to ∼0.65 from the previous 0.54 - 0.58, a 12-20% increase. Our model enhanced risk stratification for NSCLC patients, achieving clear OS and PFS separation (e.g., on C3D1, HR: 2.65 [95%CI: 1.78–3.95, P < 0.001] for high vs. intermediate risk, 2.06 [95%CI: 1.29–3.29, P = 0.002] for intermediate vs. low risk; and PFS HR: 2.04 [95%CI: 1.41–2.94, P < 0.001], 1.56 [95%CI: 1.07–2.27, P = 0.02]). Distinct patterns of ctDNA kinetic characteristics (e.g., baseline ctDNA markers, depth of ctDNA responses, and timing of ctDNA clearance, etc.) were revealed across the risk groups. Rapid and complete ctDNA clearance appears essential for long-term clinical benefit.ConclusionsOur machine-learning approach offers a novel tool for analyzing ctDNA kinetics, extracting critical features from longitudinal data, improving our understanding of the link between ctDNA kinetics and progression/mortality risks, and optimizing personalized immunotherapies for 1L NSCLC.Research in contextEvidence before this studyThe longitudinal dynamics of ctDNA are showing promise as a biomarker for treatment outcomes and monitoring. However, despite of recent advances of machine learning, very limited applications have been reported in using machine learning-based approaches to analyze the longitudinal ctDNA data, improve the prediction of clinical outcomes, and refine the risk stratifications. We searched PubMed on Oct 8, 2023 for peer-reviewed, English-language journal and conference articles using the terms (“ctDNA”) AND (“deep learning” OR “machine learning” OR “artificial intelligence”). Fifty-nine (59) search results were found. After systematical review of these search results, we found only 4 research studies where longitudinal ctDNA dynamic/kinetic data were analyzed using machine-learning models to predict patient outcomes. These studies focused on building models using handcrafted features of ctDNA dynamics such as on-treatment ctDNA levels and early ctDNA changes and clearance, etc. So far, no studies have utilized machine- or deep-learning models to extract features from longitudinal ctDNA dynamics to inform and predict cancer patient outcomes.Added value of this studyWe developed a machine-learning algorithm to predict survival and disease progression using ctDNA data from the Impower150 trial on untreated metastatic non-small cell lung cancer patients receiving atezolizumab and chemotherapy. Our machine-learning models automatically extract informative features from longitudinal ctDNA dynamics, outperforming existing models based on handcrafted features in predicting overall survival and progression-free survival at various time points. They improved risk stratification and identified crucial ctDNA kinetic characteristics in 1L NSCLC, revealing the importance of rapid and complete ctDNA clearance for long-term clinical benefit.Implications of all the available evidenceMachine-learning models can automatically extract prognostic features from longitudinal ctDNA dynamic trajectories, enable refined risk stratification and prediction of clinical outcomes, and thereby enhance ctDNA data’s utility in clinical patient care and personalized treatment.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3