Abstract A116: A machine learning-derived transcriptomic-based biomarker identifies low grade pancreatic ductal adenocarcinoma tumors and may help in treatment decision making

Author:

Fu Daniel1,Zambirinis Constantinos P.2

Affiliation:

1. 1New Jersey Medical School, Rutgers University, Newark, NJ,

2. 2Rutgers Cancer Institute of New Jersey, New Brunswick, NJ.

Abstract

Abstract Introduction: Pancreatic ductal adenocarcinoma (PDAC) is a very aggressive and highly lethal malignancy. The majority of cases present with locally advanced unresectable or metastatic disease, which precludes surgical resection and makes cure virtually impossible. However, a subset of tumors with more indolent biology may be candidates for aggressive cytoreduction. Further, synchronous resection of oligometastatic disease is being investigated in prospective trials. We sought to develop a prognostic biomarker that could identify patients with low grade tumors. Methods: The Pancreatic Adenocarcinoma TCGA dataset was analyzed on the cBioPortal platform, comparing 173 patients with non-metastatic PDAC, known tumor grade, and available mRNAseq data. Well-differentiated (G1) tumors were compared to moderately (G2) or poorly differentiated (G3) patients to identify differentially expressed (ΔE) genes. The patients were split into training (n=138) and testing sets (n=35). Low-importance genes were eliminated using a combination of LASSO regression, recursive feature elimination, random forest (RF), and Boruta algorithms. SMOTE was employed to address minority class imbalance. Support vector machine (SVM), RF, and k-nearest neighbors (kNN) were independently trained for the binary classification of G1 vs G2/3 PDAC. The best model and hyperparameters were selected using the mean bootstrapped nested k-fold cross-validation F1-score and evaluated on the held-out test set. Lastly, the model predictions of grade were used to divide patients into two cohorts for Results: There were 1335 ΔE genes between G1 and G2/3 tumors. A panel of 7 genes were unanimously selected by the four low importance gene-elimination algorithms, and used for generation of predictive algorithms. In bootstrapped nested k-fold cross-validation of the training dataset, mean sensitivity and specificity were 0.98 and 0.91 for G1, respectively, and 0.91 and 0.97 for G2/3 PDAC. The kNN model performed best, with a mean F1-score of 0.942 (95% CI: [0.941, 0.943]) on the training set and 0.781 (95% CI: [0.776, 0.786]) for the test set. Conclusion: This study presents a novel machine learning-generated transcriptomic-based tool that predicts PDAC grade with good accuracy, and has the potential to identify less aggressive variants of PDAC that may be more amenable to aggressive cytoreductive approaches. Further testing and validation of the algorithm in larger datasets and using prospectively collected needle biopsy-acquired material is a worthwhile next step, to refine this into a potent clinical risk assessment tool. Citation Format: Daniel Fu, Constantinos P. Zambirinis. A machine learning-derived transcriptomic-based biomarker identifies low grade pancreatic ductal adenocarcinoma tumors and may help in treatment decision making [abstract]. In: Proceedings of the AACR Special Conference in Cancer Research: Pancreatic Cancer; 2023 Sep 27-30; Boston, Massachusetts. Philadelphia (PA): AACR; Cancer Res 2024;84(2 Suppl):Abstract nr A116.

Publisher

American Association for Cancer Research (AACR)

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3