Machine learning identifies cell-free DNA 5-hydroxymethylation biomarkers that detect occult colorectal cancer in PLCO Screening Trial subjects

Author:

West-Szymanski Diana C.ORCID,Zhang Zhou,Cui Xiao-Long,Kowitwanich Krissana,Gao Lu,Deng Zifeng,Dougherty Urszula,Williams Craig,Merkle Shannon,Moore Matthew,He Chuan,Bissonnette Marc,Zhang Wei

Abstract

ABSTRACTBackgroundColorectal cancer (CRC) is a leading cause of cancer-related mortality, and CRC detection through screening improves survival rates. A promising avenue to improve patient screening compliance is the development of minimally-invasive liquid biopsy assays that target CRC biomarkers on circulating cell-free DNA (cfDNA) in peripheral plasma. In this report, we identify cfDNA biomarker candidate genes bearing the epigenetic mark 5-hydroxymethylcytosine (5hmC) that diagnose occult CRC up to 36 months prior to clinical diagnosis using the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial samples.MethodsArchived PLCO Trial plasma samples containing cfDNA were obtained from the National Cancer Institute (NCI) biorepositories. Study subjects included those who were diagnosed with CRC within 36 months of blood collection (i.e., case, n = 201) and those who were not diagnosed with any cancer during an average of 16.3 years of follow-up (i.e., controls, n = 402). Following the extraction of 3 - 8 ng cfDNA from less than 300 microliters plasma, we employed the sensitive 5hmC-Seal chemical labeling approach, followed by next-generation sequencing (NGS). We then conducted association studies and machine-learning modeling to analyze the genome-wide 5hmC profiles within training and validation groups that were randomly selected at a 2:1 ratio.ResultsDespite the technical challenges associated with the PLCO samples (e.g., limited plasma volumes, low cfDNA amounts, and long archival times), robust genome-wide 5hmC profiles were successfully obtained from these samples. Association analyses using the Cox proportional hazards models suggested several epigenetic pathways relevant to CRC development distinguishing cases from controls. A weighted Cox model, comprised of 32-associated gene bodies, showed predictive detection value for CRC as early as 24-36 months prior to overt tumor presentation, and a trend for increased predictive power was observed for blood samples collected closer to CRC diagnosis. Notably, the 5hmC-based predictive model showed comparable performance regardless of sex and self-reported race/ethnicity, and significantly outperformed risk factors such as age and obesity according to BMI (body mass index). Additionally, further improvement of predictive performance was achieved by combining the 5hmC-based model and risk factors for CRC.ConclusionsAn assay of 5hmC epigenetic signals on cfDNA revealed candidate biomarkers with the potential to predict CRC occurrence despite the absence of clinical symptoms or the availability of effective predictors. Developing a minimally-invasive clinical assay that detects 5hmC-modified biomarkers holds promise for improving early CRC detection and ultimately patient survival through higher compliance screening and earlier intervention. Future investigation to expand this strategy to prospectively collected samples is warranted.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3