Meta-analysis of Multi-functional Biomarkers for Discovery and Predictive Modeling of Colorectal Adenoma and Carcinoma

Author:

Peterson Scott N.1,Eroshkin Alexey M.1,Kozbial Piotr Z.1,Florio Ermanno1,Fouladi Farnaz2,Strom Noah2,Valdes Yacgley1,Kuehn Gregory1,Casaburi Giorgio3,Kuehn Thomas1

Affiliation:

1. Prescient Metabiomics

2. Diversigen

3. Exagen Inc

Abstract

Abstract Background: Despite the effectiveness of colonoscopy for reducing colorectal cancer (CRC) mortality, poor screening compliance ranks CRC as the second most deadly malignancy. There is a need to develop a preventative, non-invasive diagnostic test, such as a fecal microbiota test, for early detection of both pre-cancerous adenomas and carcinomas to effectively reduce mortality. Results: We conducted a clinical meta-analysis of published deep metagenomic stool sequence datasets including 1,670 subjects from 9 countries, including 703 healthy controls, 161 precancerous colorectal adenoma (CRA), 48 advanced precancerous colorectal adenoma (CRAA) and 758 CRC cases diagnosed by colonoscopy. We analyzed these data through a novel automated machine learning workflow using a two-stage feature importance ranking and ensemble modeling method to identify and select highly predictive taxonomic and functional biomarkers. Machine learning modeling of selected features differentiated the metagenomic profiles of healthy patients from CRA, CRAA and CRC cases with an average area under the curve (AUC) for external holdout testing of 0.84 (sensitivity=0.82; specificity=0.71, accuracy=0.77) for CRC; an AUC of 0.97 (sensitivity=0.78; specificity=0.98, accuracy=0.97) for CRAA; and an AUC of 0.90 (sensitivity=0.74, specificity=0.89, accuracy=0.86) for CRA. These performance outcomes represented a 2%, 3% and 8% increase in AUC, compared to baseline ML performance, respectively. The predictive features identified for each disease class were largely distinct and represented differing proportions of taxonomic and functional features. Conclusions: The predictive taxonomic features identified for each disease class were largely distinct, whereas many functional gene features were shared across disease classes but displayed differing direction of change. Application of our ensemble approach for feature selection increased the predictive power of each disease class and moreover may generate discriminatory models with greater generalizability.

Publisher

Research Square Platform LLC

Reference44 articles.

1. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries;Bray F;CA Cancer J Clin,2018

2. The IARC Perspective on Colorectal Cancer Screening;Lauby-Secretan B;N Engl J Med,2018

3. Accuracy of Fecal Immunochemical Tests for Colorectal Cancer: Systematic Review and Meta-analysis;Lee JK;Ann Intern Med,2014

4. Comparative evaluation of immunochemical fecal occult blood tests for colorectal adenoma detection;Hundt S;Ann Intern Med,2009

5. Multitarget Stool DNA Testing for Colorectal-Cancer Screening;Imperiale TF;N Engl J Med,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3