Machine learning integrates genomic signatures for subclassification beyond primary and secondary acute myeloid leukemia

Author:

Awada Hassan1,Durmaz Arda1ORCID,Gurnari Carmelo1,Kishtagari Ashwin1,Meggendorfer Manja2,Kerr Cassandra M.1,Kuzmanovic Teodora1,Durrani Jibran1,Shreve Jacob1,Nagata Yasunobu1,Radivoyevitch Tomas3,Advani Anjali S.4ORCID,Ravandi Farhad5,Carraway Hetty E.4ORCID,Nazha Aziz46,Haferlach Claudia2,Saunthararajah Yogen14ORCID,Scott Jacob1ORCID,Visconte Valeria1,Kantarjian Hagop5,Kadia Tapan5,Sekeres Mikkael A.14,Haferlach Torsten2,Maciejewski Jaroslaw P.14

Affiliation:

1. Department of Translational Hematology and Oncology Research, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH;

2. Munich Leukemia Laboratory, Munich, Germany;

3. Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, OH;

4. Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH;

5. Department of Leukemia, The University of Texas MD Anderson Cancer Center, Houston, TX; and

6. Center for Clinical Artificial Intelligence, Cleveland Clinic, Cleveland, OH

Abstract

Abstract Although genomic alterations drive the pathogenesis of acute myeloid leukemia (AML), traditional classifications are largely based on morphology, and prototypic genetic founder lesions define only a small proportion of AML patients. The historical subdivision of primary/de novo AML and secondary AML has shown to variably correlate with genetic patterns. The combinatorial complexity and heterogeneity of AML genomic architecture may have thus far precluded genomic-based subclassification to identify distinct molecularly defined subtypes more reflective of shared pathogenesis. We integrated cytogenetic and gene sequencing data from a multicenter cohort of 6788 AML patients that were analyzed using standard and machine learning methods to generate a novel AML molecular subclassification with biologic correlates corresponding to underlying pathogenesis. Standard supervised analyses resulted in modest cross-validation accuracy when attempting to use molecular patterns to predict traditional pathomorphologic AML classifications. We performed unsupervised analysis by applying the Bayesian latent class method that identified 4 unique genomic clusters of distinct prognoses. Invariant genomic features driving each cluster were extracted and resulted in 97% cross-validation accuracy when used for genomic subclassification. Subclasses of AML defined by molecular signatures overlapped current pathomorphologic and clinically defined AML subtypes. We internally and externally validated our results and share an open-access molecular classification scheme for AML patients. Although the heterogeneity inherent in the genomic changes across nearly 7000 AML patients was too vast for traditional prediction methods, machine learning methods allowed for the definition of novel genomic AML subclasses, indicating that traditional pathomorphologic definitions may be less reflective of overlapping pathogenesis.

Publisher

American Society of Hematology

Subject

Cell Biology,Hematology,Immunology,Biochemistry

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3