Manually-established abnormal karyotype dataset based on normal chromosomes effectively train artificial intelligence model for better cytogenetic abnormalities prediction

Author:

Deng Jinhai1,Peng Weixiong1,Lu Qinyang1,Wang Zheng2,Fu Qiang2,Zhou Xingang3,Cai Yufeng4,Mu Yang4,Pan Teng5,Liu Zaoqu6,Cai Zixing4,Yin Mingzhu7,Liu Lijue4,Lai Yueyun2

Affiliation:

1. Xiangjiang Laboratory

2. Peking University People's Hospital, Peking University Institute of Hematology, National Clinical Research Center for Hematologic Disease

3. Capital Medical University

4. Hunan Zixing Intelligent Medical Technology Co., Ltd

5. King's College London

6. The First Affiliated Hospital of Zhengzhou University

7. Chongqing University Three Gorges Hospital, Chongqing University

Abstract

Abstract With the advent of the utilization of machine learning techniques in the diagnosis of hematological diseases, endless potential can be foreseen, including digital images analysis. The application of machine-learning tool in cytogenetics contributes to the lightening of manpower burden, the improvement of recognition efficiency and the enrichment of cytogenetic maps, which paves the way for the development of digital pathology. Chromosome banding analysis is an essential technique for chromosome karyotyping, which comprises of one of important tools for the diagnostics in hematological malignancies. Its important role has been emphasized in clinic for dozens of years till now. The recognition of abnormal karyotypes is indispensable for disease classification and even diagnosis. However, a lack of abnormal karyotype images as reference dataset restricts its utilization in clinic, especially for uncommon hematological diseases. Here, to our best knowledge, we, for the first time, successfully generated abnormal karyotype images of t(9;22)(q34;q11)manually from normal karyotype images using machine learning, providing a proof-of-concept for establishing abnormal karyotypes of hematological malignancies as clinical reference. Moreover, to verify the reliability of generated abnormal dataset, artificial intelligence (AI)-recognizing models were further established based on ‘manually-built’ karyogram dataset and real karyotype dataset, respectively. The results showed that there was no difference between ‘manually-built’ karyotype dataset derived AI model (model-M) and real karyotype dataset derived AI model (model-R) regarding the recognition of t(9;22)(q34;q11) abnormality, with model-M (AUC=0.984, 95%CI 0.98-0.988) versus model-R (AUC=0.988, 95%CI 0.984-0.993) (p>0.05), which pointed out that our generated abnormal karyotype images were comparable to real images to assist the establishment of AI-recognising models. Collectively, our work demonstrates the potential application of machine learning in generating unlimited dataset from limited sources, helping to overcome the big challenge of AI in healthcare.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3