NIPT-PG: empowering non-invasive prenatal testing to learn from population genomics through an incremental pan-genomic approach

Author:

Xue Zhengfa12,Zhou Aifen3454,Zhu Xiaoyan12,Li Linxuan67,Zhu Huanhuan6,Jin Xin68,Wang Jiayin12ORCID

Affiliation:

1. School of Computer Science and Technology, Xi’an Jiaotong University , Xi’an 710049 , China

2. Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University , Xi’an 710049 , China

3. Institute of Maternal and Child Health , Wuhan Children’s Hospital (Wuhan Maternal and Child Health care Hospital), Tongji Medical College, , Wuhan 430015 , China

4. Huazhong University of Science and Technology , Wuhan Children’s Hospital (Wuhan Maternal and Child Health care Hospital), Tongji Medical College, , Wuhan 430015 , China

5. Department of Obstetrics , Wuhan Children’s Hospital (Wuhan Maternal and Child Health care Hospital), Tongji Medical College, , Wuhan 430015 , China

6. BGI Research , Shenzhen 518083 , China

7. College of Life Sciences, University of Chinese Academy of Sciences , Beijing 100049 , China

8. School of Medicine, South China University of Technology , Guangzhou 510006 , China

Abstract

Abstract Non-invasive prenatal testing (NIPT) is a quite popular approach for detecting fetal genomic aneuploidies. However, due to the limitations on sequencing read length and coverage, NIPT suffers a bottleneck on further improving performance and conducting earlier detection. The errors mainly come from reference biases and population polymorphism. To break this bottleneck, we proposed NIPT-PG, which enables the NIPT algorithm to learn from population data. A pan-genome model is introduced to incorporate variant and polymorphic loci information from tested population. Subsequently, we proposed a sequence-to-graph alignment method, which considers the read mis-match rates during the mapping process, and an indexing method using hash indexing and adjacency lists to accelerate the read alignment process. Finally, by integrating multi-source aligned read and polymorphic sites across the pan-genome, NIPT-PG obtains a more accurate z-score, thereby improving the accuracy of chromosomal aneuploidy detection. We tested NIPT-PG on two simulated datasets and 745 real-world cell-free DNA sequencing data sets from pregnant women. Results demonstrate that NIPT-PG outperforms the standard z-score test. Furthermore, combining experimental and theoretical analyses, we demonstrate the probably approximately correct learnability of NIPT-PG. In summary, NIPT-PG provides a new perspective for fetal chromosomal aneuploidies detection. NIPT-PG may have broad applications in clinical testing, and its detection results can serve as a reference for false positive samples approaching the critical threshold.

Funder

National Natural Science Foundation of China

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3