Author:
Yu Canqing,Lan Xianmei,Tao Ye,Guo Yu,Sun Dianjianyi,Qian Puyi,Zhou Yuwen,Walters Robin,Li Linxuan,Millwood Iona,Zeng Jingyu,Pei Pei,Guo Ruidong,Du Huaidong,Yang Tao,Yang Ling,Yang Fan,Chen Yiping,Chen Fengzhen,Jiang Xiaosen,Ye Zhiqiang,Ren Fangyi,Dai Lanlan,Wei Xiaofeng,Xu Xun,Yang Huanming,Wang Jian,Chen Zhengming,Zhu Huanhuan,Lv Jun,Jin Xin,Li Liming
Abstract
AbstractPrecision medicine relies on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is currently not suitable for studies with very large sample sizes due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In this study, we selected 9,950 individuals from the China Kadoorie Biobank (CKB) cohort and 50 Chinese samples from the 1000 Genome Project (1KGP) for medium-depth WGS to construct a CKB reference panel. The results of imputing microarray datasets showed that the CKB panel outperformed the extended high coverage 1KGP, TOPMed, ChinaMAP, and NuyWa panels in terms of both the number of well-imputed variants and imputation accuracy. In addition, we have completed the imputation of over 100,000 CKB microarray data with the CKB panel, and the after-imputed genotype data is the hitherto largest whole genome data of the Chinese population. Finally, we developed an online server for offering free genotype imputation service based on the CKB reference panel (https://db.cngb.org/imputation/). We believe that the constructed CKB reference panel is of great value for imputing microarray or low-depth genotype data of Chinese population. The imputation-completed 100,000 microarray data are fundamental resources of population genetic studies for complex traits and diseases in the Chinese population.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献