Characterization of large-scale genomic differences in the first complete human genome
-
Published:2023-07-04
Issue:1
Volume:24
Page:
-
ISSN:1474-760X
-
Container-title:Genome Biology
-
language:en
-
Short-container-title:Genome Biol
Author:
Yang Xiangyu, Wang Xuankai, Zou Yawen, Zhang Shilong, Xia Manying, Fu Lianting, Vollger Mitchell R., Chen Nae-Chyun, Taylor Dylan J., Harvey William T., Logsdon Glennis A., Meng Dan, Shi Junfeng, McCoy Rajiv C., Schatz Michael C., Li Weidong, Eichler Evan E., Lu Qing, Mao YafeiORCID
Abstract
Abstract
Background
The first telomere-to-telomere (T2T) human genome assembly (T2T-CHM13) release is a milestone in human genomics. The T2T-CHM13 genome assembly extends our understanding of telomeres, centromeres, segmental duplication, and other complex regions. The current human genome reference (GRCh38) has been widely used in various human genomic studies. However, the large-scale genomic differences between these two important genome assemblies are not characterized in detail yet.
Results
Here, in addition to the previously reported “non-syntenic” regions, we find 67 additional large-scale discrepant regions and precisely categorize them into four structural types with a newly developed website tool called SynPlotter. The discrepant regions (~ 21.6 Mbp) excluding telomeric and centromeric regions are highly structurally polymorphic in humans, where the deletions or duplications are likely associated with various human diseases, such as immune and neurodevelopmental disorders. The analyses of a newly identified discrepant region—the KLRC gene cluster—show that the depletion of KLRC2 by a single-deletion event is associated with natural killer cell differentiation in ~ 20% of humans. Meanwhile, the rapid amino acid replacements observed within KLRC3 are probably a result of natural selection in primate evolution.
Conclusion
Our study provides a foundation for understanding the large-scale structural genomic differences between the two crucial human reference genomes, and is thereby important for future human genomics studies.
Funder
Shanghai Pujiang Program Shanghai Jiao Tong University 2030 Program National Natural Science Foundation of China Opening research fund from Shanghai Key Laboratory of Stomatology, Shanghai Ninth People’s Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine US National Institutes of Health (NIH) grant NIGMS grants
Publisher
Springer Science and Business Media LLC
Reference96 articles.
1. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. 2. Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, et al. The sequence of the human genome. Science. 2001;291:1304–51. 3. Navarro Gonzalez J, Zweig AS, Speir ML, Schmelter D, Rosenbloom KR, Raney BJ, et al. The UCSC genome browser database: 2021 update. Nucleic Acids Res. 2021;49:D1046–57. 4. Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, et al. The complete sequence of a human genome. Science. 2022;376:44–53. 5. Altemose N, Logsdon GA, Bzikadze AV, Sidhwani P, Langley SA, Caldas GV, et al. Complete genomic and epigenetic maps of human centromeres. Science. 2022;376:eabl4178.
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|