Increased mutation and gene conversion within human segmental duplications
Author:
Vollger Mitchell R.ORCID, Dishuck Philip C.ORCID, Harvey William T., DeWitt William S., Guitart Xavi, Goldberg Michael E., Rozanski Allison N., Lucas Julian, Asri Mobin, Abel Haley J., Antonacci-Fulton Lucinda L., Baid Gunjan, Baker Carl A., Belyaeva Anastasiya, Billis Konstantinos, Bourque Guillaume, Buonaiuto Silvia, Carroll Andrew, Chaisson Mark J. P., Chang Pi-Chuan, Chang Xian H., Cheng Haoyu, Chu Justin, Cody Sarah, Colonna Vincenza, Cook Daniel E., Cook-Deegan Robert M., Cornejo Omar E., Diekhans Mark, Doerr Daniel, Ebert Peter, Ebler Jana, Eizenga Jordan M., Fairley Susan, Fedrigo Olivier, Felsenfeld Adam L., Feng Xiaowen, Fischer Christian, Flicek Paul, Formenti Giulio, Frankish Adam, Fulton Robert S., Gao Yan, Garg Shilpa, Garrison Erik, Garrison Nanibaa’ A., Giron Carlos Garcia, Green Richard E., Groza Cristian, Guarracino Andrea, Haggerty Leanne, Hall Ira M., Haukness Marina, Haussler David, Heumos Simon, Hickey Glenn, Hourlier Thibaut, Howe Kerstin, Jain Miten, Jarvis Erich D., Ji Hanlee P., Kenny Eimear E., Koenig Barbara A., Kolesnikov Alexey, Korbel Jan O., Kordosky Jennifer, Koren Sergey, Lee HoJoon, Li Heng, Liao Wen-Wei, Lu Shuangjia, Lu Tsung-Yu, Lucas Julian K., Magalhães Hugo, Marco-Sola Santiago, Marijon Pierre, Markello Charles, Marschall Tobias, Martin Fergal J., McCartney Ann, McDaniel Jennifer, Miga Karen H., Mitchell Matthew W., Monlong Jean, Mountcastle Jacquelyn, Mwaniki Moses Njagi, Nattestad Maria, Novak Adam M., Nurk Sergey, Olsen Hugh E., Olson Nathan D., Paten Benedict, Pesout Trevor, Phillippy Adam M., Popejoy Alice B., Prins Pjotr, Puiu Daniela, Rautiainen Mikko, Regier Allison A., Rhie Arang, Sacco Samuel, Sanders Ashley D., Schneider Valerie A., Schultz Baergen I., Shafin Kishwar, Sibbesen Jonas A., Sirén Jouni, Smith Michael W., Sofia Heidi J., Abou Tayoun Ahmad N., Thibaud-Nissen Françoise, Tomlinson Chad, Tricomi Francesca Floriana, Villani Flavia, Vollger Mitchell R., Wagner Justin, Walenz Brian, Wang Ting, Wood Jonathan M. D., Zimin Aleksey V., Zook Justin M., Munson Katherine M.ORCID, Lewis Alexandra P., Hoekzema Kendra, Logsdon Glennis A.ORCID, Porubsky DavidORCID, Paten BenedictORCID, Harris KelleyORCID, Hsieh PingHsunORCID, Eichler Evan E.ORCID,
Abstract
AbstractSingle-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have ‘relocated’ on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.
Publisher
Springer Science and Business Media LLC
Subject
Multidisciplinary
Reference77 articles.
1. Bailey, J. A., Yavor, A. M., Massa, H. F., Trask, B. J. & Eichler, E. E. Segmental duplications: organization and impact within the current human genome project assembly. Genome Res. 11, 1005–1017 (2001). 2. Fredman, D. et al. Complex SNP-related sequence variation in segmental genome duplications. Nat. Genet. 36, 861–866 (2004). 3. Liao, W.-W. et al. A draft human pangenome reference. Nature, https://doi.org/10.1038/s41586-023-05896-x (2023). 4. Ebert, P. et al. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science 372, eabf7117 (2021). 5. Duret, L. & Galtier, N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 10, 285–311 (2009).
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|