Impact and characterization of serial structural variations across humans and great apes
-
Published:2024-09-13
Issue:1
Volume:15
Page:
-
ISSN:2041-1723
-
Container-title:Nature Communications
-
language:en
-
Short-container-title:Nat Commun
Author:
Höps Wolfram, Rausch TobiasORCID, Jendrusch Michael, , Ashraf Hufsah, Audano Peter A., Austine Ola, Basile Anna O., Beck Christine R., Jan Bonder Marc, Byrska-Bishop Marta, Chaisson Mark J. P., Chong Zechen, Corvelo André, Devine Scott E., Ebert Peter, Ebler Jana, Eichler Evan E., Gerstein Mark B., Hallast Pille, Harvey William T., Hasenfeld Patrick, Hastie Alex R., Henglin Mir, Hoekzema Kendra, Hsieh PingHsun, Hunt Sarah, Konkel Miriam K., Kordosky Jennifer, Lansdorp Peter M., Lee Charles, Lee Wan-Ping, Lewis Alexandra P., Li Chong, Lin Jiadong, Loftus Mark, Logsdon Glennis A., Marschall Tobias, Mills Ryan E., Mostovoy Yulia, Munson Katherine M., Narzisi Giuseppe, Pang Andy, Porubsky David, Prodanov Timofey, Rodriguez-Martin Bernardo, Shi Xinghua, Surapaneni Likhitha, Talkowski Michael E., Yilmaz Feyza, Yoo DongAhn, Zhou Weichen, Zody Michael C., Korbel Jan O.ORCID, Sedlazeck Fritz J.ORCID
Abstract
AbstractModern sequencing technology enables the systematic detection of complex structural variation (SV) across genomes. However, extensive DNA rearrangements arising through a series of mutations, a phenomenon we refer to as serial SV (sSV), remain underexplored, posing a challenge for SV discovery. Here, we present NAHRwhals (https://github.com/WHops/NAHRwhals), a method to infer repeat-mediated series of SVs in long-read genomic assemblies. Applying NAHRwhals to haplotype-resolved human genomes from 28 individuals reveals 37 sSV loci of various length and complexity. These sSVs explain otherwise cryptic variation in medically relevant regions such as the TPSAB1 gene, 8p23.1, 22q11 and Sotos syndrome regions. Comparisons with great ape assemblies indicate that most human sSVs formed recently, after the human-ape split, and involved non-repeat-mediated processes in addition to non-allelic homologous recombination. NAHRwhals reliably discovers and characterizes sSVs at scale and independent of species, uncovering their genomic abundance and suggesting broader implications for disease.
Funder
U.S. Department of Health & Human Services | National Institutes of Health Bundesministerium für Bildung und Forschung
Publisher
Springer Science and Business Media LLC
Reference74 articles.
1. Alonge, M. et al. Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell 182, 145–161.e23 (2020). 2. Du, H. et al. The multiple de novo copy number variant (MdnCNV) phenomenon presents with peri-zygotic DNA mutational signatures and multilocus pathogenic variation. Genome Med 14, 122 (2022). 3. Beck, C. R. et al. Megabase length hypermutation accompanies human structural variation at 17p11.2. Cell 176, 1310–1324.e10 (2019). 4. Sekar, S. et al. Complex mosaic structural variations in human fetal brains. Genome Res 30, 1695–1704 (2020). 5. Beyter, D. et al. Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits. Nat. Genet. 53, 779–786 (2021).
|
|