A familial, telomere-to-telomere reference for humande novomutation and recombination from a four-generation pedigree
Author:
Porubsky DavidORCID, Dashnow HarrietORCID, Sasani Thomas A.ORCID, Logsdon Glennis A.ORCID, Hallast PilleORCID, Noyes Michelle D., Kronenberg Zev N., Mokveld Tom, Koundinya Nidhi, Nolan Cillian, Steely Cody J.ORCID, Guarracino AndreaORCID, Dolzhenko Egor, Harvey William T.ORCID, Rowell William J., Grigorev KirillORCID, Nicholas Thomas J.ORCID, Oshima Keisuke K., Lin JiadongORCID, Ebert PeterORCID, Watkins W. ScottORCID, Leung Tiffany Y., Hanlon Vincent C.T., McGee Sean, Pedersen Brent S., Goldberg Michael E.ORCID, Happ Hannah C., Jeong Hyeonsoo, Munson Katherine M.ORCID, Hoekzema KendraORCID, Chan Daniel D., Wang Yanni, Knuth Jordan, Garcia Gage H.ORCID, Fanslow Cairbre, Lambert Christine, Lee CharlesORCID, Smith Joshua D., Levy ShawnORCID, Mason Christopher E.ORCID, Garrison ErikORCID, Lansdorp Peter M., Neklason Deborah W., Jorde Lynn B., Quinlan Aaron R., Eberle Michael A.ORCID, Eichler Evan E.ORCID
Abstract
ABSTRACTUsing five complementary short- and long-read sequencing technologies, we phased and assembled >95% of each diploid human genome in a four-generation, 28-member family (CEPH 1463) allowing us to systematically assessde novomutations (DNMs) and recombination. From this family, we estimate an average of 192 DNMs per generation, including 75.5de novosingle-nucleotide variants (SNVs), 7.4 non-tandem repeat indels, 79.6de novoindels or structural variants (SVs) originating from tandem repeats, 7.7 centromericde novoSVs and SNVs, and 12.4de novoY chromosome events per generation. STRs and VNTRs are the most mutable with 32 loci exhibiting recurrent mutation through the generations. We accurately assemble 288 centromeres and six Y chromosomes across the generations, documentingde novoSVs, and demonstrate that the DNM rate varies by an order of magnitude depending on repeat content, length, and sequence identity. We show a strong paternal bias (75-81%) for all forms of germline DNM, yet we estimate that 17% ofde novoSNVs are postzygotic in origin with no paternal bias. We place all this variation in the context of a high-resolution recombination map (∼3.5 kbp breakpoint resolution). We observe a strong maternal recombination bias (1.36 maternal:paternal ratio) with a consistent reduction in the number of crossovers with increasing paternal (r=0.85) and maternal (r=0.65) age. However, we observe no correlation between meiotic crossover locations andde novoSVs, arguing against non-allelic homologous recombination as a predominant mechanism. The use of multiple orthogonal technologies, near-telomere-to-telomere phased genome assemblies, and a multi-generation family to assess transmission has created the most comprehensive, publicly available “truth set” of all classes of genomic variants. The resource can be used to test and benchmark new algorithms and technologies to understand the most fundamental processes underlying human genetic variation.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|