Rapid detection of identity-by-descent tracts for mega-scale datasets-Reference-Cited by-同舟云学术

Rapid detection of identity-by-descent tracts for mega-scale datasets

Published:2021-06-10 Issue:1 Volume:12 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Shemirani Ruhollah^ORCID,Belbin Gillian M.,Avery Christy L.^ORCID,Kenny Eimear E.,Gignoux Christopher R.,Ambite José Luis^ORCID

Abstract

AbstractThe ability to identify segments of genomes identical-by-descent (IBD) is a part of standard workflows in both statistical and population genetics. However, traditional methods for finding local IBD across all pairs of individuals scale poorly leading to a lack of adoption in very large-scale datasets. Here, we present iLASH, an algorithm based on similarity detection techniques that shows equal or improved accuracy in simulations compared to current leading methods and speeds up analysis by several orders of magnitude on genomic datasets, making IBD estimation tractable for millions of individuals. We apply iLASH to the PAGE dataset of ~52,000 multi-ethnic participants, including several founder populations with elevated IBD sharing, identifying IBD segments in ~3 minutes per chromosome compared to over 6 days for a state-of-the-art algorithm. iLASH enables efficient analysis of very large-scale datasets, as we demonstrate by computing IBD across the UK Biobank (~500,000 individuals), detecting 12.9 billion pairwise connections.

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry

Link

http://www.nature.com/articles/s41467-021-22910-w.pdf

Reference44 articles.

1. Carmi, S. et al. The variance of identity-by-descent sharing in the Wright-Fisher model. Genetics 193, 911–928 (2013).

2. Erlich, Y., Shor, T., Pe’er, I. & Carmi, S. Identity inference of genomic data using long-range familial searches. Science 362, 690–694 (2018).

3. Palamara, P. F., Lencz, T., Darvasi, A. & Pe’er, I. Length distributions of identity by descent reveal fine-scale demographic history. Am. J. Hum. Genet. 91, 809–822 (2012).

4. Browning, S. R. & Browning, B. L. Accurate non-parametric estimation of recent effective population size from segments of identity by descent. Am. J. Hum. Genet. 97, 404–418 (2015).

5. Browning, S. R. & Browning, B. L. Identity by descent between distant relatives: detection and applications. Annu Rev. Genet. 46, 617–633 (2012).

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast variance component analysis using large-scale ancestral recombination graphs;2024-08-31

2. Detection of distant relatedness in biobanks to identify undiagnosed cases of Mendelian disease as applied to Long QT syndrome;Nature Communications;2024-08-29

3. Identity-by-descent (IBD) segment outlier detection in endogamous populations using pedigree cohorts;2024-08-09

4. Identity-by-descent segments in large samples;2024-06-08

5. Benchmarking and Optimization of Methods for the Detection of Identity-By-Descent in High-RecombiningPlasmodium falciparum;2024-05-05