Author:
Nembot-Simo Annick,Graham Jinko,McNeney Brad
Abstract
Abstract
Background
In population association studies, standard methods of statistical inference assume that study subjects are independent samples. In genetic association studies, it is therefore of interest to diagnose undocumented close relationships in nominally unrelated study samples.
Results
We describe the R package CrypticIBDcheck to identify pairs of closely-related subjects based on genetic marker data from single-nucleotide polymorphisms (SNPs). The package is able to accommodate SNPs in linkage disequibrium (LD), without the need to thin the markers so that they are approximately independent in the population. Sample pairs are identified by superposing their estimated identity-by-descent (IBD) coefficients on plots of IBD coefficients for pairs of simulated subjects from one of several common close relationships.
Conclusions
The methods implemented in CrypticIBDcheck are particularly relevant to candidate-gene association studies, in which dependent SNPs cluster in a relatively small number of genes spread throughout the genome. The accommodation of LD allows the use of all available genetic data, a desirable property when working with a modest number of dependent SNPs within candidate genes. CrypticIBDcheck is available from the Comprehensive R Archive Network (CRAN).
Publisher
Springer Science and Business Media LLC
Subject
Information Systems and Management,Health Informatics,Computer Science Applications,Information Systems
Reference22 articles.
1. Devlin B, Roeder K: Genomic control for association studies. Biometrics. 1999, 55: 997-1004. 10.1111/j.0006-341X.1999.00997.x.
2. Voight B, Pritchard J: Confounding from cryptic relatedness in case-control association studies. Plos Genet. 2005, 1: e32-10.1371/journal.pgen.0010032.
3. Newman DL, Abney M, McPeek MS, Ober C, Cox NJ: The importance of genealogy in determining genetic associations with complex traits. Am J Hum Genet. 2001, 69 (5): 1146-1148. 10.1086/323659.
4. R Development Core Team: R: A Language and Environment for Statistical Computing. 2012, Vienna: R Foundation for Statistical Computing, [http://www.R-project.org/]. [ISBN 3-900051-07-0]
5. Weir B, Anderson A, Hepler A: Genetic relatedness analysis: modern data and new challenges. Nat Rev Genet. 2006, 7: 771-780. 10.1038/nrg1960.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献