Correlation-Based Inference for Linkage Disequilibrium With Multiple Alleles-Reference-Cited by-同舟云学术

Correlation-Based Inference for Linkage Disequilibrium With Multiple Alleles

Published:2008-09-01 Issue:1 Volume:180 Page:533-545
ISSN:1943-2631
Container-title:Genetics
language:en
Short-container-title:

Author:

Zaykin Dmitri V¹,Pudovkin Alexander²,Weir Bruce S³

Affiliation:

1. National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina 27709

2. Institute of Marine Biology, Vladivostok 690041, Russia and

3. Department of Biostatistics, University of Washington, Seattle, Washington 98195-7232

Abstract

Abstract The correlation between alleles at a pair of genetic loci is a measure of linkage disequilibrium. The square of the sample correlation multiplied by sample size provides the usual test statistic for the hypothesis of no disequilibrium for loci with two alleles and this relation has proved useful for study design and marker selection. Nevertheless, this relation holds only in a diallelic case, and an extension to multiple alleles has not been made. Here we introduce a similar statistic, R2, which leads to a correlation-based test for loci with multiple alleles: for a pair of loci with k and m alleles, and a sample of n individuals, the approximate distribution of n(k – 1)(m – 1)/(km)R2 under independence between loci is $\batchmode \documentclass[fleqn,10pt,legalpaper]{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} $\mathrm{{\chi}}_{(k{-}1)(m{-}1)}^{2}$ \end{document}$. One advantage of this statistic is that it can be interpreted as the total correlation between a pair of loci. When the phase of two-locus genotypes is known, the approach is equivalent to a test for the overall correlation between rows and columns in a contingency table. In the phase-known case, R2 is the sum of the squared sample correlations for all km 2 × 2 subtables formed by collapsing to one allele vs. the rest at each locus. We examine the approximate distribution under the null of independence for R2 and report its close agreement with the exact distribution obtained by permutation. The test for independence using R2 is a strong competitor to approaches such as Pearson's chi square, Fisher's exact test, and a test based on Cressie and Read's power divergence statistic. We combine this approach with our previous composite-disequilibrium measures to address the case when the genotypic phase is unknown. Calculation of the new multiallele test statistic and its P-value is very simple and utilizes the approximate distribution of R2. We provide a computer program that evaluates approximate as well as “exact” permutational P-values.

Publisher

Oxford University Press (OUP)

Subject

Genetics

Link

https://academic.oup.com/genetics/article-pdf/180/1/533/42094297/genetics0533.pdf

Reference40 articles.

1. Monte Carlo Evaluation of Resampling-Based Hypothesis Tests

2. Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification

Cited by 63 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Genetic parallelism between European flat oyster populations at the edge of their natural range;Evolutionary Applications;2022-08-06

2. Population Structure of German Cockroaches (Blattodea: Ectobiidae) in an Urban Environment Based on Single Nucleotide Polymorphisms;Journal of Medical Entomology;2022-04-24

3. psBLUP: incorporating marker proximity for improving genomic prediction accuracy;Euphytica;2022-04-08

4. Extensive Recombination Suppression and Epistatic Selection Causes Chromosome-Wide Differentiation of a Selfish Sex Chromosome in Drosophila pseudoobscura;Genetics;2020-09-01

5. Insights into herpesvirus assembly from the structure of the pUL7:pUL51 complex;eLife;2020-05-11