Affiliation:
1. Huck Institutes of Life Sciences, Pennsylvania State University, University Park, PA
2. Department of Biology, Pennsylvania State University, University Park, PA
3. Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL
Abstract
Abstract
Long-term balancing selection typically leaves narrow footprints of increased genetic diversity, and therefore most detection approaches only achieve optimal performances when sufficiently small genomic regions (i.e., windows) are examined. Such methods are sensitive to window sizes and suffer substantial losses in power when windows are large. Here, we employ mixture models to construct a set of five composite likelihood ratio test statistics, which we collectively term B statistics. These statistics are agnostic to window sizes and can operate on diverse forms of input data. Through simulations, we show that they exhibit comparable power to the best-performing current methods, and retain substantially high power regardless of window sizes. They also display considerable robustness to high mutation rates and uneven recombination landscapes, as well as an array of other common confounding scenarios. Moreover, we applied a specific version of the B statistics, termed B2, to a human population-genomic data set and recovered many top candidates from prior studies, including the then-uncharacterized STPG2 and CCDC169–SOHLH2, both of which are related to gamete functions. We further applied B2 on a bonobo population-genomic data set. In addition to the MHC-DQ genes, we uncovered several novel candidate genes, such as KLRD1, involved in viral defense, and SCN9A, associated with pain perception. Finally, we show that our methods can be extended to account for multiallelic balancing selection and integrated the set of statistics into open-source software named BalLeRMix for future applications by the scientific community.
Funder
Pennsylvania State University
National Institutes of Health
National Science Foundation
NIH
Publisher
Oxford University Press (OUP)
Subject
Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics
Reference158 articles.
1. A global reference for human genetic variation,2015
2. Targets of balancing selection in the human genome;Andrés;Mol Biol Evol,2009
3. Balancing selection maintains a form of ERAP2 that undergoes nonsense-mediated decay and affects antigen presentation;Andrés;PLoS Genet,2010
4. Molecular evolution of genes associated with preeclampsia: genetic conflict, antagonistic coevolution and signals of selection;Arthur;J Evol Med,2018
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献