Haplotype based testing for a better understanding of the selective architecture-Reference-Cited by-同舟云学术

Haplotype based testing for a better understanding of the selective architecture

Published:2023-08-26 Issue:1 Volume:24 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Chen Haoyu,Pelizzola Marta,Futschik Andreas

Abstract

Abstract Background The identification of genomic regions affected by selection is one of the most important goals in population genetics. If temporal data are available, allele frequency changes at SNP positions are often used for this purpose. Here we provide a new testing approach that uses haplotype frequencies instead of allele frequencies. Results Using simulated data, we show that compared to SNP based test, our approach has higher power, especially when the number of candidate haplotypes is small or moderate. To improve power when the number of haplotypes is large, we investigate methods to combine them with a moderate number of haplotype subsets. Haplotype frequencies can often be recovered with less noise than SNP frequencies, especially under pool sequencing, giving our test an additional advantage. Furthermore, spurious outlier SNPs may lead to false positives, a problem usually not encountered when working with haplotypes. Post hoc tests for the number of selected haplotypes and for differences between their selection coefficients are also provided for a better understanding of the underlying selection dynamics. An application on a real data set further illustrates the performance benefits. Conclusions Due to less multiple testing correction and noise reduction, haplotype based testing is able to outperform SNP based tests in terms of power in most scenarios.

Funder

Austrian Science Fund

National Science Foundation

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/s12859-023-05437-3.pdf

Reference44 articles.

1. Turner TL, Stewart AD, Fields AT, Rice WR, Tarone AM. Population-based resequencing of experimentally evolved populations reveals the genetic basis of body size variation in Drosophila melanogaster. PLOS Genet. 2011;7(3):1–10. https://doi.org/10.1371/journal.pgen.1001336.

2. Griffin PC, Hangartner SB, Fournier-Level A, Hoffmann AA. Genomic trajectories to desiccation resistance: convergence and divergence among replicate Selected Drosophila lines. Genetics. 2017;205(2):871–90. https://doi.org/10.1534/genetics.116.187104.

3. Spitzer K, Pelizzola M, Futschik A. Modifying the Chi-square and the CMH test for population genetic inference: adapting to overdispersion. Ann Appl Stat. 2020;14(1):202–20. https://doi.org/10.1214/19-AOAS1301.

4. Vlachos C, Burny C, Pelizzola M, Borges R, Futschik A, Kofler R, et al. Benchmarking software tools for detecting and quantifying selection in evolve and resequencing studies. Genome Biol. 2019. https://doi.org/10.1186/s13059-019-1770-8.

5. Kidd KK, Pakstis AJ. State of the art for microhaplotypes. Genes. 2022;13(8). https://www.mdpi.com/2073-4425/13/8/1322.