Fast and accurate population admixture inference from genotype data from a few microsatellites to millions of SNPs-Reference-Cited by-同舟云学术

Fast and accurate population admixture inference from genotype data from a few microsatellites to millions of SNPs

Published:2022-05-04 Issue:2 Volume:129 Page:79-92
ISSN:0018-067X
Container-title:Heredity
language:en
Short-container-title:Heredity

Author:

Wang Jinliang

Abstract

AbstractModel-based (likelihood and Bayesian) and non-model-based (PCA and K-means clustering) methods were developed to identify populations and assign individuals to the identified populations using marker genotype data. Model-based methods are favoured because they are based on a probabilistic model of population genetics with biologically meaningful parameters and thus produce results that are easily interpretable and applicable. Furthermore, they often yield more accurate structure inferences than non-model-based methods. However, current model-based methods either are computationally demanding and thus applicable to small problems only or use simplified admixture models that could yield inaccurate results in difficult situations such as unbalanced sampling. In this study, I propose new likelihood methods for fast and accurate population admixture inference using genotype data from a few multiallelic microsatellites to millions of diallelic SNPs. The methods conduct first a clustering analysis of coarse-grained population structure by using the mixture model and the simulated annealing algorithm, and then an admixture analysis of fine-grained population structure by using the clustering results as a starting point in an expectation maximisation algorithm. Extensive analyses of both simulated and empirical data show that the new methods compare favourably with existing methods in both accuracy and running speed. They can analyse small datasets with just a few multiallelic microsatellites but can also handle in parallel terabytes of data with millions of markers and millions of individuals. In difficult situations such as many and/or lowly differentiated populations, unbalanced or very small samples of individuals, the new methods are substantially more accurate than other methods.

Publisher

Springer Science and Business Media LLC

Subject

Genetics (clinical),Genetics

Link

https://www.nature.com/articles/s41437-022-00535-z.pdf

Reference45 articles.

1. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA (2012) An integrated map of genetic variation from 1092 human genomes. Nature 491:56–65

2. Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664

3. Bose A, Kalantzis V, Kontopoulou EM, Elkady M, Paschou P, Drineas P (2019) TeraPCA: a fast and scalable software package to study genetic variation in tera-scale genotypes. Bioinformatics 35:3679–3683

4. Bryc K, Durand EY, Macpherson JM, Reich D, Mountain JL (2015) The genetic ancestry of African Americans, Latinos, and European Americans across the United States. Am J Hum Genet 96:37–53

5. Corander J, Waldmann P, Sillanpää MJ (2003) Bayesian analysis of genetic differentiation between populations. Genetics 163:367–374

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Genome-wide SNP analysis coupled with geographic and reproductive-phenological information reveals panmixia in a classical marine species, the Japanese jack mackerel (Trachurus japonicus);Fisheries Research;2024-11

2. Isolation, small population size, and management influence inbreeding and reduced genetic variation in K’gari dingoes;Conservation Genetics;2024-04-19

3. Inferring Ancestry with the Hierarchical Soft Clustering Approach tangleGen;2024-03-29

4. High inter-population connectivity and occasional gene flow between subspecies improves recovery potential for the endangered Least Bell’s Vireo;Ornithological Applications;2024-02-26

5. Genetic structuring and species boundaries in the Atlantic stony coral Favia (Scleractinia, Faviidae);Zoologica Scripta;2024-01-29