Abstract
AbstractA fundamental goal in population genetics is to understand how variation is arrayed over natural landscapes. From first principles we know that common features such as heterogeneous population densities and barriers to dispersal should shape genetic variation over space, however there are few tools currently available that can deal with these ubiquitous complexities. Geographically referenced single nucleotide polymorphism (SNP) data are increasingly accessible, presenting an opportunity to study genetic variation across geographic space in myriad species. We present a new inference method that uses geo-referenced SNPs and a deep neural network to estimate spatially heterogeneous maps of population density and dispersal rate. Our neural network trains on simulated input and output pairings, where the input consists of genotypes and sampling locations generated from a continuous space population genetic simulator, and the output is a map of the true demographic parameters. We benchmark our tool against existing methods and discuss qualitative differences between the different approaches; in particular, our program is unique because it infers the magnitude of both dispersal and density as well as their variation over the landscape, and it does so using SNP data. Similar methods are constrained to estimating relative migration rates, or require identity by descent blocks as input. We applied our tool to empirical data from North American grey wolves, for which it estimated mostly reasonable demographic parameters, but was affected by incomplete spatial sampling. Genetic based methods like ours complement other, direct methods for estimating past and present demography, and we believe will serve as valuable tools for applications in conservation, ecology, and evolutionary biology. An open source software package implementing our method is available fromhttps://github.com/kr-colab/mapNN.
Publisher
Cold Spring Harbor Laboratory
Reference50 articles.
1. Martín Abadi , Ashish Agarwal , Paul Barham , Eugene Brevdo , Zhifeng Chen , Craig Citro , Greg S Corrado , Andy Davis , Jeffrey Dean , Matthieu Devin , et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467, 2016.
2. Clare IM Adams , Michael Knapp , Neil J Gemmell , Gert-Jan Jeunen , Michael Bunce , Miles D Lamare , and Helen R Taylor . Beyond biodiversity: Can environmental dna (edna) cut it as a population genetics tool? Genes, 10(3):192, 2019.
3. Predicting the landscape of recombination using deep learning;Molecular biology and evolution,2020
4. Estimating recent migration and population-size surfaces;PLoS genetics,2019
5. Kara J Andres , David M Lodge , Suresh A Sethi , and Jose Andrés . Detecting and analysing intraspecific genetic variation with edna: From population genetics to species abundance. Molecular Ecology, 2023.