Abstract
AbstractAncient environmental DNA (aeDNA) is becoming a powerful tool to gain insights about past ecosystems. However, several methodological challenges remain, particularly for classifying the DNA to species level and conducting phylogenetic placement. Current methods, primarily tailored for modern datasets, fail to capture several idiosyncrasies of aeDNA, including species mixtures from closely related species and ancestral divergence. We introducesoibean, a novel tool that utilises pangenomic graphs for identifying species from ancient environmental mitochondrial reads. It outperforms existing methods in accurately identifying species from multiple sources within a sample, enhancing phylogenetic analysis for aeDNA.soibeanemploys a damage-aware likelihood model for precise identification at low-coverage with high damage rate, demonstrating effectiveness through simulated data tests and empirical validation. Notably, our method uncovered new empirical results in published datasets, including using porpoise whales as food in a Mesolithic community in Sweden, demonstrating its potential to reveal previously unrecognised findings in aeDNA studies.
Publisher
Cold Spring Harbor Laboratory
Reference71 articles.
1. Ashkenazy, H. , et al. “FastML: a web server for probabilistic reconstruction of ancestral sequences”. In: Nucleic acids research 40.W1 (2012), W580–W584.
2. Lowest common ancestors in trees and directed acyclic graphs;In: Journal of Algorithms,2005
3. Bouckaert, R. , et al. “BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis”. In: PLoS computational biology 15.4 (2019), e1006650.
4. “Patterns of damage in genomic DNA sequences from a Neandertal”;In: Proceedings of the National Academy of Sciences,2007
5. General methods for monitoring convergence of iterative simulations;In: Journal of computational and graphical statistics,1998