Author:
Kong Sungsik,Swofford David L.,Kubatko Laura S.
Abstract
AbstractWhile phylogenies have been essential in understanding how species evolve, they do not adequately describe some evolutionary processes. For instance, hybridization, a common phenomenon where interbreeding between two species leads to formation of a new species, must be depicted by a phylogenetic network, a structure that modifies a phylogeny by allowing two branches to merge into one, resulting in reticulation. However, existing methods for estimating networks are computationally expensive as the dataset size and/or topological complexity increase. The lack of methods for scalable inference hampers phylogenetic networks from being widely used in practice, despite accumulating evidence that hybridization occurs frequently in nature. Here, we propose a novel method, PhyNEST (Phylogenetic Network Estimation using SiTe patterns), that estimates phylogenetic networks directly from sequence data. PhyNEST achieves computational efficiency by using composite likelihood as well as accuracy by using the full genomic data to incorporate all sources of variability, rather than first summarizing the data by estimating a set of gene trees, as is required by most of the existing methods. To efficiently search network space, we implement both hill-climbing and simulated annealing algorithms. Simulation studies show that PhyNEST can accurately estimate parameters given the true network topology and that it has comparable accuracy to two popular methods that use composite likelihood and a set of gene trees as input, implemented in SNaQ and PhyloNet. For datasets with a large number of loci, PhyNEST is more efficient than SNaQ and PhyloNet when considering the time required for gene tree estimation. We applied PhyNEST to reconstruct the evolutionary relationships amongHeliconiusbutterflies and Papionini primates, characterized by hybrid speciation and widespread introgression, respectively. PhyNEST is implemented in an open-source Julia package and publicly available athttps://github.com/sungsik-kong/PhyNEST.jl.
Publisher
Cold Spring Harbor Laboratory
Reference115 articles.
1. Aarts, E. H. L. and J. Korst . 1989. Simulated Annealing and Boltzmann Machines: A Stochastic Approach to Combinatorial Optimization and Neural Computing. Wiley-Interscience Series in Discrete Mathematics and Optimization Wiley, Chichester [England]; New York.
2. Model selection may not be a mandatory step for phylogeny reconstruction;Nature Communications,2019
3. Hybridization and speciation
4. NANUQ: A method for inferring species networks from gene trees under the coalescent model;Algorithms for Molecular Biology,2019
5. INTROGRESSIVE HYBRIDIZATION
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献