A Comparison of Deep Learning Architectures for Inferring Parameters of Diversification Models from Extant Phylogenies-Reference-Cited by-同舟云学术

A Comparison of Deep Learning Architectures for Inferring Parameters of Diversification Models from Extant Phylogenies

Published:2023-03-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Lajaaiti Ismaël^ORCID,Lambert Sophia^ORCID,Voznica Jakub^ORCID,Morlon Hélène^ORCID,Hartig Florian^ORCID

Abstract

AbstractTo infer the processes that gave rise to past speciation and extinction rates across taxa, space and time, we often formulate hypotheses in the form of stochastic diversification models and estimate their parameters from extant phylogenies using Maximum Likelihood or Bayesian inference. Unfortunately, however, likelihoods can easily become intractable, limiting our ability to consider more complicated diversification processes. Recently, it has been proposed that deep learning (DL) could be used in this case as a likelihood-free inference technique. Here, we explore this idea in more detail, with a particular focus on understanding the ideal network architecture and data representation for using DL in phylogenetic inference. We evaluate the performance of different neural network architectures (DNN, CNN, RNN, GNN) and phylogeny representations (summary statistics, Lineage Through Time or LTT, phylogeny encoding and phylogeny graph) for inferring rates of the Constant Rate Birth-Death (CRBD) and the Binary State Speciation and Extinction (BISSE) models. We find that deep learning methods can reach similar or even higher accuracy than Maximum Likelihood Estimation, provided that network architectures and phylogeny representations are appropriately tuned to the respective model. For example, for the CRBD model we find that CNNs and RNNs fed with LTTs outperform other combinations of network architecture and phylogeny representation, presumably because the LTT is a sufficient and therefore less redundant statistic for homogenous BD models. For the more complex BiSSE model, however, it was necessary to feed the network with both topology and tip states information to reach acceptable performance. Overall, our results suggest that deep learning provides a promising alternative for phylogenetic inference, but that data representation and architecture have strong effects on the inferential performance.

Publisher

Cold Spring Harbor Laboratory

Reference66 articles.

1. Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates

2. Has the Earth’s sixth mass extinction already arrived?

3. Approximate Bayesian Computation in Evolution and Ecology

4. Bengio Y. 2012. Neural Networks: Tricks of the Trade. Springer Berlin, Heidelberg.

5. Fractional Neuro-Sequential ARFIMA-LSTM for Financial Market Forecasting;IEEE Access,2020

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recent evolutionary origin and localized diversity hotspots of mammalian coronaviruses;eLife;2024-08-28

2. Recent evolutionary origin and localized diversity hotspots of mammalian coronaviruses;eLife;2024-08-28

3. Performance and Robustness of Parameter Estimation from Phylogenetic Trees Using Neural Networks;2024-08-06

4. Recent evolutionary origin and localized diversity hotspots of mammalian coronaviruses;2024-07-24

5. Applications of machine learning in phylogenetics;Molecular Phylogenetics and Evolution;2024-07