A Comparison of Deep Learning Architectures for Inferring Parameters of Diversification Models from Extant Phylogenies

Author:

Lajaaiti IsmaëlORCID,Lambert SophiaORCID,Voznica JakubORCID,Morlon HélèneORCID,Hartig FlorianORCID

Abstract

AbstractTo infer the processes that gave rise to past speciation and extinction rates across taxa, space and time, we often formulate hypotheses in the form of stochastic diversification models and estimate their parameters from extant phylogenies using Maximum Likelihood or Bayesian inference. Unfortunately, however, likelihoods can easily become intractable, limiting our ability to consider more complicated diversification processes. Recently, it has been proposed that deep learning (DL) could be used in this case as a likelihood-free inference technique. Here, we explore this idea in more detail, with a particular focus on understanding the ideal network architecture and data representation for using DL in phylogenetic inference. We evaluate the performance of different neural network architectures (DNN, CNN, RNN, GNN) and phylogeny representations (summary statistics, Lineage Through Time or LTT, phylogeny encoding and phylogeny graph) for inferring rates of the Constant Rate Birth-Death (CRBD) and the Binary State Speciation and Extinction (BISSE) models. We find that deep learning methods can reach similar or even higher accuracy than Maximum Likelihood Estimation, provided that network architectures and phylogeny representations are appropriately tuned to the respective model. For example, for the CRBD model we find that CNNs and RNNs fed with LTTs outperform other combinations of network architecture and phylogeny representation, presumably because the LTT is a sufficient and therefore less redundant statistic for homogenous BD models. For the more complex BiSSE model, however, it was necessary to feed the network with both topology and tip states information to reach acceptable performance. Overall, our results suggest that deep learning provides a promising alternative for phylogenetic inference, but that data representation and architecture have strong effects on the inferential performance.

Publisher

Cold Spring Harbor Laboratory

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3