Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI

Author:

Smith Killian12,Ayres Daniel3,Neumaier René1,Wörheide Gert12,Höhna Sebastian12ORCID

Affiliation:

1. GeoBio-Center, Ludwig-Maximilians-Universität München , Richard-Wagner Straße 10, 80333 Munich , Germany

2. Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München , Richard-Wagner Straße 10, 80333 Munich , Germany

3. Center for Bioinformatics and Computational Biology, University of Maryland , College Park, MD 20742 , USA

Abstract

Abstract Phylogenies are central to many research areas in biology and commonly estimated using likelihood-based methods. Unfortunately, any likelihood-based method, including Bayesian inference, can be restrictively slow for large datasets—with many taxa and/or many sites in the sequence alignment—or complex substitutions models. The primary limiting factor when using large datasets and/or complex models in probabilistic phylogenetic analyses is the likelihood calculation, which dominates the total computation time. To address this bottleneck, we incorporated the high-performance phylogenetic library BEAGLE into RevBayes, which enables multi-threading on multi-core CPUs and GPUs, as well as hardware specific vectorized instructions for faster likelihood calculations. Our new implementation of RevBayes+BEAGLE retains the flexibility and dynamic nature that users expect from vanilla RevBayes. In addition, we implemented native parallelization within RevBayes without an external library using the message passing interface (MPI); RevBayes+MPI. We evaluated our new implementation of RevBayes+BEAGLE using multi-threading on CPUs and 2 different powerful GPUs (NVidia Titan V and NVIDIA A100) against our native implementation of RevBayes+MPI. We found good improvements in speedup when multiple cores were used, with up to 20-fold speedup when using multiple CPU cores and over 90-fold speedup when using multiple GPU cores. The improvement depended on the data type used, DNA or amino acids, and the size of the alignment, but less on the size of the tree. We additionally investigated the cost of rescaling partial likelihoods to avoid numerical underflow and showed that unnecessarily frequent and inefficient rescaling can increase runtimes up to 4-fold. Finally, we presented and compared a new approach to store partial likelihoods on branches instead of nodes that can speed up computations up to 1.7 times but comes at twice the memory requirements.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Oxford University Press (OUP)

Reference45 articles.

1. BEAGLE 3: improved performance, scaling, and usability for a high-performance computing library for statistical phylogenetics;Ayres;Syst. Biol,2019

2. Beagle: an application programming interface and high-performance computing library for statistical phylogenetics;Ayres;Syst. Biol,2012

3. Markov-modulated continuous-time Markov chains to identify site-and branch-specific evolutionary variation in BEAST;Baele;Syst. Biol,2021

4. Accuracy and performance of single versus double precision arithmetics for maximum likelihood phylogeny reconstruction;Berger,2010

5. Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes. Meth;Borges;Ecol. Evol,2022

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. CMAPLE: Efficient Phylogenetic Inference in the Pandemic Era;Molecular Biology and Evolution;2024-06-27

2. Sequential Bayesian Phylogenetic Inference;Systematic Biology;2024-05-21

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3