Phylogenetic diversity statistics for all clades in a phylogeny-Reference-Cited by-同舟云学术

Phylogenetic diversity statistics for all clades in a phylogeny

Published:2023-06-01 Issue:Supplement_1 Volume:39 Page:i177-i184
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Grover Siddhant¹,Markin Alexey²,Anderson Tavis K²^ORCID,Eulenstein Oliver¹

Affiliation:

1. Department of Computer Science, Iowa State University , Ames, IA 50010, United States

2. Virus and Prion Research Unit, National Animal Disease Center, USDA-ARS , Ames, IA 50010, United States

Abstract

Abstract The classic quantitative measure of phylogenetic diversity (PD) has been used to address problems in conservation biology, microbial ecology, and evolutionary biology. PD is the minimum total length of the branches in a phylogeny required to cover a specified set of taxa on the phylogeny. A general goal in the application of PD has been identifying a set of taxa of size k that maximize PD on a given phylogeny; this has been mirrored in active research to develop efficient algorithms for the problem. Other descriptive statistics, such as the minimum PD, average PD, and standard deviation of PD, can provide invaluable insight into the distribution of PD across a phylogeny (relative to a fixed value of k). However, there has been limited or no research on computing these statistics, especially when required for each clade in a phylogeny, enabling direct comparisons of PD between clades. We introduce efficient algorithms for computing PD and the associated descriptive statistics for a given phylogeny and each of its clades. In simulation studies, we demonstrate the ability of our algorithms to analyze large-scale phylogenies with applications in ecology and evolutionary biology. The software is available at https://github.com/flu-crew/PD_stats.

Funder

Department of Agriculture

Agricultural Research Service

National Institute of Allergy and Infectious Diseases

National Institutes of Health

Department of Health and Human Services

USDA Agricultural Research Service

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

https://academic.oup.com/bioinformatics/article-pdf/39/Supplement_1/i177/50741760/btad263.pdf

Reference26 articles.

1. Has the Earth’s sixth mass extinction already arrived?;Barnosky;Nature,2011

2. Budgeted nature reserve selection with diversity feature loss and arbitrary split systems;Bordewich;J Math Biol,2012

3. Phylogenetic diversity metrics for ecological communities: integrating species richness, abundance and evolutionary history;Cadotte;Ecol Lett,2010

4. Accelerated modern human–induced species losses: entering the sixth mass extinction;Ceballos;Sci Adv,2015

5. Biopython: freely available Python tools for computational molecular biology and bioinformatics;Cock;Bioinformatics,2009

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development of MetaXplore: An Interactive Tool for Targeted Metagenomic Analysis;Current Issues in Molecular Biology;2024-05-15