Impact of Phylogenetic Tree Completeness and Mis-specification of Sampling Fractions on Trait Dependent Diversification Models

Author:

Mynard Poppy1ORCID,Algar Adam C2,Lancaster Lesley T1,Bocedi Greta1,Fahri Fahri3,Gubry-Rangin Cécile1,Lupiyaningdyah Pungki4,Nangoy Meis5,Osborne Owen G6ORCID,Papadopulos Alexander S T6,Sudiana I Made7,Juliandi Berry8,Travis Justin M J1,Herrera-Alsina Leonel1ORCID

Affiliation:

1. School of Biological Sciences, University of Aberdeen , Aberdeen , UK

2. Department of Biology, Lakehead University , Thunder Bay, ON , Canada

3. Department of Biology, Tadulako University , Palu , Indonesia

4. Museum Zoologicum Bogoriense, Research Center for Biology, Indonesian Institute of Sciences , Cibinong 16911 , Indonesia

5. Faculty of Animal Husbandry, Sam Ratulangi University , Kampus Bahu Street, Manado 95115 , Indonesia

6. School of Natural Sciences, Bangor University , Environment Centre Wales, Deiniol Road, Bangor LL57 2UW , UK

7. Microbial Ecology Research Group, Research Center for Biology, Indonesian Institute of Sciences , Cibinong 19611 , Indonesia

8. Department of Biology, Faculty of Mathematics and Natural Sciences, IPB University , Bogor, 16680 , Indonesia

Abstract

Abstract Understanding the origins of diversity and the factors that drive some clades to be more diverse than others are important issues in evolutionary biology. Sophisticated SSE (state-dependent speciation and extinction) models provide insights into the association between diversification rates and the evolution of a trait. The empirical data used in SSE models and other methods is normally imperfect, yet little is known about how this can affect these models. Here, we evaluate the impact of common phylogenetic issues on inferences drawn from SSE models. Using simulated phylogenetic trees and trait information, we fitted SSE models to determine the effects of sampling fraction (phylogenetic tree completeness) and sampling fraction mis-specification on model selection and parameter estimation (speciation, extinction, and transition rates) under two sampling regimes (random and taxonomically biased). As expected, we found that both model selection and parameter estimate accuracies are reduced at lower sampling fractions (i.e., low tree completeness). Furthermore, when sampling of the tree is imbalanced across sub-clades and tree completeness is ≤ 60%, rates of false positives increase and parameter estimates are less accurate, compared to when sampling is random. Thus, when applying SSE methods to empirical datasets, there are increased risks of false inferences of trait dependent diversification when some sub-clades are heavily under-sampled. Mis-specifying the sampling fraction severely affected the accuracy of parameter estimates: parameter values were over-estimated when the sampling fraction was specified as lower than its true value, and under-estimated when the sampling fraction was specified as higher than its true value. Our results suggest that it is better to cautiously under-estimate sampling efforts, as false positives increased when the sampling fraction was over-estimated. We encourage SSE studies where the sampling fraction can be reasonably estimated and provide recommended best practices for SSE modeling. [Trait dependent diversification; SSE models; phylogenetic tree completeness; sampling fraction.]

Funder

Newton Fund

Publisher

Oxford University Press (OUP)

Subject

Genetics,Ecology, Evolution, Behavior and Systematics

Reference71 articles.

1. Remarkable archaeal diversity detected in a Yellowstone National Park hot spring environment (archaebacteria/phylogeny/thermophfly/molecular ecology);Barns;Proc. Nati. Acad. Sci. USA,1994

2. The problem with clade-specific sampling fractions;Beaulieu,2020

3. Detecting hidden diversification shifts in models of trait-dependent speciation and extinction;Beaulieu;Syst. Biol,2016

4. Smoothing parameter selection in nonparametric regression using an improved kullback information criterion. Proceedings—8th International Symposium on Signal Processing and its Applications, ISSPA;Bekara,2005

5. On statistical tests of phylogenetic tree imbalance: The Sackin and other indices revisited;Blum;Math. Biosci,2005

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3