Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs

Author:

Portik Daniel M12ORCID,Streicher Jeffrey W3ORCID,Blackburn David C4ORCID,Moen Daniel S5ORCID,Hutter Carl R6ORCID,Wiens John J1ORCID

Affiliation:

1. Department of Ecology and Evolutionary Biology, University of Arizona , Tucson, AZ

2. Department of Herpetology, California Academy of Sciences , USA

3. Department of Life Sciences, The Natural History Museum , London , United Kingdom

4. Department of Natural History, Florida Museum of Natural History, University of Florida , Gainesville, FL

5. Department of Integrative Biology, 501 Life Sciences West, Oklahoma State University , Stillwater, OK

6. Museum of Natural Science and Department of Biological Sciences, Louisiana State University , Baton Rouge, LA

Abstract

Abstract The data available for reconstructing molecular phylogenies have become wildly disparate. Phylogenomic studies can generate data for thousands of genetic markers for dozens of species, but for hundreds of other taxa, data may be available from only a few genes. Can these two types of data be integrated to combine the advantages of both, addressing the relationships of hundreds of species with thousands of genes? Here, we show that this is possible, using data from frogs. We generated a phylogenomic data set for 138 ingroup species and 3,784 nuclear markers (ultraconserved elements [UCEs]), including new UCE data from 70 species. We also assembled a supermatrix data set, including data from 97% of frog genera (441 total), with 1–307 genes per taxon. We then produced a combined phylogenomic–supermatrix data set (a “gigamatrix”) containing 441 ingroup taxa and 4,091 markers but with 86% missing data overall. Likelihood analysis of the gigamatrix yielded a generally well-supported tree among families, largely consistent with trees from the phylogenomic data alone. All terminal taxa were placed in the expected families, even though 42.5% of these taxa each had >99.5% missing data and 70.2% had >90% missing data. Our results show that missing data need not be an impediment to successfully combining very large phylogenomic and supermatrix data sets, and they open the door to new studies that simultaneously maximize sampling of genes and taxa.

Publisher

Oxford University Press (OUP)

Subject

Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics

Reference64 articles.

1. Pruning rogue taxa improves phylogenetic accuracy: an efficient algorithm and webservice;Aberer;Syst Biol,2013

2. A species-level timeline of mammalian evolution integrating phylogenomic data;Álvarez-Carretero;Nature,2021

3. Toward a self-updating platform for estimating rates of speciation and migration, ages, and relationships of taxa;Antonelli;Syst Biol,2017

4. Phylogeny of terraranan frogs based on 2,665 loci and impacts of missing data on phylogenomic analyses;Barrientos;System Biodivers,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3