hybpiper-rbgv and yang-and-smith-rbgv: Containerization and additional options for assembly and paralog detection in target enrichment data-Reference-Cited by-同舟云学术

hybpiper-rbgv and yang-and-smith-rbgv: Containerization and additional options for assembly and paralog detection in target enrichment data

Published:2021-11-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Jackson Chris^ORCID,McLay Todd^ORCID,Schmidt-Lebuhn Alexander N.^ORCID

Abstract

ABSTRACTPREMISEThe HybPiper pipeline has become one of the most widely used tools for the assembly of target enrichment (sequence capture) data for phylogenomic analysis. Between the production of locus sequences and phylogenetic analysis, the identification of paralogs is a critical step ensuring accurate inference of evolutionary relationships. Algorithmic approaches using gene tree topologies for the inference of ortholog groups are computationally efficient and broadly applicable to non-model organisms, especially in the absence of a known species tree. Unfortunately, software compatibility issues, unfamiliarity with relevant programming languages, and the complexity involved in running numerous subsequent analysis steps continue to limit the broad uptake of these approaches and constrain their application in practice.METHODS AND RESULTSWe updated the scripts constituting HybPiper and a pipeline for the inference of ortholog groups (“Yang and Smith”) to provide novel options for the treatment of supercontigs, remove bugs, and seamlessly use the outputs of the former as inputs for the latter. The pipelines were containerised using Singularity and implemented via two Nextflow pipelines for easier deployment and to vastly reduce the number of commands required for their use. We tested the pipelines with several datasets, one of which is presented for demonstration.CONCLUSIONShybpiper-rbgv and yang-and-smith-rbgv provide easy installation, user-friendly experience, and robust results to the phylogenetic community. They are presently used as the analysis pipeline of the Australian Angiosperm Tree of Life project. The pipelines are available at https://github.com/chrisjackson-pellicle.

Publisher

Cold Spring Harbor Laboratory

Reference22 articles.

1. Standardized benchmarking in the quest for orthologs;Nature Methods,2016

2. Baker, W. J. , P. Bailey , V. Barber , A. Barker , S. Bellot , D. Bishop , L. R. Botigué , et al. 2021. A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life. Systematic Biology.

3. Ultraconserved Elements in the Human Genome

4. Trimmomatic: a flexible trimmer for Illumina sequence data

5. Breinholt, J. W. , S. B. Carey , G. P. Tiley , E. C. Davis , L. Endara , S. F. McDaniel , L. G. Neves , et al. 2020. A target enrichment probe set for resolving the flagellate plant tree of life. bioRxiv: 2020.05.29.124081.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transfer of Cotula alpina to the genus Leptinella (Asteraceae: Anthemideae);Australian Systematic Botany;2024-01-12

2. Genetic data confirm the presence of Senecio madagascariensis in New Zealand;New Zealand Journal of Botany;2022-11-23

3. Sequence capture data support the taxonomy of;Australian Systematic Botany;2022-08-25