<tt>kakapo</tt>: easy extraction and annotation of genes from raw RNA-seq reads-Reference-Cited by-同舟云学术

kakapo: easy extraction and annotation of genes from raw RNA-seq reads

Published:2023-11-27 Issue: Volume:11 Page:e16456
ISSN:2167-8359
Container-title:PeerJ
language:en
Short-container-title:

Author:

Ramanauskas Karolis¹,Igić Boris¹

Affiliation:

1. Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, United States of America

Abstract

kakapo (kākāpō) is a Python-based pipeline that allows users to extract and assemble one or more specified genes or gene families. It flexibly uses original RNA-seq read or GenBank SRA accession inputs without performing global assembly of entire transcriptomes or metatranscriptomes. The pipeline identifies open reading frames in the assembled gene transcripts and annotates them. It optionally filters raw reads for ribosomal, plastid, and mitochondrial reads, or reads belonging to non-target organisms (e.g., viral, bacterial, human). kakapo can be employed for targeted assembly, to extract arbitrary loci, such as those commonly used for phylogenetic inference in systematics or candidate genes and gene families in phylogenomic and metagenomic studies. We provide example applications and discuss how its use can offset the declining value of GenBank’s single-gene databases and help assemble datasets for a variety of phylogenetic analyses.

Funder

National Science Foundation

Publisher

PeerJ

Subject

General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Link

https://peerj.com/articles/16456.pdf

Reference33 articles.

1. Basic local alignment search tool;Altschul;Journal of Molecular Biology,1990

2. The Pfam protein families database;Bateman;Nucleic Acids Research,2002

3. Trimmomatic: a flexible trimmer for Illumina sequence data;Bolger;Bioinformatics,2014

4. Uncovering novel MHC alleles from RNA-Seq data: expanding the spectrum of MHC class I alleles in sheep;Buitkamp;BMC Genomic Data,2023

5. Gastrogenomic delights: a movable feast;Eisen;Nature Medicine,1997

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Necrotizing Toxin PromotesPseudomonas syringaeInfection Across Evolutionarily Divergent Plant Lineages;2024-07-19

2. Transcriptome data from silica-preserved leaf tissue reveal gene flow patterns in a Caribbean bromeliad;Annals of Botany;2024-01-05