Bcheck: a wrapper tool for detecting RNase P RNA genes
-
Published:2010-07-13
Issue:1
Volume:11
Page:
-
ISSN:1471-2164
-
Container-title:BMC Genomics
-
language:en
-
Short-container-title:BMC Genomics
Author:
Yusuf Dilmurat,Marz Manja,Stadler Peter F,Hofacker Ivo L
Abstract
Abstract
Background
Effective bioinformatics solutions are needed to tackle challenges posed by industrial-scale genome annotation. We present , a wrapper tool which predicts RNase P RNA genes by combining the speed of pattern matching and sensitivity of covariance models. The core of is a library of subfamily specific descriptor models and covariance models.
Results
Scanning all microbial genomes in GenBank identifies RNase P RNA genes in 98% of 1024 microbial chromosomal sequences within just 4 hours on single CPU. Comparing to existing annotations found in 387 of the GenBank files, predictions have more intact structure and are automatically classified by subfamily membership. For eukaryotic chromosomes could identify the known RNase P RNA genes in 84 out of 85 metazoan genomes and 19 out of 21 fungi genomes. predicted 37 novel eukaryotic RNase P RNA genes, 32 of which are from fungi. Gene duplication events are observed in at least 20 metazoan organisms. Scanning of meta-genomic data from the Global Ocean Sampling Expedition, comprising over 10 million sample sequences (18 Gigabases), predicted 2909 unique genes, 98% of which fall into ancestral bacteria A type of RNase P RNA and 66% of which have no close homolog to known prokaryotic RNase P RNA.
Conclusions
The combination of efficient filtering by means of a descriptor-based search and subsequent construction of a high-quality gene model by means of a covariance model provides an efficient method for the detection of RNase P RNA genes in large-scale sequencing data.
is implemented as webserver and can also be downloaded for local use from http://rna.tbi.univie.ac.at/bcheck
Publisher
Springer Science and Business Media LLC
Subject
Genetics,Biotechnology
Reference48 articles.
1. Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, Remington K, Beeson K, Tran B, Smith H, Baden-Tillson H, Stewart C, Thorpe J, Freeman J, Andrews-Pfannkoch C, Venter JE, Li K, Kravitz S, Heidelberg JF, Utterback T, Rogers YH, Falcón LI, Souza V, Bonilla-Rosso G, Eguiarte LE, Karl DM, Sathyendranath S, Platt T, Bermingham E, Gallardo V, Tamayo-Castillo G, Ferrari MR, Strausberg RL, Nealson K, Friedman R, Frazier M, Venter JC: The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. 2007, 5: e77-10.1371/journal.pbio.0050077. 2. Rust AG, Mongin E, Birney E: Genome annotation techniques: new approaches and challenges. Drug Discov Today. 2002, 7: S70-76. 10.1016/S1359-6446(02)02289-4. 3. Eddy SR: Non-coding RNA genes and the modern RNA world. Nat Rev Genet. 2001, 2: 919-929. 10.1038/35103511. 4. The Athanasius F Bompfünewerer RNA Consortium, Backofen R, Flamm C, Fried C, Fritzsch G, Hackermüller J, Hertel J, Hofacker IL, Missal K, Axel Prohaska Mosig, Rose D, Stadler PF, Tanzer A, Washietl S, Sebastian W: RNAs Everywhere: Genome-Wide Annotation of Structured RNAs. J Exp Zool B: Mol Dev Evol. 2007, 308B: 1-25. 10.1002/jez.b.21130. 5. Hertel J, de Jong D, Marz M, Rose D, Tafer H, Tanzer A, Schierwater B, Stadler PF: Non-Coding RNA Annotation of the Genome of Trichoplax adhaerens. Nucleic Acids Res. 2009, 37: 1602-1615. 10.1093/nar/gkn1084.
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|