LocARNAscan: Incorporating thermodynamic stability in sequence and structure-based RNA homology search-Reference-Cited by-同舟云学术

LocARNAscan: Incorporating thermodynamic stability in sequence and structure-based RNA homology search

Published:2013-04-20 Issue:1 Volume:8 Page:
ISSN:1748-7188
Container-title:Algorithms for Molecular Biology
language:en
Short-container-title:Algorithms Mol Biol

Author:

Will Sebastian,Siebauer Michael F,Heyne Steffen,Engelhardt Jan,Stadler Peter F,Reiche Kristin,Backofen Rolf

Abstract

Abstract Background The search for distant homologs has become an import issue in genome annotation. A particular difficulty is posed by divergent homologs that have lost recognizable sequence similarity. This same problem also arises in the recognition of novel members of large classes of RNAs such as snoRNAs or microRNAs that consist of families unrelated by common descent. Current homology search tools for structured RNAs are either based entirely on sequence similarity (such as or ) or combine sequence and secondary structure. The most prominent example of the latter class of tools is . Alternatives are descriptor-based methods. In most practical applications published to-date, however, the information contained in covariance models or manually prescribed search patterns is dominated by sequence information. Here we ask two related questions: (1) Is secondary structure alone informative for homology search and the detection of novel members of RNA classes? (2) To what extent is the thermodynamic propensity of the target sequence to fold into the correct secondary structure helpful for this task? Results Sequence-structure alignment can be used as an alternative search strategy. In this scenario, the query consists of a base pairing probability matrix, which can be derived either from a single sequence or from a multiple alignment representing a set of known representatives. Sequence information can be optionally added to the query. The target sequence is pre-processed to obtain local base pairing probabilities. As a search engine we devised a semi-global scanning variant of ’s algorithm for sequence-structure alignment. The tool is optimized for speed and low memory consumption. In benchmarking experiments on artificial data we observe that the inclusion of thermodynamic stability is helpful, albeit only in a regime of extremely low sequence information in the query. We observe, furthermore, that the sensitivity is bounded in particular by the limited accuracy of the predicted local structures of the target sequence. Conclusions Although we demonstrate that a purely structure-based homology search is feasible in principle, it is unlikely to outperform tools such as in most application scenarios, where a substantial amount of sequence information is typically available. The approach will profit, however, from high throughput methods to determine RNA secondary structure. In transcriptome-wide applications, such methods will provide accurate structure annotations on the target side. Availability Source code of the free software 1.0 and supplementary data are available athttp://www.bioinf.uni-leipzig.de/Software/LocARNAscan.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computational Theory and Mathematics,Molecular Biology,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1748-7188-8-14.pdf

Reference50 articles.

1. Berretta J, Morillon A: Pervasive transcription constitutes a new level of eukaryotic genome regulation. EMBO Rep. 2009, 10: 973-982.

2. Ponjavic J, Ponting CP, Lunter G: Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res. 2007, 17: 556-565.

3. Pheasant M, Mattick JS: Raising the estimate of functional human sequences. Genome Res. 2007, 17: 1245-1253.

4. Ponting CP, Hardison RC: What fraction of the human genome is functional?. Genome Res. 2011, 21: 1769-1776.

5. Menzel P, Gorodkin J, Stadler PF: The tedious task of finding homologous non-coding RNA genes. RNA. 2009, 15: 2075-2082.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A database of flavivirus RNA structures with a search algorithm for pseudoknots and triple base interactions;Bioinformatics;2020-08-31

2. GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering;GigaScience;2019-12-01

3. Empowering the annotation and discovery of structured RNAs with scalable and accessible integrative clustering;2019-02-20

4. Partially Local Multi-way Alignments;Mathematics in Computer Science;2018-03-19

5. PATTERNA: transcriptome-wide search for functional RNA elements via structural data signatures;Genome Biology;2018-03-01