Author:
Andronescu Mirela S.,Pop Cristina,Condon Anne E.
Abstract
Accurate prediction of RNA pseudoknotted secondary structures from the base sequence is a challenging computational problem. Since prediction algorithms rely on thermodynamic energy models to identify low-energy structures, prediction accuracy relies in large part on the quality of free energy change parameters. In this work, we use our earlier constraint generation and Boltzmann likelihood parameter estimation methods to obtain new energy parameters for two energy models for secondary structures with pseudoknots, namely, the Dirks–Pierce (DP) and the Cao–Chen (CC) models. To train our parameters, and also to test their accuracy, we create a large data set of both pseudoknotted and pseudoknot-free secondary structures. In addition to structural data our training data set also includes thermodynamic data, for which experimentally determined free energy changes are available for sequences and their reference structures. When incorporated into the HotKnots prediction algorithm, our new parameters result in significantly improved secondary structure prediction on our test data set. Specifically, the prediction accuracy when using our new parameters improves from 68% to 79% for the DP model, and from 70% to 77% for the CC model.
Publisher
Cold Spring Harbor Laboratory
Reference36 articles.
1. Asymmetry in RNA pseudoknots: observation and theory
2. The tmRDB and SRPDB resources
3. Andronescu M (2003) “Algorithms for predicting the secondary structure of pairs and combinatorial sets of nucleic acid strands.”. MS thesis, (University of British Columbia, Vancouver, Canada).
4. Andronescu MS (2008) “Computational approaches for RNA energy parameter estimation.”. PhD thesis, (University of British Columbia, Vancouver, Canada).
5. Efficient parameter estimation for RNA secondary structure prediction
Cited by
68 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献