Author:
Marashi Sayed-Amir,Eslahchi Changiz,Pezeshk Hamid,Sadeghi Mehdi
Abstract
Abstract
Background
gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms.
Results
we sought the role of mRNA secondary structures and their information contents for five vertebrate and plant splice site datasets. We selected 900-nucleotide sequences centered at each (real or decoy) donor and acceptor sites, and predicted their corresponding RNA structures by Vienna software. Then, based on whether the nucleotide is in a stem or not, the conventional four-letter nucleotide alphabet was translated into an eight-letter alphabet. Zero-, first- and second-order Markov models were selected as the signal detection methods. It is shown that applying the eight-letter alphabet compared to the four-letter alphabet considerably increases the accuracy of both donor and acceptor site predictions in case of higher order Markov models.
Conclusion
Our results imply that RNA structure contains important data and future gene prediction programs can take advantage of such information.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Reference19 articles.
1. Mathé C, Sagot MF, Schiex T, Rouzé P: Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res 2002, 30: 4103–4117. 10.1093/nar/gkf543
2. Brent MR, Guigó R: Recent advances in gene structure prediction. Curr Opin Struct Biol 2004, 14: 264–272. 10.1016/j.sbi.2004.05.007
3. Staley JP, Guthrie C: Mechanical devices in the spliceosome: Clocks, motors, springs and things. Cell 1998, 92: 315–326. 10.1016/S0092-8674(00)80925-3
4. Buratti E, Baralle FE: Influence of RNA secondary structure on the pre-mRNA splicing process. Mol Cell Biol 2004, 24: 10505–10514. 10.1128/MCB.24.24.10505-10514.2004
5. Patterson DJ, Yasuhara K, Ruzzo WL: Pre-mRNA secondary structure prediction aids splice site prediction. Pac Symp Biocomput 2002, 7: 223–234.
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献