Knotify: An Efficient Parallel Platform for RNA Pseudoknot Prediction Using Syntactic Pattern Recognition-Reference-Cited by-同舟云学术

Knotify: An Efficient Parallel Platform for RNA Pseudoknot Prediction Using Syntactic Pattern Recognition

Published:2022-02-02 Issue:1 Volume:5 Page:14
ISSN:2409-9279
Container-title:Methods and Protocols
language:en
Short-container-title:MPs

Author:

Andrikos Christos^ORCID,Makris Evangelos^ORCID,Kolaitis Angelos,Rassias Georgios,Pavlatos Christos^ORCID,Tsanakas Panayiotis^ORCID

Abstract

Obtaining valuable clues for noncoding RNA (ribonucleic acid) subsequences remains a significant challenge, acknowledging that most of the human genome transcribes into noncoding RNA parts related to unknown biological operations. Capturing these clues relies on accurate “base pairing” prediction, also known as “RNA secondary structure prediction”. As COVID-19 is considered a severe global threat, the single-stranded SARS-CoV-2 virus reveals the importance of establishing an efficient RNA analysis toolkit. This work aimed to contribute to that by introducing a novel system committed to predicting RNA secondary structure patterns (i.e., RNA’s pseudoknots) that leverage syntactic pattern-recognition strategies. Having focused on the pseudoknot predictions, we formalized the secondary structure prediction of the RNA to be primarily a parsing and, secondly, an optimization problem. The proposed methodology addresses the problem of predicting pseudoknots of the first order (H-type). We introduce a context-free grammar (CFG) that affords enough expression power to recognize potential pseudoknot pattern. In addition, an alternative methodology of detecting possible pseudoknots is also implemented as well, using a brute-force algorithm. Any input sequence may highlight multiple potential folding patterns requiring a strict methodology to determine the single biologically realistic one. We conscripted a novel heuristic over the widely accepted notion of free-energy minimization to tackle such ambiguity in a performant way by utilizing each pattern’s context to unveil the most prominent pseudoknot pattern. The overall process features polynomial-time complexity, while its parallel implementation enhances the end performance, as proportional to the deployed hardware. The proposed methodology does succeed in predicting the core stems of any RNA pseudoknot of the test dataset by performing a 76.4% recall ratio. The methodology achieved a F1-score equal to 0.774 and MCC equal 0.543 in discovering all the stems of an RNA sequence, outperforming the particular task. Measurements were taken using a dataset of 262 RNA sequences establishing a performance speed of 1.31, 3.45, and 7.76 compared to three well-known platforms. The implementation source code is publicly available under knotify github repo.

Publisher

MDPI AG

Subject

Biochemistry, Genetics and Molecular Biology (miscellaneous),Structural Biology,Biotechnology

Link

https://www.mdpi.com/2409-9279/5/1/14/pdf

Reference91 articles.

1. https://bit.ly/dataset_pseudobase_knotify

2. Knotty: efficient and accurate prediction of complex RNA pseudoknot structures

3. IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming

4. The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones

5. Let Me Count the Ways: Mechanisms of Gene Regulation by miRNAs and siRNAs

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Knotify_V2.0: Deciphering RNA Secondary Structures with H-Type Pseudoknots and Hairpin Loops;Genes;2024-05-23

2. Navigating the Multiverse: A Hitchhiker’s Guide to Selecting Harmonisation Methods for Multimodal Biomedical Data;2024-03-22

3. Syntactic Pattern Recognition for the Prediction of L-Type Pseudoknots in RNA;Applied Sciences;2023-04-21

4. Knotify+: Toward the Prediction of RNA H-Type Pseudoknots, Including Bulges and Internal Loops;Biomolecules;2023-02-06

5. Computational tools to study RNA-protein complexes;Frontiers in Molecular Biosciences;2022-10-07