An overview on nucleic-acid G-quadruplex prediction: from rule-based methods to deep neural networks-Reference-Cited by-同舟云学术

An overview on nucleic-acid G-quadruplex prediction: from rule-based methods to deep neural networks

Published:2023-07 Issue:4 Volume:24 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Elimelech-Zohar Karin¹,Orenstein Yaron¹²^ORCID

Affiliation:

1. Department of Computer Science, Bar-Ilan University , Ramat Gan, 5290002 , Israel

2. The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University , Ramat Gan, 5290002 , Israel

Abstract

Abstract Nucleic-acid G-quadruplexes (G4s) play vital roles in many cellular processes. Due to their importance, researchers have developed experimental assays to measure nucleic-acid G4s in high throughput. The generated high-throughput datasets gave rise to unique opportunities to develop machine-learning-based methods, and in particular deep neural networks, to predict G4s in any given nucleic-acid sequence and any species. In this paper, we review the success stories of deep-neural-network applications for G4 prediction. We first cover the experimental technologies that generated the most comprehensive nucleic-acid G4 high-throughput datasets in recent years. We then review classic rule-based methods for G4 prediction. We proceed by reviewing the major machine-learning and deep-neural-network applications to nucleic-acid G4 datasets and report a novel comparison between them. Next, we present the interpretability techniques used on the trained neural networks to learn key molecular principles underlying nucleic-acid G4 folding. As a new result, we calculate the overlap between measured DNA and RNA G4s and compare the performance of DNA- and RNA-G4 predictors on RNA- and DNA-G4 datasets, respectively, to demonstrate the potential of transfer learning from DNA G4s to RNA G4s. Last, we conclude with open questions in the field of nucleic-acid G4 prediction and computational modeling.

Funder

Israel Science Foundation

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/24/4/bbad252/50916906/bbad252.pdf

Reference58 articles.

1. Genome-wide analysis of RNA secondary structure;Bevilacqua;Annu Rev Genet,2016

2. Formation of parallel four-stranded complexes by guanine-rich motifs in DNA and its implications for meiosis;Sen;Nature,1988

3. G-quadruplexes: prediction, characterization, and biological application;Kwok;Trends Biotechnol,2017

4. Detecting RNA G-quadruplexes (rG4s) in the transcriptome;Kwok;Cold Spring Harb Perspect Biol,2018

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Insights into computer-aided G-quadruplex prediction in the digital age;Medicinal Chemistry Research;2024-08-28

2. Special Issue “Bioinformatics of Unusual DNA and RNA Structures”;International Journal of Molecular Sciences;2024-05-10

3. Prediction of DNA i-motifs via machine learning;Nucleic Acids Research;2024-02-14

4. Prediction of DNA i-Motifs Via Machine Learning;2023-12-12

5. EndoQuad: a comprehensive genome-wide experimentally validated endogenous G-quadruplex database;Nucleic Acids Research;2023-10-30