Prediction of RNA binding sites in proteins from amino acid sequence-Reference-Cited by-同舟云学术

Prediction of RNA binding sites in proteins from amino acid sequence

Published:2006-06-21 Issue:8 Volume:12 Page:1450-1462
ISSN:1355-8382
Container-title:RNA
language:en
Short-container-title:RNA

Author:

Terribilini Michael,Lee Jae-Hyung,Yan Changhui,Jernigan Robert L.,Honavar Vasant,Dobbs Drena

Abstract

RNA–protein interactions are vitally important in a wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses. We have developed a computational tool for predicting which amino acids of an RNA binding protein participate in RNA–protein interactions, using only the protein sequence as input. RNABindR was developed using machine learning on a validated nonredundant data set of interfaces from known RNA–protein complexes in the Protein Data Bank. It generates a classifier that captures primary sequence signals sufficient for predicting which amino acids in a given protein are located in the RNA–protein interface. In leave-one-out cross-validation experiments, RNABindR identifies interface residues with >85% overall accuracy. It can be calibrated by the user to obtain either high specificity or high sensitivity for interface residues. RNABindR, implementing a Naive Bayes classifier, performs as well as a more complex neural network classifier (to our knowledge, the only previously published sequence-based method for RNA binding site prediction) and offers the advantages of speed, simplicity and interpretability of results. RNABindR predictions on the human telomerase protein hTERT are in good agreement with experimental data. The availability of computational tools for predicting which residues in an RNA binding protein are likely to contact RNA should facilitate design of experiments to directly test RNA binding function and contribute to our understanding of the diversity, mechanisms, and regulation of RNA–protein complexes in biological systems. (RNABindR is available as a Web tool from http://bindr.gdcb.iastate.edu.)

Publisher

Cold Spring Harbor Laboratory

Subject

Molecular Biology

Reference58 articles.

1. Structure-based analysis of protein-RNA interactions using the program ENTANGLE

2. The Structure and Function of Telomerase Reverse Transcriptase

3. Functional Regions of Human Telomerase Reverse Transcriptase and Human Telomerase RNA Required for Telomerase Activity and RNA-Protein Interactions

4. Assessing the accuracy of prediction algorithms for classification: an overview

5. The Protein Data Bank

Cited by 136 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A high-throughput search for intracellular factors that affect RNA folding identifiesE. coliproteins PepA and YagL as RNA chaperones that promote RNA remodeling;2024-06-04

2. Predicting nuclear G-quadruplex RNA-binding proteins with roles in transcription and phase separation;Nature Communications;2024-03-22

3. Wild lime psyllid Leuronota fagarae Burckhardt (Hemiptera: Psylloidea) picorna-like virus full genome annotation and classification;Journal of Invertebrate Pathology;2023-11

4. Identification of RNA Oligonucleotide and Protein Interactions Using Term Frequency Inverse Document Frequency and Random Forest;Oligonucleotides - Overview and Applications;2023-03-29

5. Noncoding RNAs and RNA-binding proteins: emerging governors of liver physiology and metabolic diseases;American Journal of Physiology-Cell Physiology;2022-10-01