pyRBDome: A comprehensive computational platform for enhancing and interpreting RNA-binding proteome data-Reference-Cited by-同舟云学术

pyRBDome: A comprehensive computational platform for enhancing and interpreting RNA-binding proteome data

Published:2023-12-08 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Chu Liang-Cui,Christopoulou Niki,McCaughan Hugh,Winterbourne Sophie^ORCID,Cazzola Davide^ORCID,Wang Shichao,Litvin Ulad,Brunon Salomé,Harker Patrick J.B.^ORCID,McNae Iain^ORCID,Granneman Sander^ORCID

Abstract

AbstractHigh-throughput proteomics approaches have revolutionised the identification of RNA-binding proteins (RBPome) and RNA-binding sequences (RBDome) across organisms. Yet the extent of noise, including false-positives, associated with these methodologies, is difficult to quantify as experimental approaches for validating the results are generally low throughput. To address this, we introduce pyRBDome, a pipeline for enhancing RNA-binding proteome datain silico. It aligns the experimental results with RNA-binding site (RBS) predictions from distinct machine learning tools and integrates high-resolution structural data when available. Its statistical evaluation of RBDome data enables quick identification of likely genuine RNA-binders in experimental datasets. Furthermore, by leveraging the pyRBDome results, we have enhanced the sensitivity and specificity of RBS detection through training new ensemble machine learning models. pyRBDome analysis of a human RBDome dataset, compared with known structural data, revealed that while UV cross-linked amino acids were more likely to contain predicted RBSs, they infrequently bind RNA in high-resolution structures. This discrepancy underscores the limitations of structural data as benchmarks, positioning pyRBDome as a valuable alternative for increasing confidence in RBDome datasets.

Publisher

Cold Spring Harbor Laboratory

Reference62 articles.

1. PLIP 2021: expanding the scope of the protein–ligand interaction profiler to DNA and RNA

2. Akiba T , Sano S , Yanase T , Ohta T & Koyama M (2019) Optuna: A Next-generation Hyperparameter Optimization Framework. (http://arxiv.org/abs/1907.10902)

3. Silica-based solid-phase extraction of cross-linked nucleic acid–bound proteins

4. Photoactivatable ribonucleosides mark base-specific RNA-binding sites;Nat Commun,2021

5. Chemical RNA digestion enables robust RNA-binding site mapping at single amino acid resolution;Nature Structural & Molecular Biology,2020