Multifaceted protein–protein interaction prediction based on Siamese residual RCNN-Reference-Cited by-同舟云学术

Multifaceted protein–protein interaction prediction based on Siamese residual RCNN

Published:2019-07 Issue:14 Volume:35 Page:i305-i314
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Chen Muhao¹,Ju Chelsea J -T¹,Zhou Guangyu¹,Chen Xuelu¹,Zhang Tianran²,Chang Kai-Wei¹,Zaniolo Carlo¹,Wang Wei¹

Affiliation:

1. Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA

2. Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, USA

Abstract

AbstractMotivationSequence-based protein–protein interaction (PPI) prediction represents a fundamental computational biology problem. To address this problem, extensive research efforts have been made to extract predefined features from the sequences. Based on these features, statistical algorithms are learned to classify the PPIs. However, such explicit features are usually costly to extract, and typically have limited coverage on the PPI information.ResultsWe present an end-to-end framework, PIPR (Protein–Protein Interaction Prediction Based on Siamese Residual RCNN), for PPI predictions using only the protein sequences. PIPR incorporates a deep residual recurrent convolutional neural network in the Siamese architecture, which leverages both robust local features and contextualized information, which are significant for capturing the mutual influence of proteins sequences. PIPR relieves the data pre-processing efforts that are required by other systems, and generalizes well to different application scenarios. Experimental evaluations show that PIPR outperforms various state-of-the-art systems on the binary PPI prediction problem. Moreover, it shows a promising performance on more challenging problems of interaction type prediction and binding affinity estimation, where existing approaches fall short.Availability and implementationThe implementation is available at https://github.com/muhaochen/seq_ppi.git.Supplementary informationSupplementary data are available at Bioinformatics online.

Funder

National Institutes of Health

National Science Foundation

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

http://academic.oup.com/bioinformatics/article-pdf/35/14/i305/29098736/btz328.pdf

Reference67 articles.

1. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs;Altschul;Nucleic Acids Res,1997

2. Google’s AI tool deepvariant promises significantly fewer genome errors;Anderson;Clinical OMICs,2018

3. Controlling the false discovery rate: a practical and powerful approach to multiple testing;Benjamini;J. R. Stat. Soc. Series B (Methodol.),1995

4. The protein data bank;Berman;Nucleic Acids Res,2000

5. Neural article pair modeling for Wikipedia sub-article matching;Chen;ECML-PKDD,2018

Cited by 215 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Extended Feature Representation Technique for Predicting Sequenced-based Host-pathogen Protein-protein Interaction;Current Bioinformatics;2025-01

2. SpatialPPI: Three-dimensional space protein-protein interaction prediction with AlphaFold Multimer;Computational and Structural Biotechnology Journal;2024-12

3. Funnel graph neural networks with multi-granularity cascaded fusing for protein–protein interaction prediction;Expert Systems with Applications;2024-12

4. BioPrediction-RPI: Democratizing the prediction of interaction between non-coding RNA and protein with end-to-end machine learning;Computational and Structural Biotechnology Journal;2024-12

5. DeepPepPI: A deep cross-dependent framework with information sharing mechanism for predicting plant peptide-protein interactions;Expert Systems with Applications;2024-10