Benchmark Evaluation of Protein–Protein Interaction Prediction Algorithms-Reference-Cited by-同舟云学术

Benchmark Evaluation of Protein–Protein Interaction Prediction Algorithms

Published:2021-12-22 Issue:1 Volume:27 Page:41
ISSN:1420-3049
Container-title:Molecules
language:en
Short-container-title:Molecules

Author:

Dunham Brandan,Ganapathiraju Madhavi K.^ORCID

Abstract

Protein–protein interactions (PPIs) perform various functions and regulate processes throughout cells. Knowledge of the full network of PPIs is vital to biomedical research, but most of the PPIs are still unknown. As it is infeasible to discover all of them experimentally due to technical and resource limitations, computational prediction of PPIs is essential and accurately assessing the performance of algorithms is required before further application or translation. However, many published methods compose their evaluation datasets incorrectly, using a higher proportion of positive class data than occuring naturally, leading to exaggerated performance. We re-implemented various published algorithms and evaluated them on datasets with realistic data compositions and found that their performance is overstated in original publications; with several methods outperformed by our control models built on ‘illogical’ and random number features. We conclude that these methods are influenced by an over-characterization of some proteins in the literature and due to scale-free nature of PPI network and that they fail when tested on all possible protein pairs. Additionally, we found that sequence-only-based algorithms performed worse than those that employ functional and expression features. We present a benchmark evaluation of many published algorithms for PPI prediction. The source code of our implementations and the benchmark datasets created here are made available in open source.

Funder

United States National Library of Medicine

Publisher

MDPI AG

Subject

Chemistry (miscellaneous),Analytical Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Molecular Medicine,Drug Discovery,Pharmaceutical Science

Link

https://www.mdpi.com/1420-3049/27/1/41/pdf

Reference72 articles.

1. Analysis of Protein–Protein Interaction by Co-IP in Human Cells;Tang,2018

2. Revealing protein-protein interactions at the transcriptome scale by sequencing

3. Where Have All the Interactions Gone? Estimating the Coverage of Two-Hybrid Protein Interaction Maps

4. A reference map of the human binary protein interactome

5. Towards reproducibility in large-scale analysis of protein–protein interactions

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SpatialPPI: Three-dimensional space protein-protein interaction prediction with AlphaFold Multimer;Computational and Structural Biotechnology Journal;2024-12

2. Heterogeneous network approaches to protein pathway prediction;Computational and Structural Biotechnology Journal;2024-12

3. Guiding questions to avoid data leakage in biological machine learning applications;Nature Methods;2024-08

4. INTREPPPID—an orthologue-informed quintuplet network for cross-species prediction of protein–protein interaction;Briefings in Bioinformatics;2024-07-25

5. Co-training based prediction of multi-label protein–protein interactions;Computers in Biology and Medicine;2024-07