ProtFinder: finding subcellular locations of proteins using protein interaction networks

Author:

Grover AayushORCID,Gatto LaurentORCID

Abstract

AbstractProtein subcellular localization prediction plays a crucial role in improving our understanding of different diseases and consequently assists in building drug targeting and drug development pipelines. Proteins are known to co-exist at multiple subcellular locations which make the task of prediction extremely challenging. A protein interaction network is a graph that captures interactions between different proteins. It is safe to assume that if two proteins are interacting, they must share some subcellular locations. With this regard, we propose ProtFinder – the first deep learning-based model that exclusively relies on protein interaction networks to predict the multiple subcellular locations of proteins. We also integrate biological priors like the cellular component of Gene Ontology to make ProtFinder a more biology-aware intelligent system. ProtFinder is trained and tested using the STRING and BioPlex databases whereas the annotations of proteins are obtained from the Human Protein Atlas. Our model obtained an AUC-ROC score of 90.00% and an MCC score of 83.42% on a held-out set of proteins. We also apply ProtFinder to annotate proteins that currently do not have confident location annotations. We observe that ProtFinder is able to confirm some of these unreliable location annotations, while in some cases complementing the existing databases with novel location annotations. The source code for ProtFinder is available at https://github.com/UCLouvain-CBIO/ProtFinder.

Publisher

Cold Spring Harbor Laboratory

Reference37 articles.

1. Bruce Alberts , Alexander Johnson , Julian Lewis , Martin Raff , Keith Roberts , and Peter Walter . Analyzing protein structure and function. In Molecular Biology of the Cell. 4th edition. Garland Science, 2002.

2. Hpslpred: an ensemble multi-label classifier for human protein subcellular location prediction with imbalanced source;Proteomics,2017

3. Genetic programming for creating Chou’s pseudo amino acid based features for submitochondria localization

4. Recent progress in protein subcellular location prediction

5. Xiaoyong Pan , Lei Chen , Min Liu , Tao Huang , and Yu-Dong Cai . Predicting protein subcellular location using learned distributed representations from a protein-protein network. BioRxiv, page 768739, 2019.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3