Identifying function modules from protein–protein interaction networks based on Szemerédi’s Regularity Lemma

Author:

He Changxiang1ORCID,Li Die1ORCID,Li Yan1ORCID,Yang Peisheng1ORCID,Zhang Qingqian1ORCID,Zhong Wen1ORCID,Shan Haiying2ORCID,Dai Hao3ORCID,Chen LuoNan3ORCID

Affiliation:

1. College of Science, University of Shanghai for Science and Technology, Jungong Road, Shanghai 200093, P. R. China

2. School of Mathematical Science, Tongji University, Shanghai 200092, P. R. China

3. Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, People’s Republic of China

Abstract

Szemerédi’s Regularity Lemma (SRL) is a crucial tool in the analysis of large graphs, having made significant contributions in the proof of some sensational results in mathematics. Traditional methods for studying proteins in Protein–Protein Interaction (PPI) networks typically only extract the first-order or second-order neighbor information of proteins, ignoring the potential third-order or higher-order neighbor information between proteins, which may hide certain relationships between proteins. To explore more in-depth insights for PPI networks, we take into account the fourth-order neighbor information of proteins and reconstruct the network in this paper, naming it the weighted dense PPI network. We then partition it using SRL, which primarily utilizes the structural information and corresponds to a unique partition of the original network. Bioinformatics analyses such as those for pathway enrichment analysis and multiple sequence alignment show that our method can classify interacting protein pairs, grouping proteins with functional association, disease association, and sequence similarity together. Overall, this paper has three essential contributions: (1) we present a new model to overcome the astronomically large demand of vertices in applying SRL, and achieve protein classification; (2) we reconstruct a weighted dense PPI network which can make SRL work and mine potential interactions more efficiently; and (3) proteins in the same class partitioned by our method not only have sequence similarity, but also have functional associations.

Funder

Innovative Research Group Project of the National Natural Science Foundation of China

Strategic Priority Research Program of the Chinese Academy of Sciences

National Natural Science Foundation of China

Publisher

World Scientific Pub Co Pte Ltd

Subject

Applied Mathematics,Modeling and Simulation

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3