Graph representation learning in bioinformatics: trends, methods and applications

Author:

Yi Hai-Cheng12,You Zhu-Hong3,Huang De-Shuang4,Kwoh Chee Keong5

Affiliation:

1. Chinese Academy of Sciences, Xinjiang Technical Institute of Physics and Chemistry, Urumqi 830011, China

2. University of Chinese Academy of Sciences, Beijing 100049, China

3. School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China

4. Institute of Machine Learning and Systems Biology, School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China

5. School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore

Abstract

Abstract Graph is a natural data structure for describing complex systems, which contains a set of objects and relationships. Ubiquitous real-life biomedical problems can be modeled as graph analytics tasks. Machine learning, especially deep learning, succeeds in vast bioinformatics scenarios with data represented in Euclidean domain. However, rich relational information between biological elements is retained in the non-Euclidean biomedical graphs, which is not learning friendly to classic machine learning methods. Graph representation learning aims to embed graph into a low-dimensional space while preserving graph topology and node properties. It bridges biomedical graphs and modern machine learning methods and has recently raised widespread interest in both machine learning and bioinformatics communities. In this work, we summarize the advances of graph representation learning and its representative applications in bioinformatics. To provide a comprehensive and structured analysis and perspective, we first categorize and analyze both graph embedding methods (homogeneous graph embedding, heterogeneous graph embedding, attribute graph embedding) and graph neural networks. Furthermore, we summarize their representative applications from molecular level to genomics, pharmaceutical and healthcare systems level. Moreover, we provide open resource platforms and libraries for implementing these graph representation learning methods and discuss the challenges and opportunities of graph representation learning in bioinformatics. This work provides a comprehensive survey of emerging graph representation learning algorithms and their applications in bioinformatics. It is anticipated that it could bring valuable insights for researchers to contribute their knowledge to graph representation learning and future-oriented bioinformatics studies.

Funder

National Natural Science Foundation of China

National Outstanding Youth Science Foundation

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference166 articles.

1. Visualizing social networks;Freeman;J Soc Struct,2000

2. Creating an academic landscape of sustainability science: an analysis of the citation network;Kajikawa;Sustain Sci,2007

3. The small world of human language;RFI;Proc R Soc Lond Series B Biol Sci,2001

4. Intrinsic noise in gene regulatory networks;Thattai;Proc Natl Acad Sci,2001

5. Network visualization and analysis of gene expression data using BioLayout Express3D;Theocharidis;Nat Protoc,2009

Cited by 102 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3