Heterogeneous graph embedding model for predicting interactions between TF and target gene
Author:
Huang Yu-An1ORCID,
Pan Gui-Qing1,
Wang Jia1,
Li Jian-Qiang1,
Chen Jie1,
Wu Yang-Han1
Affiliation:
1. College of Computer Science and Software Engineering, Shenzhen University , Shenzhen, China
Abstract
Abstract
Motivation
Identifying the target genes of transcription factors (TFs) is of great significance for biomedical researches. However, using biological experiments to identify TF–target gene interactions is still time consuming, expensive and limited to small scale. Existing computational methods for predicting underlying genes for TF to target is mainly proposed for their binding sites rather than the direct interaction. To bridge this gap, we in this work proposed a deep learning prediction model, named HGETGI, to identify the new TF–target gene interaction. Specifically, the proposed HGETGI model learns the patterns of the known interaction between TF and target gene complemented with their involvement in different human disease mechanisms. It performs prediction based on random walk for meta-path sampling and node embedding in a skip-gram manner.
Results
We evaluated the prediction performance of the proposed method on a real dataset and the experimental results show that it can achieve the average area under the curve of 0.8519 ± 0.0731 in fivefold cross validation. Besides, we conducted case studies on the prediction of two important kinds of TF, NFKB1 and TP53. As a result, 33 and 32 in the top-40 ranking lists of NFKB1 and TP53 were successfully confirmed by looking up another public database (hTftarget). It is envisioned that the proposed HGETGI method is feasible and effective for predicting TF–target gene interactions on a large scale.
Availability and implementation
The source code and dataset are available at https://github.com/PGTSING/HGETGI.
Supplementary information
Supplementary data are available at Bioinformatics online.
Funder
National Key R&D Program of China
National Natural Science Foundation of China
Guangdong Basic and Applied Basic Research Foundation
Guangdong “Pearl River Talent Recruitment Program
Shenzhen Science and Technology Innovation Commission-Stable Support Program (General Program
Shenzhen Science and Technology Innovation Commission
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献