Affiliation:
1. School of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou, China
2. Guangdong
Provincial Traditional Chinese Medicine Precision Medicine Big Data Engineering Technology Research Center,
Guangzhou, China
Abstract
Background:
In this study, we aimed to develop a new end-to-end learning model
called Graph-Drug-Target Interaction (DTI), which integrates various types of information in the
heterogeneous network data, and to explore automatic learning of the topology-maintaining representations
of drugs and targets, thereby effectively contributing to the prediction of DTI. Precise
predictions of DTI can guide drug discovery and development. Most machine learning algorithms
integrate multiple data sources and combine them with common embedding methods. However,
the relationship between the drugs and target proteins is not well reported. Although some existing
studies have used heterogeneous network graphs for DTI prediction, there are many limitations
in the neighborhood information between the nodes in the heterogeneous network graphs.
We studied the drug-drug interaction (DDI) and DTI from DrugBank Version 3.0, protein–protein
interaction (PPI) from the human protein reference database Release 9, drug structure similarity
from Morgan fingerprints of radius 2 and calculated by RDKit, and protein sequence similarity
from Smith-Waterman score.
Method:
Our study consists of three major components. First, various drugs and target proteins
were integrated, and a heterogeneous network was established based on a series of data
sets. Second, the graph neural networks-inspired graph auto-encoding method was used to extract
high-order structural information from the heterogeneous networks, thereby revealing the
description of nodes (drugs and proteins) and their topological neighbors. Finally, potential
DTI prediction was made, and the obtained samples were sent to the classifier for secondary
classification.
Results:
The performance of Graph-DTI and all baseline methods was evaluated using the sums
of the area under the precision-recall curve (AUPR) and the area under the receiver operating
characteristic curve (AUC). The results indicated that Graph-DTI outperformed the baseline
methods in both performance results.
Conclusion:
Compared with other baseline DTI prediction methods, the results showed that
Graph-DTI had better prediction performance. Additionally, in this study, we effectively classified
drugs corresponding to different targets and vice versa. The above findings showed that
Graph-DTI provided a powerful tool for drug research, development, and repositioning. Graph-
DTI can serve as a drug development and repositioning tool more effectively than previous studies
that did not use heterogeneous network graph embedding.
Publisher
Bentham Science Publishers Ltd.
Subject
Drug Discovery,Molecular Medicine,General Medicine