Abstract
AbstractAccumulating evidence has shown that drug-target interactions (DTIs) play a crucial role in the process of genomic drug discovery. Although biological experimental technology has made great progress, the identification of DTIs is still very time-consuming and expensive nowadays. Hence it is urgent to develop in silico model as a supplement to the biological experiments to predict the potential DTIs. In this work, a new model is designed to predict DTIs by incorporating chemical sub-structures and protein evolutionary information. Specifically, we first use Position-Specific Scoring Matrix (PSSM) to convert the protein sequence into the numerical descriptor containing biological evolutionary information, then use Discrete Cosine Transform (DCT) algorithm to extract the hidden features and integrate them with the chemical sub-structures descriptor, and finally utilize Rotation Forest (RF) classifier to accurately predict whether there is interaction between the drug and the target protein. In the 5-fold cross-validation (CV) experiment, the average accuracy of the proposed model on the benchmark datasets of Enzymes, Ion Channels, GPCRs and Nuclear Receptors reached 0.9140, 0.8919, 0.8724 and 0.8111, respectively. In order to fully evaluate the performance of the proposed model, we compare it with different feature extraction model, classifier model, and other state-of-the-art models. Furthermore, we also implemented case studies. As a result, 8 of the top 10 drug-target pairs with the highest prediction score were confirmed by related databases. These excellent results indicate that the proposed model has outstanding ability in predicting DTIs and can provide reliable candidates for biological experiments.
Funder
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC
Reference44 articles.
1. Overington, J. P., Al-Lazikani, B. & Hopkins, A. L. Opinion - How many drug targets are there? Nature Reviews Drug Discovery 5, 993–996, https://doi.org/10.1038/nrd2199 (2006).
2. Rigden, D. J., Fernández-Suárez, X. M. & Galperin, M. Y. The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection. Nucleic acids research 44, D1–D6 (2015).
3. Ezzat, A., Zhao, P., Wu, M., Li, X. L. & Kwoh, C. K. Drug-Target Interaction Prediction with Graph Regularized Matrix Factorization. IEEE/ACM Transactions on Computational Biology &. Bioinformatics PP, 646–656 (2017).
4. Wang, L., You, Z.-H., Huang, D.-S. & Zhou, F. Combining High Speed ELM Learning with a Deep Convolutional Neural Network Feature Encoding for Predicting Protein-RNA Interactions. IEEE/ACM transactions on computational biology and bioinformatics 1, 1–1 (2018).
5. Gao, Z. G. et al. Ens-PPI: A Novel Ensemble Classifier for Predicting the Interactions of Proteins Using Autocovariance Transformation from PSSM. Biomed Research International, 8, https://doi.org/10.1155/2016/4563524 (2016).
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献