Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles-Reference-Cited by-同舟云学术

Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles

Published:2022-03-12 Issue:1 Volume:14 Page:
ISSN:1758-2946
Container-title:Journal of Cheminformatics
language:en
Short-container-title:J Cheminform

Author:

Liu Chengyou,Hogan Andrew M.,Sturm Hunter,Khan Mohd Wasif,Islam Md. Mohaiminul,Rahman A. S. M. Zisanur,Davis Rebecca,Cardona Silvia T.,Hu Pingzhao^ORCID

Abstract

Abstract Motivation Chemical–genetic interaction profiling is a genetic approach that quantifies the susceptibility of a set of mutants depleted in specific gene product(s) to a set of chemical compounds. With the recent advances in artificial intelligence, chemical–genetic interaction profiles (CGIPs) can be leveraged to predict mechanism of action of compounds. This can be achieved by using machine learning, where the data from a CGIP is fed into the machine learning platform along with the chemical descriptors to develop a chemogenetically trained model. As small molecules can be considered non-structural data, graph convolutional neural networks, which can learn from the chemical structures directly, can be used to successfully predict molecular properties. Clustering analysis, on the other hand, is a critical approach to get insights into the underlying biological relationships between the gene products in the high-dimensional chemical-genetic data. Methods and results In this study, we proposed a comprehensive framework based on the large-scale chemical-genetics dataset built in Mycobacterium tuberculosis for predicting CGIPs using graph-based deep learning models. Our approach is structured into three parts. First, by matching M. tuberculosis genes with homologous genes in Escherichia coli (E. coli) according to their gene products, we grouped the genes into clusters with distinct biological functions. Second, we employed a directed message passing neural network to predict growth inhibition against M. tuberculosis gene clusters using a collection of 50,000 chemicals with the profile. We compared the performance of different baseline models and implemented multi-label tasks in binary classification frameworks. Lastly, we applied the trained model to an externally curated drug set that had experimental results against M. tuberculosis genes to examine the effectiveness of our method. Overall, we demonstrate that our approach effectively created M. tuberculosis gene clusters, and the trained classifier is able to predict activity against essential M. tuberculosis targets with high accuracy. Conclusion This work provides an analytical framework for modeling large-scale chemical-genetic datasets for predicting CGIPs and generating hypothesis about mechanism of action of novel drugs. In addition, this work highlights the importance of graph-based deep neural networks in drug discovery.

Funder

Canadian Institutes of Health Research

Cystic Fibrosis Canada

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Computer Graphics and Computer-Aided Design,Physical and Theoretical Chemistry,Computer Science Applications

Link

https://link.springer.com/content/pdf/10.1186/s13321-022-00596-6.pdf

Reference55 articles.

1. Cacace E, Kritikos G, Typas A (2017) Chemical genetics in drug discovery. Curr Opin Syst Biol 4:35–42. https://doi.org/10.1016/j.coisb.2017.05.020

2. DeVito JA, Mills JA, Liu VG et al (2002) An array of target-specific screening strains for antibacterial discovery. Nat Biotechnol 20:478–483. https://doi.org/10.1038/nbt0502-478