Abstract
ABSTRACTMost drugs are small molecules, with their activities typically arising from interactions with protein targets. Accurate predictions of these interactions could greatly accelerate pharmaceutical research. Current machine learning models designed for this task have a limited ability to generalize beyond the proteins used for training. This limitation is likely due to a lack of information exchange between the protein and the small molecule during the generation of the required numerical representations. Here, we introduce ProSmith, a machine learning framework that employs a multimodal Transformer Network to simultaneously process protein amino acid sequences and small molecule strings in the same input. This approach facilitates the exchange of all relevant information between the two types of molecules during the computation of their numerical representations, allowing the model to account for their structural and functional interactions. Our final model combines gradient boosting predictions based on the resulting multimodal Transformer Network with independent predictions based on separate deep learning representations of the proteins and small molecules. The resulting predictions outperform all previous models for predicting drug-target interactions, and the model demonstrates unprecedented generalization capabilities to unseen proteins. We further show that the superior performance of ProSmith is not limited to drug-target interaction predictions, but also leads to improvements in other protein-small molecule interaction prediction tasks, the prediction of Michaelis constantsKMof enzyme-substrate pairs and the identification of potential substrates for enzymes. The Python code provided can be used to easily implement and improve machine learning predictions of interactions between proteins and arbitrary drug candidates or other small molecules.
Publisher
Cold Spring Harbor Laboratory
Reference55 articles.
1. Deep-Learning-Based Drug–Target Interaction Prediction
2. He, H. , Chen, G. & Chen, C. Y.-C . NHGNN-DTA: A Node-adaptive Hybrid Graph Neural Network for Interpretable Drug-target Binding Affinity Prediction. Bioinformatics, btad355 (2023).
3. ML-DTI: mutual learning mechanism for inter-pretable drug–target interaction prediction;J. Phys. Chem. Lett,2021
4. DeepDTA: deep drug–target binding affinity prediction
5. Shin, B. , Park, S. , Kang, K. & Ho, J. C . Self-attention based molecule representation for predicting drug-target interaction in Machine Learning for Healthcare Conference (2019), 230–248.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献