Affiliation:
1. Istanbul Technical University
Abstract
Abstract
Virtual screening (VS) is one of the well-established approaches in drug discovery which speeds up the search for a bioactive molecule and, reduces costs and efforts associated with experiments. VS helps to narrow down the search space of chemical space and allows selecting fewer and more probable candidate compounds for experimental testing. Docking calculations are one of the commonly used and highly appreciated structure-based drug discovery methods. Databases for chemical structures of small molecules have been growing rapidly. However, at the moment virtual screening of large libraries via docking is not very common. In this work, we aim to accelerate docking studies by predicting docking scores without explicitly performing docking calculations. We experimented with an attention based long short-term memory (LSTM) neural network for an efficient prediction of docking scores as well as other machine learning models such as XGBoost. By using docking scores of a small number of ligands we trained our models and predicted docking scores of a few million molecules. Specifically, we tested our approaches seven datasets that were produced in-house drug discovery studies. In one of the targets, by training only 7000 molecules we predicted docking scores for 3 million molecules with R2 (coefficient of determination) of 0.84. We designed the system with ease of use in mind. All the user needs to provide is a csv file containing smiles and their respective docking scores, the system then outputs a model that the user can use for the prediction of docking score for a new molecule.
Publisher
Research Square Platform LLC
Reference20 articles.
1. Lean-docking: exploiting ligands’ predicted docking scores to accelerate molecular docking;Berenger F;J Chem Inf Model,2021
2. Chen T, Guestrin C (2016), August Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785–794)
3. Progressive docking: a hybrid QSAR/docking approach for accelerating in silico high throughput screening;Cherkasov A;J Med Chem,2006
4. Predicting ligand binding modes from neural networks trained on protein–ligand interaction fingerprints;Chupakhin V;J Chem Inf Model,2013
5. Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473