Author:
Viswanathan Karthik,Goel Manan,Laghuvarapu Siddhartha,Varma Girish,Priyakumar U. Deva
Abstract
AbstractThe discovery of potential therapeutic agents for life-threatening diseases has become a significant problem. There is a requirement for fast and accurate methods to identify drug-like molecules that can be used as potential candidates for novel targets. Existing techniques like high-throughput screening and virtual screening are time-consuming and inefficient. Traditional molecule generation pipelines are more efficient than virtual screening but use time-consuming docking software. Such docking functions can be emulated using Machine Learning models with comparable accuracy and faster execution times. However, we find that when pre-trained machine learning models are employed in generative pipelines as oracles, they suffer from model degradation in areas where data is scarce. In this study, we propose an active learning-based model that can be added as a supplement to enhanced molecule generation architectures. The proposed method uses uncertainty sampling on the molecules created by the generator model and dynamically learns as the generator samples molecules from different regions of the chemical space. The proposed framework can generate molecules with high binding affinity with $$\sim$$
∼
a 70% improvement in runtime compared to the baseline model by labeling only $$\sim$$
∼
30% of molecules compared to the baseline oracle.
Funder
Kohli Center on Intelligent Systems
IHub-Data, IIIT-Hyderabad
Department of Science and Technology, Ministry of Science and Technology, India
Publisher
Springer Science and Business Media LLC
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献