Abstract
AbstractIn computer-aided drug discovery, quantitative structure activity relation models are trained to predict biological activity from chemical structure. Despite the recent success of applying graph neural network to this task, important chemical information such as molecular chirality is ignored. To fill this crucial gap, we proposeMolecular-KernelGraphNeuralNetwork (MolKGNN) for molecular representation learning, which features SE(3)-/conformation invariance, chiralityawareness, and interpretability. For our MolKGNN, we first design a molecular graph convolution to capture the chemical pattern by comparing the atom’s similarity with the learnable molecular kernels. Furthermore, we propagate the similarity score to capture the higher-order chemical pattern. To assess the method, we conduct a comprehensive evaluation with nine well-curated datasets spanning numerous important drug targets that feature realistic high class imbalance and it demonstrates the superiority of MolKGNN over other GNNs in CADD. Meanwhile, the learned kernels identify patterns that agree with domain knowledge, confirming the pragmatic interpretability of this approach. Our codes are publicly available athttps://github.com/meilerlab/MolKGNN.
Publisher
Cold Spring Harbor Laboratory
Reference54 articles.
1. Adams, K. ; Pattanaik, L. ; and Coley, C. W. 2021. Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations. arXiv preprint arXiv:2110.04383.
2. Geometric deep learning on molecular representations;Nature Machine Intelligence,2021
3. Baell, J. B. ; and Holloway, G. A. 2010. New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays. Journal of medicinal chemistry, 53(7).
4. Integration of virtual and high-throughput screening
5. Brown, B. ; Vu, O. ; Geanes, A. R. ; Kothiwale, S. ; Butkiewicz, M. ; Lowe, E. W. ; Mueller, R. ; Pape, R. ; Mendenhall, J. ; and Meiler, J. 2022. Introduction to the Bio-Chemical Library (BCL): An application-based open-source toolkit for integrated cheminformatics and machine learning in computer-aided drug discovery. Frontiers in pharmacology, 341.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献