Abstract
AbstractGraph Neural Networks (GNNs) excel in compound property and activity prediction, but the choice of molecular graph representations significantly influences model learning and interpretation. While atom-level molecular graphs resemble natural topology, they overlook key substructures or functional groups and their interpretation partially aligns with chemical intuition. Recent research suggests alternative representations using reduced molecular graphs to integrate higher-level chemical information and leverages both representations for model. However, there is a lack of studies about applicability and impact of different molecular graphs on model learning and interpretation. Here, we introduce MMGX (Multiple Molecular Graph eXplainable discovery), investigating the effects of multiple molecular graphs, including Atom, Pharmacophore, JunctionTree, and FunctionalGroup, on model learning and interpretation with various perspectives. Our findings indicate that multiple graphs relatively improve model performance, but in varying degrees depending on datasets. Interpretation from multiple graphs in different views provides more comprehensive features and potential substructures consistent with background knowledge. These results help to understand model decisions and offer valuable insights for subsequent tasks. The concept of multiple molecular graph representations and diverse interpretation perspectives has broad applicability across tasks, architectures, and explanation techniques, enhancing model learning and interpretation for relevant applications in drug discovery.
Funder
MEXT | Japan Science and Technology Agency
MEXT | Japan Society for the Promotion of Science
Japan Agency for Medical Research and Development
Publisher
Springer Science and Business Media LLC
Reference67 articles.
1. Dara, S., Dhamercherla, S., Jadav, S. S., Babu, C. M. & jawed Ahsan, M. Machine Learning in Drug Discovery: A Review. Artif. Intell. Rev. 55, 1947–1999 (2021).
2. Askr, H. et al. Deep Learning in Drug Discovery: An Integrative Review and Future Challenges. Artif. Intell. Rev. 56, 5975—6037 (2023).
3. Lavecchia, A. Deep Learning in Drug Discovery: Opportunities, Challenges and Future Prospects. Drug Discov. Today 24, 2017–2032 (2019).
4. Irwin, R., Dimitriadis, S., He, J. & Bjerrum, E. J. Chemformer: a pre-trained transformer for computational chemistry. Mach. Learn.: Sci. Technol. 3, 015022 (2022).
5. Wen, N. et al. A fingerprints based molecular property prediction method using the BERT model. J. Cheminform. 14, 71 (2022).
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献