Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules-Reference-Cited by-同舟云学术

Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules

Published:2023-04-13 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Xia Jun¹^ORCID,Zhao Chengshuai²,Hu Bozhen³,Gao Zhangyang³,Tan Cheng³,Liu Yue³,Li Siyuan³,Li Stan Z.¹

Affiliation:

1. Westlake University

2. University of California, Irvine

3. Westlake Univerity

Abstract

Recent years have witnessed the prosperity of pre-training graph neural networks (GNNs) for molecules. Typically, atom types as node attributes are randomly masked and GNNs are then trained to predict masked types as in AttrMask \citep{hu2020strategies}, following the Masked Language Modeling (MLM) task of BERT~\citep{devlin2019bert}. However, unlike MLM where the vocabulary is large, the AttrMask pre-training does not learn informative molecular representations due to small and unbalanced atom `vocabulary'. To amend this problem, we propose a variant of VQ-VAE~\citep{van2017neural} as a context-aware tokenizer to encode atom attributes into chemically meaningful discrete codes. This can enlarge the atom vocabulary size and mitigate the quantitative divergence between dominant (e.g., carbons) and rare atoms (e.g., phosphorus). With the enlarged atom `vocabulary', we propose a novel node-level pre-training task, dubbed Masked Atoms Modeling (MAM), to mask some discrete codes randomly and then pre-train GNNs to predict them. MAM also mitigates another issue of AttrMask, namely the negative transfer. It can be easily combined with various pre-training tasks to improve their performance. Furthermore, we propose triplet masked contrastive learning (TMCL) for graph-level pre-training to model the heterogeneous semantic similarity between molecules for effective molecule retrieval. MAM and TMCL constitute a novel pre-training framework, Mole-BERT, which can match or outperform state-of-the-art methods in a fully data-driven manner. We release the code at \textcolor{magenta}{\url{https://github.com/junxia97/Mole-BERT}}.

Publisher

American Chemical Society (ACS)

Link

https://chemrxiv.org/engage/api-gateway/chemrxiv/assets/orp/resource/item/64361823a41dec1a56e75135/original/mole-bert-rethinking-pre-training-graph-neural-networks-for-molecules.pdf

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Molecular representation contrastive learning via transformer embedding to graph neural networks;Applied Soft Computing;2024-10

2. MaskMol: Knowledge-guided Molecular Image Pre-Training Framework for Activity Cliffs with Pixel Masking;2024-09-09

3. Machine learning for predicting protein properties: A comprehensive review;Neurocomputing;2024-09

4. A bioactivity foundation model using pairwise meta-learning;Nature Machine Intelligence;2024-08-14

5. Attribute-guided prototype network for few-shot molecular property prediction;Briefings in Bioinformatics;2024-07-25