Transfer learning using attentions across atomic systems with graph neural networks (TAAG)-Reference-Cited by-同舟云学术

Transfer learning using attentions across atomic systems with graph neural networks (TAAG)

Published:2022-05-14 Issue:18 Volume:156 Page:184702
ISSN:0021-9606
Container-title:The Journal of Chemical Physics
language:en
Short-container-title:J. Chem. Phys.

Author:

Kolluru Adeesh¹^ORCID,Shoghi Nima²,Shuaibi Muhammed¹,Goyal Siddharth²,Das Abhishek²^ORCID,Zitnick C. Lawrence²,Ulissi Zachary¹^ORCID

Affiliation:

1. Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA

2. Meta AI Research, Menlo Park, California 94025, USA

Abstract

Recent advances in Graph Neural Networks (GNNs) have transformed the space of molecular and catalyst discovery. Despite the fact that the underlying physics across these domains remain the same, most prior work has focused on building domain-specific models either in small molecules or in materials. However, building large datasets across all domains is computationally expensive; therefore, the use of transfer learning (TL) to generalize to different domains is a promising but under-explored approach to this problem. To evaluate this hypothesis, we use a model that is pretrained on the Open Catalyst Dataset (OC20), and we study the model’s behavior when fine-tuned for a set of different datasets and tasks. This includes MD17, the *CO adsorbate dataset, and OC20 across different tasks. Through extensive TL experiments, we demonstrate that the initial layers of GNNs learn a more basic representation that is consistent across domains, whereas the final layers learn more task-specific features. Moreover, these well-known strategies show significant improvement over the non-pretrained models for in-domain tasks with improvements of 53% and 17% for the *CO dataset and across the Open Catalyst Project (OCP) task, respectively. TL approaches result in up to 4× speedup in model training depending on the target data and task. However, these do not perform well for the MD17 dataset, resulting in worse performance than the non-pretrained model for few molecules. Based on these observations, we propose transfer learning using attentions across atomic systems with graph Neural Networks (TAAG), an attention-based approach that adapts to prioritize and transfer important features from the interaction layers of GNNs. The proposed method outperforms the best TL approach for out-of-domain datasets, such as MD17, and gives a mean improvement of 6% over a model trained from scratch.

Publisher

AIP Publishing

Subject

Physical and Theoretical Chemistry,General Physics and Astronomy

Link

https://aip.scitation.org/doi/pdf/10.1063/5.0088019

Reference60 articles.

1. Machine learning for molecular and materials science

2. Retrospective on a decade of machine learning for chemical discovery

3. Machine Learning in Drug Discovery

4. Applications of machine learning in drug discovery and development

5. How AI for Synthesis Can Help Tackle Challenges in Molecular Discovery

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Divide-and-conquer potentials enable scalable and accurate predictions of forces and energies in atomistic systems;Digital Discovery;2024

2. Applying Large Graph Neural Networks to Predict Transition Metal Complex Energies Using the tmQM_wB97MV Data Set;Journal of Chemical Information and Modeling;2023-12-04

3. Representations of Materials for Machine Learning;Annual Review of Materials Research;2023-07-03

4. Prediction of normal boiling point and critical temperature of refrigerants by graph neural network and transfer learning;International Journal of Refrigeration;2023-07

5. A generalized machine learning framework for brittle crack problems using transfer learning and graph neural networks;Mechanics of Materials;2023-06