Chemical transformer compression for accelerating both training and inference of molecular modeling-Reference-Cited by-同舟云学术

Chemical transformer compression for accelerating both training and inference of molecular modeling

Published:2022-10-31 Issue:4 Volume:3 Page:045009
ISSN:2632-2153
Container-title:Machine Learning: Science and Technology
language:
Short-container-title:Mach. Learn.: Sci. Technol.

Author:

Yu Yi^ORCID,Börjesson Karl^ORCID

Abstract

Abstract Transformer models have been developed in molecular science with excellent performance in applications including quantitative structure-activity relationship (QSAR) and virtual screening (VS). Compared with other types of models, however, they are large and need voluminous data for training, which results in a high hardware requirement to abridge time for both training and inference processes. In this work, cross-layer parameter sharing (CLPS), and knowledge distillation (KD) are used to reduce the sizes of transformers in molecular science. Both methods not only have competitive QSAR predictive performance as compared to the original BERT model, but also are more parameter efficient. Furthermore, by integrating CLPS and KD into a two-state chemical network, we introduce a new deep lite chemical transformer model, DeLiCaTe. DeLiCaTe accomplishes 4× faster rate for training and inference, due to a 10- and 3-times reduction of the number of parameters and layers, respectively. Meanwhile, the integrated model achieves comparable performance in QSAR and VS, because of capturing general-domain (basic structure) and task-specific knowledge (specific property prediction). Moreover, we anticipate that the model compression strategy provides a pathway to the creation of effective generative transformer models for organic drugs and material design.

Funder

European Research council

Publisher

IOP Publishing

Subject

Artificial Intelligence,Human-Computer Interaction,Software

Link

https://iopscience.iop.org/article/10.1088/2632-2153/ac99ba/pdf

Reference48 articles.

1. A critical overview of computational approaches employed for COVID-19 drug discovery;Muratov;Chem. Soc. Rev.,2021

2. Virtual screening web servers: designing chemical probes and drug candidates in the cyberspace;Singh;Brief. Bioinform.,2020

3. Applications of machine learning in drug discovery and development;Vamathevan;Nat. Rev. Drug Discovery,2019

4. The transformational role of GPU computing and deep learning in drug discovery;Pandey;Nat. Mach. Intell.,2022

5. Inverse molecular design using machine learning: generative models for matter engineering;Sanchez-Lengeling;Science,2018