Leveraging large language models for predictive chemistry-Reference-Cited by-同舟云学术

Leveraging large language models for predictive chemistry

Published:2024-02-06 Issue:2 Volume:6 Page:161-169
ISSN:2522-5839
Container-title:Nature Machine Intelligence
language:en
Short-container-title:Nat Mach Intell

Author:

Jablonka Kevin Maik,Schwaller Philippe^ORCID,Ortega-Guerrero Andres^ORCID,Smit Berend^ORCID

Abstract

AbstractMachine learning has transformed many fields and has recently found applications in chemistry and materials science. The small datasets commonly found in chemistry sparked the development of sophisticated machine learning approaches that incorporate chemical knowledge for each application and, therefore, require specialized expertise to develop. Here we show that GPT-3, a large language model trained on vast amounts of text extracted from the Internet, can easily be adapted to solve various tasks in chemistry and materials science by fine-tuning it to answer chemical questions in natural language with the correct answer. We compared this approach with dedicated machine learning models for many applications spanning the properties of molecules and materials to the yield of chemical reactions. Surprisingly, our fine-tuned version of GPT-3 can perform comparably to or even outperform conventional machine learning techniques, in particular in the low-data limit. In addition, we can perform inverse design by simply inverting the questions. The ease of use and high performance, especially for small datasets, can impact the fundamental approach to using machine learning in the chemical and material sciences. In addition to a literature search, querying a pre-trained large language model might become a routine way to bootstrap a project by leveraging the collective knowledge encoded in these foundation models, or to provide a baseline for predictive tasks.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s42256-023-00788-1.pdf

Reference88 articles.

1. Bommasani, R. et al. On the opportunities and risks of foundation models. Preprint at https://arxiv.org/abs/2108.07258 (2021).

2. Vaswani, A. et al. Attention is all you need. Adv. Neural Inf. Process. Syst. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf (2017).

3. Chowdhery, A. et al. PaLM: scaling language modeling with pathways. J. Mach. Learn. Res. 24, 1–113 (2023).

4. Hoffmann, J. et al. An empirical analysis of compute-optimal large language model training. Adv. Neural Inf. Process. Syst. 35, 30016–30030 (2022).

5. Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).

Cited by 34 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis;Beilstein Journal of Organic Chemistry;2024-09-10

2. Kinase Drug Discovery: Impact of Open Science and Artificial Intelligence;Molecular Pharmaceutics;2024-09-06

3. Large Language Models, scientific knowledge and factuality: A framework to streamline human expert evaluation;Journal of Biomedical Informatics;2024-09

4. Recent Advances and Prospects in High‐Performance Bio‐Based Phthalonitrile Resins;Advanced Functional Materials;2024-08-28

5. Generative artificial intelligence performs rudimentary structural biology modeling;Scientific Reports;2024-08-21