Augmenting large language models with chemistry tools-Reference-Cited by-同舟云学术

Augmenting large language models with chemistry tools

Published:2024-05-08 Issue:5 Volume:6 Page:525-535
ISSN:2522-5839
Container-title:Nature Machine Intelligence
language:en
Short-container-title:Nat Mach Intell

Author:

M. Bran Andres,Cox Sam,Schilter Oliver^ORCID,Baldassari Carlo,White Andrew D.^ORCID,Schwaller Philippe^ORCID

Abstract

AbstractLarge language models (LLMs) have shown strong performance in tasks across domains but struggle with chemistry-related problems. These models also lack access to external knowledge sources, limiting their usefulness in scientific applications. We introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synthesis, drug discovery and materials design. By integrating 18 expert-designed tools and using GPT-4 as the LLM, ChemCrow augments the LLM performance in chemistry, and new capabilities emerge. Our agent autonomously planned and executed the syntheses of an insect repellent and three organocatalysts and guided the discovery of a novel chromophore. Our evaluation, including both LLM and expert assessments, demonstrates ChemCrow’s effectiveness in automating a diverse set of chemical tasks. Our work not only aids expert chemists and lowers barriers for non-experts but also fosters scientific advancement by bridging the gap between experimental and computational chemistry.

Funder

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

National Science Foundation

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s42256-024-00832-8.pdf

Reference103 articles.

1. Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. Bert: pre-training of deep bidirectional transformers for language understanding. In Proc. Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (eds Burstein, J. et al.) 4171–4186 (Association for Computational Linguistics, 2019).

2. Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).

3. Bommasani, R. et al. On the opportunities and risks of foundation models. Preprint at https://arxiv.org/abs/2108.07258 (2021).

4. Chowdhery, A. et al. Palm: scaling language modeling with pathways. J. Mach. Learn. Res. 24, 1–113 (2023).

5. Bubeck, S. et al. Sparks of artificial general intelligence: early experiments with gpt-4. Preprint at https://arxiv.org/abs/2303.12712 (2023).

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring automated energy optimization with unstructured building data: A multi-agent based framework leveraging large language models;Energy and Buildings;2024-11

2. From Data to Discovery: Recent Trends of Machine Learning in Metal–Organic Frameworks;JACS Au;2024-09-12

3. Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis;Beilstein Journal of Organic Chemistry;2024-09-10

4. SeqImprove: Machine-Learning-Assisted Curation of Genetic Circuit Sequence Information;ACS Synthetic Biology;2024-09-04

5. Large Language Models as Molecular Design Engines;Journal of Chemical Information and Modeling;2024-09-04