Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models-Reference-Cited by-同舟云学术

Broadly applicable and accurate protein design by integrating structure prediction networks and diffusion generative models

Published:2022-12-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Watson Joseph L.^ORCID,Juergens David^ORCID,Bennett Nathaniel R.^ORCID,Trippe Brian L.^ORCID,Yim Jason^ORCID,Eisenach Helen E.^ORCID,Ahern Woody,Borst Andrew J.^ORCID,Ragotte Robert J.^ORCID,Milles Lukas F.^ORCID,Wicky Basile I. M.^ORCID,Hanikel Nikita^ORCID,Pellock Samuel J.,Courbet Alexis^ORCID,Sheffler William,Wang Jue^ORCID,Venkatesh Preetham,Sappington Isaac^ORCID,Torres Susana Vázquez,Lauko Anna^ORCID,De Bortoli Valentin^ORCID,Mathieu Emile,Barzilay Regina,Jaakkola Tommi S.^ORCID,DiMaio Frank^ORCID,Baek Minkyung^ORCID,Baker David^ORCID

Abstract

AbstractThere has been considerable recent progress in designing new proteins using deep learning methods1–9. Despite this progress, a general deep learning framework for protein design that enables solution of a wide range of design challenges, includingde novobinder design and design of higher order symmetric architectures, has yet to be described. Diffusion models10,11have had considerable success in image and language generative modeling but limited success when applied to protein modeling, likely due to the complexity of protein backbone geometry and sequence-structure relationships. Here we show that by fine tuning the RoseTTAFold structure prediction network on protein structure denoising tasks, we obtain a generative model of protein backbones that achieves outstanding performance on unconditional and topology-constrained protein monomer design, protein binder design, symmetric oligomer design, enzyme active site scaffolding, and symmetric motif scaffolding for therapeutic and metal-binding protein design. We demonstrate the power and generality of the method, called RoseTTAFold Diffusion (RFdiffusion), by experimentally characterizing the structures and functions of hundreds of new designs. In a manner analogous to networks which produce images from user-specified inputs, RFdiffusionenables the design of diverse, complex, functional proteins from simple molecular specifications.

Publisher

Cold Spring Harbor Laboratory

Reference52 articles.

1. Robust deep learning–based protein sequence design using ProteinMPNN

2. ProtGPT2 is a deep unsupervised language model for protein design

3. Large-scale design and refinement of stable proteins using sequence-only models;PLOS ONE,2022

4. Scaffolding protein functional sites using deep learning

Cited by 83 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Annealed fractional Lévy–Itō diffusion models for protein generation;Computational and Structural Biotechnology Journal;2024-12

2. Auditing and instructing text-to-image generation models on fairness;AI and Ethics;2024-08-01

3. Accurate prediction of CDR-H3 loop structures of antibodies with deep learning;eLife;2024-06-26

4. An all-atom protein generative model;Proceedings of the National Academy of Sciences;2024-06-25

5. H3-OPT: Accurate prediction of CDR-H3 loop structures of antibodies with deep learning;2024-05-31