Author:
Krishna Rohith,Wang Jue,Ahern Woody,Sturmfels Pascal,Venkatesh Preetham,Kalvet Indrek,Lee Gyu Rie,Morey-Burrows Felix S,Anishchenko Ivan,Humphreys Ian R,McHugh Ryan,Vafeados Dionne,Li Xinting,Sutherland George A,Hitchcock Andrew,Hunter C Neil,Baek Minkyung,DiMaio Frank,Baker David
Abstract
AbstractAlthough AlphaFold2 (AF2) and RoseTTAFold (RF) have transformed structural biology by enabling high-accuracy protein structure modeling, they are unable to model covalent modifications or interactions with small molecules and other non-protein molecules that can play key roles in biological function. Here, we describe RoseTTAFold All-Atom (RFAA), a deep network capable of modeling full biological assemblies containing proteins, nucleic acids, small molecules, metals, and covalent modifications given the sequences of the polymers and the atomic bonded geometry of the small molecules and covalent modifications. Following training on structures of full biological assemblies in the Protein Data Bank (PDB), RFAA has comparable protein structure prediction accuracy to AF2, excellent performance in CAMEO for flexible backbone small molecule docking, and reasonable prediction accuracy for protein covalent modifications and assemblies of proteins with multiple nucleic acid chains and small molecules which, to our knowledge, no existing method can model simultaneously. By fine-tuning on diffusive denoising tasks, we develop RFdiffusion All-Atom (RFdiffusionAA), which generates binding pockets by directly building protein structures around small molecules and other non-protein molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we design and experimentally validate proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and optically active bilin molecules with potential for expanding the range of wavelengths captured by photosynthesis. We anticipate that RFAA and RFdiffusionAA will be widely useful for modeling and designing complex biomolecular systems.
Publisher
Cold Spring Harbor Laboratory
Reference59 articles.
1. Highly accurate protein structure prediction with AlphaFold
2. Accurate prediction of protein structures and interactions using a three-track neural network
3. Glide: A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy
4. AlphaFill: enriching AlphaFold models with ligands and cofactors;Nat. Methods,2023
5. G. Corso , H. Stärk , B. Jing , R. Barzilay , T. Jaakkola , DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking. arXiv [q-bio.BM] (2022), (available at http://arxiv.org/abs/2210.01776).
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献