Blind Assessment of Monomeric AlphaFold2 Protein Structure Models with Experimental NMR Data
Author:
Li Ethan H.ORCID, Spaman LauraORCID, Tejero RobertoORCID, Huang Yuanpeng JanetORCID, Ramelot Theresa A.ORCID, Fraga Keith J.ORCID, Prestegard James H.ORCID, Kennedy Michael A.ORCID, Montelione Gaetano T.ORCID
Abstract
AbstractRecent advances in molecular modeling of protein structures are changing the field of structural biology.AlphaFold-2(AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open source protein NMR data sets for such “blind” targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case)15N-1H residual dipolar coupling data. For these nine small (70 - 108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.HighlightsAF2 models assessed against NMR data for 9 monomeric proteins not used in training.AF2 models fit NMR data almost as well as the experimentally-determined structures.RPF-DP, PSVS, andPDBStatsoftware provide structure quality and RDC assessment.RPF-DPanalysis using AF2 models suggests multiple conformational states.
Publisher
Cold Spring Harbor Laboratory
Reference62 articles.
1. Critical assessment of methods of protein structure prediction (CASP)-Round XIV;Proteins,2021 2. J. Jumper , R. Evans , A. Pritzel , T. Green , M. Figurnov , O. Ronneberger , K. Tunyasuvunakool , R. Bates , A. Žídek , A. Potapenko , A. Bridgland , C. Meyer , S.A.A. Kohl , A.J. Ballard , A. Cowie , B. Romera-Paredes , S. Nikolov , R. Jain , J. Adler , T. Back , S. Petersen , D. Reiman , E. Clancy , M. Zielinski , M. Steinegger , M. Pacholska , T. Berghammer , S. Bodenstein , D. Silver , O. Vinyals , A.W. Senior , K. Kavukcuoglu , P. Kohli , D. Hassabis , Highly accurate protein structure prediction with AlphaFold, Nature, (2021). 3. Protein structure predictions to atomic accuracy with AlphaFold 4. M. Baek , F. DiMaio , I. Anishchenko , J. Dauparas , S. Ovchinnikov , G.R. Lee , J. Wang , Q. Cong , L.N. Kinch , R.D. Schaeffer , C. Millan , H. Park , C. Adams , C.R. Glassman , A. DeGiovanni , J.H. Pereira , A.V. Rodrigues , A.A. van Dijk , A.C. Ebrecht , D.J. Opperman , T. Sagmeister , C. Buhlheller , T. Pavkov-Keller , M.K. Rathinaswamy , U. Dalwadi , C.K. Yip , J.E. Burke , K.C. Garcia , N.V. Grishin , P.D. Adams , R.J. Read , D. Baker , Accurate prediction of protein structures and interactions using a three-track neural network, Science, (2021). 5. The impact of AlphaFold2 one year on;Nat Methods,2022
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|