Affiliation:
1. Department of Statistics, University of Oxford , Oxford OX1 3PB, UK
Abstract
Abstract
Summary
Motivation. Predicting the native state of a protein has long been considered a gateway problem for understanding protein folding. Recent advances in structural modeling driven by deep learning have achieved unprecedented success at predicting a protein’s crystal structure, but it is not clear if these models are learning the physics of how proteins dynamically fold into their equilibrium structure or are just accurate knowledge-based predictors of the final state.
Results. In this work, we compare the pathways generated by state-of-the-art protein structure prediction methods to experimental data about protein folding pathways. The methods considered were AlphaFold 2, RoseTTAFold, trRosetta, RaptorX, DMPfold, EVfold, SAINT2 and Rosetta. We find evidence that their simulated dynamics capture some information about the folding pathway, but their predictive ability is worse than a trivial classifier using sequence-agnostic features like chain length. The folding trajectories produced are also uncorrelated with experimental observables such as intermediate structures and the folding rate constant. These results suggest that recent advances in structure prediction do not yet provide an enhanced understanding of protein folding.
Availability. The data underlying this article are available in GitHub at https://github.com/oxpig/structure-vs-folding/
Supplementary information
Supplementary data are available at Bioinformatics online.
Funder
UK's Engineering and Physical Sciences Research Council
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Reference56 articles.
1. Phenix: a comprehensive python-based system for macromolecular structure solution;Adams;Acta Crystallogr. Sect. D Biol. Crystallogr,2010
2. The Rosetta all-atom energy function for macromolecular modeling and design;Alford;J. Chem. Theory Comput,2017
3. Accurate prediction of protein structures and interactions using a three-track network;Baek;Science, 373, 6557, 871–876,2021
4. The protein data bank;Berman;Nucleic Acids Res,2000
5. Native contacts determine protein folding mechanisms in atomistic simulations;Best;Proc. Natl. Acad. Sci. USA,2013
Cited by
56 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献