Direct Generation of Protein Conformational Ensembles via Machine Learning-Reference-Cited by-同舟云学术

Direct Generation of Protein Conformational Ensembles via Machine Learning

Published:2022-06-19 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Janson Giacomo,Valdes-Garcia Gilberto,Heo Lim,Feig Michael^ORCID

Abstract

ABSTRACTDynamics and conformational sampling are essential for linking protein structure to biological function. While challenging to probe experimentally, computer simulations are widely used to describe protein dynamics, but at significant computational costs that continue to limit the systems that can be studied. Here, we demonstrate that machine learning can be trained with simulation data to directly generate physically realistic conformational ensembles of proteins without the need for any sampling and at negligible computational cost. As a proof-of-principle a generative adversarial network based on a transformer architecture with self-attention was trained on coarse-grained simulations of intrinsically disordered peptides. The resulting model, idpGAN, can predict sequence-dependent ensembles for any sequence demonstrating that transferability can be achieved beyond the limited training data. idpGAN was also retrained on atomistic simulation data to show that the approach can be extended in principle to higher-resolution conformational ensemble generation.

Publisher

Cold Spring Harbor Laboratory

Reference53 articles.

1. Moving beyond static snapshots: Protein dynamics and the Protein Data Bank

2. How directed evolution reshapes the energy landscape in an enzyme to boost catalysis

3. Advanced Methods for Accessing Protein Shape-Shifting Present New Therapeutic Opportunities

4. Intrinsic dynamics of an enzyme underlies catalysis

5. Gupta, A. et al. Experimental techniques to study protein dynamics and conformations in: Advances in Protein Molecular and Structural Biology Methods (eds Timir Tripathi & Vikash Kumar Dubey ) 181–197 (Academic Press, 2022).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. WASCO: A Wasserstein-based Statistical Tool to Compare Conformational Ensembles of Intrinsically Disordered Proteins;Journal of Molecular Biology;2023-07

2. Clustering Heterogeneous Conformational Ensembles of Intrinsically Disordered Proteins with t-Distributed Stochastic Neighbor Embedding;Journal of Chemical Theory and Computation;2023-06-20