Protein Ensemble Generation through Variational Autoencoder Latent Space Sampling-Reference-Cited by-同舟云学术

Protein Ensemble Generation through Variational Autoencoder Latent Space Sampling

Published:2023-08-01 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Mansoor Sanaa^ORCID,Baek Minkyung^ORCID,Park Hahnbeom^ORCID,Lee Gyu Rie^ORCID,Baker David

Abstract

AbstractMapping the ensemble of protein conformations that contribute to function and can be targeted by small molecule drugs remains an outstanding challenge. Here we explore the use of soft-introspective variational autoencoders for reducing the challenge of dimensionality in the protein structure ensemble generation problem. We convert high-dimensional protein structural data into a continuous, low-dimensional representation, carry out search in this space guided by a structure quality metric, then use RoseTTAFold to generate 3D structures. We use this approach to generate ensembles for the cancer relevant protein K-Ras, training the VAE on a subset of the available K-Ras crystal structures and MD simulation snapshots, and assessing the extent of sampling close to crystal structures withheld from training. We find that our latent space sampling procedure rapidly generates ensembles with high structural quality and is able to sample within 1 angstrom of held out crystal structures, with a consistency higher than MD simulation or AlphaFold2 prediction. The sampled structures sufficiently recapitulate the cryptic pockets in the held-out K-Ras structures to allow for small molecule docking.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. Anand, N. , & Huang, P. S. (2018). Generative modeling for protein structures. Advances in Neural Information Processing Systems.

2. Baek, M. , DiMaio, F. , Anishchenko, I. , Dauparas, J. , Ovchinnikov, S. , Lee, G. R. , Wang, J. , Cong, Q. , Kinch, L. N. , Schaeffer, R. D. , Millán, C. , Park, H. , Adams, C. , Glassman, C. R. , DeGiovanni, A. , Pereira, J. H. , Rodrigues, A. V , van Dijk, A. A. , Ebrecht, A. C. , … Baker, D. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science, eabj8754. https://doi.org/10.1126/science.abj8754.

3. Exploring the structural origins of cryptic sites on proteins

4. CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites

5. Daniel, T. , & Tamar, A. (2020). Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder. http://arxiv.org/abs/2012.13253.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessing AF2’s ability to predict structural ensembles of proteins;2024-04-17

2. Exploring the conformational ensembles of protein-protein complex with transformer-based generative model;2024-02-26

3. Interpretable Atomistic Prediction and Functional Analysis of Conformational Ensembles and Allosteric States in Protein Kinases Using AlphaFold2 Adaptation with Randomized Sequence Scanning and Local Frustration Profiling;2024-02-20

4. Diffusion in a quantized vector space generates non-idealized protein structures and predicts conformational distributions;2023-11-18