Abstract
AbstractMapping the ensemble of protein conformations that contribute to function and can be targeted by small molecule drugs remains an outstanding challenge. Here we explore the use of soft-introspective variational autoencoders for reducing the challenge of dimensionality in the protein structure ensemble generation problem. We convert high-dimensional protein structural data into a continuous, low-dimensional representation, carry out search in this space guided by a structure quality metric, then use RoseTTAFold to generate 3D structures. We use this approach to generate ensembles for the cancer relevant protein K-Ras, training the VAE on a subset of the available K-Ras crystal structures and MD simulation snapshots, and assessing the extent of sampling close to crystal structures withheld from training. We find that our latent space sampling procedure rapidly generates ensembles with high structural quality and is able to sample within 1 angstrom of held out crystal structures, with a consistency higher than MD simulation or AlphaFold2 prediction. The sampled structures sufficiently recapitulate the cryptic pockets in the held-out K-Ras structures to allow for small molecule docking.
Publisher
Cold Spring Harbor Laboratory
Reference28 articles.
1. Anand, N. , & Huang, P. S. (2018). Generative modeling for protein structures. Advances in Neural Information Processing Systems.
2. Baek, M. , DiMaio, F. , Anishchenko, I. , Dauparas, J. , Ovchinnikov, S. , Lee, G. R. , Wang, J. , Cong, Q. , Kinch, L. N. , Schaeffer, R. D. , Millán, C. , Park, H. , Adams, C. , Glassman, C. R. , DeGiovanni, A. , Pereira, J. H. , Rodrigues, A. V , van Dijk, A. A. , Ebrecht, A. C. , … Baker, D. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science, eabj8754. https://doi.org/10.1126/science.abj8754.
3. Exploring the structural origins of cryptic sites on proteins
4. CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites
5. Daniel, T. , & Tamar, A. (2020). Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder. http://arxiv.org/abs/2012.13253.