A probabilistic view of protein stability, conformational specificity, and design-Reference-Cited by-同舟云学术

A probabilistic view of protein stability, conformational specificity, and design

Published:2023-09-19 Issue:1 Volume:13 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Stern Jacob A.,Free Tyler J.,Stern Kimberlee L.,Gardiner Spencer,Dalley Nicholas A.,Bundy Bradley C.,Price Joshua L.,Wingate David,Della Corte Dennis

Abstract

AbstractVarious approaches have used neural networks as probabilistic models for the design of protein sequences. These "inverse folding" models employ different objective functions, which come with trade-offs that have not been assessed in detail before. This study introduces probabilistic definitions of protein stability and conformational specificity and demonstrates the relationship between these chemical properties and the

$$p(\text {structure}|\text {seq})$$

p ( structure | seq ) Boltzmann probability objective. This links the Boltzmann probability objective function to experimentally verifiable outcomes. We propose a novel sequence decoding algorithm, referred to as “BayesDesign”, that leverages Bayes’ Rule to maximize the

$$p(\text {structure}|\text {seq})$$

p ( structure | seq ) objective instead of the

$$p(\text {seq}|\text {structure})$$

p ( seq | structure ) objective common in inverse folding models. The efficacy of BayesDesign is evaluated in the context of two protein model systems, the NanoLuc enzyme and the WW structural motif. Both BayesDesign and the baseline ProteinMPNN algorithm increase the thermostability of NanoLuc and increase the conformational specificity of WW. The possible sources of error in the model are analyzed.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-023-42032-1.pdf

Reference40 articles.

1. Defresne, M., Barbe, S. & Schiex, T. Protein design with deep learning. Int. J. Mol. Sci. 22(21) (2021).

2. Koga, N. et al. Principles for designing ideal protein structures. Nature 491, 222–227 (2012).

3. Coates, T. L. et al. Current computational methods for enzyme design. Mod. Phys. Lett. B 35, 2150155–574 (2021).

4. Norn, C. et al. Protein sequence design by conformational landscape optimization. Proc. Natl. Acad. Sci. 118(11), e2017228118 (2021).

5. Simons, K. T., Kooperberg, C., Huang, E. & Baker, D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and bayesian scoring functions11edited by f. e. cohen. J. Mol. Biol. 268(1), 209–225 (1997).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Protein Design Using Structure-Prediction Networks: AlphaFold and RoseTTAFold as Protein Structure Foundation Models;Cold Spring Harbor Perspectives in Biology;2024-03-04