Modeling sequence-space exploration and emergence of epistatic signals in protein evolution-Reference-Cited by-同舟云学术

Modeling sequence-space exploration and emergence of epistatic signals in protein evolution

Published:2021-06-05 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bisardi Matteo,Rodriguez-Rivas Juan,Zamponi Francesco,Weigt Martin^ORCID

Abstract

During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence libraries, like fitness distributions and position-specific mutational spectra. They also allow us to efficiently simulate sequence libraries for a vast array of combinations of experimental parameters like sequence divergence, selection strength and library size. We showcase the potential of the approach in re-analyzing two recent experiments to determine protein structure from signals of epistasis emerging in experimental sequence libraries. To be detectable, these signals require sufficiently large and sufficiently diverged libraries. Our modeling framework offers a quantitative explanation for the variable success of recently published experiments. Furthermore, we can forecast the outcome of time- and resource-intensive evolution experiments, opening thereby a way to computationally optimize experimental protocols.

Publisher

Cold Spring Harbor Laboratory

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tuned Fitness Landscapes for Benchmarking Model-Guided Protein Design;2022-10-30

2. Epistasis Creates Invariant Sites and Modulates the Rate of Molecular Evolution;Molecular Biology and Evolution;2022-05-01

3. Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes;2022-01-23

4. Learning the local landscape of protein structures with convolutional neural networks;Journal of Biological Physics;2021-11-09

5. Efficient generative modeling of protein sequences using simple autoregressive models;Nature Communications;2021-10-04