Affiliation:
1. Department of Chemistry, Boston University, Boston, MA 02215
Abstract
Enhancing protein thermal stability is important for biomedical and industrial applications as well as in the research laboratory. Here, we describe a simple machine-learning method which identifies amino acid substitutions that contribute to thermal stability based on comparison of the amino acid sequences of homologous proteins derived from bacteria that grow at different temperatures. A key feature of the method is that it compares the sequences based not simply on the amino acid identity, but rather on the structural and physicochemical properties of the side chain. The method accurately identified stabilizing substitutions in three well-studied systems and was validated prospectively by experimentally testing predicted stabilizing substitutions in a polyamine oxidase. In each case, the method outperformed the widely used bioinformatic consensus approach. The method can also provide insight into fundamental aspects of protein structure, for example, by identifying how many sequence positions in a given protein are relevant to temperature adaptation.
Publisher
Proceedings of the National Academy of Sciences