Affiliation:
1. School of Science, School of Big Data, Zhejiang University of Science and Technology, Hangzhou 310008, China
Abstract
Artificial intelligence technologies such as machine learning have been applied to protein engineering, with unique advantages in protein structure, function prediction, catalytic activity, and other issues in recent years. Screening better mutants is still a bottleneck in protein engineering. In this paper, a new sequence-activity relationship method was analyzed for its application in improving the thermal stability of Aspergillus terreus (R)-ω-selective amine transaminase. The experimental data from 6 single-point mutated enzymes were used as a learning dataset to build models and predict the thermostability of 26 mutants. Based on digital signal processing (DSP), this method digitized the amino acid sequence of proteins by fast Fourier transform (FFT) and then established the best model applying partial least squares regression (PLSR) to screen out all possible mutants, especially those with high performance. In protein engineering, the innovative sequence activity relationship (ISAR) method can make a reasonable prediction using limited experimental data and significantly reduce the experimental cost. The half-life (
) of (R)-ω-transaminase was fitted with the amino acid sequence by the ISAR algorithm, resulting in an
of 0.8929 and a cvRMSE of 4.89. At the same time, the mutants with higher
than the existing ones were predicted, laying the groundwork for better (R)-ω-transaminase in the later stage. The ISAR algorithm is expected to provide a new technique for protein evolution and screening.
Funder
Natural Science Foundation of Zhejiang Province
Subject
General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献