Author:
Golinski Alexander W.,Mischler Katelynn M.,Laxminarayan Sidharth,Neurock Nicole,Fossing Matthew,Pichman Hannah,Martiniani Stefano,Hackel Benjamin J.
Abstract
AbstractProteins require high developability - quantified by expression, solubility, and stability - for robust utility as therapeutics, diagnostics, and in other biotechnological applications. Measuring traditional developability metrics is low-throughput in nature, often slowing the developmental pipeline. We evaluated the ability of three high-throughput developability assays to predict the bacterial recombinant expression of paratope variants of the protein scaffold Gp2. Enabled by a phenotype/genotype linkage, assay performance for 105 variants was calculated via deep sequencing of populations sorted by proxied developability. We trained a random forest model that predicts expression from assay performance that is 35% closer to the experimental variance and trains 80% more efficiently than a model predicting from sequence information alone. Utilizing the predicted expression, we performed a sitewise analysis and predicted mutations consistent with enhanced developability. The validated assays offer the ability to identify developable proteins at unprecedented scales, reducing the bottleneck of protein commercialization.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献