Author:
Houmani Moustafa,Peterkin Finlay,Antoun Gerard,Fischer Louis,Hammi Anissa
Abstract
AbstractModern protein engineering is powered by sequence-function data-sets. We have developed parSEQ, a platform that maximizes the capture of these protein sequence-function data-sets through a sequence-first-screen-later approach. parSEQ relies on Next-Generation Sequencing (NGS) to reverse the conventional screen-first-sequence-later workflow. This allows for the high-throughput retrieval of variant sequences from DNA pools, ensuring that every screened variant’s functional data is paired with its sequence data. This report details parSEQ’s methodology and its integration into various protein engineering workflows. Through several case studies, we illustrate parSEQ’s broad applicability. These case studies describe the use of parSEQ for high-throughput variant DNA-template retrieval, expression-ready bacterial clone isolation, sourcing DNA for denovo designed proteins, and preparing targeted mutational libraries. Our findings suggest that parSEQ’s approach to capturing sequence-function data-sets can advance protein engineering efforts, particularly in the age of artificial intelligence and machine learning.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献