Meta Learning Improves Robustness and Performance in Machine Learning-Guided Protein Engineering-Reference-Cited by-同舟云学术

Meta Learning Improves Robustness and Performance in Machine Learning-Guided Protein Engineering

Published:2023-01-30 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Minot Mason^ORCID,Reddy Sai T.^ORCID

Abstract

AbstractMachine learning-guided protein engineering continues to rapidly progress, however, collecting large, well-labeled data sets remains time and resource intensive. Directed evolution and protein engineering studies often require extensive experimental processes to eliminate noise and fully label high-throughput protein sequence-function data. Meta learning methods established in other fields (e.g. computer vision and natural language processing) have proven effective in learning from noisy data, given the availability of a small data set with trusted labels and thus could be applied for protein engineering. Here, we generate yeast display antibody mutagenesis libraries and screen them for target antigen binding followed by deep sequencing. Meta learning approaches are able to learn under high synthetic and experimental noise as well as in under labeled data settings, typically outperforming baselines significantly and often requiring a fraction of the training data. Thus, we demonstrate meta learning may expedite and improve machine learning-guided protein engineering.Availability and implementationThe code used in this study is publicly available athttps://github.com/LSSI-ETH/meta-learning-for-protein-engineering.Graphical Abstract

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. Accurate prediction of protein structures and interactions using a three-track neural network

2. Learning from positive and unlabeled data: a survey;Mach. Learn,2020

3. An improved yeast transformation method for the generation of very large human antibody libraries

4. Low-N protein engineering with data-efficient deep learning

5. Yeast surface display for screening combinatorial polypeptide libraries

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Addressing epistasis in the design of protein function;Proceedings of the National Academy of Sciences;2024-08-12

2. Best practices for machine learning in antibody discovery and development;Drug Discovery Today;2024-07

3. Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering;ACS Central Science;2024-02-05

4. Machine Learning-Guided Protein Engineering;ACS Catalysis;2023-10-13