Author:
Clark Thomas,Subramanian Vidya,Jayaraman Akila,Fitzpatrick Emmett,Gopal Ranjani,Pentakota Niharika,Rurak Troy,Anand Shweta,Viglione Alexander,Raman Rahul,Tharakaraman Kannan,Sasisekharan Ram
Abstract
AbstractThe application of Machine Learning (ML) tools to engineer novel antibodies having predictable functional properties is gaining prominence. Herein, we present a platform that employs an ML-guided optimization of the complementarity-determining region (CDR) together with a CDR framework (FR) shuffling method to engineer affinity-enhanced and clinically developable monoclonal antibodies (mAbs) from a limited experimental screen space (order of 10^2 designs) using only two experimental iterations. Although high-complexity deep learning models like graph neural networks (GNNs) and large language models (LLMs) have shown success on protein folding with large dataset sizes, the small and biased nature of the publicly available antibody-antigen interaction datasets is not sufficient to capture the diversity of mutations virtually screened using these models in an affinity enhancement campaign. To address this key gap, we introduced inductive biases learned from extensive domain knowledge on protein-protein interactions through feature engineering and selected model hyper parameters to reduce overfitting of the limited interaction datasets. Notably we show that this platform performs better than GNNs and LLMs on an in-house validation dataset that is enriched in diverse CDR mutations that go beyond alanine-scanning. To illustrate the broad applicability of this platform, we successfully solved a challenging problem of redesigning two different anti-SARS-COV-2 mAbs to enhance affinity (up to 2 orders of magnitude) and neutralizing potency against the dynamically evolving SARS-COV-2 Omicron variants.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献