In silico proof of principle of machine learning-based antibody design at unconstrained scale-Reference-Cited by-同舟云学术

In silico proof of principle of machine learning-based antibody design at unconstrained scale

Published:2021-07-09 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Akbar Rahmad^ORCID,Robert Philippe A.^ORCID,Weber Cédric R.^ORCID,Widrich Michael^ORCID,Frank Robert^ORCID,Pavlović Milena^ORCID,Scheffer Lonneke^ORCID,Chernigovskaya Maria^ORCID,Snapkov Igor^ORCID,Slabodkin Andrei^ORCID,Mehta Brij Bhushan^ORCID,Miho Enkelejda^ORCID,Lund-Johansen Fridtjof^ORCID,Andersen Jan Terje^ORCID,Hochreiter Sepp^ORCID,Haff Ingrid Hobæk,Klambauer Günter^ORCID,Sandve Geir Kjetil^ORCID,Greiff Victor^ORCID

Abstract

AbstractGenerative machine learning (ML) has been postulated to be a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide range of physiological antibody binding parameters. The simulation framework enables both the computation of antibody-antigen 3D-structures as well as functions as an oracle for unrestricted prospective evaluation of the antigen specificity of ML-generated antibody sequences. We found that a deep generative model, trained exclusively on antibody sequence (1D) data can be used to design native-like conformational (3D) epitope-specific antibodies, matching or exceeding the training dataset in affinity and developability variety. Furthermore, we show that transfer learning enables the generation of high-affinity antibody sequences from low-N training data. Finally, we validated that the antibody design insight gained from simulated antibody-antigen binding data is applicable to experimental real-world data. Our work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design.Highlights

A large-scale dataset of 70M [3 orders of magnitude larger than the current state of the art] synthetic antibody-antigen complexes, that reflect biological complexity, allows the prospective evaluation of antibody generative deep learning

Combination of generative learning, synthetic antibody-antigen binding data, and prospective evaluation shows that deep learning driven antibody design and discovery at an unconstrained level is feasible

Transfer learning (low-N learning) coupled to generative learning shows that antibody-binding rules may be transferred across unrelated antibody-antigen complexes

Experimental validation of antibody-design conclusions drawn from deep learning on synthetic antibody-antigen binding data

Graphical abstractWe leverage large synthetic ground-truth data to demonstrate the (A,B) unconstrained deep generative learning-based generation of native-like antibody sequences, (C) the prospective evaluation of conformational (3D) affinity, paratope-epitope pairs, and developability. (D) Finally, we show increased generation quality of low-N-based machine learning models via transfer learning.

Publisher

Cold Spring Harbor Laboratory

Reference70 articles.

1. Development of therapeutic antibodies for the treatment of diseases

2. A human monoclonal antibody blocking SARS-CoV-2 infection

3. The growth and potential of human antiviral monoclonal antibody therapeutics

4. Research and Development on Therapeutic Agents and Vaccines for COVID-19 and Related Human Coronavirus Diseases;ACS Cent Sci,2020

5. I. Torjesen , Drug development: the journey of a medicine from lab to shelf. Pharm. J. (2015) (available at https://www.pharmaceutical-journal.com/publications/tomorrows-pharmacist/drug-development-the-journey-of-a-medicine-from-lab-to-shelf/20068196.article?firstPass=false).

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing viscosity control in antibody formulations: A framework for the biophysical screening of mutations targeting solvent-accessible hydrophobic and electrostatic patches;2024-03-12

2. Leveraging Artificial Intelligence to Expedite Antibody Design and Enhance Antibody–Antigen Interactions;Bioengineering;2024-02-15

3. The dengue-specific immune response and antibody identification with machine learning;npj Vaccines;2024-01-20

4. Staying Ahead of the Game: How SARS-CoV-2 has Accelerated the Application of Machine Learning in Pandemic Management;BioDrugs;2023-07-18

5. Deep mutational learning predicts ACE2 binding and antibody escape to combinatorial mutations in the SARS-CoV-2 receptor-binding domain;Cell;2022-10