Aligning protein generative models with experimental fitness via Direct Preference Optimization-Reference-Cited by-同舟云学术

Aligning protein generative models with experimental fitness via Direct Preference Optimization

Published:2024-05-21 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Widatalla Talal^ORCID,Rafailov Rafael,Hie Brian^ORCID

Abstract

AbstractGenerative models trained on unlabeled protein datasets have demonstrated a remarkable ability to predict some biological functions without any task-specific training data. However, this capability does not extend to all relevant functions and, in many cases, the unsupervised model still underperforms task-specific, supervised baselines. We hypothesize that this is due to a fundamental “alignment gap” in which the rules learned during unsupervised training are not guaranteed to be related to the function of interest. Here, we demonstrate how to provide protein generative models with useful task-specific information without losing the rich, general knowledge learned during pretraining. Using an optimization task called Direct Preference Optimization (DPO), we align a structure-conditioned language model to generate stable protein sequences by encouraging the model to prefer stabilizing over destabilizing variants given a protein backbone structure. Our resulting model, ProteinDPO, is the first structure-conditioned language model preference-optimized to experimental data. ProteinDPO achieves competitive stability prediction and consistently outperforms both unsupervised and finetuned versions of the model. Notably, the aligned model also performs well in domains beyond its training data to enable absolute stability prediction of large proteins and binding affinity prediction of multi-chain complexes, while also enabling single-step stabilization of diverse backbones. These results indicate that ProteinDPO has learned generalizable information from its biophysical alignment data.

Publisher

Cold Spring Harbor Laboratory

Reference59 articles.

1. De novo protein design by deep network hallucination;Nature,2021

2. Accurate prediction of protein structures and interactions using a three-track neural network

3. Y. Bai , A. Jones , K. Ndousse , A. Askell , A. Chen , N. DasSarma , D. Drain , S. Fort , D. Ganguli , T. Henighan , N. Joseph , S. Kadavath , J. Kernion , T. Conerly , S. El-Showk , N. Elhage , Z. Hatfield-Dodds , D. Hernandez , T. Hume , S. Johnston , S. Kravec , L. Lovitt , N. Nanda , C. Olsson , D. Amodei , T. Brown , J. Clark , S. McCandlish , C. Olah , B. Mann , and J. Kaplan . Training a helpful and harmless assistant with reinforcement learning from human feedback, 2022.

4. Predicting antibody developability profiles through early stage discovery screening;mAbs,2020

5. The Protein Data Bank