TopoDiff: Improving Protein Backbone Generation with Topology-aware Latent Encoding-Reference-Cited by-同舟云学术

TopoDiff: Improving Protein Backbone Generation with Topology-aware Latent Encoding

Published:2023-12-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhang Yuyang^ORCID,Ma Zinnia,Gong Haipeng^ORCID

Abstract

AbstractThede novodesign of protein structures is an intriguing research topic in the field of protein engineering. Recent breakthroughs in diffusion-based generative models have demonstrated substantial promise in tackling this task, notably in the generation of diverse and realistic protein structures. While existing models predominantly focus on unconditional generation or fine-grained conditioning at the residue level, the holistic, top-down approaches to control the overall topological arrangements are still insufficiently explored. In response, we introduce TopoDiff, a diffusion-based framework augmented by a global-structure encoding module, which is capable of unsupervisedly learning a compact latent representation of natural protein topologies with interpretable characteristics and simultaneously harnessing this learned information for controllable protein structure generation. We also propose a novel metric specifically designed to assess the coverage of sampled proteins with respect to the natural protein space. In comparative analyses with existing models, our generative model not only demonstrates comparable performance on established metrics but also exhibits better coverage across the recognized topology landscape. In summary, TopoDiff emerges as a novel solution towards enhancing the controllability and comprehensiveness ofde novoprotein structure generation, presenting new possibilities for innovative applications in protein engineering and beyond.

Publisher

Cold Spring Harbor Laboratory

Reference43 articles.

1. The coming of age of de novo protein design

2. Massively parallel de novo protein design for targeted therapeutics

3. De novo design of potent and selective mimics of IL-2 and IL-15

4. De novo design of highly selective miniprotein inhibitors of integrins αvβ6 and αvβ8

5. De novo design of luciferases using deep learning