SPIN-CGNN: Improved fixed backbone protein design with contact map-based graph construction and contact graph neural network-Reference-Cited by-同舟云学术

SPIN-CGNN: Improved fixed backbone protein design with contact map-based graph construction and contact graph neural network

Published:2023-07-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhang Xing^ORCID,Yin Hongmei,Ling Fei,Zhan Jian,Zhou Yaoqi^ORCID

Abstract

AbstractRecent advances in deep learning have significantly improved the ability to infer protein sequences directly from protein structures for the fix-backbone design. The methods have evolved from the early use of multi-layer perceptrons to convolutional neural networks, transformer, and graph neural networks (GNN). However, the conventional approach of constructing K-nearest-neighbors (KNN) graph for GNN has limited the utilization of edge information, which plays a critical role in network performance. Here we introduced SPIN-CGNN based on protein contact maps for nearest neighbors. Together with auxiliary edge updates and selective kernels, we found that SPIN-CGNN provided a comparable performance in refolding ability by AlphaFold2 to the current state-of-the-art techniques but a significant improvement over them in term of sequence recovery, perplexity, deviation from amino-acid compositions of native sequences, conservation of hydrophobic positions, and low complexity regions, according to the test by unseen structures and “hallucinated” structures. Results suggest that low complexity regions in the sequences designed by deep learning techniques remain to be improved, when compared to the native sequences.

Publisher

Cold Spring Harbor Laboratory

Reference44 articles.

1. Protein-Structure Prediction by Recombination of Fragments

2. Energy functions in de novo protein design: current challenges and future prospects;Annual review of biophysics,2013

3. Advances in protein structure prediction and design

4. Energy Functions for Protein Design: Adjustment with Protein–Protein Complex Affinities, Models for the Unfolded State, and Negative Design of Solubility and Specificity

5. Protein design with a comprehensive statistical energy function and boosted by experimental selection for foldability;Nature communications,2014