Author:
Mao Weian,Zhu Muzhi,Chen Hao,Shen Chunhua
Abstract
AbstractProteins serve as the foundation of life. Most diseases and challenges in life sciences are intimately linked to protein structures. In this paper, we propose a novel vector field network (VFN) for modeling protein structure. Unlike previous methods that extract geometric information relying heavily on hand-crafted features, VFN establishes a new geometric representation paradigm through a novel vector field operator. This vector field operator can not only eliminate the reliance on hand-crafted features, but also capture the implicit geometric relationships between residues. Thus, it enables VFN to have better generalizability and flexibility. We evaluate VFN on the protein inverse folding task. Experiment results show that VFN can significantly improve the performance of the state-of-the-art method, PiFold, by 3.0% (51.7%vs. 54.7%) in terms of the sequence recovery score, and outperform the recent solid baseline, Protein MPNN, by 8.7% (46.0%vs. 54.7%). Furthermore, we scale up VFN with all known protein structure data. Finally, the model achieves a recovery score of57.1%, pushing the accuracy to the next level.
Publisher
Cold Spring Harbor Laboratory
Reference51 articles.
1. Deep learning using rectified linear units (ReLU);arXiv preprint,2018
2. Kappel, Kalli. The Rosetta all-atom energy function for macromolecular modeling and design;Computational and Structural Biotechnology Journal,2017
3. Protein sequence design with a learned potential;Nature Communications,2022
4. Bera, Asim K. De novo protein design by deep network hallucination;Nature,2021
5. Stephen K. Burley , Helen M. Berman , Gerard J. Kleywegt , John L. Markley , Haruki Nakamura , and Sameer Velankar . Protein data bank (pdb): the single global macromolecular structure archive. Protein Crystallography: Methods and Protocols, pages 627–641, 2017.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献