Interpreting Molecular Dynamics Forces as Deep Learning Gradients Improves Quality Of Predicted Protein Structures-Reference-Cited by-同舟云学术

Interpreting Molecular Dynamics Forces as Deep Learning Gradients Improves Quality Of Predicted Protein Structures

Published:2023-10-05 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

King Jonathan Edward^ORCID,Koes David Ryan^ORCID

Abstract

ABSTRACTProtein structure predictions from deep learning models like AlphaFold2, despite their remarkable accuracy, are likely insufficient for direct use in downstream tasks like molecular docking. The functionality of such models could be improved with a combination of increased accuracy and physical intuition. We propose a new method to train deep learning protein structure prediction models using molecular dynamics force fields to work toward these goals. Our custom PyTorch loss function, OpenMM-Loss, represents the potential energy of a predicted structure. OpenMM-Loss can be applied to any all-atom representation of protein structure capable of mapping into our software package, SidechainNet. We demonstrate our method’s efficacy by finetuning OpenFold. We show that subsequently predicted protein structures, both before and after a relaxation procedure, exhibit comparable accuracy while displaying lower potential energy and improved structural quality as assessed by MolProbity metrics.SIGNIFICANCEWe propose a novel framework to directly incorporate forces from molecular dynamics as gradients for training deep learning models like AlphaFold2. We implement our method as a PyTorch loss function, OpenMM-Loss, which frames the potential energy of predicted protein structures as a minimization objective. When applied to OpenFold, our method demonstrates improved structural quality, lower potential energy, and comparable accuracy relative to the OpenFold baseline. Our framework may enhance the ability of deep learning models to recapitulate fundamental biophysical principles, reducing the number of structural irregularities in their predictions and paving the way for more effective downstream applications.

Publisher

Cold Spring Harbor Laboratory

Reference31 articles.

1. Highly accurate protein structure prediction with AlphaFold

2. High-resolutionde novostructure prediction from primary sequence

3. Accurate prediction of protein structures and interactions using a three-track neural network

4. Evolutionary-scale prediction of atomic-level protein structure with a language model

5. The Protein Data Bank