Affiliation:
1. College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
2. Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109-2218, USA
Abstract
Abstract
Motivation
With the great progress of deep learning-based inter-residue contact/distance prediction, the discrete space formed by fragment assembly cannot satisfy the distance constraint well. Thus, the optimal solution of the continuous space may not be achieved. Designing an effective closed-loop continuous dihedral angle optimization strategy that complements the discrete fragment assembly is crucial to improve the performance of the distance-assisted fragment assembly method.
Results
In this article, we proposed a de novo protein structure prediction method called IPTDFold based on closed-loop iterative partition sampling, topology adjustment and residue-level distance deviation optimization. First, local dihedral angle crossover and mutation operators are designed to explore the conformational space extensively and achieve information exchange between the conformations in the population. Then, the dihedral angle rotation model of loop region with partial inter-residue distance constraints is constructed, and the rotation angle satisfying the constraints is obtained by differential evolution algorithm, so as to adjust the spatial position relationship between the secondary structures. Finally, the residue distance deviation is evaluated according to the difference between the conformation and the predicted distance, and the dihedral angle of the residue is optimized with biased probability. The final model is generated by iterating the above three steps. IPTDFold is tested on 462 benchmark proteins, 24 FM targets of CASP13 and 20 FM targets of CASP14. Results show that IPTDFold is significantly superior to the distance-assisted fragment assembly method Rosetta_D (Rosetta with distance). In particular, the prediction accuracy of IPTDFold does not decrease as the length of the protein increases. When using the same FastRelax protocol, the prediction accuracy of IPTDFold is significantly superior to that of trRosetta without orientation constraints, and is equivalent to that of the full version of trRosetta.
Availabilityand implementation
The source code and executable are freely available at https://github.com/iobio-zjut/IPTDFold.
Supplementary information
Supplementary data are available at Bioinformatics online.
Funder
National Nature Science Foundation of China
Key Project of Zhejiang Provincial Natural Science Foundation of China
National Key Research and Development Program of China
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Reference52 articles.
1. CONFOLD: residue–residue contact-guided ab initio protein folding;Adhikari;Proteins Struct. Funct. Bioinf,2015
2. CONFOLD2: improved contact-driven ab initio protein structure modeling;Adhikari;BMC Bioinformatics,2018
3. AlphaFold at CASP13;AlQuraishi;Bioinformatics,2019
4. Protein structure prediction guided by predicted inter-residue geometries;Anishchenko;Fourteenth Meeting of Critical Assessment of Techniques for Protein Structure Prediction,2020
5. Development of a new physics-based internal coordinate mechanics force field and its application to protein loop modeling;Arnautova;Proteins Struct. Funct. Bioinf,2011
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献