DeepFold: enhancing protein structure prediction through optimized loss functions, improved template features, and re-optimized energy function

Author:

Lee Jae-Won12,Won Jong-Hyun12,Jeon Seonggwang12,Choo Yujin23,Yeon Yubin12,Oh Jin-Seon23,Kim Minsoo4,Kim SeonHwa5,Joung InSuk6,Jang Cheongjae7,Lee Sung Jong8,Kim Tae Hyun1,Jin Kyong Hwan5,Song Giltae9ORCID,Kim Eun-Sol1,Yoo Jejoong4,Paek Eunok1ORCID,Noh Yung-Kyun110,Joo Keehyoung2ORCID

Affiliation:

1. Department of Computer Science, Hanyang University , Seoul 04763, Korea

2. Center for Advanced Computation, Korea Institute for Advanced Study , Seoul 02455, Korea

3. Department of Artificial intelligence, Hanyang University, Seoul 04763, Korea

4. Department of Physics, Sungkyunkwan University , Suwon 16419, Korea

5. School of Electrical Engineering, Korea University , Seoul 02841, Korea

6. Standigm Inc. , Seoul 06234, Korea

7. Artificial Intelligence Institute, Hanyang University , Seoul 04763, Korea

8. Basic Science Research Institute, Changwon National University , Changwon 51140, Korea

9. School of Computer Science and Engineering, Pusan National University , Busan 46241, Korea

10. School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea

Abstract

Abstract Motivation Predicting protein structures with high accuracy is a critical challenge for the broad community of life sciences and industry. Despite progress made by deep neural networks like AlphaFold2, there is a need for further improvements in the quality of detailed structures, such as side-chains, along with protein backbone structures. Results Building upon the successes of AlphaFold2, the modifications we made include changing the losses of side-chain torsion angles and frame aligned point error, adding loss functions for side chain confidence and secondary structure prediction, and replacing template feature generation with a new alignment method based on conditional random fields. We also performed re-optimization by conformational space annealing using a molecular mechanics energy function which integrates the potential energies obtained from distogram and side-chain prediction. In the CASP15 blind test for single protein and domain modeling (109 domains), DeepFold ranked fourth among 132 groups with improvements in the details of the structure in terms of backbone, side-chain, and Molprobity. In terms of protein backbone accuracy, DeepFold achieved a median GDT-TS score of 88.64 compared with 85.88 of AlphaFold2. For TBM-easy/hard targets, DeepFold ranked at the top based on Z-scores for GDT-TS. This shows its practical value to the structural biology community, which demands highly accurate structures. In addition, a thorough analysis of 55 domains from 39 targets with publicly available structures indicates that DeepFold shows superior side-chain accuracy and Molprobity scores among the top-performing groups. Availability and implementation DeepFold tools are open-source software available at https://github.com/newtonjoo/deepfold.

Funder

Institute of Information & communications Technology Planning & Evaluation

Korea government

National Research Foundation of Korea

Ministry of Science and ICT

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3