ZoomQA: residue-level protein model accuracy estimation with machine learning on sequential and 3D structural features

Author:

Hippe Kyle1,Lilley Cade1,William Berkenpas Joshua1,Chandana Pocha Ciri2,Kishaba Kiyomi1,Ding Hui3,Hou Jie2,Si Dong4,Cao Renzhi1

Affiliation:

1. Department of Computer Science, Pacific Lutheran University, Tacoma, WA 98447, USA

2. Saint Louis University, USA

3. Center for Informational Biology at University of Electronic Science and Technology of China

4. University of Washington Bothell, USA

Abstract

Abstract Motivation The Estimation of Model Accuracy problem is a cornerstone problem in the field of Bioinformatics. As of CASP14, there are 79 global QA methods, and a minority of 39 residue-level QA methods with very few of them working on protein complexes. Here, we introduce ZoomQA, a novel, single-model method for assessing the accuracy of a tertiary protein structure/complex prediction at residue level, which have many applications such as drug discovery. ZoomQA differs from others by considering the change in chemical and physical features of a fragment structure (a portion of a protein within a radius $r$ of the target amino acid) as the radius of contact increases. Fourteen physical and chemical properties of amino acids are used to build a comprehensive representation of every residue within a protein and grade their placement within the protein as a whole. Moreover, we have shown the potential of ZoomQA to identify problematic regions of the SARS-CoV-2 protein complex. Results We benchmark ZoomQA on CASP14, and it outperforms other state-of-the-art local QA methods and rivals state of the art QA methods in global prediction metrics. Our experiment shows the efficacy of these new features and shows that our method is able to match the performance of other state-of-the-art methods without the use of homology searching against databases or PSSM matrices. Availability http://zoomQA.renzhitech.com

Funder

Natural Sciences Undergraduate Research Program at Pacific Lutheran University

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference43 articles.

1. Comparative protein structure modeling and its applications to drug discovery;Jacobson;Annu Rep Med Chem,2004

2. J., Ries, D., Justice, N., Zhang, J., Chan, L. and Cao, R. Survey of machine learning techniques in drug discovery;Stephenson;Curr Drug Metab,2019

3. Protein threading using context-specific alignment potential;Ma;Bioinformatics,2013

4. Improved protein structure prediction using predicted interresidue orientations;Yang;Proc Natl Acad Sci,2020

5. A. High accuracy protein structure prediction using deep learning;Jumper;Fourteenth Critical Assessment Of Techniques For Protein Structure Prediction (abstract Book)

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3