Abstract
AbstractThe Protein Data Bank (PDB) undergoes an exponential expansion in terms of the number of macromolecular structures deposited every year. A pivotal question is how this rapid growth of structural information improves the quality of three-dimensional models constructed by contemporary bioinformatics approaches. To address this problem, we performed a retrospective analysis of the structural coverage of a representative set of proteins using remote homology detected by COMPASS and HHpred. We show that the number of proteins whose structures can be confidently predicted increased during a 9-year period between 2005 and 2014 on account of the PDB growth alone. Nevertheless, this encouraging trend slowed down noticeably around the year 2008 and has yielded insignificant improvements ever since. At the current pace, it is unlikely that the protein structure prediction problem will be solved in the near future using existing template-based modeling techniques. Therefore, further advances in experimental structure determination, qualitatively better approaches in fold recognition, and more accurate template-free structure prediction methods are desperately needed.
Subject
Health Informatics,Biochemistry, Genetics and Molecular Biology (miscellaneous),Medicine (miscellaneous),General Computer Science
Reference80 articles.
1. de AG From local structure to a global framework : recognition of protein folds;Joseph;J Soc Interface,2014
2. Protein structure prediction when is it useful;Zhang;Curr Biol,2009
3. Protein fold recognition using sequence profiles and its application in structural genomics Protein;Koonin;Adv Chem,2000
4. Structural keeping up with expanding knowledge of the protein universe;Grabowski;genomics Curr Biol,2007
5. The protein structure prediction problem could be solved using the current PDB;Zhang;library Proc Natl Acad Sci USA,2005
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献