DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning-Reference-Cited by-同舟云学术

DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning

Published:2011-02-01 Issue:1 Volume:12 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Eickholt Jesse,Deng Xin,Cheng Jianlin

Abstract

Abstract Background Accurate identification of protein domain boundaries is useful for protein structure determination and prediction. However, predicting protein domain boundaries from a sequence is still very challenging and largely unsolved. Results We developed a new method to integrate the classification power of machine learning with evolutionary signals embedded in protein families in order to improve protein domain boundary prediction. The method first extracts putative domain boundary signals from a multiple sequence alignment between a query sequence and its homologs. The putative sites are then classified and scored by support vector machines in conjunction with input features such as sequence profiles, secondary structures, solvent accessibilities around the sites and their positions. The method was evaluated on a domain benchmark by 10-fold cross-validation and 60% of true domain boundaries can be recalled at a precision of 60%. The trade-off between the precision and recall can be adjusted according to specific needs by using different decision thresholds on the domain boundary scores assigned by the support vector machines. Conclusions The good prediction accuracy and the flexibility of selecting domain boundary sites at different precision and recall values make our method a useful tool for protein structure determination and modelling. The method is available at http://sysbio.rnet.missouri.edu/dobo/.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-12-43.pdf

Reference49 articles.

1. Wetlaufer DB: Nucleation, rapid folding, and globular intrachain regions in proteins. Proc Natl Acad Sci USA 1973, 70: 697–701. 10.1073/pnas.70.3.697

2. Ponting CP, Russell RR: The natural history of protein domains. Annu Rev Biophys Biomol Struct 2002, 31: 45–71. 10.1146/annurev.biophys.31.082901.134314

3. Folkers GE, van Buuren BN, Kaptein R: Expression screening, protein purification and NMR analysis of human protein domains for structural genomics. J Struct Funct Genomics 2004, 5: 119–131. 10.1023/B:JSFG.0000029200.66197.0c

4. Hondoh T, Kato A, Yokoyama S, Kuroda Y: Computer-aided NMR assay for detecting natively folded structural domains. Protein Sci 2006, 15: 871–883. 10.1110/ps.051880406

5. Kim DE, Chivian D, Malmstrom L, Baker D: Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM. Proteins 2005, 61(Suppl 7):193–200. 10.1002/prot.20737

Cited by 52 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. In-silico screening of missense nsSNPs in Delta-opioid receptor protein and their restoring tendency on MCRT interaction; focusing on dynamic nature;International Journal of Biological Macromolecules;2024-08

2. DomBpred: Protein Domain Boundary Prediction Based on Domain-Residue Clustering Using Inter-Residue Distance;IEEE/ACM Transactions on Computational Biology and Bioinformatics;2023-03-01

3. BERTDOM: PROTEIN DOMAIN BOUNDARY PREDICTION USING BERT;COMPUT INFORM;2023

4. I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction;Nature Protocols;2022-08-05

5. Multi-head attention-based U-Nets for predicting protein domain boundaries using 1D sequence features and 2D distance maps;BMC Bioinformatics;2022-07-19