In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment-Reference-Cited by-同舟云学术

In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment

Published:2013-02 Issue:S3 Volume:14 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Chitale Meghana,Khan Ishita K,Kihara Daisuke

Abstract

Abstract Background Many Automatic Function Prediction (AFP) methods were developed to cope with an increasing growth of the number of gene sequences that are available from high throughput sequencing experiments. To support the development of AFP methods, it is essential to have community wide experiments for evaluating performance of existing AFP methods. Critical Assessment of Function Annotation (CAFA) is one such community experiment. The meeting of CAFA was held as a Special Interest Group (SIG) meeting at the Intelligent Systems in Molecular Biology (ISMB) conference in 2011. Here, we perform a detailed analysis of two sequence-based function prediction methods, PFP and ESG, which were developed in our lab, using the predictions submitted to CAFA. Results We evaluate PFP and ESG using four different measures in comparison with BLAST, Prior, and GOtcha. In addition to the predictions submitted to CAFA, we further investigate performance of a different scoring function to rank order predictions by PFP as well as PFP/ESG predictions enriched with Priors that simply adds frequently occurring Gene Ontology terms as a part of predictions. Prediction accuracies of each method were also evaluated separately for different functional categories. Successful and unsuccessful predictions by PFP and ESG are also discussed in comparison with BLAST. Conclusion The in-depth analysis discussed here will complement the overall assessment by the CAFA organizers. Since PFP and ESG are based on sequence database search results, our analyses are not only useful for PFP and ESG users but will also shed light on the relationship of the sequence similarity space and functions that can be inferred from the sequences.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-14-S3-S2.pdf

Reference54 articles.

1. Kanehisa M, Goto S: KEGG: Kyoto encyclopedia of genes and genomes. Nucleic acids research. 2000, 28: 27-30. 10.1093/nar/28.1.27.

2. Bujnicki JM: Prediction of protein structures, functions, and interactions. 2009, Wiley Online Library

3. Chitale M, Kihara D: Computational protein function prediction: Framework and challenges. Protein function prediction for omis era. Edited by: Kihara D. Springer Verlag. 2011, 1-17.

4. Eisenberg D, Marcotte EM, Xenarios I, Yeates TO: Protein function in the post-genomic era. Nature. 2000, 405: 823-826. 10.1038/35015694.

5. Friedberg I: Automated protein function prediction--the genomic challenge. Briefings in bioinformatics. 2006, 7: 225-242. 10.1093/bib/bbl004.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Phylo-PFP: improved automated protein function prediction using phylogenetic distance of distantly related sequences;Bioinformatics;2018-08-25

2. Using PFP and ESG Protein Function Prediction Web Servers;Methods in Molecular Biology;2017

3. Integrated protein function prediction by mining function associations, sequences, and protein–protein and gene–gene interaction networks;Methods;2016-01

4. The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches;GigaScience;2015-09-14