A generic method for assignment of reliability scores applied to solvent accessibility predictions-Reference-Cited by-同舟云学术

A generic method for assignment of reliability scores applied to solvent accessibility predictions

Published:2009-07-31 Issue:1 Volume:9 Page:
ISSN:1472-6807
Container-title:BMC Structural Biology
language:en
Short-container-title:BMC Struct Biol

Author:

Petersen Bent,Petersen Thomas Nordahl,Andersen Pernille,Nielsen Morten,Lundegaard Claus

Abstract

Abstract Background Estimation of the reliability of specific real value predictions is nontrivial and the efficacy of this is often questionable. It is important to know if you can trust a given prediction and therefore the best methods associate a prediction with a reliability score or index. For discrete qualitative predictions, the reliability is conventionally estimated as the difference between output scores of selected classes. Such an approach is not feasible for methods that predict a biological feature as a single real value rather than a classification. As a solution to this challenge, we have implemented a method that predicts the relative surface accessibility of an amino acid and simultaneously predicts the reliability for each prediction, in the form of a Z-score. Results An ensemble of artificial neural networks has been trained on a set of experimentally solved protein structures to predict the relative exposure of the amino acids. The method assigns a reliability score to each surface accessibility prediction as an inherent part of the training process. This is in contrast to the most commonly used procedures where reliabilities are obtained by post-processing the output. Conclusion The performance of the neural networks was evaluated on a commonly used set of sequences known as the CB513 set. An overall Pearson's correlation coefficient of 0.72 was obtained, which is comparable to the performance of the currently best public available method, Real-SPINE. Both methods associate a reliability score with the individual predictions. However, our implementation of reliability scores in the form of a Z-score is shown to be the more informative measure for discriminating good predictions from bad ones in the entire range from completely buried to fully exposed amino acids. This is evident when comparing the Pearson's correlation coefficient for the upper 20% of predictions sorted according to reliability. For this subset, values of 0.79 and 0.74 are obtained using our and the compared method, respectively. This tendency is true for any selected subset.

Publisher

Springer Science and Business Media LLC

Subject

Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1472-6807-9-51.pdf

Reference37 articles.

1. Lundegaard C, Lund O, Kesmir C, Brunak S, Nielsen M: Modeling the adaptive immune system: predictions and simulations. Bioinformatics 2007, 23(24):3265–3275. 10.1093/bioinformatics/btm471

2. Rost B: PHD: predicting one-dimensional protein structure by profile-based neural networks. Methods Enzymol 1996, 266: 525–539. full_text

3. Connolly M: Analytical molecular surface calculation. Journal of Applied Crystallography 1983, 16(5):548–558. 10.1107/S0021889883010985

4. Chothia C: The nature of the accessible and buried surfaces in proteins. J Mol Biol 1976, 105(1):1–12. 10.1016/0022-2836(76)90191-1

5. Ahmad S, Gromiha MM, Sarai A: Real value prediction of solvent accessibility from amino acid sequence. Proteins 2003, 50(4):629–635. 10.1002/prot.10328

Cited by 550 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Centrosomal Protein 55 Regulates Chromosomal Instability in Cancer Cells by Controlling Microtubule Dynamics;Cells;2024-08-20

2. Zika virus non-structural proteins B-cell epitope mapping in mother-newborn immune interaction;2024-06-18

3. Reliability evaluation of individual predictions: a data-centric approach;The VLDB Journal;2024-05-30

4. Systematic Investigation of the Trafficking of Glycoproteins on the Cell Surface;Molecular & Cellular Proteomics;2024-05

5. Nitrate transporter protein NPF5.12 and major latex-like protein MLP6 are important defense factors against Verticillium longisporum;Journal of Experimental Botany;2024-04-26