Structural footprinting in protein structure comparison: the impact of structural fragments-Reference-Cited by-同舟云学术

Structural footprinting in protein structure comparison: the impact of structural fragments

Published:2007-08-09 Issue:1 Volume:7 Page:
ISSN:1472-6807
Container-title:BMC Structural Biology
language:en
Short-container-title:BMC Struct Biol

Author:

Zotenko Elena,Islamaj Dogan Rezarta,Wilbur W John,O'Leary Dianne P,Przytycka Teresa M

Abstract

Abstract Background One approach for speeding-up protein structure comparison is the projection approach, where a protein structure is mapped to a high-dimensional vector and structural similarity is approximated by distance between the corresponding vectors. Structural footprinting methods are projection methods that employ the same general technique to produce the mapping: first select a representative set of structural fragments as models and then map a protein structure to a vector in which each dimension corresponds to a particular model and "counts" the number of times the model appears in the structure. The main difference between any two structural footprinting methods is in the set of models they use; in fact a large number of methods can be generated by varying the type of structural fragments used and the amount of detail in their representation. How do these choices affect the ability of the method to detect various types of structural similarity? Results To answer this question we benchmarked three structural footprinting methods that vary significantly in their selection of models against the CATH database. In the first set of experiments we compared the methods' ability to detect structural similarity characteristic of evolutionarily related structures, i.e., structures within the same CATH superfamily. In the second set of experiments we tested the methods' agreement with the boundaries imposed by classification groups at the Class, Architecture, and Fold levels of the CATH hierarchy. Conclusion In both experiments we found that the method which uses secondary structure information has the best performance on average, but no one method performs consistently the best across all groups at a given classification level. We also found that combining the methods' outputs significantly improves the performance. Moreover, our new techniques to measure and visualize the methods' agreement with the CATH hierarchy, including the threshholded affinity graph, are useful beyond this work. In particular, they can be used to expose a similar composition of different classification groups in terms of structural fragments used by the method and thus provide an alternative demonstration of the continuous nature of the protein structure universe.

Publisher

Springer Science and Business Media LLC

Subject

Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1472-6807-7-53.pdf

Reference28 articles.

1. Zotenko E, O'Leary D, Przytycka T: Secondary structure spatial conformation footprint: a novel method for fast protein structure comparison and classification. BMC Struct Biol 2006, 6: 12. 10.1186/1472-6807-6-12

2. Holm L, Sander C: Protein structure comparison by alignment of distance matrices. J Mol Biol 1993, 233: 123–138. 10.1006/jmbi.1993.1489

3. Rogen P, Fain B: Automatic classification of protein structure by using Gauss integrals. Proc Natl Acad Sci USA 2003, 100: 119–124. 10.1073/pnas.2636460100

4. Bostick D, Shen M, Vaisman I: A simple topological representation of protein structure: implications for new, fast, and robust structural classification. Proteins 2004, 56(3):487–501. 10.1002/prot.20146

5. Choi I, Kwon J, Kim S: Local feature frequency profile: a method to measure structural similarity in proteins. Proc Natl Acad Sci USA 2004, 101: 3797–3802. 10.1073/pnas.0308656100

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Motifs and structural blocks retrieval by GHT;The European Physical Journal Plus;2014-06

2. A Method of Protein Model Classification and Retrieval Using Bag-of-Visual-Features;Computational and Mathematical Methods in Medicine;2014

3. Assessment of CASP10 contact-assisted predictions;Proteins: Structure, Function, and Bioinformatics;2013-10-17

4. Protein motifs retrieval by SS terns occurrences;Pattern Recognition Letters;2013-04

5. Structural Blocks Retrieval in Macromolecules: Saliency and Precision Aspects;New Trends in Image Analysis and Processing – ICIAP 2013;2013