Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction-Reference-Cited by-同舟云学术

Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction

Published:2010-05-11 Issue:1 Volume:11 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Bryant Drew H,Moll Mark,Chen Brian Y,Fofanov Viacheslav Y,Kavraki Lydia E

Abstract

Abstract Background Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a substructure of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels. Results This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST) method uses all-against-all substructure comparison to determine Substructural Clusters (SCs). SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH) framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build motif ensembles that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs. Conclusions FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated, statistically rigorous procedure for incorporating structural variation data into protein function prediction pipelines. Our work provides an unbiased, automated assessment of the structural variability of identified binding site substructures among protein structure families and a technique for exploring the relation of substructural variation to protein function. As available proteomic data continues to expand, the techniques proposed will be indispensable for the large-scale analysis and interpretation of structural data.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-11-242.pdf

Reference80 articles.

1. Meng EC, Polacco BJ, Babbitt PC: Superfamily active site templates. Proteins 2004, 55(4):962–976. 10.1002/prot.20099

2. Pegg SCH, Brown SD, Ojha S, Seffernick J, Meng EC, Morris JH, Chang PJ, Huang CC, Ferrin TE, Babbitt PC: Leveraging enzyme structure-function relationships for functional inference and experimental design: the structure-function linkage database. Biochemistry 2006, 45(8):2545–2555. 10.1021/bi052101l

3. Rognan D: Chemogenomic approaches to rational drug design. British Journal of Pharmacology 2007, 152: 38–52. 10.1038/sj.bjp.0707307

4. Klabunde T: Chemogenomic approaches to drug discovery: similar receptors bind similar ligands. British Journal of Pharmacology 2007, 152: 5–7. 10.1038/sj.bjp.0707308

5. Hendrickson W: Impact of structures from the Protein Structure Initiative. Structure 2007, 15(12):1528–1529. 10.1016/j.str.2007.11.006

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CrossDome: an interactive R package to predict cross-reactivity risk using immunopeptidomics databases;Frontiers in Immunology;2023-06-12

2. Large-Scale Structure-Based Screening of Potential T Cell Cross-Reactivities Involving Peptide-Targets From BCG Vaccine and SARS-CoV-2;Frontiers in Immunology;2022-01-13

3. Explaining Small Molecule Binding Specificity with Volumetric Representations of Protein Binding Sites;Algorithms and Methods in Structural Bioinformatics;2022

4. FunHoP: Enhanced Visualization and Analysis of Functionally Homologous Proteins in Complex Metabolic Networks;Genomics, Proteomics & Bioinformatics;2021-03

5. Interpreting T-Cell Cross-reactivity through Structure: Implications for TCR-Based Cancer Immunotherapy;Frontiers in Immunology;2017-10-04