Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation-Reference-Cited by-同舟云学术

Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation

Published:2009-04-20 Issue:1 Volume:10 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Exarchos Konstantinos P,Exarchos Themis P,Papaloukas Costas,Troganis Anastassios N,Fotiadis Dimitrios I

Abstract

Abstract Background Polypeptides are composed of amino acids covalently bonded via a peptide bond. The majority of peptide bonds in proteins is found to occur in the trans conformation. In spite of their infrequent occurrence, cis peptide bonds play a key role in the protein structure and function, as well as in many significant biological processes. Results We perform a systematic analysis of regions in protein sequences that contain a proline cis peptide bond in order to discover non-random associations between the primary sequence and the nature of proline cis/trans isomerization. For this purpose an efficient pattern discovery algorithm is employed which discovers regular expression-type patterns that are overrepresented (i.e. appear frequently repeated) in a set of sequences. Four types of pattern discovery are performed: i) exact pattern discovery, ii) pattern discovery using a chemical equivalency set, iii) pattern discovery using a structural equivalency set and iv) pattern discovery using certain amino acids' physicochemical properties. The extracted patterns are carefully validated using a specially implemented scoring function and a significance measure (i.e. log-probability estimate) indicative of their specificity. The score threshold for the first three types of pattern discovery is 0.90 while for the last type of pattern discovery 0.80. Regarding the significance measure, all patterns yielded values in the range [-9, -31] which ensure that the derived patterns are highly unlikely to have emerged by chance. Among the highest scoring patterns, most of them are consistent with previous investigations concerning the neighborhood of cis proline peptide bonds, and many new ones are identified. Finally, the extracted patterns are systematically compared against the PROSITE database, in order to gain insight into the functional implications of cis prolyl bonds. Conclusion Cis patterns with matches in the PROSITE database fell mostly into two main functional clusters: family signatures and protein signatures. However considerable propensity was also observed for targeting signals, active and phosphorylation sites as well as domain signatures.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-10-113.pdf

Reference32 articles.

1. Stewart DE, Sarkar A, Wampler JE: Occurrence and role of cis peptide bonds in protein structures. Journal of molecular biology 1990, 214(1):253–260. 10.1016/0022-2836(90)90159-J

2. Weiss MS, Jabs A, Hilgenfeld R: Peptide bonds revisited. Nature structural biology 1998, 5(8):676. 10.1038/1368

3. Lu KP, Finn G, Lee TH, Nicholson LK: Prolyl cis-trans isomerization as a molecular timer. Nature chemical biology 2007, 3(10):619–629. 10.1038/nchembio.2007.35

4. Lorenzen S, Peters B, Goede A, Preissner R, Frommel C: Conservation of cis prolyl bonds in proteins during evolution. Proteins 2005, 58(3):589–595. 10.1002/prot.20342

5. Pal D, Chakrabarti P: Cis peptide bonds in proteins: residues involved, their conformations, interactions and locations. Journal of molecular biology 1999, 294(1):271–288. 10.1006/jmbi.1999.3217

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conformation- and phosphorylation-dependent electron tunnelling across self-assembled monolayers of tau peptides;Journal of Colloid and Interface Science;2022-01

2. Getting to Know Your Neighbor: Protein Structure Prediction Comes of Age with Contextual Machine Learning;Journal of Computational Biology;2020-05-01

3. Complete genome sequence of human T-cell lymphotropic type 1 from patients with different clinical profiles, including infective dermatitis;Infection, Genetics and Evolution;2020-04

4. Subcellular localization of mutated β‐catenins with different incidences of cis ‐peptide bonds at the Xaa246‐P247 site in HepG2 cells;The FASEB Journal;2019-02-26

5. Detecting Proline and Non-Proline Cis Isomers in Protein Structures from Sequences Using Deep Residual Ensemble Learning;Journal of Chemical Information and Modeling;2018-08-17