PASS2-Reference-Cited by-同舟云学术

PASS2

Published:2011-10 Issue:4 Volume:2 Page:53-66
ISSN:1947-9115
Container-title:International Journal of Knowledge Discovery in Bioinformatics
language:en
Short-container-title:

Author:

Kanagarajadurai Karuppiah¹,Kalaimathy Singaravelu²,Nagarajan Paramasivam³,Sowdhamini Ramanathan⁴

Affiliation:

1. Madurai Kamaraj University, India

2. Biotechnologisches Zentrum, Germany

3. Max Planck Institute for Developmental Biology, Germany

4. National Centre for Biological Sciences, India

Abstract

A detailed comparison of protein domains that belong to families and superfamilies shows that structure is better conserved than sequence during evolutionary divergence. Sequence alignments, guided by structural features, permit a better sampling of the protein sequence space and effective construction of libraries for fold recognition. Sequence alignments are useful evolutionary models in defining structure-function relationships for protein superfamilies. The PASS2 database, maintained by the authors, presents alignments of proteins related at the superfamily level and characterised by low sequence similarity. The number of new superfamilies increased to 47% compared with the previous PASS2 version, which shows the crucial importance of updating the PASS2 database. In the current release of the PASS2 database, they align protein superfamilies using a structural alignment protocol. The authors also introduce two alignment assessment methods that depend on the average structural deviations of domains and the extent of conserved secondary structures. They also integrate new and important structural and sequence features at the superfamily level into the database. These features are conserved-unconserved blocks in proteins, spatial distribution of sequences using principal component analysis and a statistical view for each superfamily. The authors suggest that highly structurally deviant superfamily members could be removed as outliers, so that such extreme distant relationships will not obscure the alignment. They report a nearly-automated, updated version of the superfamily alignment database, consisting of 1776 superfamilies and 9536 protein domains, that is in direct correspondence with the SCOP (1.73) database.

Publisher

IGI Global

Reference34 articles.

1. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

2. Data growth and its impact on the SCOP database: new developments

3. Protein Structure Prediction and Structural Genomics

4. Hidden Markov models of biological primary sequence information.

5. Berman, H. M., Bhat, T. N., Bourne, P. E., Feng, Z., Gilliland, G., Weissig, H., … Westbrook, J. (2000). The Protein Data Bank and the challenge of structural genomics. Nature Structural Biology, 7(S), 957-959.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PASS2.7: a database containing structure-based sequence alignments and associated features of protein domain superfamilies from SCOPe;Database;2022-01-01

2. PASS2 version 6: a database of structure-based sequence alignments of protein domain superfamilies in accordance with SCOPe;Database;2019-01-01

3. PASS2 database for the structure-based sequence alignment of distantly related SCOP domain superfamilies: update to version 5 and added features;Nucleic Acids Research;2015-11-08

4. Rebelling for a Reason: Protein Structural “Outliers”;PLoS ONE;2013-09-20