Protein secondary structure prediction for a single-sequence using hidden semi-Markov models-Reference-Cited by-同舟云学术

Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

Published:2006-03-30 Issue:1 Volume:7 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Aydin Zafer,Altunbasak Yucel,Borodovsky Mark

Abstract

Abstract Background The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorithms imply that information about other (homologous) proteins is not available, while algorithms of the second type imply that information about homologous proteins is available, and use it intensively. The single-sequence algorithms could make an important contribution to studies of proteins with no detected homologs, however the accuracy of protein secondary structure prediction from a single-sequence is not as high as when the additional evolutionary information is present. Results In this paper, we further refine and extend the hidden semi-Markov model (HSMM) initially considered in the BSPSS algorithm. We introduce an improved residue dependency model by considering the patterns of statistically significant amino acid correlation at structural segment borders. We also derive models that specialize on different sections of the dependency structure and incorporate them into HSMM. In addition, we implement an iterative training method to refine estimates of HSMM parameters. The three-state-per-residue accuracy and other accuracy measures of the new method, IPSSP, are shown to be comparable or better than ones for BSPSS as well as for PSIPRED, tested under the single-sequence condition. Conclusions We have shown that new dependency models and training methods bring further improvements to single-sequence protein secondary structure prediction. The results are obtained under cross-validation conditions using a dataset with no pair of sequences having significant sequence similarity. As new sequences are added to the database it is possible to augment the dependency structure and obtain even higher accuracy. Current and future advances should contribute to the improvement of function prediction for orphan proteins inscrutable to current similarity search methods.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-7-178.pdf

Reference67 articles.

1. Jones DT: Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 1999, 292: 195–202. 10.1006/jmbi.1999.3091

2. Raghava GPS: APSSP2: Protein secondary structure prediction using nearest neighbor and neural network approach. CASP4 2000, 75–76.

3. Pollastri G, Przybylski D, Rost B, Baldi P: Improving the Prediction of Protein Secondary Structure in Three and Eight Classes using Recurrent Neural Networks and Profiles. Proteins 2002, 47: 228–235. 10.1002/prot.10082

4. Cuff JA, Barton GJ: Application of multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins 2000, 40: 502–511. 10.1002/1097-0134(20000815)40:3<502::AID-PROT170>3.0.CO;2-Q

5. Meiler J, Mueller M, Zeidler A, Schmaeschke F: Generation and evaluation of dimension-reduced amino acid parameter representations by artificial neural networks. J Mol Model 2001, 7: 360–369. 10.1007/s008940100038

Cited by 76 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SERT-StructNet: Protein secondary structure prediction method based on multi-factor hybrid deep model;Computational and Structural Biotechnology Journal;2024-12

2. Transformer Encoder with Protein Language Model for Protein Secondary Structure Prediction;Engineering, Technology & Applied Science Research;2024-04-02

3. Protein encoder: An autoencoder-based ensemble feature selection scheme to predict protein secondary structure;Expert Systems with Applications;2023-03

4. Different methods, techniques and their limitations in protein structure prediction: A review;Progress in Biophysics and Molecular Biology;2022-09

5. 1–4D Protein Structures Prediction Using Machine Learning and Deep Learning from Amino Acid Sequences;Proceedings of the Third International Conference on Information Management and Machine Intelligence;2022-08-04