Using Hidden Markov Models to Align Multiple Sequences-Reference-Cited by-同舟云学术

Using Hidden Markov Models to Align Multiple Sequences

Published:2009-07 Issue:7 Volume:2009 Page:pdb.top41
ISSN:1940-3402
Container-title:Cold Spring Harbor Protocols
language:en
Short-container-title:Cold Spring Harb Protoc

Author:

Mount David W.

Abstract

INTRODUCTIONA hidden Markov model (HMM) is a probabilistic model of a multiple sequence alignment (msa) of proteins. In the model, each column of symbols in the alignment is represented by a frequency distribution of the symbols (called a “state”), and insertions and deletions are represented by other states. One moves through the model along a particular path from state to state in a Markov chain (i.e., random choice of next move), trying to match a given sequence. The next matching symbol is chosen from each state, recording its probability (frequency) and also the probability of going to that state from a previous one (the transition probability). State and transition probabilities are multiplied to obtain a probability of the given sequence. The hidden nature of the HMM is due to the lack of information about the value of a specific state, which is instead represented by a probability distribution over all possible values. This article discusses the advantages and disadvantages of HMMs in msa and presents algorithms for calculating an HMM and the conditions for producing the best HMM.

Publisher

Cold Spring Harbor Laboratory

Subject

General Biochemistry, Genetics and Molecular Biology

Reference21 articles.

1. Hidden Markov models of biological primary sequence information.

2. Carlin BP Louis TA (1996) in Bayes and empirical Bayes methods for data analysis (Monographs on statistics and applied probability) ed Cox DR (Chapman and Hall, New York).

3. Stochastic models for heterogeneous DNA sequences

4. Durbin R Eddy S Krogh A Mitchison G (1998) Biological sequence analysis: Probabilistic models of proteins and nucleic acids (Cambridge University Press, Cambridge, UK).

5. Multiple alignment using hidden Markov models;Eddy;ISMB,1995

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Amplification refractory mutation system based real-time PCR (ARMS-qPCR) for rapid resistance characterization of Tribolium castaneum to phosphine;Pesticide Biochemistry and Physiology;2022-10

2. In silico testing of flavonoids as potential inhibitors of protease and helicase domains of dengue and Zika viruses;PeerJ;2022-08-04

3. Improving the Annotation of the Venom Gland Transcriptome of Pamphobeteus verdolaga, Prospecting Novel Bioactive Peptides;Toxins;2022-06-15

4. Genome-Wide Identification and Transcriptional Analysis of Arabidopsis DUF506 Gene Family;International Journal of Molecular Sciences;2021-10-23

5. Protein Analysis: From Sequence to Structure;Advances in Bioinformatics;2021