Abstract
INTRODUCTIONIt is difficult to find a global optimal alignment of more than two sequences (and, especially, more than three) that includes matches, mismatches, and gaps and that takes into account the degree of variation in all of the sequences at the same time. Thus, approximate methods are used, such as progressive global alignment, iterative global alignment, alignments based on locally conserved patterns found in the same order in the sequences, statistical methods that generate probabilistic models of the sequences, and multiple sequence alignments produced by graph-based methods. When 10 or more sequences are being compared, it is common to begin by determining sequence similarities between all pairs of sequences in the set. A variety of methods are then available to cluster the sequences into the most related groups or into a phylogenetic tree. This article discusses several of these methods and provides data that compare their utility under various conditions.
Publisher
Cold Spring Harbor Laboratory
Subject
General Biochemistry, Genetics and Molecular Biology
Reference27 articles.
1. The AMPS package for multiple protein sequence alignment. Computer analysis of sequence data. Part II;Barton;Methods Mol Biol,1994
2. Analysis of conserved domains and sequence motifs in cellular regulatory proteins and locus control regions using software tools for multiple alignment and visualization;Boguski;New Biol,1992
3. Comparative analysis of seven multiple protein sequence alignment servers: clues to enhance reliability of predictions
4. Multiple sequence alignment with hierarchical clustering
5. [21] Progressive alignment of amino acid sequences and construction of phylogenetic trees from them
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献