Affiliation:
1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
2. School of Life Science and Technology, Harbin Institute of Technology, Harbin, China
3. School of Computer Science, Northwestern Polytechnical University, Xi’an, China
Abstract
Background:
Multiple Sequence Alignment (MSA) is a fundamental task in bioinformatics
and is required for many biological analysis tasks. The more accurate the alignments are, the
more credible the downstream analyses. Most protein MSA algorithms realign an alignment to
refine it by dividing it into two groups horizontally and then realign the two groups. However, this
strategy does not consider that different regions of the sequences have different conservation; this
property may lead to incorrect residue-residue or residue-gap pairs, which cannot be corrected by
this strategy.
Objective:
In this article, our motivation is to develop a novel refinement method based on splitting-
splicing vertically.
Method:
Here, we present a novel refinement method based on splitting-splicing vertically, called
SpliVert. For an alignment, we split it vertically into 3 parts, remove the gap characters in the middle,
realign the middle part alone, and splice the realigned middle parts with the other two initial
pieces to obtain a refined alignment. In the realign procedure of our method, the aligner will only
focus on a certain part, ignoring the disturbance of the other parts, which could help fix the incorrect
pairs.
Results:
We tested our refinement strategy for 2 leading MSA tools on 3 standard benchmarks,
according to the commonly used average SP (and TC) score. The results show that given appropriate
proportions to split the initial alignment, the average scores are increased comparably or slightly
after using our method. We also compared the alignments refined by our method with alignments
directly refined by the original alignment tools. The results suggest that using our SpliVert method
to refine alignments can also outperform direct use of the original alignment tools.
Conclusion:
The results reveal that splitting vertically and realigning part of the alignment is a
good strategy for the refinement of protein multiple sequence alignments.
Funder
National Key R&D Program of China
Natural Science Foundation of Heilongjiang Province
National Nature Science Foundation of China
Fundamental Research Funds for the Central Universities
Publisher
Bentham Science Publishers Ltd.
Subject
Biochemistry,General Medicine,Structural Biology
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献