Affiliation:
1. Department of Computer Science, University of Connecticut, United States
Abstract
Motifs are biologically significant patterns found in DNA/protein sequences. Given a set of biological sequences, the problem of identifying the motifs is very challenging. This problem has been well studied in computational biology. Identifying motifs through experimental processes is extremely expensive and time consuming. This is one of the factors influencing computational biologists to come up with novel computational methods to predict motifs. Several motif models have been proposed in the literature and for each model numerous algorithms have been devised. Three popular motif models are (l, d)-motif search or Planted Motif Search (PMS), Simple Motif Search (SMS), and Edit-distance based Motif Search (EMS). For PMS and SMS several algorithms have been proposed and implemented. On the other hand, even though some algorithms exist in the literature for the problem of EMS, no implementations of these algorithms are known. This is mainly because the proposed algorithms are complex. In this paper we present an elegant algorithm for EMS. We have implemented this algorithm and compared it against 14 other algorithms in terms of sensitivity and specificity. Our experimental results indicate that the new algorithm is very competitive in practice.
Publisher
World Scientific Pub Co Pte Lt
Subject
Computer Science (miscellaneous)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献