Affiliation:
1. Max Planck Institute für Informatik and Institute of Computer Science, University of Wrocław, Wrocław, Poland
Abstract
In this article, a fully compressed pattern matching problem is studied. The compression is represented by straight-line programs (SLPs)—that is, context-free grammars generating exactly one string; the term
fully
means that both the pattern
and
the text are given in the compressed form. The problem is approached using a recently developed technique of local recompression: the SLPs are refactored so that substrings of the pattern and text are encoded in both SLPs in the same way. To this end, the SLPs are locally decompressed and then recompressed in a uniform way.
This technique yields an
O
((
n
+
m
) log
M
) algorithm for compressed pattern matching, assuming that
M
fits in
O
(1) machine words, where
n
(
m
) is the size of the compressed representation of the text (pattern, respectively), and
M
is the size of the decompressed pattern. If only
m
+
n
fits in
O
(1) machine words, the running time increases to
O
((
n
+
m
) log
M
log (
n
+
m
)). The previous best algorithm due to Lifshits has
O
(
n
2
m
) running time.
Funder
National Science Centre (NCN) SONATA 1
Publisher
Association for Computing Machinery (ACM)
Subject
Mathematics (miscellaneous)
Reference30 articles.
1. The Smallest Grammar Problem
2. Lecture Notes in Computer Science;Gąsieniec Leszek
3. Lecture Notes in Computer Science;Gąsieniec Leszek
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献