Affiliation:
1. Tel Aviv University, Tel Aviv, Israel
Abstract
In this work, we study the problem of testing subsequence-freeness. For a given subsequence (word)
w
=
w
1
…
w
k
, a sequence (text)
T
=
t
1
…
t
n
is said to contain
w
if there exist indices 1 ≤
i
1
< … <
i
k
≤ n such that
t
ij
=
w
j
for every 1 ≤
j
≤
k
. Otherwise,
T
is
w
-free. While a large majority of the research in property testing deals with algorithms that perform queries, here we consider
sample-based
testing (with one-sided error). In the “standard” sample-based model (i.e., under the uniform distribution), the algorithm is given samples (
i
,
t
i
) where
i
is distributed uniformly independently at random. The algorithm should distinguish between the case that
T
is
w
-free, and the case that
T
is ε-far from being
w
-free (i.e., more than an ε-fraction of its symbols should be modified so as to make it
w
-free). Freitag, Price, and Swartworth (Proceedings of RANDOM, 2017) showed that
O
((
k
2
log
k
)ε) samples suffice for this testing task. We obtain the following results.
–
The number of samples sufficient for one-sided error sample-based testing (under the uniform distribution) is
O
(
k
ε). This upper bound builds on a characterization that we present for the distance of a text
T
from
w
-freeness in terms of the maximum number of copies of
w
in
T
, where these copies should obey certain restrictions.
–
We prove a matching lower bound, which holds for
every
word
w
. This implies that the above upper bound is tight.
–
The same upper bound holds in the more general
distribution-free
sample-based model. In this model, the algorithm receives samples (
i
,
t
i
) where
i
is distributed according to an
arbitrary
distribution
p
(and the distance from
w
-freeness is measured with respect to
p
).
We highlight the fact that while we require that the testing algorithm work for every distribution and when only provided with samples, the complexity we get matches a known lower bound for a special case of the seemingly easier problem of testing subsequence-freeness with one-sided error under the uniform distribution and with queries (Canonne et al.,
Theory of Computing
, 2019).
Funder
Israel Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Computational Theory and Mathematics,Theoretical Computer Science
Reference40 articles.
1. On Active and Passive Testing
2. Noga Alon Michael Krivelevich Ilan Newman and Mario Szegedy. 2001. Regular languages are testable with a constant number of queries. SIAM Journal on Computing 30 6 (2001) 1842–1862.
3. Active Property Testing
4. Gabriel Bathie and Tatiana Starikovskaya. 2021. Property testing of regular languages with applications to streaming property testing of visibly pushdown languages. In Proceedings of the 48th International Colloquium Automata, Languages and Programming. 119:1–119:17.
5. Omri Ben-Eliezer. 2019. Testing local properties of arrays. In Proceedings of the 10th Innovations in Theoretical Computer Science conference (ITCS). 11:1–11:20.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献