Affiliation:
1. Simon Fraser University, Burnaby, BC, Canada
2. Google Research, New York, NY
Abstract
In this work, we are interested in periodic trends in long data streams in the presence of computational constraints. To this end; we present algorithms for discovering periodic trends in the combinatorial property testing model in a data stream
S
of length
n
using
o
(
n
) samples and space.
In accordance with the property testing model, we first explore the notion of being “close” to periodic by defining three different notions of self-distance through relaxing different notions of exact periodicity. An input
S
is then called approximately periodic if it exhibits a small self-distance (with respect to any one self-distance defined). We show that even though the different definitions of exact periodicity are equivalent, the resulting definitions of self-distance and approximate periodicity are not; we also show that these self-distances are constant approximations of each other. Afterwards, we present algorithms which distinguish between the two cases where
S
is exactly periodic and
S
is far from periodic with only a constant probability of error.
Our algorithms sample only
O
(√
n
log
2
n
) (or
O
(√
n
log
4
n
), depending on the self-distance) positions and use as much space. They can also find, using
o
(
n
) samples and space, the largest/smallest period, and/or all of the approximate periods of
S
. These algorithms can also be viewed as working on streaming inputs where each data item is seen once and in order, storing only a sublinear (
O
(√
n
log
2
n
) or
O
(√
n
log
4
n
)) size sample from which periodicities are identified.
Publisher
Association for Computing Machinery (ACM)
Subject
Mathematics (miscellaneous)
Reference12 articles.
1. A sublinear algorithm for weakly approximating edit distance
2. Das G. and Gunopoulos D. 2000. Time series similarity measures. http://www.acm.org/sigs/sigkdd/kdd2000/Tutorial-Das.htm. 10.1145/349093.349108 Das G. and Gunopoulos D. 2000. Time series similarity measures. http://www.acm.org/sigs/sigkdd/kdd2000/Tutorial-Das.htm. 10.1145/349093.349108
3. Uniqueness theorems for periodic functions
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Periodicity in Data Streams with Wildcards;Theory of Computing Systems;2019-12-20
2. Periodicity in Data Streams with Wildcards;Computer Science – Theory and Applications;2018
3. Periodicity and Cyclic Shifts via Linear Sketches;Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques;2011