Fast Summarization of Long Time Series with Graphics Processor-Reference-Cited by-同舟云学术

Fast Summarization of Long Time Series with Graphics Processor

Published:2022-05-23 Issue:10 Volume:10 Page:1781
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zymbler Mikhail^ORCID,Goglachev Andrey

Abstract

Summarization of a long time series often occurs in analytical applications related to decision-making, modeling, planning, and so on. Informally, summarization aims at discovering a small-sized set of typical patterns (subsequences) to briefly represent the long time series. Apparent approaches to summarization like motifs, shapelets, cluster centroids, and so on, either require training data or do not provide an analyst with information regarding the fraction of the time series that a typical subsequence found corresponds to. Recently introduced, the time series snippet concept overcomes the above-mentioned limitations. A snippet is a subsequence that is similar to many other subsequences of the time series with respect to a specially defined similarity measure based on the Euclidean distance. However, the original Snippet-Finder algorithm has cubic time complexity concerning the lengths of the time series and the snippet. In this article, we propose the PSF (Parallel Snippet-Finder) algorithm that accelerates the original snippet discovery schema with GPU and ensures acceptable performance over very long time series. As opposed to the original algorithm, PSF splits the calculation of the similarity of all the time series subsequences to a snippet into several steps, each of which is performed in parallel. Experimental evaluation over real-world time series shows that PSF outruns both the original algorithm and a straightforward parallelization.

Funder

Russian Foundation for Basic Research

Ministry of Science and Higher Education of the Russian Federation

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/10/1781/pdf

Reference32 articles.

1. Probabilistic discovery of time series motifs

2. Exact Discovery of Time Series Motifs

3. Identifying Representative Trends in Massive Time Series Data Sets Using Sketches;Indyk,2000

4. Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View That Includes Motifs, Discords and Shapelets

5. Matrix Profile XIII: Time Series Snippets: A New Primitive for Time Series Data Mining

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PaSTiLa: Scalable Parallel Algorithm for Unsupervised Labeling of Long Time Series;Lobachevskii Journal of Mathematics;2024-03

2. High-Performance Time Series Anomaly Discovery on Graphics Processors;Mathematics;2023-07-20

3. HPC Resources of South Ural State University;Communications in Computer and Information Science;2022