Author:
Sun Shiming,Shan Xin,Wei Xueyun,Tai Chunliang,Liu Chao
Abstract
Abstract
The increasing instrumentation of physical and computing processes has given us unprecedented capabilities to collect massive volumes of time series. Power data is a typical kind of time series. Considering that the original time series data has ineluctable limitations such as uneven distribution, non-uniform length, poor sampling rate and noisy, we propose a learning=based similarity join for power data consisting of RNN encoder and matrix model. In addition, we develop the partition techniques by grouping process nodes following the matrix join model, ensuring the accuracy and efficiency of similarity join for data series. We conduct experiments on real data-set to evaluate the performance of our approach, demonstrating the effectiveness and scalability of our method.
Subject
Computer Science Applications,History,Education
Reference10 articles.
1. Word2vec model analysis for semantic similarities in english words[J];Jatnika;Procedia Computer Science,2019
2. Time adaptive optimal transport: A framework of time series similarity measure[J];Zhang;IEEE Access,2020
3. Parallel time series join using spark[J];Rong;Concurrency and Computation: Practice and Experience,2020