Author:
Bünger Dominik,Gondos Miriam,Peroche Lucile,Stoll Martin
Abstract
Time series data play an important role in many applications and their analysis reveals crucial information for understanding the underlying processes. Among the many time series learning tasks of great importance, we here focus on semi-supervised learning based on a graph representation of the data. Two main aspects are studied in this paper. Namely, suitable distance measures to evaluate the similarities between different time series, and the choice of learning method to make predictions based on a given number of pre-labeled data points. However, the relationship between the two aspects has never been studied systematically in the context of graph-based learning. We describe four different distance measures, including (Soft) DTW and MPDist, a distance measure based on the Matrix Profile, as well as four successful semi-supervised learning methods, including the recently introduced graph Allen–Cahn method and Graph Convolutional Neural Network method. We provide results for the novel combination of these distance measures with both the Allen-Cahn method and the GCN algorithm for binary semi-supervised learning tasks for various time-series data sets. In our findings we compare the chosen graph-based methods using all distance measures and observe that the results vary strongly with respect to the accuracy. We then observe that no clear best combination to employ in all cases is found. Our study provides a reproducible framework for future work in the direction of semi-supervised learning for time series with a focus on graph representations.
Funder
Technische Universität Chemnitz
Subject
Applied Mathematics,Statistics and Probability
Reference64 articles.
1. A review on time series data mining;Fu;Eng Appl Artif Intell,2011
2. Social big data: recent achievements and new challenges;Bello-Orgaz;Inform Fusion,2016
3. Data mining for the internet of things: literature review and challenges;Chen;Int J Distribut Sensor Netw,2015
4. Generic and scalable framework for automated time-series anomaly detection;Laptev;Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2015
5. Probabilistic discovery of time series motifs;Chiu;Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2003
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献