ClaSP: parameter-free time series segmentation

Author:

Ermshaus ArikORCID,Schäfer Patrick,Leser Ulf

Abstract

AbstractThe study of natural and human-made processes often results in long sequences of temporally-ordered values, aka time series (TS). Such processes often consist of multiple states, e.g. operating modes of a machine, such that state changes in the observed processes result in changes in the distribution of shape of the measured values. Time series segmentation (TSS) tries to find such changes in TS post-hoc to deduce changes in the data-generating process. TSS is typically approached as an unsupervised learning problem aiming at the identification of segments distinguishable by some statistical property. Current algorithms for TSS require domain-dependent hyper-parameters to be set by the user, make assumptions about the TS value distribution or the types of detectable changes which limits their applicability. Common hyper-parameters are the measure of segment homogeneity and the number of change points, which are particularly hard to tune for each data set. We present ClaSP, a novel, highly accurate, hyper-parameter-free and domain-agnostic method for TSS. ClaSP hierarchically splits a TS into two parts. A change point is determined by training a binary TS classifier for each possible split point and selecting the one split that is best at identifying subsequences to be from either of the partitions. ClaSP learns its main two model-parameters from the data using two novel bespoke algorithms. In our experimental evaluation using a benchmark of 107 data sets, we show that ClaSP outperforms the state of the art in terms of accuracy and is fast and scalable. Furthermore, we highlight properties of ClaSP using several real-world case studies.

Funder

Humboldt-Universität zu Berlin

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Computer Science Applications,Information Systems

Reference54 articles.

1. Adams RP, MacKay DJ (2007) Bayesian online changepoint detection. arXiv preprint arXiv:0710.3742

2. Aminikhanghahi S, Cook DJ (2017) A survey of methods for time series change point detection. KAIS 51(2):339–367

3. Bagnall A, Lines J, Bostrom A et al (2016) The great time series classification bake off: an experimental evaluation of recently proposed algorithms. Extended Version. DMKD, pp 1–55

4. Baños O, Tóth MA, Damas M et al (2014) Dealing with the effects of sensor displacement in wearable activity recognition. Sensors 14(9995–10):023

5. Bosc M, Heitz F, Armspach JP et al (2003) Automatic change detection in multimodal serial mri: application to multiple sclerosis lesion evolution. NeuroImage 20(2):643–656

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series;Proceedings of the ACM Web Conference 2024;2024-05-13

2. Raising the ClaSS of Streaming Time Series Segmentation;Proceedings of the VLDB Endowment;2024-04

3. PaSTiLa: Scalable Parallel Algorithm for Unsupervised Labeling of Long Time Series;Lobachevskii Journal of Mathematics;2024-03

4. Learning and Generalizing Tasks on Humanoid Robots with an Automatic Multisensory Segmentation Method;2024 IEEE/SICE International Symposium on System Integration (SII);2024-01-08

5. Deep Learning-based Precision Diagnosis of Lung Diseases on the Internet of Medical Things (IoMT);Proceedings of the Bulgarian Academy of Sciences;2023-10-31

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3