Author:
Middlehurst Matthew,Schäfer Patrick,Bagnall Anthony
Abstract
AbstractIn 2017, a research paper (Bagnall et al. Data Mining and Knowledge Discovery 31(3):606-660. 2017) compared 18 Time Series Classification (TSC) algorithms on 85 datasets from the University of California, Riverside (UCR) archive. This study, commonly referred to as a ‘bake off’, identified that only nine algorithms performed significantly better than the Dynamic Time Warping (DTW) and Rotation Forest benchmarks that were used. The study categorised each algorithm by the type of feature they extract from time series data, forming a taxonomy of five main algorithm types. This categorisation of algorithms alongside the provision of code and accessible results for reproducibility has helped fuel an increase in popularity of the TSC field. Over six years have passed since this bake off, the UCR archive has expanded to 112 datasets and there have been a large number of new algorithms proposed. We revisit the bake off, seeing how each of the proposed categories have advanced since the original publication, and evaluate the performance of newer algorithms against the previous best-of-category using an expanded UCR archive. We extend the taxonomy to include three new categories to reflect recent developments. Alongside the originally proposed distance, interval, shapelet, dictionary and hybrid based algorithms, we compare newer convolution and feature based algorithms as well as deep learning approaches. We introduce 30 classification datasets either recently donated to the archive or reformatted to the TSC format, and use these to further evaluate the best performing algorithm from each category. Overall, we find that two recently proposed algorithms, MultiROCKET+Hydra (Dempster et al. 2022) and HIVE-COTEv2 (Middlehurst et al. Mach Learn 110:3211-3243. 2021), perform significantly better than other approaches on both the current and new TSC problems.
Publisher
Springer Science and Business Media LLC
Reference95 articles.
1. Abanda A, Mori U, Lozano J (2019) A review on distance based time series classification. Data Mining and Knowledge Discovery 33(2):378–412
2. Bagnall A, Lines J, Hills J et al (2015) Time-series classification with COTE: The collective of transformation-based ensembles. IEEE Trans Knowl Data Eng 27:2522–2535
3. Bagnall A, Lines J, Bostrom A et al (2017) The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Mining and Knowledge Discovery 31(3):606–660
4. Bagnall A, Bostrom A, Cawley G et al (2018) Is rotation forest the best classifier for problems with continuous features? ArXiv e-prints arXiv:1809.06705
5. Bagnall A, Flynn M, Large J et al (2020) On the usage and performance of HIVE-COTE v1.0. In: proceedings of the 5th Workshop on Advanced Analytics and Learning on Temporal Data
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献