Author:
Kim Siwon,Choi Kukjin,Choi Hyun-Soo,Lee Byunghan,Yoon Sungroh
Abstract
In recent years, proposed studies on time-series anomaly detection (TAD) report high F1 scores on benchmark TAD datasets, giving the impression of clear improvements in TAD. However, most studies apply a peculiar evaluation protocol called point adjustment (PA) before scoring. In this paper, we theoretically and experimentally reveal that the PA protocol has a great possibility of overestimating the detection performance; even a random anomaly score can easily turn into a state-of-the-art TAD method. Therefore, the comparison of TAD methods after applying the PA protocol can lead to misguided rankings. Furthermore, we question the potential of existing TAD methods by showing that an untrained model obtains comparable detection performance to the existing methods even when PA is forbidden. Based on our findings, we propose a new baseline and an evaluation protocol. We expect that our study will help a rigorous evaluation of TAD and lead to further improvement in future researches.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Deep Learning for Time Series Anomaly Detection: A Survey;ACM Computing Surveys;2024-08-30
2. PATE: Proximity-Aware Time Series Anomaly Evaluation;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
3. SensitiveHUE: Multivariate Time Series Anomaly Detection by Enhancing the Sensitivity to Normal Patterns;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
4. CutAddPaste:
Time Series Anomaly Detection by Exploiting Abnormal Knowledge;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
5. ICS Anomaly Detection Based on Sensor Patterns and Actuator Rules in Spatiotemporal Dependency;IEEE Transactions on Industrial Informatics;2024-08