On repairing timestamps for regular interval time series

Author:

Fang Chenguang1,Song Shaoxu1,Mei Yinan1

Affiliation:

1. BNRist, Tsinghua University

Abstract

Time series data are often with regular time intervals, e.g., in IoT scenarios sensor data collected with a pre-specified frequency, air quality data regularly recorded by outdoor monitors, and GPS signals periodically received from multiple satellites. However, due to various issues such as transmission latency, device failure, repeated request and so on, timestamps could be dirty and lead to irregular time intervals. Amending the irregular time intervals has obvious benefits, not only improving data quality but also leading to more accurate applications such as frequency-domain analysis and more effective compression in storage. The timestamp repairing problem however is challenging, given many interacting factors to determine, including the time interval, the start timestamp, the series length, as well as the matching between the time series before and after repairing. Our contributions in this paper are (1) formalizing the timestamp repairing problem for regular interval time series to minimize the cost w.r.t. move, insert and delete operations; (2) devising an exact approach with advanced pruning strategies based on lower bounds of repairing; (3) proposing an approximation based on bi-directional dynamic programming. The experimental results demonstrate the superiority of our proposal in both timestamp repair accuracy and the aforesaid applications. Remarkably, the repair results can be used to evaluate time series data quality measures. Both the repair and measure functions have been implemented in an open-source time series database, Apache IoTDB.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Reference33 articles.

1. Apache Io TDB. https://iotdb.apache.org. Accessed May 2022 . Apache IoTDB. https://iotdb.apache.org. Accessed May 2022.

2. https://github.com/apache/iotdb/tree/master/library-udf. Accessed May 2022. https://github.com/apache/iotdb/tree/master/library-udf. Accessed May 2022.

3. https://sxsong.github.io/doc/timestamp.pdf. Accessed May 2022. https://sxsong.github.io/doc/timestamp.pdf. Accessed May 2022.

4. https://iotdb.apache.org/UserGuide/Master/Library-UDF/Data-Repairing.html. Accessed May 2022. https://iotdb.apache.org/UserGuide/Master/Library-UDF/Data-Repairing.html. Accessed May 2022.

5. https://iotdb.apache.org/UserGuide/Master/Library-UDF/Data-Quality.html. Accessed May 2022. https://iotdb.apache.org/UserGuide/Master/Library-UDF/Data-Quality.html. Accessed May 2022.

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Optimizing Time Series Queries with Versions;Proceedings of the ACM on Management of Data;2024-05-29

2. Time Series Representation for Visualization in Apache IoTDB;Proceedings of the ACM on Management of Data;2024-03-12

3. IoTDQ: An Industrial IoT Data Analysis Library for Apache IoTDB;Big Data Mining and Analytics;2024-03

4. Learning Autoregressive Model in LSM-Tree based Store;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04

5. TsQuality: Measuring Time Series Data Quality in Apache IoTDB;Proceedings of the VLDB Endowment;2023-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3