Abstract
AbstractData collected from the environment in computer engineering may include missing values due to various factors, such as lost readings from sensors caused by communication errors or power outages. Missing data can result in inaccurate analysis or even false alarms. It is therefore essential to identify missing values and correct them as accurately as possible to ensure the integrity of the analysis and the effectiveness of any decision-making based on the data. This paper presents a new approach, the Gap Imputing Algorithm (GMA), for imputing missing values in time series data. The Gap Imputing Algorithm (GMA) identifies sequences of missing values and determines the periodic time of the time series. Then, it searches for the most similar subsequence from historical data. Unlike previous work, GMA supports any type of time series and is resilient to consecutively missing values with different gaps distances. The experimental findings, which were based on both real-world and benchmark datasets, demonstrate that the GMA framework proposed in this study outperforms other methods in terms of accuracy. Specifically, our proposed method achieves an accuracy score that is 5 to 20% higher than that of other methods. Furthermore, the GMA framework is well suited to handling missing gaps with larger distances, and it produces more accurate imputations, particularly for datasets with strong periodic patterns.
Funder
Tanta University
Faculty of Engineering, Tanta University
Publisher
Springer Science and Business Media LLC
Subject
General Earth and Planetary Sciences,General Engineering,General Environmental Science
Reference37 articles.
1. Zheng Y, Capra L, Wolfson O, Yang H (2014) Urban computing: concepts, methodologies, and applications. ACM Trans Intell Syst Technol 38
2. Yi X, Zheng Y, Zhang J, Li T (2015) ST-MVL: filling missing values in geo-sensory time series data. In: Conference on artificial intelligence
3. José Cambronero JK (2017) Query optimization for dynamic imputation. The VLDB Endowment, 10
4. Liao W, Bak-Jensen B, Pillai JR, Yang D, Wang Y (2021) Data-driven missing data imputation for wind farms using context encoder. J Mod Power Syst Clean Energy 10(4):964–976
5. Little RJ (1992) Regression with missing X’s: a review. J Am Stat Assoc 87(420):1227–1237
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献