Affiliation:
1. State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
Abstract
Large-scale and high-dimensional time series data are widely generated in modern applications such as intelligent transportation and environmental monitoring. However, such data contains much noise, outliers, and missing values due to interference during measurement or transmission. Directly forecasting such types of data (i.e., anomalous data) can be extremely challenging. The traditional method to deal with anomalies is to cut out the time series with anomalous value entries or replace the data. Both methods may lose important knowledge from the original data. In this paper, we propose a multidimensional time series forecasting framework that can better handle anomalous values: the robust temporal nonnegative matrix factorization forecasting model (RTNMFFM) for multi-dimensional time series. RTNMFFM integrates the autoregressive regularizer into nonnegative matrix factorization (NMF) with the application of the L2,1 norm in NMF. This approach improves robustness and alleviates overfitting compared to standard methods. In addition, to improve the accuracy of model forecasts on severely missing data, we propose a periodic smoothing penalty that keeps the sparse time slices as close as possible to the time slice with high confidence. Finally, we train the model using the alternating gradient descent algorithm. Numerous experiments demonstrate that RTNMFFM provides better robustness and better prediction accuracy.
Funder
Science and Technology Support Plan Project of Guizhou
Reference45 articles.
1. A methodology for energy multivariate time series forecasting in smart buildings based on feature selection;Energy Build.,2019
2. A survey on architecture, protocols and challenges in IoT;Sobin;Wirel. Pers. Commun.,2020
3. Yu, H.-F., Rao, N., and Dhillon, I.S. (2024, January 18). Temporal regularized matrix factorization for high-dimensional time series prediction. In Advances in Neural Information Processing Systems, 29. Available online: https://www.cs.utexas.edu/~rofuyu/papers/tr-mf-nips.pdf.
4. Structured nonnegative matrix factorization for traffic flow estimation of large cloud networks;Atif;Comput. Netw.,2021
5. Bourakna, A.E.Y., Chung, M.K., and Ombao, H. (2022). Topological data analysis for multivariate time series data. arXiv.