Author:
Yan Chenge,Zhu Ziwen,Hong Yuning
Abstract
Aiming at the problem of time series data regression prediction, this paper proposed a methodology for time series prediction based on data augmentation and cosine similarity weighted model average in the case where the predictor and response variable is a time series and the continuous scalar respectively. The constructed method deals with the problems of high dimension and noise of time series through B-spline basis expansion in which Blending algorithm was used to enhance the correlation information between B-spline basis coefficients and response variables, so as to further reduce the influence of noise on prediction models. Next, the cosine similarity model average is used to capture the unknown latent model structure between characteristic and response variables to improve the prediction accuracy of the model. The proposed method can effectively balance the bias and variance of the prediction model. In addition, the regression method in the technique is model-free. The analysis of actual data shows that the proposed method has certain advantages compared with those existing. Eventually, the method can be extended to forecasting applications in the fields of stock price prediction, social science, medicine and so on.
Publisher
Darcy & Roy Press Co. Ltd.
Reference27 articles.
1. Zeger, S. L., Irizarry, R., & Peng, R. D. (2006). On time series analysis of public health and biomedical data. Annu. Rev. Public Health, 27, 57-79.
2. Cowpertwait, P. S., & Metcalfe, A. V. (2009). Introductory time series with R. Springer Science & Business Media.
3. George. C. (2015). Time Series: ARIMA Methods, International Encyclopedia of the Social & Behavioral Sciences, 316-321.
4. Zhang, W. (2021). The representation of high dimensional time series method and classification algorithms (Ph.D. Dissertation, Beijing jiaotong university). https://kns.cnki.net/KCMS/detail/detail.aspx?dbname=CDFDLAST2022&filenam e=1021867900.nh
5. Sewell, M. (2007). Principal component analysis.