Can transformers transform financial forecasting?
-
Published:2024-06-20
Issue:
Volume:
Page:
-
ISSN:2044-1398
-
Container-title:China Finance Review International
-
language:en
-
Short-container-title:CFRI
Author:
Souto Hugo GobatoORCID, Moradi AmirORCID
Abstract
PurposeThis study aims to critically evaluate the competitiveness of Transformer-based models in financial forecasting, specifically in the context of stock realized volatility forecasting. It seeks to challenge and extend upon the assertions of Zeng et al. (2023) regarding the purported limitations of these models in handling temporal information in financial time series.Design/methodology/approachEmploying a robust methodological framework, the study systematically compares a range of Transformer models, including first-generation and advanced iterations like Informer, Autoformer, and PatchTST, against benchmark models (HAR, NBEATSx, NHITS, and TimesNet). The evaluation encompasses 80 different stocks, four error metrics, four statistical tests, and three robustness tests designed to reflect diverse market conditions and data availability scenarios.FindingsThe research uncovers that while first-generation Transformer models, like TFT, underperform in financial forecasting, second-generation models like Informer, Autoformer, and PatchTST demonstrate remarkable efficacy, especially in scenarios characterized by limited historical data and market volatility. The study also highlights the nuanced performance of these models across different forecasting horizons and error metrics, showcasing their potential as robust tools in financial forecasting, which contradicts the findings of Zeng et al. (2023)Originality/valueThis paper contributes to the financial forecasting literature by providing a comprehensive analysis of the applicability of Transformer-based models in this domain. It offers new insights into the capabilities of these models, especially their adaptability to different market conditions and forecasting requirements, challenging the existing skepticism created by Zeng et al. (2023) about their utility in financial forecasting.
Reference92 articles.
1. A hybrid artificial neural network-gjr modeling approach to forecasting currency exchange rate volatility;Neurocomputing,2019 2. Asymptotic theory of certain ‘goodness of fit’ criteria based on stochastic processes;The Annals of Mathematical Statistics,1952 3. Anil, C., Wu, Y., Andreassen, A., Lewkowycz, A., Misra, V., Ramasesh, V., Slone, A., Gur-Ari, G., Dyer, E. and Neyshabur, B. (2022), “Exploring length generalization in large language models”, in Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. and Oh, A. (Eds), Advances in Neural Information Processing Systems, Curran Associates, Vol. 35, pp. 38546-38556, available at: https://proceedings.neurips.cc/paper\text{\_}files/paper/2022/file/fb7451e43f9c1c35b774bcfad7a5714b-Paper-Conference.pdf 4. Lassoing the har model: a model selection perspective on realized volatility dynamics;Econometric Reviews,2015 5. Flexible har model for realized volatility;Studies in Nonlinear Dynamics and Econometrics,2018
|
|