Affiliation:
1. Department of Data Analysis and Modeling, VTB Bank, Moscow, Russia
2. Laboratory for Social and Cognitive Informatics, National Research University Higher School of Economics, St. Petersburg, Russia
Abstract
Open text data, such as financial news, are thought to be able to affect or to describe stock market behavior, however, there are no widely accepted algorithms for extracting the relationship between stock quotes time series and fast-growing textual representation of economic information. The field remains challenging and understudied. In particular, topic modeling as a powerful tool for interpretable dimensionality reduction has been hardly ever used for such tasks. We present a topic modeling framework for assessing the relationship between financial news stream and stock prices in order to maximize trader’s gain. To do so, we use a dataset of economic news sections of three Russian national media sources (Kommersant, Vedomosti, and RIA Novosti) containing 197,678 economic articles. They are used to predict 39 time series of the most liquid Russian stocks collected over eight years, from 2013 to 2021. Our approach shows the ability to detect significant return-predictive signals and outperforms 26 existing models in terms of Sharpe ratio and annual return of simple long strategy. In particular, it shows a significant Granger causal relationship for more than 70% of portfolio stocks. Furthermore, the approach produces highly interpretable results, requires no domain-specific dictionaries, and, unlike most existing industrial solutions, can be calibrated for individual time series. This makes it directly usable for trading strategies and analytical tasks. Finally, since topic modeling shows its efficiency for most European languages, our approach is expected to be transferrable to European stock markets as well.
Funder
Basic Research Program at the National Research University Higher School of Economics in 2022
Reference40 articles.
1. Clustering approach to stock market prediction;Babu;International Journal of Advanced Networking and Applications,2011
2. Discovering public sentiment in social media for predicting stock movement of publicly listed companies;Bing;Information Systems,2017
3. Dynamic topic models;Blei,2006
4. Latent dirichlet allocation;Blei;Journal of Machine Learning Research,2003
5. AffectiveSpace 2: enabling affective intuition for concept-level sentiment analysis;Cambria,2015