A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks-Reference-Cited by-同舟云学术

A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks

Published:2023-05-27 Issue:11 Volume:11 Page:2467
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Huang Yuling¹^ORCID,Cui Kai¹,Song Yunlin²,Chen Zongren¹

Affiliation:

1. School of Computer Science and Engineering, Macau University of Science and Technology, Macao, China

2. Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macao, China

Abstract

Advancements in machine learning have led to an increased interest in applying deep reinforcement learning techniques to investment decision-making problems. Despite this, existing approaches often rely solely on single-scaling daily data, neglecting the importance of multi-scaling information, such as weekly or monthly data, in decision-making processes. To address this limitation, a multi-scaling convolutional neural network for reinforcement learning-based stock trading, termed multi-scaling convolutional neural network SARSA (state, action, reward, state, action), is proposed. Our method utilizes a multi-scaling convolutional neural network to obtain multi-scaling features of daily and weekly financial data automatically. This involves using a convolutional neural network with several filter sizes to perform a multi-scaling extraction of temporal features. Multiple-scaling feature mining allows agents to operate over longer time scaling, identifying low stock positions on the weekly line and avoiding daily fluctuations during continuous declines. This mimics the human approach of considering information at varying temporal and spatial scaling during stock trading. We further enhance the network’s robustness by adding an average pooling layer to the backbone convolutional neural network, reducing overfitting. State, action, reward, state, action, as an on-policy reinforcement learning method, generates dynamic trading strategies that combine multi-scaling information across different time scaling, while avoiding dangerous strategies. We evaluate the effectiveness of our proposed method on four real-world datasets (Dow Jones, NASDAQ, General Electric, and AAPLE) spanning from 1 January 2007 to 31 December 2020, and demonstrate its superior profits compared to several baseline methods. In addition, we perform various comparative and ablation tests in order to demonstrate the superiority of the proposed network architecture. Through these experiments, our proposed multi-scaling module yields better results compared to the single-scaling module.

Funder

Faculty Research Grants, Macau University of Science and Technology

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/11/2467/pdf

Reference53 articles.

1. Poterba, J.M., and Summers, L.H. (1988). Mean Reversion in Stock Prices: Evidence and Implications, Social Science Electronic Publishing.

2. Moody, J.E., and Saffell, M.J. (December, January 30). Reinforcement learning for trading. Proceedings of the NIPS’98: 11th International Conference on Neural Information Processing Systems, Denver, CO, USA.

3. Neuneier, R. (December, January 30). Enhancing Q-learning for optimal asset allocation. Proceedings of the NIPS’98: 11th International Conference on Neural Information Processing Systems, Denver, CO, USA.

4. Corazza, M., and Sangalli, A. (2015). Q-Learning and SARSA: A Comparison between Two Intelligent Stochastic Control Approaches for Financial Trading. SSRN Electron. J.

5. Yan, C., Mabu, S., and Hirasawa, K. (2007, January 25–28). Genetic network programming with sarsa learning and its application to creating stock trading rules. Proceedings of the 2007 IEEE Congress on Evolutionary Computation, Singapore.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving algorithmic trading consistency via human alignment and imitation learning;Expert Systems with Applications;2024-11

2. A novel deep reinforcement learning framework with BiLSTM-Attention networks for algorithmic trading;Expert Systems with Applications;2024-04

3. Research into Developing a Data Mining-Based Quantitative Trading System for Use with Software and High Performance Computers;2023 International Conference on Industrial IoT, Big Data and Supply Chain (IIoTBDSC);2023-09-22

4. DADE-DQN: Dual Action and Dual Environment Deep Q-Network for Enhancing Stock Trading Strategy;Mathematics;2023-08-22