Reinforcement Learning for Optimizing Can-Order Policy with the Rolling Horizon Method-Reference-Cited by-同舟云学术

Reinforcement Learning for Optimizing Can-Order Policy with the Rolling Horizon Method

Published:2023-07-07 Issue:7 Volume:11 Page:350
ISSN:2079-8954
Container-title:Systems
language:en
Short-container-title:Systems

Author:

Noh Jiseong¹^ORCID

Affiliation:

1. Center for Creative Convergence Education, Hanyang University, ERICA Campus, Ansan 15588, Republic of Korea

Abstract

This study presents a novel approach to a mixed-integer linear programming (MILP) model for periodic inventory management that combines reinforcement learning algorithms. The rolling horizon method (RHM) is a multi-period optimization approach that is applied to handle new information in updated markets. The RHM faces a limitation in easily determining a prediction horizon; to overcome this, a dynamic RHM is developed in which RL algorithms optimize the prediction horizon of the RHM. The state vector consisted of the order-up-to-level, real demand, total cost, holding cost, and backorder cost, whereas the action included the prediction horizon and forecasting demand for the next time step. The performance of the proposed model was validated through two experiments conducted in cases with stable and uncertain demand patterns. The results showed the effectiveness of the proposed approach in inventory management, particularly when the proximal policy optimization (PPO) algorithm was used for training compared with other reinforcement learning algorithms. This study signifies important advancements in both the theoretical and practical aspects of multi-item inventory management.

Funder

Hanyang University

Publisher

MDPI AG

Subject

Information Systems and Management,Computer Networks and Communications,Modeling and Simulation,Control and Systems Engineering,Software

Link

https://www.mdpi.com/2079-8954/11/7/350/pdf

Reference17 articles.

1. Marilú Destino, J.F., Müllerklein, D., and Trautwein, V. (2023, May 12). To Improve Your Supply Chain, Modernize Your Supply-Chain IT. Available online: https://www.mckinsey.com/capabilities/operations/our-insights/to-improve-your-supply-chain-modernize-your-supply-chain-it.

2. AmazonWebServices (2023, May 12). Predicting The Future of Demand: How Amazon Is Reinventing Forecasting with Machine Learning. Available online: https://www.forbes.com/sites/amazonwebservices/2021/12/03/predicting-the-future-of-demand-how-amazon-is-reinventing-forecasting-with-machine-learning/.

3. Review and analysis of artificial intelligence methods for demand forecasting in supply chain management;Mediavilla;Procedia CIRP,2022

4. Deep reinforcement learning for inventory control: A roadmap;Boute;Eur. J. Oper. Res.,2022

5. Kempf, K.G. (July, January 30). Control-oriented approaches to supply chain management in semiconductor manufacturing. Proceedings of the 2004 American Control Conference, Boston, MA, USA.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data-Driven Algorithms for Two-Location Inventory Systems;Systems;2024-04-29

2. Industry 4.0;Advances in Logistics, Operations, and Management Science;2024-01-19