Correction to deep reinforcement learning‐based ordering mechanism for performance optimization in <scp>multi‐echelon</scp> supply chains-Reference-Cited by-同舟云学术

Correction to deep reinforcement learning‐based ordering mechanism for performance optimization in multi‐echelon supply chains

Published:2023-12-28 Issue: Volume: Page:
ISSN:1524-1904
Container-title:Applied Stochastic Models in Business and Industry
language:en
Short-container-title:Appl Stoch Models Bus & Ind

Author:

Kurian Dony S.¹^ORCID,Pillai V. Madhusudanan¹^ORCID

Affiliation:

1. Department of Mechanical Engineering National Institute of Technology Calicut Kozhikode India

Abstract

AbstractThis paper addresses and acknowledges the valuable feedback provided by Dr. Deniz Preil in response to the recent study conducted by Kurian et al which investigates the application of proximal policy optimization (PPO) to determine dynamic ordering policies within multi‐echelon supply chains. The first comment raised by Dr. Preil motivated an examination of the training and evaluation procedures in Experiments 2, 3, and 4. The Experiments 2 and 3 were reworked to address this, allowing the seed to vary for every training iteration, resulting in refined outcomes while there was no need of reworking of Experiment 4. The second comment focused on the benchmarking strategies involving the 1‐1 policy and the order‐up‐to (OUT) policy, clarifying the distinctions between the two policies and justifying the use of the 1‐1 policy for benchmarking in Experiment 4. The implementation of the widely accepted OUT policy was explained, highlighting the meaningful rationale behind its use. These discussions aim to enhance the methodology employed by Kurian et al and strengthen the implications of the findings within the domain of supply chain ordering management.

Publisher

Wiley

Subject

Management Science and Operations Research,General Business, Management and Accounting,Modeling and Simulation

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/asmb.2838

Reference13 articles.

1. Deep reinforcement learning‐based ordering mechanism for performance optimization in multi‐echelon supply chains

2. Artificial intelligence-based inventory management: a Monte Carlo tree search approach

3. Computers play the beer game: can artificial agents manage supply chains?

4. Inventory performance of some supply chain inventory policies under impulse demands

5. Minimizing the bullwhip effect in a supply chain using genetic algorithms