Adaptive Supply Chain: Demand–Supply Synchronization Using Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Adaptive Supply Chain: Demand–Supply Synchronization Using Deep Reinforcement Learning

Published:2021-08-15 Issue:8 Volume:14 Page:240
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Kegenbekov Zhandos,Jackson Ilya

Abstract

Adaptive and highly synchronized supply chains can avoid a cascading rise-and-fall inventory dynamic and mitigate ripple effects caused by operational failures. This paper aims to demonstrate how a deep reinforcement learning agent based on the proximal policy optimization algorithm can synchronize inbound and outbound flows and support business continuity operating in the stochastic and nonstationary environment if end-to-end visibility is provided. The deep reinforcement learning agent is built upon the Proximal Policy Optimization algorithm, which does not require hardcoded action space and exhaustive hyperparameter tuning. These features, complimented with a straightforward supply chain environment, give rise to a general and task unspecific approach to adaptive control in multi-echelon supply chains. The proposed approach is compared with the base-stock policy, a well-known method in classic operations research and inventory control theory. The base-stock policy is prevalent in continuous-review inventory systems. The paper concludes with the statement that the proposed solution can perform adaptive control in complex supply chains. The paper also postulates fully fledged supply chain digital twins as a necessary infrastructural condition for scalable real-world applications.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/14/8/240/pdf

Reference34 articles.

1. Deep Learning;Goodfellow,2016

2. Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics

3. Deep traffic: Crowdsourced hyperparameter tuning of deep reinforcement learning systems for multi-agent dense traffic navigation;Fridman;arXiv,2018

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive review of model compression techniques in machine learning;Applied Intelligence;2024-09-02

2. Benefits, challenges, and limitations of inventory control using machine learning algorithms: literature review;OPSEARCH;2024-08-15

3. Optimisation of recovery policies in the era of supply chain disruptions: a system dynamics and reinforcement learning approach;International Journal of Production Research;2024-08-06

4. Incorporating supply and production digital twins to mitigate demand disruptions in multi-echelon networks;International Journal of Production Economics;2024-07

5. Research on Optimization Strategies for Closed-Loop Supply Chain Management Based on Deep Learning Technology;International Journal of Information Systems and Supply Chain Management;2024-04-02