REINFORCEMENT LEARNING IN SUPPLY CHAINS

Author:

VALLURI ANNAPURNA1,NORTH MICHAEL J.2,MACAL CHARLES M.2

Affiliation:

1. Wharton School of Business, University of Pennsylvania, 1150 Steinberg Hall-Dietrich Hall, Philadelphia, PA 19104, USA

2. Argonne National Laboratory, 9700 S. Cass Avenue, Argonne, IL 60439, USA

Abstract

Effective management of supply chains creates value and can strategically position companies. In practice, human beings have been found to be both surprisingly successful and disappointingly inept at managing supply chains. The related fields of cognitive psychology and artificial intelligence have postulated a variety of potential mechanisms to explain this behavior. One of the leading candidates is reinforcement learning. This paper applies agent-based modeling to investigate the comparative behavioral consequences of three simple reinforcement learning algorithms in a multi-stage supply chain. For the first time, our findings show that the specific algorithm that is employed can have dramatic effects on the results obtained. Reinforcement learning is found to be valuable in multi-stage supply chains with several learning agents, as independent agents can learn to coordinate their behavior. However, learning in multi-stage supply chains using these postulated approaches from cognitive psychology and artificial intelligence take extremely long time periods to achieve stability which raises questions about their ability to explain behavior in real supply chains. The fact that it takes thousands of periods for agents to learn in this simple multi-agent setting provides new evidence that real world decision makers are unlikely to be using strict reinforcement learning in practice.

Publisher

World Scientific Pub Co Pte Lt

Subject

Computer Networks and Communications,General Medicine

Cited by 25 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Coupling simulation and machine learning for predictive analytics in supply chain management;International Journal of Production Research;2024-04-28

2. The Viability of Supply Chains with Interpretable Learning Systems: The Case of COVID-19 Vaccine Deliveries;Global Journal of Flexible Systems Management;2023-09-27

3. Source of Intelligent Manufacturing and Industrial Big Data;2023 International Seminar on Computer Science and Engineering Technology (SCSET);2023-04

4. A review on reinforcement learning algorithms and applications in supply chain management;International Journal of Production Research;2022-11-03

5. Deep reinforcement learning‐based ordering mechanism for performance optimization in multi‐echelon supply chains;Applied Stochastic Models in Business and Industry;2022-10-18

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3