AI vs. Human Buyers: A Study of Alibaba’s Inventory Replenishment System-Reference-Cited by-同舟云学术

AI vs. Human Buyers: A Study of Alibaba’s Inventory Replenishment System

Published:2023-09 Issue:5 Volume:53 Page:372-387
ISSN:2644-0865
Container-title:INFORMS Journal on Applied Analytics
language:en
Short-container-title:INFORMS Journal on Applied Analytics

Author:

Liu Jiaxi¹,Lin Shuyi¹,Xin Linwei²^ORCID,Zhang Yidong¹

Affiliation:

1. Alibaba Group, Hangzhou, Zhejiang 311100, China;

2. Booth School of Business, University of Chicago, Chicago, Illinois 60637

Abstract

Inventory management is one of the most important components of Alibaba’s business. Traditionally, human buyers make replenishment decisions: although artificial intelligence (AI) algorithms make recommendations, human buyers can choose to ignore these recommendations and make their own decisions. The company has been exploring a new replenishment system in which algorithmic recommendations are final. The algorithms combine state-of-the-art deep reinforcement learning techniques with the framework of fictitious play. By learning the supplier’s behavior, we are able to address the important issues of lead time and fill rate on order quantity, which have been ignored in the extant literature of stochastic inventory control. We present evidence that our algorithms outperform human buyers in terms of reducing out-of-stock rates and inventory levels. More interestingly, we have seen additional benefits amid the pandemic. Over the last two years, cities in China partially and intermittently locked down to mitigate COVID-19 outbreaks. We have observed panic buying from human buyers during lockdowns, leading to the bullwhip effect. By contrast, panic buying and the bullwhip effect can be mitigated using our algorithms due to their ability to recognize changes in the supplier’s behavior during lockdowns. History: This paper has been accepted for the INFORMS Journal on Applied Analytics Special Issue—2022 Daniel H. Wagner Prize for Excellence in the Practice of Advanced Analytics and Operations Research.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Link

https://pubsonline.informs.org/doi/pdf/10.1287/inte.2023.1160

Reference18 articles.

1. Brown's original fictitious play

2. Can Deep Reinforcement Learning Improve Inventory Management? Performance on Lost Sales, Dual-Sourcing, and Multi-Echelon Problems

3. Asymptotic Optimality of Constant-Order Policies for Lost Sales Inventory Models with Large Lead Times

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. VC Theory for Inventory Policies;SSRN Electronic Journal;2024

2. Multi-Agent Deep Reinforcement Learning for Decentralized Proactive Transshipment;SSRN Electronic Journal;2023