Affiliation:
1. Sauder School of Business, The University of British Columbia, Vancouver, BC, Canada
Abstract
We consider a periodic-review single-product multi-echelon inventory problem with instantaneous replenishment. In each period, the decision-maker makes ordering decisions for all echelons. Any unsatisfied demand is back-ordered, and any excess inventory is carried to the next period. In contrast to the classic inventory literature, we assume that the information of the demand distribution is not known a priori, and the decision-maker observes demand realizations over the planning horizon. We propose a nonparametric algorithm that generates a sequence of adaptive ordering decisions based on the stochastic gradient descent method. We compare the [Formula: see text]-period cost of our algorithm to the clairvoyant, who knows the underlying demand distribution in advance, and we prove that the expected [Formula: see text]-period regret is at most [Formula: see text], matching a lower bound for this problem.
Funder
Natural Sciences and Engineering Research Council of Canada
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献