Sarsa(Λ)-Based Logistics Planning Approximated by Value Function with Policy Iteration-Reference-Cited by-同舟云学术

Sarsa(Λ)-Based Logistics Planning Approximated by Value Function with Policy Iteration

Published:2015-12 Issue:4 Volume:9 Page:449-466
ISSN:1748-3026
Container-title:Journal of Algorithms & Computational Technology
language:en
Short-container-title:Journal of Algorithms & Computational Technology

Author:

Tang Yu¹

Affiliation:

1. Taizhou University Taizhou, Jiangsu, 225300, China

Abstract

The logistics planning problem has been extensively investigated for a long time. However, with the increasing number of stochastic events occurred in road, increasing number of stochastic factors should be taken into consideration. A dynamic approach is used in this paper to solve the logistics planning problem in the common form of stochastic demand with the reinforcement learning framework which is able to optimize policy in unknown environments and uncertain cases. We take advantage of clustering method to extract states as main features for basis function so as to solve the dimensionality curse problems caused by stochastic settings. We also propose an approximation approach with the policy iteration restricted by the goal of minimal time differential error to approximate the stochastic cases of the real world, and then use the attained approximation parameters as input for the proposed Sarsa(Λ)-based logistics planning algorithm to determine the policy and action in accordance with the real world stochastic events. The benchmarking experimental results showed that the proposed algorithm has achieved improvements in almost all the test cases.

Publisher

SAGE Publications

Link

http://journals.sagepub.com/doi/pdf/10.1260/1748-3018.9.4.449

Reference16 articles.

1. Emergency Logistics Planning in Natural Disasters

2. A Cost-Shaping Linear Program for Average-Cost Approximate Dynamic Programming with Performance Guarantees

3. Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands

4. Approximate Dynamic Programming

5. Dynamic Programming Approximations for a Stochastic Inventory Routing Problem

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Measurement of Logistics Radiation Range and Improvement of Logistics Radiation Ability of City Clusters;Discrete Dynamics in Nature and Society;2022-07-16

2. UAV Track Planning Algorithm Based on Graph Attention Network and Deep Q Network;Lecture Notes in Computer Science;2021