Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning

Published:2024-08-15 Issue:16 Volume:13 Page:3242
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Cai Hao¹,Xu Peng¹,Tang Xifeng¹,Lin Gan¹

Affiliation:

1. College of Civil and Transportation Engineering, Hohai University, Xikang Road, Nanjing 210024, China

Abstract

The Vehicle Routing Problem (VRP) is a classic combinatorial optimization problem commonly encountered in the fields of transportation and logistics. This paper focuses on a variant of the VRP, namely the Vehicle Routing Problem with Stochastic Travel Cost (VRP-STC). In VRP-STC, the introduction of stochastic travel costs increases the complexity of the problem, rendering traditional algorithms unsuitable for solving it. In this paper, the GAT-AM model combining Graph Attention Networks (GAT) and multi-head Attention Mechanism (AM) is employed. The GAT-AM model uses an encoder–decoder architecture and employs a deep reinforcement learning algorithm. The GAT in the encoder learns feature representations of nodes in different subspaces, while the decoder uses multi-head AM to construct policies through both greedy and sampling decoding methods. This increases solution diversity, thereby finding high-quality solutions. The REINFORCE with Rollout Baseline algorithm is used to train the learnable parameters within the neural network. Test results show that the advantages of GAT-AM become greater as problem complexity increases, with the optimal solution generally unattainable through traditional algorithms within an acceptable timeframe.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/16/3242/pdf

Reference43 articles.

1. Learning combinatorial optimization algorithms over graphs;Khalil;Adv. Neural Inf. Process. Syst.,2017

2. Machine learning for combinatorial optimization: A methodological tour d’horizon;Bengio;Eur. J. Oper. Res.,2021

3. Glorot, X., and Bengio, Y. (2010, January 13–15). Understanding the difficulty of training deep feedforward neural networks. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Sardinia, Italy.

4. Reinforcement Learning: An Introduction;Sutton;IEEE Trans. Neural Netw.,1998

5. Mastering the game of Go without human knowledge;Silver;Nature,2017