Coordinated multi‐agent hierarchical deep reinforcement learning to solve multi‐trip vehicle routing problems with soft time windows-Reference-Cited by-同舟云学术

Coordinated multi‐agent hierarchical deep reinforcement learning to solve multi‐trip vehicle routing problems with soft time windows

Published:2023-06-20 Issue:10 Volume:17 Page:2034-2051
ISSN:1751-956X
Container-title:IET Intelligent Transport Systems
language:en
Short-container-title:IET Intelligent Trans Sys

Author:

Zhang Zixian¹,Qi Geqi¹,Guan Wei¹^ORCID

Affiliation:

1. Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport Ministry of Transport Beijing China

Abstract

AbstractVehicle Routing Problem (VRP) is a widespread problem in the transportation field, which challenges the intelligent level of vehicle decisions. Multi‐Trip Vehicle Routing Problem with Time Windows (MTVRPTW), as a further evolved problem of VRP considering multiple departures from one depot and temporal constraint of visiting nodes, has developed into one of the critical issues in the scheduling of logistics, bus transit, railway, and aviation. Traditionally, MTVRPTW is solved by the heuristic algorithm, which is generally time‐consuming and of non‐steady results. Reinforcement learning (RL) and multi‐agent framework have become popular in solving VRP to get better performance. However, the lack of variant dimensions in searching space and knowledge exchange between agents inhibit the further improvement of algorithms. Therefore, a Coordinated Multi‐agent Hierarchical Deep Reinforcement Learning (CMA‐HDRL) method is proposed in this study to enhance the overall solution quality and convergence rate by constructing a three‐layered structure (time, communication, and global layers), which is particularly designed to handle the state space explosion and improve the collaboration between agents. The results show that the proposed method can significantly outperform the general genetic algorithm (GA), RL, multi‐agent algorithm, and hierarchical algorithm, not only from the effectiveness on the cost consisting of travel time and penalty time but also from the operation robustness.

Funder

Fundamental Research Funds for the Central Universities

Publisher

Institution of Engineering and Technology (IET)

Subject

Law,Mechanical Engineering,General Environmental Science,Transportation

Reference38 articles.

1. Vehicle routing problems for city logistics

2. A robust multi-trip vehicle routing problem of perishable products with intermediate depots and time windows

3. A Benders decomposition-based heuristic for a production and outbound distribution scheduling problem with strict delivery constraints

4. Bi-Objective Vehicle Routing for Hazardous Materials Transportation With No Vehicles Travelling in Echelon

5. Optimizing Multi-Terminal Customized Bus Service With Mixed Fleet

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the role of blockchain technology, warehouse automation, smart routing, and cloud computing in logistics performance;Production & Manufacturing Research;2024-08-21

2. A Novel Deep Reinforcement Learning Approach for Real-Time Gate Assignment;2024

3. Vehicle Routing Problem Solving Using Reinforcement Learning;2023 26th International Conference on Computer and Information Technology (ICCIT);2023-12-13