Scheduling of AGVs in Automated Container Terminal Based on the Deep Deterministic Policy Gradient (DDPG) Using the Convolutional Neural Network (CNN)-Reference-Cited by-同舟云学术

Scheduling of AGVs in Automated Container Terminal Based on the Deep Deterministic Policy Gradient (DDPG) Using the Convolutional Neural Network (CNN)

Published:2021-12-16 Issue:12 Volume:9 Page:1439
ISSN:2077-1312
Container-title:Journal of Marine Science and Engineering
language:en
Short-container-title:JMSE

Author:

Chen Chun^ORCID,Hu Zhi-Hua^ORCID,Wang Lei

Abstract

In order to improve the horizontal transportation efficiency of the terminal Automated Guided Vehicles (AGVs), it is necessary to focus on coordinating the time and space synchronization operation of the loading and unloading of equipment, the transportation of equipment during the operation, and the reduction in the completion time of the task. Traditional scheduling methods limited dynamic response capabilities and were not suitable for handling dynamic terminal operating environments. Therefore, this paper discusses how to use delivery task information and AGVs spatiotemporal information to dynamically schedule AGVs, minimizes the delay time of tasks and AGVs travel time, and proposes a deep reinforcement learning algorithm framework. The framework combines the benefits of real-time response and flexibility of the Convolutional Neural Network (CNN) and the Deep Deterministic Policy Gradient (DDPG) algorithm, and can dynamically adjust AGVs scheduling strategies according to the input spatiotemporal state information. In the framework, firstly, the AGVs scheduling process is defined as a Markov decision process, which analyzes the system’s spatiotemporal state information in detail, introduces assignment heuristic rules, and rewards the reshaping mechanism in order to realize the decoupling of the model and the AGVs dynamic scheduling problem. Then, a multi-channel matrix is built to characterize space–time state information, the CNN is used to generalize and approximate the action value functions of different state information, and the DDPG algorithm is used to achieve the best AGV and container matching in the decision stage. The proposed model and algorithm frame are applied to experiments with different cases. The scheduling performance of the adaptive genetic algorithm and rolling horizon approach is compared. The results show that, compared with a single scheduling rule, the proposed algorithm improves the average performance of task completion time, task delay time, AGVs travel time and task delay rate by 15.63%, 56.16%, 16.36% and 30.22%, respectively; compared with AGA and RHPA, it reduces the tasks completion time by approximately 3.10% and 2.40%.

Publisher

MDPI AG

Subject

Ocean Engineering,Water Science and Technology,Civil and Structural Engineering

Link

https://www.mdpi.com/2077-1312/9/12/1439/pdf

Reference33 articles.

1. Scheduling of container-handling equipment during the loading process at an automated container terminal

2. An Ant Colony Algorithm (ACA) for solving the new integrated model of job shop scheduling and conflict-free routing of AGVs

3. Integrated scheduling optimization of U-shaped automated container terminal under loading and unloading mode

4. Hybrid Scheduling for Multi-Equipment at U-Shape Trafficked Automated Terminal Based on Chaos Particle Swarm Optimization

5. A Look-Ahead Dispatching Method for Automated Guided Vehicles in Automated Port Container Terminals

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimization for multi-resource integrated scheduling in the automated container terminal with a parallel layout considering energy-saving;Advanced Engineering Informatics;2024-10

2. Collaborative dynamic scheduling in a self-organizing manufacturing system using multi-agent reinforcement learning;Advanced Engineering Informatics;2024-10

3. Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard;International Journal of Production Research;2024-03-21

4. Digital Twins in the Context of Seaports and Terminal Facilities;Flexible Services and Manufacturing Journal;2024-01-13

5. Load balancing of multi-AGV road network based on improved Q-learning algorithm and macroscopic fundamental diagram;Complex & Intelligent Systems;2024-01-10