Intelligent Scheduling Method for Bulk Cargo Terminal Loading Process Based on Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Intelligent Scheduling Method for Bulk Cargo Terminal Loading Process Based on Deep Reinforcement Learning

Published:2022-04-27 Issue:9 Volume:11 Page:1390
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Li Changan,Wu Sirui,Li Zhan^ORCID,Zhang Yuxiao,Zhang Lijie,Gomes Luis^ORCID

Abstract

Sea freight is one of the most important ways for the transportation and distribution of coal and other bulk cargo. This paper proposes a method for optimizing the scheduling efficiency of the bulk cargo loading process based on deep reinforcement learning. The process includes a large number of states and possible choices that need to be taken into account, which are currently performed by skillful scheduling engineers on site. In terms of modeling, we extracted important information based on actual working data of the terminal to form the state space of the model. The yard information and the demand information of the ship are also considered. The scheduling output of each convey path from the yard to the cabin is the action of the agent. To avoid conflicts of occupying one machine at same time, certain restrictions are placed on whether the action can be executed. Based on Double DQN, an improved deep reinforcement learning method is proposed with a fully connected network structure and selected action sets according to the value of the network and the occupancy status of environment. To make the network converge more quickly, an improved new epsilon-greedy exploration strategy is also proposed, which uses different exploration rates for completely random selection and feasible random selection of actions. After training, an improved scheduling result is obtained when the tasks arrive randomly and the yard state is random. An important contribution of this paper is to integrate the useful features of the working time of the bulk cargo terminal into a state set, divide the scheduling process into discrete actions, and then reduce the scheduling problem into simple inputs and outputs. Another major contribution of this article is the design of a reinforcement learning algorithm for the bulk cargo terminal scheduling problem, and the training efficiency of the proposed algorithm is improved, which provides a practical example for solving bulk cargo terminal scheduling problems using reinforcement learning.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/9/1390/pdf

Reference28 articles.

1. Research on Intelligent Optimization of Bulk Cargo Terminal Control System

2. A Machine Learning-based system for berth scheduling at bulk terminals

3. Modeling yard crane operators as reinforcement learning agents

4. The Berth Allocation Problem with Service Time and Delay Time Objectives

5. Integrated Berth Allocation and Quay Crane Assignment Problem: Set partitioning models and computational results

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An improved deep reinforcement learning approach: A case study for optimisation of berth and yard scheduling for bulk cargo terminal;Advances in Production Engineering & Management;2023-09-30

2. Research on Intelligent Dynamic Scheduling Algorithm for Automated Guided Vehicles in Container Terminal Based on Deep Reinforcement Learning;2023 IEEE International Conference on Mechatronics and Automation (ICMA);2023-08-06