Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning-Reference-Cited by-同舟云学术

Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning

Published:2023-02-15 Issue: Volume:2023 Page:1-14
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Yang Caipei¹,Zhao Yingqi¹,Cai Xuan²,Wei Wei²,Feng Xingxing²,Zhou Kaibo¹^ORCID

Affiliation:

1. MOE Key Laboratory of Image Information Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China

2. Wuhan Second Ship Design and Research Institute, Wuhan 430205, China

Abstract

It is challenging to perform path planning tasks in complex marine environments as the unmanned surface vessel approaches the goal while avoiding obstacles. However, the conflict between the two subtarget tasks of obstacle avoidance and goal approaching makes the path planning difficult. Thus, a path planning method for unmanned surface vessel based on multiobjective reinforcement learning is proposed under the complex environment with high randomness and multiple dynamic obstacles. Firstly, the path planning scene is set as the main scene, and the two subtarget scenes including obstacle avoidance and goal approaching are divided from it. The action selection strategy in each subtarget scene is trained through the double deep Q-network with prioritized experience replay. A multiobjective reinforcement learning framework based on ensemble learning is further designed for policy integration in the main scene. Finally, by selecting the strategy from subtarget scenes in the designed framework, an optimized action selection strategy is trained and used for the action decision of the agent in the main scene. Compared with traditional value-based reinforcement learning methods, the proposed method achieves a 93% success rate in path planning in simulation scenes. Furthermore, the average length of the paths planned by the proposed method is 3.28% and 1.97% shorter than that of PER-DDQN and dueling DQN, respectively.

Funder

Marine Defense Technology Innovation Center Innovation Fund

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2023/2146314.pdf

Reference36 articles.

1. Applications of marine robotic vehicles

2. Self-Adaptive Dynamic Obstacle Avoidance and Path Planning for USV Under Complex Maritime Environment

3. Path planning algorithm for unmanned surface vehicle formations in a practical maritime environment

4. Path Planning of Coastal Ships Based on Optimized DQN Reward Function

5. A constrained A* approach towards optimal path planning for an unmanned surface vehicle in a maritime environment containing dynamic obstacles and ocean currents

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Quantification of the head-on situation under Rule 14 of COLREGs with modeling of ships;Ocean & Coastal Management;2024-09

2. Application of combined SWOT and AHP (A’WOT): A case study for maritime autonomous surface ships;Turkish Journal of Maritime and Marine Sciences;2023-12-01