Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm-Reference-Cited by-同舟云学术

Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm

Published:2022-07-31 Issue:15 Volume:22 Page:5732
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Guan Wei^ORCID,Cui Zhewen,Zhang Xianku^ORCID

Abstract

With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Policy Optimization (PPO), which could drive an unmanned SMASS to the target without requiring any human experiences. In addition, a generalized advantage estimation was added to the loss function of the PPO algorithm, which allowed baselines in PPO algorithms to be self-adjusted. At first, the SMASS was modeled with the Nomoto model in a simulation waterway. Then, distances, obstacles, and prohibited areas were regularized as rewards or punishments, which were used to judge the performance and manipulation decisions of the vessel Subsequently, improved PPO was introduced to learn the action–reward model, and the neural network model after training was used to manipulate the SMASS’s movement. To achieve higher reward values, the SMASS could find an appropriate path or navigation strategy by itself. After a sufficient number of rounds of training, a convincing path and manipulation strategies would likely be produced. Compared with the proposed approach of the existing methods, this approach is more effective in self-learning and continuous optimization and thus closer to human manipulation.

Funder

Dalian Innovation Team Support Plan in the Key Research Field

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/15/5732/pdf

Reference32 articles.

1. Actor-Network Theory as a Framework to Analyse Technology Acceptance Model’s External Variables: The Case of Autonomous Vehicles;Seuwou,2017

2. Avalon

3. A Path-Planning Strategy for Unmanned Surface Vehicles Based on an Adaptive Hybrid Dynamic Stepsize and Target Attractive Force-RRT Algorithm

4. Self-Adaptive Dynamic Obstacle Avoidance and Path Planning for USV Under Complex Maritime Environment

5. The Obstacle Avoidance Planning of USV Based on Improved Artificial Potential Field;Xie;Proceedings of the IEEE International Conference on Information and Automation (ICIA),2014

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Novel Dynamically Adjusted Entropy Algorithm for Collision Avoidance in Autonomous Ships Based on Deep Reinforcement Learning;Journal of Marine Science and Engineering;2024-09-05

2. Collision avoidance decision-making strategy for multiple USVs based on Deep Reinforcement Learning algorithm;Ocean Engineering;2024-09

3. Study on Improving the Navigational Safety Evaluation Methodology based on Autonomous Operation Technology;Journal of the Korean Society of Marine Environment and Safety;2024-02-29

4. Computational Intelligence Supporting the Safe Control of Autonomous Multi-Objects;Electronics;2024-02-16

5. Intelligent decision-making system for multiple marine autonomous surface ships based on deep reinforcement learning;Robotics and Autonomous Systems;2024-02