Research on Obstacle Avoidance Planning for UUV Based on A3C Algorithm-Reference-Cited by-同舟云学术

Research on Obstacle Avoidance Planning for UUV Based on A3C Algorithm

Published:2023-12-26 Issue:1 Volume:12 Page:63
ISSN:2077-1312
Container-title:Journal of Marine Science and Engineering
language:en
Short-container-title:JMSE

Author:

Wang Hongjian¹^ORCID,Gao Wei¹,Wang Zhao¹^ORCID,Zhang Kai¹^ORCID,Ren Jingfei¹,Deng Lihui¹²,He Shanshan¹

Affiliation:

1. College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

2. Tianjin Navigation and Instrument Institute, Tianjin 300130, China

Abstract

Deep reinforcement learning is an artificial intelligence technology that combines deep learning and reinforcement learning and has been widely applied in multiple fields. As a type of deep reinforcement learning algorithm, the A3C (Asynchronous Advantage Actor-Critic) algorithm can effectively utilize computer resources and improve training efficiency by synchronously training Actor-Critic in multiple threads. Inspired by the excellent performance of the A3C algorithm, this paper uses the A3C algorithm to solve the UUV (Unmanned Underwater Vehicle) collision avoidance planning problem in unknown environments. This collision avoidance planning algorithm can have the ability to plan in real-time while ensuring a shorter path length, and the output action space can meet the kinematic constraints of UUVs. In response to the problem of UUV collision avoidance planning, this paper designs the state space, action space, and reward function. The simulation results show that the A3C collision avoidance planning algorithm can guide a UUV to avoid obstacles and reach the preset target point. The path planned by this algorithm meets the heading constraints of the UUV, and the planning time is short, which can meet the requirements of real-time planning.

Funder

National Science and Technology Innovation Special Zone Project

National Key Laboratory of Underwater Robot Technology Fund

a special program to guide high-level scientific research

Publisher

MDPI AG

Subject

Ocean Engineering,Water Science and Technology,Civil and Structural Engineering

Link

https://www.mdpi.com/2077-1312/12/1/63/pdf

Reference32 articles.

1. Bio-Inspired Neural Network-Based Optimal Path Planning for UUVs Under the Effect of Ocean Currents;Zhu;IEEE Trans. Intell. Veh.,2021

2. Yue, Y., Hao, W., Guanjie, H., and Yao, Y. (2023, January 7–9). UUV Target Tracking Path Planning Algorithm Based on Deep Reinforcement Learning. Proceedings of the 2023 8th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), Xi’an, China.

3. Path Planning Technologies for Autonomous Underwater Vehicles-A Review;Li;IEEE Access,2019

4. Cai, Y., Zhang, E., Qi, Y., and Lu, L. (2022, January 28–30). A Review of Research on the Application of Deep Reinforcement Learning in Unmanned Aerial Vehicle Resource Allocation and Trajectory Planning. Proceedings of the 2022 4th International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), Shanghai, China.

5. Deep reinforcement learning based mobile robot navigation: A review;Zhu;Tsinghua Sci. Technol.,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comparative Analysis of Computational Intelligence Methods for Autonomous Navigation of Smart Ships;Electronics;2024-04-04