Abstract
Due to the unknown motion model and the complexity of the environment, the problem of target tracking for autonomous underwater vehicles (AUVs) became one of the major difficulties in model-based controllers. Therefore, the target tracking task of AUV is modeled as a Markov decision process (MDP) with unknown state transition probabilities. Based on actor–critic framework and experience replay technique, a model-free reinforcement learning algorithm is proposed to realize the dynamic target tracking of AUVs. In order to improve the performance of the algorithm, an adaptive experience replay scheme is further proposed. Specifically, the proposed algorithm utilizes the experience replay buffer to store and disrupt the samples, so that the time series samples can be used for training the neural network. Then, the sample priority is arranged according to the temporal difference error, while the adaptive parameters are introduced in the sample priority calculation, thus improving the experience replay rules. The results confirm the quick and stable learning of the proposed algorithm, when tracking the dynamic targets in various motion states. Additionally, the results also demonstrate good control performance regarding both stability and computational complexity, thus indicating the effectiveness of the proposed algorithm in target tracking tasks.
Funder
Opening Research Fund of National Engineering Laboratory for Test and Experiment Technology of Marine Engineering Equipment
Subject
Ocean Engineering,Water Science and Technology,Civil and Structural Engineering
Reference39 articles.
1. Review on research of control technology of autonomous underwater vehicle;Wang;World Sci.-Tech. R & D,2021
2. Deep interactive reinforcement learning for path following of autonomous underwater vehicle;Zhang;IEEE Access,2020
3. Composite learning adaptive sliding mode control for AUV target tracking;Guo;Neurocomputing,2019
4. Towards docking for small scale underwater robots;Mintchev;Auton. Robot.,2015
5. Li, J., Li, C., Chen, T., and Zhang, Y. Improved RRT algorithm for AUV target search in unknown 3D environment. J. Mar. Sci. Eng., 2022. 10.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献