Reinforcement Learning for Autonomous Underwater Vehicles via Data-Informed Domain Randomization-Reference-Cited by-同舟云学术

Reinforcement Learning for Autonomous Underwater Vehicles via Data-Informed Domain Randomization

Published:2023-01-29 Issue:3 Volume:13 Page:1723
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lu Wenjie¹^ORCID,Cheng Kai¹,Hu Manman²

Affiliation:

1. School of Mechanical Engineering and Automation, Harbin Institute of Technology (Shenzhen), Shenzhen 518055, China

2. Department of Civil Engineering, University of Hong Kong, Hong Kong, China

Abstract

Autonomous Underwater Vehicles (AUVs) or underwater vehicle-manipulator systems often have large model uncertainties from degenerated or damaged thrusters, varying payloads, disturbances from currents, etc. Other constraints, such as input dead zones and saturations, make the feedback controllers difficult to tune online. Model-free Reinforcement Learning (RL) has been applied to control AUVs, but most results were validated through numerical simulations. The trained controllers often perform unsatisfactorily on real AUVs; this is because the distributions of the AUV dynamics in numerical simulations and those of real AUVs are mismatched. This paper presents a model-free RL via Data-informed Domain Randomization (DDR) for controlling AUVs, where the mismatches between the trajectory data from numerical simulations and the real AUV were minimized by adjusting the parameters in the simulated AUVs. The DDR strategy extends the existing adaptive domain randomization technique by aggregating an input network to learn mappings between control signals across domains, enabling the controller to adapt to sudden changes in dynamics. The proposed RL via DDR was tested on the problems of AUV pose regulation through extensive numerical simulations and experiments in a lab tank with an underwater positioning system. These results have demonstrated the effectiveness of RL-DDR for transferring trained controllers to AUVs with different dynamics.

Funder

National Natural Science Foundation of China

Shenzhen Science and Technology Innovation Foundation

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/3/1723/pdf

Reference26 articles.

1. Antonelli, G., and Antonelli, G. (2014). Underwater Robots, Springer.

2. Griffiths, G. (2002). Technology and Applications of Autonomous Underwater Vehicles, CRC Press.

3. Low, H.E., Randolph, M.F., Rutherford, C., Bernard, B.B., and Brooks, J.M. (2008, January 5–8). Characterization of near seabed surface sediment. Proceedings of the Offshore Technology Conference, Houston, TX, USA.

4. Programming active cohesive granular matter with mechanically induced phase changes;Li;Sci. Adv.,2021

5. A Grasshopper Optimization-based fault-tolerant control algorithm for a human occupied submarine with the multi-thruster system;Zhu;Ocean. Eng.,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Vector Control of PMSM Using TD3 Reinforcement Learning Algorithm;Algorithms;2023-08-24