A Self-Adaptive Double Q-Backstepping Trajectory Tracking Control Approach Based on Reinforcement Learning for Mobile Robots-Reference-Cited by-同舟云学术

A Self-Adaptive Double Q-Backstepping Trajectory Tracking Control Approach Based on Reinforcement Learning for Mobile Robots

Published:2023-08-14 Issue:8 Volume:12 Page:326
ISSN:2076-0825
Container-title:Actuators
language:en
Short-container-title:Actuators

Author:

He Naifeng¹,Yang Zhong¹,Fan Xiaoliang²,Wu Jiying¹^ORCID,Sui Yaoyu¹,Zhang Qiuyan³

Affiliation:

1. College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

2. State Key Laboratory of Robotics, Shenyang Institute of Automation Chinese Academy of Sciences, Shenyang 110017, China

3. Electric Power Research Institute of Guizhou Power Grid Co., Ltd., Guiyang 550002, China

Abstract

When a mobile robot inspects tasks with complex requirements indoors, the traditional backstepping method cannot guarantee the accuracy of the trajectory, leading to problems such as the instrument not being inside the image and focus failure when the robot grabs the image with high zoom. In order to solve this problem, this paper proposes an adaptive backstepping method based on double Q-learning for tracking and controlling the trajectory of mobile robots. We design the incremental model-free algorithm of Double-Q learning, which can quickly learn to rectify the trajectory tracking controller gain online. For the controller gain rectification problem in non-uniform state space exploration, we propose an incremental active learning exploration algorithm that incorporates memory playback as well as experience playback mechanisms to achieve online fast learning and controller gain rectification for agents. To verify the feasibility of the algorithm, we perform algorithm verification on different types of trajectories in Gazebo and physical platforms. The results show that the adaptive trajectory tracking control algorithm can be used to rectify the mobile robot trajectory tracking controller’s gain. Compared with the Backstepping-Fractional-Older PID controller and Fuzzy-Backstepping controller, Double Q-backstepping has better robustness, generalization, real-time, and stronger anti-disturbance capability.

Funder

Guizhou Provincial Science and Technology Projects

research and application of intelligent system for data collection, transmission and repair of training sites

Publisher

MDPI AG

Subject

Control and Optimization,Control and Systems Engineering

Link

https://www.mdpi.com/2076-0825/12/8/326/pdf

Reference50 articles.

1. Multi-objective approach for robot motion planning in search tasks;Jeddisaravi;Appl. Intell.,2016

2. Intelligent trajectory planner and generalised proportional integral control for two carts equipped with a red-green-blue depth sensor on a circular rail;Panduro;Integr. Comput. Eng.,2020

3. Robust output feedback control for the trajectory tracking of robotic wheelchairs;Chocoteco;Robotica,2014

4. Vaidyanathan, S., and Azar, A.T. (2018). Backstepping Control of Nonlinear Dynamical Systems, Elsevier.

5. Zheng, F., and Gao, W. (2011, January 25–28). Adaptive integral backstepping control of a Micro-Quadrotor. Proceedings of the International Conference on Intelligent Control & Information Processing, Harbin, China.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Overview of Model-Free Adaptive Control for the Wheeled Mobile Robot;World Electric Vehicle Journal;2024-08-29

2. A Supervised Reinforcement Learning Algorithm for Controlling Drone Hovering;Drones;2024-02-20

3. Indirect Adaptive Control Using Neural Network and Discrete Extended Kalman Filter for Wheeled Mobile Robot;Actuators;2024-01-30