A Self-Adaptive Double Q-Backstepping Trajectory Tracking Control Approach Based on Reinforcement Learning for Mobile Robots

Author:

He Naifeng1,Yang Zhong1,Fan Xiaoliang2,Wu Jiying1ORCID,Sui Yaoyu1,Zhang Qiuyan3

Affiliation:

1. College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

2. State Key Laboratory of Robotics, Shenyang Institute of Automation Chinese Academy of Sciences, Shenyang 110017, China

3. Electric Power Research Institute of Guizhou Power Grid Co., Ltd., Guiyang 550002, China

Abstract

When a mobile robot inspects tasks with complex requirements indoors, the traditional backstepping method cannot guarantee the accuracy of the trajectory, leading to problems such as the instrument not being inside the image and focus failure when the robot grabs the image with high zoom. In order to solve this problem, this paper proposes an adaptive backstepping method based on double Q-learning for tracking and controlling the trajectory of mobile robots. We design the incremental model-free algorithm of Double-Q learning, which can quickly learn to rectify the trajectory tracking controller gain online. For the controller gain rectification problem in non-uniform state space exploration, we propose an incremental active learning exploration algorithm that incorporates memory playback as well as experience playback mechanisms to achieve online fast learning and controller gain rectification for agents. To verify the feasibility of the algorithm, we perform algorithm verification on different types of trajectories in Gazebo and physical platforms. The results show that the adaptive trajectory tracking control algorithm can be used to rectify the mobile robot trajectory tracking controller’s gain. Compared with the Backstepping-Fractional-Older PID controller and Fuzzy-Backstepping controller, Double Q-backstepping has better robustness, generalization, real-time, and stronger anti-disturbance capability.

Funder

Guizhou Provincial Science and Technology Projects

research and application of intelligent system for data collection, transmission and repair of training sites

Publisher

MDPI AG

Subject

Control and Optimization,Control and Systems Engineering

Reference50 articles.

1. Multi-objective approach for robot motion planning in search tasks;Jeddisaravi;Appl. Intell.,2016

2. Intelligent trajectory planner and generalised proportional integral control for two carts equipped with a red-green-blue depth sensor on a circular rail;Panduro;Integr. Comput. Eng.,2020

3. Robust output feedback control for the trajectory tracking of robotic wheelchairs;Chocoteco;Robotica,2014

4. Vaidyanathan, S., and Azar, A.T. (2018). Backstepping Control of Nonlinear Dynamical Systems, Elsevier.

5. Zheng, F., and Gao, W. (2011, January 25–28). Adaptive integral backstepping control of a Micro-Quadrotor. Proceedings of the International Conference on Intelligent Control & Information Processing, Harbin, China.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3