Affiliation:
1. National University of Sciences and Technology (NUST)
2. Horizon Tech Pvt Limited, National University of Sciences and Technology (NUST)
Abstract
Abstract
To ensure the steady navigation for robot stable controls are one of the basic requirements. Control values selection is highly environment dependent. To ensure reusability of control parameter system needs to generalize over the environment. Adding adaptability in robots to perform effectively in the environments with no prior knowledge reinforcement leaning is a promising approach. However, tuning hyper parameters and attaining correlation between state space and reward function to train a stable reinforcement learning agent is a challenge. In this paper we designed a continuous reward function to minimizing the sparsity and stabilizes the policy convergence, to attain control generalization for differential drive robot. We Implemented Twin Delayed Deep Deterministic Policy Gradient on Open-AI Gym Race Car. System was trained to achieve smart primitive control policy, moving forward in the direction of goal by maintaining an appropriate distance from walls to avoid collisions. Resulting policy was tested on unseen environments including dynamic goal environment, boundary free environment and continuous path environment on which it outperformed Deep Deterministic Policy Gradient.
Publisher
Research Square Platform LLC
Reference17 articles.
1. Cooper, S., Di Fava, A., Vivas, C., Marchionni, L., & Ferro, F. (2020). “ARI: The Social Assistive Robot and Companion,” 29th IEEE Int. Conf. Robot Hum. Interact. Commun. RO-MAN 2020, pp. 745–751, doi: 10.1109/RO-MAN47096.2020.9223470.
2. A review of mobile robots: Concepts, methods, theoretical framework, and applications;Rubio F;Int J Adv Robot Syst,2019
3. Peters, D. N. J. (2011). “Model learning for robot control: a survey,” pp.319–340, doi: 10.1007/s10339-011-0404-1.
4. Reinforcement Learning versus Conventional Control for Controlling a Planar Bi-rotor Platform with Tail Appendage;Ugurlu HI;J Intell Robot Syst Theory Appl,2021
5. MIT Cheetah 3: Design and Control of a Robust, Dynamic Quadruped Robot;Bledt G;IEEE Int Conf Intell Robot Syst,2018