Abstract
The Deep Deterministic Policy Gradient (DDPG) algorithm is a reinforcement learning algorithm that combines Q-learning with a policy. Nevertheless, this algorithm generates failures that are not well understood. Rather than looking for those errors, this study presents a way to evaluate the suitability of the results obtained. Using the purpose of autonomous vehicle navigation, the DDPG algorithm is applied, obtaining an agent capable of generating trajectories. This agent is evaluated in terms of stability through the Lyapunov function, verifying if the proposed navigation objectives are achieved. The reward function of the DDPG is used because it is unknown if the neural networks of the actor and the critic are correctly trained. Two agents are obtained, and a comparison is performed between them in terms of stability, demonstrating that the Lyapunov function can be used as an evaluation method for agents obtained by the DDPG algorithm. Verifying the stability at a fixed future horizon, it is possible to determine whether the obtained agent is valid and can be used as a vehicle controller, so a task-satisfaction assessment can be performed. Furthermore, the proposed analysis is an indication of which parts of the navigation area are insufficient in training terms.
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference43 articles.
1. Xie, L., Scheifele, C., Xu, W., and Stol, K.A. (2015, January 6–8). Heavy-duty omni-directional Mecanum-wheeled robot for autonomous navigation: System development and simulation realization. Proceedings of the 015 IEEE International Conference on Mechatronics (ICM), Nagoya, Japan.
2. Piemngam, K., Nilkhamhang, I., and Bunnun, P. (2019, January 16–18). Development of Autonomous Mobile Robot Platform with Mecanum Wheels. Proceedings of the 2019 First International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics (ICA-SYMP), Bangkok, Thailand.
3. Inertial navigation system for an automatic guided vehicle with Mecanum wheels;Kim;Int. J. Precis. Eng. Manuf.,2012
4. Li, Y., Dai, S., Shi, Y., Zhao, L., and Ding, M. (2019). Navigation Simulation of a Mecanum Wheel Mobile Robot Based on an Improved A* Algorithm in Unity3D. Sensors, 19.
5. Path Planning for Smart Car Based on Dijkstra Algorithm and Dynamic Window Approach;Liu;Wirel. Commun. Mob. Comput.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献