Affiliation:
1. Department of Electronic Engineering, Tor Vergata University of Rome, Via del Politecnico 1, 00133 Rome, Italy
Abstract
This research explores the use of Q-Learning for real-time swarm (Q-RTS) multi-agent reinforcement learning (MARL) algorithm for robotic applications. This study investigates the efficacy of Q-RTS in the reducing convergence time to a satisfactory movement policy through the successful implementation of four and eight trained agents. Q-RTS has been shown to significantly reduce search time in terms of training iterations, from almost a million iterations with one agent to 650,000 iterations with four agents and 500,000 iterations with eight agents. The scalability of the algorithm was addressed by testing it on several agents’ configurations. A central focus was placed on the design of a sophisticated reward function, considering various postures of the agents and their critical role in optimizing the Q-learning algorithm. Additionally, this study delved into the robustness of trained agents, revealing their ability to adapt to dynamic environmental changes. The findings have broad implications for improving the efficiency and adaptability of robotic systems in various applications such as IoT and embedded systems. The algorithm was tested and implemented using the Georgia Tech Robotarium platform, showing its feasibility for the above-mentioned applications.
Reference26 articles.
1. Deep learning in neural networks: An overview;Schmidhuber;Neural Netw.,2015
2. Fault diagnosis of actuator damage in UAVs using embedded recorded data and stacked machine learning models;Jaber;J. Supercomput.,2024
3. Approximated computing for low power neural networks;Cardarilli;Telkomnika Telecommun. Comput. Electron. Control,2019
4. Simonetta, A., Paoletti, M.C., and Nakajima, T. (2023, January 4). The SQuaRE Series as a Guarantee of Ethics in the Results of AI systems. Proceedings of the 11th International Workshop on Quantitative Approaches to Software Quality, Seoul, Republic of Korea.
5. Jaber, A.A., and Bicker, R. (2014, January 28–30). The optimum selection of wavelet transform parameters for the purpose of fault detection in an industrial robot. Proceedings of the 2014 IEEE International Conference on Control System, Computing and Engineering (ICCSCE 2014), Penang, Malaysia.