Optimized-Weighted-Speedy Q-Learning Algorithm for Multi-UGV in Static Environment Path Planning under Anti-Collision Cooperation Mechanism-Reference-Cited by-同舟云学术

Optimized-Weighted-Speedy Q-Learning Algorithm for Multi-UGV in Static Environment Path Planning under Anti-Collision Cooperation Mechanism

Published:2023-05-27 Issue:11 Volume:11 Page:2476
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Cao Yuanying¹^ORCID,Fang Xi¹

Affiliation:

1. School of Science, Wuhan University of Technology, Wuhan 430070, China

Abstract

With the accelerated development of smart cities, the concept of a “smart industrial park” in which unmanned ground vehicles (UGVs) have wide application has entered the industrial field of vision. When faced with multiple tasks and heterogeneous tasks, the task execution efficiency of a single UGV is inefficient, thus the task planning research under multi-UGV cooperation has become more urgent. In this paper, under the anti-collision cooperation mechanism for multi-UGV path planning, an improved algorithm with optimized-weighted-speedy Q-learning (OWS Q-learning) is proposed. The slow convergence speed of the Q-learning algorithm is overcome to a certain extent by changing the update mode of the Q function. By improving the selection mode of learning rate and the selection strategy of action, the relationship between exploration and utilization is balanced, and the learning efficiency of multi-agent in complex environments is improved. The simulation experiments in static environment show that the designed anti-collision coordination mechanism effectively solves the coordination problem of multiple UGVs in the same scenario. In the same experimental scenario, compared with the Q-learning algorithm and other reinforcement learning algorithms, only the OWS Q-learning algorithm achieves the convergence effect, and the OWS Q-learning algorithm has the shortest collision-free path for UGVS and the least time to complete the planning. Compared with the Q-learning algorithm, the calculation time of the OWS Q-learning algorithm in the three experimental scenarios is improved by 53.93%, 67.21%, and 53.53%, respectively. This effectively improves the intelligent development of UGV in smart parks.

Funder

Equipment Pre-Research Ministry of Education Joint Fund

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/11/2476/pdf

Reference62 articles.

1. The fourth industrial revolution and the age of intelligence;Chu;China’s Ind. Informatiz.,2022

2. Vision-aware air-ground cooperative target localization for UAV and UGV;Bao;Aerosp. Sci. Technol.,2022

3. Lin, S., Liu, A., Wang, J., and Kong, X. (2022). A review of path-planning approaches for multiple mobile robots. Machines, 10.

4. Ravankar, A., Ravankar, A.A., Kobayashi, Y., and Emaru, T. (2017). Symbiotic navigation in multi-robot systems with remote obstacle knowledge sharing. Sensors, 17.

5. Modified continuous ant colony optimisation for multiple unmanned ground vehicle path planning;Liu;Expert Syst. Appl.,2022

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep reinforcement learning and ant colony optimization supporting multi‐UGV path planning and task assignment in 3D environments;IET Intelligent Transport Systems;2024-07-10

2. Multi-Vehicle Collaborative Planning Technology under Automatic Driving;Sustainability;2024-05-28

3. Mobile Robot Path Planning Based on Kinematically Constrained A-Star Algorithm and DWA Fusion Algorithm;Mathematics;2023-11-05

4. Survey of Methods Applied in Cooperative Motion Planning of Multiple Robots;Motion Planning for Dynamic Agents;2023-08-24