Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning

Author:

Guo Hongda1,Xu Youchun1,Ma Yulin2,Xu Shucai3,Li Zhixiong4ORCID

Affiliation:

1. Army Military Transportation University, Tianjin 300161, China

2. School of Mechanical Engineering, Anhui Polytechnic University, Wuhu 241000, China

3. State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing 100084, China

4. Suzhou Automotive Research Institute, Tsinghua University, Suzhou 125000, China

Abstract

Path planning plays a crucial role in the execution of pursuit tasks for multiple unmanned ground vehicles (multi-UGVs). Although existing popular path-planning methods can achieve the pursuit goals, they suffer from some drawbacks such as long computation time and excessive path inflection points. To address these issues, this paper combines gradient descent and deep reinforcement learning (DRL) to solve the problem of excessive path inflection points from a path-smoothing perspective. In addition, the prioritized experience replay (PER) method is incorporated to enhance the learning efficiency of DRL. By doing so, the proposed model integrates PER, gradient descent, and a multiple-agent double deep Q-learning network (PER-GDMADDQN) to enable the path planning and obstacle avoidance capabilities of multi-UGVs. Experimental results demonstrate that the proposed PER-GDMADDQN yields superior performance in the pursuit problem of multi-UGVs, where the training speed and smoothness of the proposed method outperform other popular algorithms. As a result, the proposed method enables satisfactory path planning for multi-UGVs.

Funder

Chinese People's Liberation Army

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference41 articles.

1. Fashey, H.K.M., and Miller, M.J. (2017). Unmanned Systems Integrated Roadmap Fiscal Year 2017–2042, United States Department of Defense.

2. Multi-UAV finite-time ring formation control considering internal collision avoidance;Liu;J. Mech. Eng.,2022

3. Improved RRT*Algorithm for Automatic Charging Robot Obstacle Avoidance Path Planning in Complex Environments;Xu;Comput. Model. Eng. Sci.,2023

4. Jiang, W., Huang, R., and Zhao, Y. (2021, January 3–4). Research on cooperative capture method of USVs. Proceedings of the 9th Academic Conference Professional, Beijing, China.

5. The hybrid path planning algorithm based on improved A* and artificial potential field for unmanned surface vehicle formations;Sang;Ocean Eng.,2021

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3