T3OMVP: A Transformer-Based Time and Team Reinforcement Learning Scheme for Observation-Constrained Multi-Vehicle Pursuit in Urban Area-Reference-Cited by-同舟云学术

T3OMVP: A Transformer-Based Time and Team Reinforcement Learning Scheme for Observation-Constrained Multi-Vehicle Pursuit in Urban Area

Published:2022-04-22 Issue:9 Volume:11 Page:1339
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yuan Zheng^ORCID,Wu Tianhao^ORCID,Wang Qinwen^ORCID,Yang Yiying,Li Lei^ORCID,Zhang Lin

Abstract

Smart Internet of Vehicles (IoVs) combined with Artificial Intelligence (AI) will contribute to vehicle decision-making in the Intelligent Transportation System (ITS). Multi-vehicle pursuit (MVP) games, a multi-vehicle cooperative ability to capture mobile targets, are gradually becoming a hot research topic. Although there are some achievements in the field of MVP in the open space environment, the urban area brings complicated road structures and restricted moving spaces as challenges to the resolution of MVP games. We define an observation-constrained MVP (OMVP) problem in this paper and propose a transformer-based time and team reinforcement learning scheme (T3OMVP) to address the problem. First, a new multi-vehicle pursuit model is constructed based on Decentralized Partially Observed Markov Decision Processes (Dec-POMDPs) to instantiate this problem. Second, the QMIX is redefined to deal with the OMVP problem by leveraging the transformer-based observation sequence and combining the vehicle’s observations to reduce the influence of constrained observations. Third, a simulated urban environment is built to verify the proposed scheme. Extensive experimental results demonstrate that the proposed T3OMVP scheme achieves improvements relative to the state-of-the-art QMIX approaches by 9.66~106.25%, from simple to difficult scenarios.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/9/1339/pdf

Reference22 articles.

1. On the Design of Mutual Authentication and Key Agreement Protocol in Internet of Vehicles-Enabled Intelligent Transportation System

2. Internet of Vehicles: Architecture, Protocols, and Security

3. A Traffic-Aware Federated Imitation Learning Framework for Motion Control at Unsignalized Intersections with Internet of Vehicles

4. Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on collaborative hunting with robotic swarm: Key technologies and application scenarios;Neurocomputing;2024-09

2. Progression Cognition Reinforcement Learning With Prioritized Experience for Multi-Vehicle Pursuit;IEEE Transactions on Intelligent Transportation Systems;2024-08

3. Transformer in reinforcement learning for decision-making: a survey;Frontiers of Information Technology & Electronic Engineering;2024-06

4. Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning;Electronics;2023-11-23

5. The Internet of Vehicles and Sustainability—Reflections on Environmental, Social, and Corporate Governance;Energies;2023-04-02