Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception-Reference-Cited by-同舟云学术

Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception

Published:2023-10-22 Issue:20 Volume:23 Page:8630
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wang Hai¹^ORCID,Niu Yaqing¹,Chen Long²,Li Yicheng²,Luo Tong³^ORCID

Affiliation:

1. School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China

2. Automotive Engineering Research Institute, Jiangsu University, Zhenjiang 212013, China

3. School of Automobile and Traffic Engineering, Jiangsu University of Technology, Changzhou 213001, China

Abstract

The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomous vehicles and road driving in the distant future and has become a trend in the future development of intelligent driving. However, there is no suitable dataset for this study. The Tsinghua University DAIR-V2X dataset is utilized to create the first cross-camera vehicle search dataset, DAIR-V2XSearch, which combines the cameras at both ends of the vehicle and the road in real-world scenes. The primary purpose of the current search network is to address the pedestrian issue. Due to varying task scenarios, it is necessary to re-establish the network in order to resolve the problem of vast differences in different perspectives caused by vehicle searches. A phased feature extraction network (PFE-Net) is proposed as a solution to the cross-camera vehicle search problem. Initially, the anchor-free YOLOX framework is selected as the backbone network, which not only improves the network’s performance but also eliminates the fuzzy situation in which multiple anchor boxes correspond to a single vehicle ID in the Re-ID branch. Second, for the vehicle Re-ID branch, a camera grouping module is proposed to effectively address issues such as sudden changes in perspective and disparities in shooting under different cameras. Finally, a cross-level feature fusion module is designed to enhance the model’s ability to extract subtle vehicle features and the Re-ID’s precision. Experiments demonstrate that our proposed PFE-Net achieves the highest precision in the DAIR-V2XSearch dataset.

Funder

the National Natural Science Foundation of China

Key Research and Development Program of Jiangsu Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/20/8630/pdf

Reference44 articles.

1. Zheng, L., Zhang, H., Sun, S., Chandraker, M., Yang, Y., and Tian, Q. (2017, January 21–26). Person Re-identification in the Wild. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.

2. 5G NR-V2X: Toward connected and cooperative autonomous driving;Bagheri;IEEE Commun. Stand. Mag.,2021

3. Bishop, R. (2000, January 5). A survey of intelligent vehicle applications worldwide. Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No. 00TH8511), Dearborn, MI, USA.

4. A statistical model of mobile-to-mobile land communication channel;Akki;IEEE Trans. Veh. Technol.,1986

5. Multivehicle cooperative driving using cooperative perception: Design and experimental validation;Kim;IEEE Trans. Intell. Transp. Syst.,2014