Three-Dimensional Vehicle Detection and Pose Estimation in Monocular Images for Smart Infrastructures-Reference-Cited by-同舟云学术

Three-Dimensional Vehicle Detection and Pose Estimation in Monocular Images for Smart Infrastructures

Published:2024-06-29 Issue:13 Volume:12 Page:2027
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Borau Bernad Javier¹^ORCID,Ramajo-Ballester Álvaro¹^ORCID,Armingol Moreno José María¹^ORCID

Affiliation:

1. Intelligent Systems Lab, Universidad Carlos III de Madrid, 28911 Leganés, Spain

Abstract

Over the last decades, the idea of smart cities has evolved from a visionary concept of the future into a concrete reality. However, the vision of smart cities has not been fully realized within our society, partly due to the challenges encountered in contemporary data collection systems. Despite these obstacles, advancements in deep learning and computer vision have propelled the development of highly accurate detection algorithms capable of obtaining 3D data from image sources. Nevertheless, this approach has predominantly centered on data extraction from a vehicle’s perspective, bypassing the advantages of using infrastructure-mounted cameras for performing 3D pose estimation of vehicles in urban environments. This paper focuses on leveraging 3D pose estimation from this alternative perspective, benefiting from the enhanced field of view that infrastructure-based cameras provide, avoiding occlusions, and obtaining more information from the objects’ sizes, leading to better results and more accurate predictions compared to models trained on a vehicle’s viewpoint. Therefore, this research proposes a new path for exploration, supporting the integration of monocular infrastructure-based data collection systems into smart city development.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/13/2027/pdf

Reference34 articles.

1. 3D Detection and Pose Estimation of Vehicle in Cooperative Vehicle Infrastructure System;Guo;IEEE Sens. J.,2021

2. Zimmer, W., Birkner, J., Brucker, M., Tung Nguyen, H., Petrovski, S., Wang, B., and Knoll, A.C. (2023, January 4–7). InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors. Proceedings of the 2023 IEEE Intelligent Vehicles Symposium (IV), Anchorage, AK, USA.

3. Yu, H., Luo, Y., Shu, M., Huo, Y., Yang, Z., Shi, Y., Guo, Z., Li, H., Hu, X., and Yuan, J. (2022, January 18–24). DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

4. Ye, X., Shu, M., Li, H., Shi, Y., Li, Y., Wang, G., Tan, X., and Ding, E. (2022, January 18–24). Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

5. Creß, C., Zimmer, W., Strand, L., Fortkord, M., Dai, S., Lakshminarasimhan, V., and Knoll, A. (2022, January 4–9). A9-Dataset: Multi-Sensor Infrastructure-Based Dataset for Mobility Research. Proceedings of the 2022 IEEE Intelligent Vehicles Symposium (IV), Aachen, Germany.