Abstract
The concept of searching and localizing vehicles from live traffic videos based on descriptive textual input has yet to be explored in the scholarly literature. Endowing Intelligent Transportation Systems (ITS) with such a capability could help solve crimes on roadways. One major impediment to the advancement of fine-grain vehicle recognition models is the lack of video testbench datasets with annotated ground truth data. Additionally, to the best of our knowledge, no metrics currently exist for evaluating the robustness and performance efficiency of a vehicle recognition model on live videos and even less so for vehicle search and localization models. In this paper, we address these challenges by proposing V-Localize, a novel artificial intelligence framework for vehicle search and continuous localization captured from live traffic videos based on input textual descriptions. An efficient hashgraph algorithm is introduced to compute valid target information from textual input. This work further introduces two novel datasets to advance AI research in these challenging areas. These datasets include (a) the most diverse and large-scale Vehicle Color Recognition (VCoR) dataset with 15 color classes—twice as many as the number of color classes in the largest existing such dataset—to facilitate finer-grain recognition with color information; and (b) a Vehicle Recognition in Video (VRiV) dataset, a first of its kind video testbench dataset for evaluating the performance of vehicle recognition models in live videos rather than still image data. The VRiV dataset will open new avenues for AI researchers to investigate innovative approaches that were previously intractable due to the lack of annotated traffic vehicle recognition video testbench dataset. Finally, to address the gap in the field, five novel metrics are introduced in this paper for adequately accessing the performance of vehicle recognition models in live videos. Ultimately, the proposed metrics could also prove intuitively effective at quantitative model evaluation in other video recognition applications. T One major advantage of the proposed vehicle search and continuous localization framework is that it could be integrated in ITS software solution to aid law enforcement, especially in critical cases such as of amber alerts or hit-and-run incidents.
Funder
Federal Highway Administration
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. CarAI: Car Inspection with Artificial Intelligence;Proceedings of the 2024 International Conference on Multimedia Retrieval;2024-05-30
2. Multi-Task YOLO for Vehicle Colour Recognition and Automatic License Plate Recognition;2024 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS);2024-05-23
3. Two-Wheeler Classification using Deep Learning (YOLOv5);2024 10th International Conference on Communication and Signal Processing (ICCSP);2024-04-12
4. Run-Time Prevention of Software Integration Failures of Machine Learning APIs;Proceedings of the ACM on Programming Languages;2023-10-16
5. Object-Based Vehicle Color Recognition in Uncontrolled Environment;Proceedings of the 2023 6th International Conference on Machine Vision and Applications;2023-03-10