Car Full View Dataset: Fine-Grained Predictions of Car Orientation from Images
-
Published:2023-12-09
Issue:24
Volume:12
Page:4947
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Catruna Andy12ORCID, Betiu Pavel1ORCID, Tertes Emanuel1, Ghita Vladimir23, Radoi Emilian12ORCID, Mocanu Irina12ORCID, Dascalu Mihai124ORCID
Affiliation:
1. Computer Science Department, National University of Science and Technology POLITEHNICA Bucharest, 060042 Bucharest, Romania 2. R&D Department, FORT S.A., 052034 Bucharest, Romania 3. Management Department, National University of Science and Technology POLITEHNICA Bucharest, 060042 Bucharest, Romania 4. Academy of Romanian Scientists, Str. Ilfov, Nr. 3, 050044 Bucharest, Romania
Abstract
The orientation of objects plays an important role in accurate predictions for the tasks of classification, detection, and trajectory estimation. This is especially important in the automotive domain, where estimating an accurate car orientation can significantly impact the effectiveness of the other prediction tasks. This work presents Car Full View (CFV), a novel dataset for car orientation prediction from images obtained by video recording all possible angles of individual vehicles in diverse scenarios. We developed a tool to semi-automatically annotate all the video frames with the respective car angle based on the walking speed of the recorder and manually annotated key angles. The final dataset contains over 23,000 images of individual cars along with fine-grained angle annotations. We study the performance of three state-of-the-art deep learning architectures on this dataset in three different learning settings: classification, regression, and multi-objective. The top result of 3.39° in circular mean absolute error (CMAE) shows that the model accurately predicts car orientations for unseen vehicles and images. Furthermore, we test the trained models on images from two different datasets and show their generalization capability to realistic images. We release the dataset and the best models while publishing a web service to annotate new images.
Funder
Automated car damage detection and cost prediction—InsureAI
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference38 articles.
1. Bengio, Y., Louradour, J., Collobert, R., and Weston, J. (2009, January 14–18). Curriculum learning. Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, QC, Canada. 2. He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 3. Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., and Xie, S. (2022, January 18–24). A convnet for the 2020s. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA. 4. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11–17). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada. 5. Azimi, S.M., Bahmanyar, R., Henry, C., and Kurz, F. (2021, January 10–15). Eagle: Large-scale vehicle detection dataset in real-world scenarios using aerial imagery. Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy.
|
|