Affiliation:
1. School of Electronics Engineering, Kyungpook National University, Daegu 41566, Republic of Korea
2. School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Republic of Korea
Abstract
Object detection is a task that performs position identification and label classification of objects in images or videos. The information obtained through this process plays an essential role in various tasks in the field of computer vision. In object detection, the data utilized for training and validation typically originate from public datasets that are well-balanced in terms of the number of objects ascribed to each class in an image. However, in real-world scenarios, handling datasets with much greater class imbalance, i.e., very different numbers of objects for each class, is much more common, and this imbalance may reduce the performance of object detection when predicting unseen test images. In our study, thus, we propose a method that evenly distributes the classes in an image for training and validation, solving the class imbalance problem in object detection. Our proposed method aims to maintain a uniform class distribution through multi-label stratification. We tested our proposed method not only on public datasets that typically exhibit balanced class distribution but also on private datasets that may have imbalanced class distribution. We found that our proposed method was more effective on datasets containing severe imbalance and less data. Our findings indicate that the proposed method can be effectively used on datasets with substantially imbalanced class distribution.
Funder
National Research Foundation of Korea
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference29 articles.
1. Deep learning for computer vision: A brief review;Voulodimos;Comput. Intell. Neurosci.,2018
2. Application of deep learning for object detection;Pathak;Procedia Comput. Sci.,2018
3. Object detection in 20 years: A survey;Zou;Proc. IEEE,2023
4. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27–30). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.
5. A review and comparative study on probabilistic object detection in autonomous driving;Feng;IEEE Trans. Intell. Transp. Syst.,2021