Author:
Lahgazi M. J., ,Argoul P.,Hakim A., ,
Abstract
Pedestrian segmentation is a critical task in computer vision, but it can be challenging for segmentation models to accurately classify pedestrians in images with challenging backgrounds and luminosity changes, as well as occlusions. This challenge is further compounded for compressed models that were designed to deal with the high computational demands of deep neural networks. To address these challenges, we propose a novel approach that integrates a region proposal-based framework into the segmentation process. To evaluate the performance of the proposed framework, we conduct experiments on the PASCAL VOC dataset, which presents challenging backgrounds. We use two different segmentation models, UNet and SqueezeUNet, to evaluate the impact of region proposals on segmentation performance. Our experiments show that the incorporation of region proposals significantly improves segmentation accuracy and reduces false positive pixels in the background, leading to better overall performance. Specifically, the SqueezeUNet model achieves a mean Intersection over Union (mIoU) of 0.682, which is a 12% improvement over the baseline SqueezeUNet model without region proposals. Similarly, the UNet model achieves a mIoU of 0.678, which is a 13% improvement over the baseline UNet model without region proposals.
Publisher
Lviv Polytechnic National University
Subject
Computational Theory and Mathematics,Computational Mathematics
Reference41 articles.
1. Minaee S., Boykov Y. Y., Porikli F., Plaza A. J., Kehtarnavaz N., Terzopoulos D. Image segmentation using deep learning: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence. 44 (7), 3523-3542 (2021).
2. Hearst M. A., Dumais S. T., Osuna E., Platt J., Scholkopf B. Support vector machines. IEEE Intelligent Systems and their Applications. 13 (4), 18-28 (1998).
3. Lahgazi M. J., Hakim A., Argoul P. An adaptive wavelet shrinkage based accumulative frame differencing model for motion segmentation. Mathematical Modeling and Computing. 10 (1), 159-170 (2023).
4. Histograms of oriented gradients for human detection;N.;2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 05),2005
5. Ashok V., Balakumaran T., Gowrishankar C., Vennila I. L. A., Nirmal Kumar A. The Fast Haar Wavelet Transform for Signal & Image Processing. International Journal of Computer Science and Information Security. 7 (2010).