Author:
Abdul Ridha Shyaa Tahreer,Hashim Ahmed A.
Abstract
The ability to accurately recognize and count persons is crucial in many real-world applications, including surveillance, security, and crowd management, making it one of computer vision’s most fundamental tasks. You Only Look Once (YOLO) is one of the most effective deep learning models for object identification and counting in recent years. This research seeks to learn more about the YOLOv8 algorithm for precisely counting people in still photos and moving videos. The YOLO method has been at the forefront of computer vision due to its ability to recognize things in real time. People in a crowd typically overlap and block one other, and perspective effects can result in enormous changes in human size, shape, and appearance in the image, all of which make accurate headcounts challenging.The YOLO methodology and its adaptation for population census are the subject of this research. Results from experiments support the usefulness of the proposed approach. Surveillance, crowd control, traffic monitoring, retail analytics, event management, and urban planning are just some of the potential uses highlighted by the findings of this study. Mean Average Precision (MAP) numbers demonstrate that the identification procedure was successful, and the counting process was accurate to within 100%.
Reference27 articles.
1. Mundhenk T. N., Konjevod G., Sakla W. A., and Boakye K., “A large contextual dataset for classification, detection and counting of cars with deep learning,” in Computer Vision-ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part III 14, Springer, 2016, pp. 785–800.
2. Lempitsky V. and Zisserman A., “Learning To Count Objects in Images.”
3. Ma Z., Yu L., and Chan A. B., “Small Instance Detection by Integer Programming on Object Density Maps.”
4. Ma Z., Wei X., Hong X., and Gong Y., “Bayesian Loss for Crowd Count Estimation with Point Supervision.” [Online]. Available: https://github.com/ZhihengCV/
5. Zhang A. et al., “Relational Attention Network for Crowd Counting.”