Abstract
This work aims to engineer a robust system capable of real-time detection, accurately discerning individuals who are either adhering to or neglecting face mask mandates, across a diverse range of scenarios encompassing images, videos, and live camera streams. This study improved the architecture of YOLOv8n for face mask detection by building a new two-modification version of YOLOv8n model to improve feature extraction and prediction network for YOLOv8n. In proposed YOLOv8n-v1, the integration of a residual Network backbone into the YOLOv8n architecture by replacing the first two layers of YOLOv8n with ResNet_Stem and ResNet_Block modules to improve the model’s ability for feature extraction and replace Spatial Pyramid Pooling Fast (SPPF) module with Spatial Pyramid Pooling-Cross Stage Partial (SPPCSP) modules which combine SPP and CSP to create a network that is both effective and efficient. The proposed YOLOv8n-v2 is built by integration Ghostconv and ResNet_Downsampling modules into the proposed YOLOv8n-v1 backbone. All models have been tested and evaluated on two datasets. The first one is MJFR dataset, which contains 23,621 images, and collected by the authors of this paper from four distinct datasets, all of which were used for facemask detection purposes. The second one is MSFM object detection dataset has been collected from groups of videos in real life and images based on the curriculum learning technology. The model’s performance is assessed by using the following metrics: mean average precision (mAP50), mAP50-95, recall (R) and precision (P). It has been concluded that both versions of proposed YOLOv8n outperform the original model in terms of accuracy for both datasets. Finally, the system was successfully implemented in one of the medical clinics affiliated with a medical complex, where the results of its application showed high efficiency in various aspects of work, and it effectively contributed to improving the public health and safety.
Reference29 articles.
1. Ahuja A. S. et al.: Artificial intelligence in ophthalmology: A multidisciplinary approach. Integrative Medicine Research 11(4), 2022, 100888.
2. Al-Shamdeen M. J., Younis A. N., Younis H. A.: Metaheuristic algorithm for capital letters images recognition. Computer Science 16(2), 2020, 577–588.
3. Bhujel S., Shakya S.: Rice Leaf Diseases Classification Using Discriminative Fine Tuning and CLR on EfficientNet. Journal of Soft Computing Paradigm 4(3), 2022, 172–187.
4. Chabi Adjobo E. et al.: Automatic Localization of Five Relevant Dermoscopic Structures Based on YOLOv8 for Diagnosis Improvement. Journal of Imaging 9(7), 2023, 148.
5. Deng J. et al.: Retinaface: Single-stage dense face localisation in the wild. arXiv preprint arXiv: 1905.00641, 2019.