A Face Detector with Adaptive Feature Fusion in Classroom Environment
-
Published:2023-04-06
Issue:7
Volume:12
Page:1738
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Sun Cheng1, Wen Pei2, Zhang Shiwen2ORCID, Wu Xingjin2, Zhang Jin23ORCID, Gong Hongfang4
Affiliation:
1. School of Mathematics and Statistics, Hunan Normal University, Changsha 410081, China 2. College of Information Science and Engineering, Hunan Normal University, Changsha 410081, China 3. School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha 410114, China 4. School of Mathematics and Statistics, Changsha University of Science and Technology, Changsha 410114, China
Abstract
Face detection in the classroom environment is the basis for student face recognition, sensorless attendance, and concentration analysis. Due to equipment, lighting, and the uncontrollability of students in an unconstrained environment, images include many moving faces, occluded faces, and extremely small faces in a classroom environment. Since the image sent to the detector will be resized to a smaller size, the face information extracted by the detector is very limited. This seriously affects the accuracy of face detection. Therefore, this paper proposes an adaptive fusion-based YOLOv5 method for face detection in classroom environments. First, a very small face detection layer in YOLOv5 is added to enhance the YOLOv5 baseline, and an adaptive fusion backbone network based on multi-scale features is proposed, which has the ability to feature fusion and rich feature information. Second, the adaptive spatial feature fusion strategy is applied to the network, considering the face location information and semantic information. Finally, a face dataset Classroom-Face in the classroom environment is creatively proposed, and it is verified with our method. The experimental results show that, compared with YOLOv5 or other traditional algorithms, our algorithm portrays better performance in WIDER-FACE Dataset and Classroom-Face dataset.
Funder
Natural Science Foundation of Hunan Province Open Research Project of the State Key Laboratory of Industrial Control Technology National Defense Science and Technology Key Laboratory Fund Project National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference36 articles.
1. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks;Ren;IEEE Trans. Pattern Anal. Mach. Intell.,2017 2. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 3. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016, January 11–14). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands. 4. Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv. 5. Jain, V., and Learned-Miller, E. (2010). FDDB: A Benchmark for Face Detection in Unconstrained Settings, Bepress. Available online: https://works.bepress.com/erik_learned_miller/55/.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|