A Study of Classroom Behavior Recognition Incorporating Super-Resolution and Target Detection-Reference-Cited by-同舟云学术

A Study of Classroom Behavior Recognition Incorporating Super-Resolution and Target Detection

Published:2024-08-30 Issue:17 Volume:24 Page:5640
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Xiaoli¹,Nie Jialei²,Wei Shoulin¹^ORCID,Zhu Guifu³,Dai Wei¹^ORCID,Yang Can²

Affiliation:

1. Key Laboratory of Computer Science, Kunming University of Science and Technology, Kunming 650500, China

2. School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China

3. Informationization Construction Management Center, Kunming University of Science and Technology, Kunming 650500, China

Abstract

With the development of educational technology, machine learning and deep learning provide technical support for traditional classroom observation assessment. However, in real classroom scenarios, the technique faces challenges such as lack of clarity of raw images, complexity of datasets, multi-target detection errors, and complexity of character interactions. Based on the above problems, a student classroom behavior recognition network incorporating super-resolution and target detection is proposed. To cope with the problem of unclear original images in the classroom scenario, SRGAN (Super Resolution Generative Adversarial Network for Images) is used to improve the image resolution and thus the recognition accuracy. To address the dataset complexity and multi-targeting problems, feature extraction is optimized, and multi-scale feature recognition is enhanced by introducing AKConv and LASK attention mechanisms into the Backbone module of the YOLOv8s algorithm. To improve the character interaction complexity problem, the CBAM attention mechanism is integrated to enhance the recognition of important feature channels and spatial regions. Experiments show that it can detect six behaviors of students—raising their hands, reading, writing, playing on their cell phones, looking down, and leaning on the table—in high-definition images. And the accuracy and robustness of this network is verified. Compared with small-object detection algorithms such as Faster R-CNN, YOLOv5, and YOLOv8s, this network demonstrates good detection performance on low-resolution small objects, complex datasets with numerous targets, occlusion, and overlapping students.

Funder

Yunnan Provincial Department of Education Science Research Fund Project

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/17/5640/pdf

Reference32 articles.

1. Simulation of classroom student behavior recognition based on PSO-kNN algorithm and emotional image processing;Wu;J. Intell. Fuzzy Syst.,2021

2. Recognition of classroom learning behaviors based on the fusion of human pose estimation and object detection;Wang;J. East China Norm. Univ. (Nat. Sci.),2022

3. Chen, G., Ji, J., and Huang, C. (2022, January 15–17). Student classroom behavior recognition based on openpose and deep learning. Proceedings of the 2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP), Xi’an, China.

4. Fu, R., Wu, T., Luo, Z., Duan, F., Qiao, X., and Guo, P. (2019, January 14–19). Learning behavior analysis in classroom based on deep learning. Proceedings of the 2019 Tenth International Conference on Intelligent Control and Information Processing (ICICIP), Marrakesh, Morocco.

5. Kolesnikov, A., Kuznetsova, A., Lampert, C., and Ferrari, V. (2019, January 27–28). Detecting visual relationships using box attention. Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision workshops, Seoul, Republic of Korea.