Abstract
Student classroom behavior recognition is a core research direction in intelligent education systems. Real-time analysis of students' learning states and behavioral features through classroom monitoring provides quantitative support for teaching evaluation, classroom management, and personalized instruction, offering significant value for data-driven educational decision-making. To address the issues of low detection accuracy and severe occlusion in classroom behavior detection, this article proposes an improved YOLOv11n-based algorithm named YOLOv11-GLIDE. The model introduces a Channel Prior Convolutional Attention (CPCA) mechanism to integrate global and local feature information, enhancing feature extraction and detection performance. A scale-based dynamic loss (SD Loss) is designed to adaptively adjust the loss weights according to object scale, improving regression stability and detection accuracy. In addition, Sparse Depthwise Convolution (SPD-Conv) replaces traditional down-sampling to reduce fine-grained feature loss and computational cost. Experimental results on the SCB-Dataset3 demonstrate that YOLOv11-GLIDE achieves an excellent balance between accuracy and lightweight design. Compared with the baseline YOLOv11n, mAP@0.5 and mAP@0.5-0.95 increase by 2.5% and 7.6%, while Parameters and GFLOPS are reduced by 9.4% and 11.1%, respectively. The detection speed reaches 127.9 FPS, meeting the practical requirements of embedded classroom monitoring systems for accurate and efficient student behavior recognition.