Abstract
In the Yangquan coal mining region, China, muffled blasting sounds commonly occur in mine surrounding rocks resulting from instantaneous energy release following the elastic deformation of overlying brittle rock layers; they are related to fracture development. Although these events rarely cause immediate hazards, their acoustic signatures contain critical information about cumulative rock damage. Currently, conventional monitoring of muffled blasting sounds and surrounding rock stability relies on microseismic systems and on-site sampling techniques. However, these methods exhibit low identification efficiency for muffled blasting events, poor real-time performance, and strong subjectivity arising from manual signal interpretation and empirical threshold setting. This article proposes retentive depthwise gated network (RDGNet). By combining retentive network sequence modeling, depthwise separable convolution, and a gated fusion mechanism, RDGNet enables multimodal feature extraction and the fusion of acoustic emission sequences and audio Mel spectrograms, supporting real-time muffled blasting sound recognition and lithology classification. Results confirm model robustness under noisy and multisource mixed-signal conditions (overall accuracy: 92.12%, area under the curve: 0.985, and Macro F1: 0.931). This work provides an efficient approach for intelligent monitoring of coal mine rock stability and can be extended to safety assessments in underground engineering, advancing the mining industry toward preventive management.