Enhancing Object Detection in Underground Mines: UCM-Net and Self-Supervised Pre-Training

增强地下矿井中的目标检测:UCM-Net 和自监督预训练

阅读:1

Abstract

Accurate real-time monitoring of underground conditions in coal mines is crucial for effective production management. However, limited computational resources and complex environmental conditions in mine shafts significantly impact the recognition and computational capabilities of detection models. This study utilizes a comprehensive dataset containing 117,887 images from five common underground mining tasks: mine personnel detection, large coal lump identification, conveyor chain monitoring, miner behavior recognition, and hydraulic support shield inspection. We propose the ESFENet backbone network, incorporating a Global Response Normalization (GRN) module to enhance feature capture stability while employing depthwise separable convolutions and HGRNBlock modules to reduce parameter volume and computational complexity. Building upon this foundation, we propose UCM-Net, a detection model based on the YOLO architecture. Furthermore, a self-supervised pre-training method is introduced to generate mine-specific pre-trained weights, providing the model with more semantic features. We propose utilizing the combined backbone and neck portions of the detection model as the encoder of an image-masking pre-training structure to strengthen feature acquisition and improve the performance of small models in self-supervised learning. Experimental results demonstrate that UCM-Net outperforms both baseline models and the state-of-the-art YOLOv12 model in terms of accuracy and parameter efficiency across the five mine datasets. The proposed architecture achieves 21.5% parameter reduction and 14.8% computational load decrease compared to baseline models while showing notable performance improvements of 1.3% (mAP(50:95)) and 0.8% (mAP(50)) in miner behavior recognition. The self-supervised pre-training framework effectively enhances training efficiency, enabling UCM-Net to attain an average mAP(50) of 94.4% across all five datasets. The research outcomes can provide key technical support for coal mine safety monitoring and offer valuable technological insights for the public safety sector.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。