Abstract
Most coal mining enterprises in China have established and use safety production information systems for hazard identification and management, but related accident hazard data have not been fully utilized. This study is based on the classification standards defined by the "coal mine major accident hazard determination standards" implemented by the Ministry of Emergency Management in 2021. We constructed a classification system including 15 major hazard categories and 79 minor hazard categories, which served as sample labels for major coal mine accident hazards. The hybrid convolutional neural network (CNN)-transformer model was used to perform hierarchical text classification on the coal mine major accident hazard data, with the bidirectional encoder representations from transformers (BERT) model used as a baseline for comparison. The results show that in the major hazard category classification experiments, the hybrid CNN-transformer model outperformed the BERT model by 3% points in terms of accuracy, recall, and F1 score. In the minor hazard category classification experiments, the hybrid CNN-transformer model achieved a maximum classification performance of 98%, generally exceeding the BERT model. The coal mine accident hazard classification algorithm based on the hybrid CNN-transformer model demonstrates significant classification effectiveness, providing efficient and rapid input support for coal mine major accident hazard identification systems. Compared to existing BERT models, the hybrid CNN-transformer model significantly improves classification accuracy and training efficiency by combining the extraction of local and global features, exhibiting higher stability and classification performance. Compared with the baseline BERT model, the proposed hybrid CNN-transformer achieves up to 3-4% higher F1 while converging faster and using fewer computational resources. Deployed in a cooperating minesafety platform, it has already reduced manual hazard-triage workload by ~ 30% in daytoday operations.