Abstract
Aiming to address insufficient feature extraction, vanishing gradients, and low prediction accuracy in tool wear prediction, this paper proposes a hybrid deep neural network based on a Convolutional Neural Network (CNN), Residual Network (ResNet) residual connections, the Convolutional Block Attention Module (CBAM), and a Bidirectional Gated Recurrent Unit (BiGRU). First, a 34-dimensional multi-domain feature set covering the time domain, frequency domain, and time-frequency domain is constructed, and multi-sensor signals are standardized using z-score normalization. A CNN-BiGRU backbone is then established, where ResNet-style residual connections are introduced to alleviate training degradation and mitigate vanishing-gradient issues in deep networks. Meanwhile, CBAM is integrated into the feature extraction module to adaptively reweight informative features in both channel and spatial dimensions. In addition, a BiGRU layer is embedded for temporal modeling to capture bidirectional dependencies throughout the wear evolution process. Finally, a fully connected layer is used as a regressor to map high-dimensional representations to tool wear values. Experiments on the PHM2010 dataset demonstrate that the proposed hybrid architecture is more stable and achieves better predictive performance than several mainstream deep learning baselines. Systematic ablation studies further quantify the contribution of each component: compared with the baseline CNN model, the mean absolute error (MAE) is reduced by 47.5%, the root mean square error (RMSE) is reduced by 68.5%, and the coefficient of determination (R(2)) increases by 14.5%, enabling accurate tool wear prediction.