4.1. Predictive Model Performance Analysis
4.1.1. Prediction Effect Analysis
- (1)
Prediction Effect of Single Support Resistance Value
As shown in the
Figure 6 below, the Support No. 100 resistance value prediction effect indicates that the predicted values maintain an overall consistent trend with the actual values. During the stages where the resistance fluctuates steadily, the predicted curve closely follows the variation characteristics of the actual resistance with minimal deviation. Crucially, when the resistance exhibits a sharp, sudden change, the predicted curve can also capture the trend of this resistance surge rapidly.
Figure 7 shows the relative deviation box plot of the resistance prediction value of support No. 100. The predicted value fluctuates continuously around the actual value. The maximum positive deviation observed is higher than the actual value, while the minimum negative deviation is lower than the actual value. Most of the predicted values are concentrated in the range of ±1% to ±2% of the corresponding actual value. In addition, the average value of the predicted value is the actual value of the corresponding time step. This shows that the predicted value is very close to the actual value, and the prediction Accuracy is high.
- (2)
Prediction Effect Across Multiple Hydraulic Supports
The
Figure 8 below illustrates the prediction performance of the resistance values across multiple hydraulic supports. The predicted and actual curves for all supports exhibit a high degree of consistency in their overall trend. During phases of stable resistance fluctuation, the predicted values closely track the dynamic characteristics of the actual values. Crucially, even when facing complex operating conditions, such as sudden resistance changes, the predicted curves can still capture the critical trends of these resistance variations.
The
Figure 9 presented displays the box plot and distribution map of the predicted pressure values for multiple supports. By comparing this box plot with the one for the single-support resistance value prediction, it can be concluded that the proximity between the predicted and actual values is a universal characteristic. While the fluctuation range of the predicted values varies slightly across different supports, all predictions are concentrated within two of the actual values.
Specifically, the mean predicted values for supports No. 85, No. 100, and No. 170 are 99.85%, 99.844%, and 99.827% of their respective actual values. Since the mean predicted values are all greater than the actual support pressure values, this further confirms the high Accuracy of the support pressure prediction model.
4.1.2. Analysis of Prediction Accuracy of Different Algorithms
The proposed GRU-AM model combines the advantages of the GRU’s lightweight temporal modeling with the key feature-enhancing capabilities of the AM. This fusion enables the model to adaptively assign higher weight to abnormally sensitive information.
Table 4 and
Figure 10 show the prediction performance of different algorithms and histograms of different models, respectively, for a 1 min time step.
As shown in
Table 4, in the 1 min single-step prediction scenario, performance metrics demonstrate the superiority of the GRU-AM model: root mean square error (RMSE): the GRU-AM model (1.8901 kN) outperforms the LSTM (1.9314 kN) and CNN (1.9816 kN). Mean absolute error (MAE): the GRU-AM model (0.6074 kN) performs significantly worse than the LSTM (0.6766 kN) and CNN (0.7734 kN). Pearson correlation coefficient (PCC): the GRU-AM model (0.9701) outperforms the LSTM (0.9506) and CNN (0.9202). This indicates that the GRU-AM model best captures the changing trends of support and resistance levels.
Overall, in the 1 min single-step forecasting task, the GRU-AM model incorporating the AM outperforms the LSTM and CNN models in terms of error control and trend capture, making it highly suitable for accurate short-term support and resistance forecasts.
Step size is a core hyperparameter in time-series models, directly impacting training efficiency and generalization performance. A step size that is too short can easily lead to overfitting, while a step size that is too long can easily lead to underfitting and increase computational cost. Taking into account computational cost and research focus, this study set the step size gradient to 5, 10, and 15 min, focusing on the changes in model prediction Accuracy under the influence of different step sizes.
Table 5 shows the prediction results of three models (GRU-AM, LSTM, and CNN) at step sizes of 5, 10, and 15 min.
Figure 11 illustrates the models’ prediction performance at different step lengths (5, 10, and 15 min). A consistent trend is observed: the fitting performance of all three models—GRU-AM, LSTM, and CNN—declines as the step length increases. This is evidenced by the gradual increase in the error metrics (ERMSE, EMAE, and EMAPE) and the continuous decrease in the Pearson correlation coefficient.
The GRU-AM model consistently demonstrates the best performance across all step lengths. Its error metrics are significantly lower than the LSTM and CNN models, and the GRU-AM model maintains a higher value.
The prediction quality decreases notably when the step length is extended to 15 min compared to 5 and 10 min, with the values for all three models dropping below 0.9. For the longer step length of 10 min, the average decrease in the value across all models, relative to the 1 min step, is. The GRU-AM model shows the most significant decrease in value in this specific case. The average value decreases when the step length is set to 15 min. Crucially, the GRU-AM model exhibits the most minor decrease in value at this extended step length.
The GRU-AM model is superior to the LSTM and CNN models across all tested step lengths. Furthermore, the GRU-AM model shows a clear advantage in long-step prediction, demonstrating greater resilience and less affected by the increase in step length.
4.2. Effectiveness Assessment of Abnormal Pressure on the Working Face
- (1)
Accuracy of abnormal detection using a single stent
To analyze the effectiveness of the GRU-AM model in strata pressure anomaly identification, the model’s risk assessment was calculated for Support No. 100 over future time horizons of 1, 5, 10, and 15 min. These results were then compared against a benchmark of manual identification, using the Comprehensive Risk Score intervals defined in
Table 2. The results of this comparison are presented below:
Table 6 and
Table 7 present the strata pressure anomaly identification results for Support No. 100 over different future time horizons and the evaluation metrics across various step lengths, respectively.
As shown in
Table 7, the model’s accuracy gradually decreases as the step length increases. At step lengths of 1 and 5 min, the risk assessment model exhibits only a small number of misclassifications between the normal- and low-risk categories. However, the identification of medium- and high-risk anomalies remains accurate. At a step length of 10 min, misclassifications increase, with fewer errors observed in the normal-, low-, and medium-risk categories.
To more clearly illustrate the change in model identification rate as the step length increases, the table data were plotted into a bar chart (
Figure 12) showing the state identification rate for the support number across 58 periodic cycles. Different colors in the figure represent different step lengths. As the step length increases, the F1 scores for normal, low, medium, and high risk and overall Accuracy all show a clear downward trend.
Notably, at a step length of 15 min, the high-risk F1 score drops below 1 for the first time. The medium-risk F1 score and the overall Accuracy also show a significant decrease. Therefore, the optimal prediction step length for the model should be set to less than 10 min.
- (2)
Multi-branch prediction accuracy
By comparing
Figure 13a with
Figure 13b, it is clear that the overall distribution trend of stress is similar between the two figures. High resistance corresponds to red and yellow colors, while low resistance corresponds to green. Crucially, the location and morphology of the high-resistance areas show a high degree of matching. This indicates that the model can capture the spatial distribution characteristics of the support resistance effectively and provides good prediction results for the resistance variation patterns in different areas of the working face.
Table 8 shows the area proportions of different stress regions (resistance levels) in the cloud map. The area difference in the green (low resistance) region is less than 3%. However, the area differences in the yellow (medium resistance) and red (high resistance) regions are 10.05% and 20.65%, respectively.
The model predicts that the area proportions of medium- and high-resistance regions are higher than the actual values. Possible reasons for this discrepancy include the following: model reliance on historical trends: the forecasting model can only predict future values by learning from the trend characteristics of past actual values. Trend adaptation lag: when a new trend emerges, the forecasting model must first capture the change in actual values before adjusting the forecast for the next time step. This inherent lag in adapting to new trends may contribute to the overestimation of the predicted area proportions of medium- and high-resistance regions.
Table 9 and
Table 10 present the strata pressure anomaly identification results for multiple supports across different future time horizons and the evaluation metrics for the various step lengths across multiple supports, respectively. The anomaly identification results for 1, 5, 10, and 15 min into the future show that the model exhibits good recognition performance across all four temporal dimensions.
1 min Prediction: The model accurately identified 1366 cases of the normal category (with only five false judgments as low risk) and 49 cases of the low-risk category (with 12 false judgments as normal). Crucially, there were no misjudgments in the medium- and high-risk categories. The overall Accuracy rate was 0.9741.
5 min Prediction: Performance remained nearly identical to the 1 min scenario. The model accurately identified 1366 normal cases (5 false judgments as low risk) and 49 low-risk cases (12 false judgments as normal, plus one false judgment as low risk). Again, there were no misjudgments in the medium- and high-risk categories. The overall Accuracy is 0.9346, which remains high.
10 min Prediction: Overall Accuracy decreased slightly to 0.9195.
15 min Prediction: Overall Accuracy was 0.8951, below 0.9.
These results demonstrate the model’s strong capability for short-term anomaly identification (1 and 5 min), with a predictable decline in performance as the prediction horizon extends to 10 and 15 min.
Figure 14 illustrates all supports’ risk state identification rate across different step lengths. At the 10 min prediction step length, the model demonstrated good recognition performance for all support states: the identification rate for the normal state reached 92.8%. The identification rates for low-, medium-, and high-risk states were 90.7%, 91.6%, and 91.6%, respectively. The overall target identification rate was 91.3%.
This confirms that the model performs well in strata pressure anomaly identification across multiple supports, effectively distinguishing between risk levels and the normal state.
4.3. Recognition Effect of Pressing from Different Mine Workfaces
- (1)
Mine pressure prediction effect
Table 11 shows the application results of the GRU-AM prediction model in a key mine in Baoji City, Shaanxi Province (focused on rock burst prevention). The mine has similar geological conditions. The model effectively identified rock burst events in the working face of the mine.
The model performed best with a 1 min step size: the RMSE was only 1.8901 KN, the EMAPE reached 12.4476%, and the Pearson correlation coefficient reached 0.9701. When the step size was increased to 15 min, the RMSE increased to 3.5065 KN, the EMAPE increased to 31.9872%, and the R value decreased to 0.8837. The model demonstrated stable performance when applied to mines with similar geological conditions.
Table 12 shows the distribution of absolute errors in mine pressure prediction for similar geological working faces. With a prediction step size of 1 min, the prediction errors are mainly concentrated between 0.5 and 1 kN, accounting for 44.69% of the samples. Samples with errors less than 1 kN account for 57.01% of the total, exceeding half of the total samples, while samples with errors greater than 2 kN account for only 9.57%. Overall, the absolute error control of this mine pressure prediction model is good, and it achieves high prediction accuracy on similar geological working faces.
- (2)
Incoming pressure recognition prediction effect
Figure 15 displays the bar chart showing the strata pressure anomaly identification rate for the working face at the mine in Baoji City, Shaanxi Province. As illustrated in
Figure 15, at a 10 min prediction step length, the model demonstrated strong performance across all risk states: the identification rate for the normal state across all supports was 91.1%. The identification rates for low-, medium-, and high-risk states reached 88.9%, 89.8%, and 89.6%, respectively. The overall target identification rate was 89.52%. These results confirm that the model also exhibits good anomaly identification effectiveness in mines with similar geological conditions. This suggests that the model is highly adaptable and capable of effectively distinguishing between different risk levels and the normal state across different mine working faces.
As shown in
Table 13, which details the actual strata pressure anomaly identification results for the working face at the mine in Baoji City, Shaanxi Province, the model demonstrated the ability to effectively identify low-, medium-, and high-risk levels across multiple working cycles from 2 July to 11 July 2025. This outcome highlights the model’s robust capability to identify strata pressure anomalies in this mine working face.
4.4. Algorithmic Limitations
Model performance is highly reliant on high-quality, continuously sampled pressure sequences. If sensors experience discontinuities or outliers, the attention weights may become distorted, necessitating additional preprocessing or missing value imputation strategies.
The perfect F1 score (F1 = 1) for high-risk identification primarily stems from the small sample proportion of high-risk events, coupled with their salient and singular patterns (terminal cycle resistance greater than 45.84 MPa, and the root mean square deviation of the terminal cycle resistance greater than 7.31 MPa), which objectively reduces the classification difficulty. Future work is planned to further validate the model’s robustness and generalization performance on rare events through strategies like time-forward rolling resampling and artificial injection of anomalies.
The acknowledged limitation of this purely data-driven model lies in its heavy reliance on the quality and quantity of historical sensor data, which can result in lower interpretability compared to physics-based models. Furthermore, the model’s generalizability across various geological and operational conditions requires further rigorous testing. Future research should focus on developing hybrid models that combine the predictive power of deep learning with the physical constraints of the strata–support interaction.