1. Introduction
Shield machines are extensively utilized in subway construction, underwater highways, high-speed railways, and similar infrastructure projects due to their high automation capabilities and robust anti-interference properties [
1,
2]. During shield tunneling operations, factors such as uneven geological conditions and improper control operations can cause the shield machine to deviate from the designed tunnel axis (DTA), resulting in attitude deviation. This deviation leads to significant engineering challenges, including tube piece assembly misalignment and tunnel structure cracking [
3,
4]. Current control strategies for shield tunnel deviation (STM) issues exhibit inherent temporal delays, meaning that operators’ corrective adjustments typically occur after attitude deviation has already manifested [
5,
6]. Accurate advance prediction of shield attitude information would therefore represent a major advancement in shield tunneling technology.
To address this challenge, researchers have conducted extensive studies on shield attitude prediction methods. As demonstrated in
Table 1, these research approaches span from traditional experimental modeling (incorporating numerical analysis and theoretical calculations) to advanced recurrent neural network (RNN) techniques, all designed to extract comprehensive feature information from historical shield attitude data, identify underlying nonlinear relationships, and achieve precise shield attitude prediction [
7,
8]. Traditional modeling approaches typically utilize mechanics-based principles and field-measured parameters (thrust, torque, etc.) to establish theoretical frameworks, employing statistical (autoregressive) modeling methods for parameter optimization [
9,
10]. Sramoon et al. [
11] investigated shield behavior simulation during tunneling in sandy gravel layers using earth-pressure-balanced shields, demonstrating that factors such as ground loosening and wire brush deformation must be considered to accurately capture actual shield behavior and align with field observations. Shen et al. [
12] developed an enhanced calculation methodology for shield attitude prediction by modeling shield–soil interaction through equivalent springs and ground reaction curves, subsequently applying this approach to a metro project for pitch and yaw angle prediction and validation against real-time monitoring data. While mechanism-based approaches enhance understanding of shield tunneling behavior, statistical and experimental modeling methods cannot adequately account for operational condition variability, are computationally intensive, and demonstrate limited generalization capability [
13].
Recent developments in data-driven methodologies have demonstrated remarkable capability in extracting complex feature representations from large-scale operational datasets for shield attitude prediction [
20]. For example, Liu et al. [
14] introduced a BWO-CNN-LSTM-GRU framework achieving 3 mm prediction deviation through systematic hyperparameter optimization, demonstrating the effectiveness of meta-heuristic algorithms in hybrid architectures. Zhou et al. [
4] proposed a novel WT-CNN-LSTM hybrid architecture that integrates wavelet transform preprocessing with convolutional feature extraction and long short-term memory temporal modeling to predict critical shield parameters, including pitch angle, rolling angle, and both vertical and horizontal deviations, achieving superior prediction accuracy through enhanced feature representation. Zhen et al. [
17] developed an explainable framework combining enhanced attention Informer (EAMInfor) with DeepLIFT, addressing the “black box” limitation of machine learning models. Their analysis of Xiamen Metro Line 3 revealed that push, thrust, and earth chamber pressure were the most significant features, with variations in importance exhibiting substantial differences across geological conditions. Wang et al. [
16] established an innovative CNN-GRU fusion architecture that substantially improved prediction accuracy and model robustness in multi-phase shield attitude estimation tasks. Their comprehensive sensitivity analysis, incorporating both first- and second-order derivatives, revealed significant heterogeneity in the contribution weights of historical values across different temporal horizons. These investigations predominantly conceptualize shield machine attitude prediction as a temporal sequence forecasting problem, consequently driving extensive adoption of recurrent architectures, including RNN, LSTM, and GRU variants that excel in capturing temporal dependencies. Relative to conventional RNNs, LSTM and GRU architectures effectively mitigate gradient explosion and vanishing gradient phenomena through sophisticated gating mechanisms, enabling the robust learning of intricate long-term dependencies in sequential data, thereby establishing their prominence as fundamental architectural components [
21].
Notably, in the authors’ previously proposed FTA-N-GRU model, the method employs feature and temporal attention to adaptively weight input parameters and model output dependencies. While effective, attention-based approaches primarily capture correlations within individual timestamps and encounter difficulties in two critical challenges: (1) modeling spatial–temporal relationships across different timestamps, and (2) handling the irregular sampling patterns that are commonly present in shield tunneling data. This study addresses these limitations through a fundamentally different approach: spatial–temporal graph modeling with time decay functions. Unlike attention mechanisms that weight features within individual timestamps, our graph-based approach explicitly constructs and models relationships among all features across all timestamps, enabling comprehensive spatial–temporal dependency learning.
Contemporary shield attitude prediction methodologies predominantly employ RNN-based architectures to model temporal dependencies within sequential operational data [
22]. However, shield operational datasets intrinsically exhibit complex spatial–temporal (ST) interdependencies characterized by both temporal correlations among operational parameters and spatial correlations between heterogeneous parameters within individual timestamps. Suboptimal modeling strategies can substantially degrade the predictive performance of data-driven approaches in shield attitude forecasting applications [
17]. Moreover, existing methodologies encounter significant challenges in data acquisition and preprocessing protocols. Irregular temporal sampling of attitude measurements can introduce systematic bias and spurious correlations into time series models, particularly compromising predictive accuracy during periods of rapid attitude variation. Under such conditions, models may erroneously approximate future attitudes using historical trajectory patterns, resulting in critically low prediction confidence and fundamentally limiting the practical applicability of data-driven methodologies [
23,
24].
To systematically address these fundamental limitations, this investigation introduces a novel ST-GC-GRU methodology, with primary contributions delineated as follows.
Temporal decomposition for robustness to sudden attitude changes: A recursive quadratic decomposition framework is introduced to separate attitude signals into trend and residual components, enhancing the model’s capability to handle drastic attitude variations during geological transitions and under irregular sampling conditions.
Fully connected spatial–temporal graph with time decay matrix: A novel graph structure is proposed where all features across multiple timestamps are fully connected, with a time decay matrix jointly modeling spatial correlations (between heterogeneous parameters) and temporal dependencies (across different time steps) based on actual temporal distances.
Comprehensive performance superiority on real project data: Extensive experiments on the Bangladesh Karnaphuli River Tunnel Project demonstrate that the proposed model consistently outperforms seven baseline methods (CNN, CNN-GRU, GRU, LSTM, TCN, RF, XGBoost) across all four deviation indices (HDH, HDT, VDH, VDT).
Compared to attention-based methods such as Feature Extraction and Attention-based Machine Learning (FEK-AML) that primarily focus on feature importance within individual timestamps, the proposed ST-GC-GRU addresses their limitations in modeling spatial–temporal relationships across different timestamps and handling drastically changing attitude data under variable working conditions. By employing spatial–temporal graph modeling with time decay functions that explicitly construct relationships among all features across all timestamps, the method enables enhanced robustness under diverse operational scenarios, including those with irregular sampling patterns and geological transitions.
The remainder of this manuscript is structured as follows:
Section 2 presents a comprehensive exposition of the proposed methodology, including theoretical foundations and architectural design principles.
Section 3 demonstrates the efficacy and superiority of the developed model through rigorous evaluation using real-world case studies and comparative analysis with state-of-the-art baseline methods.
Section 4 provides an in-depth discussion of the experimental findings, model performance characteristics, and practical implications of the proposed approach.
Section 5 concludes the investigation with a comprehensive summary of key findings and delineates promising directions for future research endeavors.
3. Case Study
3.1. Case Background and Data Processing
The Karnaphuli River Underwater Tunnel in Bangladesh, regarded as the “National Father Tunnel” within Bangladesh, represents the initial international, extended, large-diameter tunnel undertaking implemented by the China Communications Construction Company (CCCC). Situated at the mouth of the Karnaphuli River on Chittagong’s periphery in Bangladesh, this tunnel extends 2450 m in total length for the individual tunnel and roughly 4900 m for the dual track, linking the eastern and western shores of the waterway. The project site overview is depicted in
Figure 8.
The project is located in the alluvial delta region at the river’s estuary, characterized by Quaternary alluvial deposits. The stratigraphy of the west bank and the riverbed features interbedded clay and sand layers of Holocene alluvial origin within the Quaternary system. This stratigraphy is predominantly sand layers interspersed with thin clay layers and vice versa. The clayey soils are mostly in a soft, plastic state with poor geotechnical properties. On the eastern bank, the surface layer comprises alternating cohesive clay and sandy soil layers, primarily dominated by sandy soil. Beneath this surface layer, there are Quaternary alluvial deposits with alternating layers of cohesive clay and sandy soil. The cohesive clay is mainly hard-state silty clay, occasionally exhibiting a semi-indurated state. As shown in
Figure 9, when the shield machine passes through the fine sand layer, there is a complex geological stratum known as the “upper soft and lower hard” composite stratum, which easily causes misalignment of the shield machine, making it necessary to have stringent control over the shield machine’s attitude during construction.
The shield machine is equipped with numerous sensors that continuously monitor its tunneling operation in real-time. All monitoring data are stored in the data acquisition system. The data collected in the early stage include a total of 1224 rings, which are recorded at an excavation interval of 20 mm. Each ring comprises over 1700 features and more than 100 sampling points, resulting in a total data volume of approximately 130,000 samples.
The collected parameters can be summarized into four categories: mechanical operation parameters (thrust, cutter head rotation speed, cutter head torque, jack stroke, etc.), geological condition parameters (surrounding earth pressure), mud system parameters (mud delivery pressure, mud flow rate, etc.), and running status values.
The mechanical operating parameters reflect the current tunneling state of the shield machine. The shield machine operators can adjust these parameters to achieve attitude control. Of course, the raw data also includes many items related to electrical equipment and chemical gas concentrations, which are not listed here in detail. The geological condition parameters reflect the interaction forces between the cutter head and the complex strata during the tunneling process of the shield machine. The generation and distribution of the pressures are intimately related to geological conditions, construction depth, groundwater level, and the operation of the shield machine. Various soil layers (such as sand, clay, and rock) exhibit distinct densities and strengths, leading to different pressure distributions. The parameters of the mud system mainly reflect the impact of mud pressure and balance on the stability and direction control of the shield machine. These parameters can indirectly indicate the distribution of mud pressure and flow in the shield machine. By monitoring and adjusting these parameters, the stability and safety of the tunneling process can be ensured.
Through feature selection and related reference, the key parameters selected are: thrust of the tunnel boring machine (A), cutter head rotation speed (B), cutter head torque (C), penetration (D), surrounding earth pressure (E–H), jack stroke difference (I, J), mud flow and pressure (K–N), the air bubble chamber pressure (O), and the horizontal and vertical posture deviation of the shield machine (P–S). To evaluate the performance of the proposed method, this study randomly selected 150 rings (from ring 670 to ring 819), totaling 14,689 samples, as the dataset. This data volume aligns with the average amount in existing studies on tunnel boring machine attitude prediction problems (generally around 14,000 samples in total).
Table 3 provides a summary description of all selected data.
The selected data need to be prepared for model training and evaluation, which encompasses data cleaning, filtering, normalization, and reconstruction operations. Specifically, first, invalid information, such as shield machine downtime and null values, must be removed. For outliers, the 3-sigma rule is applied to identify them, and the mean value of the adjacent data points is used for replacement. The data with outliers removed are then sent to the Butterworth filter for denoising. Following this, min–max normalization is applied to rescale the data between zero and one. Finally, the sliding window method is utilized to reformat the time series for the supervision problem.
Detailed statistics for all 18 features are provided in
Table 4. The selected segment encompasses diverse geological transitions to ensure comprehensive model evaluation.
3.2. Training Details and Implementation
In total, 70% of the processed data is used for model training, 70% to 80% of the data is utilized to evaluate potential overfitting issues, and the remaining 20% of the data is used to validate the model’s performance. The training data for the model includes data A to O from time
t − to
t, as well as one of the historical attitude parameters (P–S) from time
t −
to
t. To determine the appropriate window length, this study examined the influence of various window size settings on model prediction performance.
Table 5 and
Table 6 present the model evaluation results for each window length setting, using the average MAE and RMSE as the evaluation metrics. The results indicate that a window length of three is the most appropriate choice, yielding the best performance in terms of average MAE and RMSE. When the window length is less than three, the model’s prediction accuracy is significantly compromised. When the length exceeds three, the accuracy improvement brought by the window length becomes very minimal. Considering the data volume and model runtime, the window length is set to three.
The proposed ST_GC-GRU model includes one ST-GCN layer and two GRU layers. The model hyperparameters are crucial for prediction performance, which should be based on a comparative analysis of different configurations. This study considers key candidates such as the number of GRU layers and the number of neurons in the ST-GCN and GRU layers. The number of ST-GCN layers is set to one, with candidate neuron counts of [16, 32, 64]. The GRU can have two to three layers, with candidate numbers of neurons being [64, 128, 256]. A total of six combinations covering all candidate parameters are selected, with detailed parameter choices and the average MSE results for each output listed in
Table 7. According to the parameter sensitivity analysis results, combination 2 demonstrated superior performance compared to the other combinations. Therefore, the ST-GCN layer mapping neurons in the proposed ST_GC-GRU model are set to 32, and the neurons in the two GRU layers are set to 128. The optimizer is Adam, and the loss function is MSE. A summary of the model structure is shown in
Table 8.
Figure 10 shows the loss function curves for both training and validation datasets across four attitude parameters. It is evident that the training and validation losses rapidly decrease within the first 10 epochs, subsequently stabilizing at a low level within 100 epochs, indicating that the training has been completed.
3.3. Analysis of Results
To assess the accuracy of the proposed model, this study evaluates its prediction performance on the test set.
Figure 11,
Figure 12,
Figure 13 and
Figure 14 show the predicted values and actual values for the corresponding four attitude parameters. To quantify prediction performance, the study employs MAE and RMSE as evaluation metrics.
Table 9 summarizes the prediction results of the proposed model.
The four output line charts indicate that alignment between the predicted and actual values is exemplary, showcasing consistent trends. This demonstrates that the model proposed in this study can achieve high-precision prediction performance for all attitudes. The model’s performance on the MAE and RMSE evaluation metrics is particularly noteworthy, with the average MAE for HDH, HDT, VDH, and VDT being as low as 0.3804 and 0.4991, respectively. The overall MAE does not exceed 0.58. Regarding RMSE, which is more susceptible to outliers, the overall accuracy is also exceptional, confirming that the model is highly capable of handling shield attitude prediction tasks. However, it is worth noting that the prediction results for VDH are relatively poor, with worse performance on the RMSE compared to the other three attitude parameters. A possible explanation for this is that there are more sudden changes in the VDH data, and the predicted values are smoother and more continuous compared to the actual values. Conversely, for VDT, which exhibits smoother data, the prediction performance is excellent, as can be seen from the distribution of the test set.
Table 10 presents the denormalized prediction errors on the test set. For the tolerance threshold of 50 mm, while HDT shows an MAE of 52.15 mm, this reflects the parameter’s large variation range (126 mm, from −89 mm to +37 mm) in the mixed geological conditions of this project. The key insight is that the normalized MAE of 0.41 indicates that the model captures 59% of the variation pattern, leaving only 41% as prediction error relative to the total variability. Similarly, for VDH with a 76 mm range, the normalized MAE of 0.57 shows that the model explains 43% of the variation. For practical application, these results demonstrate that the model effectively tracks attitude trends and provides reliable early warnings before deviations approach critical thresholds, enabling operators to implement timely corrective actions. The particularly strong performance on HDH (1.80 mm MAE, 3.6% of tolerance) and VDT (21.36 mm MAE, 42.7% of tolerance) further validates the model’s practical utility.
The denormalized error metrics demonstrate that the proposed model achieves prediction accuracy that is well-suited for practical construction control. For HDH and VDT, the MAE values remain within 43% of allowable tolerances, enabling operators to implement timely corrective measures before critical deviations occur. While HDT shows larger absolute errors due to its substantial variation range in mixed geology, the model’s ability to capture 59% of variation patterns provides reliable trend tracking for proactive attitude control. These performance levels are adequate for real-time shield guidance systems, where early warning capability is more critical than absolute prediction precision.
As shown in
Figure 15, the VDH parameter exhibits noticeably larger prediction errors (MAE: 43.55 mm, normalized MAE: 0.57) compared to other attitude parameters. Segment-wise analysis reveals that this degraded performance is primarily concentrated in high-deviation ranges (|VDH| > 40 mm), where the RMSE increases to 2.95 mm compared to 1.05 mm in low-deviation segments (|VDH| < 20 mm), representing a 180% increase. This performance degradation in high-change segments can be attributed to several key factors: (1) data sparsity in extreme deviation scenarios limits the model’s exposure to critical cases despite comprising 41.6% of samples, as these events exhibit substantially higher variability; (2) increased geological complexity during large deviations, where sudden transitions (rock–soil interfaces, water-bearing zones) introduce highly nonlinear shield–geology interactions not fully captured by current features; (3) operating mode shifts, as operators implement aggressive corrective measures during severe deviations, introducing additional system dynamics that increase prediction complexity. To address these limitations, future improvements should focus on implementing attention mechanisms to dynamically emphasize geological transition indicators during high-deviation scenarios, developing residual correction modules trained specifically on extreme deviation subsets with customized loss functions, and incorporating multi-horizon loss functions that assign higher penalties to large deviations, thereby improving model sensitivity to critical control situations.
In addition to evaluating the model prediction accuracy, the time cost of model training is also a crucial evaluation criterion. Model parameters are not consistently fixed and require adjustments according to different application scenarios and shield machines in practical engineering applications. Therefore, updating model parameters is essential and must be emphasized in the engineering application of data-driven models. Referring to
Table 11, this study summarizes the training time costs for various time window lengths. It is noteworthy that the training time fully meets the needs of practical engineering. Using 100 epochs as a standard, completing the model training takes only about 3 min. This implies that if it is necessary for the model to adapt to a new engineering environment and update its parameters with new data, it only needs about three minutes. As the number of input model parameters increases, the time required for model training also increases. Given that the test output time of the models in this study is relatively fast, typically within 1 s, it is not discussed further in this study.
4. Discussion
4.1. Comparison with Other Models
To further verify the superior performance of the proposed method, this study uses the same dataset to train and evaluate various commonly used methods. This section consists of two parts: ablation experiments and model comparison. Specifically, in the ablation experiments, it is necessary to verify the improvement introduced by the two strategies, these being the time decay function and the time decomposition method. It is necessary to remove these two parts from the proposed model separately to analyze their prediction results.
Table 12 and
Table 13 list the MAE and RMSE values of two ablation experiment results. The tables include six sets of models. GCN-GRU is an ablation model that has removed the time decay function, where the decay function is simply set to a constant value of 1, making it a standard GCN structure. The TS_GRU model is a GRU model that uses a time decomposition method; therefore, compared to the GRU model, it lacks the trend term and the residual term as part of the model input. The GRU model is a standard two-layer GRU network, where the number of neurons in the hidden layer is set to the same value as the proposed model, ensuring the validity of the experimental results. TS_LSTM is an LSTM model that uses a time decomposition method. Both LSTM and GRU are commonly used time series models, so they are also selected for the ablation experiment analysis to verify their generalizability. It should be emphasized that the number of neurons in the hidden layer of LSTM is set to the same value as GRU, because this part mainly discusses whether the added strategy has improved the predictive performance of the model. Therefore, the dataset, hyperparameters, and the number of hidden layers of the model should be consistent with the proposed model.
The ablation results show that the MAE and RMSE values of the proposed model are markedly lower than those of the other five methods for all four attitude parameters. Specifically, compared to the standard GCN-GRU, the average MAE and RMSE of the model are improved by 18.61% and 17.89%, respectively. In contrast, compared to the GRU model without any improvement measures, the MAE and RMSE of the proposed model are decreased by 52.44% and 57.07%, respectively, indicating a significant enhancement. For the time decomposition representation, this study used GRU and LSTM as baseline models, incorporating attitude decomposition-related features into the model input. From the prediction results of the TS_GRU and TS_LSTM models, it can be observed that the MAE and RMSE of the GRU model are decreased by an average of 16.69% and 19.67%, respectively. Similarly, the MAE and RMSE of the LSTM model are improved by an average of 11.98% and 18.05%. It is worth noting that RMSE, an evaluation metric that is sensitive to outliers, decreases more significantly, reflecting the fact that temporal decomposition can be effective in handling highly variable attitude to a certain extent.
In addition to the above ablation models, this study also tests some popular deep learning and machine learning models, namely CNN, CNN-GRU, GRU, LSTM, Temporal Convolutional Network (TCN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). Among them, CNN, CNN-GRU, and LSTM are the typically utilized models for shield attitude prediction. The TCN model, proposed in recent years for time series data, enhances the network’s receptive field through dilated and causal convolutions, thereby improving prediction performance. Related literature has confirmed that it has achieved better results than LSTM and GRU in many prediction tasks [
47]. The RF model is an ensemble method based on Bagging, with each base learner computed in parallel. RF improves the accuracy of decision trees (DT) by introducing random attribute selection during the training process of the decision tree [
48]. The XGBoost model, on the other hand, is an efficient gradient-boosting decision tree algorithm, which improves the original GBDT by using the ensemble concept of Boosting to integrate multiple weak learners into a strong learner through certain methods [
49]. To ensure the validity of the experiment, the hyperparameter settings of the models used in these comparative methods are consistent with the proposed model. For the CNN model, three convolutional layers, two pooling layers, and two fully connected layers are set. The CNN-GRU model introduces two layers of GRU after the CNN layers, with 128 neurons in the GRU hidden layer. The GRU and LSTM settings are the same as in the ablation experiments. The TCN model is configured with four convolutional layers, all with 128 output channels, and two fully connected layers for output. The activation function, loss function, optimizer, and evaluation metrics remain identical to those employed in the preceding experiment.
Table 14 and
Table 15 show the specific experimental results.
From the comparison results, it is evident that the proposed model significantly outperforms other models according to overall prediction accuracy. When compared to the popular CNN-GRU model, the proposed model reduces the average MAE and RMSE by 41.88% and 43.13%, respectively. The prediction performance of the two machine learning models is notably inferior, which is likely due to the substantial amount of large-variation data in VDH, making the machine learning models less effective in fitting abnormal data. It should be mentioned that machine learning models are highly constrained by the distribution of training data in regression tasks, and if the data range exceeds that of the training data, their prediction performance may be severely limited. Compared to the RF model based on Bagging integration, the proposed model reduces the average MAE and RMSE by 74.42% and 75.50%, respectively. The improvements are even more pronounced when compared to the XGBoost model based on Boosting integration. This study visualizes the prediction details of four attitude parameters, with
Figure 16,
Figure 17,
Figure 18 and
Figure 19 illustrating the prediction results of each model.
To comprehensively evaluate the computational efficiency of different models,
Table 16 presents a detailed comparison of model complexity and inference performance. All inference time measurements were conducted on the hardware configuration described in
Section 2.4 (Intel Core i7 12700H processor with NVIDIA GeForce RTX 3060 GPU) using a batch size of 32, with results averaged over 1000 prediction iterations to ensure statistical reliability.
For tree-based models (RF, XGBoost), the parameter count represents the total number of leaf nodes across all trees in the ensemble. Inference times for these models include the overhead of ensemble prediction across all trees.
As detailed in
Table 16 the proposed ST-GC-GRU model contains 177,923 trainable parameters, representing only a 1.2% increase over baseline GRU (175,745 parameters) while achieving 47.3% lower MAE, demonstrating superior parameter efficiency. The model achieves an average inference time of 4.50 ms per prediction step on the specified hardware (
Section 2.4), representing merely 0.45% of typical TBM control cycles (1–2 s) and enabling seamless real-time deployment. Compared to baseline architectures, the proposed model delivers comparable or better computational efficiency (4.50 ms vs. GRU: 4.44 ms, CNN-GRU: 4.51 ms, LSTM: 6.50 ms, TCN: 5.12 ms) while substantially outperforming all methods in prediction accuracy, establishing practical viability for industrial shield tunneling control systems.
4.2. The Analysis of the Time Decay Function
The primary distinction between the proposed model and traditional GCN is the introduction of temporal information decay. This concept, which is applicable in many domains, aims to attenuate the influence of previously memorized content based on the time taken: the longer the elapsed time, the less impact the previous information has on the predicted output [
47,
50]. This idea is related to shield machine tunneling. Specifically, the future attitude changes in the shield machine are heavily dependent on its recent historical attitude and current attitude. Typically, the current attitude exerts the strongest correlation, whereas data from further in the past has a diminishing influence on future attitude changes, reflecting the decaying relationship of attitude information over time. In this study, the exponential decay function is selected as the model’s decay function. Additionally, constant decay functions [1, a, a
2] (a is 0.85 according to tuning) and logarithmic functions, which possess decay capabilities, are also employed in the analysis of prediction effects.
Figure 20 illustrates the geometric interpretation of the three decay functions, which are retrained and tested while keeping all other processes constant. The results, shown in
Table 17, indicate that for the four attitude deviation values, the RMSE values using the exponential time decay function outperform those of the other two decay functions. Therefore, the exponential decay function is selected as the more appropriate decay function.
Although this method has achieved favorable prediction results, there remains room for further investigation. This study assumes a fixed time interval for setting the decay matrix. However, in actual engineering applications, the time intervals for data collection before and after tunneling are inconsistent. There may be variations in the design of the decay matrix. For instance, if the tunneling speed of the shield machine is relatively high, the difference in attitude before and after tunneling can be more pronounced, leading to greater differences in data relationships. When the tunneling speed is relatively low, the future attitude of the machine may depend more on its current attitude, or even be entirely determined by its current state. Therefore, this method still holds research value in the future.
4.3. The Analysis of the Temporal Decomposition
This study employs the time decomposition method to decompose the attitude data into long-term trend components and short-term residuals, effectively mitigating the impact of anomalous data on the model’s predictive performance. However, quantitative analysis lacks persuasiveness. This study selects data with significant changes in attitude and visualizes the prediction effects with and without time decomposition measures to specifically observe the differences in the model prediction results, as illustrated in
Figure 21.
The dashed segments in the first three figures demonstrate that the shield machine’s attitude exhibits substantial fluctuations across various operational parameters. The final VDT figure presents only a partial segment due to relatively minimal data variations throughout the collection period. Examination of the prediction details in the initial three figures reveals that prediction results without time decomposition methodology share a characteristic limitation: in regions of significant variation, predicted values frequently approximate the actual values from the preceding time step, creating an apparent temporal lag effect where predictions mirror historical observations. This phenomenon corresponds to the operational behavior patterns of shield machines that are observed in engineering practice. Specifically, when attitude variations remain minimal, the future attitude state of the shield machine depends predominantly, or exclusively, on its current operational configuration. However, this predictive pattern can only ensure approximate trend alignment with actual values, while notable discrepancies in prediction accuracy persist. Incorporating attitude decomposition information substantially mitigates this limitation. As demonstrated in the attitude decomposition analysis presented in
Section 2.3.1, the original data undergoes systematic decomposition into a trend component, representing long-term evolutionary patterns, and a residual component, capturing short-term variational characteristics. These components are responsible for extracting features of long-term trajectory trends and short-term attitude fluctuations, respectively. This decomposition enables the model to simultaneously capture the fundamental trends of the original data during periods of significant attitude changes while reducing the adverse impact of abrupt data fluctuations on predictive performance.
5. Conclusions
This investigation proposes a novel hybrid methodology integrating spatial–temporal graph networks and attitude decomposition techniques for enhanced shield attitude prediction. The proposed approach effectively models both temporal correlations between sequential timestamps and spatial correlations across different temporal instances within shield tunneling operational datasets. Specifically, the methodology employs time decomposition to augment the temporal representation of shield tunneling attitude dynamics, systematically extracting long-term evolutionary trends and short-term variational patterns from attitude change sequences. Subsequently, a time decay function is implemented to construct a dynamic decay graph that interconnects historical values across all timestamps based on temporal proximity, enabling the comprehensive capture of spatial–temporal dependencies through consideration of feature correlations at different temporal intervals. Comparative analysis demonstrates that the proposed model significantly outperforms existing methodologies in terms of prediction accuracy. The effectiveness of the proposed approach is validated through application to the Karnaphuli River Tunnel Project as a comprehensive case study. The experimental results indicate that: (1) Implementation of the time decay matrix enables gradual attenuation of historical data’s influence on model predictions according to respective temporal relationships, thereby substantially improving predictive accuracy. Ablation studies demonstrate that the proposed model significantly outperforms standard GCN architectures in processing diverse shield attitude datasets. (2) The time decomposition methodology successfully extracts temporal patterns from shield attitude operational data, effectively mitigating the impact of anomalous data perturbations on predictive performance and substantially enhancing attitude prediction accuracy.
Despite achieving superior predictive performance and computational efficiency for shield machine attitude estimation, the ST-GC-GRU model exhibits certain inherent limitations that warrant future investigation. First, the current time-decay matrix assumes fixed temporal intervals between consecutive measurements, which may not fully capture temporal relationships under highly irregular sampling patterns caused by variable excavation speeds or operational interruptions. Future research should explore adaptive decay mechanisms that directly incorporate actual time gaps, such as continuous-time models (Neural ODEs), time-aware attention mechanisms, or event-driven recurrent architectures to enhance robustness across diverse sampling conditions. Second, while temporal decomposition effectively mitigates the impact of abrupt data variations, external factors such as human operational interventions also significantly influence shield machine attitude dynamics. Integrating physics-guided modeling strategies, as demonstrated in physics-informed degradation prediction [
51], would balance data-driven flexibility with physics-based reliability by incorporating established mechanical principles (force equilibrium, geometric constraints) into the ST-GC-GRU framework, improving both interpretability and prediction accuracy in data-scarce scenarios [
52]. Third, this study focuses on single-step-ahead prediction aligned with real-time shield control requirements, where continuous sampling frequency enables iterative predictions to guide real-time corrective actions. Unlike prognostic tasks requiring long-horizon forecasting for maintenance planning [
53], shield attitude prediction serves real-time operational control where immediate accuracy is paramount. Future work could extend the framework to multi-step forecasting for planning applications through: (1) recursive multi-step prediction strategies, (2) uncertainty quantification for confidence-aware planning, and (3) hybrid single-step control with multi-step planning for enhanced decision-making under complex geological conditions.
While this study demonstrates strong predictive performance on the Bangladesh Karnaphuli River Tunnel Project encompassing diverse geological conditions (soft clay, silty sand, and “upper soft and lower hard” composite strata), the generalizability of ST-GC-GRU to other shield tunneling projects, TBM types, and significantly different geological contexts remains an important direction for future validation. The segment-wise performance analysis reveals that prediction accuracy varies with geological complexity, particularly in high-deviation scenarios where sudden geological transitions introduce highly nonlinear shield–geology interactions. Future research should systematically evaluate the framework’s performance across multiple projects with distinctly different geological conditions, and transfer learning or domain adaptation techniques may be necessary to ensure robust performance across diverse tunneling environments without requiring complete model retraining.