4.2.1. Model Evaluation Indicators
To comprehensively evaluate the predictive performance of the model, this study employs three commonly used metrics in time series forecasting to assess the model’s predictive capability. During the forecasting process, assuming there are samples, where represents the actual value of the i-th sample and denotes the predicted value of the i-th sample generated by the model, the following metrics are interpreted and analyzed:
- (1)
Coefficient of Determination (R2)
The coefficient of determination (R
2) describes the goodness-of-fit of the model to the data, with a value range of [−∞, 1]. An R
2 value closer to 1 indicates better predictive performance of the model, while an R
2 value approaching 0 suggests poor predictive capability. The formula is given as follows:
- (2)
Mean Absolute Error (MAE)
The Mean Absolute Error (MAE) calculates the average absolute difference between the predicted values and the actual values. For model evaluation, a smaller MAE indicates better performance, as it reflects closer alignment between predicted and actual data on average. The formula is defined as follows:
- (3)
Mean Absolute Percentage Error (MAPE)
The Mean Absolute Percentage Error (MAPE) quantifies the average percentage difference between predicted and actual values, providing a relative measure of prediction error scaled by the magnitude of the true values. Generally, a lower MAPE indicates better model performance, as it reflects smaller deviations between predictions and ground truth, thereby implying higher predictive accuracy. The formula is expressed as follows:
4.2.2. Experimental Analysis
In this section, comparative experiments were conducted to validate the superiority of the proposed model in sample prediction. The LSTM and RNN models were selected for comparison, with their network parameters consistent with those listed in
Table 2. For different equipment and various dimensions, we trained deep learning networks separately for prediction. Comparative graphs of left mud pump power (representative of dredge pump operations as both left and right pumps exhibit similar operational characteristics), high-pressure flushing power, and total power prediction are presented. As these parameters are critical factors affecting dredging operation efficiency and energy consumption, their prediction comparison plots can effectively demonstrate the performance differences among various models in power prediction, thereby sufficiently illustrating the effectiveness and advantages of the proposed method.
All models employed the Huber loss function as their objective function and were optimized using the Adam algorithm (β1 = 0.9, β2 = 0.999, ε = 10−8) with a learning rate of 0.003. Each model was trained for up to 100 epochs, with early stopping applied if the validation loss did not improve for 10 consecutive epochs, to prevent overfitting. The activation functions were carefully selected for different components: LSTM gates utilized sigmoid functions for forget/input/output gate controls to maintain [0, 1] gating behavior, while tanh activation governed cell state updates to ensure smooth gradient flow. For feature transformation in dense layers, ReLU activation was adopted to enhance nonlinear representation capability, and a linear activation function was applied at the output layer to accommodate the regression task’s continuous value prediction requirements. After completing the model configuration, predictions were conducted for key parameters, including the left mud pump power, high-pressure flushing power, and total generated power. This study adopts a multivariate time-aligned input architecture that precisely captures the dynamic variations in historical power sequences across multiple consecutive time steps through dedicated power channels. This architecture simultaneously integrates processing parameter channels to acquire multidimensional operational parameters (including critical variables such as rotational speed and slurry concentration) within the same temporal window. Furthermore, a specially designed environmental channel incorporates time-delay embedding techniques to handle slowly varying environmental parameters like water depth and soil type. This multi-scale fused input structure not only ensures the real-time responsiveness to high-frequency dynamic parameters but also accounts for the long-term effects of slowly changing environmental factors, thereby providing the model with comprehensive and harmonized multi-source spatiotemporal feature representation.
The predictive performance of the model plays a critical role in both equipment-level and system-level dimensions. At the equipment level, the prediction model analyzes real-time operational status, load characteristics, and environmental parameters of individual devices, providing precise start-stop decision support for power equipment with automatic power reduction capabilities. This ensures optimal operation within the most efficient power range. At the system level, the model generates system-wide power demand predictions based on macro-level data, including grid load distribution, generation output, and network topology. These predictions directly trigger the PMS (Power Management System)’s dynamic power adjustment mechanism. At the equipment level, our analysis focuses on two critical power parameters: (1) the left mud pump power (
Figure 8), which reflects the main excavation energy consumption, and (2) the high-pressure flush power (
Figure 9), representing the cleaning system’s energy demand. At the system level,
Figure 10 presents the total generated power prediction, which integrates all subsystem demands for comprehensive energy management.
As evident from the figures, the RNN demonstrates suboptimal performance in predicting left mud pump power, high-pressure flushing power, and total power. While RNNs can capture basic temporal trends, they exhibit significant limitations in modeling long-term dependencies, often resulting in prediction lag when handling complex nonlinear fluctuations during continuous operations. In contrast, LSTM achieves notably higher prediction accuracy than RNN, generally approximating the variation trends. This improvement stems from two key factors: on the one hand, LSTMs better capture the dynamic power variation patterns during dredging operations under complex working conditions; on the other hand, the incorporation of forget gates, input gates, and output gates addresses the vanishing gradient problem, thereby enhancing long-term sequence dependency learning. However, prediction inaccuracies persist, as exemplified by the failure to accurately predict certain peak values in
Figure 8b and the oversimplified curve representation in
Figure 9b, which inadequately captures detailed power curve characteristics, indicating substantial room for improvement.
By integrating improved dilated convolution with LSTM, the proposed Dilated-LSTM demonstrates superior performance over the other two models in predicting left mud pump power, high-pressure flushing power, and shaft-generated power. The model not only accurately captures the true variation trends of parameters but also achieves excellent fitting between actual and predicted curves. While maintaining parameter efficiency, the Dilated-LSTM effectively captures multi-scale temporal features and mitigates gradient vanishing issues during long-sequence training, thereby enabling more precise data predictions. These results validate the significance of the load forecasting method proposed in this study.
To objectively evaluate the predictive performance of different models, quantitative comparisons were conducted using three metrics: R
2, MAE, and MAPE. The specific numerical results for each prediction model are presented in the
Table 3,
Table 4 and
Table 5.
Experimental results demonstrate that the Dilated LSTM consistently outperforms other models across all prediction tasks. For the left dredge pump power prediction, the Dilated LSTM achieved an R2 of 0.99, surpassing RNN and LSTM by 21% and 3%, respectively. Additionally, it reduced MAE and MAPE by 75.3% and 83.3%, indicating significantly superior error control compared to conventional models. Similarly, in high-pressure flushing power and total generated power prediction, the Dilated LSTM maintained a stable R2 of 0.99, with MAE reductions exceeding 62% and MAPE consistently below 1.07%, further validating its exceptional prediction accuracy. When considering the average performance across all three metrics, the Dilated LSTM improved R2 by 20.7% and 10.3% over RNN and LSTM, respectively, while reducing MAE by 74.8% and 54.4%, and optimizing MAPE by 80.1% and 45.2% on average.
These quantitative results conclusively demonstrate that the Dilated LSTM, through its enhanced ability to capture temporal features, achieves superior stability and adaptability in load forecasting, highlighting its significant potential for engineering applications.
4.2.3. Ablation Study on Dilation Factor Selection
To evaluate the impact of different dilation rate configurations on the performance of the proposed Dilated-LSTM model, we conducted an ablation study by comparing several combinations of dilation factors. The experimental subject is the left mud pump. Specifically, we experimented with the following configurations: Group 1: Single dilation factor (1); Group 2: Small-scale factors (1, 3); Group 3: Proposed (1, 3, 7); Large-scale factors (1, 7, 15).
Each configuration was used to train the model under identical settings, including the same optimizer, learning rate, and number of epochs. The models were evaluated on a validation set using R2, MAE, and MAPE as the primary metrics.
The ablation study results demonstrate the critical impact of dilation factor selection on model performance. As shown in
Table 6, the proposed (1, 3, 7) combination achieves superior performance (R
2 = 0.99, MAE = 8.96 kW, MAPE = 0.95%) compared to other configurations. The single dilation factor (1) (equivalent to standard LSTM) exhibits the weakest performance (R
2 = 0.91, MAE = 18.6 kW), confirming its limitation in capturing long-term dependencies. While the (1, 3) combination shows improved accuracy (R
2 = 0.96), it still underperforms the proposed method by 3% in R
2, suggesting the necessity of incorporating larger dilation scales. Notably, the (1, 7, 15) configuration shows degraded performance (MAE = 10.4 kW) compared to (1, 3, 7), indicating that excessive dilation rates may overlook important local features. These results validate that the (1, 3, 7) combination optimally balances multi-scale temporal feature extraction, where d = 1 captures instantaneous fluctuations, d = 3 models medium-term operational patterns, and d = 7 identifies long-duration load trends characteristic of dredging cycles. The 52% reduction in MAE compared to single-factor LSTM demonstrates the effectiveness of this hierarchical dilation design in addressing TSHD’s complex load prediction challenges.