4.1. Performance Evaluation of the Surrogate Models
In offshore wind turbine systems subjected to complex wind and wave conditions, the mapping from environmental inputs to foundation fatigue response is strongly nonlinear. According to IEC 61400-3-1, wind speed intervals should be smaller than 2 m/s, wave height intervals should be less than 0.5 m, and wave period intervals should be less than 0.5 s for fatigue damage estimation [
32]. Considering the parameter settings in
Section 3.2, at least 2500 finite element simulations under different environmental conditions are required in fatigue damage estimation, resulting in substantial computational effort.
Four surrogate models are introduced to mitigate this limitation: Extreme Gradient Boosting (XGBoost), Random Forest (RF), Support Vector Regression (SVR), and Gaussian Process Regression (GPR). These models are widely used in nonlinear regression problems and have demonstrated strong performance in wind energy applications. For model development, a dataset consisting of 60 environmental conditions is generated as described in
Section 3.2, among which 45 samples are used for training and 15 samples are used for validation. The surrogate models are constructed to establish an efficient predictive relationship between environmental conditions and fatigue response. XGBoost has demonstrated strong performance in nonlinear regression and engineering prediction tasks, particularly for high-dimensional and complex datasets. Each new tree is trained to capture the residuals of the existing ensemble, enabling sequential model construction. The learning process aims to minimize an objective function consisting of a loss term and a regularization term, which jointly control the fitting error and model complexity [
40,
41]. For tree-based learners, the model prediction can be expressed as the summation of multiple regression trees:
where
K indicates the number of trees and
corresponds to a regression tree. In this context, the maximum tree depth determines the complexity of individual trees, while the
K controls the ensemble size and prediction stability. The evaluation adopts the mean absolute percentage error (
MAPE), expressed as:
where
pe denotes the predicted value,
ps denotes the simulated value from FEM.
As shown in
Table 5, the
MAPE of the XGBoost model decreases with an increasing number of trees
K and gradually converges to 33.29% when the number of trees reaches 300. Meanwhile, it is observed that the prediction accuracy is relatively insensitive to the maximum tree depth. This behavior can be attributed to the use of a small learning rate combined with a large number of trees, under which the model effectively behaves as an ensemble of weak learners, thereby improving robustness and reducing the risk of overfitting.
Random Forest (RF) is an ensemble learning method that constructs multiple decision trees through bootstrap sampling and random feature selection [
42]. The final prediction is generated by aggregating the outputs from individual trees. RF has demonstrated strong robustness against overfitting and good performance in high-dimensional and nonlinear problems due to its ensemble and randomization mechanisms [
43,
44,
45].
Table 6 presents the relationship between the
MAPE and the number of trees
K in the RF model, with the maximum tree depth fixed at 16. The
MAPE initially decreases with the increases in the number of trees, indicating improved model stability, and then slightly increases after reaching an optimal value, with the minimum error occurring at 60 trees. This behavior suggests that an excessive number of trees may introduce redundancy without further improving predictive performance.
Table 7 presents the effect of tree depth
Dmax on the prediction accuracy when the
K is fixed at 80. It can be observed that the
MAPE increases as the tree depth increases, and gradually converges when
Dmax reaches 8, achieving a minimum
MAPE of 29.69%. This trend indicates that overly complex trees may lead to overfitting, while a moderate depth provides a better balance between model complexity and generalization ability.
Support Vector Regression (SVR) is a regression variant of Support Vector Machines (SVM). It has demonstrated strong capability in handling nonlinear relationships, particularly in small-sample and high-dimensional problems, and has been widely applied in energy-related prediction tasks [
46,
47,
48]. SVR maps input data into a high-dimensional feature space using kernel functions (in this study, the radial basis function (RBF) kernel is adopted) and constructs an optimal hyperplane for regression [
49]. The model is formulated by minimizing a regularized risk function that balances model flatness and training error, which can be expressed as:
For SVR, the regularization parameter
C controls the trade-off between model complexity and training error, while the kernel coefficient
γ of the RBF kernel determines the influence range of individual training samples [
5]. In addition, the parameter
ε defines the width of the insensitive loss region.
Table 8 presents the relationship between the
MAPE of the SVR model and the penalty coefficient
C when the kernel coefficient
γ = 0.1. The
MAPE is observed to initially decrease and reaches its minimum when
C = 100, indicating an optimal balance between underfitting and overfitting.
Table 9 shows the effect of the kernel coefficient
γ on model performance. The results indicate that the
MAPE increases with increasing
γ, suggesting that overly localized kernel functions may reduce generalization capability. When
C = 100 and
γ = 0.1, the model achieves the best performance with a minimum
MAPE of 7.81%.
Gaussian Process Regression (GPR) has demonstrated strong capability in load and fatigue prediction in OWT due to its probabilistic nature and ability to quantify prediction uncertainty [
50]. It is a non-parametric Bayesian approach that models the relationship between input variables and output responses as a Gaussian process defined by a mean function and a covariance (kernel) function [
51]. The prediction at a new point follows a Gaussian distribution, providing both the mean estimate and the associated uncertainty, which is particularly advantageous for engineering applications involving stochastic environmental conditions.
In this study, a composite kernel function consisting of a constant kernel of RBF and a white noise term is adopted to capture the global trend, nonlinear variations, and noise in the data, respectively, which can be defined as:
For the GPR model, the kernel hyperparameters, including the length-scale , are automatically optimized during training via maximum likelihood estimation. Consequently, the influence of initial parameter selection on model performance is relatively limited, and the model tends to converge toward an optimal configuration. In this study, the GPR model exhibits stable predictive performance, with the MAPE remaining approximately 7.51%.
A comparison of the predictive accuracy of the four surrogate models is shown in
Figure 11. It can be observed that the GPR model achieves the lowest
MAPE (7.51%), indicating the highest prediction accuracy among all models. The SVR model shows comparable performance, with a slightly higher error of 7.82%. In contrast, the RF and XGBoost models exhibit relatively larger prediction errors, suggesting a lower capability in capturing the underlying nonlinear relationships of fatigue damage.
In addition,
Figure 12 and
Figure 13 present the
MAPE and RE values of the four models at 180 and 300 sample points, respectively. It can be observed that the
MAPE of the XGBoost and RF models decreases with increasing sample size, converging to 10.53 and 10.29 at 300 sample points, respectively, indicating their suitability for prediction when sufficient sample points are available. In contrast, the errors of the SVR and GPR models do not decrease with increasing sample size; instead, they achieve the highest accuracy at 60 sample points. This suggests that their representation capacity is limited by the kernel’s ability to capture complex interactions, and more flexible kernels might be needed for larger datasets. In conclusion, GPR is the preferred choice when the available simulation budget is very limited and interpretable uncertainty is desired. For scenarios where large amounts of data can be generated, XGBoost or RF may eventually outperform GPR.
The evaluation employs the root mean square error (
RMSE) and the correlation coefficient (
R) to further evaluate the predictive performance, which are defined as:
RMSE is used to measure the difference between predicted values and observed values;
R serves as a measure of the model’s goodness of fit; a value approaching 1 reflects higher predictive accuracy. The result illustrated in
Figure 14 and
Figure 15 for the testing dataset based on 45 training samples demonstrates that the GPR and SVR models consistently achieve lower
RMSE values and higher correlation coefficients compared to the RF and XGBoost models, indicating superior fitting accuracy and generalization capability.
In contrast, larger prediction errors are observed for the tree-based models, particularly in cases involving complex nonlinear responses. In addition to point-wise error metrics, the total error of short-term fatigue damage
RE for the 15 testing cases is evaluated in
Figure 11. which is defined as the absolute percentage error between the cumulative fatigue damage predicted by the surrogate models and that obtained from FEM:
The study reveals that the RE values are much lower than the MAPE values, as overestimations in some samples and underestimations in others cancel each other out, yielding a small RE. Among the four models, the GPR model achieves the lowest relative error of 0.32%.
Figure 16 compares the predicted and true fatigue damage using the GPR model, with 95% prediction intervals. Most points lie within these intervals, confirming reliable uncertainty quantification. The interval width reflects model confidence: narrower intervals indicate sufficient training data, while wider ones signal greater uncertainty. This highlights GPR’s ability to capture both prediction accuracy and associated uncertainty.
Compared with the 2520 load cases required by the IEC 61400-3-1 standard [
32], only 60 representative environmental conditions need to be simulated in this study, while the remaining cases can be efficiently estimated using surrogate models. The computational cost of surrogate prediction is negligible compared to FEM-based transient simulations.
4.3. Fatigue Damage Behavior of OWT Foundation Under Representative Sea States
To further assess the variation law of fatigue damage in OWT foundation, a representative offshore site is selected, in which the joint probability distribution of wind and wave parameters is summarized in
Table 10 [
52]. Based on this distribution, five representative sea states (Sea State Classes 2–6) are defined as Cases 1–5, corresponding to the most probable operating conditions of the wind turbine.
The GPR model is employed to estimate the short-term fatigue damage of the monopile foundation over a 600 s simulation period. As illustrated in
Figure 18, the fatigue damage exhibits a pronounced circumferential variation, with maximum values occurring at the 0° and 180° positions. This is attributed to the fact that these directions are aligned with the primary bending plane of the structure, where the fore-aft bending moments reach their maximum and dominate fatigue damage accumulation. In contrast, the 90° and 270° positions are primarily subjected to weaker lateral loading, resulting in comparatively lower fatigue damage. Furthermore, for a given circumferential position, fatigue damage increases consistently with wind speed across different sea states, indicating the dominant role of aerodynamic loading in fatigue accumulation.
In conventional fatigue design of OWT, the amplification effects due to simultaneous wind and wave loads are often simplified or overlooked. To quantify this effect, fatigue damage is evaluated under three loading scenarios: wind-only, wave-only, and combined wind–wave loading. Two commonly used superposition methods, linear superposition and the square root of the sum of squares (
SRSS), are adopted for comparison and defined as:
where
D1 and
D2 denote the damage induced by wind and wave loads, respectively, when applied independently.
The GPR model is used to estimate short-term fatigue damage under the most probable sea state (Case 3) over a duration of 600 s. As shown in
Figure 19, wind-induced fatigue damage
D(
wind) significantly exceeds that caused by wave loading
D(
wave) at all circumferential positions. However, when comparing with fully coupled wind–wave simulations, both linear and
SRSS approaches are found to systematically underestimate the total damage. This discrepancy highlights the strong nonlinear interaction between aerodynamic and hydrodynamic loads, which cannot be accurately captured by simple superposition assumptions.
The impact of wind–wave directional misalignment is explored under Case 3 by changing the wave direction and keeping the wind direction unchanged. The resulting fatigue damage distributions at different circumferential positions are shown in
Figure 20. It is observed that the overall damage patterns exhibit directional symmetry, with similar fatigue responses occurring at angle pairs such as (0°, 180°), (90°, 270°), and (45°, 225°). This symmetry arises from the structural geometry and the dominant loading direction of the wind. Since wind-induced loading governs the fatigue response and is fixed along the global X-axis, the variation in wave direction has a relatively limited influence on the global damage distribution pattern.
For a given wind–wave angle, the relative distribution of fatigue damage along the circumference remains consistent, with higher damage occurring at positions more closely aligned with the wind direction. Moreover, when examining a fixed circumferential position under varying wave directions, the fatigue damage is found to depend on the relative alignment between the wave propagation direction and the structural orientation. Specifically, smaller angles between the loading direction and the structural position lead to increased stress response and higher fatigue damage.
These results demonstrate that although wave direction has a secondary effect compared to wind loading, it can still influence local fatigue distribution through directional interaction. Therefore, accurate fatigue assessment of OWT foundations requires consideration of both wind–wave coupling and directional misalignment effects.