1. Introduction
Container ships sailing at sea are exposed to irregular motions characterized by nonlinearity and non-stationarity, induced by extreme weather, wind, waves, and currents, which significantly increase navigational risks [
1]. For example, in late October 1998, a fully loaded post-Panamax C11 container ship “APL CHINA” sailed eastward from Kaohsiung to Seattle, encountered a severe storm in the North Pacific and experienced pronounced parametric roll, which caused extensive cargo collapse and severe economic losses. To mitigate such risks, The International Maritime Organization (IMO) promulgated the second-generation intact stability (SGIS) standards [
2] in 2020, which encompass the primary stability failure modes in waves: parametric roll (PR), pure loss of stability (PL), excessive acceleration (EA), dead ship condition and surf-riding. These standards introduced safety requirements and numerical approaches for the evaluation of dynamic stability in waves: once the loading condition is determined, stability may be assessed either by hydrodynamics-based simplified vulnerability criteria [
3,
4], or by higher-fidelity time-domain numerical simulations [
5,
6,
7,
8].
Viscous-flow computational fluid dynamics (CFD), founded on the continuity equation and the Navier–Stokes equations, is capable of resolving viscous effects on flow structures and velocity distributions with high accuracy, thereby demonstrating significant advantages in simulating complex ship-flow fields. For instance, Liu et al. employed the URANS method to realize, for the first time, full-scale high-accuracy CFD simulations of parametric roll, effectively avoiding model-scale effects [
9]; Huang et al. used CFD to evaluate seakeeping performance in bi-directional beam seas and regular waves [
10]; and Kim et al. conducted CFD-based investigations into ship maneuverability under different current conditions [
11]. These studies collectively confirm the validity and methodological rigor of CFD.
Nevertheless, in engineering applications, relying solely on CFD or other high-fidelity numerical/experimental methods for large-scale condition assessment still presents three major bottlenecks.
- (1)
High computational cost: mesh resolution and free-surface capturing sharply increase computational demand; even under O(102)-core parallel conditions, a single case often requires days to weeks, making rapid iteration and batch evaluations impractical.
- (2)
Modeling and solution complexity: six-degree-of-freedom coupling and turbulence–free-surface interactions substantially increase algorithmic complexity, imposing stringent requirements on hardware and operator expertise.
- (3)
Coverage limitations: phenomena such as parametric roll are highly sensitive to loading parameters such as draft, trim, and vertical center of gravity (KG), while operational loading patterns exhibit high-dimensional, discrete distributions; thus, existing frameworks cannot exhaustively cover all cases, constraining comprehensive design-stage verification and timely operational-stage risk assessment.
Even with potential flow theory, although computational speed improves, the approach remains inadequate for large-scale coverage and near-real-time decision-making, and is still highly labor-intensive.
As the maritime industry progresses toward intelligent and unmanned shipping, safety assessment and route planning require automated, rapid, and robust capabilities. In particular, the strong nonlinearity and high sensitivity of ship dynamic stability to loading, operational, and environmental conditions lead to high-dimensional, large-scale, and complex data structures, where simplified methods such as tabulated interpolation cannot capture the essential dynamics [
7]. This motivates the exploration of data-driven surrogate models that can provide rapid approximations to high-fidelity solutions while maintaining physical consistency.
To meet this demand, existing studies can broadly be categorized into two directions. The first line of research is short-term time-series-oriented, focusing on motion extrapolation from continuous observations and capturing long-range dependencies. For example, Zhang et al. developed a hybrid model that integrated a CNN, an MRNN and IADPSO to predict the ship pitch, roll, and heave motions. The CNN is used to extract the spatial features from the ship data of the time series, and then the MRNN is used to extract the temporal features of the new time series, and finally, the IADPSO algorithm is used to optimize the hyper-parameters of the CNN-MRNN model for accurate prediction [
12]. Li et al. uses a sequence-to-sequence approach for short-term prediction of ship roll motion using a Bidirectional Long Short-Term Memory Network (Bi-LSTM) with teacher forcing [
13]. Li et al. proposed a method combining phase space reconstruction theory with an improved Elman neural network to predict ship roll motion, demonstrating higher accuracy compared to using the Elman network alone [
14]. Li et al. proposed a ship roll motion prediction model based on an NARX neural network with wind speed and direction as external inputs, which demonstrated higher accuracy compared to a BP neural network in experimental validations [
15]. Peña et al. uses two and three hidden layer structure ANN models to realize the time series prediction of the ship’s transverse rocking motion, and through experiments, it is concluded that the accurate prediction can be realized 40 s ahead of time in simple cases and 10 s ahead of time in complex cases [
16]. Fu et al. developed a variable-weight combination model integrating ConvLSTM and XGBoost, which adaptively fuses spatiotemporal features and decision tree predictions via k-NN, achieving superior accuracy in ship motion forecasting compared to individual models [
17]. The second line of research, consistent with the data form used in this paper, does not rely on raw time series but instead employs statistical indicators or database-style features as modeling inputs to improve efficiency and generalization. For example, Xu et al. computed added inertia and damping in real time and used cubic-spline interpolation to build a hydrodynamic database, thereby accelerating heel-response assessment [
18]. Bassam et al. compared different ANN architectures and concluded that a two-hidden-layer network with 100 neurons per layer performed best for ship speed prediction [
19] and Nie et al. combined EMD with SVR, decomposing non-stationary sequences into statistical components and conducting short-term motion prediction, effectively improving accuracy [
20]. These works indicate that when data are organized as statistical indicators or database entries, non-sequential regression and surrogate approaches can achieve reliable accuracy while significantly reducing computational cost.
Building on this foundation, this paper proposes a GRNN-based surrogate model for ship dynamic stability across large condition spaces spanning loading and sea states. To further address the heterogeneity of conditions, K-means clustering is incorporated into the modeling framework. The model is trained on datasets generated by the industry-recognized CCS COMPASS stability module through hydrodynamic numerical simulations, with part of the load condition data reserved as a test set for performance validation. In particular, the surrogate framework is designed to capture three representative failure modes of ship dynamic stability—EA, PL, and PR. The results demonstrate that the proposed method not only preserves trend agreement with numerical results but also substantially improves predictive accuracy, offering an effective solution for rapid stability assessment under complex conditions and providing methodological support for future intelligent maritime safety assurance. In addition, the framework achieves significantly higher computational efficiency than conventional numerical simulations, reducing the time cost from days or weeks to seconds or minutes, which further underscores its practical applicability.
2. A Ship Dynamic Stability Performance Surrogate Model Based on GRNN
2.1. The GRNN Ship Dynamic Stability Performance Surrogate Model
GRNN is a forward propagation radial basis function neural network [
21,
22]. The GRNN has three advantages: (1) based on the radial basis function, it has good nonlinear approximation performance; (2) it does not need backpropagation to find the model parameters, and the convergence speed is fast; and (3) it needs only a fraction of the training samples a backpropagation neural network would need.
According to the principle of GRNN, the ship stability GRNN fast prediction algorithm model can be expressed as follows:
where
Y represents the output result, which is the result of ship dynamic stability performance, expressed as
y1,
y2, and
y3 respectively correspond to the roll amplitude in the ship dynamic stability performance results, lateral acceleration amplitude parallel to the deck direction, and the GZ curves in waves across different wave steepness and encounter headings.
X represents the input of the GRNN model, expressed as
X1, X2..., X10 respectively correspond to the displacement, longitudinal position of the center of gravity, vertical position of the center of gravity, roll moment of inertia, the midship section coefficient, incident wave direction, irregular wave significant wave height, irregular wave average zero-cross period, ship speed, and equivalent regular wave steepness in the typical stowage scheme of the ship.
The ship dynamic stability performance surrogate model based on the GRNN neural network is shown in
Figure 1. It consists of four layers, namely the input layer, pattern layer, summation layer, and output layer.
(1) Input layer. Receive data from learning samples and the number of neurons is equal to the dimension of the input vector, that is, X1, X2…, X10.
(2) Pattern layer. Calculate the value of the Gaussian function for each sample in the training and test samples. The number of neurons in the pattern layer equals the number of training samples. The transfer function of the neurons in the pattern layer is
In the formula, X is the network input vector, Xi is the training sample vector corresponding to the input neuron, and σ is the hyperparameter of the GRNN.
(3) Summation layer. Two types of summing neurons are utilized for summing. One type performs an arithmetic summation of the outputs from all neurons in the pattern layer, where the connection weights between the pattern layer and these neurons are set to 1, and the transfer function is
The other type of summation calculates the weighted summation of all pattern layer neurons, and weighting coefficient
is the
j-th element of the label of the training sample corresponding to the
j-th pattern layer node, and the transfer function is
(4) Output layer. The output layer corresponds to the ship’s dynamic stability performance, such as the GZ curve value in the wave in the PL mode, the acceleration value in the EA mode, and the roll amplitude in the PR mode. These values have the ratio of the two types of node values in the summation layer, namely
A GRNN can be regarded as a kernel regression model, in which each training sample defines a radial basis function. The smoothing parameter governs the bias–variance trade-off. Unlike conventional neural networks, training does not require backpropagation; it only involves storing the samples. The inference complexity is O(N), where N is the number of training samples. Such complexity is acceptable for offline pre-computation or after prototype selection. These characteristics make the GRNN an appealing surrogate modeling approach when both accuracy and simplicity are prioritized.
2.2. The Algorithm Flow of the GRNN Ship Dynamic Stability Performance Surrogate Model
The algorithm of the GRNN ship dynamic stability performance forecasting process is shown in
Figure 2. And the pseudocode of the algorithm is presented below (Algorithm 1).
Algorithm 1: GRNN model |
Input: |
The original data matrix: |
The Euclidean distance: ED |
|
The smoothing parameter: |
The training data: |
The test data: |
Output: |
GRNN model with determined |
1 | Normalize D. |
2 | using K-fold cross-validation |
3 | Repeat |
4 |
For
do |
5 | Compute |
6 |
End for |
7 | Compute |
8 | Compute |
9 | based on the value of RMSE and R2, |
10 | Until that minimizes the RMSE and maximizes R2 |
2.3. Verification for the GRNN Ship Dynamic Stability Performance Surrogate Model
To verify the validity of the model, a large container ship is selected. The main dimensions of the ship are shown in
Table 1, and its ship type is shown in
Figure 3. The dynamic performance characteristic data set (the parameters are shown in
Table 2), and the numerical model results of the dynamic stability performance of the ship are directly calculated by the IMO second-generation intact stability criterion calculation function in the calculation software stability module of the COMPASS project developed by the CCS [
23,
24]. The software has undergone validation through numerical simulations and model testing, and has been utilized by the Chinese delegation for their research in developing the second-generation intact stability criteria in collaboration with the IMO.
The verification of the validity of the GRNN model takes the following steps:
- (1)
Randomly select the data from the data set in
Table 2 to train the GRNN ship dynamic stability performance model to generate the GRNN forecast model. The details are shown in
Table 3.
- (2)
Randomly select one working condition from the remaining working conditions in the data set for prediction.
- (3)
Compare and analyze the accuracy of the prediction results and the calculation results of the numerical model.
Table 3.
Model Training/Testing Parameters.
Table 3.
Model Training/Testing Parameters.
Ship Dynamic Stability Model | Number of Training Data (Piece)/Working Conditions | Number of Testing Data (Piece)/Working Conditions |
---|
EA | 13,158/44 | 306/1 |
PL | 20,000/21 | 1000/1 |
PR | 111,384/29 | 3978/1 |
The working conditions in
Table 2 were randomly split into training and testing sets;
Table 3 summarizes the split.
To accurately measure the prediction accuracy of the prediction model, this paper uses the root mean square error (RMSE) and the coefficient of determination (R
2) to compare the prediction results. It is defined as follows.
where
T represents the number of output data;
is the true value of ship dynamic stability;
is the predicted value of ship dynamic stability;
is the average value of
.
Table 4 reports the prediction accuracy of the GRNN dynamic stability surrogate model for eight randomly selected working conditions. The results demonstrate good agreement with the numerical simulation outcomes, with the average R
2 values reaching 0.996, 0.998, and 0.999 for EA, PL, and PR, respectively. This indicates that the GRNN surrogate can effectively approximate the numerical model. In terms of accuracy, the method achieves high precision for EA and PL, with root mean square errors of about 0.02 and 0.04, respectively. However, there is a notable deviation in the prediction of PR, where the maximum root mean square error reaches 0.2 and the average value is around 0.08, indicating that further refinement is required to ensure reliability in practical applications.
Complementing the accuracy analysis,
Table 5 presents the training and inference times of the GRNN surrogate model for the three stability failure modes across eight randomly selected working conditions. Experiments were performed on a workstation with Intel Core i7-14700HX CPU, 48 GB RAM and NVIDIA GeForce RTX 4070 GPU. The results show that the average one-time training costs for EA, PL, and PR are 3.371, 38.162, and 629.117 s, respectively. The corresponding average inference times per condition are 0.336, 0.852, and 2.245 s. Compared with the median runtime of the COMPASS solver under comparable conditions, the GRNN surrogate achieves several orders of magnitude reduction in computational cost. These findings highlight that, once trained, the surrogate model can rapidly evaluate new loading and sea state conditions with negligible additional burden, thereby providing strong support for large-scale and near real-time stability assessment.
It should be emphasized that the input features used in this study already explicitly incorporate parameters closely related to the ship’s geometrical characteristics, such as displacement, longitudinal and vertical centers of gravity (LCG, VCG), roll moment of inertia (RMI), and midship section coefficient. These quantities are determined by the hull geometry and appendage configuration and play essential roles in dynamic stability behavior. For example, displacement and center-of-gravity positions directly influence buoyancy distribution and trim, while the roll moment of inertia strongly affects the amplitude and period of roll response. Therefore, the dataset effectively captures the geometric attributes of the vessel through its feature representation.
3. An Improved Ship Dynamic Stability Performance Surrogate Model Based on K-Means and GRNN Hybrid Neural Network
The previous experiments indicate that, except for the large mean square error of the PR prediction results, the results of the GRNN ship dynamic stability performance surrogate model are comparable to those based on hydrodynamics under the same type of loading conditions, navigation, and environmental conditions of the ship in waves. The calculation results of the method numerical simulation have a good fit.
However, during actual ship operations, loading, navigation, and environmental conditions vary continuously, and the hydrodynamic characteristics of the same vessel change accordingly.
Figure 4 shows the distribution of the center-of-gravity coordinates across the 49 working conditions considered in the experiments, where variations in the center of gravity is closely related to buoyancy and thus affects the trim. It also illustrates that the loading conditions—particularly displacement and center-of-gravity position—exhibit clear clustering patterns.
The GRNN is a kernel-based model in which each training sample corresponds to a radial basis function. Consequently, the predictive behavior of GRNN is highly sensitive to the spatial distribution of training samples: inference is essentially performed as a normalized weighted average over all stored patterns, and the relative contribution of each sample depends on its kernel similarity to the query point. When the input space exhibits heterogeneous sub-regions, a single global GRNN may struggle to capture cross-cluster nonlinearities.
In the context of ship dynamic stability, the observed clustered patterns in loading conditions (
Figure 4) can be interpreted as arising from underlying physical characteristics such as displacement, center of gravity, and roll inertia. These physical drivers induce a natural clustering structure in the stability responses.
Hypothesis 1. Clustering the input space and training cluster-specific GRNNs is expected to significantly improve prediction accuracy compared with a single global GRNN.
To verify the proposed hypothesis, K-means clustering is employed to partition the input feature space into K relatively homogeneous subsets, followed by the independent training of GRNN models within each subset, as illustrated in
Figure 5. Prior to model comparison, candidate values of K are preliminarily examined, and clustering quality is assessed using the within-cluster sum of squared errors (SSE). The optimal number of clusters is then determined according to the elbow criterion, while the specific analysis of the SSE–K relationship is presented in the experimental section.
The integration of clustering into the surrogate modeling framework is motivated by the fact that classification of training data according to intrinsic characteristics may improve the accuracy of machine learning algorithms. Therefore, a GRNN ship dynamic stability surrogate model hybridized with the K-means method is proposed to enhance the basic GRNN framework. The K-means algorithm is one of the most widely adopted clustering techniques due to its favorable trade-off between accuracy and computational simplicity. Its fundamental principle is to iteratively assign each sample to the nearest cluster center, update the centroid of each cluster as the mean of its assigned samples, and repeat this process until convergence. With advantages such as strong clustering performance, ease of implementation, and computational efficiency, K-means is chosen to classify the ship loading data, effectively partitioning the training samples by loading parameters. Once the training data are classified, independent GRNNs are trained within each cluster under a unified protocol (cluster-wise cross-validation) for feature usage and parameter tuning. A global GRNN trained on all samples is used as a baseline. Finally, predictive performance is evaluated on a held-out test set, with RMSE and R
2 reported for EA, PL, and PR. The algorithmic workflow is summarized in
Figure 5.
5. Conclusions
This study explores the application of a neural network approach to develop a surrogate model for assessing ship dynamic stability, aimed at effectively addressing the limitations in efficiency and implementation cost associated with the traditional CFD-based numerical simulation method. In this paper, the method of using a GRNN to construct a surrogate model of ship dynamic stability is studied. The experimental results show that this method can be used to replace the traditional numerical calculation model. In dynamic changes, the accuracy of the model directly trained by the GRNN method will be affected by the training data. It is not enough to use the coefficient of determination (R
2) as the basis for measuring the quality of the model. Sometimes the coefficient of determination between the prediction result and the numerical calculation result is even above 0.9, but there are still deviations between the actual prediction results and the numerical results (
Figure 7a and
Figure 8a) and even a large deviation (
Figure 9a). Therefore, it cannot be directly used as a surrogate model for numerical calculation models.
For this reason, this paper considers the influence of stowage parameters on the dynamic stability of the ship during its actual operation. Based on the GRNN prediction model, the K-means algorithm is used to cluster the input sample data by constructing a hybrid neural network. In this framework, the clustering step does not merely reorder the input data; instead, each cluster of samples is used to train an independent GRNN model, ensuring that the learned relationships are specialized to homogeneous subsets of loading conditions. Through empirical analysis and comparison, the ship dynamic stability performance prediction model can control the variance within 0.05, which basically agrees with the numerical simulation results based on the hydrodynamic method at 98.9%.
In addition, the results on computational cost demonstrate that both GRNN and the K-means + GRNN framework provide significant time efficiency compared with CFD-based simulations, with training and inference times reduced from days or weeks to seconds or minutes. This confirms that the proposed surrogate modeling approach not only improves predictive accuracy through clustering but also retains the computational efficiency necessary for practical engineering applications.