Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat

Contreras Montes, Juan; Lovo Ayala, Aldo; Ospino-Balcázar, Daniela; Velasquez Gutierrez, Kevin; Soto Montaño, Carlos; Soto-Diaz, Roosvel; Jiménez-Cabas, Javier; Oñate López, José; Escorcia-Gutierrez, José

doi:10.3390/computation13120296

Open AccessArticle

Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat

by

Juan Contreras Montes

¹

,

Aldo Lovo Ayala

^1,2

,

Daniela Ospino-Balcázar

²

,

Kevin Velasquez Gutierrez

²

,

Carlos Soto Montaño

^3,4

,

Roosvel Soto-Diaz

^5,*

,

Javier Jiménez-Cabas

⁶

,

José Oñate López

⁴

and

José Escorcia-Gutierrez

^6,*

¹

Escuela Naval de Cadetes “Almirante Padilla”, Armada Nacional de Colombia, Cartagena 131001, Colombia

²

Centro de Desarrollo Tecnológico Naval (CEDNAV), Armada Nacional de Colombia, Cartagena 131001, Colombia

³

Research Center, Corporación Universitaria Reformada, Barranquilla 080016, Colombia

⁴

G5ID Research Group, ONIRIS ID SAS, Barranquilla 080016, Colombia

⁵

Biomedical Engineering Program, Universidad Simón Bolívar, Barranquilla 080002, Colombia

⁶

Department of Computational Science and Electronics, Universidad de la Costa, CUC, Barranquilla 080020, Colombia

^*

Authors to whom correspondence should be addressed.

Computation 2025, 13(12), 296; https://doi.org/10.3390/computation13120296

Submission received: 24 February 2025 / Revised: 8 December 2025 / Accepted: 9 December 2025 / Published: 18 December 2025

(This article belongs to the Section Computational Engineering)

Download

Browse Figures

Versions Notes

Abstract

Ship maneuverability models are typically defined by three degrees of freedom: surge, sway, and yaw. However, patrol vessels operating in riverine environments often exhibit significant roll motion during course changes, necessitating the inclusion of this dynamic. This study develops interpretable machine learning models capable of predicting vessel behavior in four degrees of freedom (4-DoF): surge, sway, yaw, and roll. A dataset of 125 h of simulated maneuvers was employed, including 29 h of out-of-distribution (OOD) conditions to test model generalization. Four models were implemented and compared over a 15-step prediction horizon: linear regression, third-order polynomial regression, a state-space model obtained via the N4SID algorithm, and an AutoRegressive model with eXogenous inputs (ARX). Results demonstrate that all models captured the essential vessel dynamics, with the state-space model achieving the best overall performance (e.g., NMSE = 0.0246 for surge velocity on test data and 0.0499 under OOD conditions). Variable-wise, surge and sway showed the lowest errors, roll rate remained stable, and yaw rate was the most sensitive to distribution shifts. Model-wise, the ARX model achieved the lowest NMSE for surge prediction (0.0149), while regression-based models provided interpretable yet less accurate alternatives. Multi-horizon evaluation (1-, 5-, 15-, and 30-step) under OOD conditions confirmed a consistent monotonic degradation across models. These findings validate the feasibility of using interpretable machine learning models for predictive control, autonomous navigation, and combat scenario simulation in riverine operations.

Keywords:

ship manoeuvring; system identification; machine learning; interpretability; linear regression model; polynomial regression model

1. Introduction

Accurate prediction and control of the trajectory, speed, and acceleration of a ship are crucial for effective maneuvering, especially in the context of the design of autonomous ship navigation systems and collision avoidance strategies [1]. In the literature, there is a wide range of maneuvering models, generally classified into hydrodynamic and mathematical response models. Hydrodynamic models, such as the Abkowitz model [2] and the Mathematical Ship Maneuvering Group (MMG) models [3], focus on capturing the physical dynamics of the vessel. In contrast, response models directly relate the motion of a ship to the actions of steering, with the first- and second-order models of Nomoto [4,5] and the nonlinear Nomoto model [6] being popular choices due to their simplicity, which are often used for the control of the direction of the ship using PID controllers, as they bypass the need to calculate hydrodynamic derivatives.

Hydrodynamic models are typically constructed with three or four degrees of freedom (4-DoF) [7,8,9]. Although some studies suggest the superiority of 4-DoF structures over 1-DoF models [10], this is not a strict rule. Although hydrodynamic models offer high precision, they are complex because of the large number of parameters and nonlinearities involved. Consequently, various methods for parameter estimation, including theoretical calculations (white-box models), captive model testing, and Computational Fluid Dynamics (CFD) [11], have been explored. System identification, in particular, has gained attention with advances in artificial intelligence and autonomous learning algorithms [12,13].

This paper aims to develop two multistep prediction models using system identification: a linear regression model, a third-order polynomial regression model, a state-space model, and an ARX model. The models will be evaluated over a prediction horizon of

h = 15

, with the Normalized Mean Squared Error (NMSE) serving as the primary error metric.

This paper is structured into several key sections. The Section 2 highlights the study’s advances, including the integration of roll motion and the development of two interpretable models. Section 3 reviews existing research on system identification and hybrid modeling approaches. In Section 5, the dataset is described, detailing its division and the modeling techniques applied. Section 6 presents the results of the developed models, including responses and error metrics. Finally, Section 7 summarizes the effectiveness of models in predicting vessel dynamics.

2. Contributions

By incorporating roll motion into traditionally 3-DoF models, research improves understanding of vessel dynamics. Two interpretable models are developed, evaluated, and validated through error metrics and residual analysis, demonstrating their robustness and accuracy in predicting vessel behavior in real-world applications. These contributions offer valuable insights for improving predictive control systems and autonomous navigation.

Unlike traditional 3-DoF approaches, this work incorporates roll dynamics into a 4-DoF framework, enabling a more accurate and realistic representation of patrol vessel behavior during sharp course changes.
Linear regression, third-order polynomial regression, state-space (via N4SID), and ARX models were implemented and systematically compared. This provides a spectrum of approaches ranging from highly interpretable to more data-driven, offering practical options for different control applications.
All models successfully captured the dynamics of the vessel over a 15-step horizon. The state-space model consistently delivered the lowest NMSE and strongest generalization across both validation and out-of-distribution datasets, while the ARX model excelled at forecasting specific variables under complex dynamic conditions.
Although regression-based models showed slightly higher errors, they remain valuable due to their simplicity and transparency, making them suitable for real-time implementation in predictive controllers. In contrast, the state-space and ARX models offer stronger predictive accuracy, especially under unseen operating conditions.
By evaluating the models against 29 h of maneuvers outside the training distribution, the study demonstrates resilience to variations in propulsion and environmental conditions, supporting future deployment in real-world autonomous navigation and combat training scenarios.

3. Related Works

Identification of a system involves developing mathematical models from experimental input-output data to replicate its dynamic behavior. This process includes several stages: optimal experimental design, data preprocessing, selection of the model structure, parameter estimation, and model validation. A key method for generating maneuvering coefficients is the Planar Motion Mechanism (PMM) test [14]. For example, ref. [15] employed a modified regression model using Least Squares Support Vector Machines (LS-SVM) to derive hydrodynamic derivatives for an Abkowitz-type model, incorporating wavelet threshold denoising to filter noise. Model validation included standard tests such as the

20 / 10

zigzag,

15 / 15

zigzag, and 35 turning-circle tests. Similarly, ref. [16] applied system identification to create a 4-DoF maneuvering model for a surface combatant in intact and damaged conditions.

Theoretical models, often called white-box models, provide interpretability by explicitly describing the relationships between variables, but they require a deep understanding of the underlying physics. In contrast, black-box models, such as Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs) [17], offer high accuracy but lack interpretability. Hybrid models attempt to bridge this gap by combining white-box and black-box methods to improve accuracy while retaining some interpretability. For example, the KINN model [18] uses a Deep Neural Network (DNN) to correct the residual error of a theoretical model.

Several examples of hybrid and gray-box models exist in the literature. In [19], an interpretable model based on parameters from PMM tests was developed to predict the maneuvering capabilities of ships with azimuth thrusters. Another example is the gray-box model by [20], which used Ordinary Least Squares (OLS) to estimate hydrodynamic derivatives, with Extended Kalman Filters (EKF) and Rauch-Tung-Striebel (RTS) smoothers to address noise. Multiple models, including linear and modified Abkowitz models, were considered for selection.

Additionally, ref. [21] derived parametric and non-parametric gray-box models and black-box models to predict ship dynamics. These models used a combination of past surge speed, sway speed, yaw, roll, and rudder angle as input to predict future dynamics. Other works, such as [22], have focused on one-step and multistep forecast models for ship dynamics, using prediction horizons ranging from 1 to 60 time steps.

Mathematical models capable of predicting future values play a critical role in the design of predictive controllers. However, models derived solely from simulations can lead to low-precision predictions and unstable controller behavior, underscoring the importance of using real-time experimental data and accounting for external disturbances [23].

4. Methodology

The methodological framework proposed in this research, presented in Figure 1, integrates simulation-based data generation, systematic preprocessing, and machine learning-driven system identification to predict the maneuverability of a 4-DoF river combat patrol vessel. The process begins with the execution of high-fidelity simulation tests that incorporate both ship parameters (propeller shaft speed, propeller azimuth angles) and environmental factors (wind speed and direction, represented through attack angles). These simulations produced 125 h of time-series data, of which 29 h corresponded to out-of-distribution (OOD) scenarios, enabling evaluation of model generalization.

The generated data were then processed, stored, and cleaned and organized, including the identification of missing or duplicated values, normalization of variable ranges, and division into training, testing, validation, and OOD subsets. The exploratory data analysis was then performed to characterize the correlations among the variables and to assess the dynamics of the ship under routine and OOD conditions. This stage laid the foundation for selecting appropriate modeling strategies and highlighted the influence of propulsion and environmental inputs on the vessel’s dynamic response.

Four predictive models were implemented to capture vessel dynamics: (i) Linear regression model; (ii) Third-order polynomial regression model; (iii) AutoRegressive model with eXogenous inputs (ARX), and (iv) State-space model identified through the N4SID algorithm. These models were chosen to provide a spectrum of interpretability and predictive capacity, ranging from simple baseline predictors to more complex dynamic formulations capable of reconstructing internal vessel states. Training and testing of the models were carried out on the prepared datasets, with a prediction horizon fixed at 15 steps ahead.

Model validation was performed using the NMSE as the primary evaluation metric. A comparative analysis was conducted across the training, validation, and OOD datasets, supplemented by time-series plots of predicted versus measured trajectories. This allowed both quantitative and qualitative assessment of model accuracy and robustness. Results confirmed that while regression-based models offered interpretability and computational efficiency, the ARX model excelled in predicting autoregressive dynamics such as surge velocity, and the state-space model achieved the strongest overall generalization across both routine and OOD scenarios.

Finally, the methodological proposal emphasizes the scalability of the developed framework. By integrating system identification with interpretable machine learning, the approach provides a reproducible basis for modeling vessel maneuverability, which can be extended to other motion platforms with varying degrees of freedom. Moreover, the combination of rigorous preprocessing, structured validation, and interpretable modeling ensures applicability not only to predictive control and autonomous navigation, but also to experimental testbeds and combat training simulations in complex riverine environments.

5. Materials and Methods

The methodological framework of this study was designed to integrate data-driven modeling techniques with the principles of system identification to capture the maneuverability characteristics of a river patrol vessel under realistic operating conditions. This section describes the dataset employed, including its composition, preprocessing, and division into training, validation, and OOD subsets. Then it outlines the modeling strategies implemented, namely linear regression, polynomial regression, state-space representation, and ARX models with AR inputs. Each method was selected to provide a balance between interpretability and predictive accuracy, enabling a rigorous evaluation of their capacity to forecast vessel dynamics across multiple degrees of freedom. The subsequent subsections present the dataset structure, the modeling approaches, and the validation procedures used to assess model performance.

5.1. Dataset

The dataset consists of 125 h of simulation data on the motion of a patrol boat designed by Pérez et al. [24] under various sea states. The samples were taken at a rate of 1 Hz, performing random maneuvers in 4-DoF, i.e., surge, sway, yaw, roll. The original boat has been expanded by incorporating two symmetrically placed rudder propellers and wind force simulations following the Isherwood model [25]. Wind-induced waves were generated using the JONSWAP spectrum [26].

It is important to emphasize that the dataset used in this study consists of randomly generated maneuvering sequences rather than standardized International Maritime Organization-IMO (compliant tests such as turning-circle, zigzag, or spiral maneuvers. Because canonical maneuverability indices—such as turning diameter, tactical diameter, advance, transfer, or yaw response times) require specific and repeatable control protocols, these metrics cannot be extracted from the available dataset. The focus of the present work is therefore on evaluating multistep dynamic prediction accuracy under varying propulsion and environmental conditions, rather than on computing classical maneuverability performance indicators.

The dataset is divided into two parts: one corresponds to the routine operations dataset, totaling 96 h, and the other to an OOD dataset, totaling 29 h. The routine operations dataset is divided into three groups with the following percentages: 60-10-30, corresponding to training, test, and validation data. The OOD dataset is only used for testing purposes.

The dataset is used to obtain models that predict several steps ahead using machine learning methods. The following inputs or predictor variables are considered for model generation: speeds of the axes of both propeller helices (n), azimuth angles of the propellers (

δ

: left and right), wind speed (

V_{w}

), and angle of attack (

α_{x}

,

α_{y}

). The output variables include the surge speed (u), the sway speed (v), angular velocity of roll (p), yaw rate (r), and the roll angle (

ϕ

) [27].

It is important to note that the dataset employed in this study consists exclusively of high-fidelity simulated maneuvers. This choice allows controlled exploration of 4-DoF dynamics under a wide range of propulsion and environmental disturbances that are difficult to replicate systematically in field trials. Simulation-based evaluation is therefore used as a foundational step before transitioning to model identification using real ship trials. The use of simulated data therefore serves as an essential foundational stage, allowing the comparative evaluation of interpretable identification models before transitioning to validation with real ship maneuvering experiments.

The correlation matrix between the variables (training data) is shown in Figure 2.

Initially, we examined the distributions of input variables across the training, test, validation, and OOD datasets, as shown in Figure 3.

The value of the variable n in the OOD dataset deviated from the range and median observed in the other datasets (training, test, and validation), representing the primary source of shift in the OOD distribution. A quantitative analysis revealed that shaft speed in the training set spans from 226 to 1612 rpm, whereas the OOD dataset reaches 2233 rpm, with 47.9% of its samples lying outside this interval. In contrast, the ranges of

V_{w}

,

δ_{l}

,

δ_{r}

,

α_{x}

, and

α_{y}

remain almost entirely within the training support, with less than 1% of samples falling outside their respective ranges.

5.2. Modeling

Figure 4 shows the six degrees of freedom of the vessel. The data set used to train the models was collected from random maneuvers in 4 degrees of freedom (surge, sway, roll rate, yaw rate, and roll angle).

Based on the dataset mentioned above, this study obtains four models to predict the surge speed (u), the sway speed (v), roll rate (p), the yaw rate (r), and the roll angle (

ϕ

) variables; using as input variables the speeds of the propeller shaft speed (n), azimuth angles of the propellers (

δ_{l}

and

δ_{r}

), wind speed (

V_{w}

) and the wind angles of attack (

α_{x}

and

α_{y}

). Figure 5 shows the input-output relations.

It should be noted that all experiments in this study were conducted in an offline computational environment using the simulated 4-DoF maneuvering dataset. Real-time execution and hardware-in-the-loop evaluation were beyond the scope of the present work, as the objective here was to establish a controlled comparison of interpretable identification models under routine and OOD operating conditions. Nevertheless, the computational structure of all four models—linear and polynomial regression, ARX, and state-space—is lightweight and compatible with embedded implementation, since each relies on closed-form matrix operations with low execution overhead. These characteristics make them suitable candidates for real-time testing in future hardware-oriented validation stages.

6. Results

This section presents the outcomes of the four predictive models developed for the 4-DoF maneuverability analysis of the patrol vessel. The performance of linear, polynomial, state-space, and ARX models is evaluated using NMSE as the primary metric across training, validation, and OOD datasets. To account for statistical significance, bootstrap resampling with 10 runs was employed to estimate the mean and standard deviation of the NMSE residuals for each dataset. In addition to these quantitative error measures, graphical comparisons of predicted and observed time series are provided to illustrate the models’ ability to capture vessel dynamics under varying operating conditions. The results highlight not only the relative accuracy of each modeling approach but also their robustness and generalization capabilities, which are critical for real-world predictive control and autonomous navigation applications.

The evaluation metrics employed in this study (i.e., NMSE, bootstrap-based variability estimates, and graphical time-series comparisons) align with the primary objective of assessing predictive accuracy and generalization under varying propulsion and environmental inputs. Since the dataset consists of non-standardized maneuvering sequences produced by random excitation, it does not include the structured control protocols required to compute domain-specific maneuverability indices such as turning diameter, tactical diameter, advance, transfer, or yaw response time. For this reason, trajectory-level dynamic prediction metrics were selected as the most appropriate and consistent tools for comparing the four model classes evaluated in this work.

6.1. Linear Regression Model

To establish the linear regression model, the intercept values for each predicted variable were first obtained, as shown in Equation (1). Subsequently, the coefficients associated with the input variables were estimated, and their values are reported in Table 1.

Intercepts = [1.4386 \times 10^{- 1} 7.3234 \times 10^{- 2} 5.2791 \times 10^{- 6} - 3.3553 \times 10^{- 3} 9.1151 \times 10^{- 4}]

(1)

These coefficients, together with the intercepts, define the full mathematical formulation of the linear model expressed in Equation (2), which relates propulsion and environmental input to surge velocity (u), sway velocity (v), roll rate (p), yaw rate (r), and the roll angle (

ϕ

) of the vessel over a 15-step prediction horizon. To assess predictive accuracy, the NMSE was calculated according to Equation (3), which allows for a quantitative evaluation of the performance of the model in the training, validation, and OOD datasets.

[\begin{matrix} u (i + 15) \\ v (i + 15) \\ p (i + 15) \\ r (i + 15) \\ ϕ (i + 15) \end{matrix}] = [\begin{matrix} 0.1439 \\ 0.0732 \\ 0.0000 \\ - 0.0034 \\ 0.0009 \end{matrix}] + [\begin{matrix} 0.0053 & 0.2487 & 0.0104 & - 0.0143 & 0.0507 & 0.0072 \\ - 0.0000 & 0.7750 & 0.8384 & - 0.0011 & - 0.0105 & 0.3964 \\ - 0.0000 & 0.0003 & 0.0004 & 0.0000 & - 0.0000 & - 0.0000 \\ 0.0000 & - 0.0309 & - 0.0336 & 0.0001 & 0.0005 & - 0.0024 \\ - 0.0000 & 0.0119 & 0.0111 & - 0.0004 & - 0.0014 & 0.0410 \end{matrix}] [\begin{matrix} n (i) \\ δ_{l} (i) \\ δ_{r} (i) \\ V_{w} (i) \\ α_{x} (i) \\ α_{y} (i) \end{matrix}]

(2)

The linear regression model obtained a determination coefficient of

R^{2} = 0.605

, indicating that approximately 60% of the variance in the vessel’s dynamic response is explained by the selected input variables. Although this value reflects only moderate predictive ability, it shows the model’s ability to capture the main trends of system dynamics with a relatively simple structure. To provide a more rigorous performance assessment, the NMSE was calculated for each output variable using Equation (3). The resulting NMSE values for all output variables across the training, validation, and OOD datasets are summarized in Table 2, which enables a consistent comparison of predictive accuracy and highlights the model’s ability to generalize beyond the conditions used for parameter estimation. Within the broader research framework, these results highlight the relevance of interpretable models as baseline predictors. Although less precise than nonlinear structures, linear regression offers transparency and computational efficiency. These properties make it suitable for rapid prototyping and real-time monitoring.

N M S E = \frac{1}{max (y) - min (y)} \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(3)

where

y_{i}

is the observed value ith,

{\hat{y}}_{i}

is the predicted value corresponding, and n is the number of observations. Figure 6 presents the approximation graphs for each variable in the test and OOD datasets.

Figure 6 illustrates the comparison between the observed and predicted vessel dynamics obtained with the linear regression model for both the test dataset (Figure 6a) and the OOD dataset (Figure 6b). In the test dataset, the predicted trajectories closely follow the measured responses for u, v, p, r, and

ϕ

, indicating an adequate fit under in-distribution conditions.

The residuals obtained with the linear regression model are reported in Table 3, which shows their mean and standard deviation derived from the prediction errors computed independently for each dataset. This data further confirms this behavior, with mean errors remaining very close to zero across all datasets. This finding confirms that the linear approximation does not introduce any relevant systematic bias in the analyzed outputs. In particular, for

u (t)

,

v (t)

,

p (t)

,

r (t)

, and

ϕ (t)

, the mean residuals remain on the order of

10^{- 6}

to

10^{- 3}

in the training, testing, and validation sets, demonstrating satisfactory statistical consistency of the estimator.

Regarding error dispersion, the standard deviation values within the in-distribution datasets (Train, Test, and Validation) remain within narrow ranges, indicating reproducible predictions with low variability under nominal operating conditions. By contrast, evaluation on the out-of-distribution (OOD) dataset reveals a clear increase in variance, most notably for

u (t)

and

v (t)

, reflecting reduced accuracy under extreme maneuvering scenarios. This aligns with the limitations of linear models, where generalization falters amid dominant nonlinear hydrodynamic effects.

6.2. Polynomial Regression Model

The third-degree polynomial regression model was developed to extend the linear formulation by incorporating nonlinear interactions between inputs and outputs, thereby improving predictive accuracy.

Figure 7 further illustrates the performance of the polynomial model for both the test dataset (Figure 7a) and the OOD dataset (Figure 7b). In the test dataset, the predicted trajectories align more closely with the measured responses than those from the linear model, capturing both amplitude and phase variations more accurately. Under OOD conditions, larger deviations are observed, especially in v and r, reflecting the increased difficulty in generalizing polynomial models beyond the training distribution. Nevertheless, the predictions preserve the overall temporal structure of the vessel dynamics, indicating that polynomial regression provides a valuable compromise between enhanced accuracy and interpretability within the broader modeling framework.

As reported in Table 4, this approach achieved lower NMSE values than the linear model in most state variables, particularly for u and p, with test errors of 0.0295 and 0.0363, respectively, demonstrating the benefit of including nonlinear interactions.

The residuals obtained with the polynomial regression model are reported in Table 5, which shows their mean and standard deviation derived from the prediction errors computed independently for each dataset. This data indicates that the model accurately reproduces the mean behavior of all output variables in the in-distribution datasets, as the mean residuals remain very close to zero in the training, testing, and validation sets. However, in the out-of-distribution (OOD) dataset, noticeable biases emerge, particularly for

u (t)

and

v (t)

, where mean residuals reach orders of

10^{- 2}

to

10^{- 1}

. These values are consistent with the previously reported performance metrics. For

u (t)

,

v (t)

,

p (t)

,

r (t)

, and

ϕ (t)

, the mean residuals remain on the order of

10^{- 6}

to

10^{- 3}

for the training, testing, and validation sets, confirming the absence of relevant systematic bias under nominal conditions.

Regarding error dispersion, the standard deviation values within the in-distribution datasets are moderate and comparable among them, indicating a stable predictive behavior under nominal operating conditions. In the out-of-distribution (OOD) dataset, a noticeable increase in variance is observed for all outputs, particularly for

u (t)

and

v (t)

, reflecting an increased sensitivity under extreme maneuvering scenarios. Nevertheless, the growth of uncertainty remains bounded and is lower than that observed for the strictly linear model, highlighting the greater representational capability of the polynomial approximation.

6.3. AutoRegressive Model with Exogenous Inputs

The ARX model is a well-established data-driven technique in system identification, widely used for time-series forecasting and control design. Unlike purely regression-based approaches, the ARX framework explicitly incorporates temporal dependencies by linking the present value of each output variable not only to contemporaneous inputs but also to its past values. This recursive structure enables the model to capture short-term memory effects and dynamic correlations between successive state of the vessel, making it particularly suitable for predicting ship maneuverability under evolving conditions. In the context of this study, the ARX formulation was applied to all five output variables (i.e., u, v, p, r, and

ϕ

) using propulsion and wind-related variables as exogenous inputs. Parameter estimation was performed via least-squares optimization, ensuring computational efficiency while preserving the model’s interpretability.

Equations set (4)–(8) present the detailed ARX structures derived for each output variable, where the autoregressive polynomials

A (z)

and the input-dependent polynomials

B (z)

capture both the inherent dynamics of the system and the effect of external forcing terms. Specifically, u, v, p, r, and

ϕ

in Equations (6)–(10), respectively. Together, these formulations illustrate how the recursive integration of past outputs with exogenous propulsion and wind inputs provides a flexible, yet interpretable framework for multistep prediction of ship maneuverability in 4-DoF.

A (z) u (t) = - A_{i} (z) u_{i} (t) + B (z) w (t)

(4)

where,

\begin{matrix} A (z) & = 1 - 1.587 z^{- 1} + 0.6324 z^{- 2} & B_{1} (z) & = - 0.0002776 z^{- 1} + 0.0005198 z^{- 2} \\ A_{2} (z) & = 1.249 z^{- 1} - 0.9771 z^{- 2} & B_{2} (z) & = 0.01252 z^{- 1} + 0.03409 z^{- 2} \\ A_{3} (z) & = - 1.803 z^{- 1} + 1.345 z^{- 2} & B_{3} (z) & = 0.02626 z^{- 1} + 0.01185 z^{- 2} \\ A_{4} (z) & = - 2.17 z^{- 1} + 6.981 z^{- 2} & B_{4} (z) & = 4.496 \times 10^{- 5} z^{- 1} + 0.001074 z^{- 2} \\ A_{5} (z) & = - 0.3084 z^{- 1} - 1.527 z^{- 2} & B_{5} (z) & = - 0.0213 z^{- 1} + 0.02525 z^{- 2} \\ B_{6} (z) & = - 0.052 z^{- 1} + 0.07131 z^{- 2} \end{matrix}

A (z) v (t) = - A_{i} (z) v_{i} (t) + B (z) h (t)

(5)

where,

\begin{matrix} A (z) & = 1 - 0.5949 z^{- 1} - 0.1277 z^{- 2} & B_{1} (z) & = - 0.0003987 z^{- 1} + 0.000608 z^{- 2} \\ A_{1} (z) & = 0.2569 z^{- 1} - 0.2173 z^{- 2} & B_{2} (z) & = 0.009658 z^{- 1} + 0.036 z^{- 2} \\ A_{3} (z) & = - 1.918 z^{- 1} + 1.472 z^{- 2} & B_{3} (z) & = 0.04115 z^{- 1} - 0.00218 z^{- 2} \\ A_{4} (z) & = - 0.6734 z^{- 1} + 5.584 z^{- 2} & B_{4} (z) & = 0.0002333 z^{- 1} + 0.0009707 z^{- 2} \\ A_{5} (z) & = - 0.5761 z^{- 1} - 1.292 z^{- 2} & B_{5} (z) & = - 0.02411 z^{- 1} + 0.02766 z^{- 2} \\ B_{6} (z) & = - 0.04824 z^{- 1} + 0.06801 z^{- 2} \end{matrix}

A (z) p (t) = - A_{i} (z) p_{i} (t) + B (z) h (t)

(6)

where,

\begin{matrix} A (z) & = 1 - 1.282 z^{- 1} + 0.4066 z^{- 2} & B_{1} (z) & = - 2.133 \times 10^{- 7} z^{- 1} + 2.171 \times 10^{- 6} z^{- 2} \\ A_{1} (z) & = 0.004553 z^{- 1} - 0.00417 z^{- 2} & B_{2} (z) & = - 0.01693 z^{- 1} + 0.01796 z^{- 2} \\ A_{2} (z) & = - 0.005217 z^{- 1} + 0.003012 z^{- 2} & B_{3} (z) & = - 0.01649 z^{- 1} + 0.01746 z^{- 2} \\ A_{4} (z) & = - 0.3249 z^{- 1} + 0.2655 z^{- 2} & B_{4} (z) & = - 4.933 \times 10^{- 6} z^{- 1} - 1.064 \times 10^{- 5} z^{- 2} \\ A_{5} (z) & = 0.6938 z^{- 1} - 0.6597 z^{- 2} & B_{5} (z) & = 0.000293 z^{- 1} - 0.0003505 z^{- 2} \\ B_{6} (z) & = 0.008251 z^{- 1} - 0.007512 z^{- 2} \end{matrix}

A (z) r (t) = - A_{i} (z) r_{i} (t) + B (z) h (t)

(7)

where,

\begin{matrix} A (z) & = 1 - 0.9703 z^{- 1} + 0.007307 z^{- 2} & B_{1} (z) & = - 1.551 \times 10^{- 6} z^{- 1} + 5.051 \times 10^{- 7} z^{- 2} \\ A_{1} (z) & = - 0.005273 z^{- 1} + 0.005065 z^{- 2} & B_{2} (z) & = - 0.01665 z^{- 1} + 0.01568 z^{- 2} \\ A_{2} (z) & = 0.005533 z^{- 1} - 0.004846 z^{- 2} & B_{3} (z) & = - 0.0168 z^{- 1} + 0.01586 z^{- 2} \\ A_{3} (z) & = 0.008095 z^{- 1} + 0.02207 z^{- 2} & B_{4} (z) & = - 1.041 \times 10^{- 5} z^{- 1} + 1.279 \times 10^{- 5} z^{- 2} \\ A_{5} (z) & = - 0.03695 z^{- 1} + 0.03044 z^{- 2} & B_{5} (z) & = 8.925 \times 10^{- 6} z^{- 1} - 1.287 \times 10^{- 5} z^{- 2} \\ B_{6} (z) & = 0.0005932 z^{- 1} - 0.0007016 z^{- 2} \end{matrix}

A (z) ϕ (t) = - A_{i} (z) ϕ_{i} (t) + B (z) h (t)

(8)

where,

\begin{matrix} A (z) & = 1 - 0.5345 z^{- 1} - 0.448 z^{- 2} & B_{1} (z) & = - 7.66 \times 10^{- 7} z^{- 1} + 1.712 \times 10^{- 6} z^{- 2} \\ A_{1} (z) & = 0.001836 z^{- 1} - 0.001651 z^{- 2} & B_{2} (z) & = - 0.007623 z^{- 1} + 0.008068 z^{- 2} \\ A_{2} (z) & = - 0.002185 z^{- 1} + 0.001034 z^{- 2} & B_{3} (z) & = - 0.007554 z^{- 1} + 0.007959 z^{- 2} \\ A_{3} (z) & = - 1.29 z^{- 1} + 0.1862 z^{- 2} & B_{4} (z) & = - 7.721 \times 10^{- 6} z^{- 1} + 1.565 \times 10^{- 5} z^{- 2} \\ A_{4} (z) & = - 0.1579 z^{- 1} + 0.1286 z^{- 2} & B_{5} (z) & = 0.0001594 z^{- 1} - 0.0001876 z^{- 2} \\ B_{6} (z) & = 0.003702 z^{- 1} - 0.003327 z^{- 2} \end{matrix}

The results, summarized in Table 6, indicate that the ARX model achieved robust predictive performance across both the test and OOD datasets. For the test data, the NMSE values were consistently low, with u reaching 0.0149, the best score among all models considered. This highlights the ability of the ARX structure to exploit temporal correlations for improved short-term forecasting. In Figure 8a, the predicted trajectories closely match the measured responses for all variables, confirming the adequacy of the recursive formulation in capturing the dynamics of the vessel during standard maneuvers. Under OOD conditions (Figure 8b), deviations become more noticeable, particularly in v and r, but the model continues to reproduce the principal oscillatory patterns of the motion of the ship, demonstrating resilience to unseen operating scenarios.

The residual statistics of the ARX model, presented in Table 6, exhibit moderate biases and relatively large variances that remain fairly consistent across all data partitions. Such behavior is common in ARX formulations when applied to multivariable systems with significant dynamic coupling, since the model structure does not explicitly encode cross-variable interactions or nonlinear effects. The surge and sway velocities,

u (t)

and

v (t)

, show mean residuals between approximately

0.1

and

0.25

, while the yaw-rate

r (t)

displays a small negative bias; however, these tendencies remain bounded and do not grow in the OOD dataset, indicating stable prediction behavior. The roll rate

p (t)

achieves residual means close to zero, although its variance is still considerable, which is expected for fast angular dynamics under an ARX structure. Overall, the results suggest that while the ARX model captures some aspects of the vessel dynamics, its linear autoregressive structure limits its ability to fully represent the USV’s multivariate, strongly coupled behavior. Such limitations become more apparent when examining the model response in conditions not represented during training. As shown in Figure 8b, although overall trends remain well aligned with the observed dynamics, discrepancies are more pronounced in v and r, where phase lags and amplitude mismatches appear.

From a broader perspective, the results confirm that ARX benefits substantially from its recursive structure, which leverages past states for more accurate short-term forecasting. This advantage is evident in the prediction of surge velocity, where the ARX model achieved the lowest NMSE (0.0149 on test data). However, for variables more strongly influenced by nonlinear hydrodynamic effects, such as r and v, the ARX model exhibits lower precision than the state-space formulation. Despite this, its performance under OOD conditions demonstrates robustness and adaptability, highlighting ARX as a valuable intermediary between simple regression approaches and more complex, higher-dimensional models.

6.4. State-Space Model

The state-space approach offers a mathematically rigorous framework for representing dynamic systems, making it particularly suitable for modeling vessel maneuverability. Unlike regression-based methods, which approximate direct input–output relationships, state-space models reconstruct the system’s internal dynamics using latent state variables. This property is essential when the objective is not only to predict trajectories but also to recreate the underlying dynamic structure of the vessel, thereby enabling scalability to more complex experiments and extended operating scenarios. In this study, the state-space model was identified using the Numerical Subspace System Identification (N4SID) algorithm, which directly derives the state, input, and output matrices (A, B, and C) from the input-output dataset. This methodology ensures that the estimated model closely corresponds to the physics of maneuvering while remaining computationally tractable for predictive applications.

The state of a dynamical system is the smallest set of variables (called state variables) such that knowledge of these variables at

k = k_{0}

, together with knowledge of the input to

k \geq k_{0}

, completely determines the behavior of the system at any

k \geq k_{0}

. Note that the concept of state is not limited to physical systems. It applies to biological, economic, social, and other systems. State-space models use state variables to describe a system using a set of first-order differential Equation (9).

\begin{matrix} x (k + 1) & = A x (k) + B w (k) \\ y (k) & = C x (k) \end{matrix}

(9)

where

x (k)

is the state vector,

w (k)

is the input vector,

y (k)

is the output vector, A is the state matrix, B is the input matrix and C is the output matrix.

A state-space model was obtained using the N4SID algorithm to identify a dynamic system from input-output data [28]. In this case, the obtained state-space model is related in Equation (10).

\begin{matrix} x (k) & = {[\begin{matrix} x_{1} (k) & x_{2} (k) & x_{3} (k) & x_{4} (k) & x_{5} (k) \end{matrix}]}^{T} \\ w (k) & = {[\begin{matrix} n (k) & δ_{l} (k) & δ_{r} (k) & V_{w} (k) & α_{x} (k) & α_{y} (k) \end{matrix}]}^{T} \\ y (k) & = {[\begin{matrix} u (k) & v (k) & p (k) & r (k) & ϕ (k) \end{matrix}]}^{T} \\ A & = [\begin{matrix} 0.9799 & 0.001582 & - 0.0006478 & - 0.0178 & 0.01094 \\ 0.001685 & 0.948 & 0.03944 & - 0.08841 & 0.1862 \\ - 0.003069 & - 0.03623 & 0.9389 & - 0.07913 & - 0.3625 \\ - 0.0153 & - 0.06697 & 0.08193 & 0.8289 & 0.4616 \\ - 0.02451 & - 0.2038 & - 0.02072 & - 0.4083 & 0.6428 \end{matrix}] \\ B & = [\begin{matrix} - 1.084 \times 10^{- 6} & 2.394 \times 10^{- 5} & 0.0001019 & 4.112 \times 10^{- 6} & - 2.463 \times 10^{- 5} & 3.014 \times 10^{- 5} \\ 4.989 \times 10^{- 7} & 7.599 \times 10^{- 6} & - 3.063 \times 10^{- 5} & - 2.627 \times 10^{- 6} & 8.326 \times 10^{- 6} & - 1.093 \times 10^{- 5} \\ - 2.439 \times 10^{- 6} & - 0.000336 & - 0.0001733 & - 6.749 \times 10^{- 6} & - 9.073 \times 10^{- 5} & 6.839 \times 10^{- 5} \\ - 1.949 \times 10^{- 7} & - 0.0001334 & - 0.0001252 & 1.063 \times 10^{- 5} & 1.712 \times 10^{- 5} & - 0.0001021 \\ - 5.242 \times 10^{- 6} & - 0.0004006 & - 7.218 \times 10^{- 5} & - 2.821 \times 10^{- 5} & - 0.0001773 & 0.0001525 \end{matrix}] \\ C & = [\begin{matrix} - 79.62 & 54.99 & - 46.12 & - 45.28 & 12.71 \\ 70.42 & 43.48 & - 41.78 & - 45.31 & 11.69 \\ - 0.09965 & - 0.5377 & 0.4486 & - 1.034 & 2.289 \\ - 1.082 & - 0.2697 & 2.4 & - 0.0448 & - 0.3768 \\ 5.174 & 7.904 & 0.4291 & - 0.3328 & 1.287 \end{matrix}] \end{matrix}

(10)

Therefore, the identified model consists of four state variables (

n = 4

), six input variables (

q = 6

), and five output variables (

p = 5

). It is important to note that the state variables

x (k)

are abstract representations of the underlying dynamics and cannot be directly measured or quantified. Instead, they serve as latent constructs that govern the system’s evolution through their interaction with the input and output matrices. A structural and dynamic analysis of the identified state-space model was performed to verify its suitability for control-oriented applications. The eigenvalue spectrum of the matrix A (

λ_{1, 2} = 0.6911 \pm j 0.4846

,

λ_{3} = 0.9949

,

λ_{4} = 0.9883

,

λ_{5} = 0.9730

), confirming that the discrete-time system is asymptotically stable. The controllability and observability matrices reached full rank (

rank [C] = 5

,

rank [O] = 5

), demonstrating that all vessel states can be influenced by propulsion inputs and reconstructed from measured outputs. These results confirm that the identified model is dynamically consistent and well-posed for control design, closed-loop simulations, and autonomous navigation tasks.

While the stability, controllability, and observability properties confirm that the identified state-space representation is structurally suitable for control-oriented applications, its performance has thus far been assessed only in a simulated environment. Validation with real ship maneuvering data is therefore essential to determine how the model behaves under measurement noise, environmental variability, and unmodeled hydrodynamic effects. This step will be addressed in future work as part of the transition from simulation-based system identification to operational deployment.

Figure 9 presents a comparison between the measured vessel responses and those predicted by the state-space model, illustrating the ability of the formulation to reconstruct and forecast the dynamic behavior of the ship.

Table 7 highlight the strong predictive performance of the state-space formulation. For u, v, and

ϕ

, the NMSE values remained consistently low in the test, validation, and out-of-distribution datasets, demonstrating both accuracy and generalization. Notably, the state-space model achieved the best overall balance among the four methods tested, with NMSE values as low as 0.0246 for u under test conditions and 0.0499 under OOD conditions. The time series comparisons further show that the model accurately reproduces both the amplitude and phase of vessel responses, underscoring its robustness in capturing essential oscillatory and transient behaviors.

In addition, the residual statistics summarized in Table 8 show that the state-space model produces unbiased errors across all datasets, with residual means consistently close to zero for the five output variables. The standard deviations remain small across the Train, Test, and Validation partitions, indicating that the model accurately captures the distribution of the training data. The increase in residual variance observed in the OOD dataset is expected, since these trajectories contain maneuvering conditions not present during identification. Notably, the angular variables

p (t)

,

r (t)

, and

ϕ (t)

exhibit extremely small residual means (

10^{- 3}

–

10^{- 4}

) and low dispersion, confirming that the short-term rotational dynamics are well represented by the identified model. Overall, the results indicate that the state-space representation is unbiased and stable, and that its prediction errors can be interpreted as noise rather than structural model deficiencies.

Beyond predictive accuracy, the state-space formulation provides a flexible structure that can be adapted to different experimental frameworks. By modifying the input and output matrices, the model can be extended to incorporate additional environmental effects, alternative propulsion systems, or sensor configurations. This scalability makes the state-space model a valuable foundation for future experimentation, allowing researchers to simulate a variety of maneuvering conditions and to design advanced predictive controllers grounded in realistic vessel dynamics.

6.5. Temporal Stability Analysis

To evaluate temporal stability, the models were assessed using four prediction horizons: 1-step, 5-step, 15-step, and 30-step. For each horizon h, the regression models and the state-space model were evaluated against shifted targets

y (t + h)

, without modifying the internal structure of the models (Table 9).

Across all models, the multi-step results reveal a consistent degradation in accuracy as the prediction horizon increases, which is expected in dynamical systems where errors propagate through time. Both linear and polynomial regression exhibit a gradual increase in NMSE for u, v, r, and

ϕ

, while maintaining an almost constant error for p, indicating that roll-rate dynamics are simpler and less prone to temporal drift. In contrast, the state-space model demonstrates the best short-horizon performance (1–5 steps), particularly for u and

ϕ

, but its advantage diminishes at longer horizons, where all models converge toward similar error magnitudes. Notably, the NMSE at the 15-step horizon remains within the monotonic trend of intermediate degradation observed between 5 and 30 steps, confirming that

h = 15

constitutes a representative mid-range horizon where temporal stability can be evaluated without reaching the saturation error region observed at 30 steps.

6.6. Statistical Significance

Statistical confidence measures of the performance metrics were employed to quantify the variability and statistical reliability of the NMSE across all datasets used in this work. For this purpose, a bootstrap-based analysis with 100 iterations was conducted.

In each bootstrap iteration, a new model was trained using a resampled version of the training data, while the NMSE was computed independently for the four evaluation datasets. Table 10 and Table 11 report the mean ± one standard deviation of the NMSE over the 100 iterations for all modeling approaches considered in this work.

This bootstrap procedure serves two main purposes:

To quantify the sensitivity of each model to sampling variability, that is, the stability of its parameters within the training distribution.
To reveal the growth of epistemic uncertainty during extrapolation toward previously unseen extreme operating conditions (distribution shift).

Table 10 and Table 11 present the results corresponding to the regression-based models. These models exhibit extremely small standard deviations across all datasets (on the order of

10^{- 5}

–

10^{- 7}

), indicating that both the linear and polynomial models display a highly stable behavior under resampling of the training set. This result suggests that the models are not overly sensitive to fluctuations in the training data and that their predictive performance remains consistent across repeated training instances.

7. Conclusions and Future Research Lines

This study developed and evaluated four interpretable models—linear regression, third-order polynomial regression, state-space, and ARX—to predict the maneuverability of a river patrol vessel in 4-DoF: u, v,

ϕ

, and r. By incorporating roll dynamics into the modeling framework, the research addressed a limitation of conventional 3-DoF approaches and provided a more comprehensive representation of vessel behavior under realistic operating conditions.

Comparative analysis demonstrated that all models successfully captured the main dynamics of the vessel, achieving low NMSE values across the training, validation, and OOD datasets. The state-space model yielded the best overall performance, with NMSE as low as 0.0246 for u under test conditions and 0.0499 for the OOD scenarios, confirming its strong generalizability. The ARX model showed robust performance, achieving the lowest NMSE for u in the test dataset (0.0149) and maintaining stability under OOD conditions, although its accuracy decreased for dynamics v and r. Regression-based models, while less precise, provided transparent formulations with acceptable error levels (e.g., NMSE < 0.10 across most variables), which makes them suitable for real-time monitoring and control prototyping.

From a scientific perspective, these findings confirm the value of interpretable system identification approaches in ship maneuverability. Regression models offer computational efficiency and simplicity, the ARX model highlights the advantages of temporal dependence for forecasting, and the state-space formulation provides a scalable framework for reconstructing internal dynamics and extending the model to more complex experimental conditions. The inclusion of OOD testing further underscores the robustness of the proposed methodologies, demonstrating their applicability in scenarios beyond the training distribution, which is critical for real-world deployment.

A principal limitation of this study is that all models were developed and evaluated exclusively using simulated data generated from a validated 4-DoF maneuvering environment. Although this approach facilitates controlled experimentation, ensures reproducibility, and allows systematic assessment of model performance under routine and extreme (OOD) operating conditions, it does not capture sensor noise, environmental uncertainty, unmodeled hydrodynamics, or operational factors present in real ship trials. As such, empirical validation with full-scale maneuvering data is a necessary next step to assess the robustness and practical reliability of the identified models. Future work will therefore focus on deploying the state-space and ARX formulations on an operational vessel to evaluate their behavior under real-world disturbances and measurement imperfections, enabling refinement and calibration of the models toward field-ready performance.

A second limitation of the present study is that the computational performance of the identified models was not evaluated on a real hardware platform. Although the mathematical formulations used—particularly the ARX and state-space models—are well suited for embedded execution due to their low computational complexity, the manuscript focuses exclusively on offline prediction. As a result, execution times, memory requirements, and real-time stability on embedded marine controllers were not assessed. Future work will therefore include hardware-in-the-loop experiments and benchmarking on representative embedded systems (e.g., ARM-based processors or industrial marine microcomputers) to ensure that the models meet real-time constraints required for onboard navigation and control applications.

A further limitation concerns the absence of maneuverability indices commonly used in naval architecture, such as turning diameter, tactical diameter, advance, transfer, or yaw response time—to assess ship handling performance. These metrics require structured, repeatable control protocols (e.g., IMO-standard turning-circle or zigzag maneuvers), which are not present in the randomly generated excitation sequences that compose the current simulation dataset. For this reason, such indices could not be computed in the present study. Future work will incorporate simulated and full-scale trials specifically designed to reproduce standardized maneuvers, enabling a comprehensive comparison between model predictions and accepted hydrodynamic performance criteria.

In general, this work establishes a foundation for integrating interpretable machine learning with classical system identification to design predictive controllers, autonomous navigation strategies, and combat training simulations in riverine environments. Future research may extend the state-space formulation to hybrid or nonlinear structures, integrate additional environmental disturbances such as currents and shallow-water effects, and validate the models using full-scale experimental data to further enhance their operational reliability.

Future research lines derived from this work may follow several directions. First, extending the current framework to hybrid or nonlinear state-space models would enable integrating hydrodynamic theory with data-driven corrections, thereby improving accuracy while preserving interpretability in complex maneuvering conditions. Second, incorporating additional environmental disturbances, such as river currents, shallow-water effects, and varying payloads, would increase the robustness and realism of the models, enabling more reliable predictions across diverse operating scenarios. Finally, validation through full-scale experimental trials with patrol vessels is essential to confirm the applicability of the proposed models in real-world conditions, providing empirical evidence to refine predictive controllers and autonomous navigation strategies.

Another promising line of research is the generalization of the developed models to motion platforms with varying DoF. By adapting the state-space and ARX formulations, the methodology can be scaled beyond the 4-DoF representation of the patrol vessel to higher-DoF systems, such as 5-DoF or 6-DoF marine vehicles, or even to terrestrial and aerial platforms. This scalability would allow the framework to serve as a unifying approach for modeling and predicting the dynamics of diverse maneuvering systems, thereby broadening its applicability across experimental testbeds, training simulators, and autonomous vehicle design in different domains.

Author Contributions

Conceptualization, J.C.M. and J.E.-G.; Methodology, J.C.M. and J.E.-G.; Software, D.O.-B., K.V.G., C.S.M. and R.S.-D.; Validation, A.L.A., C.S.M., R.S.-D., J.J.-C. and J.E.-G.; Formal analysis, J.C.M. and J.O.L.; Investigation, J.C.M., C.S.M., J.J.-C. and J.E.-G.; Data curation, D.O.-B., K.V.G., J.J.-C. and J.O.L.; Writing—original draft preparation, J.C.M., D.O.-B. and K.V.G.; Writing—review and editing, A.L.A. and J.E.-G.; Visualization, K.V.G. and J.O.L.; Supervision, J.E.-G.; Project administration, A.L.A.; Funding acquisition, A.L.A. and R.S.-D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was possible thanks to financing from the Ministry of Science, Technology, and Innovation of Colombia—Minciencias, within the framework agreement for collaboration N° 877-2017 for the development project N° 1126-1022-82866. We also thank the “Almirante Padilla” Naval School—ENAP, Marine Infantry Training Center. School—EFIM, and to the International Center of Advanced Fluvial Excellence—CIEAF, for their valuable assistance in carrying out the online surveys that contributed to the development of this research.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

Author Carlos Soto Montaño and José Oñate López were employed by the company ONIRIS ID SAS. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Lan, J.; Zheng, M.; Chu, X.; Ding, S. Parameter Prediction of the Non-Linear Nomoto Model for Different Ship Loading Conditions Using Support Vector Regression. J. Mar. Sci. Eng. 2023, 11, 903. [Google Scholar] [CrossRef]
Abkowitz, M. Measurement of hydrodynamic characteristics from ship maneuvering trials by system identification. Trans. Soc. Nav. Archit. Mar. Eng. 1980, 88, 283–318. [Google Scholar]
Yasukawa, H.; Yoshimura, Y. Introduction of MMG standard method for ship maneuvering predictions. J. Mar. Sci. Technol. 2015, 20, 37–52. [Google Scholar] [CrossRef]
Moreno-Salinas, D.; Chaos, D.; De la Cruz, J.; Aranda, J. Identification of a Surface Marine Vessel Using LS-SVM. J. Appl. Math. 2013, 2013, 803548. [Google Scholar] [CrossRef]
Carrillo, S.; Contreras, J. Obtaining First and Second Order Nomoto Models of a Fluvial Support Patrol using Identification Techniques. Cienc. Tecnol. Buques 2018, 11, 19–28. [Google Scholar] [CrossRef]
Zhang, X.; Zhao, B.; Zhang, G. Improved parameter identification algorithm for ship model based on nonlinear innovation decorated by sigmoid function. Transp. Saf. Environ. 2021, 3, 114–122. [Google Scholar] [CrossRef]
Fukui, Y.; Yokota, H.; Yano, H.; Kondo, M.; Nakano, T.; Yoshimura, Y. 4-DOF Mathematical Model for Manoeuvring Simulation including Roll Motion. J. Jpn. Soc. Nav. Archit. Ocean Eng. 2016, 24, 167–179. [Google Scholar] [CrossRef]
Zhao, B.; Zhang, X.; Liang, C. A Novel Parameter Identification Algorithm for 3-DOF Ship Maneuvering Modelling Using Nonlinear Multi-Innovation. J. Mar. Sci. Eng. 2022, 10, 581. [Google Scholar] [CrossRef]
Yu, Q.; Yang, Y.; Geng, X.; Jiang, Y.; Li, Y.; Tang, Y. Integrating Computational Fluid Dynamics for Maneuverability Prediction in Dual Full Rotary Propulsion Ships: A 4-DOF Mathematical Model Approach. J. Mar. Sci. Eng. 2024, 12, 762. [Google Scholar] [CrossRef]
Tillig, F.; Ringsberg, J. A 4 DOF simulation model developed for fuel consumption prediction of ships at sea. Ships Offshore Struct. 2018, 14, 112–120. [Google Scholar] [CrossRef]
Zhang, C.; Liu, X.; Wan, D.; Wang, J. Experimental and numerical investigations of advancing speed effects on hydrodynamic derivatives in MMG model, Part I: Xvv, Yv, Nv. Ocean Eng. 2019, 179, 67–75. [Google Scholar] [CrossRef]
Meng, Y.; Zhang, X.; Zhu, J. Parameter identification of ship motion mathematical model based on full-scale trial data. Int. J. Nav. Archit. Ocean Eng. 2022, 14, 100437. [Google Scholar] [CrossRef]
Escorcia-Gutierrez, J.; Gamarra, M.; Beleño, K.; Soto, C.; Mansour, R.F. Intelligent deep learning-enabled autonomous small ship detection and classification model. Comput. Electr. Eng. 2022, 100, 107871. [Google Scholar] [CrossRef]
Zhu, Z.; Kim, B.S.; Wang, S.; Kim, Y. Study on numerical PMM test and its application to KCS hull. Appl. Ocean Res. 2022, 127, 103327. [Google Scholar] [CrossRef]
Hu, Y.; Song, L.; Liu, Z.; Yao, J. Identification of Ship Hydrodynamic Derivatives Based on LS-SVM with Wavelet Threshold Denoising. J. Mar. Sci. Eng. 2021, 9, 1356. [Google Scholar] [CrossRef]
Jeon, M.; Yoon, H.; Park, J.; Rhee, S.; Seo, J. Identification of 4-DoF maneuvering mathematical models for a combatant in intact and damaged conditions. Int. J. Nav. Archit. Ocean Eng. 2022, 14, 100480. [Google Scholar] [CrossRef]
Song, L.; Hao, L.; Tao, H.; Xu, C.; Guo, R.; Li, Y.; Yao, J. Research on Black-Box Modeling Prediction of USV Maneuvering Based on SSA-WLS-SVM. J. Mar. Sci. Eng. 2023, 11, 324. [Google Scholar] [CrossRef]
Chattha, N.; Siddiqui, S.; Malik, M.; Elst, L.; Dengel, A.; Ahmed, S. KINN: Incorporating Expert Knowledge in Neural Networks. arXiv 2019, arXiv:1902.05653. [Google Scholar] [CrossRef]
Wu, T.; Li, R.; Chen, Q.; Pi, G.; Wan, S.; Liu, Q. A Numerical Study on Modeling Ship Maneuvering Performance Using Twin Azimuth Thrusters. J. Mar. Sci. Eng. 2023, 11, 2167. [Google Scholar] [CrossRef]
Alexandersson, M.; Mao, W.; Ringsberg, J. System identification of Vessel Manoeuvring Models. Ocean Eng. 2022, 266, 112940. [Google Scholar] [CrossRef]
Liu, Y.; Xue, Y.; Huang, S.; Xue, G.; Jing, Q. Dynamic Model Identification of Ships and Wave Energy Converters Based on Semi-Conjugate Linear Regression and Noisy Input Gaussian Process. J. Mar. Sci. Eng. 2021, 9, 194. [Google Scholar] [CrossRef]
Baier, A.; Aspandi, D.; Staab, S. ReLiNet: Stable and Explainable Multistep Prediction with Recurrent Linear Parameter Varying Networks. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23), Macao, China, 19–25 August 2023; pp. 3461–3469. [Google Scholar] [CrossRef]
Miller, A. Ship Model Identification with Genetic Algorithm Tuning. Appl. Sci. 2021, 11, 5504. [Google Scholar] [CrossRef]
Baier, A.; Staab, S. A Simulated 4-DOF Ship Motion Dataset for System Identification under Environmental Disturbances; DaRUS: Sttutgart, Germany, 2022. [Google Scholar] [CrossRef]
Isherwood, R. Wind resistance of merchant ships. Trans. RINA 1973, 115, 327–338. [Google Scholar]
Hasselmann, K.; Barnett, T.P.; Bouws, E.; Carlson, H.; Cartwright, D.E.; Enke, K.; Ewing, J.; Gienapp, A.; Hasselmann, D.; Kruseman, P.; et al. Measurements of wind-wave growth and swell decay during the Joint North Sea Wave Project (JONSWAP). Ergaenzungsheft Dtsch. Hydrogr. Z. Reihe A 1973, 12, 1–95. [Google Scholar]
Frank, D.; Aspandi, D.; Muehlebach, M.; Unger, B.; Staab, S. Robust Recurrent Neural Network to Identify Ship Motion in Open Water with Performance Guarantees—Technical Report. arXiv 2022, arXiv:2212.05781. [Google Scholar]
De Moor, B.; Van Overschee, P.; Favoreel, W. Algorithms for subspace state-space system identification: An overview. In Applied and Computational Control, Signals, and Circuits: Volume 1; Springer: Berlin/Heidelberg, Germany, 1999; pp. 247–311. [Google Scholar]

Figure 1. Methodological framework for 4-DoF ship maneuverability model identification.

Figure 2. Correlation matrix between variables (training data).

Figure 3. Distribution of input data in each group: training, test, validation, and out-of-distribution (OOD).

Figure 4. Six degrees of freedom ship motions.

Figure 5. Input/Output Model.

Figure 6. Linear model validation.

Figure 7. Third-degree polynomial model validation.

Figure 8. ARX model validation.

Figure 9. State-space model validation.

Table 1. Linear model coefficients.

	$n (i)$	$δ_{l} (i)$	$δ_{r} (i)$	$V_{w} (i)$	$α_{x} (i)$	$α_{y} (i)$
$u (i + 15)$	$5.308 \times 10^{- 3}$	$2.4873 \times 10^{- 1}$	$1.0465 \times 10^{- 2}$	$- 1.4306 \times 10^{- 2}$	$5.0742 \times 10^{- 2}$	$7.1792 \times 10^{- 3}$
$v (i + 15)$	$- 7.292 \times 10^{- 5}$	$7.7497 \times 10^{- 1}$	$8.3838 \times 10^{- 1}$	$- 1.1228 \times 10^{- 3}$	$- 1.0514 \times 10^{- 2}$	$3.9640 \times 10^{- 1}$
$p (i + 15)$	$- 1.9237 \times 10^{- 8}$	$3.3852 \times 10^{- 4}$	$3.6734 \times 10^{- 4}$	$1.2846 \times 10^{- 6}$	$- 7.6219 \times 10^{- 5}$	$- 1.3922 \times 10^{- 5}$
$r (i + 15)$	$3.5651 \times 10^{- 6}$	$- 3.0931 \times 10^{- 2}$	$- 3.3648 \times 10^{- 2}$	$- 1.2977 \times 10^{- 4}$	$5.4632 \times 10^{- 4}$	$- 2.3571 \times 10^{- 3}$
$ϕ (i + 15)$	$- 1.2155 \times 10^{- 6}$	$1.1894 \times 10^{- 2}$	$1.1074 \times 10^{- 2}$	$- 4.2644 \times 10^{- 4}$	$- 1.4290 \times 10^{- 3}$	$4.0950 \times 10^{- 2}$

Table 2. NMSE linear model results.

Output	Train	Test	Validation	OOD
$u (t)$	0.0374	0.0371	0.0438	0.0559
$v (t)$	0.0639	0.0672	0.0757	0.0898
$p (t)$	0.0339	0.0269	0.0396	0.0647
$r (t)$	0.0585	0.0626	0.0634	0.1469
$ϕ (t)$	0.0722	0.0703	0.0841	0.0877

Table 3. Residuals mean ± standard deviation for the linear regression model.

Output	Train	Test	Validation	OOD
$u (t)$	$- 0.00013 \pm 0.31115$	$- 0.00655 \pm 0.28534$	$- 0.00380 \pm 0.28716$	$0.01381 \pm 0.51223$
$v (t)$	$- 0.00005 \pm 0.22920$	$0.00070 \pm 0.21353$	$0.02423 \pm 0.20393$	$- 0.05242 \pm 0.44082$
$p (t)$	$- 2 \times 10^{- 6} \pm 0.00664$	$- 0.00001 \pm 0.00605$	$0.00000 \pm 0.00639$	$0.00001 \pm 0.00797$
$r (t)$	$1 \times 10^{- 6} \pm 0.00673$	$0.00003 \pm 0.00719$	$- 0.00048 \pm 0.00691$	$0.00248 \pm 0.02049$
$ϕ (t)$	$0.00002 \pm 0.02171$	$- 0.00128 \pm 0.02116$	$0.00133 \pm 0.02147$	$- 0.00139 \pm 0.04067$

Table 4. NMSE Polynomial Model Results.

Output	Train	Test	Validation	OOD
$u (t)$	0.0310	0.0295	0.0359	0.0591
$v (t)$	0.0476	0.0477	0.0543	0.0958
$p (t)$	0.0339	0.0268	0.0395	0.0644
$r (t)$	0.0463	0.0483	0.0505	0.1554
$ϕ (t)$	0.0358	0.0363	0.0444	0.0614

Table 5. Residuals mean ± standard deviation for the polynomial regression model.

Output	Train	Test	Validation	OOD
$u (t)$	$- 0.00012 \pm 0.25881$	$- 0.00215 \pm 0.22858$	$- 0.00624 \pm 0.23347$	$- 0.16032 \pm 0.50028$
$v (t)$	$- 0.00004 \pm 0.17559$	$0.00696 \pm 0.15539$	$0.00672 \pm 0.15108$	$- 0.01530 \pm 0.48581$
$p (t)$	$- 2.69 \times 10^{- 6} \pm 0.00664$	$- 0.00001 \pm 0.00604$	$- 0.00002 \pm 0.00638$	$0.00001 \pm 0.00794$
$r (t)$	$2.01 \times 10^{- 6} \pm 0.00557$	$0.00001 \pm 0.00580$	$- 0.00020 \pm 0.00571$	$0.00105 \pm 0.02218$
$ϕ (t)$	$0.00001 \pm 0.01104$	$0.00012 \pm 0.01120$	$- 0.00001 \pm 0.01159$	$- 0.00132 \pm 0.02892$

Table 6. Residuals mean ± standard deviation for the ARX model.

Output	Train	Test	Validation	OOD
$u (t)$	$0.2506 \pm 1.0037$	$0.2540 \pm 1.0042$	$0.1873 \pm 0.9955$	$0.1156 \pm 0.9968$
$v (t)$	$0.1081 \pm 1.0045$	$0.2680 \pm 0.9933$	$0.2189 \pm 0.9953$	$0.1843 \pm 1.00245$
$p (t)$	$0.0481 \pm 0.9999$	$0.0323 \pm 0.9942$	$0.0350 \pm 1.0034$	$- 0.0009 \pm 0.9790$
$r (t)$	$- 0.3021 \pm 0.9832$	$- 0.3611 \pm 0.9936$	$- 0.2379 \pm 0.9929$	$- 0.0714 \pm 1.0000$
$ϕ (t)$	$0.2209 \pm 1.0018$	$0.1819 \pm 1.0031$	$0.2053 \pm 0.9980$	$- 0.0233 \pm 1.00548$

Table 7. NMSE state-space model results.

Output	Train	Test	Validation	OOD
$u (t)$	0.0218	0.0246	0.0225	0.0499
$v (t)$	0.0508	0.0473	0.0449	0.0812
$p (t)$	0.0843	0.0875	0.0669	0.0525
$r (t)$	0.0461	0.0396	0.0437	0.0933
$ϕ (t)$	0.0463	0.0388	0.0490	0.0654

Table 8. Residuals mean ± standard deviation for the state space model.

Output	Train	Test	Validation	OOD
$u (t)$	$0.0116 \pm 0.2160$	$- 0.0370 \pm 0.1902$	$- 0.0372 \pm 0.1869$	$- 0.0767 \pm 0.5027$
$v (t)$	$0.0453 \pm 0.1289$	$0.0305 \pm 0.0628$	$0.0356 \pm 0.0688$	$0.0529 \pm 0.4165$
$p (t)$	$0.0001 \pm 0.0067$	$- 0.0002 \pm 0.0043$	$- 0.0001 \pm 0.0057$	$- 3.91 \times 10^{- 5} \pm 0.0085$
$r (t)$	$- 0.0034 \pm 0.0068$	$- 0.0028 \pm 0.0070$	$- 0.0031 \pm 0.0064$	$- 0.0020 \pm 0.0217$
$ϕ (t)$	$- 2.17 \times 10^{- 5} \pm 0.0129$	$- 0.0017 \pm 0.0125$	$- 0.0015 \pm 0.0129$	$- 0.0066 \pm 0.0388$

Table 9. NMSE for multi-step prediction horizons (1, 5, 15, 30 steps) under OOD evaluation.

Model	Horizon	u	v	p	r	$ϕ$
Linear Regression	1	0.0583	0.0900	0.0647	0.1509	0.0876
	5	0.0695	0.0927	0.0648	0.1650	0.0884
	15	0.0992	0.1066	0.0648	0.1868	0.0939
	30	0.1355	0.1259	0.0648	0.1964	0.1066
Polynomial Regression	1	0.0602	0.0990	0.0645	0.1636	0.0624
	5	0.0727	0.1040	0.0646	0.1822	0.0646
	15	0.1036	0.1232	0.0648	0.2084	0.0794
	30	0.1394	0.1443	0.0647	0.2143	0.1071
State-space	1	0.0417	0.0764	0.0983	0.0877	0.0137
	5	0.0456	0.0805	0.0573	0.1408	0.0758
	15	0.0634	0.0970	0.0717	0.1748	0.1050
	30	0.0818	0.1127	0.0802	0.2013	0.1278

Table 10. NMSE mean ± standard deviation for the linear regression model.

Output	Train	Test	Validation	OOD
$u (t)$	0.0376 ± 5.38 $\times 10^{- 7}$	0.0371 ± 5.3 $\times 10^{- 6}$	0.0438 ± 1.1 $\times 10^{- 5}$	0.0559 ± 3.4 $\times 10^{- 5}$
$v (t)$	0.0640 ± 7.25 $\times 10^{- 7}$	0.0673 ± 3.3 $\times 10^{- 5}$	0.0756 ± 5.3 $\times 10^{- 5}$	0.0898 ± 3.4 $\times 10^{- 5}$
$p (t)$	0.0341 ± 3.54 $\times 10^{- 7}$	0.0269 ± 6.31 $\times 10^{- 7}$	0.0396 ± 1.0 $\times 10^{- 6}$	0.0647 ± 6.8 $\times 10^{- 6}$
$r (t)$	0.0588 ± 6.84 $\times 10^{- 7}$	0.0626 ± 2.9 $\times 10^{- 5}$	0.0634 ± 1.8 $\times 10^{- 5}$	0.1469 ± 3.7 $\times 10^{- 5}$
$ϕ (t)$	0.0723 ± 7.94 $\times 10^{- 7}$	0.0703 ± 2.6 $\times 10^{- 5}$	0.0840 ± 6.9 $\times 10^{- 5}$	0.0876 ± 6.4 $\times 10^{- 5}$

Table 11. NMSE mean ± standard deviation for the third-order polynomial regression model.

Output	Train	Test	Validation	OOD
$u (t)$	0.0313 ± 2.05 $\times 10^{- 6}$	0.0296 ± 2.13 $\times 10^{- 5}$	0.0360 ± 2.74 $\times 10^{- 5}$	0.0574 ± 1.0 $\times 10^{- 4}$
$v (t)$	0.0490 ± 2.13 $\times 10^{- 6}$	0.0490 ± 5.01 $\times 10^{- 5}$	0.0559 ± 6.09 $\times 10^{- 5}$	0.0984 ± 3.0 $\times 10^{- 4}$
$p (t)$	0.0341 ± 1.32 $\times 10^{- 6}$	0.0269 ± 3.29 $\times 10^{- 6}$	0.0396 ± 2.65 $\times 10^{- 6}$	0.0645 ± 1.49 $\times 10^{- 5}$
$r (t)$	0.0486 ± 3.74 $\times 10^{- 6}$	0.0505 ± 3.75 $\times 10^{- 5}$	0.0523 ± 4.17 $\times 10^{- 5}$	0.1581 ± 5.0 $\times 10^{- 4}$
$ϕ (t)$	0.0367 ± 1.87 $\times 10^{- 6}$	0.0372 ± 3.08 $\times 10^{- 5}$	0.0455 ± 4.32 $\times 10^{- 5}$	0.0624 ± 1.0 $\times 10^{- 4}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Contreras Montes, J.; Lovo Ayala, A.; Ospino-Balcázar, D.; Velasquez Gutierrez, K.; Soto Montaño, C.; Soto-Diaz, R.; Jiménez-Cabas, J.; Oñate López, J.; Escorcia-Gutierrez, J. Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat. Computation 2025, 13, 296. https://doi.org/10.3390/computation13120296

AMA Style

Contreras Montes J, Lovo Ayala A, Ospino-Balcázar D, Velasquez Gutierrez K, Soto Montaño C, Soto-Diaz R, Jiménez-Cabas J, Oñate López J, Escorcia-Gutierrez J. Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat. Computation. 2025; 13(12):296. https://doi.org/10.3390/computation13120296

Chicago/Turabian Style

Contreras Montes, Juan, Aldo Lovo Ayala, Daniela Ospino-Balcázar, Kevin Velasquez Gutierrez, Carlos Soto Montaño, Roosvel Soto-Diaz, Javier Jiménez-Cabas, José Oñate López, and José Escorcia-Gutierrez. 2025. "Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat" Computation 13, no. 12: 296. https://doi.org/10.3390/computation13120296

APA Style

Contreras Montes, J., Lovo Ayala, A., Ospino-Balcázar, D., Velasquez Gutierrez, K., Soto Montaño, C., Soto-Diaz, R., Jiménez-Cabas, J., Oñate López, J., & Escorcia-Gutierrez, J. (2025). Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat. Computation, 13(12), 296. https://doi.org/10.3390/computation13120296

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Ship Model Identification Using Interpretable 4-DOF Maneuverability Models for River Combat Boat

Abstract

1. Introduction

2. Contributions

3. Related Works

4. Methodology

5. Materials and Methods

5.1. Dataset

5.2. Modeling

6. Results

6.1. Linear Regression Model

6.2. Polynomial Regression Model

6.3. AutoRegressive Model with Exogenous Inputs

6.4. State-Space Model

6.5. Temporal Stability Analysis

6.6. Statistical Significance

7. Conclusions and Future Research Lines

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI