System Identification of a Moored ASV with Recessed Moon Pool via Deterministic and Bayesian Hankel-DMDc

Giorgio Palma; Ivan Santic; Andrea Serani; Lorenzo Minno; Matteo Diez

doi:10.3390/jmse13122267

,

and

¹

National Research Council–Institute of Marine Engineering (CNR-INM), Via di Vallerano, 139, 00128 Rome, Italy

²

Codevintec Italiana S.r.l., Viale Lenormant 215-217, 00119 Rome, Italy

^*

Authors to whom correspondence should be addressed.

J. Mar. Sci. Eng.2025, 13(12), 2267;https://doi.org/10.3390/jmse13122267

This article belongs to the Special Issue Machine Learning for Prediction of Ship Motion

Version Notes

Order Reprints

Abstract

This study addresses the system identification of a small autonomous surface vehicle (ASV) under moored conditions using Hankel dynamic mode decomposition with control (HDMDc) and its Bayesian extension (BHDMDc). Experiments were carried out on a Codevintec CK-14e ASV in the CNR-INM towing tank, under both irregular and regular head wave conditions. The ASV under investigation features a recessed moon pool, which induces nonlinear responses due to sloshing, thereby increasing the modeling challenge. Data-driven reduced-order models were built from measurements of vessel motions and mooring loads. The HDMDc framework provided accurate deterministic predictions of vessel dynamics, while the Bayesian formulation enabled uncertainty-aware characterization of the model response by accounting for variability in hyperparameter selection. Validation against experimental data demonstrated that both HDMDc and BHDMDc can predict the vessel’s response under unseen regular and irregular wave excitations. In conclusion, this study shows that HDMDc-based ROMs are a viable data-driven alternative for system identification, demonstrating for the first time their generalization capability for an unseen sea condition different from the training set, achieving high accuracy in reproducing the vessel dynamics.

Keywords:

dynamic mode decomposition; system identification; data-driven modeling; reduced order modeling; machine learning; ship motions; uncertainty quantification

1. Introduction

The maritime and ocean engineering communities have increasingly recognized the importance of developing accurate and computationally efficient models for the prediction and control of marine system dynamics. In particular, the capability to accurately predict ship motions is essential for supporting design and operations. This holds particularly for seakeeping and maneuvering in adverse weather conditions, for which the availability of reliable predictive tools helps develop and operate vessels, ensuring the safety of structures, payload, and crew. In this regard, commercial and military ships must meet the International Maritime Organization (IMO) Guidelines and NATO Standardization Agreements (STANAG), respectively. Both regulatory frameworks emphasize the need for accurate assessments of vessel motions and loads under a wide range of sea conditions, highlighting the importance of robust modeling and simulation tools to support compliance and operational readiness.

The complexity and nonlinear nature of hydrodynamic phenomena involved pose significant challenges to predicting ship responses in waves. Recent works [1,2] demonstrated the ability of computational fluid dynamics (CFD) methods with unsteady Reynolds-averaged Navier–Stokes (URANS) formulations to assess ship performances in waves and extreme sea conditions. Along with the high fidelity of simulations, however, comes their high computational costs. This is particularly true when simulations aim to achieve statistical convergence of relevant quantities of interest, and complex fluid–structure interactions are investigated. Real-time applications, such as control, fault detection, and digital twinning in general, are also limited by the computational resources required from such models.

In this context, data-driven modeling techniques have emerged as powerful alternatives or complements to traditional first-principles approaches. They promise to reduce the computational cost while keeping the fidelity of their estimates comparable to the original data sources. This requires the methods to be properly trained and/or calibrated, which is generally not trivial [3]. Within this framework, system identification provides a structured approach for constructing predictive models able to incorporate key characteristics of the system from high-fidelity numerical tools and/or experiments. Here, we interpret system identification broadly as the development of reduced-order models (ROMs) capable of predicting system responses, independent of whether their parameters correspond directly to physical quantities.

Equation-based reduced order models (ROMs), such as the Maneuvering Modeling Group (MMG) model [4,5], have been developed as physics-based efficient approaches, characterized by fast evaluation, and demonstrated good agreement with experiments and CFD for the maneuvering of displacement ships [6] and twin-hull configurations [7,8] and have been recently studied also for planing hulls [9]. Physics-based models are typically classifiable as white-box (fully equation-based) or grey-box (physiscs-based with data tuning). This characteristic makes their solution highly interpretable, offering insights on the identified system and the involved physical phenomena Despite the promising results, such models typically exhibit a limited adaptability to unmodeled dynamics (white-box) or require a large amount of data from CFD computations or experimental fluid dynamics (EFD) for their training and definition of forcing terms (grey-box).

Data-driven machine learning techniques gained popularity due to their ability to model complex input–output relations in an automated manner directly from data, not requiring the effort of gaining prior knowledge on the system (equation-free approaches). In particular, recurrent neural networks (RNNs) [10] and long short-term memory networks (LSTMs) [3], along with their bidirectional LSTM (BiLSTM) [11,12] variant, have been demonstrated to be effective for building equation-free data-driven models for ship motions and provide multi-step-ahead forecasting of the ship’s motion in several sailing conditions, including calm water and waves [13]. The strength of machine and deep learning methods lies in their ability to capture relevant hidden and nonlinear input–output relations directly from available data, their compactness, and enabling fast evaluation. However, deep learning models typically require large datasets for training (more complex architectures usually require more expensive training) and to generalize effectively. In addition, while powerful, such models are often considered black-box approaches and pose challenges in terms of the physical interpretability of their results.

Among the available methods, the dynamic mode decomposition (DMD) [14,15,16,17] and its methodological variants have recently gained attention due to their ability to extract dominant dynamic features directly from experimental or numerical data, with small or no assumption on the underlying physics, providing interpretable, low-dimensional representations of nonlinear systems. DMD can be classified somewhere between a black-box and a grey-box approach for reduced-order modeling. With the former, DMD shares the data-driven and equation-free structure like other machine learning techniques, such as RNN, LSTM, and BiLSTM. However, DMD-based methods retain a certain level of interpretability thanks to the linear nature of the model. DMD can be considered a method to build a finite-dimensional approximation of the Koopman operator [18], which, in turn, describes a nonlinear dynamical system as a possibly infinite-dimensional linear system [19]. The reduced-order linear model is obtained by DMD from a small set of snapshots of the dynamical system under analysis. The DMD obtains the model with a direct procedure (linear algebra) that, from a machine learning perspective, constitutes the training phase. Its data-driven nature, the fast non-iterative training, and its data-lean characteristics contributed to the popularity of DMD as a reduced-order modeling technique in several fields, such as fluid dynamics and aeroacoustics [14,20,21,22,23], epidemiology [24], neuroscience [25], finance [26].

DMD was applied for the first time to the forecasting of ship dynamics in [27], in which the proof of concept for short-term forecasting of trajectories, motions, and loads of maneuvering ships in waves was given. In [28], the approach was systematically assessed on the same test cases and first extended to the use of an augmented DMD by augmenting the system state with lagged copies of the original states and their derivatives. This approach enabled the modeling of memory effects in the system, improving accuracy over the tested cases compared to the standard formulation. Ref. [29] systematically explored the use of Hankel-DMD (HDMD) for short-term forecasting of ship motions, highlighting its potential for real-time prediction and control applications and digital twinning. One of the first efforts into the development of a DMD-based, data-driven system identification model for ship motions was conducted in [30], using the dynamic mode decomposition with control (DMDc). This incorporates the control variables (e.g., rudder angle) and forcing inputs (e.g., wave elevation) in the system regression, separating their effect from the free evolution of the system. Furthermore, Hankel-DMD with control (HDMDc) was applied to both ship motion and forces prediction, demonstrating the capability of the method to achieve good accuracy without degradation through the prediction time. The HDMDc was then tested on several ship test cases, introducing methodological advancements to face specific challenges, such as the embedding of nonlinear observables in the state and input vectors to address extremely nonlinear responses of planing hulls in slamming [31], and the use of a Tikhonov-regularized least-square formulation for improving the numerical stability of the DMD regression when using noisy experimental data [32].

Quantifying the uncertainty associated with the prediction of a ROM has become increasingly relevant for data-driven modeling, and a key characteristic for its usage in the context of, e.g., multifidelity analysis and optimization. Few approaches for introducing uncertainty quantification in DMD analysis have been presented in the literature so far. Ref. [33] first introduced a probabilistic model by modeling the measurement noise as Gaussian and treating the dynamic modes and eigenvalues as random variables, whose posterior distributions were inferred through Gibbs sampling. Later, ref. [34] presented the bagging optimized-DMD, where Breiman’s statistical resampling strategy was applied to training data, generating ensembles of DMD models and estimating confidence intervals for the extracted modes and eigenvalues. A similar approach was used in [32] and called a frequentist approach: several training signals were used, producing an ensemble of HDMDc models and estimating confidence intervals for time-resolved predictions of ship motions. Refs. [29,30] considered the uncertainty arising from the selection of HDMD and HDMDc hyperparameters: the length of the training sequence and the number of time-lagged copies of the state were separately considered as probabilistic variables with uniform distributions within a suitable range. Monte Carlo sampling was applied to obtain an ensemble of predictions forming a posterior distribution. This concept, referred to as Bayesian, was further developed in [29,30,31,32], where all relevant hyperparameters are considered as stochastic variables at once.

HDMDc and its stochastic extensions enabled the construction of robust, uncertainty-aware ROMs capable of capturing the essential dynamics of marine systems under realistic operating conditions. In practical marine applications, it is crucial that ROMs remain valid across a variety of sea states, wave spectra, and loading conditions. Testing the transferability of DMD-derived models beyond their training datasets, therefore, represents a key step toward their reliable adoption in real-world scenarios. However, despite the growing body of literature on DMD-based modeling in ship dynamics, most studies to date have focused on fixed or well-controlled experimental conditions, with limited exploration of the generalization capabilities of such models when exposed to unseen environments. In [30,31,32], for example, the test set for the DMD-based ROMs was composed from ship’s dynamic responses to unseen forcing wave signals, which, however, represented different realizations of the same sea state, characterized by an identical spectral distribution of wave energy. Consequently, although the wave sequences used for testing were unseen during training, they were statistically consistent with the training conditions, thus not probing the generalization capability of the models to different sea states.

The present work aims to address this gap by investigating specific capability of HDMDc and BHDMDc system identification to generalize beyond the training conditions. A dedicated experimental campaign was performed at the CNR-INM towing tank facility, collecting data from a Codevintec CK-14e autonomous surface vehicle (ASV) subject to irregular and regular head wave conditions. The ASV features a recessed moon pool, which induces significant nonlinearities in the hydrodynamic response, primarily associated with sloshing and piston-like oscillations of the internal free surface. Specifically, the DMD-based ROMs are learned using data from the irregular waves condition and subsequently applied to predict the vessel response in irregular and regular wave conditions. Statistical and probabilistic analyses were employed to quantify prediction accuracy and uncertainty.

The remainder of this paper is organized as follows. Section 2 describes the experimental setup and data acquisition. Section 3 presents the HDMDc and BHDMDc methodologies adopted in the study. Section 6 discusses the identification results and the validation of the models under different sea conditions. Finally, Section 7 summarizes the main findings and outlines perspectives for future research on data-driven modeling of marine systems.

2. Experimental Setup

2.1. Facility and Model

Experimental tests were carried out at the CNR-INM Emilio Castagneto seakeeping basin in Rome. The facility is 220 m long, 9 m wide, and 3.6 m deep. It is equipped with a single-flap wave generator capable of producing both regular and irregular waves: regular waves with wavelengths ranging from 1 to 10 m and corresponding heights from 0.1 to 0.45 m, as well as irregular waves following any desired sea spectrum at the appropriate scale. The plunger flap, whose rotation axis is located at 1.80 m below the calm water level (mid-depth), is electro-hydraulically driven by three pumps with a total power of 38.5 kW. Its deflection angle is controlled in the range ±13 deg through an electronic programming system producing harmonic components, each modulated in both amplitude and frequency (in the range 0.1–1.4 Hz).

A Codevintec CK-14e ASV was used for the present work, see Figure 1a. The vessel is a small marine surface drone, with a carbon fiber and Kevlar hull, 1.40 m in length (

L_{m}

), 0.9 m in beam, and 0.35 m in height (0.45 m including the rollbar). In the configuration used for the present tests, the model weighed 60.7 kg, consisting of 22 kg for the hull, 13.4 kg for the batteries, and the remainder for the payload (i.e., instrumentation and cables).

Figure 1. Picture of the CK-14e ASV (a); the top cover is removed, showing the internal configuration and the location of the moon pool, visible from the lid and its fastening system. Towing tank test rig for CK-14e in moored conditions, the white panel used in tracking is visible on the model top cover (b).

2.2. Moored Tests

During the tests, the CK-14e ASV was moored and encountered irregular and regular head waves, as shown in Table 1 and described below.

Table 1. Test matrix; regular and irregular wave conditions.

A still carriage in the tank was used to position the bow and stern mooring poles for the model, see Figure 1b; the distance between the two poles was 8.22 m, with the bow pole positioned 48.24 m away from the wave generator flap. The model was moored to the poles through elastic mooring lines, characterized by linear elastic constants

k_{bow} = 1.89

N/m and

k_{stern} = 1.33

N/m.

For both regular and irregular wave testing, sea state conditions were selected by considering the CK-14e as a 1:50 scaled model of a 70 m long supply vessel.

2.2.1. Irregular Waves

The first part of the experiments was dedicated to model testing in irregular head waves. The wave generator was operated to produce a linear superposition of

N_{w} = 204

elemental wave components with random phase

φ

and frequencies equally spaced in the interval

0.28

Hz

\leq f \leq 1.13

Hz:

η (t) = \sum_{i = 1}^{N_{w}} A (f_{i}) sin (2 π f_{i} t + φ_{i}) .

(1)

The amplitudes

A (f_{i}) = \sqrt{(4 π E (f_{i}) Δ f_{i})}

were defined to obtain a sea state 7 Pierson-Moskowitz (P-M) spectrum

E (f)

[35] characterized by a significant height of

h_{s} = 0.15

m and peak period

T_{p} = 2.12

s in model scale (

h_{s} = 7.5

m and

T_{p} = 15

s in full scale):

E (f) = \frac{5}{16} h_{s}^{2} T_{p}^{- 4} f^{- 5} exp (- \frac{5}{4} T_{p}^{- 4} f^{- 4}) .

(2)

Figure 2 compares the nominal power spectral density from Equation (2), with the spectrum obtained from the experimental wave, as measured by the Kenek probe in the forward position (see Figure 3b). The EFD power spectral density is obtained using Welch’s method by analyzing approximately 210 s of wave elevation signal per run (10.5 min) in ASV scale, corresponding to about 4455 s in equivalent supply vessel scale (1 h and 14 min). The signal, sampled at 500 Hz, was divided into segments of

2^{12}

samples with an overlap of

2^{7}

samples. Each segment was transformed using a

2^{16}

-point FFT, and the resulting periodograms were averaged to obtain the final spectral estimate. The actual realizations are characterized by an average significant height of

{\hat{h}}_{s} = 0.16

m and average peak period of

{\hat{T}}_{p} = 1.98

s in ASV scale (

{\hat{h}}_{s} = 8

m and

{\hat{T}}_{p} = 14

s in equivalent supply vessel scale).

Figure 2. Comparison between nominal Pierson–Moskowitz from Equation (2) and experimental power spectral density of wave elevation obtained from irregular wave test. Vertical lines show the frequencies of regular waves tests.

Figure 3. Sketch of the wave probes positions (a), and wave probes location from wave generator flap (b).

2.2.2. Regular Waves

In the second part of the experiments, the moored ASV was subject to regular head waves. The nominal wave height h was set to

h = 0.1

m ASV scale (5 m equivalent supply vessel scale). Three different wavelengths

λ_{w} = 1.5 L_{m}

,

2 L_{m}

, and

3 L_{m}

were tested. Actual realizations were characterized by (average ± standard deviation)

\hat{h} = 0.105 \pm 0.01

m,

0.107 \pm 0.01

m, and

0.116 \pm 0.01

m for the three

λ_{w}

, respectively.

2.3. Instrumentation

2.3.1. Wave Elevation Measurements

Two capacitance-type probes by Kenek (model SPH 150) were rigidly mounted to the carriage for measuring wave elevation at the forward mooring pole and the center of gravity of the model at rest. In addition, four ultrasonic wave elevation probes were also mounted, two of them in correspondence with the capacitance-type probes, one in correspondence with the half of the forward mooring line at rest, and one in correspondence with the bow of the model at rest. The distance in meters from the wave generator flap to each probe is reported in Figure 3b. The elevation probes collected data with a sampling frequency of

f_{s w} = 500

Hz. Signals were then filtered by applying a low-pass minimum-order filter with delay compensation and stopband attenuation of 60 dB, with passband frequency

f_{p b} = 10

Hz and stopband frequency

f_{s b} = 10.75

Hz.

2.3.2. Motion Measurements

An SMC-108 motion reference unit (MRU) was installed onboard the CK-14e (see Figure 1a), composed of accelerometers and gyroscopes mounted in all three axes. The sensing data signals were acquired with a sampling frequency

f_{s m} = 50

Hz. Measures were processed online in parallel with a Kalman filter inside the MRU multi-core processor. This provided output data for accelerations

\ddot{x}, \ddot{y}, \ddot{z}

, angular velocities

\dot{ϕ}, \dot{θ}

, heave z, roll

ϕ

, pitch

θ

, and attitude

ψ

. Furthermore, z was computed by integration of

\ddot{z}

into a linear position. The integration was further processed and filtered for an accurate heave measurement.

The set of variables was further extended by post processing the measurements:

\dot{x}

and

\dot{z}

were obtained by integrating the measured accelerations and

\ddot{θ}

were obtained by 4th order central differentiation from the corresponding measured angular velocity. A low-pass minimum-order filter with a stopband attenuation of 60 dB and delay compensation was applied after the integration step, with passband frequency

f_{p b} = 10

Hz and stopband frequency

f_{s b} = 10.75

Hz.

Additionally, a video recording of the moored setup from a side view was taken with a GoPro Hero 7 camera. A dedicated image-processing pipeline was developed to track the motion of the ship model. Each video frame was first converted to the RGB color space and segmented using fixed intensity thresholds to isolate the region corresponding to a white identification panel positioned on top of the model, used as the tracked object. The resulting binary mask was refined through morphological operations (opening, closing, and hole-filling) in order to remove noise and produce a clean and contiguous representation of the target. Connected-component analysis was then performed, and the panel was identified as the largest component. Its geometric properties—including centroid position and orientation angle—were extracted using standard region-based feature analysis. These measurements were stored frame by frame and subsequently low-pass filtered to suppress high-frequency noise and obtain estimations for x, z, and

θ

. The pitch and heave from the image-processing tracking approach were compared with the MRU signals, obtaining a good agreement, validating the procedure and the so-obtained surge measurement.

2.3.3. Mooring Forces

Mooring forces

M_{bow}

and

M_{stern}

were measured by two high-accuracy DDEN in-line submersible load cells by Applied Measurements Ltd., Aldermaston, Reading, UK, aligned with the mooring lines at calm water level. Pretensions have been applied accordingly to the elastic constants of the mooring lines and expected motions of the model in the test area.

2.4. Response Amplitude Operators

Figure 4 shows the response amplitude operators (RAOs) for the ASV’s surge, heave, and pitch evaluated from the irregular and regular wave tests data. No significant resonant behavior is evident in the frequency range considered. The RAOs show an overdamped response for the motions; this may be an indication of the moonpool having a dissipative effect through sloshing phenomena, to be confirmed by further investigations which are, however, beyond the scope of this work. Heave and pitch RAOs suggest the presence of a response small peak around

λ_{w} / L_{p p} = 1.5

, visible in both the irregular and regular response, which is often found in typical seakeeping problems.

Figure 4. Response amplitude operators for surge, heave, and pitch, evaluated from irregular (solid blue line) and regular (black circles) wave responses. The dash-dotted vertical line highlights the peak frequency of the P-M EFD spectrum.

3. HDMDc

DMD [14,36] was originally presented to decompose high-dimensional time-resolved data into a set of spatiotemporal coherent structures, characterized by fixed spatial structures (modes) and associated temporal dynamics, providing a linear reduced-order representation of possibly nonlinear system dynamics. The original DMD characterizes naturally evolving dynamical systems. In contrast, its extension to forced systems, called DMD with control (DMDc) [37], accounts for the influence of forcing inputs in the analysis, helping disambiguate it from the unforced dynamics of the system.

The standard DMD and DMDc formulations approximate the Koopman operator, creating a best-fit linear model that links sequential data snapshots of measurements [14,15,38]. This model provides a locally linear (in time) representation of the dynamics, which, however, is unable to capture many essential features of nonlinear systems. The augmentation of the system state is thus the subject of several DMD algorithmic variants [17,39,40,41] aiming to find a coordinate system (or embedding) that spans a Koopman-invariant subspace, to search for an approximation of the Koopman operator valid also far from fixed points and periodic orbits in a larger space. However, there is no general rule for defining these observables and guaranteeing they will form a closed subspace under the Koopman operator [42].

The HDMD [43] is a specific version of the DMD algorithm developed to deal with the cases of nonlinear systems in which only partial observations are available [17]. Incorporating time-lagged information in the data used to learn the model, HDMD and HDMDc increase the dimensionality of the system. Including time-delayed data in the analysis, the HDMD and its extension to externally forced systems, HDMDc, can extract linear modes and the associated input operator defined in a space of augmented dimensionality. Such modes are capable of reflecting the nonlinearities in the time evolution of the original system through complex relations between present and past states. The state vector is thus augmented by embedding s time-delayed copies of the original variables. The HDMDc involves, in addition, augmenting the input vector with z time-delayed copies of the original forcing inputs. The use of time-delayed copies as additional observables in the DMD has been connected to the Koopman operator as a universal linearizing basis [44], yielding the true Koopman eigenfunctions and eigenvalues in the limit of infinite-time observations [43].

The HDMDc identifies a representation of the dynamics as an externally forced system:

{\hat{x}}_{j + 1} = \hat{A} {\hat{x}}_{j} + \hat{B} {\hat{u}}_{j} .

(3)

The vectors

{\hat{x}}_{j}

and

{\hat{u}}_{j}

are referred to as the extended state and input vectors, respectively, at the time instant j. The vectors

{\hat{x}}_{j}

and

{\hat{u}}_{j}

are obtained starting from the original state and input vectors

x_{j}

and

u_{j}

, respectively:

x_{j} = [\begin{matrix} x_{1} \\ x_{2} \\ \dots \\ x_{N} \end{matrix}] \in R^{N}, u_{j} = [\begin{matrix} u_{1} \\ u_{2} \\ \dots \\ u_{Q} \end{matrix}] \in R^{Q},

(4)

which are augmented by embedding a number s and z of time-lagged (delayed) copies of the original state and input variables, such that:

{\hat{x}}_{j} = [\begin{matrix} x_{j} \\ x_{j - 1} \\ \dots \\ x_{j - s} \end{matrix}] \in R^{N (s + 1)}, {\hat{u}}_{j} = [\begin{matrix} u_{j} \\ u_{j - 1} \\ \dots \\ u_{j - z} \end{matrix}] \in R^{Q (z + 1)} .

(5)

As a consequence, the extended state matrix and the extended system input matrix are defined as

\hat{A} \in R^{N (s + 1) \times N (s + 1)}

and

\hat{B} \in R^{N (s + 1) \times Q (z + 1)}

, respectively.

The procedure to extract the extended matrices from data starts by introducing the vector

{\hat{y}}_{j} \in R^{N (s + 1) + Q (z + 1)}

:

{\hat{y}}_{j} = [\begin{matrix} {\hat{x}}_{j} \\ {\hat{u}}_{j} \end{matrix}],

(6)

such that Equation (3) can be rewritten as follows:

{\hat{x}}_{j + 1} = \hat{G} {\hat{y}}_{j}, with \hat{G} = [\begin{matrix} \hat{A} & \hat{B} \end{matrix}] .

(7)

Data from m snapshots are re-arranged in two augmented data matrices

\hat{Y} \in

R^{[N (s + 1) + Q (z + 1)] \times (m - 1)}

and

{\hat{X}}^{'} \in R^{N (s + 1) \times (m - 1)}

, which are built as follows:

\hat{Y} = [\begin{matrix} X \\ S \\ U \\ Z \end{matrix}], {\hat{X}}^{'} = [\begin{matrix} X^{'} \\ S^{'} \end{matrix}] .

(8)

Specifically the matrices

X \in R^{N \times (m - 1)}

,

X^{'} \in R^{N \times (m - 1)}

, and

U \in R^{Q \times (m - 1)}

contain the state and input snapshots at the m considered time instants:

\begin{matrix} X & = [\begin{matrix} x_{j} & x_{j + 1} & \dots & x_{m - 1} \end{matrix}], \end{matrix}

(9)

\begin{matrix} X^{'} & = [\begin{matrix} x_{j + 1} & x_{j + 2} & \dots & x_{m} \end{matrix}], \end{matrix}

(10)

\begin{matrix} U & = [\begin{matrix} u_{j} & u_{j + 1} & \dots & u_{m - 1} \end{matrix}], \end{matrix}

(11)

while the Hankel matrices

S

,

S^{'}

, and

Z

contain the extended delayed state and input snapshots:

\begin{matrix} S & = [\begin{matrix} x_{j - 1} & x_{j} & \dots & x_{m - 2} \\ x_{j - 2} & x_{j - 1} & \dots & x_{m - 3} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ x_{j - s - 1} & x_{j - s} & \dots & x_{m - s - 1} \end{matrix}], \end{matrix}

(12)

\begin{matrix} S^{'} & = [\begin{matrix} x_{j} & x_{j + 1} & \dots & x_{m - 1} \\ x_{j - 1} & x_{j} & \dots & x_{m - 2} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ x_{j - s} & x_{j - s + 1} & \dots & x_{m - s} \end{matrix}], \end{matrix}

(13)

\begin{matrix} Z & = [\begin{matrix} u_{j - 1} & u_{j} & \dots & u_{m - 2} \\ u_{j - 2} & u_{j - 1} & \dots & u_{m - 3} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ u_{j - z - 1} & u_{j - z} & \dots & u_{m - z - 1} \end{matrix}] . \end{matrix}

(14)

The augmented matrix

\hat{G} = [\hat{A} \hat{B}] \in R^{N (s + 1) \times (N (s + 1) + Q (z + 1))}

is approximated by solving the following regularized least-square minimization:

min_{\hat{A}, \hat{B}} | | {\hat{X}}^{'} - [\hat{A} \hat{B}] \hat{Y} {| |}_{F}^{2} + λ [\hat{A} \hat{B}] {| |}_{F}^{2},

(15)

which solution is given by the following equation:

[\hat{A} \hat{B}] = {\hat{X}}^{'} {\hat{Y}}^{T} {[\hat{Y} {\hat{Y}}^{T} + λ I]}^{- 1} .

(16)

where

λ

is a regularization factor. The above Tikhonov-regularized formulation extends the one presented in [45] to the HDMDc. It is suitable for improving the numerical stability of DMD regression, with its robustness in high-dimensional spaces and accuracy for applications involving noisy data. A similar effect can be pursued with the exact-DMD formulation by SVD rank truncation, see [31], also reducing the model size. The Tikhonov-regularized formulation is here preferred for its more robust behavior to noisy data. Once the matrices

\hat{A}

and

\hat{B}

are obtained, Equation (3) can be used to calculate the time evolution of the augmented state vector

\tilde{\hat{x}} (t)

from an initial condition, where the tilde indicates the HDMDc estimation. By isolating its first N components, the predicted time evolution of the original state variables

\tilde{x} (t)

is extracted.

Uncertainty Quantification in HDMDc Through Ensembling

DMD-based models can be further extended to provide uncertainty estimation of their predictions. To this aim, the ensembling approach is applied, i.e., the combination of predictions coming from different models to obtain a prediction along with its statistics. The Bayesian formulation for HDMDc, which leads to BHDMDc was intrduced in [30]. The driving rationale behind the BHDMDc is the observation highlighted by the authors in previous works [13,28] that the final prediction from DMD-based models can strongly vary for different hyperparameter settings. Furthermore, there is no general rule for determining their optimal values.

The dimensions and the values within matrices

\hat{A}

and

\hat{B}

depend on the four hyperparameters m, s, z, and

λ

. In place of the the first three, we equivalently use their respective time lengths, i.e., the observation time length

l_{t r} = t_{m} - t_{1}

, the maximum delay time in the augmented state

l_{d_{x}} = t_{j - 1} - t_{j - s - 1}

, and the maximum delay time in the augmented input

l_{d_{u}} = t_{j - 1} - t_{j - z - 1}

. The relation between m, s, z, and

l_{t r}

,

l_{d_{x}}

,

l_{d_{u}}

is given by:

m = ⌊\frac{l_{t r}}{Δ t}⌋, s = ⌊\frac{l_{d_{x}}}{Δ t}⌋, z = ⌊\frac{l_{d_{u}}}{Δ t}⌋,

(17)

with

Δ t

the step of the temporal discretization. These dependencies can be denoted as follows:

\begin{matrix} \hat{A} & = \hat{A} (l_{t r}, l_{d_{x}}, l_{d_{u}}, λ), \\ \hat{B} & = \hat{B} (l_{t r}, l_{d_{x}}, l_{d_{u}}, λ) . \end{matrix}

(18)

In BHDMDc, the hyperparameters are considered as stochastic variables with given prior distribution,

p (l_{t r})

,

p (l_{d_{x}})

,

p (l_{d_{u}})

, and

p (λ)

, respectively. Through uncertainty propagation, the solution

\tilde{x} (t)

also depends on

l_{t r}

,

l_{d_{x}}

,

l_{d_{u}}

, and

λ

:

x = x (t, l_{t r}, l_{d_{x}}, l_{d_{u}}, λ),

(19)

and the following are used to define, at a given time t, the expected value (

μ

) of the solution and its standard deviation (

σ

):

μ [x (t)] = \int_{λ^{l}}^{λ^{u}} \int_{l_{d_{u}}^{l}}^{l_{d_{u}}^{u}} \int_{l_{d_{x}}^{l}}^{l_{d_{x}}^{u}} \int_{l_{t r}^{l}}^{l_{t r}^{u}} x (t, l_{t r}, l_{d_{x}}, l_{d_{u}}, λ) p (l_{t r}) p (l_{d_{x}}) p (l_{d_{u}}) p (λ) d l_{t r} d l_{d_{x}} d l_{d_{u}} d λ,

(20)

σ [x (t)] = {\{\int_{λ^{l}}^{λ^{u}} \int_{l_{d_{u}}^{l}}^{l_{d_{u}}^{u}} \int_{l_{d_{x}}^{l}}^{l_{d_{x}}^{u}} \int_{l_{t r}^{l}}^{l_{t r}^{u}} {\{x (t, l_{t r}, l_{d_{x}}, l_{d_{u}}, λ) - μ [x (t)]\}}^{2} p (l_{t r}) p (l_{d_{x}}) p (l_{d_{u}}) p (λ) d l_{t r} d l_{d_{x}} d l_{d_{u}} d λ\}}^{\frac{1}{2}},

(21)

where

l_{t r}^{l}

,

l_{d_{x}}^{l}

,

l_{d_{u}}^{l}, λ^{l}

, and

l_{t r}^{u}

,

l_{d_{x}}^{u}

,

l_{d_{u}}^{u}

,

λ^{u}

are lower and upper bounds and

p (l_{t r})

,

p (l_{d_{x}})

, and

p (l_{d_{u}})

p (λ)

are the given probability density functions for

l_{t r}

,

l_{d_{x}}

,

l_{d_{u}}

, and

λ

.

In practice, a uniform probability density function is assigned to the hyperparameters, and a set of realizations is obtained through a Monte Carlo sampling, obtaining a posterior distribution on the prediction.

We note that the proposed method is termed “Bayesian” in a broader sense than classical Bayesian inference. In particular, posterior distributions are not inferred over the Koopman operator or dynamic mode decomposition matrices themselves. Instead, in BHDMDc, the hyperparameters of the method are treated in a Bayesian manner, assigning them given prior distributions and propagating their uncertainty through Monte Carlo sampling. This results in a predictive distribution for the system variables. While this does not constitute full Bayesian parameter inference, it is consistent with Bayesian principles of uncertainty propagation and model averaging and aligns with how “Bayesian” is used in the related literature for uncertainty-aware reduced-order modeling.

4. Statistical Variables and Performance Metrics

To compare the predictions made by the deterministic and Bayesian models with the ground truth from the experiments, four error indices are employed: the average normalized mean square error (ANRMSE) [13], its time-resolved version (

ε

), the normalized average minimum/maximum absolute error (NAMMAE) [13], and the Jensen-Shannon divergence (JSD) [46].

The ANRMSE quantifies the root mean square error between the predicted values

{\tilde{x}}_{t}

and the measured (reference) values

x_{t}

at different time steps, normalizing the result for each variable by k times the standard deviation of the measured value (

k = 1

in this work) and averaging over the N variables in

x

:

ANRMSE = \frac{1}{N} \sum_{i = 1}^{N} \sqrt{\frac{1}{T {(k S D [x_{i}])}^{2}} \sum_{j = 1}^{T} {({\tilde{x}}_{i} (t_{j}) - x_{i} (t_{j}))}^{2}},

(22)

where

T

is the number of time instants in the considered time window, and

S D [ξ]

indicates the standard deviation of the measured values in the considered time window for the variable

ξ

:

S D [ξ] = \sqrt{\frac{1}{T - 1} \sum_{i = 1}^{T} {(ξ_{i} - E V [ξ])}^{2}}

(23)

and

E V [ξ] = \frac{1}{T} \sum_{i = 1}^{T} ξ_{i}

(24)

For the same time window, the time-resolved variant of the ANRMSE called

ε

is evaluated as the average across the N variables of the time evolution of the square root difference between the reference and predicted signal, normalized by its standard deviation in the considered time interval:

ε (t) = \frac{1}{N} \sum_{i = 1}^{N} \sqrt{\frac{{(\tilde{x_{i}} (t) - x_{i} (t))}^{2}}{{(k S D [x_{i}])}^{2}}}

(25)

This is used to monitor potential trends in the prediction error, i.e., whether the accuracy decreases or increases for longer predictions.

The NAMMAE metric [13] provides an engineering-oriented assessment of prediction accuracy. It measures the absolute difference between the minimum and maximum values of the predicted and measured time series, normalized by k times the standard deviation of the measured values, and averaged over N variables, as follows:

NAMMAE = \frac{1}{2 N} \sum_{i = 1}^{N} \frac{1}{k S D [x_{i}]} (|min ({\tilde{x}}_{i} (t_{j})) - min (x_{i} (t_{j}))| + |max ({\tilde{x}}_{i} (t_{j})) - max (x_{i} (t_{j}))|) j = 1, \dots, T .

(26)

In addition to the direct comparison of DMD-predicted and experimentally measured time histories, calculating the probability density function (PDF) of the variables under prediction is of interest for the application in irregular waves. To statistically assess the PDF estimator obtaining confidence intervals, a moving block bootstrap (MBB) method is applied to time histories from EFD and BHDMDc-based predictions of each variable. For the MBB analysis of a generic variable

ξ

, a single time series of

T

items is obtained for the EFD and the BHDMDc, respectively, by joining all the measured or predicted series. From the single time series, a number

C = T - r + 1

of moving blocks is used, each defining a time series composed by

ξ_{i}

i = c, \dots, c + r - 1

, where c is the block index, and

r = {(2 φ / a)}^{2 / 3} T^{1 / 3}

is an optimal block length [47] with

φ = \frac{T \sum_{i = 1}^{T} [ξ_{i + 1} - EV (ξ)] [ξ_{i} - EV (ξ)]}{(T - 1) \sum_{i = 1}^{T} [(ξ_{i} - EV (ξ))]}

(27)

and

a = (1 - φ) (1 + φ)

. From the original set of C blocks, a number of

C^{'} = T / r

blocks are drawn at random with replacement and concatenated in the order they are picked, forming a new bootstrapped series of size

T

. The PDF of each bootstrapped time series is obtained using kernel density estimation [48] as follows:

PDF (ξ, y) = \frac{1}{T h} \sum_{i = 1}^{T} K (\frac{y - ξ_{i}}{h}) .

(28)

Here, K is a normal kernel function defined as

K (ξ) = \frac{1}{\sqrt{2 π}} exp (- \frac{ξ^{2}}{2}),

(29)

where

h = 1.06 min (S D [ξ], IQR (ξ)) T^{- 1 / 5}

(30)

is the bandwidth [49], and

IQR (ξ)

is the inter-quartile range for the variable

ξ

I Q R [ξ] = q_{ξ} (0.75) - q_{ξ} (0.25)

(31)

with

q_{ξ}

being the quantile function for the variable

ξ

. The expected value and a confidence interval are calculated for the PDF of each variable for the EFD measurements and DMD-based predictions. The quantile function q is evaluated at probabilities

p = 0.025

and

0.975

, defining the lower and upper bounds of the 95% confidence interval of the PDFs as

U_{PDF (ξ, y)} = q_{PDF (ξ, y)} (0.975) - q_{PDF (ξ, y)} (0.025)

.

The so-obtained PDFs of each variable from the different sources are then compared using the JSD [46], defined as follows:

\begin{matrix} {JSD}_{ξ} = JSD [V_{ξ} | | W_{ξ}] = \frac{1}{2} D (V_{ξ} | | M_{ξ}) + \frac{1}{2} D (W_{ξ} | | M_{ξ}), \end{matrix}

(32)

where

M_{ξ} = \frac{1}{2} (V_{ξ} + W_{ξ}), and D (V_{ξ} | | W_{ξ}) = \sum_{y \in χ} V_{ξ} (y) ln (\frac{V_{ξ} (y)}{W_{ξ} (y)}) .

(33)

The Jensen–Shannon divergence (JSD) [50] is a symmetric measure of similarity between two probability distributions V and W, based on the Kullback–Leibler divergence (D) [51]. Here,

V_{ξ}

and

W_{ξ}

are the PDFs of the variable

ξ

from the EFD and HDMDc, respectively. The JSD quantifies the average discrepancy of each distribution with respect to their mixture M, defined over the domain

χ

. It is always finite, bounded as

0 \leq JSD [V | | M] \leq ln 2

.

For each variable, the expected value and the quantile function for

p = 0.025

and

0.975

of JSD are calculated on the PDFs evaluated from the bootstrapped time series, defining the lower and upper bound of the 95% confidence interval

U_{{JSD}_{ξ}} = U_{JSD (V | | W)} = JSD {(V | | W)}_{q = 0.95} - JSD {(V | | W)}_{q = 0.025}

.

5. System Identification Setup

In the present work, no equation-based motion model of the ASV is assumed a priori. The reduced-order model is obtained in a fully data-driven and equation-free manner, where the state-space representation of Equation (3) is identified directly from the experimental measurements through the (Bayesian) Hankel-DMDc (see Box 1 for a summary of the workflow for the system identification using HDMDc). The state vector of the system is defined as

x = {[x, z, θ, \dot{x}, \dot{z}, \dot{θ}, \ddot{x}, \ddot{z}, \ddot{θ}, M_{bow}, M_{stern}]}^{T} .

(34)

The variables in the state vector correspond to the surge, heave, and pitch degrees of freedom, which dominate the symmetric head-sea response of the moored vessel, together with their first and second time derivatives and the measured mooring loads. These quantities reflect the relevant physics of the phenomenon to be predicted by the system identification procedure.

The input vector is composed of observables based on the wave elevations measured by the Kenek probes

η_{f w d}^{k n}

and

η_{c o g}^{k n}

, and the model surge x.

u = {[η_{f w d}, η_{c o g}]}^{T} .

(35)

During the experiments, the relative position between the model and the wave probes was not constant, and large deviations from the rest position occurred in the x-direction. The elastic restoring force from the moorings was not sufficient to counteract the wave forces and keep the surge oscillating around the rest position. Hence, directly using the wave elevation in the input vector as measured by the probes would lead to non-negligible phase error in the prediction. The HDMDc and BHDMDc are, in fact, able to learn a single phase relation between input and output, which, however, would be insufficient due to the dynamic change in the relative position between probes and model. For this reason, the observables

η_{f w d}

and

η_{c o g}

were approximated as delayed signals from

η_{f w d}^{k n}

and

η_{c o g}^{k n}

:

\begin{matrix} η_{f w d} (t) & = η_{f w d}^{k n} (t - τ (t)) \\ η_{c o g} (t) & = η_{c o g}^{k n} (t - τ (t)) \end{matrix}

(36)

where the delay

τ

depends on the surge x and the effective phase velocity c:

τ (t) = \frac{x (t) - x_{0}}{c} .

(37)

In this work, a simple estimation of c is obtained through signal correlation between

η_{f w d}^{k n}

and

η_{c o g}^{k n}

:

c = \frac{x_{c o g}^{k n} - x_{f w d}^{k n}}{τ_{s}}, τ_{s} = \underset{τ_{s}}{argmax} \{corr [η_{f w d} (t), η_{c o g} (t - τ_{s})]\} .

(38)

The cross-correlation operator applied to the measured discrete wave elevation signals results in a time-discrete vector

ρ_{η} [i]

collecting the correlation value varying the number i of shift samples. The temporal shift

τ_{s}

is hence found as follows:

τ_{s} = \frac{i_{m a x}}{f_{s w}},

(39)

where

i_{m a x}

denotes the discrete integer shift between the signals for which the cross-correlation

ρ_{η}

reaches its maximum.

Box 1. System identification using HDMDc workflow

Inputs. Training state and input time series

x (t) \in R^{N}

,

u (t) \in R^{Q}

; training window length

l_{t r}

; delay-embedding length

l_{d}

; Testing input time series

u (t) \in R^{Q}

for the forecast horizon

l_{t e}

.

Preprocessing.
- Identify the training window of length $l_{t r}$ within the reference time series;
- (optional) subsample time series;
- apply z-scoring to $x (t)$ and $u (t)$ .
Build snapshots matrices.
- With $m = ⌊ l_{t r} / Δ t ⌋$ , form $X = [x_{j}, \dots, x_{m - 1}]$ , $X^{'} = [x_{j + 1}, \dots, x_{m}]$ , and $U = [u_{j}, \dots, u_{m - 1}]$ from the training data.
Hankel embedding.
- With $s = ⌊ l_{d x} / Δ t ⌋$ and $z = ⌊ l_{d u} / Δ t ⌋$ , build delayed state and input vectors:
  
  ${\hat{x}}_{j} = {[x_{j}^{T}, x_{j - 1}^{T}, \dots, x_{j - s}^{T}]}^{T}, {\hat{u}}_{j} = {[u_{j}^{T}, u_{j - 1}^{T}, \dots, u_{j - z}^{T}]}^{T};$
- assemble extended snapshot matrices $\hat{X}, {\hat{X}}^{'}$ , and $\hat{U}$ .
HDMDc operator estimation.
- Define $\hat{Y} = [\begin{matrix} \hat{X} \\ \hat{U} \end{matrix}]$ ;
- estimate the augmented operator by solving the Tikhonov-regularized least-square problem:
  
  $[\hat{A} \hat{B}] = {\hat{X}}^{'} {\hat{Y}}^{T} {(\hat{Y} {\hat{Y}}^{T} + λ I)}^{- 1} .$
Forecasting.
- Collect $\hat{u} (t)$ for the test window $l_{t e}$ ;
- for each time step $k \in [0, l_{t e}]$ , predict $\tilde{\hat{x}}$
  
  ${\tilde{\hat{x}}}_{k + 1} = \hat{A} {\tilde{\hat{x}}}_{k} + \hat{B} {\hat{u}}_{k};$
- extract original variables $\tilde{x} (t)$ from the first N components of $\tilde{\hat{x}} (t)$ ;
- (optional) de-standardize using $μ_{t r}, σ_{t r}$ .

Outputs. Deterministic prediction

\tilde{x} (t)

for the horizon

l_{t e}

and identified system matrices

\hat{A}, \hat{B}

.

An average encounter wave period

\hat{T} = 1.64 s

is calculated as the average encounter wave period of the measured

η_{fwd}^{k n}

signal in the three EFD runs with irregular waves:

\hat{T} = \frac{1}{n_{EFD}} \sum_{i = 1}^{n_{EFD}} \frac{N z_{u p_{i}}}{T_{i}} .

(40)

For processing ease with DMD, data were downsampled to 32 time steps per

\hat{T}

,

Δ t = 0.051 s

. This reduced the size of the data matrices to be handled while preserving sufficient temporal resolution to retain the fidelity of the original signal.

Leveraging results from previous studies by the authors [30,31,32], reasonable values for the HDMDc hyperparameters for deterministic analysis are defined:

l_{t r}^{d e t} = 20 \hat{T}

,

l_{d_{x}}^{d e t} = \hat{T}

,

l_{d_{u}}^{d e t} = 5 \hat{T}

and

λ^{d e t} = 100

, so that

m^{d e t} = 640

,

s^{d e t} = 32

, and

z^{d e t} = 160

.

The ranges of variation for the uniform hyperparameter distributions in the Bayesian analysis were obtained considering a ±50% interval centered on the deterministic values, defining the respective priors:

l_{t r}^{b a y}

∼

U (10 \hat{T}, 30 \hat{T})

,

l_{d_{x}}^{b a y}

∼

U (0.5 \hat{T}, 1.51 \hat{T})

,

l_{d_{u}}^{b a y}

∼

U (2.5 \hat{T}, 7.5 \hat{T})

, and

λ^{b a y}

∼

U (50, 150)

. The Bayesian predictions are obtained using 100 Monte Carlo realizations of the hyperparameters. s, z, and m are taken as the nearest integers from the calculated values.

All the analyses are based on normalized data using the Z-score standardization: the mean of the time series of each variable is removed and the amplitude is scaled with the standard deviation evaluated on the training signal.

The data from each experimental run in irregular waves was split into training (the first half) and testing data (the second half). Data from regular wave tests are used completely as testing.

6. Results and Discussion

Results are structured in two parts, namely irregular-to-irregular (Figure 5, Figure 6, Figure 7 and Figure 8 in Section 6.1) and irregular-to-regular (Figure 9, Figure 10, Figure 11 and Figure 12 in Section 6.2), reflecting the two testing conditions.

Figure 5. ANRMSE (a) and NAMMAE (b) box–violin plots. Comparison between deterministic and Bayesian predictions over the test sequences.

Figure 6. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Irregular test wave, selected sequence 1.

Figure 7. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Irregular test wave, selected sequence 2.

Figure 8. PDF comparison between measured data and Bayesian Hankel-DMDc prediction on bootstrapped sequences for each variable. Shaded areas indicate the 95% confidence interval of the two PDFs.

Figure 9. ANRMSE (a) and NAMMAE (b) box–violin plots. Comparison between deterministic and Bayesian predictions over the regular waves test sequences.

Figure 10. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 1.5

. Note that

\hat{T}

is evaluated from irregular waves.

Figure 11. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 2

. Note that

\hat{T}

is evaluated from irregular waves.

Figure 12. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 3

. Note that

\hat{T}

is evaluated from irregular waves.

In order to statistically assess the performance of HDMDc and BHDMDc, the ANRMSE and NAMMAE were evaluated for several training and testing sequences. In particular, 10 training sequences and 10 test sequences were randomly selected and combined in a full-factorial analysis. The same training and test sequences were used for the deterministic and Bayesian versions of the ROM to guarantee a fair comparison. The length of the test time histories for the irregular waves case was

l_{t e} = 15 \hat{T}

. Due to reduced signal lengths,

l_{t e} = 5 \hat{T}

for regular waves.

For both irregular-to-irregular and irregular-to-regular cases, results are presented as box–violin plots comparing the ANRMSE and NAMMAE for the deterministic and Bayesian ROMs, see Figure 5 and Figure 9, respectively.

The boxes show the first, second (equivalent to the median value), and third quartiles, while the whiskers extend from the box to the farthest data point lying within 1.5 times the interquartile range, defined as the difference between the third and the first quartiles from the box. The density of the data distribution is additionally represented by the violin shape, providing a visual summary of the distribution beyond the quartiles. The values of single realizations, including outliers, are plotted with dots.

In addition, representative test sequences are shown in Figure 6 and Figure 7 for irregular-to-irregular and in Figure 10, Figure 11 and Figure 12 for irregular-to-regular cases. Figures compare the experimental measurements (EFD), the deterministic ROM prediction, and the Bayesian ROM prediction (mean value as solid line and 95% confidence interval as shaded area of the same color).

Finally, the MBB analysis is applied to the irregular-to-irregular case, and PDFs from bootstrapped sequences for EFD and BHDMDc are obtained for each predicted variable, along with their 95% confidence interval, and are shown in Figure 8. The differences between the EFD and BHDMDc distributions are evaluated using JSD, and results are summarized in Table 2.

Table 2. EV and 95% confidence lower bound, upper bound, and interval of JSD of the PDFs of variables in the bootstrapped time series.

6.1. Irregular-to-Irregular

The HDMDc and the BHDMDc ROMs produce an overall reliable and robust prediction for unseen irregular input waves. No trend in

ε (t)

was observed throughout the testing window, suggesting that the same level of accuracy shown in

t \leq 15 \hat{T}

can be achieved for arbitrarily long sequences.

Observing the time-resolved predictions in Figure 6 and Figure 7 evidences that the great majority of the ANRMSE and NAMMAE errors arise from the prediction of the mooring forces. This is also confirmed by Table 2 showing that the JSD for the

M_{b o w}

e

M_{s t e r n}

PDFs is an order of magnitude higher than the other variables. Comparing the box–violin plot in Figure 5, it can be noted that the BHDMDc model reduced both ANRMSE and NAMMAE errors, lowering the average error and also reducing the results dispersion.

The uncertainty in the Bayesian model is very low, as can be observed in Figure 6 and Figure 7. This is consistent with the high accuracy achieved for vessel dynamics predictions and indicates the robustness of the model to hyperparameter changes in the identified ranges (confirming the rule of thumbs for the hyperparameter values identified in the literature [31,32]). However, confidence intervals are not sufficiently extended to cover the ground truth for mooring loads, whose predictions are less accurate and larger uncertainties would have been expected.

The remarkable accuracy obtained for the ship dynamics is also reflected in the MBB analysis, as the PDFs from EFD and BHDMDc data are very close, and their confidence intervals are almost always overlapping. The MBB analysis also confirms a reduced accuracy in the estimation of mooring forces.

The difficulty in obtaining accurate predictions of mooring forces is due to a combination of measurement noise and artifacts arising from the physical behavior of the mooring lines. Specifically, the mooring lines rested at the water surface, alternately emerging or being submerged by waves (particularly the bow mooring line), which induced sudden variations in tension that do not necessarily reflect the system dynamics. In addition, insufficient pretensioning, mainly due to an initial underestimation of the ASV mass and wave forces, caused the lines to be slack at times, further contributing to spurious oscillations in both the mooring lines and the measured loads. These factors complicated the learning of the system response by the DMD-based system identification, with a consequent increased error in the prediction of the mooring forces.

Nevertheless, the DMD-based models capture the low-frequency content of the loads, as can be seen in Figure 6 and Figure 7, achieving a fair estimation of load peaks. Regularization plays a key role, preventing the identification of spurious and unstable dynamics, to which the method is particularly sensitive in noisy data environments.

6.2. Irregular-to-Regular

The ROM learned in irregular wave conditions is tested in regular waves. Figure 10, Figure 11 and Figure 12 show the results of a selected test sequence for

λ_{w} = 1.5 L_{m}

,

2 L_{m}

, and

λ_{w} = 3 L_{m}

, respectively. The DMD-based models achieve remarkable accuracy also in these cases for the variables linked to the ship’s dynamics, with the largest error being in the reproduction of mooring forces. Part of the error clearly arises from an imperfect identification of the load-related subsystem, already noted in the irregular waves data results. In addition, it has been noted that the issue evidenced for the measure of mooring forces was even more prominent in the regular waves case. During the regular wave tests, the model was more effectively pushed downstream by the regular waves from excitations with shorter wavelength, amplifying slackness of the stern line (loads on the stern mooring are consequently better predicted for higher

λ_{w}

).

The box–violin plots showing the statistical analysis of ANRMSE and NAMMAE results are presented in Figure 9. It can be noted that the ANRMSE is slightly higher than for irregular-to-irregular wave predictions; however, the NAMMAE is lower. This is coherent with a more pronounced phase error in the prediction of

M_{bow}

and

M_{stern}

, while the models still provide a reasonable prediction of the loads’ extrema.

The ROMs trained on P-M irregular waves were able to predict vessel responses to regular head waves accurately. On one side, the regular wave frequencies lie in the decreasing side of the spectral peak of the training sea state

f (λ_{w} = 1.5 L_{m}) = 0.86

Hz,

f (λ_{w} = 2 L_{m}) = 0.75

Hz, and

f (λ_{w} = 3 L_{m}) = 0.61

Hz, as can be seen from Figure 2. However, irregular and regular wave excitations have substantial differences in terms of frequency content and phase coherence. The successful prediction of regular-wave responses is, hence, an indication that the models have generalized beyond the specific type of excitation seen during training, at least when the new excitation remains within the spectral band covered during training. This highlights the importance of selecting informative training datasets, i.e., constructing and using training datasets that encompass a sufficiently rich representation of the system dynamics, enabling the development of ROMs capable of generalizing across multiple operating conditions. In this sense, irregular sea states appear promising due to their broadband spectral content, inherently providing a more comprehensive excitation of the system.

7. Conclusions

This work presented the system identification of a small ASV in moored conditions using the HDMDc and its uncertainty-aware extension, BHDMDc. The methods were applied to experimental data collected in the CNR-INM towing tank under both irregular and regular head wave conditions. The ASV under investigation features a recessed moon pool, which induces nonlinear responses due to sloshing, thereby increasing the modeling challenge.

The deterministic HDMDc formulation successfully captured the dominant dynamics of the vessel motions, providing accurate predictions of surge, heave, and pitch across all tested conditions. The prediction of mooring forces was more challenging and exhibited larger errors, mainly due to measurement noise and extremely nonlinear effects related to the partial slackening and intermittent immersion of the mooring lines. Nevertheless, the models were able to reproduce the low-frequency components and general trends of the mooring loads, offering a meaningful approximation of their temporal evolution.

The Bayesian extension introduced uncertainty quantification by propagating the variability of the HDMDc hyperparameters through Monte Carlo sampling. The Bayesian model consistently improved prediction accuracy compared to the deterministic version, also improving the robustness as measured by ANRMSE and NAMMAE.

A key outcome of this study is the generalization capability of HDMDc and BHDMDc models trained exclusively on irregular wave data. The trained models were able to accurately predict the ASV response in regular wave conditions, a different excitation regime although still characterized by frequencies within the spectral range of the training sea state. This highlights the importance of constructing informative training datasets that encompass a sufficiently rich system dynamics. In particular, irregular sea states, due to their broadband spectral content, inherently provide a more comprehensive excitation of the system, enabling the development of ROMs capable of generalizing across multiple operating conditions within the same frequency band. Further extending the approach to include, e.g., different sea states, wave directions, and operating conditions, would represent a key enabling technology for digital twins of marine systems, where reliable real-time predictions and uncertainty quantification are essential for control, monitoring, and decision support applications. In this context, the interpolation of parametric reduced-order models [52,53] may represent a viable strategy to overcome the intrinsic limitations of using a single, albeit informative, training set, by combining multiple locally trained models to construct a global predictive model that retains accuracy across a broader range of operating conditions.

Author Contributions

Conceptualization, G.P. and M.D.; methodology, G.P., A.S. and M.D.; software, G.P.; validation, G.P.; formal analysis, G.P.; investigation, G.P., I.S. and M.D.; resources, I.S., L.M. and M.D.; data curation, G.P. and I.S.; writing—original draft preparation, G.P. and I.S.; writing—review and editing, A.S. and M.D.; visualization, G.P.; supervision, M.D.; project administration, M.D.; funding acquisition, M.D. All authors have read and agreed to the published version of this manuscript.

Funding

This research was funded by the Italian Ministry of University and Research through the National Recovery and Resilience Plan (PNRR), CN00000023–CUP B43C22000440001, “Sustainable Mobility Center” (CNMS), Spoke 3 “Waterways”. The authors are also grateful to the US Office of Naval Research for its support in the methodological development through NICOP Grants N62909-21-1-2042 and N62909-24-1-2102.

Data Availability Statement

The original data presented in this study are openly available at https://github.com/MAORG-CNR-INM/ck14-e_experimental-data/tree/main (accessed on 20 November 2025) and https://zenodo.org/records/17315035 (accessed on 20 November 2025).

Acknowledgments

The authors gratefully acknowledge the staff of the CNR-INM towing tank for their valuable support during the experimental campaign. In particular, we thank Luigi D’Addeo, Stefano Dalla Torre, Matteo Dell’Abate di Fabio, Stefano Giudici, Massimo Guerra, Alessandro Piazza, Roberto Zagaglia, Fabio Passacantilli, and Luigi Fabbri for their assistance in instrumentation calibration, test-rig setup, and execution.

Conflicts of Interest

Author Lorenzo Minno was employed by the company Codevintec S.r.l. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflicts of interest.

Nomenclature

The following abbreviations and symbols are used in this manuscript:

Abbreviations and symbols
ANRMSE	Average Normalized Root Mean Square Error
ASV	Autonomous Surface Vehicle
BHDMDc	Bayesian Hankel Dynamic Mode Decomposition with Control
BiLSTM	Bidirectional Long short-term memory
CFD	Computational Fluid Dynamics
CNR-INM	Consiglio Nazionale delle Ricerche-Istituto di Ingegneria del Mare
D	Kullback-Lieber Divergence
DMD	Dynamic Mode Decomposition
DMDc	Dynamic Mode Decomposition with Control
EFD	Experimental Fluid Dynamics
EV	Expected Value
HDMD	Hankel Dynamic Mode Decomposition
HDMDc	Hankel Dynamic Mode Decomposition with Control
IMO	International Maritime Organization
IQR	Interquartile range
JSD	Jensen-Shannon Divergence
LSTM	Long short-term memory
MBB	Moving block bootstrap
MMG	Maneuvering Modeling Group
MRU	Motion Reference Unit
NAMMAE	Normalized Average Minimum/Maximum Absolute Error
P-M	Pierson-Moskovitz
PDF	Probability Density Function
RAO	Response Amplitude Operator
RGB	Red Green Blue
RNN	Recurrent Neural Network
ROM	Reduced Order Model
SD	Standard Deviation
STANAG	Guidelines and NATO Standardization Agreements
U	Uncertainty
URANS	Unsteady Raynolds Averaged Navier Stokes
$A$ , $B$	state and input matrices
$\hat{A}$ , $\hat{B}$	Extended state and input matrices in HDMDc
$I$	Identity matrix
$U$ , $Z$	Snapshot matrices for input and shifted input
$X$ , $S$ , $X^{'}$ , $S^{'}$	Snapshot matrices for state and shifted state
${\hat{X}}^{'}$ , $\hat{Y}$ ,	Snapshot matrices for extended state and aggregated state-input
$x$ , $u$	State and input vector
$\hat{x}$ , $\hat{u}$	Extended state and input vector
c	Effective wavephase velocity
f	Frequency
$f_{p b}$ , $f_{s b}$	Passband and stopband frequencies of filters
$f_{s}$	Sampling frequency
h	Wave height
$h_{s}$ , ${\hat{h}}_{s}$	Nominal and experimental significant wave height
$k_{bow}$	Elastic constants of bow mooring line
$k_{stern}$	Elastic constants of stern mooring line
$l_{t e}$	Test signal length in seconds
$l_{d_{u}}$ , $l_{d_{u}}^{\det, bay}$	Maximum input delay in seconds, deterministic (det) and Bayesian (bay) hyperparameter
$l_{d_{x}}$ , $l_{d_{x}}^{\det, bay}$	Maximum state delay in seconds, deterministic (det) and Bayesian (bay) hyperparameter
$l_{t r}$ , $l_{t r}^{\det, bay}$	Training signal length in seconds, deterministic (det) and Bayesian (bay) hyperparameter
$L_{m}$	Model length
$λ$	Regularization parameter in Tikhonov-regularized DMD
	(hyperparameter)
$λ^{\det, bay}$	Regularization parameter in Tikhonov-regularized DMD for
	deterministic (det) or Bayesian (bay) formulations (hyperparameter)
$λ_{w}$	Wave wavelength
m, $m^{\det, bay}$	Number of training snapshots, deterministic (det) and Bayesian (bay) hyperparameter
$M_{bow}$ , $M_{stern}$	Bow and stern mooring loads
$N_{w}$	Number of wave components
N	Number of state variables
$η$	Wave elevation
$η_{bow, fwd}^{us, kn}$	Wave elevation measured by the ultrasonic (us) or Kenek (kn) probes
	at bow or fwd positions
$q_{ξ} (a)$	Quantile function for the generic variable $ξ$ at probability a
Q	Number of input variables
s, $s^{\det, bay}$	Number of added delayed states, deterministic (det) and Bayesian (bay) hyperparameter
t	time
$T_{p}$ , ${\hat{T}}_{p}$	Peak period and average peak period
$\hat{T}$	Average encounter wave period
$v_{g r o u p}$	Group velocity
$v_{p h a s e}$	Phase velocity
x, y, z, $\dot{x}$ , $\dot{y}$ , $\dot{z}$ , $\ddot{x}$ , $\ddot{y}$ , $\ddot{z}$	Surge, sway, andheave, corresponding rates and accelerations
z, $z^{\det, bay}$	Number of added delayed inputs, deterministic (det) and Bayesian (bay) hyperparameter
$ϕ$ , $θ$ , $ψ$ , $\dot{ϕ}$ , $\dot{θ}$ , $\dot{ψ}$ , $\ddot{ϕ}$ , $\ddot{θ}$ , $\ddot{ψ}$	Roll, pitch, and yaw angles, corresponding rates and accelerations
$ξ$	Generic variable

References

Serani, A.; Diez, M.; van Walree, F.; Stern, F. URANS analysis of a free-running destroyer sailing in irregular stern-quartering waves at sea state 7. Ocean Eng. 2021, 237, 109600. [Google Scholar] [CrossRef]
Aram, S.; Mucha, P. CFD validation and analysis of turning maneuvers of a surface combatant in regular waves. Ocean Eng. 2024, 293, 116653. [Google Scholar] [CrossRef]
Xu, W.; Maki, K.J.; Silva, K.M. A data-driven model for nonlinear marine dynamics. Ocean Eng. 2021, 236, 109469. [Google Scholar] [CrossRef]
Yasukawa, H.; Yoshimura, Y. Introduction of MMG standard method for ship maneuvering predictions. J. Mar. Sci. Technol. 2015, 20, 37–52. [Google Scholar] [CrossRef]
Yasukawa, H.; Hirata, N.; Nakayama, Y. High-Speed Ship Maneuverability. J. Ship Res. 2016, 60, 239–258. [Google Scholar] [CrossRef]
Sanada, Y.; Park, S.; Kim, D.H.; Wang, Z.; Stern, F.; Yasukawa, H. Experimental and computational study of hull–propeller–rudder interaction for steady turning circles. Phys. Fluids 2021, 33, 127117. [Google Scholar] [CrossRef]
Pandey, J.; Hasegawa, K. Manoeuvring mathematical model of catamaran wave adaptive modular vessel (WAM-V) using the system identification technique. In Proceedings of the 7th PAAMES and AMEC2016, Hong Kong, China, 13–14 October 2016; Volume 13, p. 14. [Google Scholar]
Pandey, J.; Hasegawa, K. Study on Turning Manoeuvre of Catamaran Surface Vessel with a Combined Experimental and Simulation Method. In Proceedings of the 10th IFAC Conference on Control Applications in Marine Systems CAMS, Trondheim, Norway, 13–16 September 2016; Volume 49, pp. 446–451. [Google Scholar] [CrossRef]
Diez, M.; Wang, Z.; Park, S.; Milano, C.; Stern, F.; Yasukawa, H.; Gunderson, A.; Scherer, J. Multi-Fidelity MMG-Model for Digital Design of High-Speed Small Craft. In Proceedings of the SNAME Chesapeake Power Boat Symposium, Norfolk, VA, USA, 14 October 2024; p. D011S002R003. [Google Scholar] [CrossRef]
D’Agostino, D.; Serani, A.; Stern, F.; Diez, M. Time-series forecasting for ships maneuvering in waves via recurrent-type neural networks. J. Ocean Eng. Mar. Energy 2022, 8, 479–487. [Google Scholar] [CrossRef]
Wang, N.; Kong, X.; Ren, B.; Hao, L.; Han, B. SeaBil: Self-attention-weighted ultrashort-term deep learning prediction of ship maneuvering motion. Ocean Eng. 2023, 287, 115890. [Google Scholar] [CrossRef]
Jiang, Z.; Ma, Y.; Li, W. A Data-Driven Method for Ship Motion Forecast. J. Mar. Sci. Eng. 2024, 12, 291. [Google Scholar] [CrossRef]
Diez, M.; Gaggero, M.; Serani, A. Data-driven forecasting of ship motions in waves using machine learning and dynamic mode decomposition. Int. J. Adapt. Control Signal Process. 2024, 39, 2119–2142. [Google Scholar] [CrossRef]
Schmid, P.J. Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 2010, 656, 5–28. [Google Scholar] [CrossRef]
Kutz, J.; Brunton, S.; Brunton, B.; Proctor, J. Dynamic Mode Decomposition: Data-Driven Modeling of Complex Systems; SIAM—Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2016. [Google Scholar] [CrossRef]
Mezić, I. Koopman Operator, Geometry, and Learning of Dynamical Systems. Not. Am. Math. Soc. 2021, 68, 1087–1105. [Google Scholar] [CrossRef]
Brunton, S.L.; Budišić, M.; Kaiser, E.; Kutz, J.N. Modern Koopman Theory for Dynamical Systems. SIAM Rev. 2022, 64, 229–340. [Google Scholar] [CrossRef]
Koopman, B.O. Hamiltonian Systems and Transformation in Hilbert Space. Proc. Natl. Acad. Sci. USA 1931, 17, 315–318. [Google Scholar] [CrossRef]
Proctor, J.L.; Brunton, S.L.; Kutz, J.N. Generalizing Koopman Theory to Allow for Inputs and Control. SIAM J. Appl. Dyn. Syst. 2018, 17, 909–930. [Google Scholar] [CrossRef]
Rowley, C.W.; Mezić, I.; Bagheri, S.; Schlatter, P.; Hennigson, D.S. Spectral analysis of nonlinear flows. J. Fluid Mech. 2009, 641, 115–127. [Google Scholar] [CrossRef]
Tang, Z.; Jiang, N. Dynamic mode decomposition of hairpin vortices generated by a hemisphere protuberance. Sci. China Phys. Mech. Astron. 2012, 55, 118–124. [Google Scholar] [CrossRef]
Semeraro, O.; Bellani, G.; Lundell, F. Analysis of time-resolved PIV measurements of a confined turbulent jet using POD and Koopman modes. Exp. Fluids 2012, 53, 1203–1220. [Google Scholar] [CrossRef]
Song, G.; Alizard, F.; Robinet, J.C.; Gloerfelt, X. Global and Koopman modes analysis of sound generation in mixing layers. Phys. Fluids 2013, 25, 124101. [Google Scholar] [CrossRef]
Proctor, J.L.; Eckhoff, P.A. Discovering dynamic patterns from infectious disease data using dynamic mode decomposition. Int. Health 2015, 7, 139–145. [Google Scholar] [CrossRef]
Brunton, B.W.; Johnson, L.A.; Ojemann, J.G.; Kutz, J.N. Extracting spatial–temporal coherent patterns in large-scale neural recordings using dynamic mode decomposition. J. Neurosci. Methods 2016, 258, 1–15. [Google Scholar] [CrossRef]
Mann, J.; Kutz, J.N. Dynamic mode decomposition for financial trading strategies. Quant. Financ. 2016, 16, 1643–1655. [Google Scholar] [CrossRef]
Diez, M.; Serani, A.; Campana, E.F.; Stern, F. Time-series forecasting of ships maneuvering in waves via dynamic mode decomposition. J. Ocean Eng. Mar. Energy 2022, 8, 471–478. [Google Scholar] [CrossRef]
Serani, A.; Dragone, P.; Stern, F.; Diez, M. On the use of dynamic mode decomposition for time-series forecasting of ships operating in waves. Ocean Eng. 2023, 267, 113235. [Google Scholar] [CrossRef]
Palma, G.; Serani, A.; Aram, S.; Wundrow, D.W.; Drazen, D.; Diez, M. Bayesian dynamic mode decomposition for real-time ship motion digital twinning. Appl. Ocean Res. 2025, 165, 104863. [Google Scholar] [CrossRef]
Palma, G.; Serani, A.; Aram, S.; Wundrow, D.W.; Drazen, D.; Diez, M. Model-free system identification of surface ships in waves via Hankel dynamic mode decomposition with control. Ocean Eng. 2025, 341, 122539. [Google Scholar] [CrossRef]
Palma, G.; Serani, A.; Diez, M.; Milano, C.; Wang, Z.; Park, S.; Sarigul, D.O.; Stern, F. Bayesian Hankel Extended Dynamic Mode Decomposition for System Identification of High-Speed Planing Hulls. In Proceedings of the International Conference on Fast Sea Technology (FAST), Norfolk, VA, USA, 29 October 2025. [Google Scholar]
Palma, G.; Serani, A.; Diez, M. Data-driven uncertainty-aware seakeeping prediction of the Delft 372 catamaran using ensemble Hankel dynamic mode decomposition. J. Hydrodyn. 2025; under review. [Google Scholar]
Takeishi, N.; Kawahara, Y.; Tabei, Y.; Yairi, T. Bayesian Dynamic Mode Decomposition. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17, Melbourne, Australia, 19–25 August 2017; pp. 2814–2821. [Google Scholar] [CrossRef]
Sashidhar, D.; Kutz, J.N. Bagging, optimized dynamic mode decomposition for robust, stable forecasting with spatial and temporal uncertainty quantification. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2022, 380, 20210199. [Google Scholar] [CrossRef] [PubMed]
Carter, D.J.T. Estimation of Wave Spectra from Wave Height and Period; Technical Report I.O.S. 135; Institute of Oceanographic Sciences: Wormley, UK, 1982. [Google Scholar]
Schmid, P.; Sesterhenn, J. Dynamic Mode Decomposition of Numerical and Experimental Data. In Proceedings of the Sixty-First Annual Meeting of the APS Division of Fluid Dynamics, San Antonio, TX, USA, 23–25 November 2008. [Google Scholar]
Proctor, J.L.; Brunton, S.L.; Kutz, J.N. Dynamic mode decomposition with control. SIAM J. Appl. Dyn. Syst. 2016, 15, 142–161. [Google Scholar] [CrossRef]
Mezić, I. On Numerical Approximations of the Koopman Operator. Mathematics 2022, 10, 1180. [Google Scholar] [CrossRef]
Otto, S.E.; Rowley, C.W. Linearly Recurrent Autoencoder Networks for Learning Dynamics. SIAM J. Appl. Dyn. Syst. 2019, 18, 558–593. [Google Scholar] [CrossRef]
Takeishi, N.; Kawahara, Y.; Yairi, T. Learning Koopman invariant subspaces for dynamic mode decomposition. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 4–9 December 2017; NIPS’17. pp. 1130–1140. [Google Scholar]
Lusch, B.; Kutz, J.N.; Brunton, S.L. Deep learning for universal linear embeddings of nonlinear dynamics. Nat. Commun. 2018, 9, 4950. [Google Scholar] [CrossRef]
Brunton, S.L.; Brunton, B.W.; Proctor, J.L.; Kutz, J.N. Koopman Invariant Subspaces and Finite Linear Representations of Nonlinear Dynamical Systems for Control. PLoS ONE 2016, 11, e0150171. [Google Scholar] [CrossRef] [PubMed]
Arbabi, H.; Mezić, I. Ergodic Theory, Dynamic Mode Decomposition, and Computation of Spectral Properties of the Koopman Operator. SIAM J. Appl. Dyn. Syst. 2017, 16, 2096–2126. [Google Scholar] [CrossRef]
Brunton, S.L.; Brunton, B.W.; Proctor, J.L.; Kaiser, E.; Kutz, J.N. Chaos as an intermittently forced linear system. Nat. Commun. 2017, 8, 19. [Google Scholar] [CrossRef]
Xie, X.; Tang, S. Regularized dynamic mode decomposition algorithm for time sequence predictions. Theor. Appl. Mech. Lett. 2024, 14, 100555. [Google Scholar] [CrossRef]
Marlantes, K.E.; Bandyk, P.J.; Maki, K.J. Predicting ship responses in different seaways using a generalizable force correcting machine learning method. Ocean Eng. 2024, 312, 119110. [Google Scholar] [CrossRef]
Carlstein, E. The use of subseries values for estimating the variance of a general statistic from a stationary sequence. Ann. Stat. 1986, 14, 1171–1179. [Google Scholar] [CrossRef]
Miecznikowski, J.C.; Wang, D.; Hutson, A. Bootstrap MISE Estimators to Obtain Bandwidth for Kernel Density Estimation. Commun. Stat. Simul. Comput. 2010, 39, 1455–1469. [Google Scholar] [CrossRef]
Silverman, B.W. Density Estimation for Statistics and Data Analysis; Routledge: London, UK, 2018. [Google Scholar] [CrossRef]
Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 1991, 37, 145–151. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On Information and Sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Amsallem, D.; Farhat, C. Interpolation Method for Adapting Reduced-Order Models and Application to Aeroelasticity. AIAA J. 2008, 46, 1803–1813. [Google Scholar] [CrossRef]
Amsallem, D.; Farhat, C. An Online Method for Interpolating Linear Parametric Reduced-Order Models. SIAM J. Sci. Comput. 2011, 33, 2169–2198. [Google Scholar] [CrossRef]

Figure 1. Picture of the CK-14e ASV (a); the top cover is removed, showing the internal configuration and the location of the moon pool, visible from the lid and its fastening system. Towing tank test rig for CK-14e in moored conditions, the white panel used in tracking is visible on the model top cover (b).

Figure 2. Comparison between nominal Pierson–Moskowitz from Equation (2) and experimental power spectral density of wave elevation obtained from irregular wave test. Vertical lines show the frequencies of regular waves tests.

Figure 3. Sketch of the wave probes positions (a), and wave probes location from wave generator flap (b).

Figure 4. Response amplitude operators for surge, heave, and pitch, evaluated from irregular (solid blue line) and regular (black circles) wave responses. The dash-dotted vertical line highlights the peak frequency of the P-M EFD spectrum.

Figure 5. ANRMSE (a) and NAMMAE (b) box–violin plots. Comparison between deterministic and Bayesian predictions over the test sequences.

Figure 6. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Irregular test wave, selected sequence 1.

Figure 7. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Irregular test wave, selected sequence 2.

Figure 8. PDF comparison between measured data and Bayesian Hankel-DMDc prediction on bootstrapped sequences for each variable. Shaded areas indicate the 95% confidence interval of the two PDFs.

Figure 9. ANRMSE (a) and NAMMAE (b) box–violin plots. Comparison between deterministic and Bayesian predictions over the regular waves test sequences.

Figure 10. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 1.5

. Note that

\hat{T}

is evaluated from irregular waves.

Figure 11. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 2

. Note that

\hat{T}

is evaluated from irregular waves.

Figure 12. Standardized time series prediction by deterministic and Bayesian Hankel-DMDc. Regular test wave

λ_{w} / L_{m} = 3

. Note that

\hat{T}

is evaluated from irregular waves.

Table 1. Test matrix; regular and irregular wave conditions.

$h_{s}$ [m]	$λ_{w} / L_{m}$ [-]	$v_{phase}$ [m/s]	$v_{group}$ [m/s]	# of Performed Tests
0.15	Pierson-Moskowitz spectrum			3
0.10	1.5	1.811	0.905	1
0.10	2	2.090	1.045	1
0.10	3	2.560	1.280	1

Table 2. EV and 95% confidence lower bound, upper bound, and interval of JSD of the PDFs of variables in the bootstrapped time series.

JSD_ξ
Variable	EV	q = 0.025	q = 0.975	U
x	0.0026	0.0010	0.0061	0.0052
z	0.0031	0.0013	0.0061	0.0049
$θ$	0.0035	0.0017	0.0076	0.0059
$\dot{x}$	0.0021	0.0007	0.0063	0.0057
$\dot{z}$	0.0036	0.0015	0.0075	0.0060
$\dot{θ}$	0.0041	0.0011	0.0106	0.0095
$\ddot{x}$	0.0045	0.0028	0.0073	0.0045
$\ddot{z}$	0.0071	0.0030	0.0140	0.0110
$\ddot{θ}$	0.0086	0.0028	0.0181	0.0153
$M_{bow}$	0.0186	0.0099	0.0302	0.0203
$M_{stern}$	0.0274	0.0185	0.0387	0.0202
avg	0.0077	0.0040	0.0139	0.0099

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

System Identification of a Moored ASV with Recessed Moon Pool via Deterministic and Bayesian Hankel-DMDc

Abstract

1. Introduction

2. Experimental Setup

2.1. Facility and Model

2.2. Moored Tests

2.2.1. Irregular Waves

2.2.2. Regular Waves

2.3. Instrumentation

2.3.1. Wave Elevation Measurements

2.3.2. Motion Measurements

2.3.3. Mooring Forces

2.4. Response Amplitude Operators

3. HDMDc

Uncertainty Quantification in HDMDc Through Ensembling

4. Statistical Variables and Performance Metrics

5. System Identification Setup

6. Results and Discussion

6.1. Irregular-to-Irregular

6.2. Irregular-to-Regular

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Nomenclature

References

Article Metrics

Citations

Article Access Statistics