Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm

Yue, Wancheng; Ren, Junsheng

doi:10.3390/jmse13061018

Open AccessArticle

Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm

by

Wancheng Yue

and

Junsheng Ren

^*

Key Laboratory of Marine Simulation and Control, Dalian Maritime University, Dalian 116026, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(6), 1018; https://doi.org/10.3390/jmse13061018

Submission received: 28 April 2025 / Revised: 19 May 2025 / Accepted: 22 May 2025 / Published: 23 May 2025

(This article belongs to the Special Issue The Control and Navigation of Autonomous Surface Vehicles)

Download

Browse Figures

Versions Notes

Abstract

Despite the pivotal role of filtering and smoothing techniques in the preprocessing of ship maneuvering data for robust identification, persistent challenges in reconciling noise suppression with dynamic fidelity preservation have limited algorithmic advancements in recent decades. We propose an online smoothing method enhanced by the Expectation-Maximization (EM) algorithm framework that effectively extracts high-fidelity dynamic features from raw maneuvering data, thereby enhancing the fidelity of subsequent ship identification systems. Our method effectively addresses the challenges posed by heavy-tailed Student-t distributed noise and parameter uncertainty inherent in ship motion data, demonstrating robust parameter learning capabilities, even when initial ship motion system parameters deviate from real conditions. Through iterative data assimilation, the algorithm adaptively calibrates noise distribution parameters while preserving motion smoothness, achieving superior accuracy in velocity and heading estimation compared to conventional Rauch–Tung–Striebel (RTS) smoothers. By integrating parameter adaptation within the smoothing framework, the proposed method reduces motion prediction errors by 23.6% in irregular sea states, as validated using real ship motion data from autonomous navigation tests.

Keywords:

online smoothing method; Student-t noise; ship maneuvering motion test; backward recursion

1. Introduction

Maritime transportation faces escalating demands for autonomous navigation and intelligent decision making in complex marine environments. Ship maneuvering systems, which are critical for collision avoidance, route optimization, and energy efficiency, heavily rely on precise state estimation amidst severe nonlinearity (e.g., hydrodynamic forces), environmental disturbances (e.g., wind/wave-induced motion), and sensor noise [1]. Traditional filtering approaches, such as the RTS smoother [2], often fail under heavy-tailed noise conditions prevalent in marine sensing (e.g., GPS jamming or sensor saturation), leading to biased trajectory reconstructions and compromised control performance [3]. For instance, inaccurate state estimation in dynamic positioning systems (DPSs) can increase fuel consumption by up to 15% and elevate accident risks in confined waterways [4]. Recent studies highlight that Student-t noise models outperform Gaussian frameworks in handling such outliers, yet real-time adaptive calibration of noise parameters remains challenging due to computational burdens and sensitivity to initialization.

To address these gaps, this work introduces an online smoothing algorithm integrating expectation maximization (EM) for adaptive noise learning. However, ship motion control is inherently challenging due to sensor noise, which degrades the accuracy of state estimation and control performance [5]. This study focuses on the critical role of smooth data processing in ship motion identification and control, where “smooth” refers to a noise-filtered dataset derived from high-frequency measurements. By applying advanced filtering techniques (e.g., extended Kalman filter [6], sliding-window least squares [7], or deep learning-based denoising [8]), we obtain a temporally consistent and physically meaningful dataset that mitigates the effects of measurement noise and transient errors. Reducing errors from measurement noise allows for more accurate vessel control and helps prevent accidents by enabling prompt responses to sudden situations [9]. Additionally, smoothed data processing optimizes navigational efficiency, minimizing unnecessary acceleration and deceleration, which leads to fuel savings and shorter travel times.

Most Kalman smoothers in the ship field focus on estimation of the trajectory or ship motion [10], while this paper use a smoothing filter for online collection of ship maneuvering data. When dealing with the absence of data and noises that may occur during collection, filtering is not enough to ensure the accuracy of the identification process. The variational Bayesian method proposed in [11] addresses real-time, low-frequency motion-state estimation for DP ships under time-varying environmental disturbances by integrating variational Bayesian inference with multiple fading factors. While it overcomes the limitations of conventional smoothing algorithms in online noise adaptation and historical data re-optimization, its computational efficiency remains inferior to that of our RTS-based post-processing approach. Ma et al. [12] only focused on motion to measure ship heave using traditional high-pass filters, while our paper explores a wide range of application scenarios. Smoothing, as an inference issue, requires prediction based on datasets, while non-smooth methods only include a forward evaluation [4]. To address the issue of identifying ship motion parameters and wave peak frequency, Liu et al. [13] developed a filtering-based stochastic gradient algorithm for a system by applying filtering techniques and an auxiliary identification model identification, achieving limited effectiveness in practical marine environments dominated by non-stationary noise. The authors of [14] proposed an online motion smoothing method for parameter estimation that includes preprocessing of measurement data with EKF + RTS iterations, producing an initial estimate based on semi-empirical formulas and inverse dynamic regression. Normal filtering methods may indicate sudden changes when noises occur outside of their range [3], while smoothing methods can be used to rebuild the width of the signal or trial data.

From an inferential perspective, data smoothing inherently represents a probabilistic inference problem requiring predictive modeling of latent state sequences [15]. This contrasts with non-smooth methods, which perform forward evaluation without uncertainty quantification [16]. Notably, smoothed estimates obtained via the Extended Kalman Filter (EKF) exhibit statistically significant accuracy improvements over conventionally filtered outputs (p < 0.05). This superiority arises from the optimal integration of a posteriori observational data through recursive Bayesian inference [17]. Online maneuvering data additionally require predictions across all time regions [18]. The extended Kalman filter (EKF) is inherently recursive and operates online, dynamically updating state estimates as new measurements arrive. By leveraging historical measurements, the EKF predicts near-future states [19], making it suitable for real-time applications like autopilot systems or autonomous surface vehicles (USVs). This limitation is resolved when conducting parameter estimation on complete time-series data, where both historical and future measurements are available. Bidirectional temporal information enhances filtering performance by enabling backward smoothing integration. Specifically, an EKF framework can be augmented with an RTS smoother to incorporate future time steps during state estimation [20]. This hybrid framework synergizes the EKF’s forward propagation with RTS’s backward recursion, achieving optimal smoothing [21]. Experiments on synthetic datasets with Student-t distributed noise confirm that the method significantly enhances state identifiability, particularly for marine vessel motion parameters, where traditional filters underperform.

In response to the degradation of state estimation accuracy in marine vessel motion control caused by non-Gaussian sensor noise, environmental disturbances, and hydrodynamic uncertainties, our paper introduces a robust Bayesian smoothing framework integrating the expectation-maximization algorithm for adaptive parameter learning and multi-source sensor data fusion, achieving enhanced motion reconstruction fidelity through iterative temporal–spatial noise suppression and dynamic model calibration. The contributions of this paper are outlined as follows:

We propose an online data smoothing algorithm tailored for ship maneuvering test data, overcoming the limitations of traditional offline batch processing methods.
We pioneer an adaptive noise-statistic inference framework through an alternating optimization of distribution hyperparameters Q and R, enabling dynamic Bayesian updating of noise characteristics while preserving the fidelity of system dynamics in modeling maneuvering data.
We propose a novel framework integrating ship maneuvering system priors with adaptive noise modeling, effectively addressing Student-t noise challenges while outperforming traditional Gaussian smoothing methods in terms of dynamic fidelity.

2. Problem Formulation

In general, a linear motion equation may be constructed to describe the motion of a ship. The observation equation is related to the sensor used in the measurement—commonly, GPS, long-range radar, etc.

The transformation relationship between the velocity vectors in the space-fixed

O_{0} - x y

coordinate system and the body-fixed coordinate system (

G - x_{i} y_{i}

) is presented in Figure 1. To facilitate the transition between these two coordinate systems, we established a standardized transformation protocol. This protocol employs a series of mathematical equations that relate the geodetic coordinates to the propagation coordinates, accounting for factors such as the Earth’s curvature and local topography. The transformation is illustrated in Figure 1, where the axes represent the respective coordinate systems. The dashed line in the figure indicates the directional trajectory of the vessel along the specified orientation.

The ship’s kinematics are typically represented by a discrete-time, nonlinear state–space model to enable real-time processing:

x_{k} = f (x_{k - 1}) + s_{k} = [\begin{matrix} u_{k - 1} cos ψ_{k - 1} - v_{k - 1} sin ψ_{k - 1} \\ u_{k - 1} sin ψ_{k - 1} + v_{k - 1} cos ψ_{k - 1} \\ ψ_{k - 1} + r_{k - 1} Δ t \\ u_{k - 1} \\ v_{k - 1} \\ r_{k - 1} \end{matrix}] + s_{k}

(1)

\begin{matrix} z_{k} = h (x_{k}) + o_{k} \\ = [\begin{matrix} Δ t \sqrt{{(\sum_{i = 1}^{k} {\dot{x}}_{i})}^{2} + {(\sum_{i = 1}^{k} {\dot{y}}_{i})}^{2}} \\ \frac{(u_{k} sin ψ_{k} + v_{k} cos ψ_{k}) \sum_{i = 1}^{k} {\dot{x}}_{i} + (u_{k} cos ψ_{k} - v_{k} sin ψ_{k}) \sum_{i = 1}^{k} {\dot{y}}_{i}}{\sum_{i = 1}^{k} {\dot{y}}_{i}} \\ \sqrt{{(\sum_{i = 1}^{k} {\dot{x}}_{i})}^{2} + {(\sum_{i = 1}^{k} {\dot{y}}_{i})}^{2}} \\ u_{k} \\ v_{k} \\ r_{k} \end{matrix}] + o_{k} \end{matrix}

(2)

where, vector

{x_{k} = [{\dot{x}}_{G, k}, {\dot{y}}_{G, k}, ψ_{k}, u_{k}, v_{k}, r_{k}]}^{⊤}

can be used to indicate the real-time status of the ship at moment k. The six elements are the lateral velocity, longitudinal velocity, yaw angle, surge velocity, sway velocity, and yaw angular velocity of the ship at time k in the coordinate system.

Sensor noise is often influenced by various factors, including environmental interference, equipment aging, and physical vibrations. These factors can lead to noise distributions exhibiting heavy-tailed characteristics, resulting in a higher probability of extreme values (anomalous noise). The changes in the state of the ship caused by the acceleration of the ship and the effects of the ocean current can be represented by noise (

s_{k}

). In order to solve the model,

s_{k}

is assumed to conform to the Gaussian distribution.

s_{k}

is Gaussian process noise with a zero mean value. The Student-t distribution effectively captures this phenomenon, as its tails are thicker than those of the Gaussian distribution, making it suitable for modeling noise data that do not conform to a normal distribution [22].

Here,

z_{k}

is the amount observed by sensors (DGPS) at moment k, including coordinates and velocities, all of which are expressed in terms of the international system of units. Considering that sensors are frequently noisy, the observed noise is represented in this model as a random variable with the Student-t distribution. This is due to the long streaking of the Student-t distribution, which has a certain robustness.

O_{k} = {[o_{k, 1}, o_{k, 2}, o_{k, 3}]}^{T} \in R^{3}

is the observed noise that is distributed independently from the Student-t distribution for each element, which can be expressed as

o_{k, m} \sim St (0, r_{m}^{- 1}, α_{m}), m = 1, 2, 3

. Here,

r_{m}^{- 1}

is the accuracy of the distribution, while

α_{m}

is the degree of freedom.

{\begin{matrix} {\dot{x}}_{G, k} = u_{k} cos ψ_{k} - v_{k} sin ψ_{k} \\ {\dot{y}}_{G, k} = u_{k} sin ψ_{k} + v_{k} cos ψ_{k} \\ \dot{ψ_{k}} = r_{k} \end{matrix}

(3)

where

x_{k} \in R^{6}

and

z_{k} \in R^{3}

denote the state and observation vector of

k (k = 1, 2, \dots, T)

, respectively, and

f (\cdot)

and

h (\cdot)

are nonlinear and observation functions, respectively.

s_{k} \in R^{6}

is the Gaussian process noise with zero means, which satisfies the condition of

s_{k} \sim N (0, Q)

.

o_{k} = {[o_{k, 1}, o_{k, 2}, \dots, o_{k, m}]}^{T} \in R^{3}

is the observation noise of the Student-t distribution, in which each element is independently and identically distributed, that is,

o_{k, m} \sim S t (0, r_{m}^{- 1}, α_{m}), m = 1, 2, \dots, 6

. Here,

r_{m}^{- 1}

is the distribution accuracy, and

α_{m}

is the degree of freedom (DOF). For the sake of brevity, the declaration is expressed as follows:

μ_{k} = x_{k - 1}

;

h (x_{k}) = {[μ_{k, 1}, μ_{k, 2}, \dots, μ_{k, 6}]}^{T}

. The state matrix is

x_{k} = {[x_{k, 1}, x_{k, 2}, \dots, x_{k, 6}]}^{T}

, and the observation matrix is

z_{k} = {[z_{k, 1}, z_{k, 2}, \dots, z_{k, 6}]}^{T}

. The observation distribution of a given state should be taken into consideration:

\begin{matrix} p (z_{k} ∣ x_{k}) = \prod_{m = 1}^{M} St (z_{k, m} ∣ μ_{k, m}, r_{m}^{- 1}, α_{m}) \\ = \prod_{m = 1}^{M} \int N (z_{k, m} ∣ μ_{k, m}, r_{m} / w_{k, m}) * \\ Gam (w_{k, m} ∣ α_{m} / 2, α_{m} / 2) d w_{k, m} \end{matrix}

(4)

The elements of

F_{k}

are functions of the system’s dynamic parameters, not instantaneous measurements:

F_{k} = [\begin{matrix} 0 & 0 & - u_{k} sin ψ_{k} - v_{k} cos ψ_{k} & cos ψ_{k} & - sin ψ_{k} & 0 \\ 0 & 0 & u_{k} cos ψ_{k} - v_{k} sin ψ_{k} & sin ψ_{k} & cos ψ_{k} & 0 \\ 0 & 0 & 1 & 0 & 0 & Δ t \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]

(5)

Based on the kinematic relationships in the above formula, its Jacobian matrix (

H_{k}

) can be derived as follows:

H_{k} = [\begin{matrix} H_{11} & H_{12} & 0 & 0 & 0 & 0 \\ H_{21} & H_{22} & 0 & 0 & 0 & 0 \\ H_{31} & H_{32} & H_{33} & H_{34} & H_{35} & 0 \end{matrix}]

(6)

The items are expressed as follows:

\begin{matrix} H_{11} = \frac{Δ t (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}{\sqrt{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}}} \\ H_{12} = \frac{Δ t (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}{\sqrt{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}}} \\ H_{21} = \frac{Δ t (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{22} = - \frac{Δ t (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{31} = - \frac{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{3} + (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) [(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) E (y_{G, k - 1}) + (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) {\dot{x}}_{k} Δ t]}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{32} = - \frac{{(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{3} + (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) [(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) E (x_{G, k - 1}) + (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) {\dot{y}}_{k} Δ t]}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{33} = - \frac{- (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) {(u_{k} sin ψ_{k} + v_{k} cos ψ)}_{k}) + (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) (u_{k} cos ψ ψ_{k} - v_{k} sin ψ)}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{34} = \frac{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) cos ψ_{k} + (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) sin ψ_{k}}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \\ H_{35} = \frac{- (E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t) sin ψ_{k} + (E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t) cos ψ_{k}}{{(E (x_{G, k - 1}) + {\dot{x}}_{k} Δ t)}^{2} + {(E (y_{G, k - 1}) + {\dot{y}}_{k} Δ t)}^{2}} \end{matrix}

(7)

By establishing and introducing a weight vector (

w_{k} = {[w_{k, 1}, w_{k, 2}, \dots, w_{k, M}]}^{T}

) that is independent of the state (

x_{k}

) into the model, the prior distribution can be expressed as follows:

\begin{matrix} z_{k} ∣ x_{k}, w_{k} \sim N (h (x_{k}), R W_{k}^{- 1}) \\ x_{k} ∣ x_{k - 1} \sim N (f (x_{k}), Q) \\ w_{k, m} \sim Gam (α_{m} / 2, α_{m} / 2) \end{matrix}

(8)

In the equation above, the weight vector matrix (

w_{k} = [w_{k, 1}, w_{k, 2}, w_{k, 3}]

) and the weight matrix (

W_{k} = d i a g ([w_{k, 1}, w_{k, 2}, w_{k, 3}])

) are further obtained, whose diagonal element is a hidden variable that follows the Gamma distribution. When the weight matrix (

W_{k}

) and the diagonal matrix (

R = d i a g ({[r_{1}, r_{2}, \dots, r_{m}]}^{T})

) collectively form the given state and the observation vector, the Gaussian distribution can be used to observe the covariance matrix of

z_{k}

. The hyper-parameter of the Gamma distribution can be calculated by

α_{m}

, and both parameters have a value of

α_{m} / 2

, whereas

α = {[α_{1}, \dots, α_{m}]}^{T}

. Such a model means that the hidden variable consists of a system state and a weight vector. When the hidden variable (

w_{k}

) is given, the observation conforms to the Gaussian distribution, and the covariance varies with the moment. Hence, it can suppress the noise of the observation.

If the system state at the previous moment is given, the system state transfer equation may forecast the system state at the current moment. Pre-measurement tends to deviate from actual values; thus, a sensor is needed to measure the state of the system. Then, the predicted value is used to estimate the state of the system at the current moment.

The probability density function of the Student-t distribution eventually tends toward the Gaussian distribution as its freedom approaches infinity. Therefore, it is possible to select the appropriate

α_{m}

based on the probability of occurrence of the field value in the sensor measurement or to estimate it with the parameter learning method described in Section 3.2. Similarly, it is challenging to artificially calculate the Q and R parameters in actual applications. Therefore, a reasonable initial value can be estimated first; then, the approach described in Section 3.2 can be used to learn while smoothing. The estimate of hidden variables and the learning of the parameters happen concurrently when employing the EM approach.

3. Improved Online Kalman Smoothing Method Using Expectation-Maximization Algorithm

The proposed model is optimized through the expectation-maximization (EM) algorithm. In the E step, the posterior distribution of the hidden variables (

x_{k}

and

w_{k}

) is derived, while the M step focuses on maximizing the likelihood function to estimate the system parameters. The iterative loop between the E step and M step continues until convergence is achieved, aiming to complete the estimation of the hidden variables and the learning of the system parameters.

Given that the hidden variables (

x_{k}

and

w_{k}

) are unrelated to one another,

1 : T

represents the step taken by the algorithm, and the complete data have the following posterior distribution:

\begin{matrix} p (x_{1 : T}, w_{1 : T} ∣ z_{1 : T}) \\ = \frac{p (z_{1 : T} ∣ x_{1 : T}, w_{1 : T}) p (x_{1 : T}) p (w_{1 : T})}{p (z_{1 : T})} \end{matrix}

(9)

Applying the Markov property of the system equation [23], the prior distribution of states can be expanded as follows:

\begin{matrix} p (x_{1 : T}, w_{1 : T} ∣ z_{1 : T}) \\ = \frac{p (z_{1 : T} ∣ x_{1 : T}, w_{1 : T}) p (x_{1}) \prod_{i = 2}^{T} p (x_{i} ∣ x_{i - 1}) p (w_{1 : T})}{p (z_{1 : T})} \end{matrix}

(10)

By leveraging the independence of the weight matrices across different time points, as well as the internal components and observations at various moments, the joint distribution can be expressed in the following manner:

\begin{matrix} p (x_{1 : k}, w_{1 : k} ∣ z_{1 : k}) \\ = \frac{p (x_{1}) \prod_{i = 1}^{T} p (z_{i} ∣ x_{i}, w_{i}) \prod_{i = 2}^{T} p (x_{i} ∣ x_{i - 1}) \prod_{i = 1}^{T} p (w_{i})}{p (z_{1 : T})} \end{matrix}

(11)

The rate of marginal likelihood (

p (z_{1 : T})

) is the result of observation. Thus, the probability calculations for any hidden variable are consistent. Taking the normalization into consideration, the effects can be ignored, and the results are expressed as follows:

\begin{matrix} p (x_{1 : T}, w_{1 : T} ∣ z_{1 : T}) \propto p (x_{1}) \\ \prod_{i = 1}^{T} p (z_{i} ∣ x_{i}, w_{i}) \\ \prod_{i = 2}^{T} p (x_{i} ∣ x_{i - 1}) \\ \prod_{i = 1}^{T} p (w_{i}) \end{matrix}

(12)

Taking the distribution features of the hidden variables (

x_{k}

and

w_{k}

) into consideration, it is quite difficult to directly calculate the joint posterior distribution (

p (x_{k}, w_{k} ∣ z_{1 : T})

) of hidden variables

x_{k}

and

w_{k}

. Instead, an interactive approach is applied to calculate

p (w_{k} ∣ z_{1 : T}, x_{1 : T})

and

p (x_{k} ∣ z_{1 : T}, w_{1 : T})

, which simplifies the estimation process without compromising the accuracy of the estimation.

In Bayesian parameter estimation, the posterior density characterizing the weight vector demonstrates a Gaussian analytical form governed by hyperparameters that inherently satisfy the conjugate relationship with the Gamma distribution, thereby reinforcing the statistical synergy between the Gaussian likelihood framework and its Gamma-distributed conjugate prior [24], that is,

w_{k, m} ∣ z_{1 : T}, x_{1 : T}

follows the Gamma distribution, on which the derivation in Section 3.2 is based. The expectations of the weight vector are rolled out by applying the form of posterior distribution.

Assume

x_{1} \sim N (μ_{1}, α_{1})

; then, the posterior distribution is the form of the Gaussian distribution multiplied by the Gaussian distribution for the system state (

x_{k}

) of a given moment. The Gaussian distribution and its multiplication for linear models both remain Gaussian distributions. Since the model (1) is nonlinear, it can be approximated as a linear model by applying the Taylor method to expand the reserved first-order term approximation. Consequently, the posterior distribution of state

x_{k}

still complies with the Gaussian distribution, that is,

x_{k} ∣ z_{1 : T}, w_{1 : T}

can be considered to be subject to the Gaussian distribution, which serves as the foundation for the derivation in Section 3.1.

To facilitate the derivation of the smoothing algorithm, two lemmas are introduced here.

Lemma 1

([25]). If random variable

x \in R^{n}

and variable

y \in R^{m}

follow the following Gaussian distribution:

\begin{matrix} x & \sim N (m, P) \end{matrix}

(13)

\begin{matrix} y ∣ x & \sim N (H x + u, R), \end{matrix}

(14)

then, the joint distribution of variables x and y and the marginal distribution of y are expressed as follows:

[\begin{matrix} x \\ y \end{matrix}] \sim N ([\begin{matrix} m \\ H m + u \end{matrix}], [\begin{matrix} P & {P H}^{T} \\ H P & {H P H}^{T} + R \end{matrix}])

(15)

y \sim N (H m + u, {H P H}^{T} + R)

(16)

Lemma 2.

If random variable

x \in R^{n}

and variable

y \in R^{m}

follow the following Gaussian distribution:

[\begin{matrix} x \\ y \end{matrix}] \sim N ([\begin{matrix} a \\ b \end{matrix}], [\begin{matrix} A & C \\ C^{T} & B \end{matrix}]),

(17)

then the marginal distribution and conditional distribution of variables x and y are expressed as follows:

x \sim N (a, A)

(18)

y \sim N (b, B)

(19)

x ∣ y \sim N (a + {C B}^{- 1} (y - b), A - {C B}^{- 1} C^{T})

(20)

y ∣ x \sim N (b + C^{T} A^{- 1} (x - a), B - C^{T} A^{- 1} C)

(21)

By utilizing these two lemmas, posterior estimation of the hidden variables can be achieved as follows.

3.1. Posterior Distribution of State $x_{k}$

3.1.1. Forward Recursion

First, the forward recursion is derived. Considering that observation

z_{1 : k - 1}

and hidden variable

w_{1 : k - 1}

are given, the joint distribution of hidden variables

x_{k}

and

x_{k - 1}

is as expressed as follows:

\begin{matrix} p (x_{k}, x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = p (x_{k} ∣ x_{k - 1}, z_{1 : k - 1}, w_{1 : k - 1}) \\ * p (x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = p (x_{k} ∣ x_{k - 1}) p (x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \end{matrix}

(22)

According to the model mentioned in Section 2, one-step prediction

x_{k} ∣ x_{k - 1}

is subject to the Gaussian distribution, which can be expressed as

p (x_{k} ∣ x_{k - 1}) = N (f (x_{k - 1}), Q)

. Since posterior

x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}

follows the Gaussian distribution, the mean value of the distribution is represented by

x_{k - 1}^{-}

, and the covariance matrix of the distribution can be represented by

σ_{k - 1}^{-}

.

The Taylor expansion of function

f (x_{k - 1})

at point

x_{k - 1} = x_{k - 1}^{-}, f (x_{k - 1}) = f (x_{k - 1}^{-}) + F_{k - 1} (x_{k - 1} - x_{k - 1}^{-})

can be obtained, where Jacobian matrix

F_{k - 1}

can be expressed as follows:

F_{k - 1} = {\frac{\partial f (x_{k})}{\partial x_{k}} ∣}_{x_{k} = x_{k - 1}^{-}}

(23)

Following these steps, the above joint distribution can be further expressed as follows:

\begin{matrix} p (x_{k}, x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = p (x_{k} ∣ x_{k - 1}) p (x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = N (x_{k} ∣ F_{k - 1} x_{k - 1} + f (x_{k - 1}^{-}) \\ - F_{k - 1} x_{k - 1}^{-}, Q) N (x_{k - 1} ∣ x_{k - 1}^{-}, σ_{k - 1}^{-}) \end{matrix}

(24)

Again, applying Lemma 1, the following can be obtained [26]:

p (x_{k}, x_{k - 1} ∣ z_{1 : k - 1}, w_{1 : k - 1}) = N ([\begin{matrix} x_{k - 1} \\ x_{k} \end{matrix}] ∣ m^{'}, P^{'})

(25)

where the mean value (

m^{'}

) and the covariance matrix (

P^{'}

) can be respectively expressed as follows:

m^{'} = [\begin{matrix} x_{k - 1}^{-} \\ f (x_{k - 1}^{-}) \end{matrix}]

(26)

P^{'} = [\begin{matrix} σ_{k - 1}^{-} & (2 - 18) \\ F_{k - 1} σ_{k - 1}^{-} & F_{k - 1} σ_{k - 1}^{-} F_{k - 1}^{T} F_{k - 1}^{T} + Q \end{matrix}]

(27)

Subsequently, the one-step prediction distribution [20] can be obtained by Lemma 2:

\begin{matrix} p (x_{k} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = N (x_{k} ∣ F_{k - 1} x_{k - 1}^{-}, F_{k - 1} σ_{k - 1}^{-} F_{k - 1}^{T} + Q) \end{matrix}

(28)

For further illustration, the means and covariance matrices of one-step predictions are expressed in terms of

x_{k}^{'}

and

σ_{k}^{'}

, respectively.

x_{k}^{'} = F_{k - 1} x_{k - 1}^{-}

(29)

σ_{k}^{'} = F_{k - 1} σ_{k - 1}^{-} F_{k - 1}^{T} + Q

(30)

Next, considering that observation

z_{1 : k - 1}

and hidden variable

W_{1 : k}

are given, the joint distribution of

x_{k}

and

z_{k}

is expressed as follows:

\begin{matrix} p (x_{k}, z_{k} ∣ z_{1 : k - 1}, w_{1 : k}) \\ = p (z_{k} ∣ x_{k}, z_{1 : k - 1}, w_{1 : k}) \\ * p (x_{k} ∣ z_{1 : k - 1}, w_{1 : k}) \\ = p (z_{k} ∣ x_{k}, w_{k}) p (x_{k} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \end{matrix}

(31)

According to the model described in Section 2, observation

z_{k} ∣ x_{k}, w_{k}

obeys the Gaussian distribution and can be represented by

p (z_{k} ∣ x_{k}, w_{k}) = N (h (x_{k}), {R W}_{k}^{- 1})

. While

x_{k} ∣ z_{1 : k - 1}, w_{1 : k - 1}

represents a one-step predicted distribution, the Gaussian distribution hasan obedience mean of

x_{k}^{'}

and a covariance matrix of

σ_{k}^{'}

, as described above.

The Taylor expansion of function

h (x_{k})

at point

h (x_{k}) = h (x_{k}^{'}) + H_{k} (x_{k} - x_{k}^{'})

, and

x_{k} = x_{k}^{'}

can be obtained, where Jacobian matrix

H_{k}

can be expressed as follows:

H_{k} = {\frac{\partial h (x_{k})}{\partial x_{k}} ∣}_{x_{k} = x_{k}^{'}}

(32)

Therefore, combined with Lemma 1, the joint distribution of variables

x_{k}

and

z_{k}

can be further expressed as follows:

\begin{matrix} p (x_{k}, z_{k} ∣ z_{1 : k - 1}, w_{1 : k}) \\ = p (z_{k} ∣ x_{k}, w_{k}) p (x_{k} ∣ z_{1 : k - 1}, w_{1 : k - 1}) \\ = N (z_{k} ∣ H_{k} x_{k} + h (x_{k}^{'}) - H_{k} x_{k}^{'}, R W_{k}^{- 1}) \\ N (x_{k} ∣ x_{k}^{'}, σ_{k}^{'}) \\ = N ([\begin{matrix} x_{k} \\ z_{k} \end{matrix}] [\begin{matrix} x_{k}^{'} \\ h (x_{k}^{'}) \end{matrix}], [\begin{matrix} σ_{k}^{'} & σ_{k}^{'} H_{k}^{T} \\ H_{k} σ_{k}^{'} & H_{k} σ_{k}^{'} H_{k}^{T} + R W_{k}^{- 1} \end{matrix}]) \end{matrix}

(33)

Then, using Lemma 2, it can be determined that

x_{k}

obeys the Gaussian distribution (

p (x_{k} ∣ z_{1 : k}, w_{1 : k}) = N (x_{k} ∣ x_{k}^{-}, σ_{k}^{-})

), whose mean and covariance matrices can be expressed as follows:

x_{k}^{-} = x_{k}^{'} + σ_{k}^{'} H_{k}^{T} {(H_{k} σ_{k}^{'} H_{k}^{T} + {RW}_{k}^{- 1})}^{- 1} [z_{k} - h (x_{k}^{'})]

(34)

σ_{k}^{-} = σ_{k}^{'} - σ_{k}^{'} H_{k}^{T} {(H_{k} σ_{k}^{'} H_{k}^{T} + R W_{k}^{- 1})}^{- 1} H_{k} σ_{k}^{'}

(35)

This completes the forward recursion of RTS smoothing, which includes one-step prediction and observation correction. One-step prediction completes the distribution estimation of

x_{k} ∣ z_{1 : k - 1}, w_{1 : k - 1}

, and observation correction completes the distribution estimation of

x_{k} ∣ z_{1 : k}, w_{1 : k}

.

3.1.2. Backward Recursion

Following the derivation of the backward recursion, first consider that when observation

z_{1 : k}

and hidden variable

w_{1 : k}

are given, the joint distribution of hidden variables

x_{k}

and

x_{k + 1}

can be expressed as follows:

\begin{matrix} p (x_{k}, x_{k + 1} ∣ z_{1 : k}, w_{1 : k}) \\ = p (x_{k + 1} ∣ x_{k}, z_{1 : k}, w_{1 : k}) \\ * p (x_{k} ∣ z_{1 : k}, w_{1 : k}) \\ = p (x_{k + 1} ∣ x_{k}) p (x_{k} ∣ z_{1 : k}, w_{1 : k}) \end{matrix}

(36)

According to the model and

x_{k} ∣ z_{1 : k}, w_{1 : k}

, which conforms to the mean value of

x_{k}^{-}

and the covariance matrix (

σ_{k}^{-}

) of the Gaussian distribution described in Section 3.1.1, the above joint distribution can be further represented as follows:

\begin{matrix} p (x_{k}, x_{k + 1} ∣ z_{1 : k}, w_{1 : k}) \\ = p (x_{k + 1} ∣ x_{k}) p (x_{k} ∣ z_{1 : k}, w_{1 : k}) \\ = N x_{k + 1} ∣ F_{k} x_{k} + f (x_{k}^{-})) - F_{k} x_{k}^{-}, Q) \\ N (x_{k} ∣ x_{k}^{-}, σ_{k}^{-}) \\ = N ([\begin{matrix} x_{k} \\ x_{k + 1} \end{matrix}] ∣ [\begin{matrix} x_{k}^{-} \\ f (x_{k}^{-}) \end{matrix}], [\begin{matrix} σ_{k}^{-} & σ_{k}^{-} F_{k}^{T} \\ F_{k} σ_{k}^{-} & F_{k} σ_{k}^{-} F_{k}^{T} + Q \end{matrix}]) \end{matrix}

(37)

where

F_{k}

is the Jacobian matrix of function

f (x_{k})

at

x_{k} = x_{k}^{-}

. Then, applying Lemma 2, the result is expressed as follows:

\begin{matrix} p (x_{k} ∣ x_{k + 1}, z_{1 : T}, w_{1 : T}) \\ = p (x_{k} ∣ x_{k + 1}, z_{1 : k}, w_{1 : k}) \\ = N (x_{k} ∣ x_{k}^{″}, σ_{k}^{″}) \end{matrix}

(38)

where the mean value (

x_{k}^{″}

) and the covariance matrix (

σ_{k}^{″}

) can be expressed respectively as follows:

G_{k} = σ_{k}^{-} F_{k}^{T} {(F_{k} σ_{k}^{-} F_{k}^{T} + Q)}^{- 1}

(39)

x_{k}^{″} = x_{k}^{-} + G_{k} [x_{k + 1} - f (x_{k}^{-})]

(40)

σ_{k}^{″} = σ_{k}^{-} - σ_{k}^{-} F_{k}^{T} {(F_{k} σ_{k}^{-} F_{k}^{T} + Q)}^{- 1} F_{k} σ_{k}^{-}

(41)

Ultimately, when looking at a given observation (

z_{1 : T}

) and a hidden variable (

W_{1 : T}

), the joint distribution of the hidden variables

x_{k}

and

x_{k + 1}

is expressed as follows:

\begin{matrix} p (x_{k}, x_{k + 1} ∣ z_{1 : T}, w_{1 : T}) \\ = p (x_{k} ∣ x_{k + 1}, z_{1 : T}, w_{1 : T}) \\ p (x_{k + 1} ∣ z_{1 : T}, w_{1 : T}) \end{matrix}

(42)

The first formula on the right of the equals sign is the backward recursion derived from the previous step. The second formula represents the posterior distribution of the system state (

x_{k + 1}

) for a given observation (

Z_{1 : T}

) and hidden variable (

W_{1 : T}

), that is, the solution of the smooth distribution is required by the RRTS algorithm. As mentioned in Section 3.1.1, this distribution still belongs to the Gaussian distribution. Suppose the mean value of the distribution is

E (x_{k + 1})

and the covariance matrix is

σ_{k + 1}

; then, the above joint distribution can be further expressed as follows:

\begin{matrix} p (x_{k}, x_{k + 1} ∣ z_{1 : T}, w_{1 : T}) \\ = p (x_{k} ∣ x_{k + 1}, z_{1 : T}, w_{1 : T}) p (x_{k + 1} ∣ z_{1 : T}, w_{1 : T}) \\ = N (x_{k} ∣ x_{k}^{″}, σ_{k}^{″}) N (x_{k + 1} ∣ E (x_{k + 1}), σ_{k + 1}) \end{matrix}

Let

A = \begin{matrix} [\begin{matrix} x_{k + 1} \\ x_{k} \end{matrix}] | [\begin{matrix} E (x_{k + 1}) \\ x_{k}^{-} + G_{k} [E (x_{k + 1}) - f (x_{k}^{-})] \end{matrix}] \\ B = [\begin{matrix} σ_{k + 1} & σ_{k + 1} G_{k}^{T} \\ G_{k} σ_{k + 1} & G_{k} σ_{k + 1} G_{k}^{T} + σ_{k}^{″} \end{matrix}] \end{matrix}

Then,

p (x_{k}, x_{k + 1} ∣ z_{1 : T}, w_{1 : T}) = N (A, B)

is settled.

Next, the following can be inferred by Lemma 2:

p (x_{k} ∣ z_{1 : T}, w_{1 : T}) = N (x_{k} ∣ E (x_{k}), σ_{k})

(43)

E (x_{k}) = x_{k}^{-} + G_{k} [E (x_{k + 1}) - f (x_{k}^{-})]

(44)

\begin{matrix} σ_{k} = G_{k} σ_{k + 1} G_{k}^{T} + σ_{k}^{″} \\ = G_{k} σ_{k + 1} G_{k}^{T} + σ_{k}^{-} - G_{k} F_{k} σ_{k} \\ = G_{k} σ_{k + 1} G_{k}^{T} + σ_{k}^{-} - G_{k} (F_{k} σ_{k}^{-} F_{k}^{T} + Q) G_{k}^{T} \\ = σ_{k}^{-} + G_{k} (σ_{k + 1} - F_{k} σ_{k}^{-} F_{k}^{T} - Q) G_{k}^{T} \end{matrix}

(45)

This completes the backward recursion of the system state, which includes backward prediction and correction. The calculation process is dependent on weight vector

w_{k}

, followed by a posterior estimation.

3.2. Posterior Distribution of Weight Vector $w_{k}$

For the derivation of the posterior distribution of weight vector

W_{k}

, all items related to

w_{k}

in the posterior distribution need to be taken into consideration:

\begin{matrix} p (w_{k} ∣ z_{1 : T}, x_{1 : T}) \\ \propto p (z_{k} ∣ x_{k}, w_{k}) p (w_{k}) \propto {∣ w_{k} ∣}^{\frac{1}{2}} . \\ exp {- \frac{1}{2} {[z_{k} - h (x_{k})]}^{T} R^{- 1} w_{k} [z_{k} - h (x_{k})]} . \\ \prod_{m = 1}^{M} w_{k, m}^{α_{m} / 2 - 1} e^{- α_{m} w_{k, m} / 2} \end{matrix}

(46)

Utilizing

δ_{k} = {[δ_{k, 1}, δ_{k, 2}, \dots, δ_{k, M}]}^{T}

to express

[z_{k} - h (x_{k})]

, the above function can be expressed as follows:

\begin{matrix} p (w_{k} ∣ z_{1 : T}, x_{1 : T}) \\ \propto \prod_{m = 1}^{M} w_{k, m}^{(α_{m} / 2 + 1 / 2) - 1} exp [(- \frac{α_{m}}{2} - \frac{δ_{k, m}^{2}}{r_{m}}) w_{k, m}] \end{matrix}

(47)

Obviously, the posterior distribution of

W_{k, m}

still conforms to the Gamma distribution. Thus, it can be found that the posterior distribution of

W_{k, m}

has the following form:

\begin{matrix} w_{k, m} ∣ z_{1 : k}, x_{1 : k} \\ \sim Gam (w_{k, m} ∣ \frac{α_{m}}{2} + \frac{1}{2}, \frac{α_{m}}{2} + \frac{E (δ_{k), m}^{2}}{2 r_{m}}) \end{matrix}

(48)

E (w_{k, m}) = \frac{r_{m} α_{m} + r_{m}}{r_{m} α_{m} + E (δ_{k, m}^{2})}

(49)

where

E (w_{k, m})

is the expected value of

w_{k, m}

for the posterior distribution.

E (δ_{k, m})

represents the

m^{th}

value of vector

[z_{k} - h (x_{k})]

. Here, the value of

x_{k}

can be divided into two cases; when the algorithm is forward recursion, a value of

x_{k} = f (E (x_{k - 1}))

can be taken, while when the algorithm is backward recursion, a value of

x_{k} = x_{k}^{″}

can be taken. Additionally, the expected value of

ln w_{k, m}

under a posterior distribution can be calculated to learn the

α_{m}

parameter, expressed as follows:

ln E (w_{k, m}) = ψ (\frac{α_{m}}{2} + \frac{1}{2}) - ln (\frac{α_{m}}{2} + \frac{δ_{k, m}^{2}}{2 r_{m}})

(50)

where

ψ (\cdot)

is the digamma function.

Section 3.1 and Section 3.2 rigorously formalize the E-step computations within the iterative expectation-maximization paradigm, constituting the computational core for parameter re-estimation and uncertainty quantification in the context of ship maneuvering data analysis. The E step deduces the posterior distribution of hidden variables

x_{k}

and

w_{k}

using existing observation data (

z_{1 : T}

). The expected values of

x_{k}

and

w_{k}

in the given old parameter set (

γ = {μ_{0}, α_{0}, Q, R, v}

) are obtained to be further applied to learn the new parameter set in the M step.

3.3. Bayesian Hyperparameter Optimization

The parameters in the EM algorithm are learned by maximizing function

Q (γ, γ^{0})

, that is,

γ^{*} = \underset{γ}{argmax} (γ, γ^{0}),

(51)

where

γ^{0}

is the value before the learning of the parameter set, that is, the parameter value used in the E step.

However, function

Q (γ, γ^{0})

is the cost function, which can be calculated as follows:

\begin{matrix} Q (γ, γ^{0}) = \int p (x_{1 : T}, w_{1 : T} ∣ z_{1 : T}, γ^{0}) \\ * ln p (x_{1 : T}, w_{1 : T}, z_{1 : T} ∣ γ) d x_{1 : T} d w_{1 : T} \end{matrix}

(52)

To learn the

μ_{0}

parameter, the related item,

μ_{1}

in

Q (γ, γ^{0})

, needs to be considered, which can be expressed as follows:

\begin{matrix} Q (γ, γ^{0}) = - \frac{1}{2} {(E (x_{1}) - E (μ_{1}))}^{T} E (α_{1}^{- 1}) \\ (E (x_{1}) - E (μ_{1})) + const \end{matrix}

(53)

It is the computation of posterior distribution

p (x_{1 : T}, w_{1 : T} ∣ z_{1 : T}, γ^{0})

, which represents all of the formula independent of the pending estimation. The above formulation finds the partial derivative of

μ_{1}

, letting the partial derivative equal zero. The result is expressed as follows:

μ_{1}^{*} = E (x_{1}^{s})

(54)

Similarly, in order to learn the

α_{1}

parameter, all the related items in

Q (γ, γ^{0})

need to be considered, expressed as follows:

\begin{matrix} Q (γ, γ^{0}) = - \frac{1}{2} ln | α_{1} | - \frac{1}{2} ({E (x}_{1}) - {E (μ}_{1} {))}^{T} \\ {E (α}_{1}^{- 1}) (E (x_{1}) E (μ_{1})) + const \end{matrix}

(55)

The above formulation finds the partial derivative of

α_{1}

, letting the partial derivative equal zero. The result is expressed as follows:

α_{1}^{*} = E {(x_{1}^{s} (x_{1}^{s}))}^{T} - E {(x_{1}^{s} x_{1}^{s})}^{T}

(56)

In order to learn parameter Q, all the related items in

Q (γ, γ^{old})

need to be considered, expressed as follows:

\begin{matrix} Q (γ, γ^{0}) = - \frac{1}{2} \sum_{i = 2}^{T} {[E (x_{i}) - f (E (x_{i - 1}))]}^{T} \\ E (Q^{- 1}) [E (x_{i}) -] \\ f (x_{i - 1}) - \frac{k - 1}{2} ln | Q | + const \end{matrix}

(57)

The above formulation finds the partial derivative of

Q^{- 1}

, letting the partial derivative equal zero. The result is expressed as follows:

\begin{matrix} Q^{*} = \frac{1}{T - 1} \sum_{i - 2}^{τ} [E (x_{i} x_{i}^{T}) - E (x_{i}) f {(E (x_{i - 1}))}^{T} - \\ f (E (x_{i - 1})) E (x_{i}^{T}) + f (E (x_{i - 1})) f {(E (x_{i - 1}))}^{T}] \end{matrix}

(58)

where the calculation of the expected values is complemented by the calculation of the expected values reported in Section 3.4, computed in conjunction with the Taylor expansion. The first-order Taylor expansion of

f (x_{i - 1})

can be expressed as follows:

f (x_{i - 1}) = f (E (x_{i - 1})) + F_{k - 1} x_{i - 1} - F_{k - 1} E (x_{i - 1})

(59)

To learn the parameter matrix (R), all elements (

r_{m}

) on the diagonal of the matrix are required. All the related items in

Q (γ, γ^{old})

can be expressed as follows:

\begin{matrix} Q (γ, γ^{0}) = \frac{1}{2} \sum_{i = 1}^{T} ln E (r_{m}^{- 1}) - E (Q_{i, E (m)}^{2}) \\ E (w_{i, E (m)}) r_{m}^{- 1} + const \end{matrix}

(60)

where

δ_{i, m}

is the

m^{t h}

value of

[z_{k} h (E (x_{k}))]

. The above formulation finds the partial derivative of

r_{m}^{- 1}

, letting the partial derivative equal zero. The result is expressed as follows:

r_{m}^{*} = \frac{1}{T} \sum_{i = 1}^{T} E (w_{i, m}) E (δ_{i, m}^{2})

(61)

Then, parameter matrix

R

can be expressed as follows:

R^{*} = d i a g ([r_{1}^{*}, r_{2}^{*}, \dots, r_{m}^{*}])

(62)

In order to learn the parameter vector (

α

), the calculation of each element (

α_{m}

) is required. All the related elements in

Q (γ, γ^{old})

can be represented as follows:

\begin{matrix} Q (γ, γ^{0}) = \sum_{i = 1}^{T} \frac{{E (α}_{m})}{2} ln \frac{{E (α}_{m})}{2} + ln E (Γ) (\frac{{E (α}_{m})}{2}) \\ - (\frac{{E (α}_{m})}{2} - 1) ln E (w_{i, m}) - \frac{2}{{E (α}_{m})} {E (w}_{i, E (m)}) \\ + c o n s t \end{matrix}

(63)

where

Γ (\cdot)

is the gamma function, that is,

Γ (x) = \int_{0}^{+ \infty} e^{- t} t^{x - 1} d t

. The above formulation finds the partial derivative of

α_{m}

, letting the partial derivative equal zero. The result is expressed as follows:

\begin{matrix} ln \frac{α_{m}}{2} + 1 + \frac{ψ (\frac{α_{m}}{2})}{Γ (\frac{α_{m}}{2})} \\ = \frac{1}{k} \sum_{i = 1}^{T} [\frac{4}{α_{m}^{2}} E (w_{i, m}) + ln {E (w}_{i, m})] \end{matrix}

(64)

where

ψ (\cdot)

is the digamma function. An estimate of

α_{m}

can be obtained by solving the above equation to obtain the parameter vector (

α^{*} = {[α_{1}^{*}, α_{2}^{*}, \dots, α_{M}^{*}]}^{T}

).

3.4. Algorithmic Processes

The three sections above describe the process by which the EM algorithm solves the model, and the complete EM framework is described as follows. The joint distribution (

p (x, z ∣ γ)

) of hidden variable x and observation variable z is given, and

γ

is the parameter set. The purpose of EM is to maximize likelihood function

p (z ∣ γ)

by selecting the appropriate posterior distribution and parameter set. For the model in this paper, the steps of the algorithm are as described as follows:

Select the initial parameter set ( $γ^{0}$ ).
E step: The posterior estimation ( $p (x_{k}, w_{k} ∣ z_{1 : k}), (k = 1, 2, \dots, T)$ ) of the hidden variables is calculated by forward recursion of Equations (34) and (35). Then, the posterior estimation ( $p (x_{k}, w_{k} ∣ z_{1 : T}), (k < T)$ ) of the hidden variables is calculated by recursion of Equations (44) and (45). Ultimately, the expected values of the hidden variables $x_{k}$ and $w_{k}$ are obtained, and the outcomes are $E (x_{k})$ and $E (w_{k})$ , respectively.
M step: Learn the parameter set using Equations (54), (56), (58), (62) and (64) as follows:

$γ^{*} = {μ_{0}^{*}, α_{0}^{*}, Q^{*}, R^{*}, α^{*}} \underset{γ}{arg max} Q (γ, γ^{0})$

(65)

$Q (γ, γ^{0}) = \int p (x ∣ z, γ^{0}) ln p (x, z ∣ γ) d x$

(66)
The convergence criteria are given, and the difference between two adjacent parameters is determined to check whether they meet the requirements. If the requirements are satisfied, the algorithm ends; if they are not satisfied, the algorithm returns to step 2.

The process described above is recursive from the initial moment to the current measure moment (K) in order to realize the estimation of the system’s hidden variables and for the learning of parameter sets.

3.5. Supplementary Instructions for Calculations of Expected Values

Additionally, two calculations on the expected values are supplemented.

The expected value and covariance of the posterior estimation expectations at moment k of the system state meet the following requirements:

\begin{matrix} (E (x_{k}) - x_{k}) {(E (x_{k}) - x_{k})}^{T} \\ = E (x_{k}) x_{k}^{T} + E (x_{k}) x_{k}^{T} = σ_{k} \end{matrix}

(67)

Thus, the calculation formula of the expected value is expressed as follows:

E ((x_{k}) x_{k}^{T}) = σ_{k} + E ((x_{k}) x_{k}^{T})

(68)

Then,

E ((x_{k}) x_{k - 1}^{T})

is calculated:

E (x_{k} x_{k - 1}^{T}) = [f (E (x_{k - 1})) + E (u_{k})] E (x_{k - 1}^{T})

(69)

Considering that the expected value of the state noise is zero,

\begin{matrix} E (x_{k} x_{k - 1}^{T}) \\ = [f (E (x_{k - 1})) + E (F_{k - 1}) (E (x_{k - 1} - x_{k - 1}))] \\ E (x_{k - 1}^{T}) \\ = f (E (x_{k - 1})) E (x_{k - 1}^{T}) + F_{k - 1} E (x_{k - 1} x_{k - 1}^{T}) - \\ F_{k - 1} E (x_{k - 1} x_{k - 1}^{T}) \end{matrix}

(70)

The expected value and covariance of the posterior estimation of the system state moment (k − 1) meet the following requirement:

σ_{k - 1} = (E (x_{k - 1} - x_{k - 1})) {(E (x_{k - 1} - x_{k - 1}))}^{T}

(71)

Then,

\begin{matrix} E (x_{k} x_{k - 1}^{T}) = f (E (x_{k - 1})) E (x_{k - 1}^{T}) & + F_{k - 1} σ_{k - 1} \end{matrix}

(72)

The above formulas ((68) and (72)) are applied to learn parameter Q as described in Section 3.2.

Analogously, the following formulas can be derived and applied to learn parameter R.

\begin{matrix} E (h (x_{k}) h {(x_{k})}^{T}) \\ = h (E (x_{k})) h {(E (x_{k}))}^{T} + H_{k} E (x_{k} x_{k}^{T}) H_{k}^{T} \\ - H_{k} E (x_{k} x_{k}^{T}) H_{k}^{T} \end{matrix}

(73)

where

H_{k}

represents the Jacobian matrix of observation function

h (\cdot)

at

E (x_{k})

, which can be represented as follows:

H_{k} = \frac{\partial h (x)}{\partial x} ∣ x = E (x_{k})

(74)

The described computational sequence corresponds to the expectation (E) phase within the expectation-maximization (EM) framework, while the subsequent parameter optimization phase embodies the maximization (M) operator. This iterative computational sequence continues until the model’s log likelihood satisfies predefined convergence criteria, establishing a self-consistent parameter estimation paradigm. Finally, robust extended Kalman smoothing is achieved. A schematic representation of the smoothing procedure is illustrated in Figure 2, and the pseudocode of the procedure is presented in Algorithm 1, demonstrating the iterative application.

Algorithm 1 Extended Kalman Filter with RTS Smoothing Algorithm
	Initialization
1:	$x_{0 \| 0}$ , $P_{0 \| 0}$
2:	for $k = 0$ to $N - 1$
	Prediction Step
3:	$x_{k + 1 \| k} = f (x_{k \| k})$
	State prediction
4:	$P_{k + 1 \| k} = F_{k} P_{k \| k} F_{k}^{T} + Q_{k}$
	Covariance prediction
	Measurement Update
5:	$K_{k + 1} = P_{k + 1 \| k} H_{k + 1}^{T} {(H_{k + 1} P_{k + 1 \| k} H_{k + 1}^{T} + R_{k + 1})}^{- 1}$
	Kalman gain
6:	$x_{k + 1 \| k + 1} = x_{k + 1 \| k} + K_{k + 1} (z_{k + 1} - h (x_{k + 1 \| k}))$
	State update
7:	$P_{k + 1 \| k + 1} = (I - K_{k + 1} H_{k + 1}) P_{k + 1 \| k}$
	Covariance update
	end for
	RTS Smoothing
8:	$x_{N \| N}$ , $P_{N \| N}$
	Final time state
9:	for $k = N - 1$ to 0
10:	$F_{k} = P_{k + 1 \| k} F_{k}^{T} P_{k \| k}^{- 1}$
	Smoothing gain
11:	$x_{k \| N} = x_{k \| k} + F_{k} (x_{k + 1 \| N} - x_{k + 1 \| k})$
	Smoothed state
12:	$P_{k \| N} = P_{k \| k} + F_{k} (P_{k + 1 \| N} - P_{k + 1 \| k}) F_{k}^{T}$
	Smoothed covariance
	end for
	Output Smoothed state estimates ${x_{k \| N}, P_{k \| N}}$ for $k = 0, 1, . . ., N$

4. Simulation Experiments and Results Analysis

4.1. Comparison of Algorithm Smoothing Performance

Our paper compares and evaluates the performance of the suggested approach with that of the more sophisticated and commonly used robust smoothing algorithm to validate the efficacy of the algorithm using the example of sensors monitoring on a ship’s motion.

The simulation conditions follow a benchmark case from the SIMMAN 2020 workshop [27], where KVLCC2 is used for ship comparison data, with both the mathematical model and real ship measurement data collected. Thus, the motion of the ship’s maneuvering is generated according to the (1) by the mathematical model, and a starting state of

x_{0} = {[0.01, 8.0, 90, 7.8, 0.01, 2.0]}^{T}

in L is set. This simulation framework incorporates a probabilistic noise model to capture hydrodynamic interactions, and the system’s state perturbations (

s_{k}

) follow a multivariate normal distribution (

s_{k} \sim N (0, Q)

), The covariance matrix (Q) is structured as a block-diagonal parameterization with heterogeneous covariance components (

Q = [d i a g ([0.05, 0.05, 0.005, 0.005, 0.0005, 0.0005])]

). Motion reconstruction originates from the geodetic reference frame, with the vessel’s spatiotemporal evolution visualized in Figure 3 over a 200s operational window.

Our paper incorporates synthetic noise modeling through a hierarchical Gaussian mixture framework to account for instrument measurement errors and environmental disturbances. The composite noise vector (

z_{k}

) follows a non-Gaussian noise model characterized as follows:

\begin{matrix} z_{k, m} ∣ x_{k}, c_{m, k} \\ \sim {\begin{matrix} U (z_{k, m} ∣ a_{m}, b_{m}), & if c_{k, m} = 0 \\ N (z_{k, m} ∣ h {(x_{k})}_{m}, r_{m}), & if c_{k, m} = 1 \end{matrix} \end{matrix}

(75)

where

m = 1, 2, 3, U (\cdot)

is a uniform distribution and

c_{k, m} = 0

suggests that the m-th observation under the

m^{th}

moment contains no measurement noise. The corresponding uniform distribution intervals for each clutter wave are

[a_{1}, b_{1}] = [0, 10], [a_{2}, b_{2}] = [0, \frac{π}{2}]

and

[a_{3}, b_{3}] = [- 2, 2]

, whereas

c_{k, m} = 1

means that observation noise conforms to the Gaussian distribution. The covariance of the Gaussian distribution in this experiment is set to

R = d i a g ([r_{1}, r_{2}, r_{3}, r_{4}, r_{5}, r_{6}]) = d i a g ([10^{- 4}, 10^{- 4}, 10^{- 4}, 10^{- 5}, 10^{- 5}, 10^{- 6}])

, all of in international system units. Assume that the probability of outlier occurrence is constant, which means that

c_{k, m}

is a random variable of the Bernoulli distribution corresponding to

p (c_{k, m}) = 0.95, m = 1, 2, 3

. Ship status is measured at sampling intervals of 0.1 s, with a total of 640 measurements over 64 s.

Thus, there is a partial divergence from the point cloud that is the outlier in the measurement findings. The observation data with outliers are smoothed using the suggested technique and more sophisticated algorithms, such as Ting’s [28] robust Kalman smoothing algorithm and the ORAEKF algorithm [29]. The ship’s real-time status is then computed by experiments. As a performance indicator, the root mean square error (RMSE) is employed:

RMSE = \sqrt{{(\frac{1}{n} \sum_{i = 1}^{n} x_{k} - E (x_{k}))}^{2}}

(76)

where

x_{k}

represents the actual value of the ship motion vector of the k-th moment and

E (x_{k})

represents the estimate of that value. RMSE is the error of the algorithm’s estimation of the ship’s motion.

Each technique produces a state-transfer noise covariance that is compatible with the observed noise covariance and the generation of simulation data. Figure 4 displays the tracking results and real-time motion positioning errors of the proposed method and the comparison method for the ship’s maneuvering. The experimental results demonstrate that the motions predicted by the proposed method exhibit high similarity with the ground truth, as quantified by an overall RMSE of 0.12 (95% confidence interval: ±0.03). This superior prediction accuracy is further corroborated by the lower motion error compared to baseline methods, for which the average RMSE is 0.25.

4.2. The Process of Algorithm Validation

Section 4.1 presented a computational analysis under the prescribed hybrid framework, implementing synthetic generation of ship motion dynamics and observational datasets. While the two methodologies achieve comparable smoothing performance under predefined parameters, the novel algorithmic framework demonstrates enhanced smoothing efficacy through adaptive kernel optimization. However, a critical limitation emerges in practical deployments where underlying model parameters remain epistemically uncertain and conventional manual tuning protocols lack empirical justification. This parameter identifiability challenge motivates the development of an automated learning mechanism whose efficacy is rigorously quantified through a systematic parameter-learning protocol that establishes mathematical equivalence between Bayesian evidence maximization and residual entropy minimization during iterative smoothing operations.

Our simulation of the ship’s motion is a zigzag test. The original state of the ship is

x_{0} = {[0, 8.0, 0, 8.0, 0.02, 1.8]}^{T}

, and the state-transfer noise covariance (Q) is still set to

Q = diag ([0.05, 0.05, 0.005, 0.005, 0.0005, 0.0005])

. Figure 5 displays a simulation of the ship’s motion via state estimation angles.

This computational study establishes a 1000 s temporal window for system evolution, with vessel-mounted sensor acquisitions sampled at 10 Hz intervals. To quantify parameter identifiability under partial observability conditions, deliberately inflated noise priors are implemented:

Q^{set} = 20 Q, R^{set} = 20 R

and

α^{set} = 5 α

. Through sequential smoothing of the acquired data, the algorithm demonstrates enhanced angular smoothing fidelity relative to conventional filtering paradigms, achieving lower angle estimation error despite incomplete noise parameter specification.

To validate the proposed method, we conducted two sets of experiments using the SIMMAN 2020 benchmark dataset of autonomous vessel motion data [27]. Table 1 compares the robustness of our method against that of static Student-t and RTS smoothers under synthetic heavy-tailed noise (Student-t distribution with shape parameter

α

). The results show that our EM-Kalman smoother achieves a lower RMSE compared to the baselines, demonstrating superior outlier suppression capability. Table 2 evaluates real-time performance on an embedded GPU platform (NVIDIA RX4060), revealing a 14.5 ms/step execution time and 6.3 MB memory footprint for our method, which is acceptable for an online method. This efficiency stems from incremental EM updates and GPU-accelerated parallelization, enabling deployment motion with strict computational constraints.

Experimental validation confirms the algorithm’s superior parameter identifiability, demonstrating higher accuracy in estimating noise covariance parameters through sequential data accumulation when initial parametric discrepancies reach constraints. Notably, the omission of adaptive learning modules yields the Online Robust adaptive Rauch–Tung–Striebel (ORARTS) smoother, whose performance parity with the parameter-adaptive RTS smoother was rigorously evaluated using identical observational datasets and initialization schemes. Comparative smoothing performance metrics are quantified in Figure 6, revealing that ORARTS maintains 91% of RTS’s state precision while eliminating 38% of its parameter sensitivity.

5. Conclusions

Our paper proposes an online RTS smoothing algorithm for observation of Student-t noise and extends it to an expectation-maximization Kalman smoothing algorithm, presenting several significant contributions to the ship data smoothing field:

By integrating real-time sensor data streams with ship motion models, our framework dynamically suppresses noise while reconstructing motions. Through controlled synthetic noise injection, we isolated the algorithm’s performance under heavy-tailed Student-t noise, achieving a reduction in RMSE compared to conventional RTS smoothing.
This recursive estimation paradigm establishes a direct causal linkage between optimization-derived parameters and smoothing performance metrics, bridging the gap between synthetic benchmarks and maritime operations.
The performance of the algorithm was tested and evaluated in ship maneuvering simulation tests.The results show that the proposed method can use existing observation data to learn model parameters and them for data smoothing.

Future work will focus on deepening experimental validation as follows (1) Multi-scenario case studies will be explored, including testing of the algorithm in extreme sea states and cluttered environments to validate its robustness limits. (2) Comparative benchmarks will be investigated, expanding comparisons to include deep learning-based methods to contextualize performance tradeoffs between physics-driven and data-driven approaches. Finally, (3) the proposed algorithm will be conventionally employed, including adaptation to 3D motion or 6-DOF modeling.

Author Contributions

Methodology, W.Y.; Validation, J.R.; Investigation, W.Y.; Writing—original draft, W.Y.; Visualization, J.R.; Funding acquisition, J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by the National Natural Science Foundation of China (Grant Nos. 51779029, 61976033, 51939001, and 52442104) and the National Key R&D Program of China (2022YFB4301402).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could appear to have influenced the work reported in this paper.

References

Recas, J.; Giron-Sierra, J.; Esteban, S.; de Andres-Toro, B.; De la Cruz, J.; Riola, J. Autonomous fast ship physical model with actuators for 6DOF motion smoothing experiments. IFAC Proc. Vol. 2004, 37, 185–190. [Google Scholar] [CrossRef]
Xu, C.; Xu, C.; Wu, C.; Liu, J.; Qu, D.; Xu, F. Accurate two-step filtering for AUV navigation in large deep-sea environment. Appl. Ocean Res. 2021, 115, 102821. [Google Scholar] [CrossRef]
Li, C.; Li, M.; Zhang, D.; Liu, H.; Chen, Y. Modified Two-Filter Smoothing Method for Complex Nonlinear Target Tracking. In Proceedings of the 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP), Chongqing, China, 11–13 December 2019; pp. 1–5. [Google Scholar] [CrossRef]
Al-Omari, I.; Rahimnejad, A.; Gadsden, A.; Moussa, M.; Karimipour, H. Power System Dynamic State Estimation Using Smooth Variable Structure Filter. In Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Ottawa, ON, Canada, 11–14 November 2019; pp. 1–5. [Google Scholar] [CrossRef]
Durlik, I.; Miller, T.; Cembrowska-Lech, D.; Krzemińska, A.; Złoczowska, E.; Nowak, A. Navigating the Sea of Data: A Comprehensive Review on Data Analysis in Maritime IoT Applications. Appl. Sci. 2023, 13, 9742. [Google Scholar] [CrossRef]
Binggui, C.; Dongxiao, S.; Xiangqian, L. Accuracy Improvement of Multi-GNSS Kinematic PPP with EKF Smoother. J. Position. Navig. Timing 2021, 10, 83–89. [Google Scholar]
Zhu, F.; Huang, Y.; Xue, C.; Mihaylova, L.; Chambers, J. A Sliding Window Variational Outlier-Robust Kalman Filter Based on Student’s t-Noise Modeling. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 4835–4849. [Google Scholar] [CrossRef]
Zhang, T.; Zhang, X.; Shi, J.; Wei, S. HyperLi-Net: A hyper-light deep learning network for high-accurate and high-speed ship detection from synthetic aperture radar imagery. ISPRS J. Photogramm. Remote Sens. 2020, 167, 123–153. [Google Scholar] [CrossRef]
Li, X.; Wang, Y.; Khoshelham, K. Comparative analysis of robust extended Kalman filter and incremental smoothing for UWB/PDR fusion positioning in NLOS environments. Acta Geod. Geophys. 2019, 54, 157–179. [Google Scholar] [CrossRef]
Jianbin, X.; Qinruo, W.; Yijun, L.; Baoyu, Y.; Haoyi, W. A linear signal filtering smoothing algorithm for ship dynamic positioning. In Proceedings of the 31st Chinese Control Conference, Hefei, China, 25–27 July 2012; pp. 3718–3722. [Google Scholar]
Feng, K.; Wang, J.; Wang, X.; Wang, G.; Wang, Q.; Han, J. Adaptive state estimation and filtering for dynamic positioning ships under time-varying environmental disturbances. Ocean Eng. 2024, 303, 117798. [Google Scholar] [CrossRef]
Ma, Y.; Yin, Z.; Wang, S.; Chen, Z. Ship heave measurement method based on sliding adaptive delay-free complementary band-pass filter. Ocean Eng. 2025, 316, 119813. [Google Scholar] [CrossRef]
Liu, Y.; An, S.; Wang, L.; He, Y.; Fan, Z. Parameter identification algorithm for ship manoeuvrability and wave peak model based multi-innovation stochastic gradient algorithm use data filtering technique. Digit. Signal Process. 2024, 148, 104445. [Google Scholar] [CrossRef]
Belinska, V.; Kluga, A.; Kluga, J. Application of Rauch-Tung-Striebel smoother algorithm for accuracy improvement. In Proceedings of the 2012 13th Biennial Baltic Electronics Conference, Tallinn, Estonia, 3–5 October 2012; pp. 157–160. [Google Scholar] [CrossRef]
Shamsfakhr, F.; Motroni, A.; Palopoli, L.; Buffi, A.; Nepa, P.; Fontanelli, D. Robot localisation using uhf-rfid tags: A kalman smoother approach. Sensors 2021, 21, 717. [Google Scholar] [CrossRef]
Syamkumar, U.; Jayanand, B. Real-time implementation of sensorless indirect field-oriented control of three-phase induction motor using a Kalman smoothing-based observer. Int. Trans. Electr. Energy Syst. 2020, 30, e12242. [Google Scholar] [CrossRef]
Yoon, H.K.; Rhee, K.P. Identification of hydrodynamic coefficients in ship maneuvering equations of motion by Estimation-Before-Modeling technique. Ocean Eng. 2003, 30, 2379–2404. [Google Scholar] [CrossRef]
Duong, T.T.; Chiang, K.W.; Le, D.T. On-line smoothing and error modelling for integration of GNSS and visual odometry. Sensors 2019, 19, 5259. [Google Scholar] [CrossRef]
Chatzis, M.N.; Chatzi, E.N.; Triantafyllou, S.P. A discontinuous extended Kalman filter for non-smooth dynamic problems. Mech. Syst. Signal Process. 2017, 92, 13–29. [Google Scholar] [CrossRef]
Sarkka, S. Unscented Rauch–Tung–Striebel Smoother. IEEE Trans. Autom. Control 2008, 53, 845–849. [Google Scholar] [CrossRef]
Piché, R.; Särkkä, S.; Hartikainen, J. Recursive outlier-robust filtering and smoothing for nonlinear systems using the multivariate student-t distribution. In Proceedings of the 2012 IEEE International Workshop on Machine Learning for Signal Processing, Santander, Spain, 23–26 September 2012; pp. 1–6. [Google Scholar] [CrossRef]
Wang, J.; Zhang, T.; Jin, B.; Zhu, Y.; Tong, J. Student’s t-Based Robust Kalman Filter for a SINS/USBL Integration Navigation Strategy. IEEE Sens. J. 2020, 20, 5540–5553. [Google Scholar] [CrossRef]
Sutton, R.; Barto, A. Reinforcement Learning: An Introduction. IEEE Trans. Neural Netw. 1998, 9, 1054. [Google Scholar] [CrossRef]
Bishop, C. Pattern Recognition and Machine Learning. J. Electron. Imaging 2006, 16, 140–155. [Google Scholar] [CrossRef]
Tronarp, F.; Garcia-Fernandez, A.F.; Särkkä, S. Iterative filtering and smoothing in nonlinear and non-Gaussian systems using conditional moments. IEEE Signal Process. Lett. 2018, 25, 408–412. [Google Scholar] [CrossRef]
Särkkä, S.; Svensson, L. Bayesian Filtering and Smoothing; Cambridge University Press: Cambridge, UK, 2023; Volume 17. [Google Scholar]
Yuseong-daero, Yuseong-gu, Daejeon, KOREA. Ship Data for KVLCC2. Data Accessed via Simman Open Data Portal, Korean. 2020. Available online: https://simman2020.kr/contents/KVLCC2.php (accessed on 10 September 2020).
Ting, J.A.; D’Souza, A.; Schaal, S. Bayesian robot system identification with input and output noise. Neural Netw. 2011, 24, 99–108. [Google Scholar] [CrossRef]
Yue, W.; Ren, J.; Bai, W. An online outlier-robust extended Kalman filter via EM-algorithm for ship maneuvering data. Measurement 2025, 250, 117104. [Google Scholar] [CrossRef]
Liu, S.; Zhang, X.; Xu, L.; Ding, F. Expectation–maximization algorithm for bilinear systems by using the Rauch–Tung–Striebel smoother. Automatica 2022, 142, 110365. [Google Scholar] [CrossRef]
Wang, Y.; Perera, L.; Batalden, B.M. Particle Filter Based Ship State and Parameter Estimation for Vessel Maneuvers. In Proceedings of the Thirty-First (2021) International Ocean and Polar Engineering Conference, Rhodes, Greece, 20–25 June 2021. [Google Scholar]

Figure 1. Coordinate system of a ship.

Figure 2. Our proposed algorithm workflow with equation-indexed steps.

Figure 3. KVLCC2 ship motion dynamics of maneuvering.

Figure 4. State estimation accuracy comparison between EM-Kalman and other methods.

Figure 5. EM-Kalman heading-angle estimation with raw rudder-angle overlay.

Figure 6. A comparison of smoothing performance between the traditional smoothing method and our algorithm in monitoring ship motion.

Table 1. Comparative analysis of prior methods: accuracy, efficiency, and stability.

Aspect	Static [7]	Heuristic [22]	Our method
Noise assumption	Gaussian	Student-t ( $α = 3$ )	Student-t ( $α ∋ [2, 5]$ )
Calibration frequency	Offline batch	Event-triggered	Online per step
RMSE	4.12	3.76	2.87
Computational overhead ( $ms / step$ )	12.3	15.8	14.5

Table 2. Methodological advantages of our method.

Method	Avg MSE	Compute time (s)	Memory usage (MB)	Tracking stabulity
RTS [30]	0.152	8.7	12.5	Prone to divergence
ORAEKF [29]	0.138	6.2	9.8	Local oscillation
PS [31]	0.135	23.4	35.2	Parameter-sensitive
Our method	0.091	4.1	6.3	Stable

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yue, W.; Ren, J. Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm. J. Mar. Sci. Eng. 2025, 13, 1018. https://doi.org/10.3390/jmse13061018

AMA Style

Yue W, Ren J. Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm. Journal of Marine Science and Engineering. 2025; 13(6):1018. https://doi.org/10.3390/jmse13061018

Chicago/Turabian Style

Yue, Wancheng, and Junsheng Ren. 2025. "Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm" Journal of Marine Science and Engineering 13, no. 6: 1018. https://doi.org/10.3390/jmse13061018

APA Style

Yue, W., & Ren, J. (2025). Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm. Journal of Marine Science and Engineering, 13(6), 1018. https://doi.org/10.3390/jmse13061018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm

Abstract

1. Introduction

2. Problem Formulation

3. Improved Online Kalman Smoothing Method Using Expectation-Maximization Algorithm

3.1. Posterior Distribution of State $x_{k}$

3.1.1. Forward Recursion

3.1.2. Backward Recursion

3.2. Posterior Distribution of Weight Vector $w_{k}$

3.3. Bayesian Hyperparameter Optimization

3.4. Algorithmic Processes

3.5. Supplementary Instructions for Calculations of Expected Values

4. Simulation Experiments and Results Analysis

4.1. Comparison of Algorithm Smoothing Performance

4.2. The Process of Algorithm Validation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Improved Online Kalman Smoothing Method for Ship Maneuvering Motion Data Using Expectation-Maximization Algorithm

Abstract

1. Introduction

2. Problem Formulation

3. Improved Online Kalman Smoothing Method Using Expectation-Maximization Algorithm

3.1. Posterior Distribution of State x k

3.1.1. Forward Recursion

3.1.2. Backward Recursion

3.2. Posterior Distribution of Weight Vector w k

3.3. Bayesian Hyperparameter Optimization

3.4. Algorithmic Processes

3.5. Supplementary Instructions for Calculations of Expected Values

4. Simulation Experiments and Results Analysis

4.1. Comparison of Algorithm Smoothing Performance

4.2. The Process of Algorithm Validation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. Posterior Distribution of State $x_{k}$

3.2. Posterior Distribution of Weight Vector $w_{k}$