Model Adaptive Kalman Filter for Maneuvering Target Tracking Based on Variational Inference

Wang, Junxiang; Wang, Xin; Chen, Yingying; Yan, Mengting; Lan, Hua

doi:10.3390/electronics14101908

Open AccessArticle

Model Adaptive Kalman Filter for Maneuvering Target Tracking Based on Variational Inference

by

Junxiang Wang

^1,2,

Xin Wang

³,

Yingying Chen

³,

Mengting Yan

² and

Hua Lan

^3,*

¹

School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China

²

Beijing Institute of Aerospace Systems Engineering, Beijing 100076, China

³

School of Automation, Northwestern Polytechnical University, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(10), 1908; https://doi.org/10.3390/electronics14101908

Submission received: 6 April 2025 / Revised: 28 April 2025 / Accepted: 6 May 2025 / Published: 8 May 2025

(This article belongs to the Special Issue New Insights in Radar Signal Processing and Target Recognition)

Download

Browse Figures

Versions Notes

Abstract

This study introduces a new variational Bayesian adaptive estimator that enhances traditional interactive multiple model (IMM) frameworks for maneuvering target tracking. Conventional IMM algorithms struggle with rapid maneuvers due to model-switching delays and fixed structures. Our method uses Bayesian inference to update change-point statistics in real-time for quick model switching. Variational Bayesian inference approximates the complex posterior distribution, transforming target state estimation and model identification into an optimization task to maximize the evidence lower bound (ELBO). A closed-loop iterative mechanism jointly optimizes the target state and model posterior. Experiments in six simulated and two real-world scenarios show our method outperforms current algorithms, especially in high maneuverability contexts.

Keywords:

variational Bayesian inference; multi-model estimation; Bayesian online change point detection; maneuvering target tracking

1. Introduction

Target tracking using radar and sonar is crucial in military and civilian applications like missile defense, air traffic control, and maritime surveillance [1]. For non-maneuvering targets, a single motion model with the Kalman Filter (KF) effectively estimates and tracks the target’s state. However, relying on a single model can mismatch the model and actual behavior when tracking maneuvering targets.

Multimodel estimators have become a significant approach to address the challenges associated with model mismatches arising from the maneuverability of targets [2,3]. These methods use hypothetical motion models to simultaneously estimate model probabilities and system states within a Markov framework. The exponential growth of potential hypotheses in multimodel state estimation systems poses a significant computational challenge, making exact Bayesian inference infeasible. Efforts have been made to develop computationally feasible approximations [4,5], primarily by reducing hypothesis space through the approximation of multimodal probability density functions (pdfs). The Generalized Pseudo-Bayesian algorithm replaces the intensive Gaussian mixture with a single Gaussian by using moment matching during updates, albeit with higher computational costs than basic Kalman filtering [6]. For greater efficiency, Blom and Shalom introduced the IMM algorithm with Markov switching coefficients [7]. The IMM method balances computational efficiency with tracking accuracy through model interaction and fusion, and is widely used in target tracking, fault detection, and signal processing [8,9]. Qiu et al. [10] present a centralized fusion algorithm interacting multiple models and the adaptive Kalman filter combining IMM and an adaptive Kalman filter for underwater acoustic sensor networks. Youn et al. [11] introduce an adaptive Kalman filter for estimating measurement loss probability within the IMM framework. Qu et al. [12] explore multi-model estimation with variable structures to handle uncertain model parameters. The IMM algorithm fusing modified input estimation and best linear unbiased estimation filter, and hybrid grid multiple model algorithms are other notable IMM extensions but are computationally intensive, limiting their real-time applications [13,14].

The aforementioned methods conventionally approximate the posterior pdf (mixture Gaussian distribution in linear Gaussian systems) by a single Gaussian distribution with the mean and variances calculated through weighted summation. While this approach provides a reasonable estimate, it inevitably results in some loss of accuracy. When model parameters are uncertain, obtaining the optimal analytical solution for target state estimation becomes challenging, necessitating the use of approximate reasoning techniques. These techniques can be broadly categorized into two groups: sampling-based stochastic approximation and optimization-based deterministic approximation. In particular, sampling-based sequential Monte Carlo (SMC) methods [15] approximate the intractable joint probability density functions via particle propagation [16]. However, their considerable computational burden restricts their applicability to small-scale state estimation problems. In contrast, the optimization-based variational Bayes (VB) approach [17] reformulates posterior inference as an optimization problem, thereby providing an approximate analytical solution.

VB-based methods have attracted significant attention in adaptive Kalman filtering applications due to their efficient computation compared to sampling-based methods. Särkkä and Nummenmaa [18] introduced the first VB-based adaptive Kalman filtering method for joint recursive estimation of dynamic state and the time-varying measurement noise covariance in linear state space models. This work [18] was further extended to nonlinear state space models with unknown measurement noise covariance by combining with nonlinear approximation methods, such as MCMC [19] and the cubature integration rule [20]. By regarding the predicted state covariance as a latent variable, Huang et al. [21,22] presented the VB adaptive Kalman filtering for linear state estimation with measurement noise covariance and process noise covariance. Zhang et al. [23] explored distributed sequential state estimation for discrete time-varying systems with imprecise process noise covariance over binary sensor networks. Ma et al. [24] proposed the VB-based multi-model estimator that jointly infers target state and model identity by adaptively weighting model posteriors. Lan et al. [25] introduced an auxiliary latent variable to separate state and process noise covariance, developing the VB-based adaptive Kalman filtering for nonlinear state estimation with unknown process noise covariance and measurement noise covariance. Recently, Lan et al. [26,27] introduced the conjugate VB-based adaptive Kalman filtering for state estimation and noise identification in both linear and nonlinear dynamic systems. However, it is worth noting that due to the non-convex nature of the variational optimization objective function, the performance of variational adaptive filtering algorithms is heavily influenced by the initial iteration value settings. When targets undergo strong maneuvers, the algorithm is prone to converging to local optima, potentially resulting in degraded tracking performance or even track loss.

Essentially, the non-stationary stochastic process induced by target maneuvers can be viewed as a change point occurring in an otherwise stationary stochastic process. Bayesian online change-point detection (BOCPD) [28,29], an algorithm designed for real-time detection of anomalies in data streams, offers a promising solution for maneuver detection. This method characterizes the probability of a change point occurring by introducing a run length variable and partitions the non-stationary time series data generated by maneuvering targets into non-overlapping stationary sub-sequences based on the posterior probability of the run length.

We propose an adaptive Kalman filter using VB for online change point detection with model selection, referred to as VBOCPDMS, which employs a change point variable modeled by run length within a variational Bayesian framework to jointly estimate latent variables like target state, model identity, and run length. The proposed VBOCPDMS method transforms the complex inference of the posterior probability density function into a Kullback–Leibler (KL) divergence optimization problem. Using variational Bayesian inference, we update change point statistics online, allowing for real-time detection and response to changes in target motion states. An efficient model switching mechanism dynamically adjusts model weights based on the target’s motion state, facilitating the integration of multiple motion models. The effectiveness of the method is validated by simulations and real radar data, demonstrating its superiority over existing methods.

2. Problem Formation

Assuming that a non-stationary time-series data

{y_{1}, y_{2}, \dots, y_{k}}

can be segmented into non-overlapping stationary subsequences delineated by run lengths

r_{k}

. Consequently, each measurement data

y_{k}

follows a probability distribution

p (y_{k} | η_{m_{k}})

according to model

m_{k}

at time k. Specifically, the run length is modeled as a discrete random variable

r_{k} \in [1, 2, \dots, k]

: when a change in the target’s motion pattern occurs, signifying a change point, the run length reduces to 1; otherwise, it increments.

Given a particular run length

r \in r_{k}

, we consider the following discrete-time linear multi-model state-space system:

\begin{matrix} x_{k} & = F_{m, k} (x_{k - 1}) + υ_{m, k} \end{matrix}

(1)

\begin{matrix} y_{k} & = H_{m, k} (x_{k}) + w_{m, k} \end{matrix}

(2)

where the model identity

m \in M

, and

M

represents the domain of possible models. The time index k corresponds to the target state

x_{k} \in R^{n_{x}}

and the observation

y_{k} \in R^{n_{y}}

, where

n_{x}

and

n_{y}

are the state and measurement dimensions, respectively. Given the model m at time k, the state transition function and measurement function are denoted by

F_{m, k}

and

H_{m, k}

, respectively. The process noise vector

υ_{m, k}

and the measurement noise vector

w_{m, k}

are assumed to be mutually independent and follow Gaussian distributions, i.e.,

υ_{m, k} \sim N (0, Q_{m, k})

and

w_{m, k} \sim N (0, R_{m, k})

. The initial state vector

x_{0 | 0}

is assumed to follow a Gaussian distribution. Furthermore,

x_{0}

,

υ_{m, k}

, and

w_{m, k}

are assumed to be mutually uncorrelated.

The discrete model identity variable

m_{k}

is reformulated as a vector of binary indicators

I_{k} = [I_{1, k}, \dots, I_{m, k}, \dots, I_{M, k}]

, where

I_{m, k} \in \{0, 1\}

for

m = 1, 2, \dots, M

, and

\sum_{m = 1}^{M} I_{m, k} = 1

. Consequently, selecting model m at time k implies

I_{m, k} = 1

, while all other elements of

I_{k}

are zero. The model identity vector

I_{k}

can be modeled using a categorical distribution:

\begin{matrix} C a t (I_{k} | μ_{k}) = \prod_{m = 1}^{M} {[μ_{m, k}]}^{I_{m, k}} \end{matrix}

(3)

where the elements of the parameter vector

μ_{k}

are all positive and sum to 1, and

μ_{m, k}

represents the statistical expectation of

I_{m, k}

.

This paper focuses on the multi-model state estimation problem, leveraging different parameters

F_{m, k}, Q_{m, k}

within the model domain to capture the maneuverability of the target. Accordingly, the filtering problem is reformulated as a joint variational posterior inference problem encompassing the run length, model identifier, and target state, namely, the computation of a joint posterior probability density function

p (x_{k}, I_{k}, r_{k} | y_{1 : k})

. Formally, the well-known recursive Bayesian filtering solution consists of the following steps:

Initialization: Set the prior pdf $p (x_{0}, I_{0}, r_{0})$
Time prediction: The predictive pdf $p (x_{k}, I_{k}, r_{k} | y_{1 : k - 1})$ is given by the Chapman–Kolmogorov equation:

$\begin{matrix} p (x_{k}, I_{k}, r_{k} | y_{1 : k - 1}) = & \sum_{r_{k - 1}} \int p (x_{k} | x_{k - 1}, I_{k}) p (I_{k} | I_{k - 1}, r_{k}) p (r_{k} | r_{k - 1}) \\ \times p (x_{k - 1}, I_{k - 1}, r_{k - 1} | y_{1 : k - 1}) d x_{k - 1} d I_{k - 1} \end{matrix}$

(4)
Measurement update: The above predictive PDF is updated by measurement $y_{k}$ :

$\begin{matrix} p (x_{k}, I_{k}, r_{k} | y_{1 : k}) \propto p (y_{k} | x_{k}, I_{k}, r_{k}) \times p (x_{k}, I_{k}, r_{k} | y_{1 : k - 1}) \end{matrix}$

(5)

The recursion equations are complex due to nonlinearity and multi-model estimation. We will solve these using variational Bayesian inference in the next section.

3. The Proposed VBOCPDMS Method

The proposed VBOCPDMS method can be represented by the following Figure 1.

3.1. Model Prior Distributions

The initialization step is to model the prior distribution of latent variables. According to the definition of

I_{k}

in (3), the prior distribution of the system state, i.e., predicted pdf, can be derived as follows:

\begin{matrix} p (x_{k} | y_{1 : k - 1}, I_{k}^{r}, r_{k}) = \prod_{m = 1}^{M} N {(x_{k}^{r} | {\tilde{x}}_{m, k}^{r}, {\tilde{P}}_{m, k}^{r})}^{I_{m, k}^{r}} \end{matrix}

(6)

where

x_{k}^{r}

and

I_{k}^{r}

represent the conditional state and conditional model identity at time k conditioned on the run length

r_{k} = r

, respectively. Meanwhile,

{\tilde{x}}_{m, k}^{r}

and

{\tilde{P}}_{m, k}^{r}

denote the predicted mean and the corresponding covariance matrix of the conditional state for model

m_{k} = m

at time k, respectively. The notation

\tilde{(\cdot)}

signifies a predicted value.

Subsequently, based on the state transition function in (1), we can obtain

\begin{matrix} {\tilde{x}}_{m, k}^{r} & = F_{m, k} {\hat{x}}_{k - 1}^{r} \end{matrix}

(7)

\begin{matrix} {\tilde{P}}_{m, k}^{r} & = F_{m, k} {\hat{P}}_{k - 1}^{r} F_{m, k}^{T} + Q_{m, k} \end{matrix}

(8)

where

{\hat{x}}_{k - 1}^{r}

and

{\hat{P}}_{k - 1}^{r}

denote the estimated mean and the corresponding covariance matrix of the conditional state at time

k - 1

, respectively. The notation

\hat{(\cdot)}

signifies an estimated value.

Q_{m, k}

is the process noise covariance for model

m_{k} = m

.

The prior

p (r_{k} | y_{1 : k - 1})

of the run length

r_{k}

at time k is presented as follows:

\begin{matrix} p (r_{k} | y_{1 : k - 1}) & = \sum_{r_{k - 1}} p (r_{k}, r_{k - 1} | y_{1 : k - 1}) \\ = \sum_{r_{k - 1}} p (r_{k} | r_{k - 1}, y_{1 : k - 1}) p (r_{k - 1} | y_{1 : k - 1}) \\ = \sum_{r_{k - 1}} p (r_{k} | r_{k - 1}) p (r_{k - 1} | y_{1 : k - 1}) \end{matrix}

(9)

where

p (r_{k} | r_{k - 1})

denotes the transition probability of the run length. Since the probability mass of

p (r_{k} | r_{k - 1})

is non-zero only in two distinct scenarios—either the absence of a change point, resulting in the continuation of the run length

r_{k}

, or the occurrence of a change point, truncating

r_{k}

to 1—the transition probability of the run length can be mathematically formulated as follows:

\begin{matrix} P (r_{k} | r_{k - 1}) = \{\begin{matrix} H (r_{k - 1} + 1), & r_{k} = 1 \\ 1 - H (r_{k - 1} + 1), & r_{k} = r_{k - 1} + 1 \end{matrix} \end{matrix}

(10)

where

H (τ)

represents a penalty function, as defined in (11).

\begin{matrix} H (τ) = \frac{P_{gap} (g = τ)}{\sum_{k = τ}^{\infty} P_{gap} (g = k)} \end{matrix}

(11)

In this paper,

P_{gap} (g = τ)

can be formulated as a discrete exponential (or geometric) distribution with a time scale

τ

. Given the memoryless nature of this process, the penalty function can be simplified to

H (τ) = 1 / λ

, where

λ

represents the maneuvering period.

By substituting (10) into (9), the prior

p (r_{k} | y_{1 : k - 1})

is finally obtained:

\begin{matrix} p (r_{k} | y_{1 : k - 1}) = \{\begin{matrix} (1 - H (r_{k - 1} + 1)) p (r_{k - 1} | y_{1 : k - 1}), & r_{k} = r_{k - 1} + 1 \\ \sum_{r_{k - 1}} H (r_{k - 1} + 1) p (r_{k - 1} | y_{1 : k - 1}), & r_{k} = 1 \end{matrix} \end{matrix}

(12)

As

I_{k}

follows a categorical distribution, the prior of the conditional model identity

I_{k}^{r}

at time k is assumed to be,

\begin{matrix} p (I_{k} | r_{k}, y_{1 : k - 1}) & = C a t (I_{k}^{r} | {\tilde{μ}}_{k}^{r}) \end{matrix}

(13)

where

{\tilde{μ}}_{k}^{r}

represents the parameter set of the conditional model identity

I_{k}^{r}

.

In the case where

r_{k} = r_{k - 1} + 1

, it indicates a target undergoing stationary motion. However, for

r_{k} = 1

, this situation indicates the occurrence of a change point. Thus, the prior parameter

{\tilde{μ}}_{k}^{r}

is defined as:

\begin{matrix} {\tilde{μ}}_{k}^{r} = \{\begin{matrix} {\hat{μ}}_{k - 1}^{r}, & r_{k} = r_{k - 1} + 1 \\ {\hat{μ}}_{0}, & r_{k} = 1 \end{matrix} \end{matrix}

(14)

where

{\hat{μ}}_{0}

represents the hyperparameter of

{\tilde{μ}}_{k}^{r}

when a change point occurs.

There are several different choices

(I_{0}, I_{1}, \dots)

for

{\hat{μ}}_{0}

. In this paper, we compute the likelihood of the target state under each choice in (17), where the run length is 1, and select the one that maximizes the likelihood as the initial model weight:

\begin{matrix} {\hat{μ}}_{0}^{*} = p (I_{k}^{1} | y_{1 : k - 1}) = arg max_{{\hat{μ}}_{0}} \{p (y_{k} | y_{1 : k - 1})\} \end{matrix}

(15)

Subsequently, the parameter

{\tilde{μ}}_{k}^{r}

will be updated through the variational posterior, and its number will linearly increase with growth in the run length.

3.2. Update Approximate Posterior Distributions

Utilizing Bayes’ theorem, the posterior distribution

p (r_{k} | y_{1 : k})

can be computed as

\begin{matrix} p (r_{k} | y_{1 : k}) = \frac{p (r_{k} | y_{1 : k - 1}) p (y_{k} | y_{1 : k - 1}) p (y_{1 : k - 1})}{p (y_{1 : k})} \propto p (r_{k} | y_{1 : k - 1}) p (y_{k} | y_{1 : k - 1}) \end{matrix}

(16)

with

\begin{matrix} p (y_{k} | y_{1 : k - 1}) = \int p (y_{k} | x_{k}^{r}, I_{k}^{r}, y_{1 : k - 1}) p (x_{k}^{r} | I_{k}^{r}, y_{1 : k - 1}) p (I_{k}^{r} | y_{1 : k - 1}) d x_{k}^{r} d I_{k}^{r} \end{matrix}

(17)

Conditioned on the run length

r_{k}

, the joint distribution

p (y_{k}, x_{k}, I_{k} | r_{k}, y_{1 : k - 1})

is

\begin{matrix} p (y_{k}, x_{k}, I_{k} | r_{k}, y_{1 : k - 1}) = p (y_{k} | x_{k}^{r}, I_{k}^{r}) p (x_{k}^{r} | y_{1 : k - 1}, I_{k}^{r}) p (I_{k}^{r} | y_{1 : k - 1}, r_{k}) \end{matrix}

(18)

Using the measurement function from (2) and the prior distributions from (6) and (13), the three terms on the right-hand side of (18) are given as follows:

\begin{matrix} p (y_{k} | x_{k}^{r}, I_{k}^{r}) = \prod_{m = 1}^{M} N {(y_{k} | H_{m, k} x_{k}^{r}, r_{m, k})}^{I_{m, k}^{r}} \end{matrix}

(19)

\begin{matrix} p (x_{k}^{r} | y_{1 : k - 1}, I_{k}^{r}) = \prod_{m = 1}^{M} N {(x_{k}^{r} | {\tilde{x}}_{m, k}^{r}, {\tilde{P}}_{m, k}^{r})}^{I_{m, k}^{r}} \end{matrix}

(20)

\begin{matrix} p (I_{k}^{r} | y_{1 : k - 1}, r_{k}) = \prod_{m = 1}^{M} {[{\tilde{μ}}_{m, k}^{r}]}^{I_{m, k}^{r}} \end{matrix}

(21)

In the following, we will derive the posterior distributions for each latent variable related to run length, system state, and the motion model separately.

Derivations of $p (r_{k} | y_{1 : k})$

By incorporating (19) through (21) into (17), the likelihood previously presented in (17) is expressed in a compact form as

\begin{matrix} p (y_{k} | y_{1 : k - 1}) = E_{p (I_{k}^{r} | y_{1 : k - 1})} [\prod_{m = 1}^{M} {Ƶ_{k}}^{I_{m, k}^{r}}] \end{matrix}

(22)

where

E

denotes the expectation operator, and the term

Ƶ_{k}

is denoted as

\begin{matrix} Ƶ_{k} = N (y_{k} | H_{m, k} {\tilde{x}}_{m, k}^{r}, H_{m, k} {\tilde{P}}_{m, k}^{r} H_{m, k}^{T} + R_{m, k}) \end{matrix}

(23)

Due to

I_{m, k}^{r} \in \{0, 1\}

and

\sum_{m = 1}^{M} I_{m, k}^{r} = 1

, it follows that:

\begin{matrix} E_{p (I_{k}^{r} | y_{1 : k - 1})} [\prod_{m = 1}^{M} {Ƶ_{k}}^{I_{m, k}^{r}}] = \sum_{m = 1}^{M} {\tilde{μ}}_{m, k}^{r} Ƶ_{k} \end{matrix}

(24)

Consequently, by substituting (12), and (22) into (16), we can obtain the posterior pdf

p (r_{k} | y_{1 : k})

of the run length.

\begin{matrix} p (r_{k} | y_{1 : k}) \propto \{\begin{matrix} p (y_{k} | y_{1 : k - 1}) (1 - H (r_{k - 1} + 1)) p (r_{k - 1} | y_{1 : k - 1}), & r_{k} = r_{k - 1} + 1 \\ p (y_{k} | y_{1 : k - 1}) \sum_{r_{k - 1}} H (r_{k - 1} + 1) p (r_{k - 1} | y_{1 : k - 1}), & r_{k} = 1 \end{matrix} \end{matrix}

(25)

A change point is formally identified when the absolute difference between successive run length estimates

| r_{k}^{*} - r_{k - 1}^{*} |

exceeds a preset detection threshold

δ

, mathematically expressed as:

\begin{matrix} r_{k}^{*} = arg max_{r} {p (r_{k} | y_{1 : k})} \end{matrix}

(26)

\begin{matrix} | r_{k}^{*} - r_{k - 1}^{*} | > δ \end{matrix}

(27)

To estimate the latent variables

{x_{k}, I_{k}}

, we employ a mean-field variational family that assumes mutual independence among the latent variables. Specifically, each latent variable is governed by a distinct factor within the variational density. Consequently, the variational distribution,

Q_{k} = q (x_{k}, I_{k} | r_{k})

, serves as an approximation to the true posterior distribution

p (x_{k}, I_{k} | r_{k}, y_{1 : k})

through a free-form factorization, which can be formulated as:

\begin{matrix} Q_{k} = q (x_{k} | r_{k}) q (I_{k} | r_{k}) = q (x_{k}; {\hat{x}}_{k}^{r}, {\hat{P}}_{k}^{r}) q (I_{k}; {\hat{μ}}_{k}^{r}) \end{matrix}

(28)

Remark 1.

The mean-field variational family assumes that the latent variables are mutually independent. In highly dynamic or nonlinear systems, this assumption may seem restrictive at first glance. In such systems, the latent variables are likely to be highly correlated. Despite this, mean-field factorization can still be useful. It provides a computationally efficient way to approximate the variational posterior. In some cases, even if the variables are correlated, the mean-field approximation can capture the main characteristics of the distribution. As shown in the reference paper [17], the mean-field approximation can capture any marginal density of the latent variables, which can be sufficient for certain types of analysis. Relevant literature can be found where run-length modeling is also employed to capture maneuvering behavior in highly dynamic scenarios, and mean-field factorization is adopted to perform approximate variational inference in nonlinear systems [27,30].

However, the limitations of the mean-field approximation are also noteworthy. Specifically, it fails to capture dependencies between latent variables, which can be helpful for accurate modeling in highly dynamic or nonlinear systems [17]. To address this issue, structured variational approximations can be considered. As noted in [31], hierarchical variational models (HVMs) and copula variational inference (copula VI) are two approaches that aim to preserve such dependencies. HVMs introduce a prior over the variational parameters and marginalize them out to model latent dependencies, while copula VI leverages a copula distribution to explicitly restore the correlations among latent variables.

With the variational parameter denoted as

λ_{k} = \{{\hat{x}}_{k}^{r}, {\hat{P}}_{k}^{r}, {\hat{μ}}_{k}^{r}\}

, the ELBO according to the standard variational method is given by:

\begin{matrix} B (λ_{k}) & = E_{Q_{k}} [log p (y_{k}, x_{k}, I_{k} | r_{k}, y_{1 : k - 1}) - log q (x_{k}, I_{k} | r_{k})] \end{matrix}

(29)

The optimal variational parameters

λ_{k}^{*}

can be obtained by maximizing the ELBO

B (λ_{k})

as follows:

\begin{matrix} λ_{k}^{*} = \arg max_{λ_{k}} B (λ_{k}) \end{matrix}

(30)

By taking the logarithm of both sides of (18) and subsequently incorporating (19)–(21) into (18), the logarithmic joint distribution

F_{k} = log p (y_{k}, x_{k}, I_{k} | r_{k}, y_{1 : k - 1})

can be decomposed as follows:

\begin{matrix} F_{k} = & log [\prod_{m = 1}^{M} N {(y_{k} | H_{m, k} x_{k}^{r}, R_{m, k})}^{I_{m, k}^{r}}] \\ + log [\prod_{m = 1}^{M} N {(x_{k}^{r} | {\tilde{x}}_{m, k}^{r}, {\tilde{P}}_{m, k}^{r})}^{I_{m, k}^{r}}] + log [\prod_{m = 1}^{M} {({\tilde{μ}}_{m, k}^{r})}^{I_{m, k}^{r}}] \end{matrix}

(31)

Derivations of $q (x_{k}; {\hat{x}}_{k}^{r}, {\hat{P}}_{k}^{r})$

The expression of the state’s expected parameters is as follows:

\begin{matrix} E_{x} = \{E_{q (x_{k}^{r})} [x_{k}^{r}], E_{q (x_{k}^{r})} [x_{k}^{r} {(x_{k}^{r})}^{T}]\} \end{matrix}

(32)

Rewriting the ELBO

B (λ_{k})

as the function of the state’s expected parameters

E_{x}

, and omitting the rest terms that are independent of

x_{k}^{r}

, denoted by

B_{x}

for brevity,

\begin{matrix} B_{x} & = E_{Q_{k}} {log [\prod_{m = 1}^{M} N {(y_{k} | H_{m, k} x_{k}^{r}, R_{m, k})}^{I_{m, k}^{r}}] \\ + log [\prod_{m = 1}^{M} N {(x_{k} | {\tilde{x}}_{m, k}^{r}, {\tilde{P}}_{m, k}^{r})}^{I_{m, k}^{r}}] - log N (x_{k} | {\hat{x}}_{k}^{r}, {\hat{P}}_{k}^{r})} \end{matrix}

(33)

Extending the express and omitting the constant term,

B_{x}

can be further simplified to

\begin{matrix} B_{x} = \frac{1}{2} tr \{[{({\hat{P}}_{k}^{r})}^{- 1} - J_{k}^{1}] E_{q (x_{k}^{r})} [x_{k}^{r} {(x_{k}^{r})}^{T}]\} + tr \{[J_{k}^{2} - {({\hat{P}}_{k}^{r})}^{- 1} {\hat{x}}_{k}^{r}] E_{q (x_{k}^{r})} [x_{k}^{r}]\} \end{matrix}

(34)

with

\begin{matrix} J_{k}^{1} = \sum_{m = 1}^{M} {\hat{μ}}_{m, k}^{r} [{({\tilde{P}}_{m, k}^{r})}^{- 1} + H_{m, k}^{T} R_{m, k}^{- 1} H_{m, k}] \end{matrix}

(35)

\begin{matrix} J_{k}^{2} = \sum_{m = 1}^{M} {\hat{μ}}_{m, k}^{r} [{({\tilde{P}}_{m, k}^{r})}^{- 1} {\tilde{x}}_{m, k}^{r} + H_{m, k}^{T} R_{m, k}^{- 1} y_{k}] \end{matrix}

(36)

By setting the derivative with respect to the expected parameter equal to 0, we have

\begin{matrix} {({\hat{P}}_{k}^{r})}^{- 1} = \sum_{m = 1}^{M} {\hat{μ}}_{m, k}^{r} [{({\tilde{P}}_{m, k}^{r})}^{- 1} + H_{m, k}^{T} R_{m, k}^{- 1} H_{m, k}] \end{matrix}

(37)

\begin{matrix} {({\hat{P}}_{k}^{r})}^{- 1} {\hat{x}}_{k}^{r} = \sum_{m = 1}^{M} {\hat{μ}}_{m, k}^{r} [{({\tilde{P}}_{m, k}^{r})}^{- 1} {\tilde{x}}_{m, k}^{r} + H_{m, k}^{T} R_{m, k}^{- 1} y_{k}] \end{matrix}

(38)

Derivations of $q (I_{k}; {\hat{μ}}_{k}^{r})$

Analogous to the derivation process of

q (x_{k}; {\hat{x}}_{k}^{r}, {\hat{P}}_{k}^{r})

, we reformulate the ELBO

B (λ_{k})

as the function of the expected parameters of the model identity,

E_{q (I_{k}^{r})} [I_{m, k}^{r}]

, and omit the rest terms that are independent of

I_{k}^{r}

, denoted by

B_{I}

for brevity,

\begin{matrix} B_{I} = & E_{Q_{k}} [log \prod_{m = 1}^{M} N {(y_{k} | H_{m, k} x_{k}^{r}, R_{m, k})}^{I_{m, k}^{r}}] \\ + E_{Q_{k}} [log \prod_{m = 1}^{M} N {(x_{k}^{r} | {\tilde{x}}_{m, k}^{r}, {\tilde{P}}_{m, k}^{r})}^{I_{m, k}^{r}}] \\ + E_{Q_{k}} [log \prod_{m = 1}^{M} {({\tilde{μ}}_{m, k}^{r})}^{I_{m, k}^{r}} - log \prod_{m = 1}^{M} {({\hat{μ}}_{m, k}^{r})}^{I_{m, k}^{r}}] \end{matrix}

(39)

Upon further calculation and ignoring the constant term,

B_{I}

can be written as:

\begin{matrix} B_{I} = tr \{\sum_{m = 1}^{M} [(A_{k} + log {\tilde{μ}}_{m, k}^{r} - log {\hat{μ}}_{m, k}^{r}) E_{q (I_{k}^{r})} [I_{m, k}^{r}]]\} \end{matrix}

(40)

By equating the derivative of (40) with respect to

E_{q (I_{k}^{r})} [I_{m, k}^{r}]

to zero, we arrive at:

\begin{matrix} {\hat{μ}}_{m, k}^{r} \propto {\tilde{μ}}_{m, k}^{r} \cdot exp (A_{k}) \end{matrix}

(41)

where the expression for

A_{k}

is presented as follows:

\begin{matrix} A_{k} = & - \frac{1}{2} tr \{R_{m, k}^{- 1} E_{k}^{1}\} - \frac{1}{2} tr \{{({\tilde{P}}_{m, k}^{r})}^{- 1} E_{k}^{2}\} - \frac{1}{2} log |{\tilde{P}}_{m, k}^{r}| \\ - \frac{n_{x}}{2} log (2 π) - \frac{1}{2} log |R_{m, k}| - \frac{n_{y}}{2} log (2 π) \end{matrix}

(42)

with

\begin{matrix} E_{k}^{1} = E_{q (x_{k}^{r})} [(y_{k} - H_{m, k} x_{k}^{r}) {(y_{k} - H_{m, k} x_{k}^{r})}^{T}] \end{matrix}

(43)

\begin{matrix} E_{k}^{2} = E_{q (x_{k}^{r})} [(x_{k}^{r} - {\tilde{x}}_{m, k}^{r}) {(x_{k}^{r} - {\tilde{x}}_{m, k}^{r})}^{T}] \end{matrix}

(44)

The expected expressions involved in (43) and (44) are given as follows:

\begin{matrix} E_{q (x_{k}^{r})} [x_{k}^{r} {(x_{k}^{r})}^{T}] ≜ {\hat{x}}_{k}^{r} {({\hat{x}}_{k}^{r})}^{T} + {\hat{P}}_{k}^{r}, E_{q (x_{k}^{r})} [x_{k}^{r}] ≜ {\hat{x}}_{k}^{r} \end{matrix}

(45)

Based on the preceding derivations, we derive the posterior distribution of the run length

r_{k}

, denoted as

p (r_{k} | y_{1 : k})

via Bayes’ theorem. Subsequently, by applying variational Bayesian methods, we obtain the conditional posterior distributions of

x_{k}

and

I_{k}

, expressed as

q (x_{k} | r_{k})

and

q (I_{k} | r_{k})

, respectively. Following the definition of the mixed posterior distribution, we have:

\begin{matrix} q (I_{k}) = \sum_{r_{k}} q (I_{k} | r_{k}) p (r_{k} | y_{1 : k}) \end{matrix}

(46)

\begin{matrix} q (x_{k}) = \sum_{r_{k}} q (x_{k} | r_{k}) p (r_{k} | y_{1 : k}) \end{matrix}

(47)

To proceed, we utilize conjugate computations and the information filtering method outlined in [32] to derive the update formulas for the posterior distribution parameters of

q (x_{k}; {\hat{x}}_{k}, {\hat{P}}_{k})

and

q (I_{k}; {\hat{μ}}_{k})

, i.e.,

\begin{matrix} {\hat{μ}}_{m, k} & = \sum_{r_{k}} p (r_{k} | y_{1 : k}) {\hat{μ}}_{m, k}^{r} \end{matrix}

(48)

\begin{matrix} {\hat{P}}_{k}^{- 1} & = \sum_{r_{k}} p (r_{k} | y_{1 : k}) {({\hat{P}}_{k}^{r})}^{- 1} \end{matrix}

(49)

\begin{matrix} {\hat{P}}_{k}^{- 1} {\hat{x}}_{k} & = \sum_{r_{k}} p (r_{k} | y_{1 : k}) {({\hat{P}}_{k}^{r})}^{- 1} {\hat{x}}_{k}^{r} \end{matrix}

(50)

The proposed VBOCPDMS algorithm can be summarized in Algorithm 1.

Algorithm 1 VBOCPDMS: variational Bayesian online change-point detection with model selection.

Require:: Measurement $y_{k}$ , approximated posterior pdfs $q (x_{k - 1}^{r})$ , $q (I_{k - 1}^{r})$ , $p (r_{k - 1})$ at last time, iterations $N_{m a x}$ , model domain $M$ , detection threshold $δ_{r}$ , penalty function $H (τ)$ ;
Ensure:: Approximated posterior pdfs $q (x_{k - 1}^{r})$ , $q (I_{k - 1}^{r})$ , $q (x_{k})$ , $q (I_{k})$ and $p (r_{k - 1})$ at current time;
1:: Time prediction:
2:: Calculate prior pdf $p (x_{k}^{r} | y_{1 : k - 1})$ via (6)
3:: Calculate prior pdf $p (I_{k}^{r} | y_{1 : k - 1})$ via (13)
4:: Calculate prior pdf $p (r_{k} | y_{1 : k - 1})$ via (12)
5:: Measurement update:
6:: update run length parameters via (16)
7:: Initialization:
8:: $q^{(0)} (x_{k}^{r}) = p (x_{k}^{r} | y_{1 : k - 1})$ , $q^{(0)} (I_{k}^{r}) = p (I_{k}^{r} | y_{1 : k - 1})$ ;
9:: for all $n = 1 : N_{m a x}$ do
10:: update posterior $q (x_{k - 1}^{r})$ : update variational parameters via (37) and (38)
11:: update posterior $q (I_{k - 1}^{r})$ : update variational parameters via (41)
12:: end for
13:: Hybrid estimation:
14:: Calculate state estimation $\hat{x_{k}}$ and corresponding covariance $\hat{P_{k}}$ via (49) and (50);
15:: New run length initialization:
16:: Calculate the likelihood $p (y_{k} | y_{1 : k - 1})$ and select the ${\hat{μ}}_{0}^{*}$ via (15)
17:: Change point detection:
18:: Calculate the MAP estimate $r_{k}^{*}$ of $r_{k}$ via (27)

Remark 2.

The computational complexity of the proposed algorithm primarily arises from two factors: (1) the increasing number of run-lengths over time, and (2) the iterative optimization required for variational inference. To address the first issue, we adopted the pruning strategy proposed in [29], which effectively controls the growth of the run-lengths. In theory, pruning the number of run-lengths may lead to a decrease in algorithm performance. However, it is important to note that the performance improvement gained from increasing the number of run-lengths is also limited. Experimental analysis shows that when the number of run-lengths is kept around 15, the algorithm strikes a good balance between computational load and filtering accuracy.

To address the second issue, we propose two parameter settings to reduce the number of iterations. The first is the most intuitive approach—setting the maximum iteration

I_{m a x} < 5

. Since the average number of iterations in the experiments is around 5, reducing

I_{m a x}

naturally reduces the computational load, and even with

I_{m a x} = 1

, the algorithm still outperforms the comparison algorithms in terms of accuracy. Alternatively, we can set the

2 a

value in the reinitialization model weights to be close to 1. This approach brings the initial weights closer to the true weights during maneuvers, resulting in an average iteration count of around 2, with minimal loss in filtering accuracy. Therefore, this is the most recommended method for reducing computational burden in engineering applications.

These conclusions are supported by the experiments in Section 4.1.5. Additionally, since the state filtering computations for different run-lengths are independent, parallel computing can be employed to further reduce runtime when hardware resources are sufficient.

4. Experimental Evaluation

We evaluate the proposed VBOCPDMS algorithm against existing methods using six aerial maneuvering simulations and two real scenarios. Performance was measured by tracking accuracy and maneuver detection capability.

4.1. Simulation Scenarios

4.1.1. Scenario Setup

In this study, we employ the following six representative simulation scenarios for high-speed, highly maneuvering aerial targets as proposed in [33]. In all scenarios, the total number of steps is set to 186 with a measurement sampling period of T = 1 s. The measurement noise is defined as

R = diag (10^{4}, 10^{4})

. The target trajectories for the six simulation scenarios are shown in Figure 2, and the details are described as follows.

S1 (shown in Figure 2a): The target is a large aircraft. During the intervals

k \in [60, 79)

and

k \in [111, 130)

, the aircraft executes turning maneuvers with accelerations of

2 g

(with

g

representing gravitational acceleration) and

3 g

, respectively. At all other times, the target maintains uniform linear motion. The change-point instants for this scenario are

[60, 79, 111, 130]

.

S2 (shown in Figure 2b): The target is a small, agile aircraft. During the intervals

k \in [31, 54)

and

k \in [101, 115)

, the aircraft performs a

90^{\circ}

turn with an acceleration of

2.5 g

and a turn with an acceleration of

4 g

, respectively. In the interval

k \in [54, 101)

, the target gradually decelerates, while it maintains a constant speed during the remaining periods. The change-point instants for this scenario are

[31, 54, 101, 115]

.

S3 (shown in Figure 2c): The target is a high-speed, medium-sized bomber. During the intervals

k \in [31, 40)

and

k \in [75, 91)

, the bomber performs a

45^{\circ}

turn with an acceleration of

4 g

and a

90^{\circ}

turn with an acceleration of

4 g

, respectively. Between

k \in [91, 115)

, the target continues turning while gradually decelerating; during all other periods, it maintains uniform linear motion. The change-point instants for this scenario are

[31, 40, 75, 91, 115]

.

S4 (shown in Figure 2d): The target is a high-speed, medium-sized bomber. During the intervals

k \in [31, 40)

and

k \in [72, 91)

, the aircraft executes a

45^{\circ}

turn with an acceleration of

4 g

and a turn with an acceleration of

6 g

, respectively. At other times, the target moves at a constant speed in the horizontal plane. The change-point instants for this scenario are

[31, 40, 72, 82, 91, 136]

.

S5 (shown in Figure 2e): The target is a fighter aircraft whose trajectory comprises three constant turning maneuvers, accompanied by significant acceleration throughout the flight. Specifically, the turning intervals are

k \in [30, 40)

,

k \in [60, 72)

, and

k \in [118, 128)

, with corresponding turning accelerations of

5 g

,

7 g

, and

6 g

, respectively. The change-point instants for this scenario are

[30, 40, 60, 72, 118, 130, 143, 163]

.

S6 (shown in Figure 2f): The target is a fighter aircraft with a trajectory that includes four turning maneuvers. After the second turn, the aircraft reduces its altitude and speed before initiating the third turn; following the third turn, it rapidly accelerates to enter the fourth turn. The turning intervals are

k \in [31, 45)

,

k \in [70, 87)

,

k \in [116, 132)

, and

k \in [151, 160)

, with corresponding turning accelerations of

7 g

,

6 g

,

6 g

, and

7 g

, respectively. The change-point instants for this scenario are

[31, 45, 70, 87, 116, 132, 151, 160]

.

4.1.2. Algorithm Parameter Settings

Considering that the models differ only in the process noise covariance matrix and should cover various motion modes ranging from nearly constant velocity to highly maneuvering behavior, the state dimension is set as

n_{x} = 4

, and the state transition model is based on uniform linear motion. The corresponding model parameters are as follows:

\begin{matrix} F_{k} = I_{2} \otimes [\begin{matrix} 1 & T \\ 0 & 1 \end{matrix}] Q_{k} = σ_{q}^{2} \times I_{2} \otimes [\begin{matrix} T^{4} / 4 & T^{3} / 2 \\ T^{3} / 2 & T^{2} \end{matrix}] \end{matrix}

(51)

Assuming a linear measurement model, the measurement transformation matrix

H_{k}

and measurement noise matrix

R_{k}

are given by:

\begin{matrix} H_{k} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}] R_{k} = 10^{4} \times [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] \end{matrix}

(52)

Thus, for the relatively weak maneuvering scenarios (S1 and S2), the set of process noise models is defined as

σ_{q} \in {10, 20, 40} / 2

, while for the strongly maneuvering scenarios (S3–S6), the process noise models are set as

σ_{q} \in {10, 20, 40}

.

IMM [7]: This is the standard interacting multiple model filter, whose model weights are initialized using $[1 / 3, 1 / 3, 1 / 3]$ .
IEE [24]: This is an information theoretic IMM, whose model weights are initialized using $[1 / 3, 1 / 3, 1 / 3]$ .
IT-IMM [32]: This is a multi-model filter based on VB, whose model weights are initialized using $[1 / 3, 1 / 3, 1 / 3]$ .
VBOCPDMS: The filter mode weights are initialized using $[1 / 3, 1 / 3, 1 / 3]$ . When a change point is assumed to exist, there are three possible choices for reinitializing the weights: $I_{0} = [0.8, 0.1, 0.1]$ , $I_{1} = [0.1, 0.8, 0.1]$ , and $I_{2} = [0.1, 0.1, 0.8]$ , which respectively indicate that the maneuver at the change point is biased toward a specific mode. The maximum number of iterations is $I_{m a x} = 50$ , the maximum number of retained run lengths is set to $N_{m a x} = 10$ , and the penalty function is defined as $H (τ) = 15 / 186$ .

For IMM, IT-IMM and IEE filters, the mode transition probability matrix

P_{m}

is defined as:

\begin{matrix} P_{m} = [\begin{matrix} 0.9 & 0.05 & 0.05 \\ 0.05 & 0.9 & 0.05 \\ 0.05 & 0.05 & 0.9 \end{matrix}] \end{matrix}

(53)

For the proposed VBOCPDMS algorithm, the iterative process is terminated when the difference between successive state estimates is below a given threshold (e.g., when

| | {\hat{x}}_{k}^{(n + 1)} - {\hat{x}}_{k}^{(n)} | | < 1 e - 6

) or when the maximum number of iterations

I_{m a x} = 50

is reached. In addition, we conduct 100 Monte Carlo runs for each simulation scenario.

4.1.3. Performance Evaluation Metrics

The tracking performance of the target position is evaluated using the root mean square error (RMSE) and the average root mean square error (ARMSE) for the target position. The RMSE and ARMSE are calculated as follows. In this paper, these metrics are adopted to assess tracking accuracy:

\begin{matrix} {RMSE}_{pos} ≜ \sqrt{\frac{1}{N_{t}} \sum_{N_{t}} [{(p_{k}^{x} - p_{k, t}^{x})}^{2} + {(p_{k}^{y} - p_{k, t}^{y})}^{2}]} \end{matrix}

(54a)

\begin{matrix} {ARMSE}_{pos} ≜ \frac{1}{N_{s}} \sum_{N_{s}} {RMSE}_{pos} \end{matrix}

(54b)

Here,

N_{s}

denotes the number of Monte Carlo simulation runs,

N_{t}

represents the total number of simulation steps,

p_{k}^{x}

and

p_{k, t}^{x}

indicate the estimated and true positions in the x-direction at time k, respectively, while

p_{k}^{y}

and

p_{k, t}^{y}

denote the estimated and true positions in the y-direction.

For evaluating the change-point detection capability of the proposed VBOCPDMS algorithm, an improved F1-score is used to quantify detection accuracy. This metric is selected because the exact positions of change-points in real time-series data may be subject to randomness (e.g., process noise, measurement noise), and experts seldom agree on the precise locations of change-points. To address this issue, following the approach in [28], change-point detection is formulated as a classification problem between “change-point” and “non-change-point”. The F1-score is then computed as:

\begin{matrix} F_{1} = \frac{2 P R}{P + R} \end{matrix}

(55)

where

P

denotes precision (the ratio of correctly detected change-points to the total number of detected change-points), and

R

denotes recall (the ratio of correctly detected change-points to the total number of true change-points).

As mentioned earlier, the calculation of the F1-score requires a clear definition of correct change-point detection. Specifically, if the algorithm detects a change-point within a tolerance range

E \geq 0

of a true change-point, then that detection is considered correct. Furthermore, to avoid double counting, only one detection within the tolerance range

E

around a true change-point is counted as a true positive. Formally, let C denote the set of change-points detected by the algorithm, T the set of true change-points, and

D (T, C)

the set of true change-points that have been detected. For each

γ \in D (T, C)

, there exists a

c \in C

such that

|γ - c| \leq E

, and each true change-point

γ

is associated with only one such c. Based on this definition, the precision

P

and recall

R

are computed as:

\begin{matrix} P = \frac{| D (T, C) |}{| C |} R = \frac{| D (T, C) |}{| T |} \end{matrix}

(56)

4.1.4. Results

Figure 3 presents the ARMSE curves of the target position under different scenarios for four filtering algorithms. Gray dashed lines indicate the change points of the target’s acceleration, i.e., the onset of maneuvers. Different filtering algorithms are represented by distinct curve markers: the IMM algorithm is depicted with blue square markers, the IEE algorithm with orange cross markers, the IT-IMM algorithm is depicted with green circle markers, and the proposed VBOCPDMS algorithm with red asterisk markers. Subfigures (a) to (f) correspond to simulation scenarios S1 through S6, respectively.

All algorithms perform similarly in non-maneuvering segments due to their different motion models and optimal mixed estimation via probabilistic weighting. The VBOCPDMS algorithm, however, outperforms IMM, IEE, and IT-IMM, benefiting from variational iterative optimization for better handling of non-maneuvering models.

During the maneuver process, the RMSE of IMM, IEE and IT-IMM all increased significantly, especially after a change point. Among them, the increase of IEE was the largest, followed by that of IT-IMM, and the increase of IMM was relatively small. In contrast, the VBOCPDMS algorithm’s RMSE also rises but declines faster than the baseline algorithms. IMM shows a trade-off, achieving better accuracy in non-maneuvering scenarios at the cost of poorer accuracy during maneuvers. VBOCPDMS achieves good estimation accuracy during maneuvers by adaptively adjusting prior model weights based on run length and switching models promptly. In contrast, IMM, IEE, and IT-IMM use fixed prior weights and transition matrices, causing delays in model switching and competition between models, which degrades performance. VBOCPDMS speeds up parameter estimation by using hard-decision initialization to reduce secondary model weights during maneuvers.

Table 1 shows that the VBOCPDMS filtering algorithm outperforms IMM, IEE, and IT-IMM in filtering accuracy across six simulation scenarios, with average RMSE rankings of VBOCPDMS < IT-IMM < IMM < IEE. VBOCPDMS effectively detects target maneuvering pattern changes through online change-point detection for timely responses. It uses run-length probability weighting for maximum entropy in state estimation, demonstrating superior tracking accuracy in all scenarios.

Figure 4 illustrates the true and estimated trajectories across six simulation scenarios. Our proposed algorithm (red) closely follows the true trajectory (black) during target maneuvers, unlike the IEE algorithm (orange), which deviates significantly. This supports our analysis showing the proposed method’s superior accuracy, especially during abrupt maneuvers where traditional algorithms falter.

Table 2 analyzes the computational costs of various algorithms. The IEE algorithm has the highest efficiency and the shortest running time, followed by the IMM algorithm. The running time of the IT-IMM algorithm is relatively slightly longer. Conversely, the proposed VBOCPDMS algorithm is slower due to iterative computation for estimating posterior states and weights. Meanwhile, it incorporates maneuver detection, requiring an extra discrete variable estimated at each time step, growing with time. Table 3 shows the average iteration count across scenarios of VBOCPDMS, all notably below the maximum limit

I_{m a x} = 50

, indicating good convergence.

To evaluate the effectiveness and accuracy of the VBOCPDMS algorithm in change point detection, Table 4 presents the F1-scores under different scenarios. By analyzing these data, it can be observed that in scenarios with weak target maneuverability (such as S1), the algorithm’s F1-score is relatively low. In contrast, in scenarios with stronger maneuverability or higher maneuvering frequency (such as S6), the F1-score improves significantly. This suggests that the strength and frequency of target maneuvers have a notable impact on the algorithm’s change point detection performance—the stronger the maneuverability, the better the detection effect.

Furthermore, it is worth noting that when a larger tolerance range

E

is selected, the algorithm’s F1-score exhibits an obvious upward trend, reflecting that change point detection has a certain degree of latency. However, this does not indicate that the proposed algorithm struggles with handling target maneuvers. In fact, the VBOCPDMS algorithm inherently possesses the ability to identify maneuver parameters (i.e., estimate run length probability). Under the variational Bayesian joint optimization framework, change point detection and maneuver parameter identification are inherently coupled. If the algorithm can effectively respond to the ongoing maneuvering change point through parameter estimation, it can effectively alleviate the problem of model mismatch caused by model switching delay. Therefore, the accuracy of change point detection is no longer an absolute evaluation criterion, but depends on the actual ability of the algorithm to effectively deal with maneuvering change points.

4.1.5. Robustness Analysis

To verify the robustness of the proposed algorithm under various parameter settings and to further elucidate the roles of these parameters during the filtering process, a series of controlled experiments were conducted in this subsection. Considering that scenarios S1 and S2 involve relatively weak target maneuvers, and that demonstrating robustness under strong maneuvering conditions is more convincing, we selected simulation scenarios S3 through S6 for the robustness analysis. In each scenario, the focus was placed on the variation of a single algorithm parameter.

Table 5 analyzes the impact of varying parameter

H (τ)

on the performance of the VBOCPDMS algorithm. In scenarios S3 and S4, the ARMSE is minimized when the parameter

H (τ)

is set to 10, whereas in scenarios S5 and S6, the minimum ARMSE occurs when the parameter

H (τ)

is set to 15. This is because the parameter

H (τ)

essentially reflects the maneuvering frequency of the target. Targets in scenarios S3 and S4 exhibit relatively lower maneuvering frequencies, corresponding to a parameter value of 10, while targets in scenarios S5 and S6 have higher maneuvering frequencies, corresponding to a parameter value of 15. A properly selected parameter

H (τ)

can effectively capture the target’s maneuvering behavior, thereby achieving the best filtering performance. When the parameter selection is not perfectly aligned with the actual maneuvering frequency, the filtering performance slightly degrades; however, it still outperforms the baseline algorithms overall.

Table 6 analyzes the impact of varying parameter

I_{m a x}

on the performance of the VBOCPDMS algorithm. Given that the average number of iterations required for convergence is approximately five, it can be observed that the minimum ARMSE across all four scenarios is achieved when the maximum allowable iterations are set to either 10 or 50. Moreover, once the number of iterations exceeds five, the filtering performance exhibits negligible further changes, indicating that convergence has been effectively achieved. It is particularly noteworthy that even when the number of iterations is limited to one or two, the proposed algorithm still outperforms the baseline methods. This property provides a valuable means to balance filtering accuracy and computational efficiency in practical engineering applications.

Table 7 analyzes the impact of varying reinitialization model weights on the performance of the VBOCPDMS algorithm. In the “weights” column of the table, the listed values correspond to

2 a

, leading to the reinitialization model weights defined as

I_{0} = [2 a, 0.5 - a, 0.5 - a]

,

I_{1} = [0.5 - a, 2 a, 0.5 - a]

, and

I_{2} = [0.5 - a, 0.5 - a, 2 a]

. It can be observed that when

2 a = 1 / 3

, corresponding to an equal distribution of initialization weights, the filtering performance is significantly degraded. In this case, the initialization weight selection mechanism effectively becomes inactive, resulting in uniformly distributed initial weights. As the initialization weights become more biased toward their respective models, the filtering performance improves. Moreover, as

2 a

approaches 1, the filtering accuracy remains consistently high, while the average number of iterations decreases from approximately seven to about two. This characteristic can be exploited in practical engineering applications to substantially reduce computational burden.

Table 8 analyzes the impact of varying pruning threshold

N_{m a x}

on the performance of the VBOCPDMS algorithm. When the pruning threshold

N_{\max} = 5

, the filtering performance of the VBOCPDMS algorithm is inferior to that of the baseline methods. However, as the pruning threshold increases, the filtering performance gradually improves and eventually converges. This observation indicates that the number of retained run-lengths should be greater than 10 to achieve satisfactory performance. Further increasing the number of run-lengths yields only marginal performance gains while significantly increasing the computational burden, which is therefore not recommended.

4.2. Real Scenarios

In this section, we further evaluate the estimation performance of the proposed algorithm in two sets of real maneuvering target tracking scenarios by using the two-dimensional radar dataset.

4.2.1. Scenario Setup

The targets are detected and tracked using radar. The motion patterns of the two targets are detailed as follows:

R1 (shown in Figure 5a): This target performs a composite maneuver consisting of six segments: uniform motion, left turn, uniform motion, right turn, uniform acceleration, and a figure-eight circling pattern. The true change points in its motion occur at frames [13, 23, 34, 66, 73, 81, 116, 102]. The radar sampling interval is

T = 10 s

, and the sequence comprises a total of 121 frames.

R2 (shown in Figure 5b): It also follows a six-segment composite maneuver: uniform motion, left turn, uniform motion, right turn, uniform motion, and an O-shaped circling pattern. The actual maneuver transition points are located at frames [13, 34, 65, 80, 100]. The radar sampling interval is

T = 10 s

, with a total of 105 frames.

4.2.2. Algorithm Parameter Settings and Performance Evaluation Metrics

Real scenarios use the same matrix forms as simulations. In R1, the process noise model set is

σ_{q} \in {20, 35, 50}

and the change-point detection probability for VBOCPDMS is

49 / NStep

. In R2, the process noise model set is

σ_{q} \in {5, 15, 30}

and the change-point detection probability is

15 / NStep

.

The reinitialization model weights are given by

I_{0} = [0.01, 0.495, 0.495]

,

I_{1} = [0.1, 0.8, 0.1]

and

I_{2} = [0.1, 0.1, 0.8]

; the maximum number of retained run lengths is set to

N_{m a x} = 15

for the two real scenarios. Other parameters remain identical to those in the simulation scenarios, and the performance evaluation metrics are the same.

4.2.3. Results

Figure 6 shows the RMSE curves for target position estimation across different algorithms in scenarios R1 and R2. Table 9 provides the ARMSE for these algorithms in both scenarios, highlighting VBOCPDMS’s superior filtering performance. In scenario R2, RMSE curves are similar across algorithms during non-maneuvering or weakly maneuvering states, but VBOCPDMS stands out significantly once the target starts maneuvering.

Table 10 and Table 11 detail the RMSE values across different filtering algorithms under various conditions. When the target moves uniformly or with slight turns, all methods perform similarly. However, as turns or acceleration increase, performance decreases. The VBOCPDMS algorithm remains more robust in handling maneuvers, surpassing both the IT-IMM, IMM, and IEE algorithms overall. This is because the IMM algorithm depends on the model transition probability matrix for model switching, which can cause delays and errors if the matrix is inaccurate during strong target maneuvers. Conversely, the VBOCPDMS algorithm uses Bayesian online change point detection for model selection without relying on this matrix, allowing for timely reinitialization of model weights and target states. This adaptability enhances the accuracy and stability of the VBOCPDMS algorithm in managing complex maneuvers.

Table 12 shows the computational cost comparison for different algorithms in real scenarios, supporting the previous simulation analysis and confirming the cost characteristics of the VBOCPDMS algorithm. The higher cost is mainly due to online change-point detection and variational iterative optimization.

With an error range of

E = 5

, the F1-scores of VBOCPDMS in scenarios R1 and R2 were

0.4000

and

0.2857

. Bayesian online change point detection is a soft decision-making method based on manually defined rules. Evaluating it with a single metric is insufficient; thus, the filtering results are crucial, with the F1-score as a secondary criterion.

5. Conclusions

This paper presents a novel maneuvering target tracking method, VBOCPDMS, which integrates variational Bayesian inference and run-length modeling to capture maneuver change points in real time. Within the variational framework, the approximate posterior distributions of the motion state, model weights, and run-length are jointly estimated. The introduction of the run-length variable not only enables accurate detection of change points but also facilitates the adaptive reconfiguration of the algorithm’s parameters during abrupt target maneuvers. Unlike traditional IMM algorithms, which assume a first-order Markov process for model transitions, the proposed method explicitly tracks the run-length, modeling the system evolution in a non-Markovian manner. This design allows VBOCPDMS to retain a longer memory of past system dynamics, which is crucial for accurately capturing sudden maneuver onsets and maintaining robust tracking performance under highly dynamic conditions. Extensive experiments were conducted across six simulation scenarios and two real-world scenarios. The results demonstrate that VBOCPDMS consistently outperforms baseline methods such as IMM, IT-IMM, and IEE in terms of RMSE and maneuver-adaptive responsiveness, especially in cases involving high accelerations and abrupt turns. Furthermore, the proposed method exhibits strong robustness to model parameter uncertainties, highlighting its potential for real-time applications, such as radar surveillance. In future work, further research will be devoted to extending the VBOCPDMS framework to accommodate high-dimensional state spaces and multi-target tracking scenarios.

Author Contributions

Conceptualization, J.W. and Y.C.; methodology, X.W. and M.Y.; formal analysis, X.W. and Y.C.; data curation, J.W.; writing—original draft preparation, X.W.; writing—review and editing, H.L. and X.W.; supervision, H.L.; funding acquisition, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science Foundation of China OF FUNDER grant number No. 62371398.

Data Availability Statement

All data included in this study are available upon request by contact with the corresponding author.

Acknowledgments

We wish to thank all the project team members.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

IMM	Interacting Multiple Model
ELBO	Evidence Lower Bound
KF	Kalman Filter
pdfs	Probability Density Functions
SMC	Sequential Monte Carlo
VB	Variational Bayes
BOCPD	Bayesian Online Changepoint Detection
KL	Kullback–Leibler
RMSE	Root Mean Square Error
ARMSE	Average Root Mean Square Error
IEE	Identity Expectation Estimator
ITIMM	Information Theory Interacting Multiple Model
VBOCPDMS	Variational Bayesian Online Change Point Detection with Multiple Models

References

Li, X.R.; Jilkov, V.P. Survey of maneuvering target tracking. Part I: Dynamic models. IEEE Trans. Aerosp. Electron. Syst. 2003, 39, 1333–1363. [Google Scholar]
Li, X.R.; Jilkov, V.P. Survey of maneuvering target tracking. Part V: Multiple-model methods. IEEE Trans. Aerosp. Electron. Syst. 2005, 41, 1255–1321. [Google Scholar]
Punithakumar, K.; Kirubarajan, T.; Sinha, A. Multiple-model probability hypothesis density filter for tracking maneuvering targets. IEEE Trans. Aerosp. Electron. Syst. 2008, 44, 87–98. [Google Scholar] [CrossRef]
Blackman, S.S. Multiple hypothesis tracking for multiple target tracking. IEEE Aerosp. Electron. Syst. Mag. 2004, 19, 5–18. [Google Scholar] [CrossRef]
Guo, Y.; Huang, B. Moving horizon estimation for switching nonlinear systems. Automatica 2013, 49, 3270–3281. [Google Scholar] [CrossRef]
Watanabe, K.; Tzafestas, S.G. Generalized pseudo-Bayes estimation and detection for abruptly changing systems. J. Intell. Robot. Syst. 1993, 7, 95–112. [Google Scholar] [CrossRef]
Blom, H.A.P.; Bar-Shalom, Y. The interacting multiple model algorithm for systems with Markovian switching coefficients. IEEE Trans. Autom. Control 1988, 33, 780–783. [Google Scholar] [CrossRef]
Liu, W.; Zhang, H.; Wang, Z. A novel truncated approximation based algorithm for state estimation of discrete-time Markov jump linear systems. Signal Process. 2011, 91, 702–712. [Google Scholar] [CrossRef]
Pourbabaee, B.; Meskin, N.; Khorasani, K. Sensor fault detection, isolation, and identification using multiple-model-based hybrid Kalman filter for gas turbine engines. IEEE Trans. Control Syst. Technol. 2015, 24, 1184–1200. [Google Scholar] [CrossRef]
Qiu, J.; Xing, Z.; Zhu, C.; Lu, K.; He, J.; Sun, Y.; Yin, L. Centralized fusion based on interacting multiple model and adaptive Kalman filter for target tracking in underwater acoustic sensor networks. IEEE Access 2019, 7, 25948–25958. [Google Scholar] [CrossRef]
Youn, W.; Ko, N.Y.; Gadsden, S.A.; Myung, H. A novel multiple-model adaptive Kalman filter for an unknown measurement loss probability. IEEE Access 2020, 70, 1–11. [Google Scholar] [CrossRef]
Qu, H.; Pang, L.; Li, S. A novel interacting multiple model algorithm. Signal Process. 2009, 89, 2171–2177. [Google Scholar] [CrossRef]
Sheng, H.; Zhao, W.; Wang, J. Interacting multiple model tracking algorithm fusing input estimation and best linear unbiased estimation filter. IET Radar Sonar Navig. 2017, 11, 70–77. [Google Scholar] [CrossRef]
Xu, L.; Li, X.R.; Duan, Z. Hybrid grid multiple-model estimation with application to maneuvering target tracking. IEEE Trans. Aerosp. Electron. Syst. 2016, 52, 122–136. [Google Scholar] [CrossRef]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef]
Özkan, E.; Smídl, V.; Saha, S.; Lundquist, C.; Gustafsson, F. Marginalized adaptive particle filtering for nonlinear models with unknown time-varying noise parameters. Automatica 2013, 49, 1566–1575. [Google Scholar] [CrossRef]
Blei, D.M.; Kucukelbir, A.; McAuliffe, J.D. Variational inference: A review for statisticians. J. Am. Stat. Assoc. 2017, 112, 859–877. [Google Scholar] [CrossRef]
Särkkä, S.; Nummenmaa, A. Recursive noise adaptive Kalman filtering by variational Bayesian approximations. IEEE Trans. Autom. Control 2009, 54, 596–600. [Google Scholar] [CrossRef]
Mbalawata, I.S.; Särkkä, S.; Vihola, M.; Haario, H. Adaptive Metropolis algorithm using variational Bayesian adaptive Kalman filter. Comput. Stat. Data Anal. 2015, 83, 101–115. [Google Scholar] [CrossRef]
Dong, P.; Jing, Z.; Leung, H.; Shen, K. Variational Bayesian adaptive cubature information filter based on Wishart distribution. IEEE Trans. Autom. Control 2017, 62, 6051–6057. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Wu, Z.; Li, N.; Chambers, J. A Novel Adaptive Kalman Filter with Inaccurate Process and Measurement Noise Covariance Matrices. IEEE Trans. Autom. Control 2018, 63, 594–601. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Shi, P.; Chambers, J. Variational adaptive Kalman filter with Gaussian-inverse-Wishart mixture distribution. IEEE Trans. Autom. Control 2020, 66, 1786–1793. [Google Scholar] [CrossRef]
Zhang, J.; Wei, G.; Ding, D.; Ju, Y. Distributed Sequential State Estimation Over Binary Sensor Networks with Inaccurate Process Noise Covariance: A Variational Bayesian Framework. IEEE Trans. Signal Inf. Process. Netw. 2025, 11, 1–10. [Google Scholar] [CrossRef]
Ma, Y.; Zhao, S.; Huang, B. Multiple-model state estimation based on variational Bayesian inference. IEEE Trans. Autom. Control 2018, 64, 1679–1685. [Google Scholar] [CrossRef]
Lan, H.; Hu, J.; Wang, Z.; Cheng, Q. Variational Nonlinear Kalman Filtering with Unknown Process Noise Covariance. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 9177–9190. [Google Scholar] [CrossRef]
Lan, H.; Zhao, S.; Hu, J.; Wang, Z.; Fu, J. Joint State Estimation and Noise Identification Based on Variational Optimization. IEEE Trans. Autom. Control 2024, 1–16. [Google Scholar] [CrossRef]
Lan, H.; Zhao, S.; Mao, Y.; Wang, Z.; Cheng, Q.; Liu, Z. Noise Adaptive Kalman Filtering with Stochastic Natural Gradient Variational Inference. IEEE Trans. Aerosp. Electron. Syst. 2025, 1–17. [Google Scholar] [CrossRef]
Van den Burg, G.J.J.; Williams, C.K.I. An evaluation of change point detection algorithms. arXiv 2020, arXiv:2003.06222. [Google Scholar]
Adams, R.P.; MacKay, D.J. Bayesian online changepoint detection. arXiv 2007, arXiv:0710.3742. [Google Scholar]
Hou, X.; Zhao, S.; Hu, J.; Lan, H. Noise-Adaptive State Estimators with Change-Point Detection. Sensors 2024, 24, 4585. [Google Scholar] [CrossRef]
Zhang, C.; Bütepage, J.; Kjellström, H.; Mandt, S. Advances in variational inference. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 41, 2008–2026. [Google Scholar] [CrossRef] [PubMed]
Li, W.; Jia, Y. An information theoretic approach to interacting multiple model estimation. IEEE Trans. Aerosp. Electron. Syst. 2015, 51, 1811–1825. [Google Scholar] [CrossRef]
Blair, W.D.; Watson, G.A.; Kirubarajan, T.; Bar-Shalom, Y. Benchmark for radar allocation and tracking in ECM. IEEE Trans. Aerosp. Electron. Syst. 1998, 34, 1097–1114. [Google Scholar] [CrossRef]

Figure 1. VBOCPDMS method framework diagram.

Figure 2. The target trajectories for six simulation scenarios: (a) Simulation scenario S1. (b) Simulation scenario S2. (c) Simulation scenario S3. (d) Simulation scenario S4. (e) Simulation scenario S5. (f) Simulation scenario S6.

Figure 3. The ARMSE curves of the target positions of different algorithms in six simulation scenarios: (a) ARMSE curves in S1. (b) ARMSE curves in S2. (c) ARMSE curves in S3. (d) ARMSE curves in S4. (e) ARMSE curves in S5. (f) ARMSE curves in S6.

Figure 4. True trajectory vs. inferred trajectory of different algorithms in six simulation scenarios: (a) Trajectory in S1. (b) Trajectory in S2. (c) Trajectory in S3. (d) Trajectory in S4. (e) Trajectory in S5. (f) Trajectory in S6.

Figure 5. Target trajectory of real scenarios: (a) Real scenario R1. (b) Real scenario R2.

Figure 6. The RMSE curves of the target position in real scenarios: (a) RMSE curves in R1. (b) RMSE curves in R2.

Table 1. Average position RMSE (m) of different algorithms in scenarios S1–S6.

Algorithm	S1	S2	S3	S4	S5	S6
IEE	95.587	98.596	94.848	94.196	105.310	108.680
IMM	86.492	87.477	90.105	89.937	96.586	98.331
IT-IMM	85.052	86.078	87.513	87.297	95.88	97.799
VBOCPDMS	81.530	80.024	86.096	85.128	92.754	94.474

Table 2. Average runtime (s) of different algorithms in the simulation scenarios.

Algorithms	S1	S2	S3	S4	S5	S6
IEE	0.0093	0.0091	0.0081	0.0092	0.0077	0.0078
IMM	0.0151	0.0142	0.0132	0.0135	0.0123	0.0124
IT-IMM	0.0227	0.0213	0.0203	0.0215	0.0184	0.0189
VBOCPDMS	0.7830	0.5769	0.5918	0.6351	0.5989	0.6181

Table 3. Average number of iterations of the VBOCPDMS algorithm in the simulation scenarios.

Algorithms	S1	S2	S3	S4	S5	S6
VBOCPDMS	4.3116	4.3032	4.9460	4.8916	5.1058	5.1662

Table 4. F1-scores of the VBOCPDMS algorithm.

Threshold	S1	S2	S3	S4	S5	S6
$E = 5$	0.3468	0.4186	0.4230	0.5799	0.6214	0.6399
$E = 10$	0.6272	0.6552	0.7389	0.8403	0.8484	0.8961

Table 5. The average position RMSE (m) of

H (τ)