1. Introduction
State estimation of dynamical systems from noisy observations in real time is one of the most fundamental tasks in localization, tracking, and navigation [
1]. The Kalman filter (KF) is an optimal state estimator for a linear Gaussian state-space model requiring known noise statistics [
2]. However, in many practical situations, the statistical noise covariances are partially unknown and may abruptly change [
3]. The underlying motivating application is maneuvering-target tracking [
4], wherein some real-world targets, including ground vehicles, aircraft, and ballistic missiles, are capable of making very sharp, evasive maneuvers to escape from the radar’s tracking and locking. The parameters, such as the process noise covariance accounting for unexpected maneuverability, are unknown and abruptly change between different motion mode segments.
The classical solution to state estimation problems with uncertain parameters is adaptive estimators [
5], which perform statistics on the model parameters or noise, as well as the dynamic state, simultaneously. Noise-adaptive estimators can be broadly divided into four categories, including Bayesian inference, maximum likelihood estimation, covariance-matching, and correlation methods [
3]. Bayesian inference is the most general approach, while the other methods are often interpreted as approximations of Bayesian inference [
6]. In the Bayesian inference approach, noise-adaptive estimators are required to calculate the intractable joint posterior probability density function (PDF) of the dynamic state and unknown parameters. There are three primary methods for approximating adaptive estimators, including multiple model (MM) methods, sequence Monte Carlo (SMC) methods, and variational Bayesian (VB) methods.
MM methods [
7] regard the underlying dynamics as switching systems among a finite number of models, representing different noise levels or system structures. There exist both continuous noise uncertainties and discrete model uncertainties in switching systems. MM methods carry out state estimation and model selection recursively, which can be divided into static and dynamic MM estimators depending on whether the model is switched. By assuming that the model switching process is a Markov process, the well-known interacting MM (IMM) [
8,
9] estimator achieves a trade-off between computation and accuracy, which has proven to be promising for tracking highly maneuvering targets [
1]. The applicability of IMM estimators depends on the completeness of the model sets and suffers from the curse of dimensionality. Some extended methods can tackle these issues [
10].
SMC methods [
11] approximate the intractable joint PDF by propagating a set of random particles, drawing from the tractable proposal distribution. For adaptive state estimation with static parameters, one solution is based on artificial parameter evolution, which perturbs particles by adding artificial noise to avoid over-diffuse approximations [
12]. An alternative approach is based on particle learning [
13], which marginalizes the static parameters out of the posterior distribution and can be implemented in an online fashion by constructing sufficient statistics [
14]. Nemeth et al. [
15] extended the work of [
12] to time-varying parameters by combining SMC approaches with change-point models. Arnold [
16] presented two SMC-based adaptive estimators with time-varying parameters using the concept of artificial parameter evolution. SMC methods provide a flexible and accurate Bayesian inference for adaptive estimation but are limited to small-scale state estimation problems due to their demand for massive computational power. Meanwhile, the performance of SMC methods depends on the choice of the proposal distribution. Improper design of the proposal distribution often leads to poor approximation, especially for high-dimensional estimation problems.
VB methods [
17] approximate the intractable joint PDF through optimization. The VB-based adaptive state estimator has received significant attention due to its computational efficiency compared to SMC methods. Most existing VB methods address the adaptive state estimation for a linear state-space model with unknown measurement noise covariance (MNC). By modeling the conjugate prior distribution of the state and MNC as a Gaussian inverse-Wishart distribution, the joint posterior PDFs are approximated using factorized free-form (mean-field approximation) and updated via the coordinate ascent method. Särkkä and Nummenmaa [
6] presented the first VB-based adaptive Kalman filtering with unknown MNC. The extension to unknown process noise covariance (PNC) is not straightforward because the joint prior of the state and PNC is non-conjugate. Huang et al. [
18] extended the work of [
6] to both unknown MNC and PNC by regarding the state-predicted covariance as an inverse-Wishart distribution. Ma et al. [
19] constructed the conjugate prior distribution of the state and PNC by inducing auxiliary latent variables. Ardeshiri et al. proposed an adaptive smoother with unknown PNC and MNC. Xu et al. [
20] proposed adaptive fixed-lag smoothing with unknown MNC. Ma et al. [
19] proposed VB-based joint state estimation and model identity for multiple model systems. Zhu et al. [
21] proposed an outlier-robust variational Kalman filter by leveraging Student-t noise modeling. Yu and Meng [
22] proposed robust Kalman filters with multiplicative noise modeling. Xia et al. [
23] proposed an adaptive variational Kalman filter with unknown MNC to solve the calibration problem. Zhu et al. [
24,
25] proposed variational Kalman filters with unknown, time-varying, and non-stationary heavy-tailed process and measurement noises. Huang et al. [
26] proposed an adaptive Kalman filter with a Gaussian inverse-Wishart mixture distribution for unknown MNC. For nonlinear state-space models, it is intractable to directly optimize the objective with the coordinate ascent method. The basic idea is to approximate the intractable expectation of nonlinear expressions, such as adaptive Metropolis sampling [
27] and the cubature integration rule [
28]. An alternative approach is to employ stochastic gradient methods [
29]. Lan et al. [
30] proposed a nonlinear adaptive Kalman filter with unknown PNC based on stochastic search VB, achieving high estimation accuracy but suffering from slow iteration convergence.
As stated in [
26], existing VB-based adaptive Kalman filters (AKF) are quite sensitive to the initial value setting of PNC. This is mainly because the Kullback–Leibler (KL) divergence is generally a nonconvex objective function, and the coordinate ascent method only guarantees convergence to a local optimum, which can be sensitive to initialization. In the domain of maneuvering-target tracking, the dimension of latent variables (target state, PNC, MNC) is generally larger than the dimension of measurements, making it easy for variational iterations to converge to local minima. As a result, initialization issues hinder the application of existing adaptive Kalman filters to sharply maneuvering-target-tracking problems, where PNC may change abruptly and accurate prior information is unavailable. In order to deal with tracking sharply maneuvering targets, an adaptive initialization strategy should be addressed.
Motivated by the challenges of tracking sharply maneuvering targets, this paper develops novel variational adaptive state estimators for joint target state and process noise parameter estimation for a class of linear state-space models with abruptly changing parameters. By combining variational inference with change-point detection in an online Bayesian fashion, two adaptive estimators—a change-point-based adaptive Kalman filter (CPAKF) and a change-point-based adaptive Kalman smoother (CPAKS)—are proposed in a recursive detection and estimation process. In each step, the run-length probability of the current maneuver mode is first calculated, and then the joint posterior of the target state and process noise parameter conditioned on the run length is approximated by variational inference. Compared with existing variational noise-adaptive Kalman filters, the proposed methods are robust to initial iterative value settings, improving their ability to track sharply maneuvering targets. Meanwhile, the change-point detection divides the non-stationary time sequence into several stationary segments, allowing for an adaptive sliding length in the CPAKS method. Finally, the superior tracking performance of the proposed methods is verified using both synthetic and real-world datasets of maneuvering-target tracking.
The remainder of this paper is organized as follows. 
Section 2 describes the problem formulation of adaptive state estimation with unknown process noise covariance. 
Section 3 presents the proposed VB-based adaptive state estimation with change-point detection. 
Section 5 and 
Section 6 provide performance comparisons using simulated and real data, respectively. Finally, 
Section 7 concludes this paper.
  2. Problem Description
Maneuvering-target tracking can be characterized by the following discrete-time state-space model
      
      where 
 is the target kinematic state of dimension 
 and 
 is the sensor measurement of dimension 
. The state transition matrix 
 and measurement matrix 
 are assumed to be known. The process noise vector 
 and measurement noise vector 
 are the Gaussian distribution with zero mean and the corresponding covariance matrices 
 and 
, respectively. Assume that the initial state satisfies 
 and the random variables 
, 
, and 
 are independent of each other.
In modern target tracking, non-cooperative targets are capable of making very sharp and evasive maneuvers to avoid tracking and locking on. Assume that a hostile aircraft initially travels at a constant velocity (CV) of 200 m/s for 33 s, then enters a constant turn (CT) of 10 deg/s for 33 s, and the target accelerates in a straight line at 3 
. As shown in the upper figure in 
Figure 1, when using the nominal CV model to represent the target motion, the magnitudes of modeling errors in the x-axis resulting from target maneuvers vary over time and change significantly when the model is switched. As shown in the lower figure in 
Figure 1, the change points can divide the target flight into several non-overlapping motion segments, known as the 
run length, which refers to the length of flight time since the last change point [
31], and the model parameters during each run-length segment are assumed to be constant or slow-varying. The objective of this paper is to perform adaptive state estimation by continuously adjusting the process noise level.
Definition 1  ([
31])
. Define the discrete random variable  as the run length at time k. At each time k, the run length  has only two outcomes: it either continues to grow  if no change point occurs or drops to  when a change point occurs. Remark 1.  The run-length variable  is used to describe changes in the kinematic model or process noise level. Taking Figure 1 as an example, during the CV motion segment from 0 to 39 s, the run length continues to grow () with . At time , the CV motion becomes CT motion, the run-length variable at  is , and the run-length variable begins to grow again () with the initial value . Similarly, the run-length variable begins to grow again () with the initial value  during the CA motion segment.  For noncooperative maneuvering-target tracking, the process noise covariance 
, which accounts for motion model uncertainty, is commonly unknown and time-varying, and it should be estimated along with the target state 
. The VB-based AKF approximates the intractable joint posterior PDFs 
 through optimization, whereas the posterior of each latent variable (
 and 
) is updated iteratively via the coordinate ascent method. Some variational Bayesian adaptive Kalman filters (VBAKF) have been proposed in [
18,
26], where the unknown process noise covariance is assumed to vary slowly. However, these methods are quite sensitive to the initial values due to the local optimality of VB, limiting their ability to track sharply maneuvering targets [
26].
To tackle the iterative initialization problem, we introduce the run length  as an auxiliary latent variable to enhance the performance of AKF. The run length  divides the sequence of model parameters  into non-overlapping segments. Within each segment, the parameters  are dependent. However, across different segments, the parameters  are independent of each other. Therefore, the initial value of  at each iteration is determined by the specific run length .
The objective of AKF with the presence of unknown  and  is to compute the posterior PDF . Formally, the well-known recursive Bayesian optimal filtering consists of the following prediction-update cycle after starting from the prior PDF :
- Prediction- : Following the Chapman–Kolmogorov equation, the predictive PDF of joint latent variables is
           
 
- Update- : Given the measurement  - , the posterior PDF is updated using Bayes’ rule:
           
 
Due to the intractable integrations and summarization involved in the prediction-update cycle, the general Bayesian filtering solution is not analytically tractable. We effectively solve the above recursive equations using VB, where the intractable posterior PDF  is approximated by a variational distribution  by minimizing the Kullback–Leibler (KL) divergence. Minimizing the KL divergence to zero guarantees that the variational distribution matches the exact posterior.