Parameter Identification of the Discrete-Time Stochastic Systems with Multiplicative and Additive Noises Using the UD-Based State Sensitivity Evaluation

Andrey Tsyganov; Yulia Tsyganova

doi:10.3390/math11244964

and

¹

Department of Mathematics, Physics and Technology Education, Ulyanovsk State University of Education, Ulyanovsk 432071, Russia

²

Department of Mathematics, Information and Aviation Technology, Ulyanovsk State University, Ulyanovsk 432017, Russia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics2023, 11(24), 4964;https://doi.org/10.3390/math11244964

This article belongs to the Special Issue New Trends on Identification of Dynamic Systems

Version Notes

Order Reprints

Abstract

The paper proposes a new method for solving the parameter identification problem for a class of discrete-time linear stochastic systems with multiplicative and additive noises using a numerical gradient-based optimization. The constructed method is based on the application of a covariance UD filter for the above systems and an original method for evaluating state sensitivities within the numerically stable, matrix-orthogonal MWGS transformation. In addition to the numerical stability of the proposed algorithm to machine roundoff errors due to the application of the MWGS-UD orthogonalization procedure at each step, the main advantage of the obtained results is the possibility of analytical calculation of derivatives at a given value of the identified parameter without the need to use finite-difference methods. Numerical experiments demonstrate how the obtained results can be applied to solve the parameter identification problem for the considered stochastic system model.

Keywords:

parameter identification; gradient-based optimization; sensitivity evaluation; discrete-time linear stochastic systems; multiplicative and additive noises

MSC:

93E12

1. Introduction

This paper addresses a parameter identification problem for a class of discrete-time linear stochastic systems represented by state-space difference equations with additive and multiplicative noises. A distinctive feature of this class of dynamic systems is multiplicative noise, which can be included in both the state and measurement equations [1]. The reasons for the appearance of multiplicative noise are different; for example, it can be due to linearization, quantization, and modeling errors, or physical phenomena such as fading in communication channels. Most often, systems with additive and multiplicative noises are considered when solving problems related to various kinds of measurement processing.

The parameter identification problem consists of determining the unknown parameters of a mathematical model of the system belonging to a selected class of models using known input and output measurement data [2]. As noted in [3,4], the main approaches to solving parameter identification problems, which remain of current interest today, are subspace identification methods and minimum prediction error (MPE) methods. The first approach is based on the application of projections in Euclidean space, and the second one is based on minimizing the identification criterion depending on the system parameters. The foundation of these approaches was laid in the seminal works [5,6]. A great contribution to the development of MPE methods was made by Lennart Ljung, who defined the basic concepts of MPE methodology [2,4].

Linear discrete-time stochastic systems with associated Kalman-type filtering algorithms have been extensively used in practice. As a rule, application of the filter equations assumes a complete a priori knowledge of the system model parameters, but it is a rare case. The classical way of solving the parameter identification problem is to use adaptive filters where the model parameters are estimated together with the dynamic state [7]. This requires determination of the sensitivities of the system state to unknown parameters, i.e., partial derivatives of the state estimates. Straightforward differentiation of the filter equations is a direct approach to compute the state sensitivities. This leads to a set of filter sensitivity equations. It is well known that, for discrete-time linear stochastic systems with additive noises, a conventional Kalman filter may suffer from numerical instability caused by machine roundoff errors [8]. This is also true for such systems with multiplicative noises [9]. Nevertheless, there are currently many numerically stable modifications of the conventional Kalman filtering algorithm, which are based on various methods for factorizing the estimation error covariance matrices [8,10]. It is worth noting that, for discrete-time stochastic systems with multiplicative noises, such modifications were developed relatively recently (see, for example, [9,11]).

In this paper, we chose a numerically stable UD-based modification of the Kalman-type filtering algorithm [9] for solving the parameter identification problem, since it has the following attractive numerical properties: numerical stability to machine roundoff errors, absence of square root and matrix inversion operations, and the compactness and simplicity of the block matrix array form for implementation on a computer, including parallel computing [10].

Thus, the purpose of our work is to develop a numerically stable UD-based method for solving the parameter identification problem for a class of discrete-time linear stochastic systems with multiplicative and additive noises using a numerical, gradient-based optimization of the identification criterion in the form of a negative logarithmic likelihood function.

We propose the following solution steps to achieve the goal:

Replace the conventional Kalman-type filtering algorithm with a UD-based covariance filtering algorithm that is numerically stable to machine roundoff errors.
Construct a new method for calculating the state sensitivity values in the adaptive UD-based filter.
Apply this method to the gradient-based minimization of the identification criterion.

The paper is organized as follows. Section 2 provides the problem statement of parameter identification for a considered class of stochastic systems. Then, the UD-based covariance filtering algorithm for discrete-time stochastic systems with multiplicative and additive noises is described. Lemma 1 provides the method of obtaining the main results presented in the next section. Section 3 contains the new UD-based state sensitivity evaluation method, which is described in Proposition 1. Proposition 2 explains how the values of the identification criterion and its gradient can be calculated using the suggested algorithm. Section 4 demonstrates how these two methods can be applied for solving the parameter identification problem of the considered stochastic system model. Section 5 concludes the paper.

2. Methods

2.1. The Problem Statement

Consider a discrete-time linear stochastic system with multiplicative and additive noises

\{\begin{matrix} x_{k} & = (F_{k - 1} + {\tilde{F}}_{k - 1} ξ_{k - 1}) x_{k - 1} + G_{k - 1} w_{k - 1}, \\ z_{k} & = (H_{k} + {\tilde{H}}_{k} ζ_{k}) x_{k} + v_{k}, k = 1, 2, \dots, M, \end{matrix}

(1)

where

x_{k} \in R^{n}

is the system state vector;

z_{k} \in R^{m}

is the measurement vector; matrices

F_{k}

,

{\tilde{F}}_{k} \in R^{n \times n}

;

H_{k}

,

{\tilde{H}}_{k} \in R^{m \times n}

;

G \in R^{n \times q}

; M is the number of measurements;

x_{0}

∼

N ({\bar{x}}_{0}, Π_{0})

is the initial state;

ξ_{k} \in R

∼

N (0, σ_{ξ}^{2})

and

w_{k} \in R^{q}

∼

N (0, Q_{k})

are multiplicative and additive noises in the state equation, respectively;

ζ_{k} \in R

∼

N (0, σ_{ζ}^{2})

and

v_{k} \in R^{m}

∼

N (0, R_{k})

are multiplicative and additive noises in the measurement equation, respectively; covariance matrices

Q_{k}

and

R_{k}

of noises

w_{k}

and

v_{k}

, respectively, are positive definite; and all noises and the initial state are mutually independent.

The considered systems find their application in wireless sensor networks, navigation, and other fields. Multiplicative noises are commonly used to account for stochastic uncertainties in system dynamics and measurements. In this paper, we assume that system (1) additionally contains parametric uncertainty, i.e., the matrices defining the equations of system (1) depend on the unknown parameter

θ \in R^{p}

. Consequently, all elements of the system matrices

F_{k}

,

{\tilde{F}}_{k}

,

G_{k}

,

H_{k}

,

{\tilde{H}}_{k}

, noise covariance matrices

Q_{k}

and

R_{k}

, and variances

σ_{ξ}^{2}

,

σ_{ζ}^{2}

, as well as the initial conditions

{\bar{x}}_{0}

and

Π_{0}

, may depend on the unknown parameter

θ

; i.e.,

F_{k} = F_{k} (θ)

,

{\tilde{F}}_{k} = {\tilde{F}}_{k} (θ)

,

G_{k} = G_{k} (θ)

, etc.

Thus, the parameter identification problem of system (1) arises. It is known that the performance of the Kalman filter degrades in the presence of modeling uncertainties. Solving the problem of parameter identification makes it possible to cope with this problem. The well-known approaches involve numerical minimization by

θ

of the identification criterion [12]

{\hat{θ}}_{m i n} = \underset{θ \in D (θ)}{argmin} J (θ; Z_{1}^{M}),

(2)

which implies solving an optimization problem with constraints, where

D (θ)

represents some compact set, such as a segment in the scalar case.

To solve the problem of parameter identification, we chose an identification criterion (2) in the form of the negative logarithmic likelihood function [13]:

J (θ; Z_{1}^{M}) = \frac{M m}{2} ln 2 π + \frac{1}{2} \sum_{k = 1}^{M} \{ln det Σ_{k} (θ) + | | ν_{k} (θ) {| |}_{Σ_{k}^{- 1} (θ)}^{2}\},

(3)

depending on the available measurement information, which includes both measurements

Z_{1}^{M} = {z_{1}, \dots, z_{k}, \dots, z_{M}}

themselves and the observed measurement residuals

ν_{k} (θ) = z_{k} - H_{k} {\hat{x}}_{k} (θ)

, calculated by conventional Kalman-type filtering algorithm (Algorithm 1 in [9]). Here,

| | ν_{k} (θ) {| |}_{Σ_{k}^{- 1} (θ)}^{2} = ν_{k}^{T} (θ) Σ_{k}^{- 1} (θ) ν_{k} (θ)

.

In this paper, our first task is to replace the conventional Kalman-type filtering algorithm with a numerically stable, UD-based covariance filtering algorithm.

2.2. The UD-Based Covariance Filtering Algorithm for Discrete-Time Stochastic Systems with Multiplicative and Additive Noises

Originally, two algorithms based on the

U D U^{T}

decomposition of covariance and information matrices for discrete-time stochastic systems with multiplicative and additive noises were proposed in [9]. The first of these algorithms is the UD-based covariance filter, and the second one is the UD-based information filter. Both of these filters have an extended array form, and their computational schemes allow for updating all required filter quantities with the use of the numerically stable MWGS orthogonalization procedure.

The UD-based implementations imply the decomposition of the error covariance matrix in the form of

P = U_{P} D_{P} U_{P}^{T}

, where

U_{P}

is an upper triangular matrix with 1’s on the main diagonal, and

D_{P}

is a diagonal matrix. To recursively update the resulting UD-factors

U_{P}

and

D_{P}

, we use the modified weighted Gram–Schmidt UD-based (MWGS-UD) orthogonalization [14] as follows: given a pair of the pre-arrays

{A, D_{A}}

, compute a pair of the post-arrays

{U, D_{U}}

by means of the MWGS-UD orthogonalization, i.e.,

A^{T} = U M^{T} and A^{T} D_{A} A = U D_{U} U^{T},

(4)

where

A \in R^{r \times s}

,

r > s

, and

M \in R^{r \times s}

is the MWGS-UD transformation that produces the block upper triangular matrix

U \in R^{s \times s}

. The diagonal matrices

D_{A} \in R^{r \times r}

and

D_{U} \in R^{s \times s}

satisfy

M^{T} D_{A} M = D_{U}

and

D_{A} > 0

(see Lemma VI.4.1 in [14] for an extended explanation).

In this paper, we consider a modification of the UD-based covariance filter [9], whose equations are presented in Algorithm 1. The proof of Algorithm 1 is similar to the proof presented in [9].

Algorithm 1: UD-Based Covariance Filter (UD-CF)

Input:

{\bar{x}}_{0}

,

Π_{0}

.

Initialization

1. Set

{\hat{x}}_{0} = {\bar{x}}_{0}

,

X_{0} = Π_{0} + {\bar{x}}_{0} {\bar{x}}_{0}^{T}

,

Π_{0} = U_{Π_{0}} D_{Π_{0}} U_{Π_{0}}^{T}

,

X_{0} = U_{X_{0}} D_{X_{0}} U_{X_{0}}^{T}

,

U_{P_{0}} = U_{Π_{0}}

,

D_{P_{0}} = D_{Π_{0}}

.

For

k = 1, 2, \dots, M

do

Time Update

2.

{\hat{x}}_{k | k - 1} = F_{k - 1} {\hat{x}}_{k - 1}

.

3.

Q_{k - 1} = U_{Q_{k - 1}} D_{Q_{k - 1}} U_{Q_{k - 1}}^{T}

.

4.

A_{1}^{T} = [\begin{matrix} {\tilde{F}}_{k - 1} U_{X_{k - 1}} & G_{k - 1} U_{Q_{k - 1}} \end{matrix}]

,

D_{A_{1}} = [\begin{matrix} σ_{ξ}^{2} D_{X_{k - 1}} & 0 \\ 0 & D_{Q_{k - 1}} \end{matrix}]

.

Using MWGS-UD

⟨ A_{1}^{T}, D_{A_{1}} ⟩ ⟹ ⟨ U_{1}, D_{U_{1}} ⟩

where

U_{1} = U_{{\tilde{Q}}_{k - 1}}

,

D_{U_{1}} = D_{{\tilde{Q}}_{k - 1}}

.

5.

A_{2}^{T} = [\begin{matrix} F_{k - 1} U_{P_{k - 1}} & U_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

D_{A_{2}} = [\begin{matrix} D_{P_{k - 1}} & 0 \\ 0 & D_{{\tilde{Q}}_{k - 1}} \end{matrix}]

.

Using MWGS-UD

⟨ A_{2}^{T}, D_{A_{2}} ⟩ ⟹ ⟨ U_{2}, D_{U_{2}} ⟩

where

U_{2} = U_{P_{k | k - 1}}

,

D_{U_{2}} = D_{P_{k | k - 1}}

.

6.

A_{3}^{T} = [\begin{matrix} F_{k - 1} U_{X_{k - 1}} & U_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

D_{A_{3}} = [\begin{matrix} D_{X_{k - 1}} & 0 \\ 0 & D_{{\tilde{Q}}_{k - 1}} \end{matrix}]

.

Using MWGS-UD

⟨ A_{3}^{T}, D_{A_{3}} ⟩ ⟹ ⟨ U_{3}, D_{U_{3}} ⟩

where

U_{3} = U_{X_{k}}

,

D_{U_{3}} = D_{X_{k}}

.

Measurement Update

7.

R_{k} = U_{R_{k}} D_{R_{k}} U_{R_{k}}^{T}

.

8.

A_{4}^{T} = [\begin{matrix} {\tilde{H}}_{k} U_{X_{k}} & U_{R_{k}} \end{matrix}]

,

D_{A_{4}} = [\begin{matrix} σ_{ζ}^{2} D_{X_{k}} & 0 \\ 0 & D_{R_{k}} \end{matrix}]

.

Using MWGS-UD

⟨ A_{4}^{T}, D_{A_{4}} ⟩ ⟹ ⟨ U_{4}, D_{U_{4}} ⟩

where

U_{4} = U_{{\tilde{R}}_{k}}

,

D_{U_{4}} = D_{{\tilde{R}}_{k}}

.

9.

A_{5}^{T} = [\begin{matrix} U_{P_{k | k - 1}} & 0 \\ H_{k} U_{P_{k | k - 1}} & U_{{\tilde{R}}_{k}} \end{matrix}]

,

D_{A_{5}} = [\begin{matrix} D_{P_{k | k - 1}} & 0 \\ 0 & D_{{\tilde{R}}_{k}} \end{matrix}]

.

Using MWGS-UD

⟨ A_{5}^{T}, D_{A_{5}} ⟩ ⟹ ⟨ U_{5}, D_{U_{5}} ⟩

where

U_{5} = [\begin{matrix} U_{P_{k}} & (K_{k} U_{Σ_{k}}) \\ 0 & U_{Σ_{k}} \end{matrix}]

,

D_{U_{5}} = [\begin{matrix} D_{P_{k}} & 0 \\ 0 & D_{Σ_{k}} \end{matrix}]

.

10.

{\bar{e}}_{k} = U_{Σ_{k}}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k | k - 1})

.

11.

{\hat{x}}_{k} = {\hat{x}}_{k | k - 1} + (K_{k} U_{Σ_{k}}) {\bar{e}}_{k}

.

End For

Output:

{\hat{x}}_{k}

,

P_{k} = U_{P_{k}} D_{P_{k}} U_{P_{k}}^{T}

,

k = 1, 2, \dots, M

.

Remark 1.

Steps 3 and 7 of Algorithm 1 require the application of the

U D U^{T}

modified Cholesky decomposition [15]. Steps 4–6, 8, and 9 require the application of the MWGS-UD orthogonalization to a pair of block matrices

⟨ A_{i}^{T}, D_{A_{i}} ⟩

that produces another pair of block matrices

⟨ U_{i}, D_{U_{i}} ⟩

(i = 1, \dots, 5)

so that the equalities (4) are satisfied.

Let us rewrite the identification criterion (3) in terms of the UD-CF algorithm. Taking into account that

det Σ_{k} (θ) = det D_{Σ_{k} (θ)}

and

| | ν_{k} {(θ) | |}_{Σ_{k}^{- 1} (θ)}^{2} = | | {\bar{e}}_{k} (θ) {| |}_{D_{Σ_{k} (θ)}^{- 1}}^{2}

, we obtain

J_{U D} (θ; Z_{1}^{M}) = \frac{M m}{2} ln 2 π + \frac{1}{2} \sum_{k = 1}^{M} \{ln det D_{Σ_{k} (θ)} + | | {\bar{e}}_{k} (θ) {| |}_{D_{Σ_{k} (θ)}^{- 1}}^{2}\},

(5)

where diagonal matrix

D_{Σ_{k}}

and normalized residual vector

{\bar{e}}_{k}

are calculated in Steps 9 and 10 of Algorithm 1.

2.3. Derivative Evaluation of the MWGS-Based Array of Block Matrices

Solving the parameter identification problem requires minimization of the identification criterion (5) with respect to unknown system parameters. It is often done by using the gradient approach [7], where the computation of

▽_{θ} J_{U D} (θ; Z_{1}^{M})

is necessary. For the discrete-time stochastic system (1), the

J_{U D} (θ; Z_{1}^{M})

and

▽_{θ} J_{U D} (θ; Z_{1}^{M})

evaluation demands an implementation of the UD-CF and of the so-called “differentiated” UD-CF to determine the state sensitivities of the system state to the unknown system parameters [7,16,17]. The

▽_{θ} J_{U D} (θ; Z_{1}^{M})

computation will lead to a set of p filter sensitivity equations for computing

\partial {\hat{x}}_{k} / \partial θ

and a set of p matrix Riccati-type sensitivity equations for computing

\partial P_{k} / \partial θ

.

Such a method of state sensitivity evaluation within the UD-based array covariance filter was proposed in [17] for a class of discrete-time linear stochastic systems with only additive noises. In this paper, we extend this approach to systems with multiplicative and additive noises. Solving the problem of state sensitivity evaluation, we augment the numerical scheme of the UD-CF (Algorithm 1) with a procedure for numerically efficient evaluation of the derivatives of the UD-based filter variables with respect to unknown system parameters.

We use the following basic result of [17] that gives a simple and convenient technique which naturally augments any MWGS-based array of block matrices for computing derivatives of its elements.

Lemma 1

([17]). Let entries of the pre-arrays

A

,

D_{A}

in (4) be known differentiable functions of a parameter θ. Consider the transformation in (4). Given the derivatives of the pre-arrays

A_{θ}^{'}

and

{(D_{U})}_{θ}^{'}

, the following formulas calculate the corresponding derivatives of the post-arrays:

U_{θ}^{'} = U ({\bar{L}}_{0}^{T} + {\bar{U}}_{0} + {\bar{U}}_{2}) D_{U}^{- 1} a n d {(D_{U})}_{θ}^{'} = 2 D_{0} + D_{2},

(6)

where the quantities

{\bar{L}}_{0}

,

D_{0}

, and

{\bar{U}}_{0}

are, respectively, the strictly lower triangular, diagonal, and strictly upper triangular parts of the matrix product

M^{T} D_{A} A_{θ}^{'} U^{- T}

. Additionally,

D_{2}

and

{\bar{U}}_{2}

are the diagonal and strictly upper triangular parts of the product

M^{T} {(D_{A})}_{θ}^{'} M

, respectively.

3. Main Results

One can see from Algorithm 1 that elements

{\hat{x}}_{k}

and the UD factors

U_{P_{k}}

and

D_{P_{k}}

are readily available from this UD-based filter. Hence, our aim is to augment equations of the UD-CF so that the derivatives

\partial {\hat{x}}_{k} (θ) / \partial θ_{i}

and

\partial U_{P_{k} (θ)} / \partial θ_{i}

,

\partial D_{P_{k} (θ)} / \partial θ_{i}

,

i = 1, \dots, p

can be computed using quantities available from this UD-CF algorithm.

3.1. The New UD-Based State Sensitivity Evaluation Method

Now, we are ready to present our new result—the UD-based state sensitivity evaluation method for a class of discrete-time linear stochastic systems with multiplicative and additive noises.

Proposition 1.

Let the elements of matrices defining system (1) be known differentiable functions of a parameter θ. Then, for a given value of parameter θ, the estimates of state vector

x_{k}

, their sensitivity values

\partial {\hat{x}}_{k} / \partial θ

, and the UD factors

U_{P_{k}}

,

D_{P_{k}}

of the error covariance matrices and their sensitivity values

\partial U_{P_{k}} / \partial θ

,

\partial D_{P_{k}} / \partial θ

can be evaluated simultaneously using the subsequent Algorithm 2.

Algorithm 2: State Sensitivity Evaluation within Adaptive UD-CF

Input:

{\bar{x}}_{0}

,

Π_{0}

,

θ = θ_{i}

,

i = 1, \dots, p

.

Initialization

1. Set

{\hat{x}}_{0} = {\bar{x}}_{0}

,

X_{0} = Π_{0} + {\bar{x}}_{0} {\bar{x}}_{0}^{T}

,

Π_{0} = U_{Π_{0}} D_{Π_{0}} U_{Π_{0}}^{T}

,

X_{0} = U_{X_{0}} D_{X_{0}} U_{X_{0}}^{T}

.

{({\hat{x}}_{0})}_{θ}^{'} = \frac{\partial {\hat{x}}_{0}}{\partial θ}

,

{(U_{P_{0}})}_{θ}^{'} = \frac{\partial U_{P_{0}}}{\partial θ}

,

{(D_{P_{0}}^{'})}_{θ} = \frac{\partial D_{P_{0}}}{\partial θ}

,

{(U_{X_{0}})}_{θ}^{'} = \frac{\partial U_{X_{0}}}{\partial θ}

,

{(D_{X_{0}})}_{θ}^{'} = \frac{\partial U_{X_{0}}}{\partial θ}

.

For

k = 1, 2, \dots, M

do

Time Update

2.

{\hat{x}}_{k | k - 1} = F_{k - 1} {\hat{x}}_{k - 1}

,

{({\hat{x}}_{k | k - 1})}_{θ}^{'} = \frac{\partial F_{k - 1}}{\partial θ} {\hat{x}}_{k - 1} + F_{k - 1} {({\hat{x}}_{k - 1})}_{θ}^{'}

.

3.

Q_{k - 1} = U_{Q_{k - 1}} D_{Q_{k - 1}} U_{Q_{k - 1}}^{T}

,

{(U_{Q_{k - 1}})}_{θ}^{'} = \frac{\partial U_{Q_{k - 1}}}{\partial θ}

,

{(D_{Q_{k - 1}})}_{θ}^{'} = \frac{\partial D_{Q_{k - 1}}}{\partial θ}

.

4.

A_{1}^{T} = [\begin{matrix} {\tilde{F}}_{k - 1} U_{X_{k - 1}} & G_{k - 1} U_{Q_{k - 1}} \end{matrix}]

,

{(A_{1}^{T})}_{θ}^{'} = \frac{\partial A_{1}^{T}}{\partial θ}

;

D_{A_{1}} = [\begin{matrix} σ_{ξ}^{2} D_{X_{k - 1}} & 0 \\ 0 & D_{Q_{k - 1}} \end{matrix}]

,

{(D_{A_{1}})}_{θ}^{'} = \frac{\partial D_{A_{1}}}{\partial θ}

.

By Lemma 1

⟨ A_{1}^{T}, {(A_{1}^{T})}_{θ}^{'}, D_{A_{1}}, {(D_{A_{1}})}_{θ}^{'} ⟩ ⟹ ⟨ U_{1}, {(U_{1})}_{θ}^{'}, D_{U_{1}}, {(D_{U_{1}})}_{θ}^{'} ⟩

where

U_{1} = U_{{\tilde{Q}}_{k - 1}}

,

{(U_{1})}_{θ}^{'} = {(U_{{\tilde{Q}}_{k - 1}})}_{θ}^{'}

,

D_{U_{1}} = D_{{\tilde{Q}}_{k - 1}}

,

{(D_{U_{1}})}_{θ}^{'} = {(D_{{\tilde{Q}}_{k - 1}})}_{θ}^{'}

.

5.

A_{2}^{T} = [\begin{matrix} F_{k - 1} U_{P_{k - 1}} & U_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

{(A_{2}^{T})}_{θ}^{'} = \frac{\partial A_{2}^{T}}{\partial θ}

;

D_{A_{2}} = [\begin{matrix} D_{P_{k - 1}} & 0 \\ 0 & D_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

{(D_{A_{2}})}_{θ}^{'} = \frac{\partial D_{A_{2}}}{\partial θ}

.

By Lemma 1

⟨ A_{2}^{T}, {(A_{2}^{T})}_{θ}^{'}, D_{A_{2}}, {(D_{A_{2}})}_{θ}^{'} ⟩ ⟹ ⟨ U_{2}, {(U_{2})}_{θ}^{'}, D_{U_{2}}, {(D_{U_{2}})}_{θ}^{'} ⟩

where

U_{2} = U_{P_{k | k - 1}}

,

{(U_{2})}_{θ}^{'} = {(U_{P_{k | k - 1}})}_{θ}^{'}

,

D_{U_{2}} = D_{P_{k | k - 1}}

,

{(D_{U_{2}})}^{'} = {(D_{P_{k | k - 1}})}_{θ}^{'}

.

6.

A_{3}^{T} = [\begin{matrix} F_{k - 1} U_{X_{k - 1}} & U_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

{(A_{3}^{T})}_{θ}^{'} = \frac{\partial A_{3}^{T}}{\partial θ}

;

D_{A_{3}} = [\begin{matrix} D_{X_{k - 1}} & 0 \\ 0 & D_{{\tilde{Q}}_{k - 1}} \end{matrix}]

,

{(D_{A_{3}})}_{θ}^{'} = \frac{\partial D_{A_{3}}}{\partial θ}

.

By Lemma 1

⟨ A_{3}^{T}, {(A_{3}^{T})}_{θ}^{'}, D_{A_{3}}, {(D_{A_{3}})}_{θ}^{'} ⟩ ⟹ ⟨ U_{3}, {(U_{3})}_{θ}^{'}, D_{U_{3}}, {(D_{U_{3}})}_{θ}^{'} ⟩

where

U_{3} = U_{X_{k}}

,

{(U_{3})}_{θ}^{'} = {(U_{X_{k}})}_{θ}^{'}

,

D_{U_{3}} = D_{X_{k}}

,

{(D_{U_{3}})}_{θ}^{'} = {(D_{X_{k}})}_{θ}^{'}

.

Measurement Update

7.

R_{k} = U_{R_{k}} D_{R_{k}} U_{R_{k}}^{T}

,

{(U_{R_{k}})}_{θ}^{'} = \frac{\partial U_{R_{k}}}{\partial θ}

,

{(D_{R_{k}})}_{θ}^{'} = \frac{\partial D_{R_{k}}}{\partial θ}

.

8.

A_{4}^{T} = [\begin{matrix} {\tilde{H}}_{k} U_{X_{k}} & U_{R_{k}} \end{matrix}]

,

{(A_{4}^{T})}_{θ}^{'} = \frac{\partial A_{4}^{T}}{\partial θ}

;

D_{A_{4}} = [\begin{matrix} σ_{ζ}^{2} D_{X_{k}} & 0 \\ 0 & D_{R_{k}} \end{matrix}]

,

{(D_{A_{4}})}_{θ}^{'} = \frac{\partial D_{A_{4}}}{\partial θ}

.

By Lemma 1

⟨ A_{4}^{T}, {(A_{4}^{T})}_{θ}^{'}, D_{A_{4}}, {(D_{A_{4}})}_{θ}^{'} ⟩ ⟹ ⟨ U_{4}, {(U_{4})}_{θ}^{'}, D_{U_{4}}, {(D_{U_{4}})}_{θ}^{'} ⟩

where

U_{4} = U_{{\tilde{R}}_{k}}, {(U_{4})}_{θ}^{'} = {(U_{{\tilde{R}}_{k}})}_{θ}^{'}

,

D_{U_{4}} = D_{{\tilde{R}}_{k}}

,

{(D_{U_{4}})}_{θ}^{'} = {(D_{{\tilde{R}}_{k}})}_{θ}^{'}

.

9.

A_{5}^{T} = [\begin{matrix} U_{P_{k | k - 1}} & 0 \\ H_{k} U_{P_{k | k - 1}} & U_{{\tilde{R}}_{k}} \end{matrix}]

,

{(A_{5}^{T})}_{θ}^{'} = \frac{\partial A_{5}^{T}}{\partial θ}

;

D_{A_{5}} = [\begin{matrix} D_{P_{k | k - 1}} & 0 \\ 0 & D_{{\tilde{R}}_{k}} \end{matrix}]

,

{(D_{A_{5}})}_{θ}^{'} = \frac{\partial D_{A_{5}}}{\partial θ}

.

By Lemma 1

⟨ A_{5}^{T}, {(A_{5}^{T})}_{θ}^{'}, D_{A_{5}}, {(D_{A_{5}})}_{θ}^{'} ⟩ ⟹ ⟨ U_{5}, {(U_{5})}_{θ}^{'}, D_{U_{5}}, {(D_{U_{5}})}_{θ}^{'} ⟩

where

U_{5} = [\begin{matrix} U_{P_{k}} & (K_{k} U_{Σ_{k}}) \\ 0 & U_{Σ_{k}} \end{matrix}]

,

{(U_{5})}_{θ}^{'} = [\begin{matrix} {(U_{P_{k}})}^{'} & {(K_{k} U_{Σ_{k}})}_{θ}^{'} \\ 0 & {(U_{Σ_{k}})}_{θ}^{'} \end{matrix}]

,

D_{U_{5}} = [\begin{matrix} D_{P_{k}} & 0 \\ 0 & D_{Σ_{k}} \end{matrix}]

,

{(D_{U_{5}})}_{θ}^{'} = [\begin{matrix} {(D_{P_{k}})}_{θ}^{'} & 0 \\ 0 & {(D_{Σ_{k}})}_{θ}^{'} \end{matrix}]

.

10.

{\bar{e}}_{k} = U_{Σ_{k}}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k | k - 1})

,

{({\bar{e}}_{k})}_{θ}^{'} = {(U_{Σ_{k}}^{- 1})}_{θ}^{'} (z_{k} - H_{k} {\hat{x}}_{k | k - 1}) - U_{Σ_{k}}^{- 1} (\frac{\partial H_{k}}{\partial θ} {\hat{x}}_{k | k - 1} + H_{k} {({\hat{x}}_{k | k - 1})}_{θ}^{'})

.

11.

{\hat{x}}_{k} = {\hat{x}}_{k | k - 1} + (K_{k} U_{Σ_{k}}) {\bar{e}}_{k}

,

{({\hat{x}}_{k})}_{θ}^{'} = {({\hat{x}}_{k | k - 1})}_{θ}^{'} + {(K_{k} U_{Σ_{k}})}_{θ}^{'} {\bar{e}}_{k} + (K_{k} U_{Σ_{k}}) {({\bar{e}}_{k})}_{θ}^{'}

.

End For

Output:

{\hat{x}}_{k}

,

\partial {\hat{x}}_{k} / \partial θ

,

U_{P_{k}}

,

\partial U_{P_{k}} / \partial θ

,

D_{P_{k}}

,

\partial D_{P_{k}} / \partial θ

,

k = 1, 2, \dots, M

.

Proof.

Steps 1, 3, and 7 of Algorithm 2 are obtained from the corresponding steps of Algorithm 1 by direct differentiation of the vector

{\hat{x}}_{0}

and the UD factors of the covariance matrices

P_{0}

,

Q_{k - 1}

, and

R_{k}

with respect to parameter

θ

.

Steps 2, 10, and 11 of Algorithm 2 are obtained from the corresponding steps of Algorithm 1 by direct differentiation of the equations with respect to parameter

θ

.

Let us consider the equations that define Steps 4–6, 8, and 9 in Algorithm 2. They have the same form and are obtained using Lemma 1 as follows. At each step, the MWGS-UD procedure orthogonalizes the columns of the block matrix

A_{i}

with respect to the weight matrix

D_{A_{i}}

so that the equalities in (4) are satisfied. To calculate sensitivity values

{(U_{i})}_{θ}^{'}

and

{(D_{U_{i}})}_{θ}^{'}

for a given value of parameter

θ

, it is necessary and sufficient to find the values of the partial derivatives of the elements of the block matrices

A_{i}

and

D_{A_{i}}

, i.e., calculate the matrices

{(A_{i})}_{θ}^{'}

and

{(D_{A_{i}})}_{θ}^{'}

. Lemma 1 with

A : = A_{i}

,

D_{A} : = D_{A_{i}}

, and

U : = U_{i}

,

D_{U} : = D_{U_{i}}

(

i = 1, \dots, 5

) allows for achieving this goal. As a result, we find block matrices

{(U_{i})}_{θ}^{'}

and

{(D_{U_{i}})}_{θ}^{'}

, from which we obtain the required sensitivity values. □

Thus, Proposition 1 gives the method for state sensitivity evaluation using the UD-based covariance filter for discrete-time linear stochastic systems with multiplicative and additive noises.

3.2. The UD-Based Computation of the Identification Criterion and Its Gradient

Algorithm 2 can be considered as an adaptive filter, in which parameter

θ

is adjusted according to the minimum of criterion

J_{U D} (θ; Z_{1}^{M})

. When solving the parameter identification problem using gradient-based algorithms, adaptive filters are used, supplemented by a sensitivity model equations [18] to calculate the gradient of the identification criterion.

Implementation of gradient-based numerical methods for minimizing identification criterion (5) requires calculating the values of its gradient

▽_{θ} J_{U D} (θ; Z_{1}^{M})

.

The conventional gradient-based method has the following iterative form:

θ^{j + 1} = θ^{j} - β_{j} \nabla_{θ} J_{U D} (θ^{j}; Z_{1}^{M}),

(7)

where

θ^{j}

is the parameter vector at the jth iteration. In (7),

\nabla_{θ}

denotes the gradient operator

{[\partial / \partial θ_{1} | \dots | \partial / \partial θ_{p}]}^{T}

(

θ \in R^{p}

), which is applied here to (5) at point

θ = θ^{j}

. Scalar step size parameter

β_{j}

is designed to ensure that

J_{U D} (θ^{j + 1}; Z_{1}^{M}) \leq J_{U D} (θ^{j}; Z_{1}^{M}) + e

, where e is a positive number that can be chosen in a variety of ways [19].

Let us write the equation we could use to evaluate the gradient of the identification criterion (3) in terms of the UD-CF algorithm. Let

θ \in R^{p}

. Then, from (5), we have

\frac{\partial J_{U D} (θ; Z_{1}^{M})}{\partial θ_{i}} = \frac{1}{2} \sum_{k = 1}^{M} \{\frac{\partial [ln (det D_{Σ_{k} (θ)})]}{\partial θ_{i}} + \frac{\partial [{\bar{e}}_{k}^{T} (θ) D_{Σ_{k} (θ)}^{- 1} {\bar{e}}_{k} (θ)]}{\partial θ_{i}}\}, i = 1, \dots, p .

(8)

Taking into account matrix differentiation rules, we rewrite (8) to obtain the expression for

\nabla_{θ} J_{U D} (θ; Z_{1}^{M})

evaluation (

i = 1, \dots, p

):

\frac{\partial J_{U D} (θ; Z_{1}^{M})}{\partial θ_{i}} = \frac{1}{2} \sum_{k = 1}^{M} \{tr \frac{\partial D_{Σ_{k} (θ)}}{\partial θ_{i}} D_{Σ_{k} (θ)}^{- 1} + 2 \frac{\partial {\bar{e}}_{k}^{T} (θ)}{\partial θ_{i}} D_{Σ_{k} (θ)}^{- 1} {\bar{e}}_{k} (θ) - {\bar{e}}_{k}^{T} (θ) D_{Σ_{k (θ)}}^{- 2} \frac{\partial D_{Σ_{k} (θ)}}{\partial θ_{i}} {\bar{e}}_{k} (θ)\} .

(9)

Proposition 2.

Let the elements of matrices defining system (1) be known differentiable functions of a parameter

θ \in R^{p}

. Then, for a given value of parameter θ, the values of identification criterion

J_{U D} (θ; Z_{1}^{M})

and its gradient

▽_{θ} J_{U D} (θ; Z_{1}^{M})

can be evaluated according to Equations (5) and (9) using Algorithm 2.

Proof.

Calculating the values of identification criterion

J_{U D} (θ; Z_{1}^{M})

and its gradient

▽_{θ} J_{U D} (θ; Z_{1}^{M})

according to (5) and (9) requires the values of

{\bar{e}}_{k} (θ)

,

D_{Σ_{k} (θ)}

, and their sensitivities

\partial {\bar{e}}_{k} (θ) / \partial θ_{i}

,

\partial D_{Σ_{k} (θ)} / \partial θ_{i}

for

k = 1, \dots, M

;

i = 1, \dots, p

. The values of these quantities are available in Steps 10 and 11 of Algorithm 2. □

Thus, for a given value of parameter

θ

, Algorithm 2 allows us to obtain all the quantities necessary to calculate the values of the identification criterion and its gradient that are used in the parameter identification gradient-based method.

4. Discussion

As a practical application of the proposed method, let us consider the parameter identification problem for a nearly constant velocity model of the uniform motion augmented with multiplicative noises [11]:

\{\begin{matrix} x_{k} & = ([\begin{matrix} 1 & θ \\ 0 & 1 \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix}] ξ_{k - 1}) x_{k - 1} + [\begin{matrix} \frac{θ^{2}}{2} \\ θ \end{matrix}] w_{k - 1}, \\ z_{k} & = ([\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix}] ζ_{k}) x_{k} + v_{k}, k = 1, \dots, 100, \end{matrix}

(10)

where

x_{k} = {[x_{1}, x_{2}]}_{k}^{T}

,

x_{1} = x

is the coordinate of the object,

x_{2} = v_{x}

is its velocity,

x_{0}

∼

N ({[0, 1]}^{T}, 10 I_{2})

,

w_{k}

∼

N (0, 10^{- 2})

,

v_{k}

∼

N (0, σ^{2} I_{2})

(

σ = 0.1, 0.5, 1.0

),

ξ_{k}

∼

N (0, 10^{- 4})

,

ζ_{k}

∼

N (0, 10^{- 4})

, and

θ

is the model parameter to be identified. Let us put the “true” value of the parameter equal to

θ^{*} = 0.3

.

To demonstrate the validity of the proposed approach, we have conducted numerical experiments in MATLAB, which is a common tool for simulation, optimization, control, and filtering [8,20,21]. We have implemented all necessary functions for simulating system dynamics and measurements, as well as functions for calculating the identification criterion (5) and its gradient (9). Figure 1 shows the identification criterion and its gradient for the considered problem, with noise covariance matrix

R = 0 . 5^{2} I_{2}

averaged over 100 runs.

Figure 1. Identification criterion (a) and its gradient (b).

The following MATLAB functions were used for numerical minimization of the identification criterion: simulannealbnd, ga, and fmincon. The first two functions implement gradient-free metaheuristic algorithms SA (Simulated Annealing) and GA (Genetic Algorithm), respectively. The third function was configured to use two different gradient-based algorithms: interior-point (IP) and trust-region-reflective (TRR). The first algorithm estimates the gradient using finite differences, and the second algorithm uses a user-provided gradient of the objective function.

A series of 100 numerical experiments was conducted for each value of the noise level

σ

. In each experiment, numerical identification of parameter

θ

was performed based on the results of simulated measurements. The solution,

θ^{*}

, was searched on the segment

[0.1; 1]

. The initial point for SA and both gradient-based algorithms was chosen randomly in each experiment.

Table 1 presents the main settings of the optimizers used in the numerical experiments. The remaining settings are taken by default. All experiments were conducted on the following platform: MATLAB R2017a, Windows 11, Intel Core i3-1115G4 CPU @ 3.00 GHz, 8 GB of RAM.

Table 1. Optimizer settings.

Table 2 provides the average number of iterations and running times for all optimizers. It can be seen that, at the chosen settings, gradient-based algorithms run significantly faster than non-gradient algorithms. The TRR algorithm is about 10% faster and requires fewer iterations than the IP algorithm. The SA algorithm works on average two times faster than GA, whose performance largely depends on the population size, which in this series of experiments was taken to be 10. Reducing the population size can proportionally reduce the running time of this algorithm, but it can also lead to a decrease in accuracy.

Table 2. Average number of iterations and time, in seconds.

The results of numerical identification of parameter

θ

are summarized in Table 3. They show that, for the problem under consideration with the selected settings, all algorithms demonstrate approximately the same mean accuracy. RMSE and MAPE values decrease with decreasing noise levels, but for the SA algorithm, they remain slightly larger than for other algorithms.

Table 3. Identification results.

5. Conclusions

In this paper, we have proposed a new method for solving the parameter identification problem for discrete-time linear stochastic systems with multiplicative and additive noises based on the application of the covariance UD-filter and the original method for the state sensitivity evaluation within the numerically stable, matrix-orthogonal MWGS-UD transformation.

The main theoretical results of the paper are the UD-based state sensitivity evaluation method and the method of calculating the values of identification criterion

J_{U D} (θ; Z_{1}^{M})

and its gradient, which are formulated in Propositions 1 and 2, respectively. Both methods use Algorithm 2 to calculate the state vector estimates and their sensitivity values. In addition to the numerical stability of the MWGS orthogonalization procedure to machine roundoff errors, the main advantage of the proposed method is the possibility of analytical calculation of derivative values at a given value of the identified parameter without the need to use finite-difference methods.

Numerical experiments confirm that the obtained results can be used for solving the parameter identification problems of the considered stochastic systems. The gradient-based minimization of the identification criterion that uses the proposed method outperforms both non-gradient algorithms and the algorithm that estimates gradients using finite differences.

Author Contributions

Conceptualization, A.T. and Y.T.; methodology, A.T. and Y.T.; software, A.T.; validation, Y.T.; formal analysis, A.T. and Y.T.; investigation, A.T. and Y.T.; resources, A.T. and Y.T.; data curation, A.T. and Y.T.; writing—original draft preparation, Y.T.; writing—review and editing, A.T. and Y.T.; visualization, A.T. and Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Russian Science Foundation at Ulyanovsk State University of Education, grant no. 22-21-00387, https://rscf.ru/en/project/22-21-00387/ (accessed on 24 November 2023).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MWGS	Modified weighted Gram–Schmidt orthogonalization
MWGS-UD	MWGS based on the UD factorization
KF	Kalman filter
TU	Time update
MU	Measurement update
MPE	Minimum prediction error
GA	Genetic algorithm
SA	Simulated annealing
RMSE	Root mean square error
MAPE	Mean absolute percentage error
IP	Interior point algorithm
TRR	Trust region reflective algorithm

References

Wu, Y.; Zhang, Q.; Shen, Z. Kalman filtering with multiplicative and additive noises. In Proceedings of the 12th World Congress on Intelligent Control and Automation (WCICA 2016), Guilin, China, 12–15 June 2016; pp. 483–487. [Google Scholar]
Ljung, L. System Identification: Theory for the User, 2nd ed.; Prentice Hall PTR: Upper Saddle River, NJ, USA, 1999. [Google Scholar]
Gevers, M. A personal view of the development of system identification: A 30-year journey through an exciting field. IEEE Control Syst. Mag. 2006, 26, 93–105. [Google Scholar]
Ljung, L. Perspectives on system identification. Annu. Control 2010, 34, 1–12. [Google Scholar] [CrossRef]
Åström, K.-J.; Bohlin, T. Numerical Identification of Linear Dynamic Systems from Normal Operating Records. In Proceedings of the Second IFAC Symposium on the Theory of Self-Adaptive Control Systems, Teddington, UK, 14–17 September 1966; pp. 96–111. [Google Scholar]
Ho, B.L.; Kalman, R.E. Effective construction of linear state-variable models from input/output functions. Regelungstechnik 1966, 14, 545–548. [Google Scholar]
Gupta, N.K.; Mehra, R.K. Computational aspects of maximum likelihood estimation and reduction in sensitivity function calculations. IEEE Trans. Autom. Control 1974, AC-19, 774–783. [Google Scholar] [CrossRef]
Grewal, M.S.; Andrews, A.P. Kalman Filtering: Theory and Practice Using MATLAB, 4th ed.; John Wiley & Sons, Inc.: New York, NY, USA, 2015. [Google Scholar]
Tsyganov, A.V.; Tsyganova, J.V.; Kureneva, T.N. UD-based Linear Filtering for Discrete-Time Systems with Multiplicative and Additive Noises. In Proceedings of the 19th European Control Conference, Saint Petersburg, Russia, 12–15 May 2020; pp. 1389–1394. [Google Scholar]
Kailath, T.; Sayed, A.; Hassibi, B. Linear Estimation; Prentice Hall: Upper Saddle River, NJ, USA, 2000. [Google Scholar]
Tsyganov, A.; Tsyganova, Y. SVD-Based Identification of Parameters of the Discrete-Time Stochastic Systems Models with Multiplicative and Additive Noises Using Metaheuristic Optimization. Mathematics 2023, 11, 4292. [Google Scholar] [CrossRef]
Gibbs, B.P. Advanced Kalman Filtering, Least-Squares and Modeling: A Practical Handbook; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2011. [Google Scholar]
Åström, K.-J. Maximum Likelihood and Prediction Error Methods. Automatica 1980, 16, 551–574. [Google Scholar] [CrossRef]
Bierman, G.J. Factorization Methods for Discrete Sequential Estimation; Academic Press: New York, NY, USA, 1977. [Google Scholar]
Golub, G.H.; Van Loan, C.F. Matrix Computations; Johns Hopkins University Press: Baltimore, MD, USA, 1983. [Google Scholar]
Bierman, G.J.; Belzer, M.R.; Vandergraft, J.S.; Porter, D.W. Maximum likelihood estimation using square root information filters. IEEE Trans. Autom. Control 1990, 35, 1293–1298. [Google Scholar] [CrossRef]
Tsyganova, J.V.; Kulikova, M.V. State sensitivity evaluation within UD based array covariance filters. IEEE Trans. Autom. Control 2013, 58, 2944–2950. [Google Scholar] [CrossRef]
Tsypkin, Y.Z. Information Theory of Identification; Fizmatlit: Moscow, Russia, 1995. (In Russian) [Google Scholar]
Nocedal, J.; Wright, S.J. Numerical Optimization; Springer Series in Operations Research and Financial Engineering; Springer Nature: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Dorf, R.C.; Bishop, R.H. Modern Control Systems, 13th ed.; Pearson: Upper Saddle River, NJ, USA, 2016. [Google Scholar]
Mu, D.; Xu, C.; Liu, Z.; Pang, Y. Further Insight into Bifurcation and Hybrid Control Tactics of a Chlorine Dioxide-Iodine-Malonic Acid Chemical Reaction Model Incorporating Delays. MATCH Commun. Math. Comput. Chem. 2023, 89, 529–566. [Google Scholar] [CrossRef]

Figure 1. Identification criterion (a) and its gradient (b).

Table 1. Optimizer settings.

Optimizer	Settings
SA	‘TimeLimit’ = Inf, ‘MaxFunEvals’ = Inf, ‘MaxIter’ = Inf,
SA	‘StallIterLimit’ = 100, ‘ReannealInterval’ = 100, ‘Display’ = ‘off’
	‘TimeLimit’ = Inf, ‘Generations’ = Inf, ‘StallGenLimit’ = 20,
GA	‘PopulationSize’ = 10, ‘PopInitRange’ = [0.1; 1],
	‘MutationFcn’ = @mutationadaptfeasible, ‘Display’ = ‘off’
FMINCON	‘SpecifyObjectiveGradient’ = false, ‘Algorithm’ = ‘interior-point’,
(IP)	‘MaxFunctionEvaluations’ = Inf, ‘Display’ = ‘off’
FMINCON	‘SpecifyObjectiveGradient’ = true, ‘Algorithm’ = ‘trust-region-reflective’,
(TRR)	‘MaxFunctionEvaluations’ = Inf, ‘Display’ = ‘off’

Table 2. Average number of iterations and time, in seconds.

		SA	GA	FMINCON (IP)	FMINCON (TRR)
$σ = 1.0$	Iterations	187	33	13	7
$σ = 1.0$	Time	2.857	5.044	0.661	0.499
$σ = 0.5$	Iterations	176	36	12	7
$σ = 0.5$	Time	2.739	5.582	0.568	0.509
$σ = 0.1$	Iterations	187	42	12	7
$σ = 0.1$	Time	2.880	6.506	0.607	0.540

Table 3. Identification results.

		SA	GA	FMINCON	FMINCON
				(IP)	(TRR)
	Mean	0.287424	0.287823	0.287803	0.287802
$σ = 1.0$	RMSE	0.029494	0.028942	0.028962	0.028962
	MAPE	7.967221	7.818963	7.827929	7.827960
	Mean	0.299297	0.299802	0.299805	0.299805
$σ = 0.5$	RMSE	0.016056	0.015995	0.015996	0.015996
	MAPE	4.217731	4.183624	4.183879	4.183878
	Mean	0.298175	0.299436	0.299438	0.299438
$σ = 0.1$	RMSE	0.005688	0.003432	0.003429	0.003429
	MAPE	1.419462	0.881047	0.880159	0.880159

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Parameter Identification of the Discrete-Time Stochastic Systems with Multiplicative and Additive Noises Using the UD-Based State Sensitivity Evaluation

Abstract

1. Introduction

2. Methods

2.1. The Problem Statement

2.2. The UD-Based Covariance Filtering Algorithm for Discrete-Time Stochastic Systems with Multiplicative and Additive Noises

2.3. Derivative Evaluation of the MWGS-Based Array of Block Matrices

3. Main Results

3.1. The New UD-Based State Sensitivity Evaluation Method

3.2. The UD-Based Computation of the Identification Criterion and Its Gradient

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics