Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method

Yang, Yi; Zhang, Yong; Zhou, Yuyang

doi:10.3390/e25020186

Open AccessArticle

Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method

by

Yi Yang

¹

,

Yong Zhang

¹

and

Yuyang Zhou

^2,*

¹

School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China

²

School of Computing Engineering and Built Environment, Edinburgh Napier University, Edinburgh EH10 5DT, UK

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(2), 186; https://doi.org/10.3390/e25020186

Submission received: 5 December 2022 / Revised: 12 January 2023 / Accepted: 13 January 2023 / Published: 17 January 2023

(This article belongs to the Special Issue Entropy and Stochastic Distribution Optimization for Large-Scale Dynamical Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Output probability density function (PDF) tracking control of stochastic systems has always been a challenging problem in both theoretical development and engineering practice. Focused on this challenge, this work proposes a novel stochastic control framework so that the output PDF can track a given time-varying PDF. Firstly, the output PDF is characterised by the weight dynamics following the B-spline model approximation. As a result, the PDF tracking problem is transferred to a state tracking problem for weight dynamics. In addition, the model error of the weight dynamics is described by the multiplicative noises to more effectively establish its stochastic dynamics. Moreover, to better reflect the practical applications in the real world, the given tracking target is set to be time-varying rather than static. Thus, an extended fully probabilistic design (FPD) is developed based on the conventional FPD to handle multiplicative noises and to track the time-varying references in a superior way. Finally, the proposed control framework is verified by a numerical example, and a comparison simulation with the linear–quadratic regulator (LQR) method is also included to illustrate the superiority of our proposed framework.

Keywords:

tracking control; probability density function; full probability design; B-spline model

1. Introduction

In recent years, there has been a growing interest in stochastic control, as many systems in the real world, such as those in aerospace, chemical, textile machinery and ships, can all be modelled as stochastic dynamic systems [1]. In the study of stochastic system control, for the Gaussian processes, the distribution can be controlled by only controlling its mean and variance [2]. However, many stochastic systems contain non-Gaussian processes, such as the scale distribution of flocculating particles in the paper process [3], the flame shape distribution [4], and the chemical polymerisation reaction molecular weight distribution [5]. For such stochastic systems with non-Gaussian variables, the mean and variance cannot describe the full statistical information, so that traditional methods are no longer productive [6]. In order to solve this problem, Wang proposed a new stochastic control method by introducing the PDF into the control field in 1996 [7,8]. In this method, the B-spline function is applied to model the PDF, which releases the limitation of the Gaussian assumption of the traditional stochastic control methods [9]. Based on that, a series of stochastic control frameworks have been presented in both practical and theoretical disciplines. On the practical side, Li applied the PDF control method to the fibre length distribution in the grinding process to predict the random distribution of the fibre length and achieved a good control effect [3]. Sun applied this method to the flame temperature field, realised the modelling of the flame temperature field and its iterative learning control, and achieved the purpose of improving the control effect [4]. In theoretical terms, a PDF shape control method using the FPK equation for nonlinear stochastic systems with an arbitrary expression was proposed in [10], where the exact stationary solution of the FPK equation was derived and the controller was designed for a non-polynomial nonlinear function. Reference [11] presented the rational square root B-spline model and introduced the concept of the pseudo-weight, which makes the control algorithm almost unconstrained and the system analysis relatively simple. In these studies, most correlation methods ignore the model error of the B-spline function or simply assume the error as steady-state residuals. Thus, the control methods that have been applied in most of these studies are LQR methods. However, for some complicated practical systems, the PDF model error can be involved in a stochastic dynamical manner [12]. Ignoring that would affect the control performance and even lead to divergence. For such complex practical processes, multiplicative noise, which is also known as state-dependent noise, can better characterise the uncertainty of the model parameters and be more in line with the actual systems [13]. In this way, the LQR method is no longer suitable for dealing with noises and uncertainties, which are strongly dependent on the system state. Thus, finding a more advanced control algorithm aiming at multiplicative noise is crucial.

Target tracking has always been a hot topic in both theoretical research and practical applications [14]. Unlike the general tracking problem, for the stochastic systems, it usually requires the system to track a distribution rather than a single value. For the Gaussian processes, the controller can be simply designed to track the mean and variance of the given distribution. For the non-Gaussian processes, there are many algorithms and literature works published under the B-spline model framework proposed by Wang. For instance, Zhou designed the observer to control the observed state to track the actual state of the system [15]. Luan added noise to the model to control the output PDF by the optimal control algorithm [16]. In most of the related studies, they only considered the static tracking targets and neglected the time-varying targets. However, in many industrial plants, in order to meet the field requirements, the tracking target can be time-varying, which increases the difficulties for controller design.

In summary, based on the aforementioned problems, the challenges of stochastic systems include, but are not limited to the following: (1) most existing stochastic control algorithms are limited to the Gaussian assumption and have poor control performance once the systems contain non-Gaussianity or nonlinearity; (2) for the published control frameworks under the B-spline approximation model proposed by Wang [8], even though they have successfully lifted the Gaussian assumption of the traditional stochastic method, they have failed to involve the uncertainties and errors of the model parameters in a stochastic dynamical manner; (3) The majority of stochastic tracking problems consider static references other than the time-varying reference, which brings problems to practical applications. To address these issues, this paper proposes a novel B-spline model-based control framework using the fully probabilistic control design (FPD) [17,18]. In this method, a linear B-spline model is applied to describe the distribution of the system output [11,16]. Besides the B-spline model, there are other approximation models that can also be used to fit the PDF curve such as the RBF model [19], the input and output ARMAX model [20], the neural network PDF model, and so on [21]. For the RBF model, the existing research shows that there are few intermediate parameters in RBF model control, but the dynamical relationship between the input and output cannot be reflected. Furthermore, the model accuracy of the RBF model is relatively poor especially when few basis functions are chosen. Although the input–output ARMAX model can reflect the dynamic relationship between the input and output to some extent, it has difficulty controlling the shape of the system when designing the controller. The neural network PDF model can achieve satisfactory control results; however, for a multi-input multi-output system, the number of basis functions will increase exponentially with the increase of the complexity of the system. Compared with the other types of approximation models mentioned above, the B-spline model is the most widely used and is easier to convert to the state-space dynamical weight model. Furthermore, compared to the other types of B-spline model such as the square root B-spline model [22], rational B-spline model [23], and rational square root B-spline model [24], the linear B-spline model is relatively simple and unique in expression. Based on the B-spline approximation principle [25], this model approximates the system output probability density function by n pre-selected B-spline basis functions. Due to the distribution constraints, only

n - 1

of the n weights in this model are independent of each other [26]. Therefore, the control of the system output PDF can be achieved by controlling

n - 1

independent weights. Thus, the control goal is transferred from altering the shape of the PDF to a desired PDF to controlling the weights of the B-spline model to a pre-selected weight set. Besides, the PDF model error is characterised as multiplicative noise [27,28], indicating that the weight error of the PDF model is involved in a dynamical manner. Considering that, the randomised controller FPD was implemented in this paper to achieve the control goal [29,30]. The FPD is evaluated by minimising the KL-divergence between the distribution of the system dynamics and the desired distribution. To better cope with multiplicative noises, the extended FPD proposed by Zhou [12] was applied in this paper. This extended FPD is strongly intuitively appealing and provides an explicit minimisation strategy that exhibits better convergence, as well as shorter response times when dealing with multiplicative noise [30].

To sum up, under the B-spline framework proposed by Wang, the purpose of this paper is to design a randomised control method so that the weight of the dynamics can track a target time-varying weight, thus realising PDF shape tracking and providing a theoretical basis for PDF tracking for non-Gaussian stochastic systems in the actual industry. The contribution of this paper can be summarised as follows:

1: The linear B-spline model is implemented to characterise the system PDF, thus converting the PDF shape control problem into the weight control problem;
2: The PDF model error is represented as multiplicative noise, indicating the stochasticity and dynamics of the weight error;
3: The dynamics of the weights are characterised by the stochastic state space model, thus providing full stochastic properties;
4: The extended FPD is applied to better cope with the multiplicative noises existing in the dynamics of the weight model. Compared with the conventional FPD [29], the extended FPD can better cope with multiplicative noises by modifying the Riccati equations. Moreover, we improved the extended FPD in [12] so that the system state can track the time-varying targets.

The rest of this paper is organised as follows. Section 2 states a description of the problem. In Section 3, the optimal control law is solved based on the performance metrics and the implementation algorithm is provided. In Section 4, the controller is applied to a numerical example and a comparison simulation with the LQR method is also included. Finally, the conclusion and future work are summarised in Section 5.

2. Problem Description

2.1. PDF Description Based on B-Spline

For the output PDF of the controlled system, if the PDF is obtained by solving partial differential equations, it is very challenging to model by first principles, thus bringing difficulties to obtaining an effective control strategy [31]. To address that, the B-spline model can be applied to fit the PDF curve by the relationship between the weights and basis functions. Assuming that the interval [a, b] is known and the output PDF

γ (y)

is continuous and bounded on the interval [a, b], the output PDF can be represented using n B-spline basis functions as follows:

γ (y) = \sum_{i = 1}^{n} w_{i} B_{i} (y),

(1)

where

w_{i}

,

(i = 1, 2, . . ., n)

is the weight and

B_{i} (y)

is the pre-selected n basis functions. There are different functions that can be chosen as basic functions such as the Gaussian function, radial basis function, and wavelet basis function. From the properties of the B-spline function, it can be used to approximate any continuous function defined on a compact set [25]. A notable advantage of the B-spline method is the exact and smooth surfacewise description of the curve [32]. Compared with the RBF basis function, when the number of basis functions is low, the B-spline basis function is more suitable for curve fitting. This will be proven later by the comparison simulation results in Section 4. Since

γ (y)

is a PDF defined on the interval [a, b], it should satisfy the following constraint:

\int_{a}^{b} γ (y) = 1 .

(2)

As Equation (2) needs to be guaranteed, it follows that only the

n - 1

weights are independent of each other, for which the distribution can be written in the following form:

\begin{matrix} γ (y) = C_{0} (y) x + L (y), \end{matrix}

(3)

\begin{matrix} x = {[w_{1}, w_{2}, . . ., w_{n - 1}]}^{T} \in R^{n - 1 \times 1}, \end{matrix}

(4)

\begin{matrix} L (y) = {(\int_{a}^{b} B_{n} (y) d y)}^{- 1} B_{n} (y) \in R^{1 \times 1}, \end{matrix}

(5)

\begin{matrix} C_{0} (y) = {[\begin{matrix} B_{1} (y) - \frac{B_{n} (y)}{\int_{a}^{b} B_{n} (y) d y} \int_{a}^{b} B_{1} (y) d y \\ B_{2} (y) - \frac{B_{n} (y)}{\int_{a}^{b} B_{n} (y) d y} \int_{a}^{b} B_{2} (y) d y \\ ⋮ \\ B_{n - 1} (y) - \frac{B_{n} (y)}{\int_{a}^{b} B_{n} (y) d y} \int_{a}^{b} B_{n - 1} (y) d y \end{matrix}]}^{T} \in R^{1 \times n - 1}, \end{matrix}

(6)

where

x

is the weight set,

C_{0} (y)

is the basis function vector, and

L (y)

is the basis function scalar. From Equations () and (6), we can see that

C_{0} (y)

and

L (y)

are known once the basis functions are chosen.

According to Equations (1)–(6), we can see that, by using the B-spline model, controlling the output PDF shape can be realised by controlling

n - 1

independent weight vectors [33].

2.2. PDF Tracking Control Problem

Figure 1 demonstrates the tracking diagram of the system B to the system A. The target system A has an unknown structure and can monitor the output in real time, which means that for any k moments, the output PDF distribution

g_{k} (y)

of the system A is known. System B is a known controlled system with control input u and output

γ_{k} (y, u_{k})

. The goal here is to make the output distribution of System B track the output distribution of System A, where

D

characterises the difference between the two distributions. According to Figure 1, the system and tracking control problems mentioned above will be described in detail in the following section.

Consider the stochastic system with output PDF

γ_{k} (y)

, whose dynamics is described by:

γ_{k + 1} (y) = f (γ_{k} (y), u_{k}),

(7)

where the distribution of the system output y is

γ_{k} (y)

and

u_{k}

is the system input.

By applying the B-spline model (3), the output PDF

γ_{k} (y)

of the tracking system B is described as follows:

γ_{k} (y) = C_{0} (y) x_{k} + L (y),

(8)

where

x_{k}

are the weights corresponding to the basis functions.

The tracking target

g_{k} (y)

is a dynamic target PDF that varies dynamically with time in the following form:

g_{k} (y) = C_{0} (y) V_{k} + L (y),

(9)

where

V_{k}

are the pre-set target weights corresponding to each basis function.

Note that the B-spline basis functions need to be selected in advance. Then, the shaping distribution problem of the output PDF

γ (y, u_{k})

on the interval [a, b] can be realised by controlling the weight state

x_{k}

. Based on the B-spline model framework given in [8], the dynamics of the weight states

x_{k}

of the B-spline model-based PDF are described as follows:

x_{k + 1} = G x_{k} + H u_{k} + D x_{k} E_{k},

(10)

where

G \in R^{n - 1 \times n - 1}

,

H \in R^{n - 1 \times 1}

are the corresponding weight system parameters and

E_{k}

represents the state-based model randomness whose distribution is given by

E_{k} ∽ (0, M),

(11)

where M is the variance of

E_{k}

.

The control flow chart is shown above in Figure 2. After pre-selecting the B-spline basis function, the time-varying target weight

V_{k}

in Figure 2 can be obtained according to the target distribution through the B-spline principle. The system input

u_{k}

is obtained by evaluating the target weight and the system weight through the FPD controller, which will be introduced in the next section. The weight

x_{k + 1}

is then updated by B-spline principle modelling and input

u_{k}

. According to the relationship between the weight and the basis function, the output distribution is then obtained. In addition, it should be noted that the model error part

E_{k}

is represented by multiplicative noise, and

D

is the weight matrix with the appropriate dimension. Finally, the weights are iteratively updated according to the model to achieve output distribution control.

Following this sequence, our control goal was to design a randomised controller so that the weight

x_{k}

can follow the target weight

V_{k}

. The PDF shape-tracking control problem is transformed into a weight tracking control problem. The controller design will be introduced in the next section.

3. Control Algorithm

In this section, the FPD control algorithm is introduced to achieve the tracking goals for the weight state. The main reason we chose FPD is that we considered the multiplicative noise in the weight dynamical system. Moreover, FPD not only fully respects the stochastic nature of the system, but also provides a very detailed implementation procedure. The details will be given as follows.

3.1. General Control Solution of FPD

In FPD, the difference between the actual distribution

f (D)

of the system and the target distribution

f^{I} (D)

is measured by the Kullback–Leibler divergence (KLD) [34], where

D = (x_{k}, u_{k})

represents the observation data. The closer the distance between the output distribution and the target distribution, the greater the degree of similarity and, conversely, the smaller the degree of similarity [35]. The formula for the KLD is given by

D (f | | f^{I}) = \int f (D) l n \frac{f (D)}{f^{I} (D)} d D,

(12)

where

D

is the relative entropy, directional divergence, or KLD and

D

has the key property of non-negativity, i.e.,

D (f | | f^{I}) \geq 0

.

Assuming that

u_{k}

is the input to the system and

k \in k_{0}, k_{1}, . . ., k_{n}

, all states

x_{k}

of the system are assumed to be measurable; the joint PDF of the closed-loop system from moment

k_{0}

to moment

k_{n}

is:

f (D) = \prod_{k = k_{0}}^{k_{n}} s (x_{k} | x_{k - 1}, u_{k - 1}) c (u_{k - 1} | x_{k - 1}),

(13)

where

s (x_{k} | x_{k - 1}, u_{k - 1})

denotes the distribution of the system dynamics at moment k and

c (u_{k - 1} | x_{k - 1})

denotes the distribution of the controller at moment k. Similar to the joint PDF system, the target PDF

f^{I} (D)

is given by

f^{I} (D) = \prod_{k = k_{0}}^{k_{n}} s^{I} (x_{k} | x_{k - 1}, u_{k - 1}) c^{I} (u_{k - 1} | x_{k - 1}),

(14)

where

s^{I} (x_{k} | x_{k - 1}, u_{k - 1})

denotes the desired distribution at moment k, while

c^{I} (u_{k - 1} | x_{k - 1})

denotes the desired distribution of the controller at moment k.

The control strategy is to find an optimal randomised controller to bring the distribution of the system dynamics as close as possible to a target distribution, thus minimising the KLD given in Equation (12). The following performance indicators can be established according to Equations (12)–(14):

\begin{matrix} - l n (γ (x_{k - 1})) = min_{\{c {(u_{k - 1} | x_{k - 1})}_{τ > k}^{k_{n}}\}} \sum_{τ = k}^{k_{n}} \int f (D_{τ} | x_{k - 1}) \times \\ l n (\frac{s (x_{τ} | x_{τ - 1}, u_{τ - 1}) c (u_{τ - 1} | x_{τ - 1})}{s^{I} (x_{τ} | x_{τ - 1}, u_{τ - 1}) c^{I} (u_{τ - 1} | x_{τ - 1})}) d (D_{τ}), \end{matrix}

(15)

where, for any

τ \in k_{0}, k_{1}, . . ., k_{n}

, the recursive form of Equation (15) can be obtained according to the dynamic programming method as follows:

\begin{matrix} - l n (γ (x_{k - 1})) & = min_{c (u_{k - 1} | x_{k - 1})} \int s (x_{k} | x_{k - 1}, u_{k - 1}) c (u_{k - 1} | x_{k - 1}) \\ \times [l n (\frac{s (x_{k} | x_{k - 1}, u_{k - 1}) c (u_{k - 1} | x_{k - 1})}{s^{I} (x_{k} | x_{k - 1}, u_{k - 1}) c^{I} (u_{k - 1} | x_{k - 1})} - l n (γ (x_{k}))] d (x_{k}, u_{k - 1}) . \end{matrix}

(16)

Thus, the optimal control law

c^{*} (u_{k - 1} | x_{k - 1})

that minimises the performance index (16) can then be evaluated as follows based on FPD:

c^{*} (u_{k - 1} | x_{k - 1}) = \frac{c^{I} (u_{k - 1} | x_{k - 1}) e x p [- β_{1} (u_{k - 1}, x_{k - 1}) - β_{2} (u_{k - 1}, x_{k - 1})]}{γ (x_{k - 1})},

(17)

where

γ (x_{k - 1}) = \int c^{I} (u_{k - 1} | x_{k - 1}) e x p [- β_{1} (u_{k - 1}, x_{k - 1}) - β_{2} (u_{k - 1}, x_{k - 1})] d u_{k - 1}

,

β_{1} (u_{k - 1}, x_{k - 1}) = \int s (x_{k} | x_{k - 1}, u_{k - 1}) [l n \frac{s (x_{k} | x_{k - 1}, u_{k - 1})}{s^{I} (x_{k} | x_{k - 1}, u_{k - 1})}] d x_{k}

,

β_{2} (u_{k - 1}, x_{k - 1}) = - \int s (x_{k} | x_{k - 1}, u_{k - 1}) l n (γ (x_{k})) d x_{k}

.

The proof of Equation (17) can be found in [12].

3.2. FPD Control Solution for the Weight Dynamic System

To apply the FPD, all the variables will be characterised as stochastic distributions. Based on Equation (10), the weight dynamic distribution is given as follows with

μ_{k}

as the mean and

Z_{k}

as the covariance:

s (x_{k} | x_{k - 1}, u_{k - 1}) ∽ N (μ_{k}, Z_{k}),

(18)

where

\begin{matrix} μ_{k} & = G x_{k - 1} + H u_{k - 1}, \\ Z_{k} & = c o v (x_{k} | x_{k - 1}, u_{k - 1}) \\ = E \{(x_{k} - μ_{k}) {(x_{k} - μ_{k})}^{T}\} \\ = E \{D x_{k - 1} E_{k - 1}^{T} x_{k - 1}^{T} D^{T}\} \\ = D x_{k - 1} M x_{k - 1}^{T} D^{T} . \end{matrix}

The ideal weight distribution is assumed to be

s^{I} (x_{k} | x_{k - 1}, u_{k - 1}) ∽ N (V_{k}, R_{k}),

(19)

where

V_{k}

is the target weight of the system distribution at instant k and

R_{k}

is its ideal covariance. Note that

V_{k}

is dynamic and time-varying, so as to better simulate the real situation.

The distribution of the ideal controller can also be formulated as

c^{I} (u_{k - 1} | x_{k - 1}) ∽ N (θ_{k - 1}, Γ),

(20)

where

Γ

is the ideal covariance of the controller and

θ_{k - 1}

is the ideal mean of the optimal control signal at each moment of the controller, which can be evaluated as follows:

\begin{matrix} θ_{k - 1} & = {(H^{T} H)}^{- 1} H^{T} (V_{k} - G V_{k - 1}) . \end{matrix}

(21)

Therefore, given the system output distribution (18) and the target distribution (19) and (20), following the general PFD scheme Equation (17), the optimal distribution of the tracking controller is given by the following theorem.

Theorem 1.

Under the PDF (18) describing the dynamic weights of the system, the optimal controller distribution for minimising the performance index function (17) is given as follows:

c^{*} (u_{k - 1} | x_{k - 1}) ∽ N ({\bar{u}}_{k - 1}, Γ_{k - 1}),

(22)

where

\begin{matrix} {\bar{u}}_{k - 1} & = - K_{k - 1} x_{k - 1} + d_{k - 1}, \\ K_{k - 1} & = Γ_{k - 1} H^{T} Q_{k} G, \\ Q_{k} & = R_{k}^{- 1} + S_{k}, \\ Γ_{k - 1} & = {(Γ^{- 1} + H^{T} Q_{k} H)}^{- 1}, \\ d_{k - 1} & = Γ_{k - 1} [Γ^{- 1} θ_{k - 1} + H^{T} R_{k}^{- 1} V_{k} - 0.5 H^{T} P_{k}^{T}], \end{matrix}

(23)

and we establish the following performance indicator function:

- l n (γ (x_{k - 1})) = 0.5 x_{k - 1}^{T} S_{k - 1} x_{k - 1} + 0.5 P_{k - 1} x_{k - 1} + 0.5 q_{k - 1},

(24)

where

\begin{matrix} S_{k} = & - G^{T} Q_{k} H Γ_{k - 1} H^{T} Q_{k}^{T} G + G^{T} Q_{k} G + D x_{k - 1} M x_{k - 1}^{T} D^{T}, \\ P_{k - 1} = & (P_{k} - 2 V_{k}^{T} R_{k}^{- 1}) G + 2 θ_{k - 1}^{T} Γ^{- 1} Γ_{k - 1} H^{T} Q_{k} G + 2 V_{k}^{T} R_{k}^{- 1} H Γ_{k - 1} H^{T} Q_{k} G \\ - P_{k} H Γ_{k - 1} H^{T} Q_{k} G, \end{matrix}

(25)

where Equation (24) is the Riccati expression of the performance index,

S_{k}

is the quadratic Riccati equation,

P_{k}

is the linear term of the Riccati equation, and

q_{k - 1}

stands for some normal numbers that do not contribute to the controller, thus omitted here. In addition,

{\bar{u}}_{k - 1}

is the mean of the optimal controller,

Γ_{k - 1}

stands for the covariance, and

K_{k - 1}

is the controller feedback gain. Moreover,

d_{k - 1}

is a linear shift caused by the considered tracking control problem.

Proof.

The proof of this theorem can be evaluated following the same procedure in our previous publication [12], thus omitted here. □

Remark 1.

The FPD algorithm given by Equation (17) is the general solution to the fully probabilistic control design irrespective of the type of distribution describing the system dynamics or whether the system is linear or nonlinear. It can be derived numerically or analytically. The specific solution of the FPD given by Theorem 1 is the analytical solution for this specific case.

The proposed control framework for the B-spline modelled PDF shaping using the FPD method is summarised by Algorithm 1.

Algorithm 1: Tracking control framework with output probability density function.

4. Simulation Result

In this section, the proposed framework is tested on a numerical example to illuminate the effectiveness of the proposed control method. Moreover, to better demonstrate the advantages of the proposed framework, two comparison simulations are also included.

For the discrete system, the following B-spline basis functions were selected to approximate the linear model.

\begin{matrix} B_{1} (y) & = 0.5 (y^{2} + 6 y + 9) I_{1} + (- y^{2} - 3 y - 1.5) I_{2} + 0.5 y^{2} I_{3}, \\ B_{2} (y) & = 0.5 (y^{2} + 4 y + 4) I_{2} + (- y^{2} - y + 0.5) I_{2} + 0.5 (y^{2} - 2 y + 1) I_{4}, \\ B_{3} (y) & = 0.5 (y^{2} + 2 y + 1) I_{3} + (- y^{2} + y + 0.5) I_{4} + 0.5 (y^{2} - 4 y + 4) I_{5}, \\ B_{4} (y) & = 0.5 y^{2} I_{4} + (- y^{2} + 3 y - 1.5) I_{5} + 0.5 (y^{2} - 6 y + 9) I_{6}, \end{matrix}

(26)

where

I_{i} = \{\begin{matrix} 1 & y \in [i - 4, i - 3] \\ 0 & o t h e r s \end{matrix}

,

i = 1, . . ., 6

.

Firstly, the comparison simulation of the approximation between the B-spline model and the RBF model with the following Gaussian basis function is presented to show the advantages of the B-spline model.

\begin{matrix} h_{j} (y) = e x p (- \frac{{(y - c_{j})}^{2}}{b_{j}^{2}}) i = 1, . . ., n_{j}, \end{matrix}

(27)

where j is the order of nodes,

n_{j}

is the total number of nodes of the RBF basis function,

c_{j}

and

b_{j}

are the centre value and width, and variable

y \in [a, b]

. The same as the B-spline model, four basis functions of the RBF model were chosen to fit the target PDF curve. The simulation result is shown in Figure 3 and Figure 4. Figure 3 shows the fitting effect of the B-spline basis function and RBF basis function on the curve, where the red real line is the fitting curve of the B-spline model, the green real line is the fitting curve of the RBF model, and the black dotted line is the target curve. Figure 4 demonstrates the fitting errors of the two models, where the red line is the fitting error of the B-spline model and the green solid line is the fitting error of the RBF model. From Figure 3, it can be seen that, for the same number of basis functions, the B-spline model has a better fitting performance than the RBF model. It can also be proven by Figure 4, which shows clearly that the B-spline model has a much smaller fitting error than the RBF model. Therefore, the result proves that the B-spline model is more suitable for PDF fitting when the number of basis functions is low. As a state space model, the weight dynamic system from the approximation model generally has a lower number of basis functions, which makes the B-spline model more suitable in such cases.

To verify the control effectiveness of the proposed control algorithm, the following example, which uses the same four basis functions (26), is given as below. Due to the existence of the constraint condition (2), only three corresponding weights will be required out of four B-spline basis functions. The fourth weight is linearly related to the first three weights so that the model order is reduced to three. The coefficient matrix of the system model is

\begin{matrix} x_{k + 1} = G x_{k} + H u_{k} + D x_{k} E_{k}, \end{matrix}

(28)

with

G = [\begin{matrix} 0.555 & - 0.098 & - 0.041 \\ - 0.1 & - 0.734 & 0.181 \\ - 0.292 & 0.02 & 0.291 \end{matrix}]

,

H = [\begin{matrix} 0.275 \\ 0.302 \\ 0.302 \end{matrix}]

,

D = [\begin{matrix} 0.41 & 1.66 & 0.51 \\ - 0.11 & 0.215 & 0.16 \\ 0.31 & 0.02 & - 1.005 \end{matrix}]

, where

G

is the state weight matrix,

H

is the control matrix,

D

is the random weighting matrix of the noise term, and

E_{k}

is the Gaussian noise. The target weight is given by

V_{g} = {[\begin{matrix} 0.4229 & 0.1217 & 0.1487 \end{matrix}]}^{T}

for the first 100 steps and

V_{g} = {[\begin{matrix} 0.5908 & 0.1701 & 0.2077 \end{matrix}]}^{T}

for the remaining steps. In addition, the system initial weight state is

x_{0} = {[\begin{matrix} 0.2673 & 0.0969 & 0.1897 \end{matrix}]}^{T}

; the covariance of the noise is subjected to the distribution given by

E_{k} ∽ N (0, 0.004)

; the ideal state covariance

R

was chosen as

R = d i a g [\begin{matrix} 0.4 & 0.004 & 10 \end{matrix}]

.

In order to verify the tracking performance of the algorithm proposed in this paper, the results were compared with the conventional LQR algorithm. The following simulation results can be obtained following Algorithm 1. The state feedback gain matrix

K = [\begin{matrix} - 0.2544 & - 2.2287 & 0.535 \end{matrix}]

is calculated according to Equation (23). The state feedback gain matrix

K_{L} = [\begin{matrix} 0.2814 & - 0.9455 & 0.3609 \end{matrix}]

is calculated according to LQR algorithm. Figure 5 shows the four weights of the FPD algorithm and LQR algorithm and their target reference curves, respectively. More specifically, in Figure 5, the red line is the weight-tracking curve under the FPD algorithm, while the blue line represents the weight-tracking curve under the LQR algorithm. Note that the weights

V_{1}

,

V_{2}

, and

V_{3}

are the controlled states, while the weight

V_{4}

has a linear relationship with the other three weights, which is not included as the controlled state due to the existence of the constraints. Therefore, the weight

V_{4}

is represented by a dotted line. From Figure 5, it can be seen that, even with the effect of the multiplicative noises, both the FPD algorithm and the LQR algorithm can successfully track the given references and manage to keep tracking the new reference values when the target changes at the 100th step. Compared with the LQR algorithm, we can see that the FPD algorithm shows a better tracking effect and smaller tracking error under the effect of multiplicative noise. Figure 6 indicates the system control inputs, where the red line is the control input curve under the FPD algorithm and the blue line is the control input curve under the LQR algorithm. The results of Figure 6 show that, due to the change in the shape of the target, the input of the control target will also change synchronously. Compared with the LQR algorithm, the input of the FPD algorithm contains less randomness under multiplicative noise. Figure 7 shows the iterative process of the two algorithms controlling the output PDF curve. Compared with the results of the system under the action of the two algorithms, the FPD algorithm has a smaller error and a smoother tracking process from the initial curve to the target curve. When the target curve changes, the system control output curve will still track the new target. Therefore, it can be concluded that, compared with the LQR algorithm, the extended FPD algorithm can show a better tracking effect in the dynamic random B-spline function-based weight dynamical system with multiplicative noise and time-varying target.

5. Conclusions

This paper investigated the output distribution shape-tracking problem for a class of stochastic distribution systems under the B-spline model framework. The linear B-spline model was implemented to fit the PDF curve and simplified the PDF shaping problem to a dynamic weight-altering problem. At the same time, the randomness and the model error of the dynamic weight system were characterised as state-dependent noises, which more realistically simulate the actual complex system. In addition, the target distribution shape of the system was changed from a fixed shape to a time-varying shape, which makes the control goal more realistic and convenient to operate in the actual working process. The extended FPD algorithm was then implemented to achieve the time-varying tracking goal under the effect of the multiplicative noises. Moreover, the implementation procedure was provided step by step. Finally, the simulation results were obtained following the implementation procedure. Meanwhile, to make the experimental results more convincing, the conventional LQR algorithm was also added for comparison. As a result, the simulation showed that the proposed control framework can make the dynamic weights track their time-varying targets under the effect of multiplicative noises. Compared with the LQR algorithm, the proposed extended FPD algorithm has smaller tracking errors and stronger robustness to the multiplicative noises in the model. In addition, in order to better illustrate the suitability of the B-spline model selected in this paper, the RBF model was also added for comparison. By analysing the simulation results, it was concluded that the B-spline model has a smaller fitting error than the RBF model when the number of basis functions is small.

Overall, the method proposed in this paper can effectively solve the time-varying distribution shape-tracking problem for a class of stochastic systems. In practice, the application of the B-spline framework can productively model the non-Gaussian PDF, where most of the practical systems contain non-Gaussian variables. Moreover, using stochastic distributions to characterise the dynamics of weights can fully represent the stochastic properties. In addition, the proposed controller structure is simple and easy to follow for practical application. Thus, the proposed control method can further promote the application of PDF control. In future work, we will consider the application of the proposed framework to real-world systems.

Author Contributions

Conceptualisation, Y.Z. (Yong Zhang); methodology, Y.Z. (Yuyang Zhou); validation, Y.Y.; writing—original draft preparation, Y.Y.; resources, Y.Z. (Yong Zhang); writing—review and editing, Y.Z. (Yuyang Zhou); All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grant Number 62263026, the Natural Science Foundation of Inner Mongolia under Grant Number 2020LH06006, the Natural Science Foundation of Inner Mongolia under Grant Number 2022MS06005, and the Baotou medical college young and middle aged backbone support program under Grant Number BYJJ-QNGG2022001.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ren, M.; Zhang, Q.; Zhang, J. An introductory survey of probability density function control. Syst. Sci. Control Eng. 2019, 7, 158–170. [Google Scholar] [CrossRef]
Filip, I.; Dragan, F.; Szeidert, I.; Albu, A. Minimum-variance control system with variable control penalty factor. Appl. Sci. 2020, 10, 2274. [Google Scholar] [CrossRef]
Li, M.; Zhou, P. Predictive PDF control of output fibre length stochastic distribution in refining process. Acta Autom. Sin. 2019, 45, 1923–1932. [Google Scholar]
Sun, X.; Xun, L.; Wang, H.; Dong, H. Iterative learning control of singular stochastic distribution model of jet flame temperature field. J. Beijing Univ. Technol. 2013, 33, 523–528. [Google Scholar]
Cao, L.; Wu, H. MWD modelling and control for polymerization via B-spline neural network. J. Chem. Ind. Eng. China 2004, 55, 742–746. [Google Scholar]
Zhang, Q.; Zhou, Y. Recent advances in non-Gaussian stochastic systems control theory and its applications. Int. J. Netw. Dyn. Intell. 2022, 1, 111–119. [Google Scholar] [CrossRef]
Wang, H.; Yue, H. Output PDF control of stochastic distribution systems: Modelling control and applications. Control Eng. China 2003, 10, 193–197. [Google Scholar]
Wang, H.; Zhang, J. Bounded stochastic distributions control for pseudo-ARMAX stochastic systems. IEEE Conf. Decis. Control 2001, 46, 486–490. [Google Scholar] [CrossRef]
Wang, H. Bounded Dynamic Stochastic Systems: Modelling and Control, 1st ed.; Springer Science & Business Media: London, UK, 2000; pp. 15–34. [Google Scholar]
Wang, L.; Xie, G.; Qian, F.; Shangguan, A. Developing an innovative method to control the probability density function shape of the state response for nonlinear stochastic systems. Int. J. Robust Nonlinear Control 2021, 31, 7904–7919. [Google Scholar] [CrossRef]
Zhou, J. PDF Control and Its Application in Filtering; Institute of Automation, Chinese Academy of Sciences: Beijing, China, 2005. [Google Scholar]
Zhou, Y.; Herzallah, R.; Zafar, A. Fully probabilistic design for stochastic discrete system with multiplicative noise. In Proceedings of the 2019 IEEE 15th International Conference on Control and Automation (ICCA), Edinburgh, UK, 16–19 July 2019; Volume 19138868, pp. 940–945. [Google Scholar]
Ma, J.; Yang, X.; Sun, S. Distributed fusion estimation for multi-sensor systems with time-correlated multiplicative noises. Acta Autom. Sin. 2021, 47, 1–13. [Google Scholar]
Huang, E.; Cheng, Y.; Hu, W. Tracking control of multi-agent systems based on reset control. Control Eng. China 2022, 1–7. [Google Scholar]
Zhou, J.; Wang, H. Optimal tracking control of the output probability density functions: Square root B-spline model. Control Theory Appl. 2005, 22, 369–376. [Google Scholar]
Luan, X.; Liu, F. Finite time stabilization of output probability density function of stochastic systems. Control Decis. 2009, 24, 1161–1166. [Google Scholar]
Herzallah, R. Fully probabilistic control for stochastic nonlinear control systems with input dependent noise. Neural Netw. 2015, 63, 199–207. [Google Scholar] [CrossRef]
Herzallah, R.; Zhou, Y. A fully probabilistic control framework for stochastic systems with input and state delay. Sci. Rep. 2022, 12, 7812. [Google Scholar] [CrossRef]
Wang, H.; Afshar, P.; Yue, H. ILC-based generalised PI control for output pdf of stochastic systems using LMI and RBF neural networks. In Proceedings of the 45th IEEE Conference on Decision and Control, San Diego, CA, USA, 13–15 December 2006; pp. 5048–5053. [Google Scholar]
Zhou, P.; Liu, J. Data-driven multi-output ARMAX modelling for online estimation of central temperatures for cross temperature measuring in blast furnace ironmaking. Acta Autom. Sin. 2018, 44, 552–561. [Google Scholar]
Wang, H. Multivariable output probability density function control for non-Gaussian stochastic systems using simple MLP neural networks. In Proceedings of the IFAC International Conference on Intelligent Control Systems and Signal Processing, Algarve, Portugal, 8–11 April 2003; pp. 84–89. [Google Scholar]
Kabore, P.; Baki, H.; Wang, H. Linearized controller design for the output pdfs using square root based B-spline models. In Proceedings of the 15th IFAC World Congress, Barcelona, Spain, 21–26 July 2002; pp. 2694–2699. [Google Scholar]
Wang, H.; Yue, H. A rational spline model approximation and control of output probability density functions for dynamic stochastic systems. Trans. Inst. Meas. Control 2003, 25, 93–105. [Google Scholar] [CrossRef]
Zhou, J.; Yue, H.; Wang, H. Shaping of output pdf based on the rational square-root b-spline model. Acta Autom. Sin. 2005, 31, 343–351. [Google Scholar]
Girosi, F.; Poggio, T. Networks and the best approximation property. Biol. Cybern. 1990, 63, 169–176. [Google Scholar] [CrossRef]
Wang, H.; Zhang, J.; Yue, H. Multi-step predictive control of a PDF-shaping problem. Acta Autom. Sin. 2005, 31, 274–279. [Google Scholar]
Chen, M. Studies of Numerically Stable Estimation for Multi-Channel Systems with Multiplicative Noises; Ocean University of China: Shandong, China, 2004. [Google Scholar]
Yin, Y.; Luo, S.; Wan, T. Model-free optimal tracking control for linear discrete-time stochastic systems subject to additive and multiplicative noises. Control Theory Appl. 2022, 39, 1–10. [Google Scholar]
Kárný, M. Towards fully probabilistic control design. Automatica 1996, 32, 1719–1722. [Google Scholar] [CrossRef]
Herzallah, R.; Kárný, M. Fully probabilistic control design in an adaptive critic framework. Neural Netw. 2011, 24, 1128–1135. [Google Scholar] [CrossRef] [PubMed]
Zha, W.; Li, D.; Shen, L.; Zhang, W.; Liu, x. Review of neural network-based methods for solving partial differential equations. Chin. J. Theor. Appl. Mech. 2022, 54, 543–556. [Google Scholar]
Yang, H.; Xu, X. Multi-sensor technology for B-spline modelling and deformation analysis of composite structures. Compos. Struct. 2019, 224, 111000. [Google Scholar] [CrossRef]
Zhang, Y.; Zhou, P.; Lv, D.; Zhang, S.; Cui, G.; Wang, H. Inverse calculation of burden distribution matrix using B-spline model based PDF control in blast furnace burden charging process. IEEE Trans. Ind. Inform. 2023, 19, 317–327. [Google Scholar] [CrossRef]
Kárný, M. Axiomatisation of fully probabilistic design revisited. Syst. Control Lett. 2020, 141, 104719. [Google Scholar] [CrossRef]
Shi, S. Sensing Optimization Design of UAV Electric Actuator Operation State; Harbin Institute of Technology: Harbin, China, 2020. [Google Scholar]

Figure 1. Diagram of the tracking system.

Figure 2. System control structure diagram.

Figure 3. B-spline and RBF basis function fitting curve.

Figure 4. Error of fitting.

Figure 5. Weight-tracking curve.

Figure 6. Control input curve.

Figure 7. System output PDF of 3D drawings.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, Y.; Zhang, Y.; Zhou, Y. Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method. Entropy 2023, 25, 186. https://doi.org/10.3390/e25020186

AMA Style

Yang Y, Zhang Y, Zhou Y. Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method. Entropy. 2023; 25(2):186. https://doi.org/10.3390/e25020186

Chicago/Turabian Style

Yang, Yi, Yong Zhang, and Yuyang Zhou. 2023. "Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method" Entropy 25, no. 2: 186. https://doi.org/10.3390/e25020186

APA Style

Yang, Y., Zhang, Y., & Zhou, Y. (2023). Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method. Entropy, 25(2), 186. https://doi.org/10.3390/e25020186

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method

Abstract

1. Introduction

2. Problem Description

2.1. PDF Description Based on B-Spline

2.2. PDF Tracking Control Problem

3. Control Algorithm

3.1. General Control Solution of FPD

3.2. FPD Control Solution for the Weight Dynamic System

4. Simulation Result

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI