1. Introduction
In the last 40 years, special attention was paid to the singular perturbation techniques applied in both analysis and synthesis of control laws with prescribed performance specifications for the regulation of systems whose mathematical models are described by a system of differential equations of high dimension, and also contain a number of small parameters multiplying derivatives of a part of the state variables of the physical phenomenon under discussion.
The large number of differential equations of the mathematical model of a physical process may be caused by the presence of some “parasitic” parameters such as small time constants, resistances, inductances, capacitances, moments of inertia, small masses, etc.
The presence of such small parameters is often a source of stiffness due to the simultaneous occurrence of slow and fast phenomena. It is known that the stiffness can produce ill-conditioning of the numerical computation involved in the process of designing the optimal control. This inconvenience leads to the idea to simplify the mathematical model by neglecting the small parameters occurring in the original model. Besides the stiffness, the necessity of the simplification of the mathematical model by neglecting the small parameters is also imposed by the fact that, in many applications, the values of these parasitic quantities are not exactly known. A fundamental problem is to check if the optimal control design based on the reduced model provides a satisfactory behavior of the full system which contains fast phenomena neglected during the designing process.
Remarkable results were obtained in the problem of the design of some near optimal controllers in the case of some deterministic systems with several time scales. Such results may be found in the monographs [
1,
2,
3,
4]. A common feature of the approaches in these works is the use of the singular perturbations techniques, initially developed in connection with the study of qualitative properties of the solutions of some classes of differential equations starting with the classical work of Tichonov [
5]. The interest for studying different problems regarding the singularly perturbed controlled systems is still increasing. For the reader’s convenience, we refer to the recent papers [
6,
7,
8,
9,
10].
Lately, the interest for studying optimal control problems for stochastic systems modeled by singularly perturbed Itô differential equations also increased. Unlike the deterministic case, where the reduced model is obtained by simply removing the small parameters, in the case of stochastic optimal control problems driven by systems of singularly perturbed Itô differential equations, the definition of the reduced model is not always intuitive and it is strongly dependent upon the intensity of the white noise type perturbations affecting the diffusion part of the fast equations of the mathematical model. Hence, problems related to singularly perturbed stochastic systems could not be viewed as simple extensions of there deterministic counterparts. This makes the study of this class of systems a challenging (and relatively not fully investigated) topic. The main results obtained in this field were published in [
11,
12,
13,
14].
Very few results have been reported in the literature dealing with several fast time scales. We cite here [
15] for the deterministic case and [
16] for the stochastic framework. Pursuing our efforts in the study of singularly perturbed stochastic systems, we consider in this paper a stochastic optimal control problem described by a quadratic performance criterion and a linear controlled system modeled by a system of singularly perturbed Itô differential equations with two fast time scales.
Unlike [
17] in the deterministic case or [
14] in the stochastic case, where the fast time scales have the same order of magnitude, in the present work, we consider the case in which the two fast time scales have different order of magnitude. More precisely, if
are the small parameters associated with the two fast time scales, the ratio
becomes the third small parameter which needs to be considered in the asymptotic analysis performed here. The most part of our study is devoted to the analysis of the asymptotic structure of the stabilizing solution of the algebraic Riccati equation involved in the computation of the optimal control of the optimization problem under consideration. The main tool in the derivation of the asymptotic structure of the stabilizing solution of the algebraic Riccati equation under consideration around the origin
is the implicit functions theorem. To this end, we first investigate the solvability of the system of reduced equations obtained setting
and
in the original algebraic Riccati equation. Unlike the deterministic case, in the stochastic framework considered in this paper, the system of the reduced equations is a system of strongly interconnected Riccati type algebraic equations. For this system of interconnected Riccati type equations we introduce the concept of stabilizing solution and provide a set of necessary and sufficient conditions which guarantee the existence of such a solution. Further, employing the stabilizing solution of the system of the reduced equations and the corresponding stabilizing gain matrices we show that one may apply the implicit functions theorem to obtain the existence, as well as the asymptotic structure of, the stabilizing solution of the algebraic Riccati equation associated with the optimal control problem under consideration. Based on the dominant part independent of the small parameters of the stabilizing gain matrix, we construct a near optimal control whose gain matrices can be computed without the knowledge of the precise values of the small parameters associated with the fast time scales.
The paper is organized as follows:
Section 2 provides the model description and the problem formulation. In
Section 3 we show how the system of reduced Riccati equations, which are not dependent upon the small parameters, can be derived. Also, we introduce the concept of the stabilizing solution for the system of reduced algebraic Riccati equations. Then, we provide conditions which guarantee the existence of this stabilizing solution. In
Section 4, we obtain the existence and the asymptotic structure of the stabilizing solution for the Riccati equation associated with the original linear quadratic control problem. Finally, we show how the asymptotic structure of the stabilizing feedback gain can be used to construct a near optimal control.
2. The Problem
Let us consider the stochastic optimal control problem asking for the minimization of the quadratic functional
along with the trajectories of the controlled system having the state space representation described by the following system of singularly perturbed Itô differential equations
In (
1) and (
2)
is the vector of the control parameters and
is the vector of state parameters,
,
;
,
. In (
1),
,
,
. In (
2),
are small parameters often not exactly known.
In order to make more intuitive the developments in this paper we assume that the small parameters satisfy the assumption:
, where are nondecreasing functions with the properties:
- (i)
if and only if , .
- (ii)
; .
In the sequel, the dependence of upon the parameter will be suppressed.
Remark 1. According to the terminology used in the framework of singularly perturbed differential equations,will be calledslow state variableswhilewill be namedfast state variables.
From the condition imposed to the values of the ratioin, it follows that the statesare faster than. That is why under the assumptionsystem (2) is a controlled system with two fast time scales. In the deterministic framework, the asymptotic structure of the solutions of some systems with several time fast scales was studied in [
18] while in [
19] were derived uniform upper bounds of the block components of the fundamental matrix solution of the systems of linear differential equations with several fast time scales.
In (
2),
is a 1-dimensional standard Wiener process defined on a given probability space
. The consideration of the case with an 1-dimensional standard Wiener process is only to easy the exposition. The extension to the case of a multidimensional Wiener process can be done without difficulty.
Regarding the coefficients of system (
2), we make the following assumption:
are matrix valued functions defined on a neighborhood of the origin .
We set
With these notations (
1) and (
2) may be written in a compact form as:
and
One sees that for each
fixed, the optimal control asking for the minimization of the quadratic cost (
5) in the class of controls that stabilizes system (
6) is a standard stochastic linear quadratic optimal control problem, which was studied starting with [
20].
In [
21,
22] it was shown that a stochastic linear quadratic control problem, with control dependent terms in the diffusion part of the controlled system, is still well possed even if the cost weight matrices of the states and the control are allowed to be indefinite. The optimal control is given by:
where
is the unique stabilizing solution of the algebraic Riccati equation of stochastic control (SARE):
satisfying the sign condition
The condition (
9) supplies the absence of the information regarding the sign of the matrix
R. In [
22], necessary and sufficient conditions that guarantee the existence of the stabilizing solution of a SARE were provided as (
8) satisfying the sign condition (
9) and a procedure for numerical computation of this solution using the so called semidefinite programming (SDP) was proposed. Also, an iterative procedure for numerical computation of the constrained SARE of type (
8) and (
9) was proposed in Section 5.8 from [
23]. Unfortunately, the way in which the small parameters
,
affect the coefficients of SARE (
8) and (
9) may produce ill-conditioning of the numerical computation involved in obtaining the stabilizing solution
of the SARE under consideration. In order to avoid the ill-conditioning of the numerical computations generated by the high difference between the order of magnitude of the coefficients, knowledge of the asymptotic structure of the solution
in a neighborhood of the origin
is useful. As a consequence of such a study, a system of Riccati type equations not depending upon the small parameters
,
, often named a system of reduced algebraic Riccati equations, which allows us to compute the dominant part of the solution
can be displayed.
In the deterministic case, see for example [
1,
2,
3,
4,
24], the system of reduced algebraic Riccati equations is obtained by simply removing all of the small parameters. In the stochastic framework, when the controlled systems are modeled by singularly perturbed Itô differential equations, the definition of the system of reduced algebraic Riccati equations cannot be done by a simple neglection of the small parameters. From [
11] or [
12,
25], one sees that the definition of the system of reduced algebraic Riccati equations is strongly dependent upon the magnitude of the white noise perturbations affecting the equations of the fast variables in the controlled system.
In order to obtain the asymptotic structure with respect to the small parameters
of the stabilizing solution of SARE (
8), we shall use the implicit functions theorem. To this end, we need a rigourous definition of the corresponding system of reduced algebraic Riccati equations (SRARE) and to point out a special kind of solution of this system which helps us to apply the implicit functions theorem. That is why in the next section we shall show how the system of reduced algebraic Riccati equations in the case of SARE (
8) and (
9) can be defined. Next, we shall introduce a concept of stabilizing solution of the obtained SRARE and we shall provide a set of conditions which guarantee the existence of this stabilizing solution of SRARE. In
Section 4, using reasoning based on the implicit functions theorem, we shall obtain the asymptotic structure of the stabilizing solution of SARE (
8) satisfying the sign condition (
9), as well as the asymptotic structure of the corresponding stabilizing feedback gain.
5. Conclusions
The goal of the work has been the derivation of the asymptotic structure of the stabilizing solution of an algebraic Riccati equation arising in connection with a stochastic linear quadratic optimal control problem for a controlled system described by singularly perturbed Itô differential equations with two fast time scales.
The main conclusion of our study is that, in the stochastic case when the controlled system contains state multiplicative and/or control multiplicative white noise perturbations, the reduced system of algebraic Riccati equations cannot be directly obtained by neglecting the small parameters associated with the fast time scales of the controlled system as in the deterministic framework.
In
Section 3 we have shown in detail how the system of reduced algebraic Riccati equations can be defined in the considered stochastic framework. In the second part of
Section 3, we have introduced the concept of a stabilizing solution of SRARE, and we have provided a set of conditions equivalent to the existence of this kind of solution of SRARE which satisfy a prescribed sign condition of type (
44). Employing the stabilizing solution of SRARE, as well as the corresponding stabilizing feedback gains, we have obtained the asymptotic structure of the stabilizing solution of SARE and of the corresponding stabilizing feedback gain. The dominant part of the stabilizing feedback gain was used to construct a near optimal control whose gain matrices do not depend upon the small parameters associated with the fast time scales. The extension of the study to the case of singularly perturbed linear stochastic systems with
N fast time scales, also including more complex systems such as jump Markov perturbations [
27], Levy noise perturbations [
28] and semi-Markov switched systems [
29] remains a challenge for future research.