Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint

Lu, Yanrong; Li, Jize; Zhou, Yonghui

doi:10.3390/math13081327

Open AccessArticle

Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint

by

Yanrong Lu

^†,

Jize Li

and

Yonghui Zhou

^*

School of Mathematics, Guizhou Normal University, Guiyang 550025, China

^*

Author to whom correspondence should be addressed.

^†

Current address: School of Big Data and Computer Science, Guizhou Normal University, Guiyang 550025, China.

Mathematics 2025, 13(8), 1327; https://doi.org/10.3390/math13081327

Submission received: 11 March 2025 / Revised: 7 April 2025 / Accepted: 16 April 2025 / Published: 18 April 2025

(This article belongs to the Special Issue Stochastic Optimal Control, Game Theory, and Related Applications)

Download

Browse Figure

Versions Notes

Abstract

This paper investigates a backward stochastic linear quadratic control problem with an expected-type equality constraint on the initial state. By using the Lagrange multiplier method, the problem with a uniformly convex cost functional is first transformed into an equivalent unconstrained parameterized backward stochastic linear quadratic control problem. Then, under the surjectivity of the linear constraint, the equivalence between the original problem and the dual problem is proven by Lagrange duality theory. Subsequently, with the help of the maximum principle, an explicit solution of the optimal control for the unconstrained problem is obtained. This solution is feedback-based and determined by an adjoint stochastic differential equation, a Riccati-type ordinary differential equation, a backward stochastic differential equation, and an equality, thereby yielding the optimal control for the original problem. Finally, an optimal control for an investment portfolio problem with an expected-type equality constraint on the initial state is explicitly provided.

Keywords:

backward stochastic optimal control; expectation equality constraint; maximum principle; Riccati equation; Lagrange duality theory

MSC:

93E20; 49N10; 49N15; 93B52

1. Introduction

Backward stochastic differential equations (BSDEs) play a pivotal role in stochastic control theory and have found extensive applications in finance (see [1,2,3,4]). The linear quadratic (LQ) optimal control problem for BSDEs with deterministic coefficients was first investigated by Lim and Zhou [5]. They derived a complete solution to such backward stochastic linear quadratic (BSLQ) problems by employing forward equations, limit procedures, and completing squares techniques. Following this foundational work, numerous extensions have been developed, including BSLQ problems under partial information [6], BSLQ problems with asymmetric information [7], mean-field backward LQ optimal control problems [8], BSLQ problems with stochastic coefficients [9], BSLQ problems with random jumps [10], the turnpike property of BSLQ problems [11], BSLQ problems with indefinite weighting matrices in the cost functional [12], among others.

In the aforementioned literature, no constraints were considered for BSLQ optimal control. However, in practical applications, constraints on states or controls are often inevitable. Examples include the portfolio selection problem with variance investment under the restriction of no short selling of stocks [13] and the welfare pension problem [14]. Therefore, studying BSLQ optimal control with state or control constraints is of significant importance. Examples of such constraints include mixed-control-state integral quadratic inequality constraints [15,16], conic control constraints over an infinite time horizon [17,18], state switching, stochastic coefficients, conic control constraints over finite [19] and infinite time domains [20], mixed pointwise state-control linear inequality constraints [21,22], and terminal state expected inequality constraints [23], to name just a few.

Based on the ideas from the aforementioned literature, and as a companion to [24], which addresses a forward stochastic LQ optimal control problem with a terminal state expected equality constraint, this paper investigates a backward stochastic optimal control problem with an initial state expected equality constraint (CBSLQ for short):

\{\begin{matrix} min J (u) \\ E [N X_{0}^{ξ, u}] = b . \end{matrix}

(1)

Let

T > 0

, and consider the underlying filtered probability space

(Ω, F_{T}, F, P)

. On this space, a one-dimensional standard Wiener process W is defined. Here,

F

is the natural filtration generated by W (augmented by all the P-null sets). The expectation operator is denoted by

E [\cdot]

. The controlled backward linear stochastic differential equation is

d X_{t}^{ξ, u} = (A (t) X_{t}^{ξ, u} + B (t) u_{t} + C (t) Z_{t}^{ξ, u}) d t + Z_{t}^{ξ, u} d W_{t}, X_{T}^{ξ, u} = ξ,

(2)

with the state

(X^{ξ, u}, Z^{ξ, u})

depending on a terminal condition

ξ

and a control u, and the quadratic cost function is

\begin{matrix} J^{ξ} (u) ≜ J (u) = & \frac{1}{2} E [< H X_{0}^{ξ, u}, X_{0}^{ξ, u} > + \int_{0}^{T} < Q (t) X_{t}^{ξ, u}, X_{t}^{ξ, u} > \\ + < S (t) Z_{t}^{ξ, u}, Z_{t}^{ξ, u} > + < R (t) u (t), u (t) > d t] . \end{matrix}

(3)

Here, the terminal datum

ξ

; the control u; the coefficients

A (\cdot)

,

B (\cdot)

,

C (\cdot)

,

Q (\cdot)

,

S (\cdot), R (\cdot)

,

H, N, b

; and the operator

< \cdot >

are subject to appropriate assumptions and will be defined later.

Clearly, the completion of squares method [24] is not applicable for solving our CBSLQ problem. However, the maximum principle [8] combined with Lagrange duality theory [24] provides an effective approach. Our main contributions are as follows: (1) the CBSLQ problem is proposed for the first time; (2) under the uniform convexity of the cost function, an equivalent unconstrained parameterized BSLQ problem is explicitly solved, whose optimal control is feedback based and determined by an adjoint SDE, a Riccati-type ODE, a BSDE, and an equality; (3) the surjectivity of the linear constraint is established with an illustrative example, and the equivalence between the CBSLQ problem and the dual problem of the equivalent BSLQ problem is ensured, thereby obtaining the optimal control for the CBSLQ problem.

The remainder of this paper is organized as follows. Section 2 introduces basic notations and assumptions. Section 3 establishes the equivalence between the (CBSLQ) problem and its dual problem under a surjectivity condition. In Section 4, an explicit solution to the optimal control of the unconstrained problem is derived by using the maximum principle. Section 5 provides several equivalent characterizations of the surjectivity condition. A financial application of the (CBSLQ) problem to portfolio management is presented in Section 6. Concluding remarks are given in Section 7.

2. Preliminaries

Let

R^{n \times m}

be the Euclidean space of all

n \times m

real matrices (and simply write it as

R^{n}

when

m = 1

), on which, for any two elements M and N, the inner product

< M, N >

is the trace of the matrix

M^{⊤} N

with the transpose ⊤ of matrices and induces the norm of M as

| M | = \sqrt{(M^{⊤} M)}

. Denote as

S^{n}

the space of all symmetric

n \times n

real matrices. If

Σ \in S^{n}

is positive definite (positive semi-definite), we write it as

Σ > 0

(Σ \geq 0)

. The same symbols

< \cdot, \cdot >

and

| \cdot |

shall be used to denote the inner product and the induced norm, respectively, in possibly different Hillbert spaces

X

without confusion. Given a measure space

(Ξ, G, μ)

, for a measurable function

φ : Ξ \to [0, \infty)

, its essential supremum is defined as

e s s {sup}_{ξ} φ = inf {α \in R ∣ μ (φ (ξ) > α) = 0}

. Some spaces of random vectors or random processes are introduced by us as follows.

\begin{matrix} L^{2} (0, T; X) = & {φ : [0, T] \to X | φ is Lebesgue - measurable, \int_{0}^{T} | φ (t) |^{2} d t < \infty} . \\ L^{\infty} (0, T; X) = & {φ : [0, T] \to X | φ is Lebesgue - measurable, e s s sup_{t} | φ | < \infty} . \\ L_{F}^{2} (0, T; X) = & {φ : [0, T] \times Ω \to X | φ is F - progressively measurable and E \int_{0}^{T} {| φ (t) |}^{2} d t < \infty} . \\ L_{F}^{\infty} (0, T; X) = & {φ : [0, T] \times Ω \to X | φ is F - progressively measurable and e s s sup_{t, w} | φ | < \infty} . \end{matrix}

To make our problem CBSLQ solvable, we introduce some assumptions below.

Assumption 1.

The coefficients of the state equation are all deterministic and satisfy the following:

\begin{matrix} A (\cdot), C (\cdot) \in L^{\infty} (0, T; R^{n \times n}); B (\cdot) \in L^{\infty} (0, T; R^{n \times m}); \\ ξ \in R^{n}, N \in R^{ℓ \times n}, b \in R^{ℓ} \end{matrix}

Assumption 2.

The weighting coefficients in the cost functional are all deterministic and satisfy the following:

\begin{matrix} Q (\cdot), S (\cdot) \in L^{\infty} (0, T; S^{n}); R (\cdot) \in L^{\infty} (0, T; S^{m}), H \in S^{n}, \\ Q (\cdot), S (\cdot), H \geq 0; R (\cdot) \geq δ I, for δ > 0 . \end{matrix}

Assumption 3.

Our admissible control set is

\begin{matrix} U_{a d} = {u \in L_{F}^{2} (0, T; R^{m}) ∣ (X^{ξ, u}, Z^{ξ, u}, u) s a t i s f i e s (2) a n d E [N X_{0}^{ξ, u}] = b .} \end{matrix}

such that the mapping

u \to E [N X_{0}^{ξ, u}]

is surjective, that is,

R^{ℓ} = {E [N X_{0}^{ξ, u}] | u \in U_{a d}} .

We say a functional

f : X \to R

is called strongly convex [25] if there exists a constant

σ > 0

,

\begin{matrix} f (α x + β y) \leq α f (x) + β f (y) - \frac{σ}{2} α β {| x - y |}_{X}^{2}, \forall x, y \in X, α, β \in [0, 1], α + β = 1 . \end{matrix}

Lemma 1.

Suppose that Assumptions 1 and 2 hold. Then, the cost functional J is strongly convex and continuous on

L_{F}^{2} (0, T; R^{m})

. In addition, if Assumption 3 holds, CBSLQ is uniquely solvable on

U_{a d}

.

Proof of Lemma 1.

By the Assumptions 1 and 2, for each controller

u \in L_{F}^{2} (0, T; R^{m})

(2), there exists a unique solution

(X^{ξ, u}, Z^{ξ, u})

, and

J (u) < \infty

, which is continuous. We need to prove its strong convexity:

Let

u_{1}, u_{2} \in L_{F}^{2} (0, T; R^{m}), α, β \in [0, 1] .

Then, by (A1), such solutions as

(X_{t}^{ξ, u_{1}}, Z_{t}^{ξ, α u_{1}})

,

(X_{t}^{ξ, u_{2}}, Z_{t}^{ξ, β u_{2}})

, and

(X_{t}^{ξ, α u_{1} + β u_{2}}, Z_{t}^{α u_{1} + β u_{2}})

to the systems (2) controlled by

u_{1}, u_{2}

and

α u_{1} + β u_{2}

, respectively, exist uniquely. Moreover,

X_{t}^{ξ, α u_{1} + β u_{2}} = α X_{t}^{ξ, u_{1}} + β X_{t}^{ξ, u_{2}}, Z_{t}^{ξ, α u_{1} + β u_{2}} = α Z_{t}^{ξ, u_{1}} + β Z_{t}^{ξ, u_{2}} .

In addition, it is easy to check that

J^{ξ} (α u_{1} + β u_{2}) = α J^{ξ} (u_{1}) + β J^{ξ} (u_{2}) - α β J^{0} (u_{1} - u_{2}) .

Then by Assumption 2,

J^{ξ} (α u_{1} + β u_{2}) \leq α J^{ξ} (u_{1}) + β J^{ξ} (u_{2}) - δ α β \int_{0}^{T} {| u_{1 t} - u_{2 t} |}^{2} d t,

which means J is strongly convex on

L_{F}^{2} (0, T; R^{m}) .

□

By Assumption 3,

U_{a d}

is nonempty. It is also a closed convex subset of

L_{F}^{2} (0, T; R^{m})

, since the control system (2) is linear and the initial state constraint is a linear equality constraint. Then, by the standard existence theory of convex optimization (see, for instance, [26], Theorem 2.31), CBSLQ is uniquely solvable.

3. Lagrangian Duality

Inspired by the ideas in [24], the Lagrangian dual method can be applied to solve the problem

(C B S L Q)

.

Lemma 2.

For

u \in L_{F}^{2} (0, T; R^{m}), λ \in R^{ℓ}

, we define

\begin{matrix} M (u, λ) ≜ J (u) + < λ, E [N X_{0}^{ξ, u} - b] >, \end{matrix}

where

X_{0}^{ξ, u}

satisfies (2), and J is the same as the definition (3). If Assumptions 1 and 2 hold, then given

λ \in R^{ℓ}

, the following unconstrained backward stochastic linear quadratic problem with parameter λ (for short, BSLQu) is uniquely solvable:

\begin{matrix} d (λ) ≜ inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) . \end{matrix}

(4)

Proof of Lemma 2.

Note first that the mapping

M : L_{F}^{2} (0, T; R^{m}) \times R^{ℓ} \to R

is well defined by Lemma 1.

Moreover, given

λ \in R^{ℓ}

, there is for

u_{1}, u_{2} \in L_{F}^{2} (0, T; R^{m})

, and

α, β \in [0, 1]

with

α + β = 1

,

M (α u_{1} + β u_{2}, λ) = α M (u_{1}, λ) + β M (u_{2}, λ) - α β J^{0} (u_{1} - u_{2}) .

Then, by Assumption 2, we have

M (α u_{1} + β u_{2}, λ) ⩽ α M (u_{1}, λ) + β M (u_{2}, λ) - δ α β \int_{0}^{T} {| u_{1 t} - u_{2 t} |}^{2} d t,

which means that M is strongly convex with respect to control u on

L_{F}^{2} (0, T; R^{m})

. Again, by Theorem 2.31 in [26], the problem BSLQu is uniquely solvable. □

Clearly, under the Assumptions 1 and 2, by (4)

\begin{matrix} sup_{λ \in R^{ℓ}} M (u, λ) = & sup_{λ \in R^{ℓ}} {J (u) + < λ, E [N X_{0}^{ξ, u} - b] >} \\ = & \{\begin{matrix} J (u), E [N X_{0}^{ξ, u} - b] = 0, \\ + \infty, E [N X_{0}^{ξ, u} - b] \neq 0 . \end{matrix} \end{matrix}

Therefore, the original CBSLQ is equivalent to

\begin{matrix} inf_{u \in U_{a d}} J (u) = inf_{u \in L_{F}^{2} (0, T; R^{m})} sup_{λ \in R^{ℓ}} M (u, λ) . \end{matrix}

(5)

Now, we define the dual problem of CBSLQ:

\begin{matrix} sup_{λ \in R^{ℓ}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) = sup_{λ \in R^{ℓ}} d (λ) . \end{matrix}

(6)

In what follows, we prove the strong duality between CBSLQ and its dual problem (6)

Theorem 1.

Assume Assumptions 1–3 and let

\bar{u}

be the unique optimal solution to CBSLQ. Then, the following two assertions hold true.

(i) The strong duality between CBSLQ and unconstrained problem (6) holds true, i.e.,

\begin{matrix} inf_{u \in L_{F}^{2} (0, T; R^{m})} sup_{λ \in R^{ℓ}} M (u, λ) = sup_{λ \in R^{ℓ}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) . \end{matrix}

(ii) Let

\bar{λ}

be the solution to the dual problem (6); then,

(\bar{u}, \bar{λ})

is a

s a d d l e p o i n t

, i.e.,

\begin{matrix} M (\bar{u}, λ) \leq M (\bar{u}, \bar{λ}) \leq M (u, \bar{λ}), \forall u \in L_{F}^{2} (0, T; R^{m}), \forall λ \in R^{ℓ} . \end{matrix}

Especially,

\begin{matrix} M (\bar{u}, \bar{λ}) = inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, \bar{λ}) . \end{matrix}

Proof of Theorem 1.

Define

\begin{matrix} K = {(α, β) \in R^{l + 1} ∣ u \in L_{F}^{2} (0, T; R^{m}) s . t . J (u) - J (\bar{u}) \leq α, E [N X_{0}^{ξ, u} - b] = β}, \end{matrix}

and

\begin{matrix} O = {(α, β) \in R^{l + 1} ∣ α^{'} < 0, β^{'} = 0} . \end{matrix}

Clearly,

O

is a convex set. In the following, we will prove that

K

is also a convex set. Let

(α_{1}, β_{1}), (α_{2}, β_{2}) \in K

; then, there exist

u_{1}, u_{2} \in L_{F}^{2} (0, T; R^{m})

such that

\begin{matrix} J (u_{i}) - J (\bar{u}) \leq α_{i}, E [N X_{0}^{ξ, u_{i}} - b] = β_{i}, i = 1, 2 . \end{matrix}

By Lemma 1, J is convex. Then, for any

θ \in [0, 1]

,

\begin{matrix} J (θ u_{1} + (1 - θ) u_{2}) - J (\bar{u}) & \leq θ J (u_{1}) + (1 - θ) J (u_{2}) - J (\bar{u}) \\ = θ J (u_{1}) + (1 - θ) J (u_{2}) - θ J (\bar{u}) - (1 - θ) J (\bar{u}) \\ \leq θ α_{1} + (1 - θ) α_{2} . \end{matrix}

From (2), obviously, there is a unique solution

(X, Z)

satisfying

\begin{matrix} X_{t} = ξ - \int_{t}^{T} A (t) X_{t} + B (t) u_{t} + C (t) Z_{t} d t - \int_{t}^{T} Z_{t} d W_{t} . \end{matrix}

such that

E [X_{t}]

satisfy the linearity in (2). Thus,

\begin{matrix} E [N X_{0}^{ξ, θ u_{1} + (1 - θ) u_{2}} - b] & = E [N (θ X_{0}^{ξ, u_{1}} + (1 + θ) X_{0}^{ξ, u_{2}}) - b] \\ = E [N X_{0}^{ξ, u_{1}} - b] + (1 - θ) E [N X_{0}^{ξ, u_{2}} - b] \\ = θ β_{1} + (1 - θ) β_{2} . \end{matrix}

Therefore,

\begin{matrix} θ (α_{1}, β_{1}) + (1 - θ) (α_{2}, β_{2}) \in K . \end{matrix}

Therefore,

K

is also a convex set.

By the optimality of

\bar{u}

, we deduce that

K \cap O = \emptyset

. Then, by the separation theorem,

\exists (λ_{0}, λ_{1}) \in R^{ℓ + 1}

,

(λ_{0}, λ_{1}) \neq 0

such that

\begin{matrix} inf_{α, β \in K} {λ_{0} α + < λ, β >} \geq sup_{α^{'}, β^{'} \in O} {λ_{0} α^{'} + < λ, β^{'} >} = sup_{α^{'} < 0} λ_{0} α . \end{matrix}

Then, we have

λ_{0} \geq 0

,

sup_{α^{'} < 0} λ_{0} α = 0

, and

\begin{matrix} 0 & \leq inf_{α, β \in K} {λ_{0} α + < λ, β >} \\ \leq inf_{u \in L_{F}^{2} (0, T; R^{m})} {λ_{0} (J (u) - J (\bar{u})) + < λ, E [N X_{0}^{ξ, u} - b] >} . \end{matrix}

We claim that

λ_{0} \neq 0

. If

λ_{0} = 0

, then

\begin{matrix} 0 \leq < λ, E [N X^{ξ, u} (0) - b] >, \forall u \in L_{F}^{2} (0, T; R^{m}) . \end{matrix}

Then, by Assumption 3,

λ = 0

. This contradicts

(λ_{0}, λ) \neq 0

. Therefore,

λ_{0} > 0

.

Letting

\bar{λ} = \frac{λ}{λ_{0}}

, we have

\begin{matrix} 0 \leq J (u) - J (\bar{u}) + < \bar{λ}, E [N X_{0}^{ξ, u} - b] >, \forall u \in L_{F}^{2} (0, T; R^{m}) . \end{matrix}

Since

\begin{matrix} E [N {\bar{X}}_{0}^{ξ, \bar{u}} - b] = 0, \end{matrix}

it holds that

\begin{matrix} J (\bar{u}) + < \bar{λ}, E [N {\bar{X}}_{0}^{ξ, \bar{u}} - b] > \leq J (u) + < \bar{λ}, E [N X_{0}^{ξ, u} - b] >, \forall u \in L_{F}^{2} (0, T; R^{m}) . \end{matrix}

This implies that

\begin{matrix} inf_{u \in L_{F}^{2} (0, T; R^{m})} sup_{λ \in R^{l}} M (u, λ) & = J (\bar{u}) = J (\bar{u}) + < \bar{λ}, E [N {\bar{X}}_{0}^{ξ, \bar{u}} - b] > \\ \leq inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, \bar{λ}) = d (\bar{λ}) \\ \leq sup_{λ \in R^{l}} d (λ) = sup_{λ \in R^{l}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) . \end{matrix}

Since there is

\begin{matrix} sup_{λ \in R^{l}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) \leq inf_{u \in L_{F}^{2} (0, T; R^{m})} sup_{λ \in R^{l}} M (u, λ), \end{matrix}

\begin{matrix} inf_{u \in L_{F}^{2} (0, T; R^{m})} sup_{λ \in R^{l}} M (u, λ) = sup_{λ \in R^{l}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) . \end{matrix}

This proves (i).

The following proves the assertion (ii). Let

\bar{λ}

in

R^{ℓ}

be the unique optimal solution (which exists by (i)); then, we have

\begin{matrix} sup_{λ \in R^{l}} inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ) = inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, \bar{λ}) \leq M (u, \bar{λ}), \forall u \in L_{F}^{2} (0, T; R^{m}) . \end{matrix}

By the optimality of

\bar{u}

and

\bar{λ}

,

\begin{matrix} M (\bar{u}, λ) \leq M (\bar{u}, \bar{λ}) \leq M (u, \bar{λ}), \forall u \in L_{F}^{2} (0, T; R^{m}), \forall λ \in R^{l} . \end{matrix}

Thus,

M (\bar{u}, \bar{λ})

is a saddle point of M. Then, it follows that

\begin{matrix} M (\bar{u}, \bar{λ}) = inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, \bar{λ}) . \end{matrix}

□

4. Maximum Principle

The effient method for the solving backward stochastic linear quadratic problem in [8] can be used to solve our BSLQu.

Lemma 3.

Suppose that Assumptions 1 and 2 hold. If

({\bar{X}}^{ξ, \bar{u}}, {\bar{Z}}^{ξ, \bar{u}}, \bar{u})

is the optimal triple of BSLQu with parameter λ, then

R (t) {\bar{u}}_{t} - B^{⊤} (t) {\bar{Y}}_{t}^{λ} = 0, t \in [0, T],

(7)

where the process

\bar{Y}

is the solution to the following SDE

\{\begin{matrix} d {\bar{Y}}_{t}^{λ} = & {- A^{⊤} (t) {\bar{Y}}_{t}^{λ} + Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}} d t + {- C^{⊤} (t) {\bar{Y}}_{t}^{λ} + S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} d W (t), \\ {\bar{Y}}_{0}^{λ} = & H {\bar{X}}_{0}^{ξ, \bar{u}} + N^{⊤} λ . \end{matrix}

(8)

Proof of Lemma 3.

Let

\bar{u}

be the optimal control of BSLQu (4); then, for any

v \in L_{F}^{2} (0, T; R^{m})

and

ε \in [0, 1]

,

(X_{t}^{ξ, \bar{u} + ε v}, Z_{t}^{ξ, \bar{u} + ε v}) = ({\bar{X}}_{t}^{ξ, \bar{u}} + ε X_{t}^{0, v}, {\bar{Z}}_{t}^{ξ, \bar{u}} + ε Z_{t}^{0, v}),

where

(X^{0, v}, Z^{0, v})

is a solution to (2) with the terminal state

ξ = 0

. Then, we get

\begin{matrix} M (\bar{u} + ε v, λ) - M (\bar{u}, λ) = & ε E [< H X_{0}^{0, v} + N^{⊤} λ, X_{0}^{0, v} > \\ + \int_{0}^{T} < Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}, X_{t}^{0, v} > + < S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}, Z_{t}^{0, v} > + < R (t) {\bar{u}}_{t}, v_{t} > d t] \\ + \frac{1}{2} ε^{2} E [< H X_{0}^{0, v}, X_{0}^{0, v} > \\ + \int_{0}^{T} < Q (t) X_{t}^{0, v}, X_{t}^{0, v} > + < R (t) v_{t}, v_{t} > + < S (t) Z_{t}^{0, v}, Z_{t}^{0, v} > d t] . \end{matrix}

Moreover, with the help of Equation (8) and by applying It

\hat{o}

’s formula to

t \to < {\bar{Y}}_{t}, X_{t}^{0, v} >

, the following result is given

\begin{matrix} - E [< H X_{0}^{0, v} + N^{⊤} λ, X_{0}^{0, v} >] = E [\int_{0}^{T} d < {\bar{Y}}_{t}^{λ}, X_{t}^{0, v} >] \\ = E [\int_{0}^{T} < Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}, X_{t}^{0, v} > + < S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}, Z_{t}^{0, v} > + < B^{⊤} (t) {\bar{Y}}_{t}^{λ}, v_{t} > d t] . \end{matrix}

Therefore,

\begin{matrix} M ({\bar{u}}_{t} + ε v_{t}, λ) - M ({\bar{u}}_{t}, λ) = ε E [\int_{0}^{T} < R (t) {\bar{u}}_{t} - B^{⊤} (t) {\bar{Y}}_{t}^{λ}, v_{t} > d t] + \frac{1}{2} ε^{2} E [< H X_{0}^{0, v}, X_{0}^{0, v} > \\ + \int_{0}^{T} < Q (t) X_{t}^{0, v}, X_{t}^{0, v} > + < S (t) Z_{t}^{0, v}, Z_{t}^{0, v} > + < R (t) v_{t}, v_{t} > d t] . \end{matrix}

(9)

Next, set

ϕ (ε) = M ({\bar{u}}_{t} + ε v_{t}, λ) - M ({\bar{u}}_{t}, λ) .

Then,

ϕ

is continuously differentiable with perturbed parameters

ε

, and we obtain

0 = ϕ^{'} (0) = E [\int_{0}^{T} < R (t) {\bar{u}}_{t} - B^{⊤} (t) {\bar{Y}}_{t}^{λ}, v_{t} > d t] .

We directly claim that (7) holds, since v is arbitrary.

From the above result, we see that if u happens to be an optimal control of BSLQu, then the following FBSDE admits an adapted solution

({\bar{Y}}^{λ}, {\bar{X}}^{ξ, \bar{u}}, {\bar{Z}}^{ξ, \bar{u}})

:

\{\begin{matrix} d {\bar{Y}}_{t}^{λ} = & {- A^{⊤} (t) {\bar{Y}}_{t}^{λ} + Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}} d t + {- C^{⊤} (t) {\bar{Y}}_{t}^{λ} + S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} d W_{t}, \\ d {\bar{X}}_{t}^{ξ, \bar{u}} = & {A (t) {\bar{X}}_{t}^{ξ, \bar{u}} + B (t) {\bar{u}}_{t} + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} d t + {\bar{Z}}_{t}^{ξ, \bar{u}} d W_{t}, \\ {\bar{Y}}_{0}^{λ} = & H {\bar{X}}_{0}^{ξ, \bar{u}} + N^{⊤} λ, X_{T} = ξ, \end{matrix}

(10)

and the following stationarity condition holds:

\begin{matrix} R (t) {\bar{u}}_{t} - B^{⊤} (t) {\bar{Y}}_{t}^{λ} = 0, t \in [0, T] . \end{matrix}

(11)

□

We call (8), together with the stationarity condition (11), the optimality system for the optimal control of CBSLQ. Note that the four-step scheme introduced in [27,28] for general FBSDEs has provided an efficient skill to solve this special FBSDE (10).

In fact, due to the linearity of BSLQu, there is also a linear relation between

{\bar{X}}^{ξ, \bar{u}}

and

{\bar{Y}}^{λ}

, as shown in the following lemma.

Lemma 4.

In the FBSDE (10), there is, for

t \in [0, T]

,

\begin{matrix} {\bar{X}}_{t}^{ξ, \bar{u}} = - P (t) {\bar{Y}}_{t}^{λ} - h_{t}, \end{matrix}

(12)

where the matrix-valued function P satisfies the Riccati-type ODE

\{\begin{matrix} \dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t) \\ + C (t) {(I + P (t) S (t))}^{- 1} P (t) C^{⊤} (t) = 0, \\ P (T) = 0, \end{matrix}

(13)

and the process pair

(h, β)

satisfies the BSDE

d h_{t} = {(A (t) + P (t) Q (t)) h_{t} + C (t) {(I + P (t) S (t))}^{- 1} β_{t}} d t + β_{t} d W_{t}, h_{T} = - ξ .

Proof of Lemma 4.

Assume that

\begin{matrix} {\bar{X}}_{t}^{ξ, \bar{u}} = - P (t) {\bar{Y}}_{t}^{λ} - h_{t} \end{matrix}

(14)

where

P : [0, T] \to S^{n}

is absolutely continuous, and a process pair

(h, β)

satisfies the following BSDE

d h_{t} = α_{t} d t + β_{t} d W_{t}, h_{T} = - ξ

(15)

for some adapted process

α

.

Applying It

\hat{o}

’s formula to Equation (14) together with (15) and (10), we have

\begin{matrix} 0 = d {\bar{X}}_{t}^{ξ, \bar{u}} + \dot{P} (t) {\bar{Y}}_{t}^{λ} d t + P (t) d {\bar{Y}}_{t}^{λ} + d h_{t} \\ = {A (t) {\bar{X}}_{t}^{ξ, \bar{u}} + B (t) {\bar{u}}_{t} + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} + {\bar{Z}}_{t}^{ξ, \bar{u}} d W_{t} \\ + \dot{P} (t) {\bar{Y}}_{t}^{λ} d t + {- P (t) A^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}} d t \\ + {- P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} d W_{t} + α_{t} d t + β_{t} d W_{t} \\ = {A (t) {\bar{X}}_{t}^{ξ, \bar{u}} + B (t) {\bar{u}}_{t} + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}} + \dot{P} (t) {\bar{Y}}_{t}^{λ} \\ - P (t) A^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) Q (t) {\bar{X}}_{t}^{ξ, \bar{u}} + α_{t}} d t \\ + {{\bar{Z}}_{t}^{ξ, \bar{u}} - P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) S (t) {\bar{Z}}_{t}^{ξ, \bar{u}} + β_{t}} d W_{t} \\ = {- A (t) P (t) {\bar{Y}}_{t}^{λ} - A (t) h_{t} + B (t) R^{- 1} (t) B^{⊤} (t) {\bar{Y}}_{t}^{λ} + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}} \\ + \dot{P} (t) {\bar{Y}}_{t}^{λ} - P (t) A^{⊤} (t) {\bar{Y}}_{t}^{λ} - P (t) Q (t) P (t) {\bar{Y}}_{t}^{λ} - P (t) Q (t) h_{t} + α_{t}} d t \\ + {{\bar{Z}}_{t}^{ξ, \bar{u}} - P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) S (t) {\bar{Z}}_{t}^{ξ, \bar{u}} + β_{t}} d W_{t} \\ = {(\dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t)) {\bar{Y}}_{t}^{λ} \\ + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}} - (A (t) + P (t) Q (t)) h_{t} + α_{t}} d t \\ + {{\bar{Z}}_{t}^{ξ, \bar{u}} - P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) S (t) {\bar{Z}}_{t}^{ξ, \bar{u}} + β_{t}} d W_{t} . \end{matrix}

Then, we can suppose that

\begin{matrix} (\dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t)) {\bar{Y}}_{t}^{λ} \\ + C (t) {\bar{Z}}_{t}^{ξ, \bar{u}} - (A (t) + P (t) Q (t)) h_{t} + α_{t} = 0 \end{matrix}

(16)

and

\begin{matrix} {\bar{Z}}_{t}^{ξ, \bar{u}} - P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} + P (t) S (t) {\bar{Z}}_{t}^{ξ, \bar{u}} + β_{t} = 0 . \end{matrix}

(17)

Now, from (17) we have

\begin{matrix} {\bar{Z}}_{t}^{ξ, \bar{u}} = {(I + P (t) S (t))}^{- 1} {P (t) C^{⊤} (t) {\bar{Y}}_{t}^{λ} - β_{t}} \end{matrix}

(18)

where I is the

n \times n

identity matrix. Then, substituting it into (16) gives

\begin{matrix} (\dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t) \\ + C (t) {(I + P (t) S (t))}^{- 1} P (t) C^{⊤} (t)) {\bar{Y}}_{t}^{λ} - C (t) {(I + P (t) S (t))}^{- 1} β_{t} - (A (t) + P (t) Q (t)) h_{t} + α_{t} = 0 . \end{matrix}

From the above equation, one can take

\{\begin{matrix} \dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t) \\ + C (t) {(I + P (t) S (t))}^{- 1} P (t) C^{⊤} (t) = 0, \\ α (t) - C (t) {(I + P (t) S (t))}^{- 1} β_{t} - (A (t) + P (t) Q (t)) h_{t} = 0 . \end{matrix}

Comparing the terminal values on both sides of Equations (10) and (15), one has

P (T) = 0

. Then, there are two equations, the ODE (13) and the BSDE (15). According to Assumptions 1 and 2 and Proposition 4.1 in [5], each of the two equations has a unique solution. Thus, Equation (12) follows. □

Moreover, we can obtain the optimal cost function of the BSLQ as in the following lemma.

Lemma 5.

Suppose that Assumptions 1 and 2 hold. Then, the cost function for the optimal triple

({\bar{X}}^{ξ, \bar{u}}, {\bar{Z}}^{ξ, \bar{u}}, \bar{u})

is

\begin{matrix} d (λ) = M (\bar{u}, λ) = \frac{1}{2} E [< P (0) (N^{⊤} λ - H h_{0}), (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1} > \\ + 2 < H P (0) (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1}, h_{0} > + < H h_{0}, h_{0} >] \\ - E [< N^{⊤} λ, P (0) (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1} + h_{0} > - < b, λ >] \\ + E [\int_{0}^{T} < Q (t) h_{t}, h_{t} > + < {(I + P (t) S (t))}^{- 1} S (t) β_{t}, β_{t} > d t], \end{matrix}

where

P (0)

and

h_{0}

are defined in Lemma 4.

Proof of Lemma 5.

Note that, according to the formula (4),

\begin{matrix} M (\bar{u}, λ) = \frac{1}{2} E [\int_{0}^{T} < Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}, {\bar{X}}_{t}^{ξ, \bar{u}} > + < S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}, {\bar{Z}}_{t}^{ξ, \bar{u}} > + < R (t) {\bar{u}}_{t}, {\bar{u}}_{t} > d t \\ + < H X_{0}^{ξ, \bar{u}}, X_{0}^{ξ, \bar{u}} >] + < λ, E [N X_{0}^{ξ, \bar{u}} - b] > . \end{matrix}

On one hand, according to the three equalities (11), (12), and (18), we have

\begin{matrix} \frac{1}{2} \int_{0}^{T} < Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}, {\bar{X}}_{t}^{ξ, \bar{u}} > + < S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}, {\bar{Z}}_{t}^{ξ, \bar{u}} > + < R (t) {\bar{u}}_{t}, {\bar{u}}_{t} > d t \\ = \frac{1}{2} \int_{0}^{T} {< (P (t) Q (t) P (t) + C (t) {(I + P (t) S (t))}^{- 1} P (t) S (t) P (t) {(I + P (t) S (t))}^{- 1} C^{⊤} (t) \\ + B (t) R^{- 1} (t) B^{⊤} (t)) {\bar{Y}}_{t}^{λ}, {\bar{Y}}_{t}^{λ} > \\ + 2 < {\bar{Y}}_{t}^{λ}, P (t) h_{t} > - 2 < {\bar{Y}}_{t}^{λ}, C (t) {(I + P (t) S (t))}^{- 1} P (t) S (t) {(I + P (t) S (t))}^{- 1} β_{t} > \\ + < Q (t) h_{t}, h_{t} > + < {(I + P (t) S (t))}^{- 1} S (t) {(I + P (t) S (t))}^{- 1} β_{t}, β_{t} >} d t . \end{matrix}

(19)

On the other hand, by applying

I t \hat{o}

’s formula to

t \mapsto < P (t) {\bar{Y}}_{t}, {\bar{Y}}_{t} >

, we have

\begin{matrix} 0 = E [< P (0) {\bar{Y}}_{0}^{λ}, {\bar{Y}}_{0}^{λ} >] \\ + E [\int_{0}^{T} {< (\dot{P} (t) - (A (t) + P (t) Q (t)) P (t) - P (t) {(A (t) + P (t) Q (t))}^{⊤} (t) \\ + C (t) {(I + P (t) S (t))}^{- 1} P (t) {(I + P (t) S (t))}^{- 1} C^{⊤} (t)) {\bar{Y}}_{t}^{λ}, {\bar{Y}}_{t}^{λ} > \\ - 2 < {\bar{Y}}_{t}^{λ}, P (t) h_{t} > + 2 < {\bar{Y}}_{t}^{λ}, C (t) {(I + P (t) S (t))}^{- 1} P (t) S (t) {(I + P (t) S (t))}^{- 1} β_{t} > \\ + < {(I + P (t) S (t))}^{- 1} > S (t) P (t) S (t) {(I + P (t) S (t))}^{- 1} β_{t}, β_{t} >} d t] . \end{matrix}

(20)

Then, combining (19) with (20), we get

\begin{matrix} d (λ) = M (\bar{u}, λ) \\ = \frac{1}{2} E [< H {\bar{X}}_{0}^{ξ, \bar{u}}, {\bar{X}}_{0}^{ξ, \bar{u}} > + < P (0) {\bar{Y}}_{0}, {\bar{Y}}_{0} >] + < λ, E [N {\bar{X}}_{0}^{ξ, \bar{u}} - b] > \\ + E [\int_{0}^{T} Q (t) h_{t}, h_{t} > + < {(I + P (t) S (t))}^{- 1} S (t) β_{t}, β_{t} > d t] . \end{matrix}

(21)

Since it can be obtained from (11) and (12) that

\begin{matrix} {\bar{Y}}_{0}^{λ} = (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1}, \end{matrix}

we obtain

\begin{matrix} \frac{1}{2} E [< H X_{0}^{ξ, u}, X_{0}^{ξ, u} > + < P (0) Y_{0}, Y_{0} >] + < λ, E [N X_{0}^{ξ, u} - b] > \\ = \frac{1}{2} E [< H (P (0) Y_{0} + h_{0}), P (0) Y_{0} + h_{0} > + < P (0) Y_{0}, Y_{0} >] - < λ, E [N (P (0) Y_{0} + h_{0}) + b] > \\ = \frac{1}{2} E [< P (0) (I + H P (0)) Y_{0}, Y_{0} > + 2 < H P (0) Y_{0}, h_{0} > + < H h_{0}, h_{0} >] \\ - E [< N^{⊤} λ, P (0) Y_{0} + h_{0} > - < b, λ >] \\ = \frac{1}{2} E [< P (0) (N^{⊤} λ - H h_{0}), (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1} > \\ + 2 < H P (0) (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1}, h_{0} > + < H h_{0}, h_{0} >] \\ - E [< N^{⊤} λ, P (0) (N^{⊤} λ - H h_{0}) {(I + H P (0))}^{- 1} + h_{0} > - < b, λ >] . \end{matrix}

Substituting the above expression into (21) gives the result immediately. □

According to the above lemma, we can construct an optimization problem

d (\bar{λ}) = sup_{λ \in R^{ℓ}} d (λ),

(22)

which can be solved easily.

Lemma 6.

Suppose Assumptions 1 and 2 hold. Then, for the optimization problem (22), there is a unique optimal

\bar{λ}

satisfying the following linear algebraic equations

\begin{matrix} E [(H {(I + H P (0))}^{- 1} - I) N^{⊤} h_{0} + b] = E [N \bar{λ} P (0) {(I + H P (0))}^{- 1} N^{⊤}] . \end{matrix}

Proof of Lemma 6.

First, we prove that

N P (0) N^{⊤} > 0

. According to Assumption 3, the matrix N is full rank, and

P (0)

is a positive definite matrix. Therefore,

N P (0) N^{⊤}

is also a positive definite matrix, i.e.,

N P (0) N^{⊤} > 0

. Obviously, the dual function d on

R^{ℓ}

defined by the Lagrange multiplier method is concave and differentiable. Then, its optimal solution

\bar{λ}

satisfies the first-order optimality conditions, that is

\begin{matrix} 0 = \nabla d (\bar{λ}) = E [(H {(I + H P (0))}^{- 1} - I) N^{⊤} h_{0} + b - N \bar{λ} P (0) {(I + H P (0))}^{- 1} N^{⊤}], \end{matrix}

Since

N P (0) N^{⊤} > 0

, the matrix

N P (0) N^{⊤}

is invertible. Therefore, there exists a unique

\bar{λ}

that satisfies the above equation. □

Now, we can state our main theorem about our original problem, the CBSLQ, whose proof is similar to that of Theorem 3.2 in [24] based on Lagrangian duality theory. We provide the details in the next section for readers’ convenience.

Theorem 2.

Suppose Assumptions 1–3 hold. Then, the optimal control of CBSLQ is

\begin{matrix} {\bar{u}}_{t} = R^{- 1} (t) B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}}, \end{matrix}

where

\{\begin{matrix} d {\bar{Y}}_{t}^{\bar{λ}} = & {- A^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} + Q (t) {\bar{X}}_{t}^{ξ, \bar{u}}} d t + {- C^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} + S (t) {\bar{Z}}_{t}^{ξ, \bar{u}}} d W_{t}, \\ {\bar{Y}}_{0}^{\bar{λ}} = & H {\bar{X}}_{0}^{ξ, \bar{u}} + N^{⊤} \bar{λ}, \\ E [H P (0) & {(I + H P (0))}^{- 1} - I) N^{⊤} h_{0} - b] = E [N \bar{λ} P (0) {(I + H P (0))}^{- 1} N^{⊤}] \end{matrix}

with

P (0)

and

h_{0}

respectively determined by the ODE

\{\begin{matrix} \dot{P} (t) - A (t) P (t) - P (t) A^{⊤} (t) - P (t) Q (t) P (t) + B (t) R^{- 1} (t) B^{⊤} (t) \\ + C (t) {(I + P (t) S (t))}^{- 1} P (t) C^{⊤} (t) = 0, \\ P (T) = 0, \end{matrix}

and the BSDE

d h_{t} = {(A (t) + P (t) Q (t)) h_{t} + C (t) {(I + P (t) S (t))}^{- 1} β_{t}} d t + β_{t} d W_{t}, h_{T} = - ξ .

Proof of Theorem 2.

It follows from Lemma 1, Lemma 2, Lemma 3, and Lemma 6, and Theorem 1 in Section 3. □

5. The Characterization of Condition (Assumption 3)

In this section, we will investigate the equivalent characterizations of condition (Assumption 3). The basic idea is from the fundamental controllability argumentation for deterministic controlled linear systems (see, for instance, [29]). In order to characterize the condition (Assumption 3), let us consider a special CSBLQ with Assumption 1 holding true:

\{\begin{matrix} m i n \frac{1}{2} E [\int_{0}^{T} | u_{t} |^{2} d t] . \\ s . t u \in L_{F}^{2} (0, T; R^{m}) \\ (X^{ξ, u}, Z^{ξ, u}, u) satisfies \\ d X_{t}^{ξ, u} = (A (t) X_{t}^{ξ, u} + B (t) u_{t} + C (t) Z_{t}^{ξ, u}) d t + Z_{t}^{ξ, u} d W_{t}, X_{T}^{ξ, u} = ξ, \\ a n d E [N X_{0}^{ξ, u}] = b . \end{matrix}

(23)

Define the Lagrangian functional for (23) by

\begin{matrix} M (u, λ) = \frac{1}{2} E [\int_{0}^{T} | u_{t} |^{2} d t] + 〈 λ, E [N X_{0}^{ξ, u}] - b 〉 \end{matrix}

(24)

for

u \in L_{F}^{2} (0, T; R^{m})

and

λ \in R^{ℓ}

.

Then, by Lemma 3.1 and Lemma 3.2, for a given

λ \in R^{ℓ}

, for the problem

\begin{matrix} d (λ) ≜ inf_{u \in L_{F}^{2} (0, T; R^{m})} M (u, λ), \end{matrix}

there is a unique optimal control

\bar{u}

satisfying, for

t \in [0, T]

,

\begin{matrix} {\bar{u}}_{t} = B^{⊤} (t) {\bar{Y}}_{t}^{λ} \end{matrix}

(25)

with the process

{\bar{Y}}^{λ}

being the solution to an adjoint equation

\{\begin{matrix} d {\bar{Y}}_{t}^{λ} = & - A^{⊤} (t) {\bar{Y}}_{t}^{λ} d t - C^{⊤} (t) {\bar{Y}}_{t}^{λ} d W_{t} \\ {\bar{Y}}_{0}^{λ} = & N^{⊤} λ . \end{matrix}

(26)

By applying

I t \hat{o}

’ s formula together with the Equations (26) and (25), we have

\begin{matrix} d (λ) = M (\bar{u}, λ) = \frac{1}{2} E [\int_{0}^{T} | {\bar{u}}_{t} |^{2} d t] + 〈 λ, E [N X_{0}^{ξ, \bar{u}}] - b 〉 \\ = - \frac{1}{2} E [\int_{0}^{T} | B^{⊤} (t) {\bar{Y}}_{t}^{λ} |^{2} d t] + < {\bar{Y}}_{T}^{λ}, ξ > - E [< λ, b >] . \end{matrix}

We have the following result.

Lemma 7.

If

\bar{λ}

is the optimal solution to

\begin{matrix} sup_{λ \in R^{ℓ}} d (λ), \end{matrix}

(27)

then

\begin{matrix} {\bar{u}}_{t} ≜ B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} \end{matrix}

(28)

is the optimal solution to (23), where

{\bar{Y}}_{t}^{\bar{λ}}

is the solution to (26) with initial datum

N^{⊤} \bar{λ}

.

Proof of Lemma 7.

Since

\bar{λ}

is the optimal solution, for any

μ \in R^{ℓ}

,

\begin{matrix} 0 = & < ▽ d (\bar{λ}), μ > \\ = & < {\bar{Y}}_{T}^{μ}, ξ > - E [< b, μ >] - E [\int_{0}^{T} < B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}}, B^{⊤} (t) {\bar{Y}}_{t}^{μ} > d t] . \end{matrix}

(29)

Let

X^{ξ, \bar{u}}

be the solution to the controlled system (2) with terminal datum

ξ

and control

\bar{u}

defined by (28). Then, by (28) and (29) and

I t \hat{o}

’ formula,

\begin{matrix} E [< N X_{0}^{ξ, \bar{u}} - b, μ >] \\ = & E [< N^{⊤} μ, X_{0}^{ξ, \bar{u}} > - < b, μ >] \\ = & E [< {\bar{Y}}_{T}^{μ}, ξ > - \int_{0}^{T} < {\bar{u}}_{t}, B^{⊤} (t) {\bar{Y}}_{t}^{μ} > d t - < b, μ >] \\ = & 0 . \end{matrix}

(30)

Due to the arbitrariness of

μ

, we obtain

N {\bar{X}}_{0}^{ξ, \bar{u}} - b = 0, a . s .

This proves that the control

\bar{u}

defined by (28) is a feasible control.

Next, we prove the optimality of

\bar{u}

. Replacing

μ

by

\bar{λ}

in (30), we obtain

\begin{matrix} 0 = & E [< {\bar{Y}}_{T}^{\bar{λ}}, ξ > - \int_{0}^{T} | {\bar{u}}_{t} |^{2} d t - < b, \bar{λ} >] . \end{matrix}

(31)

Note that for any

u \in L_{F}^{2} (0, T; R^{m})

with the corresponding state

X^{ξ, u}

such that

N X_{0}^{ξ, u} - b = 0, a . s .,

by

I t \hat{o}

’s formula,

\begin{matrix} E [< X_{0}^{ξ, u}, N^{⊤} \bar{λ} >] = E [< {\bar{Y}}_{T}^{\bar{λ}}, ξ > - \int_{0}^{T} < u_{t}, B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} > d t] . \end{matrix}

(32)

Combining (31) with (32), we obtain

\begin{matrix} E [\int_{0}^{T} | {\bar{u}}_{t} |^{2} d t] = & E [< N X_{0}^{ξ, u} - b, \bar{λ} >] + E [\int_{0}^{T} < u_{t}, B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} > d t] \\ = & E [\int_{0}^{T} < u_{t}, B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} > d t] \\ \leq & (E [\int_{0}^{T} | u_{t} |^{2} {d t])}^{\frac{1}{2}} (E [\int_{0}^{T} | {\bar{u}}_{t} |^{2} d t {])}^{\frac{1}{2}} . \end{matrix}

This proves the optimality of

\bar{u}

. □

The following theorem gives a necessary and sufficient condition for

u \mapsto E [N X_{0}^{ξ, u}]

being a surjection.

Theorem 3.

Suppose that Assumptions 1 and 2 hold. Then,

u \mapsto E [N X_{0}^{ξ, u}]

is a surjection if and only if there is

c > 0

such that

λ \in R^{ℓ}

,

\begin{matrix} E [\int_{0}^{T} | B^{⊤} (t) {\bar{Y}}_{t}^{λ} |^{2} {d t] \geq c | λ |}^{2}, \end{matrix}

(33)

where

{\bar{Y}}_{t}^{λ}

is the solution to (26) with initial datum

N^{⊤} λ

.

Proof of Theorem 3.

We first prove the sufficiency.

For arbitrarily

α \in R^{ℓ}

, let

d_{α}

be the dual function defined in (27) with b replaced by

α

. If there is

c > 0

such that (33) holds, then

- d_{α}

is coercive. Since the convexity and continuity of

- d_{α}

are obvious, the dual problem (27) with b replaced by

α

has a solution

{\bar{λ}}_{α} .

Then, by Lemma 7, the control

{\bar{u}}_{α}

defined by (28) with

\bar{λ}

replaced by

{\bar{λ}}_{α}

is a minimal norm control such that

E [N X^{ξ, {\bar{u}}_{α}} (0)] = α

. By the arbitrariness of

α

, the mapping

u \mapsto E [N X^{ξ, u} (0)]

is surjective.

Next, we prove the necessity.

Suppose by contradiction that

u \mapsto E (N X^{ξ, u} (0))

is surjective but the inequality (33) does not hold. Then, there is a sequence

{λ_{n} \in R^{ℓ}}

such that

| λ_{n} | = 1

, and

\begin{matrix} lim_{n \to \infty} E [\int_{0}^{T} | B^{⊤} (t) {\bar{Y}}_{t}^{{\bar{λ}}_{n}} |^{2} d t] = 0 . \end{matrix}

Without loss of generality, assume that

λ_{n}

converges to some

\tilde{λ} \in R^{ℓ}

with

| λ | = 1

. Then,

\begin{matrix} E [\int_{0}^{T} | B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} |^{2} d t] = 0 . \end{matrix}

Therefore,

B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} = 0, a . e, t \in [0, T], a . s .

Since

u \mapsto E [N X_{0}^{ξ, u}]

is surjective, for any

α \in R^{ℓ}

, there is u such that

E [N X_{0}^{ξ, u}] - α = 0

.

Then, from (26)

\begin{matrix} Y_{T}^{\bar{λ}} = N^{⊤} \bar{λ} e^{- \int_{0}^{T} [A (s) - \frac{1}{2} C^{2} (s)] d s + \int_{0}^{T} C (s) d W (s)} . \end{matrix}

Let

Φ = e^{- \int_{0}^{T} [A (s) - \frac{1}{2} C^{2} (s)] d s + \int_{0}^{T} C (s) d W (s)}

; then,

\begin{matrix} 0 = & E [< N X_{0}^{ξ, u} - α, \bar{λ} >] \\ = & E [< Y_{T}^{\bar{λ}}, ξ >] - E [\int_{0}^{T} < \bar{u} (t), B^{⊤} (t) {\bar{Y}}_{t}^{\bar{λ}} > d] t - E [< α, \bar{λ} >] \\ = & E [< Y_{T}^{\bar{λ}}, ξ >] - E [< α, \bar{λ} >] \\ = & E [< N^{⊤} \bar{λ} Φ, ξ >] - E [< α, \bar{λ} >] . \end{matrix}

By the arbitrariness of

α

, we have

\bar{λ} = 0

, contradicting

| \bar{λ} | = 1

. This completes the proof. □

6. An Illustrative Example

Here is a simple practical application problem to support our theory. This example demonstrates the practical value of the theoretical results in Section 3, Section 4 and Section 5. We consider a backward portfolio management problem with a constraint expectation of initial state

\{\begin{matrix} min J (u) = E [\int_{0}^{T} | u (t) |^{2} d t], \\ E [X_{0}] = b \end{matrix}

(34)

under the following backward stochastic linear equation in terms of

(X_{t}, Z_{t})

controlled by u:

d X_{t} = (0.03 X_{t} + 0.2 u (t)) d t + Z_{t} d W (t), X_{T} = 50 .

(35)

Here,

u (t)

is the investment strategy adjustment variable at time

t \in [0, T]

(such as the adjustment of the capital allocation proportion), which is adapted to the known noise of standard Brownian motion W and constrained by the expected value of the initial investment portfolio b;

X_{t}

represents the value of the investment portfolio; and

Z_{t}

is the market fluctuation factor (such as the fluctuation of the market index). Among the other parameters,

0.03

represents the expected growth rate of the investment portfolio itself, and

0.2

represents the impact coefficient of the investment strategy adjustment on the value of the investment portfolio.

Then, according the discussions in the above sections, there is a corresponding Riccati ODE in terms of P,

\dot{P} (t) - 0.06 P (t) + 0.04 = 0, P (T) = 0,

an ODE in terms of

{\bar{Y}}_{t}

,

{\dot{\bar{Y}}}_{t} + 0.03 {\bar{Y}}_{t} = 0, {\bar{Y}}_{0} = λ,

and an ODE in terms of

h_{t}

,

\dot{h_{t}} - 0.03 h_{t} = 0, h_{T} = - 50 .

Clearly, we have explicit solutions, respectively, for

t \in [0, T]

,

\begin{matrix} P (t) = \frac{2}{3} - \frac{2}{3} e^{0.06 t - 0.06 T}, {\bar{Y}}_{t} = λ e^{- 0.03 t}, h_{t} = - 50 e^{0.03 (t - T)} . \end{matrix}

Then, we can solve the dual problem

\begin{matrix} sup_{λ \in R} d (λ) = - \frac{1}{2} (\frac{2}{3} - \frac{2}{3} e^{- 0.06 T}) λ^{2} + (50 e^{- 0.03 T} - b) λ, \end{matrix}

whose unique optimal solution is

\begin{matrix} \bar{λ} = \frac{(50 e^{- 0.03 T} - b)}{\frac{2}{3} - \frac{2}{3} e^{- 0.06 T}} . \end{matrix}

Therefore, the optimal control has the explicit representation

\begin{matrix} \bar{u} (t) = 0.2 \bar{λ} e^{- 0.03 t}, t \in [0, T] . \end{matrix}

(36)

Table 1 shows optimization results under different parameter combinations.

Economic Explanation:

(i) Time Effect: As the investment horizon T increases, the optimal control intensity

λ

decays at a faster rate (exponential term

e^{- 0.03 t}

), which aligns with the practical understanding that long-term investments should reduce trading frequency.

(ii) Risk Budgeting: The exponential decay characteristic of

P (t)

indicates that the system allocates a decreasing risk budget over time, providing a quantitative tool for dynamic risk control.

(iii) Constraint Effectiveness: By setting the initial expectation b, investors can precisely control the statistical properties of the portfolio value (see Table 1).

(iv) Computational Superiority: The explicit solution avoids iterative numerical optimization, providing a real-time strategy generation method for high-frequency trading scenarios.

(v) Scalability: The model parameters (0.03 growth rate, 0.2 control coefficient) can be extended to time-varying functions, adapting to complex market environments.

The expected value equality constraint of

E [X_{0}] = b

indicates that the expected value of the initial investment portfolio is b. It is an initial reference point set by the investor and is used to measure whether the starting point of the investment portfolio meets the expectations. The variance of the investment portfolio value reflects the extent to which it deviates from the expected path due to factors such as market fluctuations and uncertainties in investment strategies. The larger the variance, the higher the uncertainty of the investment portfolio value is while the greater the investment risk is. Generally, investors hope to minimize this variance under a given expected return level. This leads to solving the following stochastic LQ problem with initial state constraint:

\{\begin{matrix} m i n V a r X_{T} = E {[X_{T}]}^{2} - {(E [X_{T}])}^{2}, \\ s u b j e c t t o u \in L_{F}^{2} (0, T; R^{m}), \\ (X_{t}, u) satisfies the CBSLQ problem (6.1) \end{matrix}

We show in Figure 1 the efficient frontier of the above portfolio management problem for

T = 1

and

ξ = 50

. The efficient frontier graph illustrates the trade-off relationship between the initial expected return b and terminal risk (standard deviation), exhibiting a characteristic convex decreasing curve (consistent with high-risk-high-return theory), which validates the core theoretical proposition of the model: precise risk-return management can be achieved through backward stochastic control. The purple point on the graph represents the maximum return point, while the shaded area denotes the infeasible region where no strategy can simultaneously deliver higher returns and lower risk.

This example visually confirms three key values of the model through the efficient frontier: (1) the initial constraint b enables precise control of the investment starting point; (2) dynamic strategies can generate Pareto-optimal paths; (3) parameters (e.g., the 0.2 adjustment coefficient) possess clear economic significance. In practical applications, this framework provides quantitative tools for structured product design and dynamic pension fund allocation.

7. Conclusions

This paper studies a backward stochastic LQ problem with linear equality constraints on the initial state expectation (CBSLQ). By use of the Lagrange multiplier method and maximum principle, we give an explicit solution to an equivalent unconstrained parameterized BSLQ problem when the cost function is uniformly convex, which is a feedback-optimal control determined by an adjoint SDE, a Riccati-type ODE, a BSDE, and an equality. Under the surjectivity of the linear constraint, we prove the equivalence between the original problem and the dual problem by Lagrange duality theory, thereby obtaining the optimal control for the original CBSLQ. Finally, an illustrative example regarding an investment portfolio is given. This study of the CBSLQ will provide some reference for financial investment or industry control.

Author Contributions

Conceptualization, Y.Z. and Y.L.; methodology, Y.L. and J.L.; software, Y.L.; validation, J.L.; formal analysis, Y.L.; writing—original draft, Y.L.; data curation, Y.L.; writing—review and editing, Y.Z.; supervision, Y.Z.; project administration, Y.Z.; funding acquisition, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This document is the result of a research project funded by the National Natural Science Foundation of China (11861025), Guizhou OKZYD[2022]4055, Natural Science Research Project of Guizhou Provincial Department of Education (QJJ[2023]011), Guizhou Provincial QKHPTRC-BQW [2024]015.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Yong, J.; Zhou, X.Y. Stochastic Controls: Hamiltonian Systems and HJB Equations; Springer: New York, NY, USA, 1999. [Google Scholar]
Pham, H. Continuous-Time Stochastic Control and Optimization with Financial Applications; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Peng, S. Backward stochastic differential equation, nonlinear expectation and their applications. In Proceedings of the International Congress of Mathematicians, Hyderabad, India, 19–27 August 2010; Volume I, pp. 393–432. [Google Scholar]
Zhang, J. Backward Stochastic Differential Equations: From Linear to Fully Nonlinear Theory; Springer: New York, NY, USA, 2017. [Google Scholar]
Lim, A.E.B.; Zhou, X.Y. Linear-quadratic control of backward stochastic differential equations. SIAM J. Control Optim. 2001, 40, 450–474. [Google Scholar] [CrossRef]
Huang, J.; Wang, S.; Wu, Z. Backward mean-field linear-quadratic-Gaussian (LQG) games: Full and partial information. IEEE Trans. Automat. Control 2016, 61, 3784–3796. [Google Scholar] [CrossRef]
Wang, G.; Xiao, H.; Xiong, J. A kind of LQ non-zero sum differential game of backward stochastic differential equation with asymmetric information. Automatica 2018, 97, 346–352. [Google Scholar] [CrossRef]
Li, X.; Sun, J. Linear quadratic optimal control problems for mean field backward stochastic differential equations. Appl. Math. Optim. 2017, 80, 223–250. [Google Scholar] [CrossRef]
Sun, J.; Wang, H. Linear-quadratic optimal control for backward stochastic differential equations with random coefficients. ESAIM Control Optim. Calc. Var. 2021, 27, 46. [Google Scholar] [CrossRef]
Zhang, D. Backward linear-quadratic stochastic optimal control and nonzero-sum differential game problem with random jumps. J. Syst. Sci. Complex. 2011, 24, 647–662. [Google Scholar] [CrossRef]
Chen, Y.; Luo, P. Turnpike properties for stochastic backward linear-quadratic optimal problems. Mathematics 2023. submitted. [Google Scholar]
Du, K.; Huang, J.; Wu, Z. Linear quadratic mean-field-game of backward stochastic differential systems. Math. Control Relat. Fields 2018, 8, 653–678. [Google Scholar] [CrossRef]
Wu, F.; Xiong, J.; Zhang, X. Zero-sum stochastic linear-quadratic Stackelberg differential games with jumps. Appl. Math. Optim. 2024, 89, 29. [Google Scholar] [CrossRef]
Achim, K. Die Messung des Sozialstaates: Beschäftigungspolitische Unterschiede Zwischen Brutto-und Nettosozialleistungsquote; Working Paper from Center for Social Policy Studies; University of Munich: Munich, Germany, 2001; p. 34. Available online: https://nbn-resolving.org/urn:nbn:de:0168-ssoar-115013 (accessed on 1 March 2025).
Lim, A.E.B.; Zhou, X.Y. Stochastic optimal LQR control with integral quadratic constraints and indefinite control weights. IEEE Trans. Automat. Control 1999, 44, 1359–1369. [Google Scholar] [CrossRef]
Sun, J. Linear quadratic optimal control problems with fixed terminal states and integral quadratic constraints. Appl. Math. Optim. 2021, 83, 251–276. [Google Scholar] [CrossRef]
Chen, X.; Zhou, X.Y. Stochastic linear-quadratic control with conic control constraints on an infinite time horizon. SIAM J. Control Optim. 2004, 43, 1120–1150. [Google Scholar] [CrossRef]
Hu, Y.; Zhou, X.Y. Constrained stochastic LQ control with random coefficients, and application to portfolio selection. SIAM J. Control Optim. 2005, 44, 444–466. [Google Scholar] [CrossRef]
Hu, Y.; Shi, X.; Xu, Z.Q. Constrained stochastic LQ control with regime switching and application to portfolio selection. Ann. Appl. Probab. 2022, 32, 426–460. [Google Scholar] [CrossRef]
Ying, H.; Sun, X.M.; Quan, Z.X. Constrained stochastic LQ control on infinite time horizon with regime switching. ESAIM Control Optim. Calc. Var. 2022, 28, 24. [Google Scholar]
Wu, W.; Gao, J.; Lu, J.-G.; Li, X. On continuous-time constrained stochastic linear-quadratic control. Automatica 2020, 114, 108809. [Google Scholar] [CrossRef]
Klaus, K.; Neitzel, I.; Rösch, A. Sufficient optimality conditions for the Moreau-Yosida-type regularization concept applied to semilinear elliptic optimal control problems with pointwise state constraints. Ann. Acad. Rom. Sci. Ser. Math. Its Appl. 2010, 2, 222–246. [Google Scholar]
Feng, X.; Hu, Y.; Huang, J. Backward Stackelberg differential game with constraints: A mixed terminal-perturbation and linear-quadratic approach. SIAM J. Control Optim. 2022, 60, 1488–1518. [Google Scholar] [CrossRef]
Zhang, H.; Zhang, X.F. Stochastic linear quadratic optimal control problems with expectation-type linear equality constraints on the terminal states. Syst. Control Lett. 2023, 177, 105551. [Google Scholar] [CrossRef]
Zhang, H.; Zhang, X. Lagrangian dual method for solving stochastic linear quadratic optimal control problems with terminal state constraints. ESAIM Control Optim. Calc. Var. 2024, 30, 22. [Google Scholar] [CrossRef]
Bonnans, J.F.; Shapiro, A. Perturbation Analysis of Optimization Problems; Springer: New York, NY, USA, 2000. [Google Scholar]
Ma, J.; Protter, P.; Yong, J. Solving forward-backward stochastic differential equations explicitly: A four-step scheme. Probab. Theory Relat. Fields 1994, 98, 339–359. [Google Scholar] [CrossRef]
Ma, J.; Yong, J. Forward-Backward Stochastic Differential Equations and Their Applications; Lecture Notes in Mathematics; Springer: New York, NY, USA, 1999; Volume 1702. [Google Scholar]
Glowinski, R.; Lions, J.-L. Exact and approximate controllability for distributed parameter systems. Acta Numer. 1994, 3, 269–378. [Google Scholar] [CrossRef]

Figure 1. Efficient Frontier.

Table 1. Optimization results under different parameter combinations.

Parameter Combination $(b, T)$	Optimal $\bar{λ}$	Terminal Variance $Var (X_{T})$	Control Cost $\int_{0}^{T} E [u^{2}] dt$
(40, 5)	15.28	6.85	18.42
(45, 8)	9.67	11.23	12.56
(48, 10)	6.34	15.07	8.91

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, Y.; Li, J.; Zhou, Y. Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint. Mathematics 2025, 13, 1327. https://doi.org/10.3390/math13081327

AMA Style

Lu Y, Li J, Zhou Y. Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint. Mathematics. 2025; 13(8):1327. https://doi.org/10.3390/math13081327

Chicago/Turabian Style

Lu, Yanrong, Jize Li, and Yonghui Zhou. 2025. "Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint" Mathematics 13, no. 8: 1327. https://doi.org/10.3390/math13081327

APA Style

Lu, Y., Li, J., & Zhou, Y. (2025). Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint. Mathematics, 13(8), 1327. https://doi.org/10.3390/math13081327

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Backward Stochastic Linear Quadratic Optimal Control with Expectational Equality Constraint

Abstract

1. Introduction

2. Preliminaries

3. Lagrangian Duality

4. Maximum Principle

5. The Characterization of Condition (Assumption 3)

6. An Illustrative Example

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI