Decentralized Control for Interrelated Systems with Asymmetric Information Architecture

Wang, Yixing; Wang, Yirun; Tan, Boqun; Li, Xinghua; Liang, Xiao

doi:10.3390/electronics15010096

Open AccessArticle

Decentralized Control for Interrelated Systems with Asymmetric Information Architecture

by

Yixing Wang

¹

,

Yirun Wang

¹,

Boqun Tan

¹,

Xinghua Li

² and

Xiao Liang

^1,*

¹

School of Automation and Electrical Engineering, Linyi University, Linyi 276000, China

²

College of Shandong Traffic Technician, Linyi 276000, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(1), 96; https://doi.org/10.3390/electronics15010096

Submission received: 21 November 2025 / Revised: 18 December 2025 / Accepted: 22 December 2025 / Published: 24 December 2025

(This article belongs to the Special Issue Evolutionary and Swarm Intelligence Approaches for Recommender Systems)

Download

Browse Figures

Versions Notes

Abstract

This paper focuses on finite-horizon optimum state feedback control problems for interconnected systems of two players involved with asymmetric one-step delay information. For the finite horizon optimum decentralized control problem, a crucial and adequate condition is derived by using Pontryagin’s maximum principle. Under this framework, player 1 transmits its state and control input data with a one-step delay to the controller of player 2, while player 1’s controller does not have access to the real-time or delayed states and control inputs of player 2, resulting in an asymmetric information structure characterized by a one-step delay Then, the solutions to the forward and backward stochastic difference equations are derived. A target tracking system is given in numerical examples to verify the proposed algorithm.

Keywords:

asymmetric one-step delay information; optimum control; maximum principle; forward and backward stochastic difference equations

1. Introduction

The traditional stochastic optimal control problem was first formulated by [1] in the 1960s, with subsequent in-depth investigations conducted by [2,3,4,5,6,7,8,9,10,11,12,13,14]. Specifically, Ref. [7] proposed an outlined Riccati equation to establish the solvability of the stochastic optimal control problem. For delay-free stochastic systems, Ref. [11] addressed the infinite-horizon optimal control problem where the status and control weighting matrices are indefinite. Regarding stochastic linear systems with time delays, Ref. [14] solved the optimal control and equalization problem by proposing a novel Riccati-ZXL difference equation. In summary, the traditional stochastic optimal control problem has been well resolved.

With the growing demand for practical employments—such as networked control systems, formation flight, and target tracking—decentralized control has attracted considerable attention [15,16,17,18,19,20]. A key distinction between traditional stochastic optimal control and decentralized control lies in the feedback information (i.e., state or observation information) available to different controllers: while the feedback information is uniform for all controllers in traditional stochastic optimal control, it varies across controllers in decentralized control, leading to asymmetric information. Generally, for traditional stochastic optimal control, linear controllers are globally optimal, with control gains computable via Riccati equations and the feedback form expressible as a linear function of the conditional expectation of the status [10]. In contrast, for decentralized control, nonlinear strategies may achieve better performance than linear ones [17], and the optimal decentralized control problem may not be convex [18]. Consequently, the methodologies developed for traditional stochastic optimal control cannot be directly applied to decentralized control scenarios.

While the optimal decentralized control problem is intractable, many studies have been devoted to it; see [18,21,22,23,24,25,26,27] and the references therein. Ref. [22] showed that the optimal control strategies were linear by proposing the partial nestedness structure. Ref. [23] introduced a general decentralized model named partial historical sharing structure. Under this structure, assuming that controllers satisfied some linear forms, Ref. [24] presented the optimal controllers in virtue of the common information. Refs. [25,26,27,28] extended this work to networked control systems with special structure. By using dynamic programming and common information, Refs. [26,28] solved the optimal control and equalization problem for systems with a special unreliable communication channel. Under the same special structure, Ref. [27] presented the optimal controllers and stabilization condition with Pontryagin’s maximum principle and algebraic Riccati equations. Recently, Ref. [29] extended the above special structure to the general case of a standard observation equation and obtained linear optimal controllers.

However, the decentralized control problem with time delay is not considered in the above references. Meanwhile, in the context of reliable control under component failures, Ref. [30] proposed a guaranteed-cost LQ control based on algebraic Riccati equations for discrete-time systems with actuator failures, ensuring the system’s asymptotic stability and bounded quadratic cost. Ref. [31] developed a reliable LQG control method that addresses sensor failures via coupled Riccati equations and LMIs, guaranteeing robust stability for systems with partial degradation or outage. These works inspire our research to resolve information asymmetry induced by asymmetric delays while enhancing tolerance to model uncertainties. With the identical combination of delay and sparsity constraints, Ref. [32] presented solutions to the decentralized state feedback control case. Ref. [33] studied applications of heavy-duty vehicle platooning and solved the optimal control problem for chain structures with delayed information. For the finite-horizon case, Ref. [34] presented an optimal state feedback controller for large-scale delayed systems. By requiring plant dynamics to exhibit lower block triangular characteristics, Ref. [35] put forward a decentralized system model with two players that has an asymmetric one-step delayed information pattern. Employing the dynamic programming approach, Ref. [35] obtained the solution of optimal control strategies for the finite horizon. Ref. [5] studied the asymmetric information control involved in different delayed state information, and optimal controllers were derived for the finite-horizon case. Ref. [36] addressed control problem formulations for discrete-time stochastic systems equipped with multiple input paths and input time delays. However, merely one state is involved in [36]. To the best of our knowledge, for general plant dynamics, the optimal control of the asymmetric delayed decentralized problem has not been solved yet.

In this paper, we investigate the finite-horizon optimal state feedback control problem for two-player decentralized interconnected systems with asymmetric one-step delay information—a critical gap in existing research that has not been fully addressed for general coupled dynamics. Unlike previous studies that impose special structural constraints on plant dynamics or assume decoupled control inputs, our work completely removes structural restrictions on system dynamics, enabling the handling of fully coupled state and control input relationships between the two players, which is more aligned with practical interconnected system scenarios such as multi-agent coordination. The main contributions are summarized as follows:

(1) Applying Pontryagin’s maximum principle, we propose an innovative framework specifically designed to derive coupled costate–state backward stochastic difference equations and inter-player equilibrium equations. This framework can explicitly capture the impact of asymmetric delay on information asymmetry, overcoming the limitation of traditional methods that struggle to quantify the correlation between delay and information coupling, and laying a solid theoretical foundation for the subsequent derivation of optimal control strategies.

(2) Building on these equations, we further derive the analytical solution to the forward and backward stochastic difference equations (FBSDEs) and rigorously establish finite-horizon optimal decentralized control strategies through two sets of Riccati equations. We clarify the mild condition for the unique solvability of the strategy, namely the invertibility of relevant matrices, which ensures the theoretical rigor of the method and its feasibility in engineering applications, thereby addressing the issue regarding whether optimal control solutions exist uniquely for coupled systems with asymmetric delays.

(3) A practical unmanned aerial vehicle (UAV) target tracking system is adopted to serve as a case study for confirming the suggested method, fully demonstrating its effectiveness in real-world applications. Experimental results show that the method exhibits stable performance in terms of tracking accuracy and energy efficiency, capable of meeting the dual requirements of practical engineering for control effects and resource consumption, and providing an implementable solution for the optimal control of interrelated systems with asymmetric delays.

Subsequent parts of the paper are arranged in the manner described below. Section 2 presents a study on the finite-horizon optimum decentralized control case. In Section 3, numerical examples are shown on the target tracking system. Relevant proofs are shown in Appendix A in detail.

2. Optimum Control

2.1. Problem Formulation

We study the linear system with a coupled two-player structure:

\begin{matrix} [\begin{matrix} ψ_{1} (ϱ + 1) \\ ψ_{2} (ϱ + 1) \end{matrix}] = & [\begin{matrix} D_{11} & D_{12} \\ D_{21} & D_{22} \end{matrix}] [\begin{matrix} ψ_{1} (ϱ) \\ ψ_{2} (ϱ) \end{matrix}] + [\begin{matrix} G_{11} & G_{12} \\ G_{21} & G_{22} \end{matrix}] [\begin{matrix} u_{1} (ϱ) \\ u_{2} (ϱ) \end{matrix}] \\ + [\begin{matrix} λ_{1} (ϱ) \\ λ_{2} (ϱ) \end{matrix}], \end{matrix}

(1)

where

ψ_{1} (ϱ)

,

u_{1} (ϱ)

and

λ_{1} (ϱ)

are the status, control input and system noise of player 1, respectively.

ψ_{2} (ϱ)

,

u_{2} (ϱ)

and

λ_{2} (ϱ)

are the status, control input and system noise of player 2, respectively.

ψ_{1} (0)

,

ψ_{2} (0)

,

λ_{1} (ϱ)

and

λ_{2} (ϱ)

are Gaussian white noises and are featured to not influence each other, with a mean of (

{\bar{ψ}}_{1} (0), {\bar{ψ}}_{2} (0), 0, 0

) and a covariance of (

σ_{1}, σ_{2}, Q_{λ_{1}}, Q_{λ_{2}}

), respectively. The objective for both players is to minimize the following quadratic cost function related to the system (1):

\begin{matrix} Ω (ϰ) = & E {\sum_{ϱ = 0}^{ϰ} \sum_{i = 1}^{2} [ψ_{i} {(ϱ)}^{'} Q_{i}^{t} ψ_{i} (ϱ) + u_{i} {(ϱ)}^{'} R_{i}^{t} u_{i} (ϱ) \\ + ψ_{i} {(ϰ + 1)}^{'} H_{i} (ϰ + 1) ψ_{i} (ϰ + 1)]}, \end{matrix}

(2)

where

Q_{i}^{t}

,

R_{i}^{t}

and

H_{i} (ϰ + 1)

are positive semi-definite matrices for

i = 1, 2

.

At each time instant

ϱ

, the control actions for the two players obey the system structure information, i.e., the available information for

u_{1} (ϱ)

are

{ψ_{1} (ϱ), ψ_{1} (ϱ - 1), \dots, ψ_{1} (0), u_{1} (ϱ - 1), \dots, u_{1} (0)}

. Accordingly,

u_{1} (ϱ)

is

F_{1} (ϱ)

-measurable. The accessible information for

u_{2} (ϱ)

is

{ψ_{1} (ϱ - 1), \dots, ψ_{1} (0), u_{1} (ϱ - 1), \dots, u_{1} (0), ψ_{2} (ϱ), ψ_{2} (ϱ - 1), \dots, ψ_{2} (0), u_{2} (ϱ - 1), \dots,

u_{2} (0)}

. Accordingly,

u_{2} (ϱ)

is

F_{2} (ϱ)

-measurable. Obviously, the common information for the two players are

F_{c} (ϱ) = {ψ_{1} (ϱ - 1), \dots, ψ_{1} (0), u_{1} (ϱ - 1), \dots, u_{1} (0)}

. The whole information for the system is denoted by

F (ϱ) = {F_{1} (ϱ), F_{2} (ϱ)}

.

To make the context much clearer, (1) and (2) are rewritten as

\begin{matrix} ψ (ϱ + 1) = D ψ (ϱ) + {\bar{G}}_{1} u_{1} (ϱ) + {\bar{G}}_{2} u_{2} (ϱ) + e (ϱ), \end{matrix}

(3)

\begin{matrix} Ω (ϰ) = & E {\sum_{ϱ = 0}^{ϰ} \sum_{i = 1}^{2} {[ψ (ϱ)}^{'} Q^{t} ψ (ϱ) + u_{i} {(ϱ)}^{'} R_{i}^{t} u_{i} (ϱ) \\ + ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)]}, \end{matrix}

(4)

where

ψ (ϱ) = [\begin{matrix} ψ_{1} (ϱ) \\ ψ_{2} (ϱ) \end{matrix}]

,

D = [\begin{matrix} D_{11} & D_{12} \\ D_{21} & D_{22} \end{matrix}]

,

{\bar{G}}_{1} = [\begin{matrix} G_{11} \\ G_{21} \end{matrix}]

,

{\bar{G}}_{2} = [\begin{matrix} G_{12} \\ G_{22} \end{matrix}]

,

e (ϱ) = [\begin{matrix} λ_{1} (ϱ) \\ λ_{2} (ϱ) \end{matrix}]

,

Q^{t} = [\begin{matrix} Q_{1}^{t} & 0 \\ 0 & Q_{2}^{t} \end{matrix}]

.

The issue to be dealt with in this section is presented as follows.

Problem 1.

Find the

F_{1} (ϱ)

-measurable

u_{1} (ϱ)

and

F_{2} (ϱ)

-measurable

u_{2} (ϱ)

to minimize (4) subject to (3).

2.2. Strategy to Solve Problem 1

Similarly to the procedure of [14], Pontryagin’s maximum principle is applied to (3) and (4) to derive the equations listed below:

\begin{matrix} η (ϱ - 1) = E [D^{'} η (ϱ) | F (ϱ)] + Q^{t} ψ (ϱ), \end{matrix}

(5)

\begin{matrix} 0 = E [{\bar{G}}_{1}^{'} η (ϱ) | F_{1} (ϱ)] + R_{1}^{t} u_{1} (ϱ), \end{matrix}

(6)

\begin{matrix} 0 = E [{\bar{G}}_{2}^{'} η (ϱ) | F_{2} (ϱ)] + R_{2}^{t} u_{2} (ϱ), \end{matrix}

(7)

\begin{matrix} η (ϰ) = H (ϰ + 1) ψ (ϰ + 1), \end{matrix}

(8)

where

η (ϱ)

represents the costate variable.

Noting the available information for the controllers

u_{1} (ϱ)

and

u_{2} (ϱ)

, it can be found that

u_{1} (ϱ)

is not

F_{2} (ϱ)

-measurable and

u_{2} (ϱ)

is not

F_{1} (ϱ)

-measurable. With the above costate Equations (5)–(7), this leads to an unsolvable mathematical problem where five unknown variables exist in four Equations, (3), (5)–(7). To this end, in view of the common information of

u_{1} (ϱ)

and

u_{2} (ϱ)

, we define

\begin{matrix} u_{1}^{c} (ϱ) = E [u_{1} (ϱ) | F_{c} (ϱ)], u_{1}^{p} (ϱ) = u_{1} (ϱ) - u_{1}^{c} (ϱ), \end{matrix}

(9)

\begin{matrix} u_{2}^{c} (ϱ) = E [u_{2} (ϱ) | F_{c} (ϱ)], u_{2}^{p} (ϱ) = u_{2} (ϱ) - u_{2}^{c} (ϱ) . \end{matrix}

(10)

The following relationship can be obtained:

\begin{matrix} E [u_{1}^{p} (ϱ) | F_{c} (ϱ)] = 0, E [u_{2}^{p} (ϱ) | F_{c} (ϱ)] = 0 . \end{matrix}

(11)

In virtue of (9) and (10), we rewrite (3) and (4) as

\begin{matrix} ψ (ϱ + 1) = D ψ (ϱ) + \bar{G} u^{c} (ϱ) + {\bar{G}}_{1} u_{1}^{p} (ϱ) + {\bar{G}}_{2} u_{2}^{p} (ϱ) + e (ϱ), \end{matrix}

(12)

\begin{matrix} Ω (ϰ) = E {\sum_{ϱ = 0}^{ϰ} \sum_{i = 1}^{2} {[ψ (ϱ)}^{'} Q^{t} ψ (ϱ) + u^{c} {(ϱ)}^{'} R^{t} u^{c} (ϱ) \\ + u_{i}^{p} {(ϱ)}^{'} R_{i}^{t} u_{i}^{p} (ϱ) + ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)]}, \end{matrix}

(13)

where

\bar{G} = [\begin{matrix} {\bar{G}}_{1} & {\bar{G}}_{2} \end{matrix}], u^{c} (ϱ) = [\begin{matrix} u_{1}^{c} (ϱ) \\ u_{2}^{c} (ϱ) \end{matrix}], R^{t} = [\begin{matrix} R_{1}^{t} & 0 \\ 0 & R_{2}^{t} \end{matrix}]

.

As a result of the preceding, we give the next lemma.

Lemma 1.

The costate Equations (5)–(8) can be expressed as

\begin{matrix} η (ϱ - 1) = E [D^{'} η (ϱ) | F (ϱ)] + Q^{t} ψ (ϱ), \end{matrix}

(14)

\begin{matrix} 0 = E [{\bar{G}}^{'} η (ϱ) | F_{c} (ϱ)] + R^{t} u^{c} (ϱ), \end{matrix}

(15)

\begin{matrix} 0 = E [{\bar{G}}_{1}^{'} η (ϱ) | F_{1} (ϱ)] - E [{\bar{G}}_{1}^{'} η (ϱ) | F_{c} (ϱ)] + R_{1}^{t} u_{1}^{p} (ϱ), \end{matrix}

(16)

\begin{matrix} 0 = E [{\bar{G}}_{2}^{'} η (ϱ) | F_{2} (ϱ)] - E [{\bar{G}}_{2}^{'} η (ϱ) | F_{c} (ϱ)] + R_{2}^{t} u_{2}^{p} (ϱ), \end{matrix}

(17)

\begin{matrix} η (ϰ) = H (ϰ + 1) ψ (ϰ + 1) . \end{matrix}

(18)

Proof.

By virtue of (9) and (10), applying expectation to both sides of (6) and (7) with

F_{c} (ϱ)

, respectively, we can obtain the following:

\begin{matrix} 0 = E [{\bar{G}}_{1}^{'} η (ϱ) | F_{c} (ϱ)] + R_{1}^{t} u_{1}^{c} (ϱ), \end{matrix}

(19)

\begin{matrix} 0 = E [{\bar{G}}_{2}^{'} η (ϱ) | F_{c} (ϱ)] + R_{2}^{t} u_{2}^{c} (ϱ) . \end{matrix}

(20)

Noting (19) and (20) and observing (12) and (13), (15) can be readily obtained.

Subtracting (19) from (6) and using (9), we have

\begin{matrix} 0 = E [{\bar{G}}_{1}^{'} η (ϱ) | F_{1} (ϱ)] - E [{\bar{G}}_{1}^{'} η (ϱ) | F_{c} (ϱ)] + R_{1}^{t} u_{1} (ϱ) - R_{1}^{t} u_{1}^{c} (ϱ), \\ 0 = E [{\bar{G}}_{1}^{'} η (ϱ) | F_{1} (ϱ)] - E [{\bar{G}}_{1}^{'} η (ϱ) | F_{c} (ϱ)] + R_{1}^{t} u_{1}^{p} (ϱ), \end{matrix}

which implies that (16) holds. Similarly, subtracting (20) from (7) and applying (10), (17) is valid. This ends the proof. □

It is noted that through the transformation in lemma 1 and the definitions of (9) and (10), the FBSDEs (12) and (14) can be successfully solved as 4 unknown variables exist in 5 Equations: (12), (14)–(17).

Now the following Riccati equations are introduced:

\begin{matrix} H^{c} (ϱ) = D^{'} H^{c} (ϱ + 1) D - M^{c} {(ϱ)}^{'} Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) + Q^{t}, \end{matrix}

(21)

\begin{matrix} H^{1} (ϱ) = D^{'} H^{c} (ϱ + 1) D - M^{1} {(ϱ)}^{'} Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) + Q^{t}, \end{matrix}

(22)

\begin{matrix} H^{2} (ϱ) = D^{'} H^{2} (ϱ + 1) D - M^{2} {(ϱ)}^{'} Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) + Q^{t}, \end{matrix}

(23)

where

\begin{matrix} Υ^{c} (ϱ) = {\bar{G}}^{'} H^{c} (ϱ + 1) \bar{G} + R, M^{c} (ϱ) = {\bar{G}}^{'} H^{c} (ϱ + 1) D, \end{matrix}

(24)

\begin{matrix} Υ^{1} (ϱ) = {\bar{G}}_{1}^{'} H^{c} (ϱ + 1) {\bar{G}}_{1} + R_{1}^{t}, M^{1} (ϱ) = {\bar{G}}_{1}^{'} H^{c} (ϱ + 1) D, \end{matrix}

(25)

\begin{matrix} Υ^{2} (ϱ) = {\bar{G}}_{2}^{'} H^{2} (ϱ + 1) \bar{G_{2}} + R_{2}^{t}, M^{2} (ϱ) = {\bar{G}}_{2}^{'} H^{2} (ϱ + 1) D . \end{matrix}

(26)

with terminal values

H^{c} (ϰ + 1) = H^{1} (ϰ + 1) = H^{2} (ϰ + 1) = H (ϰ + 1)

.

The main results are presented now.

Theorem 1.

A unique solution to Problem 1 exists precisely when

Υ^{c} (ϱ)

,

Υ^{1} (ϱ)

and

Υ^{2} (ϱ)

are invertible for

ϱ = 0, \dots, ϰ

. And the optimal controllers are

\begin{matrix} u^{c} (ϱ) = - Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ), \end{matrix}

(27)

\begin{matrix} u_{1}^{p} (ϱ) = - Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}], \end{matrix}

(28)

\begin{matrix} u_{2}^{p} (ϱ) = - Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}], \end{matrix}

(29)

where

\begin{matrix} {\hat{ψ}}^{c} (ϱ | ϱ) = E [ψ (ϱ) | F_{c} (ϱ)], \end{matrix}

(30)

\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) = ψ_{1} (ϱ) - E [ψ_{1} (ϱ) | F_{c} (ϱ)] = ψ_{1} (ϱ) - {\hat{ψ}}_{1}^{c} (ϱ | ϱ), \end{matrix}

(31)

\begin{matrix} {\tilde{ψ}}_{2} (ϱ | ϱ) = ψ_{2} (ϱ) - E [ψ_{2} (ϱ) | F_{c} (ϱ)] = ψ_{2} (ϱ) - {\hat{ψ}}_{2}^{c} (ϱ | ϱ) . \end{matrix}

(32)

Then, the optimum cost is

\begin{matrix} Ω^{*} (ϰ) & = E [ψ {(0)}^{'} H^{c} (0) \bar{ψ} (0) + ψ {(0)}^{'} H^{2} (0) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (0 | 0) \end{matrix}]] \\ + \sum_{ϱ = 0}^{ϰ} {t r [(H^{1} (ϱ) + H^{1} (ϱ + 1)) [\begin{matrix} Q_{λ_{1}} & 0 \\ 0 & 0 \end{matrix}] \\ + H^{2} (ϱ + 1) [\begin{matrix} 0 & 0 \\ 0 & Q_{λ_{2}} \end{matrix}]]}, \end{matrix}

(33)

where

\bar{ψ} (0) = [\begin{matrix} {\bar{ψ}}_{1} (0) \\ {\bar{ψ}}_{2} (0) \end{matrix}]

. Furthermore, the result of the FBSDEs (12) and (14) is

\begin{matrix} η (ϱ - 1) = & H^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ) + H^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}] \\ + H^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}] . \end{matrix}

(34)

Proof.

Please view Appendix A. □

3. Numerical Examples

Target tracking (TT) systems have drawn significant attention due to their vast applications, such as navigation, collision avoidance, monitoring missions and so on [37].

Consider a simple TT system of two unmanned aerial vehicles (UAVs), which is depicted as in Figure 1. The two unmanned aerial vehicles are named UAV1 and UAV2. In the TTS, UAV1 is the target aerial vehicle with inferior equipments; UAV2 with superior equipments tracks UAV1 within a safe distance and monitors UAV1’s information. But UAV1 cannot observe the information of UAV2 due to its poor equipments when the distances between UAV1 and UAV2 are far enough. To avoid being detected, UAV2 should keep a safe distance from UAV1. Thus, when UAV2 monitors the information of UAV1, delay is unavoidable. The location and velocity for UAV1 and UVA2 are defined as

ϵ_{1} (k), λ_{1} (k)

and

ϵ_{2} (k), λ_{2} (k)

, respectively (for simplicity, we suppose that UAVs fly in the straight route; the variables are scalars and the delay is a unit).

Correspondingly, the dynamic equations for UAV1 and UAV2 can be obtained as follows:

\begin{matrix} ϵ_{1} (ϱ + 1) & = d_{11} ϵ_{1} (ϱ) + g_{11} v_{1} (ϱ) + λ_{1} (ϱ), \\ ϵ_{2} (ϱ + 1) & = ϵ_{2} (ϱ) + d_{21} ϵ_{1} (ϱ) + g_{22} v_{2} (ϱ) + g_{21} v_{1} (ϱ) + λ_{2} (ϱ), \end{matrix}

where

λ_{1} (ϱ)

and

λ_{2} (ϱ)

present the uncertain flight condition, and

d_{11}, d_{21}, g_{11}, g_{21}, g_{22}

are scalar constants. The initial locations

ϵ_{1} (0)

,

ϵ_{2} (0)

,

λ_{1} (0)

and

λ_{2} (0)

are Gaussian white noises and non-interacting with one another, with a mean of

(μ_{1}, μ_{2}, 0, 0)

and a covariance of

(σ_{1}, σ_{2}, Q_{λ_{1}}, Q_{λ_{2}})

, respectively.

From Figure 1, at time

ϱ

, UAV2 can observe the locations

{ϵ_{1} (ϱ - 1), \dots, ϵ_{1} (0)}

, the velocities

{v_{1} (ϱ - 1), \dots, v_{1} (0)}

of UAV1, and its own locations

{ϵ_{2} (ϱ), ϵ_{2} (ϱ - 1), \dots, ϵ_{1} (0)}

and velocities

{v_{2} (ϱ), v_{2} (ϱ - 1), \dots, v_{2} (0)}

. Obviously, the communication channel from UAV1 to UAV2 presents a unit delay. UVA1 can only obtain its own locations

{ϵ_{1} (ϱ), ϵ_{1} (ϱ - 1), \dots, ϵ_{1} (0)}

and velocities

{v_{1} (ϱ), v_{1} (ϱ - 1), \dots, v_{1} (0)}

. In this scenario, the aim of the TT systems is to keep the distance of UAV1 and UAV2 to the safe distance l and keep the energy cost of UAV1 and UAV2 to the relative minimum. Accordingly, the cost function is of the form

\begin{matrix} Ω (ϰ) & = \sum_{ϱ = 0}^{ϰ} E {{[ϵ_{2} (ϱ) - ϵ_{1} (ϱ) - l]}^{2} + R_{1}^{t} v_{1} {(ϱ)}^{2} + R_{2}^{t} v_{2} {(ϱ)}^{2}}, \end{matrix}

where

R_{1}^{t} > 0

and

R_{2}^{t} > 0

are the weighting matrices of the energy cost for UAV1 and UAV2, respectively.

This problem can be resolved by using the results obtained in this paper. Firstly, we define

ψ_{1} (ϱ) = ϵ_{1} (ϱ)

,

ψ_{2} (ϱ) = ϵ_{2} (ϱ) - l

,

u_{1} (ϱ) = v_{1} (ϱ)

and

u_{2} (ϱ) = v_{2} (ϱ)

. Then, the corresponding linear system and the finite-horizon cost function for the can be defined as

\begin{matrix} ψ (ϱ + 1) = D ψ (ϱ) + {\bar{G}}_{1} u_{1} (ϱ) + {\bar{G}}_{2} u_{2} (ϱ) + λ (ϱ), \end{matrix}

(35)

\begin{matrix} Ω (ϰ) = E {\sum_{ϱ = 0}^{ϰ} \sum_{i = 1}^{2} [ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) + R_{i}^{t} u_{i}^{2} (ϱ)]}, \end{matrix}

(36)

where

ψ (ϱ) = [\begin{matrix} ψ_{1} (ϱ) \\ ψ_{2} (ϱ) \end{matrix}], D = [\begin{matrix} d_{11} & 0 \\ d_{21} & 1 \end{matrix}], {\bar{G}}_{1} = [\begin{matrix} g_{11} \\ g_{21} \end{matrix}], {\bar{G}}_{2} = [\begin{matrix} 0 \\ g_{22} \end{matrix}], e (ϱ) = [\begin{matrix} λ_{1} (ϱ) \\ λ_{2} (ϱ) \end{matrix}], Q^{t} = [\begin{matrix} 1 & - 1 \\ - 1 & 1 \end{matrix}]

.

By virtue of Theorem 1, we give the following corollary.

Corollary 1.

The optimal controllers

u_{1}^{*} (ϱ)

and

u_{2}^{*} (ϱ)

are of the forms

\begin{matrix} u_{1}^{*} (ϱ) = & - [\begin{matrix} 1 & 0 \end{matrix}] Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ) \\ - Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}], \end{matrix}

(37)

\begin{matrix} u_{2}^{*} (ϱ) = & - [\begin{matrix} 0 & 1 \end{matrix}] Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ) \\ - Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}], \end{matrix}

(38)

where

{\hat{ψ}}^{c} (ϱ | ϱ)

is the estimation of

ψ (ϱ)

based on

{ψ_{1} (ϱ - 1), \dots, ψ_{1} (0)}

,

{\tilde{ψ}}_{1} (ϱ | ϱ) = ψ_{1} (ϱ) - {\hat{ψ}}_{1}^{c} (ϱ | ϱ)

is the estimation error of

ψ_{1} (ϱ)

,

{\tilde{ψ}}_{2} (ϱ | ϱ) = ψ_{2} (ϱ) - {\hat{ψ}}_{2}^{c} (ϱ | ϱ)

is the estimation error of

ψ_{2} (ϱ)

, and

Υ^{c} (ϱ), M^{c} (ϱ), Υ^{1} (ϱ), M^{1} (ϱ), Υ^{2} (ϱ), M^{2} (ϱ)

are as in (24)–(26).

For similarity, we set the safe distance

l = 3

,

d_{11} = d_{21} = g_{11} = g_{21} = g_{22} = 1,

μ_{1} = μ_{2} = 0

,

σ_{1} = σ_{2} = Q_{λ_{1}} = Q_{λ_{2}} = 1

,

R_{1}^{t} = 1.5

,

R_{2}^{t} = 2

and

N = 100

.

Figure 2 shows the distances between UAV1 and UAV2. It can be observed that the distances between the two UAVs are around the safe distance

l = 3

. Figure 3 indicates the velocities of the two UAVs. It can be seen that the velocities of UAV2 are close to those of UAV1.

4. Conclusions

This paper handles the problems of optimal control and stability for decentralized system with one-step asymmetric delay information. Employing Pontryagin’s maximum principle, costate equation and equilibrium equations are given. By virtue of these equations, we obtain the solution to the FBSDEs. Based on this solution, the finite-horizon optimization problem is solved. In view of the finite-horizon optimal cost, the Lyapunov function is defined and the mean-square stabilization conditions for the system are derived in terms of two algebraic Riccati equations. We generalize the proposed algorithm to the examples of a target tracking system to verify the effectiveness of the given results. Future research will focus on extending the proposed framework to multi-player interconnected systems, relaxing the one-step delay constraint to multi-step delays, investigating infinite-horizon scenarios, weakening the matrix invertibility assumption, and accommodating more general non-Gaussian noise distributions to enhance the method’s generality and applicability.

Author Contributions

Conceptualization, Y.W. (Yixing Wang) and X.L.; methodology, Y.W. (Yixing Wang); software, Y.W. (Yirun Wang); validation, Y.W. (Yirun Wang) and B.T.; formal analysis, X.L.; writing—review and editing, X.L. and B.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by Youth Innovation Team Program of Shandong Higher Education Institution under grant 2023KJN049.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 1

Necessity: Assuming that Problem 1 admits the unique solutions, we shall show that

Υ^{c} (ϱ)

,

Υ^{1} (ϱ)

and

Υ^{2} (ϱ)

are invertible for

ϱ = 0, \dots, ϰ

and that the optimal controllers are as (27)–(29) by mathematical induction. Noting (18) and (30)–(32), (34) holds for

ϱ = ϰ + 1

. Define

\begin{matrix} Ω (j) = & E {\sum_{ϱ = j}^{ϰ} \sum_{i = 1}^{2} {[ψ (ϱ)}^{'} Q^{t} ψ (ϱ) + u^{c} {(ϱ)}^{'} R^{t} u^{c} (ϱ) \\ + u_{i}^{p} {(ϱ)}^{'} R_{i}^{t} u_{i}^{p} (ϱ) + ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)]} . \end{matrix}

For

ϱ = ϰ

, since

Υ^{c} (ϰ)

,

Υ^{1} (ϰ)

and

Υ^{2} (ϰ)

are uncorrelated with

ψ (ϰ)

,

Q_{e}

, using (12) and letting

ψ (ϰ) = 0

,

Q_{e} = 0

, with (24)–(26), we have the following:

\begin{matrix} Ω (ϰ) = & E {\sum_{i = 1}^{2} {[ψ (ϰ)}^{'} Q^{t} ψ (ϰ) + u^{c} {(ϰ)}^{'} R^{t} u^{c} (ϰ) \\ + u_{i}^{p} {(ϰ)}^{'} R_{i}^{t} u_{i}^{p} (ϰ) + ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)]} \\ = & E {\sum_{i = 1}^{2} [u^{c} {(ϰ)}^{'} ({\bar{G}}^{'} H (ϰ + 1) \bar{G} + R) u^{c} (ϰ) \\ + u_{i}^{p} {(ϰ)}^{'} (R_{i}^{t} + {\bar{G}}_{i}^{'} H (ϰ + 1) {\bar{G}}_{i}) u_{i}^{p} (ϰ)]} \\ = & E {\sum_{i = 1}^{2} [u^{c} {(ϰ)}^{'} Υ^{c} (ϰ) u^{c} (ϰ) + u_{i}^{p} {(ϰ)}^{'} Υ^{i} (ϰ) u_{i}^{p} (ϰ)]} . \end{matrix}

The uniqueness of

u^{c} (ϰ)

,

u_{1}^{p} (ϰ)

and

u_{2}^{p} (ϰ)

implies that the quadratic terms of the controllers are positive for any nonzero controllers. It follows that

Υ^{c} (ϰ) > 0

,

Υ^{1} (ϰ) > 0

and

Υ^{2} (ϰ) > 0

.

Using (11), (12), (15) and (18), we have the following:

\begin{matrix} 0 & = E [{\bar{G}}^{'} H (ϰ + 1) ψ (ϰ + 1) | F_{c} (ϰ)] + R^{t} u^{c} (ϰ) \\ = {\bar{G}}^{'} H (ϰ + 1) [D {\hat{ψ}}^{c} (ϰ | ϰ) + \bar{G} u^{c} (ϰ)] + R^{t} u^{c} (ϰ) . \end{matrix}

Therefore, the optimal

u^{c} (ϰ)

satisfies

\begin{matrix} u^{c} (ϰ) = - Υ^{c} {(ϰ)}^{- 1} M^{c} (ϰ) {\hat{ψ}}^{c} (ϰ | ϰ) . \end{matrix}

(A1)

By virtue of (11), (12), (16), (18) and (31), it yields

\begin{matrix} 0 = & E [{\bar{G}}_{1}^{'} H (ϰ + 1) ψ (ϰ + 1) | F_{1} (ϰ)] \\ - E [{\bar{G}}_{1}^{'} H (ϰ + 1) ψ (ϰ + 1) | F_{c} (ϰ)] + R_{1}^{t} u_{1}^{p} (ϰ) \\ = & {\bar{G}}_{1}^{'} H (ϰ + 1) \\ \times [\begin{matrix} D_{11} ψ_{1} (ϰ) + G_{11} u_{1} (ϰ) \\ D_{21} ψ_{1} (ϰ) + D_{22} {\hat{ψ}}_{2}^{c} (ϰ | ϰ) + G_{21} u_{1} (ϰ) + G_{22} u_{2}^{c} (ϰ) \end{matrix}] \\ - {\bar{G}}_{1}^{'} H (ϰ + 1) \\ \times [\begin{matrix} D_{11} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + G_{11} u_{1}^{c} (ϰ) \\ D_{21} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + D_{22} {\hat{ψ}}_{2}^{c} (ϰ | ϰ) + G_{21} u_{1}^{c} (ϰ) + G_{22} u_{2}^{c} (ϰ) \end{matrix}] & + R_{1}^{t} u_{1}^{p} (ϰ) \\ = & {\bar{G}}_{1}^{'} H (ϰ + 1) [\begin{matrix} D_{11} {\tilde{ψ}}_{1} (ϰ | ϰ) + G_{11} u_{1}^{p} (ϰ) \\ D_{21} {\tilde{ψ}}_{1} (ϰ | ϰ) + G_{21} u_{1}^{p} (ϰ) \end{matrix}] + R_{1}^{t} u_{1}^{p} (ϰ) \\ = & {\bar{G}}_{1}^{'} H (ϰ + 1) D [\begin{matrix} {\tilde{ψ}}_{1} (ϰ | ϰ) \\ 0 \end{matrix}] \\ + ({\bar{G}}_{1}^{'} H (ϰ + 1) {\bar{G}}_{1} + R_{1}^{t}) u_{1}^{p} (ϰ) . \end{matrix}

Thus, the optimum

u_{1}^{p} (ϰ)

is as follows

\begin{matrix} u_{1}^{p} (ϰ) = - Υ^{1} {(ϰ)}^{- 1} M^{1} (ϰ) [\begin{matrix} {\tilde{ψ}}_{1} (ϰ | ϰ) \\ 0 \end{matrix}] . \end{matrix}

(A2)

By applying (12), (17), (18) and (32), we obtain

\begin{matrix} 0 = & E [{\bar{G}}_{2}^{'} H (ϰ + 1) ψ (ϰ + 1) | F_{2} (ϰ)] \\ - E [{\bar{G}}_{2}^{'} H (ϰ + 1) ψ (ϰ + 1) | F_{c} (ϰ)] + R_{2}^{t} u_{2}^{p} (ϰ) \\ = & {\bar{G}}_{2}^{'} H (ϰ + 1) \\ \times [\begin{matrix} D_{11} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + G_{11} u_{1}^{c} (ϰ) \\ D_{21} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + D_{22} {\hat{ψ}}_{2}^{c} (ϰ | ϰ) + G_{21} u_{1}^{c} (ϰ) + G_{22} u_{2}^{c} (ϰ) \end{matrix}] & - {\bar{G}}_{2}^{'} H (ϰ + 1) \\ \times [\begin{matrix} D_{11} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + G_{11} u_{1}^{c} (ϰ) \\ D_{21} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) + D_{22} {\hat{ψ}}_{2}^{c} (ϰ | ϰ) + G_{21} u_{1}^{c} (ϰ) + G_{22} u_{2}^{c} (ϰ) \end{matrix}] \\ + R_{2}^{t} u_{2}^{p} (ϰ) \\ = & {\bar{G}}_{2}^{'} H (ϰ + 1) [\begin{matrix} 0 \\ D_{22} {\tilde{ψ}}_{2} (ϰ | ϰ) + G_{22} u_{2}^{p} (ϰ) \end{matrix}] + R_{2}^{t} u_{2}^{p} (ϰ) \\ = & {\bar{G}}_{2}^{'} H (ϰ + 1) D [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϰ | ϰ) \end{matrix}] \\ + ({\bar{G}}_{2}^{'} H (ϰ + 1) {\bar{G}}_{2} + R_{2}^{t}) u_{2}^{p} (ϰ) . \end{matrix}

Thus, we have the optimum

u_{2}^{p} (ϰ)

, which is as follows:

\begin{matrix} u_{2}^{p} (ϰ) = - Υ^{2} {(ϰ)}^{- 1} M^{2} (ϰ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϰ | ϰ) \end{matrix}] . \end{matrix}

(A3)

Using (12), (15), (18), (A1)–(A3), we obtain

\begin{matrix} η (ϰ - 1) \\ = & E [D^{'} H (ϰ + 1) ψ (ϰ + 1) | F (ϰ)] + Q^{t} ψ (ϰ) \\ = & D^{'} H (ϰ + 1) [D ψ (ϰ) + \bar{G} u^{c} (ϰ) + {\bar{G}}_{1} u_{1}^{p} (ϰ) + {\bar{G}}_{2} u_{2}^{p} (ϰ)] \\ + Q^{t} ψ (ϰ) \\ = & D^{'} H (ϰ + 1) D ψ (ϰ) - M^{c} {(ϰ)}^{'} Υ^{c} {(ϰ)}^{- 1} M^{c} (ϰ) [\begin{matrix} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) \\ {\hat{ψ}}_{2}^{c} (ϰ | ϰ) \end{matrix}] \\ - M^{1} {(ϰ)}^{'} Υ^{1} {(ϰ)}^{- 1} M^{1} (ϰ) [\begin{matrix} {\tilde{ψ}}_{1} (ϰ | ϰ) \\ 0 \end{matrix}] \\ - M^{2} {(ϰ)}^{'} Υ^{2} {(ϰ)}^{- 1} M^{2} (ϰ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϰ | ϰ) \end{matrix}] + Q^{t} ψ (ϰ) \\ = & [D^{'} H (ϰ + 1) D - M^{c} {(ϰ)}^{'} Υ^{c} {(ϰ)}^{- 1} M^{c} (ϰ) + Q^{t}] \\ \times [\begin{matrix} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) \\ {\hat{ψ}}_{2}^{c} (ϰ | ϰ) \end{matrix}] \\ + [D^{'} H (ϰ + 1) D - M^{1} {(ϰ)}^{'} Υ^{1} {(ϰ)}^{- 1} M^{1} (ϰ) + Q^{t}] \\ \times [\begin{matrix} {\tilde{ψ}}_{1} (ϰ | ϰ) \\ 0 \end{matrix}] \end{matrix}

\begin{matrix} + [D^{'} H (ϰ + 1) D - M^{2} {(ϰ)}^{'} Υ^{2} {(ϰ)}^{- 1} M^{2} (ϰ) + Q^{t}] \\ \times [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϰ | ϰ) \end{matrix}] \\ = & H^{c} (ϰ) [\begin{matrix} {\hat{ψ}}_{1}^{c} (ϰ | ϰ) \\ {\hat{ψ}}_{2}^{c} (ϰ | ϰ) \end{matrix}] + H^{1} (ϰ) [\begin{matrix} {\tilde{ψ}}_{1} (ϰ | ϰ) \\ 0 \end{matrix}] \\ + H^{2} (ϰ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϰ | ϰ) \end{matrix}], \end{matrix}

which means that (34) holds.

To accomplish the mathematical induction, letting random l satisfying

0 \leq l \leq N

, suppose that

Υ^{c} (ϱ), Υ^{1} (ϱ), Υ^{c} (ϱ)

are invertible and that the optimal

u^{c} (ϱ)

,

u_{1}^{p} (ϱ)

,

u_{2}^{p} (ϱ)

and

η (ϱ - 1)

are (27)–(29) and (34), respectively, for all

ϱ \geq l + 1

.

By virtue of (12), (14)–(17), we have

\begin{matrix} E [ψ {(ϱ)}^{'} η (ϱ - 1) - ψ {(ϱ + 1)}^{'} η (ϱ)] \\ = & E {ψ {(ϱ)}^{'} E [D^{'} η (ϱ) | F (ϱ)] + ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) \\ - [ψ {(ϱ)}^{'} D^{'} - u^{c} {(ϱ)}^{'} {\bar{G}}^{'} - u_{1}^{p} {(ϱ)}^{'} {\bar{G}}_{1}^{'} - u_{2}^{p} {(ϱ)}^{'} {\bar{G}}_{2}^{'}] η (ϱ)} \\ = & E {ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) - u^{c} {(ϱ)}^{'} E [{\bar{G}}^{'} η (ϱ) | F_{c} (ϱ)] \\ - u_{1}^{p} {(ϱ)}^{'} E [{\bar{G}}_{1}^{'} η (ϱ) | F_{1} (ϱ)] - u_{2}^{p} {(ϱ)}^{'} E [{\bar{G}}_{2}^{'} η (ϱ) | F_{2} (ϱ)]} \\ = & E [ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) + u^{c} {(ϱ)}^{'} R^{t} u^{c} (ϱ) \\ + u_{1}^{p} {(ϱ)}^{'} R_{1}^{t} u_{1}^{p} (ϱ) + u_{2}^{p} {(ϱ)}^{'} R_{2}^{t} u_{2}^{p} (ϱ)] . \end{matrix}

Aggregating data from

ϱ = κ + 1

to

ϱ = ϰ

on both sides of the above equation and using (18), we obtain

\begin{matrix} E [ψ {(κ + 1)}^{'} η (κ) - ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)] \\ = & \sum_{ϱ = κ + 1}^{ϰ} {E [ψ (ϱ)}^{'} Q^{t} ψ (ϱ) + u^{c} {(ϱ)}^{'} R^{t} u^{c} (ϱ) \\ + u_{1}^{p} {(ϱ)}^{'} R_{1}^{t} u_{1}^{p} (ϱ) + u_{2}^{p} {(ϱ)}^{'} R_{2}^{t} u_{2}^{p} (ϱ)] . \end{matrix}

It follows that

\begin{matrix} Ω (κ) = & E [ψ {(κ)}^{'} Q^{t} ψ (κ) + u^{c} {(κ)}^{'} R^{t} u^{c} (κ) + u_{1}^{p} {(κ)}^{'} R_{1}^{t} u_{1}^{p} (κ) \\ + u_{2}^{p} {(κ)}^{'} R_{2}^{t} u_{2}^{p} (κ)] + \sum_{ϱ = κ + 1}^{ϰ} E [ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) \\ + u^{c} {(ϱ)}^{'} R^{t} u^{c} (ϱ) + u_{1}^{p} {(ϱ)}^{'} R_{1}^{t} u_{1}^{p} (ϱ) + u_{2}^{p} {(ϱ)}^{'} R_{2}^{t} u_{2}^{p} (ϱ) \\ + ψ {(ϰ + 1)}^{'} H (ϰ + 1) ψ (ϰ + 1)] \\ = & E [ψ {(κ)}^{'} Q^{t} ψ (κ) + u^{c} {(κ)}^{'} R^{t} u^{c} (κ) + u_{1}^{p} {(κ)}^{'} R_{1}^{t} u_{1}^{p} (κ) \\ + u_{2}^{p} {(κ)}^{'} R_{2}^{t} u_{2}^{p} (κ)] + E [ψ {(κ + 1)}^{'} η (κ)] . \end{matrix}

(A4)

From (34), for

ϱ = κ + 1

, setting

ψ (κ) = 0, e (κ) = 0

, we have

\begin{matrix} η (κ) = & H^{c} (κ + 1) [\begin{matrix} D_{11} ψ_{1} (κ) + G_{11} u_{1} (κ) \\ D_{21} ψ_{1} (κ) + D_{22} {\hat{ψ}}_{2}^{c} (κ | κ) + G_{21} u_{1} (κ) + G_{22} u_{2}^{c} (κ) \end{matrix}] \\ + H^{1} (κ + 1) [\begin{matrix} λ_{1} (κ) \\ 0 \end{matrix}] + H^{2} (κ + 1) \\ \times [\begin{matrix} 0 \\ D_{22} {\tilde{ψ}}_{2} (κ | κ) + G_{22} u_{2}^{p} (κ) + λ_{2} (κ) \end{matrix}] \\ = & H^{c} (κ + 1) \bar{G} u^{c} (κ) + H^{c} (κ + 1) {\bar{G}}_{1} u_{1}^{p} (κ) + H^{2} (κ + 1) {\bar{G}}_{2} u_{2}^{p} (κ) . \end{matrix}

(A5)

Substituting (A5) into (A4) and setting

ψ (κ) = 0, e (κ) = 0

, we obtain

\begin{matrix} Ω (N) \\ = & E {u^{c} {(κ)}^{'} [R + {\bar{G}}^{'} H^{c} (κ + 1) \bar{G}] u^{c} (κ) \\ + u_{1}^{p} {(κ)}^{'} [R_{1}^{t} + {\bar{G}}_{1}^{'} H^{c} (κ + 1) {\bar{G}}_{1}] u_{1}^{p} (κ) \\ + u_{2}^{p} {(κ)}^{'} [R_{2}^{t} + {\bar{G}}_{2}^{'} H^{2} (κ + 1) {\bar{G}}_{2}] u_{2}^{p} (κ)} \\ = & u^{c} {(κ)}^{'} Υ^{c} (κ) u^{c} (κ) + u_{1}^{p} {(κ)}^{'} Υ^{1} (κ) u_{1}^{p} (κ) + u_{2}^{p} {(κ)}^{'} Υ^{2} (κ) u_{2}^{p} (κ) \end{matrix}

It can be seen that

Υ^{c} (κ) > 0, Υ^{1} (κ) > 0

and

Υ^{2} (κ) > 0

based on the uniqueness of the optimal controllers. The proofs of the optimal

u^{c} (κ), u_{1}^{p} (κ), u_{2}^{p} (κ)

and

η (κ - 1)

are analogous to the above procedures of

κ = N

. Thus, we omit them here. The essential proof is completed.

Sufficiency: Assuming that

Υ^{c} (ϱ)

,

Υ^{1} (ϱ)

and

Υ^{2} (ϱ)

are invertible for

ϱ = 0, \dots, N

, we will demonstrate that Problem 1 can be solved unparalleled. Define

\begin{matrix} V (ϱ, ψ (ϱ)) = & E {ψ {(ϱ)}^{'} [H^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ) + H^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}] \\ + H^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]]} . \end{matrix}

By applying (12), (21)–(26) and (30)–(32), it yields

\begin{matrix} V (ϱ, ψ (ϱ)) - V (ϱ + 1, ψ (ϱ + 1)) \\ = & E {ψ {(ϱ)}^{'} [H^{c} (ϱ) - D^{'} H^{c} (ϱ + 1) D + M^{c} {(ϱ)}^{'} Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ)] \\ \times ψ (ϱ) - ψ {(ϱ)}^{'} M^{c} {(ϱ)}^{'} Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) ψ (ϱ) \\ - 2 {\hat{ψ}}^{c} {(ϱ | ϱ)}^{'} D^{'} H^{c} (ϱ + 1) \bar{G} u^{c} (ϱ) - u^{c} {(ϱ)}^{'} [Υ^{c} (ϱ) - R] u^{c} (ϱ) \\ - 2 {[\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]}^{'} D^{'} H^{c} (ϱ + 1) {\bar{G}}_{1} u_{1}^{p} (ϱ) \\ - u_{1}^{p} {(ϱ)}^{'} [Υ^{1} (ϱ) - R_{1}^{t}] u_{1}^{p} (ϱ) \\ - 2 {[\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]}^{'} D^{'} H^{2} (ϱ + 1) {\bar{G}}_{2} u_{2}^{p} (ϱ) \\ - u_{2}^{p} {(ϱ)}^{'} [Υ^{2} (ϱ) - R_{2}^{t}] u_{2}^{p} (ϱ) \\ - {[\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]}^{'} M^{2} {(ϱ)}^{'} Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}] \\ + {[\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]}^{'} M^{c} {(ϱ)}^{'} Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}] \\ - {[\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]}^{'} M^{1} {(ϱ)}^{'} Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]} \end{matrix}

\begin{matrix} = & E {[ψ {(ϱ)}^{'} Q^{t} ψ (ϱ) + u_{(}^{c} ϱ)}^{'} R^{t} u^{c} (ϱ) + u_{1}^{p} {(ϱ)}^{'} R_{1}^{t} u_{1}^{p} (ϱ) \\ + u_{2}^{p} {(ϱ)}^{'} R_{2}^{t} u_{2}^{p} (ϱ)] - E {{[u^{c} (ϱ) + Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ)]}^{'} \\ \times Υ^{c} (ϱ) [u^{c} (ϱ) + Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ)] \\ + {[u_{1}^{p} (ϱ) + Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]]}^{'} \\ \times Υ^{1} (ϱ) [u_{1}^{p} (ϱ) + Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]] \\ + {[u_{2}^{p} (ϱ) + Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]]}^{'} \\ \times Υ^{2} (ϱ) [u_{2}^{p} (ϱ) + Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]]} \\ - E {{[\begin{matrix} λ_{1} (ϱ) \\ 0 \end{matrix}]}^{'} [H^{1} (ϱ) + H^{1} (ϱ + 1)] [\begin{matrix} λ_{1} (ϱ) \\ 0 \end{matrix}] \\ + {[\begin{matrix} 0 \\ λ_{2} (ϱ) \end{matrix}]}^{'} H^{2} (ϱ + 1) [\begin{matrix} 0 \\ λ_{2} (ϱ) \end{matrix}]} . \end{matrix}

Adding

ϱ = 0

to

ϱ = ϰ

on two sides of the above equation, we have

\begin{matrix} Ω (ϰ) & = E [ψ {(0)}^{'} H^{c} (0) \bar{ψ} (0) + ψ {(0)}^{'} H^{2} (0) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (0 | 0) \end{matrix}]] \\ + \sum_{ϱ = 0}^{ϰ} {t r [(H^{1} (ϱ) + H^{1} (ϱ + 1)) [\begin{matrix} Q_{λ_{1}} & 0 \\ 0 & 0 \end{matrix}] \\ + H^{2} (ϱ + 1) [\begin{matrix} 0 & 0 \\ 0 & Q_{λ_{2}} \end{matrix}]]} \\ + \sum_{ϱ = 0}^{ϰ} E {{[u^{c} (ϱ) + Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ)]}^{'} \\ \times Υ^{c} (ϱ) [u^{c} (ϱ) + Υ^{c} {(ϱ)}^{- 1} M^{c} (ϱ) {\hat{ψ}}^{c} (ϱ | ϱ)] \\ + {[u_{1}^{p} (ϱ) + Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]]}^{'} \\ \times Υ^{1} (ϱ) [u_{1}^{p} (ϱ) + Υ^{1} {(ϱ)}^{- 1} M^{1} (ϱ) [\begin{matrix} {\tilde{ψ}}_{1} (ϱ | ϱ) \\ 0 \end{matrix}]] \\ + {[u_{2}^{p} (ϱ) + Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]]}^{'} \\ \times Υ^{2} (ϱ) [u_{2}^{p} (ϱ) + Υ^{2} {(ϱ)}^{- 1} M^{2} (ϱ) [\begin{matrix} 0 \\ {\tilde{ψ}}_{2} (ϱ | ϱ) \end{matrix}]]} . \end{matrix}

Since

Υ^{c} (ϱ), Υ^{1} (ϱ)

and

Υ^{2} (ϱ)

are definitely positive, optimal controllers satisfy (27)–(29) and the optimal performance index is (33). This completes the sufficiency part of the proof.

References

Wonham, W.M. On a matrix Riccati equation of stochastic control. SIAM J. Control 1968, 6, 681–697. [Google Scholar] [CrossRef]
Lu, M.; Liu, S.; Luo, Y.; Qi, Q.; Sun, Y.; Li, J. Modeling and Output Feedback Tracking Control of Self-Developed Autonomous Underwater Vehicle. Unmanned Systems 2025, 13, 1639–1660. [Google Scholar] [CrossRef]
Wang, X.; Tan, C.P.; Wang, Y.; Qi, Q.; Wang, X. Adaptive Interval Observer-Based Fault-Tolerant Control for a 3-DOF Helicopter Without Angular Velocity Measurement. IEEE Trans. Control Syst. Technol 2025, 33, 2476–2482. [Google Scholar] [CrossRef]
Qi, Q.; Xie, L.; Zhang, H.; Liang, X. Jointly Optimal Local and Remote Controls for Networked Multiple Systems with Multiplicative Noises and Unreliable Uplink Channels. IEEE Trans. Autom. Control 2025, 70, 1054–1067. [Google Scholar] [CrossRef]
Qi, Q.; Xie, L.; Zhang, H. Optimal Control for Stochastic Systems with Multiple Controllers of Different Information Structures. IEEE Trans. Autom. Control 2021, 66, 4160–4175. [Google Scholar] [CrossRef]
Kleinman, D. Optimal stationary control of linear systems with control-dependent noise. IEEE Trans. Autom. Control 1969, 14, 673–677. [Google Scholar] [CrossRef]
Bismut, J.M. Linear quadratic optimal stochastic control with random coefficients. SIAM J. Control 1976, 14, 419–444. [Google Scholar] [CrossRef]
Zou, L.; Wang, Z.; Hu, J.; Dong, H. Ultimately Bounded Filtering Subject to Impulsive Measurement Outliers. IEEE Trans. Autom. Control 2022, 67, 304–319. [Google Scholar] [CrossRef]
Zou, L.; Wang, Z.; Hu, J.; Liu, Y.; Liu, X. Communication-protocol-based analysis and synthesis of networked systems: Progress, prospects and challenges. Int. J. Syst. Sci. 2021, 52, 3013–3034. [Google Scholar] [CrossRef]
Zhang, H.; Xu, J. Control for Itô stochastic systems with input delay. IEEE Trans. Autom. Control 2016, 62, 350–365. [Google Scholar] [CrossRef]
Rami, M.A.; Zhou, X.Y. Linear matrix inequalities, Riccati equations, and indefinite stochastic linear quadratic controls. IEEE Trans. Autom. Control 2000, 45, 1131–1143. [Google Scholar] [CrossRef]
Yong, J.; Zhou, X.Y. Stochastic Controls: Hamiltonian Systems and HJB Equations; Springer: New York, NY, USA, 1999; Volume 43. [Google Scholar]
Sinopoli, B.; Schenato, L.; Franceschetti, M.; Poolla, K.; Jordan, M.I.; Sastry, S.S. Kalman filtering with intermittent observations. IEEE Trans. Autom. Control 2004, 49, 1453–1464. [Google Scholar] [CrossRef]
Zhang, H.; Li, L.; Xu, J.; Fu, M. Linear quadratic regulation and stabilization of discrete-time systems with delay and multiplicative noise. IEEE Trans. Autom. Control 2015, 60, 2599–2613. [Google Scholar] [CrossRef]
Lessard, L.; Lall, S. An algebraic approach to the control of decentralized systems. IEEE Trans. Control Netw. Syst. 2014, 1, 308–317. [Google Scholar] [CrossRef]
Fanti, M.P.; Mangini, A.M.; Pedroncelli, G.; Ukovich, W. A decentralized control strategy for the coordination of AGV systems. Control Eng. Pract. 2018, 70, 86–97. [Google Scholar] [CrossRef]
Lipsa, G.M.; Martins, N.C. Optimal memoryless control in Gaussian noise: A simple counterexample. Automatica 2011, 47, 552–558. [Google Scholar] [CrossRef]
Bamieh, B.; Voulgaris, P.G. A convex characterization of distributed control problems in spatially invariant systems with communication constraints. Syst. Control Lett. 2005, 54, 575–583. [Google Scholar] [CrossRef]
Torabi, A.; Zarei, J.; Razavi-Far, R.; Saif, M. Decentralized Resilient Output-Feedback Control Design for Networked Control Systems Under Denial-of-Service. IEEE Syst. J. 2022, 16, 5620–5629. [Google Scholar] [CrossRef]
He, W.; Li, S.; Ahn, C.K.; Guo, J.; Xiang, Z. Global Decentralized Control of p-Normal Large-Scale Nonlinear Systems Based on Sampled-Data Output Feedback. IEEE Syst. J. 2021, 15, 3540–3548. [Google Scholar] [CrossRef]
Tsitsiklis, J.; Athans, M. On the complexity of decentralized decision making and detection problems. IEEE Trans. Autom. Control 1985, 30, 440–446. [Google Scholar] [CrossRef]
Ho, Y.C.; Chu, K.C. Team decision theory and information structures in optimal control problems–Part I. IEEE Trans. Autom. Control 1972, 17, 15–22. [Google Scholar] [CrossRef]
Nayyar, A.; Mahajan, A.; Teneketzis, D. Decentralized stochastic control with partial history sharing: A common information approach. IEEE Trans. Autom. Control 2013, 58, 1644–1658. [Google Scholar] [CrossRef]
Mahajan, A.; Nayyar, A. Sufficient statistics for linear control strategies in decentralized systems with partial history sharing. IEEE Trans. Autom. Control 2015, 60, 2046–2056. [Google Scholar] [CrossRef]
Liang, X.; Xu, J.; Zhang, H. Optimal control and stabilization for networked control systems with asymmetric information. IEEE Trans. Control Netw. Syst. 2020, 7, 1355–1365. [Google Scholar] [CrossRef]
Ouyang, Y.; Asghari, S.M.; Nayyar, A. Optimal infinite horizon decentralized networked controllers with unreliable communication. IEEE Trans. Autom. Control 2020, 66, 1778–1785. [Google Scholar] [CrossRef]
Liang, X.; Xu, J. Control for networked control systems with remote and local controllers over unreliable communication channel. Automatica 2018, 98, 86–94. [Google Scholar] [CrossRef]
Asghari, S.M.; Ouyang, Y.; Nayyar, A. Optimal local and remote controllers with unreliable uplink channels. IEEE Trans. Autom. Control 2018, 64, 1816–1831. [Google Scholar] [CrossRef]
Liang, X.; Qi, Q.; Zhang, H.; Xie, L. Decentralized control for networked control systems with asymmetric information. IEEE Trans. Autom. Control 2022, 67, 2076–2083. [Google Scholar] [CrossRef]
Yang, G.H.; Wang, J.L.; Soh, Y.C. Reliable LQG control with sensor failures. IEE Proc. Control Theory Appl. 2000, 147, 433–439. [Google Scholar] [CrossRef]
Yang, Y.; Yang, G.H.; Soh, Y.C. Reliable control of discrete-time systems with actuator failure. IEE Proc. Control Theory Appl. 2000, 147, 428–432. [Google Scholar] [CrossRef]
Lamperski, A.; Lessard, L. Optimal decentralized state-feedback control with sparsity and delays. Automatica 2015, 58, 143–151. [Google Scholar] [CrossRef]
Feyzmahdavian, H.R.; Alam, A.; Gattami, A. Optimal distributed controller design with communication delays: Application to vehicle formations. In Proceedings of the 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), Maui, HI, USA, 10–13 December 2012; pp. 2232–2237. [Google Scholar]
Wang, Y.; Xiong, J.; Ho, D.W. Globally optimal state-feedback LQG control for large-scale systems with communication delays and correlated subsystem process noises. IEEE Trans. Autom. Control 2019, 64, 4196–4201. [Google Scholar] [CrossRef]
Nayyar, N.; Kalathil, D.; Jain, R. Optimal decentralized control with asymmetric one-step delayed information sharing. IEEE Trans. Control Netw. Syst. 2016, 5, 653–663. [Google Scholar] [CrossRef]
Wang, H.; Zhang, H.; Li, L.; Fu, M. LQR and stabilization for discrete-time systems with multiplicative noises and input delays. IEEE Trans. Autom. Control 2023, 69, 3515–3530. [Google Scholar] [CrossRef]
Mazor, E.; Averbuch, A.; Bar-Shalom, Y.; Dayan, J. Interacting multiple model methods in target tracking: A survey. IEEE Trans. Aerosp. Electron. Syst. 1998, 34, 103–123. [Google Scholar] [CrossRef]

Figure 1. The TT system of UAV1 and UAV2.

Figure 2. Distance between UAV1 and UAV2.

Figure 3. Velocities of UAV1 and UAV2.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Y.; Wang, Y.; Tan, B.; Li, X.; Liang, X. Decentralized Control for Interrelated Systems with Asymmetric Information Architecture. Electronics 2026, 15, 96. https://doi.org/10.3390/electronics15010096

AMA Style

Wang Y, Wang Y, Tan B, Li X, Liang X. Decentralized Control for Interrelated Systems with Asymmetric Information Architecture. Electronics. 2026; 15(1):96. https://doi.org/10.3390/electronics15010096

Chicago/Turabian Style

Wang, Yixing, Yirun Wang, Boqun Tan, Xinghua Li, and Xiao Liang. 2026. "Decentralized Control for Interrelated Systems with Asymmetric Information Architecture" Electronics 15, no. 1: 96. https://doi.org/10.3390/electronics15010096

APA Style

Wang, Y., Wang, Y., Tan, B., Li, X., & Liang, X. (2026). Decentralized Control for Interrelated Systems with Asymmetric Information Architecture. Electronics, 15(1), 96. https://doi.org/10.3390/electronics15010096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Decentralized Control for Interrelated Systems with Asymmetric Information Architecture

Abstract

1. Introduction

2. Optimum Control

2.1. Problem Formulation

2.2. Strategy to Solve Problem 1

3. Numerical Examples

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Proof of Theorem 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI