Operator Newton Method for Large-Scale Coupled Riccati Equations Arising from Jump Systems

Bo Yu; Yiwen Liu; Ning Dong

doi:10.3390/axioms14080601

,

and

School of Science, Hunan University of Technology, Zhuzhou 412007, China

^*

Author to whom correspondence should be addressed.

Axioms2025, 14(8), 601;https://doi.org/10.3390/axioms14080601

This article belongs to the Special Issue Advances in Linear Algebra with Applications, 2nd Edition

Version Notes

Order Reprints

Abstract

Consider a class of coupled discrete-time Riccati equations arising from jump systems. To compute their solutions when systems reach a steady state, we propose an operator Newton method and correspondingly establish its quadratic convergence under suitable assumptions. The advantage of the proposed method lies in the fact that its subproblems are solved using the operator Smith method, which allows it to maintain quadratic convergence in both the inner and outer iterations. Moreover, it does not require the constant term matrix of the equation to be invertible, making it more broadly applicable than existing inverse-free iterative methods. For large-scale problems, we develop a low-rank variant by incorporating truncation and compression techniques into the operator Newton framework. A complexity analysis is also provided to assess its scalability. Numerical experiments demonstrate that the presented low-rank operator Newton method is highly effective in approximating solutions to large-scale structured coupled Riccati equations.

Keywords:

coupled discrete-time Riccati equations; operator Newton method; Smith iteration; jump control system; large-scale problems

MSC:

65F30

1. Introduction

Consider a discrete-time jump control system [1,2,3], modeled by:

x_{k + 1} = A_{i} x_{k} + B_{i} u_{k}, y_{k} = C_{i} x_{k}, i = 1, \dots, m, k = 1, 2, \dots,

where

A_{i} \in R^{N \times N}

,

B_{i} \in R^{N \times m^{b}}

, and

C_{i} \in R^{m^{c} \times N}

, with

m^{b}, m^{c} ≪ N

. To obtain the optimal control of the above system, one has to minimize the quadratic cost function

J (x, u) = \sum_{k = 1}^{\infty} [x_{k}^{⊤} Q_{i} x_{k} + u_{k}^{⊤} R_{i} u_{k}],

where

Q_{i} = C_{i}^{⊤} C_{i} \geq 0

and

R_{i} > 0

are symmetric positive semi-definite (SPSD) and symmetric positive definite (SPD) matrices, respectively, representing the state and control weighting terms in the cost function. The corresponding optimal control feedback gain is given by:

u_{k} = - {(R_{i} + B_{i}^{⊤} E_{i} (X) B_{i})}^{- 1} B_{i}^{⊤} E_{i} (X) A_{i} x_{k},

where

E_{i} (X) = \sum_{j = 1}^{m} p_{i j} X_{j} \in R^{N \times N}

is defined as a convex combination of

X_{j}

via the stochastic weights

p_{i j}

. To achieve the optimal control

u_{k}

, one has to solve the coupled discrete-time algebraic Riccati equation (CDARE):

D_{C i} (X) = - X_{i} + A_{i}^{⊤} E_{i} (X) A_{i} + Q_{i} - A_{i}^{⊤} E_{i} (X) B_{i} {(R_{i} + B_{i}^{⊤} E_{i} (X_{i}) B_{i})}^{- 1} B_{i}^{⊤} E_{i} (X) A_{i} = 0,

(1)

for

i = 1, \dots, m

.

Among the various approaches developed for solving the CDARE (1), iterative methods remain prominent. Commonly used iterations typically reformulate the problem as either optimization-based methods [4] or fixed-point iterations [5,6], drawing on classical linear–quadratic regulator theory. Enhanced variants based on linear matrix inequality (LMI) formulations have also been proposed [7,8]. However, such methods often exhibit slow convergence of the objective function and limited numerical precision, particularly in large-scale or tightly coupled scenarios. Fixed-point schemes address the CDARE directly by recasting it as a fixed-point problem. Notably, Ivanov [9] introduced two distinct fixed-point iterations, which were later accelerated by incorporating extrapolation techniques that leverage information from the current iteration to replace the previous one, thereby obviously improving the convergence rate.

To circumvent the computational cost associated with explicit matrix inversion, inverse-free fixed-point methods have been introduced. These schemes are inspired by the Schulz iteration [10], which recasts matrix inversion as a Newton-type process, replacing inversions with matrix multiplications [11,12,13]. While inverse-free methods demonstrate notable computational efficiency in practice, their theoretical convergences are generally linear.

To further accelerate convergence, Newton-type methods have garnered attention. For continuous-time coupled Riccati equations, Feng and Chu [14] proposed a Newton-based approach that extends the block-diagonal structure originally developed in [15] and generalizes the pseudo-Newton strategies given in [13]. For the discrete-time case in (1), Newton-type iteration was explored in [16], though the convergence analysis therein is restricted to settings where each weighting matrix

Q_{i}

is symmetric positive definite. In practical large-scale systems, however, the output matrix

C_{i}

often has low rank, rendering the associated

Q_{i}

only positive semi-definite. In such a case, the above Newton variants, including the inverse-free version [17], become inapplicable. For other methods applicable to similar types of equations, readers may also refer to [18,19].

The key to applying Newton’s method to large-scale CDARE (1) lies in efficiently computing the solution of large-scale coupled Stein equations (CSEs). Recently, a novel operator Smith algorithm (OSA) was introduced for solving CSEs with SPSD constant matrices [20]. This method interprets the coupling unknowns as operator-valued expressions and employs a doubling strategy to accelerate the iteration, demonstrating well-behaved numerical performance on large-scale problems. Another approach to handling large-scale CSEs with low-rank structure is to represent the low-rank matrix in the HODLR structured form [21,22]. However, after performing matrix operations with sparse matrices, the HODLR-structured matrix requires restructuring; otherwise, the dense matrix will not be adapted to large-scale computations.

Inspired by the development of OSA in [20], we propose an operator Newton method (ONM) tailored to solving large-scale CDAREs (1). The main contributions are summarized as follows:

We develop an operator Newton iteration scheme grounded in the structure of the OSA for solving the CDARE (1) and rigorously establish the convergence as well as the convergence rate. Crucially, unlike existing inverse-free schemes [11,12,13] that require invertible $Q_{i}$ to initiate the iteration, our method allows for SPSD initial $Q_{i}$ .
A low-rank variant of the operator Newton method is constructed to address the large-scale system. When the matrix $Q_{i}$ admits low-rank representation, the proposed method ensures that the rank of the initial residual remains fixed across iterations, effectively mitigating rank inflation.
We employ a doubling-based operator formulation for Newton’s subproblem, i.e., the coupled Stein equations, and embed the truncation–compression (TC) technique to control the growth of the column of low-rank factors. This enables efficient low-rank approximation to the solution without compromising numerical stability.
We propose a scalable residual evaluation strategy for large-scale CDARE and validate the proposed method on practical problems from engineering applications. Numerical experiments demonstrate that, for a comparable level of residual accuracy, the presented operator Newton method significantly reduces CPU time relative to the standard Newton’s method with the incorporation of the HODLR structure [21,22].

The paper is structured as follows. Section 2 reviews the operator Smith algorithm, and presents several lemmas required for constructing the operator Newton method. Section 3 introduces the iterative scheme of the operator Newton method for solving the CDAREs and establishes corresponding theorems on the convergence and convergence rate. Section 4 develops a low-rank variant of the operator Newton method tailored for large-scale problems with low-rank structures. By incorporating the truncation and compression technique, the scheme effectively controls the growth of the iterative matrix sequence. A detailed analysis of the computational complexity per iteration is also provided. Section 5 demonstrates the effectiveness of the proposed operator Newton method in solving large-scale practical CDAREs.

2. Preliminaries

In this section, we begin by reviewing the iterative framework of the operator Smith algorithm (OSA), which serves as the foundation for solving the subproblems. We then establish several useful lemmas that underpin the convergence analysis of the proposed method.

2.1. Operator Smith Algorithm for Coupled Discrete-Time Stein Equations

The operator Smith algorithm [20] for solving the coupled discrete-time Stein equations

X_{i} - A_{i}^{⊤} E_{i} (X) A_{i} - W_{i} = 0, i = 1, \dots, m

(2)

is given by the iterative scheme:

X_{i}^{(k + 1)} = X_{i}^{(k)} + F_{i}^{(k)} (X^{(k)}), X_{i}^{(0)} = W_{i}, k = 0, 1, 2, \dots,

(3)

where the operator

F_{i}^{(k)}

at the k-th iteration is defined as

F_{i}^{(k)} (\cdot) = A_{i}^{⊤} E_{i} (\overset{2^{k} - 1}{\overset{︷}{A^{⊤} E (\dots A^{⊤} E}} (\cdot) \overset{2^{k} - 1}{\overset{︷}{A) \dots A}}) A_{i} : = A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{k} - 1} (\cdot) A^{2^{k} - 1}) A_{i}

(4)

with

A^{⊤} E (\cdot) A = \sum_{j = 1}^{m} A_{j}^{⊤} E_{i} (\cdot) A_{j}

being a linear combination about j. After the k-th iteration, the expression of the iteration matrix is provided by the following theorem.

Theorem 1

([20]). Let

E_{i} (W) = \sum_{s = 1}^{m} p_{i, s} W_{s}

. Then the k-th iteration

X_{i}^{(k)}

produced by (3) admits the representation

X_{i}^{(k)} = W_{i} + A_{i}^{⊤} E_{i} (\sum_{j = 0}^{2^{k} - 2} {(A^{⊤} E)}^{j} (W) A^{j}) A_{i}, i = 1, \dots, m,

(5)

and the solution to (2) is given by

X_{i}^{*} = W_{i} + A_{i}^{⊤} E_{i} (\sum_{j = 0}^{\infty} {(A^{⊤} E)}^{j} (W) A^{j}) A_{i} .

(6)

When each

A_{i}

is d-stable, the following theorem establishes the quadratic convergence.

Theorem 2

([20]). Let

ρ : = {max}_{i = 1}^{m} ρ (A_{i})

and

p : = {max}_{i, j = 1}^{m} p_{i j}

such that

m p ρ^{m} < 1

. Then, for any matrix norm, the following bound holds:

∥ X_{i}^{(k)} - X_{i}^{*} ∥ \leq \frac{{(m p ρ^{m})}^{2^{k}}}{1 - m p ρ^{m}} ∥ W ∥,

where

∥ W ∥ : = {max}_{i = 1}^{m} ∥ W_{i} ∥

.

2.2. Some Lemmas

In this subsection, we introduce some lemmas which are useful for constructing the convergence of the operator Newton method.

Lemma 1.

Let

A_{i}

be d-stable. Let

L_{i} (S) = \sum_{j = 1}^{m} c_{i j} S_{j}

be a linear operator with

c_{i j} > 0

. Define

V_{i} = S_{i} - A_{i}^{⊤} L_{i} (S) A_{i} .

(7)

Then,

S_{i} \geq 0

if

V_{i} \geq 0

for all

i = 1, \dots, m

.

Proof.

From (7), it follows that

S_{i} = V_{i} + A_{i}^{⊤} L_{i} (\sum_{j = 0}^{\infty} {(A^{⊤} L)}^{j} (V) A^{j}) A_{i} .

Since

c_{i j} > 0

for all

i, j = 1, \dots, m

, the SPSD

V_{i}

implies that

S_{i} \geq 0

. □

Lemma 2.

Let A and B be SPSD matrices. If

I + A B

is nonsingular, then

{∥ (I + A B)}^{- 1} ∥_{2} \leq 1 .

Proof.

Let

y = {(I + A B)}^{- 1} x

for an arbitrary nonzero vector x. Then

x = y + A B y

, and hence

\begin{matrix} {∥ x ∥}_{2}^{2} & = y^{⊤} {(I + A B)}^{⊤} (I + A B) y \\ \geq y^{⊤} y + y^{⊤} (A B + B A) y + y^{⊤} (B A^{2} B) y \\ \geq {∥ y ∥}_{2}^{2} . \end{matrix}

Therefore,

{∥ (I + A B)}^{- 1} ∥_{2}^{2} = max_{x \neq 0} {(\frac{{∥ y ∥}_{2}}{{∥ x ∥}_{2}})}^{2} \leq 1 .

□

Lemma 3.

The Sherman–Morrison–Woodbury formula [23] is

{(A + U C V)}^{- 1} = A^{- 1} - A^{- 1} U {(C^{- 1} + V A^{- 1} U)}^{- 1} V A^{- 1},

where

A \in R^{n \times n}

,

U \in R^{n \times k}

,

C \in R^{k \times k}

, and

V \in R^{k \times n}

are conformable matrices.

3. Operator Newton Method

In this section, we first introduce the iterative framework of the operator Newton method (ONM) for solving the CDARE (1), and then analyze its convergence.

3.1. Iteration Format

The Fréchet derivative of the nonlinear operator

D_{C_{i}} (X)

in (1) corresponds to the linear part of the increment

D_{C_{i}} (X + H) - D_{C_{i}} (X)

, and is given by:

D_{C_{i} (X)}^{'} (H) = H_{i} - A_{i}^{⊤} {(I + E_{i} (X) B_{i} R_{i}^{- 1} B_{i}^{⊤})}^{- 1} E_{i} (H) {(I + B_{i} R_{i}^{- 1} B_{i}^{⊤} E_{i} (X))}^{- 1} A_{i},

assuming all matrix inverses exist.

Given

X_{i}^{(0)}

, Newton’s method based on the above derivative, can be written as

D_{C_{i} (X^{(k)})}^{'} (X^{(k + 1)} - X^{(k)}) = - D_{C_{i}} (X^{(k)}),

which yields a sequence of linear subproblems of the form

X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)} = Q_{i} + L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤}, i = 1, \dots, m,

(8)

where

{\hat{A}}_{i}^{(k)} = A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤}, L_{i}^{(k)} = A_{i}^{⊤} E_{i} (X^{(k)}) B_{i} {(R_{i} + B_{i}^{⊤} E_{i} (X^{(k)}) B_{i})}^{- 1},

(9)

and

E_{i} (X^{(k)}) = \sum_{s = 1}^{m} p_{i s} X_{s}^{(k)} .

Obviously, each subproblem corresponds to coupled Stein equations (8). When solved by using the OSA (3), and under some conditions specified in Theorem 3, the convergence of each inner iteration is guaranteed to be quadratic by Theorem 2. The resulting iterative scheme is referred to as the ONM.

Remark 1.

In many large-scale applications, the coefficient matrix

A_{i}

(

i = 1, \dots, m

) is large and sparse with dimensions

N \times N

, while the control matrix

B_{i}

is tall and skinny, meaning it has significantly fewer columns than rows. Consequently, the gain matrix

L_{i}^{(k)}

computed during the ONM also exhibits a tall and skinny structure.

An advantage of the ONM emerges when the right-hand side matrix

Q_{i}

in (8) admits low-rank factorization of the form

Q_{i} = C_{i}^{⊤} C_{i}

, where the number of columns in

C_{i}

is far less than N. In such a case, the column space of the right-hand side of (8) remains low-dimensional throughout the iteration, and the generated matrix tends to maintain a low-rank structure. This allows for efficient computation of the approximate solution with significantly reduced storage cost.

3.2. Convergence and Convergence Rate

The following theorem concludes the monotone convergence of the iteration sequence

{X_{i}^{(k)}}

generated by ONM (8) under some mild conditions.

Theorem 3.

Let

Q_{i} \geq 0

,

R_{i} > 0

for all

i \leq m

. Assume that there exists an SPD matrix

{\hat{X}}_{i}

such that

D_{c_{i}} (\hat{X}) \geq 0

and the Euclidean norm of

A_{i}

is less than one for each i. Then the sequence

{X_{i}^{(k)}}_{k = 1}^{\infty}

generated by (8) satisfies

1.: $X_{i}^{(k)} \geq X_{i}^{(k + 1)} \geq {\hat{X}}_{i}$ , $k \geq 1$ ;
2.: $σ ({\hat{A}}_{i}^{(k)}) \in D_{<}$ , $k \geq 1$ .

So there is a solution

X_{i}^{*} \geq 0

to

D_{c_{i}} (X) = 0

such that

σ ({\hat{A}}_{i}^{*}) \in D_{\leq}

for each i, where

{\hat{A}}_{i}^{*} = (I - B_{i} {(R_{i} + B_{i}^{⊤} E_{i} (X^{*}) B_{i})}^{- 1} B_{i} E_{i} (X^{*})) A_{i} .

Proof.

We prove items 1–2 by induction. Let

X_{i}^{(1)}

be the solution of

X_{i}^{(1)} - {(A_{i})}^{⊤} E_{i} (X^{(1)}) A_{i} = Q_{i}

for

i = 1, \dots, m

. As

A_{i}

is d-stable and

Q_{i} \geq 0

, it follows from Lemma 1 that

X_{i}^{(1)} \geq 0

and

E_{i} (X^{(1)}) \geq 0

. Moreover, direct calculation shows that

\begin{matrix} (X_{i}^{(1)} - {\hat{X}}_{i}) - A_{i}^{⊤} (E_{i} (X^{(1)}) - E_{i} (\hat{X})) A_{i} = D_{c_{i}} (\hat{X}) + {\hat{E}}_{i}^{⊤} {\hat{R}}_{i}^{- 1} {\hat{E}}_{i}, \end{matrix}

where

{\hat{R}}_{i} = R_{i} + B_{i}^{⊤} E_{i} (\hat{X}) B_{i}, {\hat{E}}_{i} = B_{i}^{⊤} E_{i} (\hat{X}) A_{i} .

Using the assumption

R_{i} > 0

and Lemma 1 again, one has

X_{i}^{(1)} \geq {\hat{X}}_{i}

for

i = 1, \dots, m

.

The SMW formula (Lemma 3) directly yields

\begin{matrix} {\hat{A}}_{i}^{(1)} = & A_{i} - B_{i} {(L_{i}^{(1)})}^{⊤} \\ = & (I - B_{i} {(R_{i} + B_{i}^{⊤} E_{i} (X^{(1)}) B_{i})}^{- 1} B_{i} E_{i} (X^{(1)})) A_{i} \\ = & {(I + B_{i} R_{i}^{- 1} B_{i}^{⊤} E_{i} (X^{(1)}))}^{- 1} A_{i} \end{matrix}

with

{(L_{i}^{(1)})}^{⊤} = (R_{i} + B_{i}^{⊤} E_{i} (X^{(1)}) B_{i})^{- 1} B_{i} E_{i} (X^{(1)}) A_{i} : = {({\hat{R}}_{i}^{(1)})}^{- 1} B_{i} E_{i} (X^{(1)}) A_{i} .

It then follows Lemma 2 that

ρ ({\hat{A}}_{i}^{(1)}) \leq ∥ {(I + B_{i} R_{i}^{- 1} B_{i}^{⊤} E_{i} (X^{(1)}))}^{- 1} ∥_{2} {∥ A_{i} ∥}_{2} < 1

(10)

for

i = 1, \dots, m

.

Now with the available

X_{i}^{(1)}

,

{\hat{A}}_{i}^{(1)}

, and

L_{i}^{(1)}

, we construct sequences

{X_{i}^{(l)}}_{l = 1}^{k}

,

{{\hat{A}}_{i}^{(l)}}_{l = 1}^{k}

, and

{L_{i}^{(l)}}_{l = 1}^{k}

satisfying the induction assumptions

X_{i}^{(1)} \geq \dots \geq X_{i}^{(k)} \geq {\hat{X}}_{i},

and

σ ({\hat{A}}_{i}^{(l)}) = σ (A_{i} - B_{i} {(L_{i}^{(l)})}^{⊤}) \subset D_{<},

for

i \leq m

and

l \leq k

. Here,

{(L_{i}^{(l)})}^{⊤} = (R_{i} + B_{i}^{⊤} E_{i} (X^{(l)}) B_{i})^{- 1} B_{i} E_{i} (X^{(l)}) A_{i} : = {({\hat{R}}_{i}^{(l)})}^{- 1} B_{i} E_{i} (X^{(l)}) A_{i} .

(11)

We will show they are true for

k + 1

.

By using Newton’s iteration, one has

\begin{matrix} (X_{i}^{(k)} - X_{i}^{(k + 1)}) - {({\hat{A}}_{i}^{(k)})}^{⊤} (E_{i} (X^{(k)}) - E_{i} (X^{(k + 1)})) {\hat{A}}_{i}^{(k)} \\ = & (X_{i}^{(k)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k)}) - (X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)}) \\ = & (X_{i}^{(k)} - {({\hat{A}}_{i}^{(k - 1)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k - 1)}) + {({\hat{A}}_{i}^{(k - 1)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k - 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k)} \\ - L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤} - Q_{i} \\ = & L_{i}^{(k - 1)} R_{i} {(L_{i}^{(k - 1)})}^{⊤} - L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤} + {({\hat{A}}_{i}^{(k - 1)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k - 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k)}) {\hat{A}}_{i}^{(k)} . \end{matrix}

(12)

By replacing

{\hat{A}}_{i}^{(k - 1)}

and

{\hat{A}}_{i}^{(k)}

with

A_{i} - B_{i} {(L_{i}^{(k - 1)})}^{⊤}

and

A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤}

, respectively, and noting

B_{i}^{⊤} E_{i} (X^{(k)}) A_{i} = {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤}

, the above expression becomes

\begin{matrix} L_{i}^{(k - 1)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k - 1)})}^{⊤} - L_{i}^{(k)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤} \\ - (L_{i}^{(k - 1)} - L_{i}^{(k)}) B_{i}^{⊤} E_{i} (X^{(k)}) A_{i} - A_{i}^{⊤} E_{i} (X^{(k)}) B_{i} {(L_{i}^{(k - 1)} - L_{i}^{(k)})}^{⊤} \\ = & L_{i}^{(k - 1)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k - 1)})}^{⊤} - L_{i}^{(k)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤} - (L_{i}^{(k - 1)} - L_{i}^{(k)}) {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤} \\ - L_{i}^{(k)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k - 1)} - L_{i}^{(k)})}^{⊤} \\ = & (L_{i}^{(k - 1)} - L_{i}^{(k)}) {\hat{R}}_{i}^{(k)} {(L_{i}^{(k - 1)} - L_{i}^{(k)})}^{⊤} . \end{matrix}

Since

{\hat{A}}_{i}^{(k)}

is d-stable and

(L_{i}^{(k - 1)} - L_{i}^{(k)}) {\hat{R}}_{i}^{(k)} {(L_{i}^{(k - 1)} - L_{i}^{(k)})}^{⊤} \geq 0

(as

R_{i} > 0

), it follows from Lemma 1 that

X_{i}^{(k)} \geq X_{i}^{(k + 1)}

for

i = 1, \dots, m

.

We next show that

X_{i}^{(k + 1)} \geq {\hat{X}}_{i}

holds for all

i \leq m

. In fact, one has

\begin{matrix} {\hat{X}}_{i} - {({\hat{A}}_{i}^{(k + 1)})}^{⊤} E_{i} (\hat{X}) {\hat{A}}_{i}^{(k + 1)} \\ = & {\hat{X}}_{i} - {(A_{i})}^{⊤} E_{i} (\hat{X}) A_{i} + L_{i}^{(k + 1)} {\hat{E}}_{i} + {(L_{i}^{(k + 1)} {\hat{E}}_{i})}^{⊤} - L_{i}^{(k + 1)} ({\hat{R}}_{i} - R_{i}) {(L_{i}^{(k + 1)})}^{⊤} \\ = & - D_{c_{i}} (\hat{X}) + Q_{i} - ({\hat{R}}_{i} L_{i}^{(k + 1)} - {\hat{E}}_{i}) {\hat{R}}_{i}^{- 1} {({\hat{R}}_{i} L_{i}^{(k + 1)} - {\hat{E}}_{i})}^{⊤} + L_{i}^{(k + 1)} R_{i} {(L_{i}^{(k + 1)})}^{⊤} . \end{matrix}

(13)

On the other hand, one can use almost the same assertion with (10) that

ρ ({\hat{A}}_{i}^{k + 1}) < 1

for all

i \leq m

. Now Newton’s iteration indicates that

\begin{matrix} X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k + 1)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k + 1)} \\ = & X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)} + B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) ({\hat{A}}_{i}^{(k)} + B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤}) \\ = & X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)} - {(B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)} \\ - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) (B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤}) \\ + {(B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) (B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤}) \\ = & L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤} + Q_{i} - {(B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) (A_{i} + B_{i} {(L_{i}^{(k)})}^{⊤}) \\ - {(A_{i} + B_{i} {(L_{i}^{(k)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) (B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤}) \\ + {(B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤})}^{⊤} E_{i} (X^{(k + 1)}) (B_{i} {(L_{i}^{(k)} - L_{i}^{(k + 1)})}^{⊤}) . \end{matrix}

(14)

Subtracting (13) from (14) and recollecting some terms with the equality

B_{i}^{⊤} E_{i} (X^{(k)}) A_{i} = {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤}

, one has

\begin{matrix} X_{i}^{(k + 1)} - {\hat{X}}_{i} - {({\hat{A}}_{i}^{(k + 1)})}^{⊤} (E_{i} (X^{(k + 1)}) - E_{i} (\hat{X})) {\hat{A}}_{i}^{(k + 1)} \\ = & D_{c_{i}} (\hat{X}) + {({\hat{R}}_{i} {(L_{i}^{(k + 1)})}^{⊤} - {\hat{E}}_{i})}^{⊤} {\hat{R}}_{i}^{- 1} ({\hat{R}}_{i} {(L_{i}^{(k + 1)})}^{⊤} - {\hat{E}}_{i}) \\ + (L_{i}^{(k + 1)} - L_{i}^{(k)}) (R_{i} + B_{i}^{⊤} E_{i} (X^{(k + 1)}) B_{i}) {(L_{i}^{(k + 1)} - L_{i}^{(k)})}^{⊤} . \end{matrix}

Since

D_{c_{i}} (\hat{X}) \geq 0

and

X^{(k + 1)} \geq 0

(follows from Newton’s iteration Format (8) and Lemma 1), one has

X_{i}^{(k + 1)} \geq {\hat{X}}_{i}

for all

i \leq m

by using Lemma 1.

We now have obtained a non-increasing sequence

{X_{i}^{(k)}}

of SPSD matrices bounded below by

{\hat{X}}_{i}

. Then

X_{i}^{*} : = lim_{k \to \infty} X_{i}^{(k)}

exists and is an SPSD matrix satisfying

X_{i}^{*} \geq {\hat{X}}_{i}

. Moreover,

R_{i} + B_{i}^{⊤} X_{i}^{*} B_{i} \geq R_{i} + B_{i}^{⊤} {\hat{X}}_{i} B_{i} > 0

. By taking the limit in (8) while

k \to \infty

, and, for brevity, writing

\begin{matrix} L_{i}^{*} = A_{i}^{⊤} E_{i} (X^{*}) B_{i} {(R_{i} + B_{i}^{⊤} E_{i} (X^{*}) B_{i})}^{- 1}, \end{matrix}

(15)

one has

X_{i}^{*} - {(A_{i} - B_{i} {(L_{i}^{*})}^{⊤})}^{⊤} E_{i} (X^{*}) (A_{i} - B_{i} {(L_{i}^{*})}^{⊤}) = L_{i}^{*} R_{i} {(L_{i}^{*})}^{⊤} + Q_{i} .

(16)

By noting

\begin{matrix} L_{i}^{*} (R_{i} + B_{i}^{⊤} E_{i} (X^{*}) B_{i}) {(L_{i}^{*})}^{⊤} = L_{i}^{*} B_{i}^{⊤} E_{i} (X^{*}) A_{i} = A_{i}^{⊤} E_{i} (X^{*}) B_{i} {(L_{i}^{*})}^{⊤}, \end{matrix}

(16) is equivalent to

D_{c_{i}} (X^{*}) = 0

. At last, since

{\hat{A}}_{i}^{(k)}

is d-stable for all

i \leq m

, the limit

σ ({\hat{A}}_{i}^{*}) = σ (A_{i} - B_{i} {(L^{*})}^{⊤}) \subset D_{\leq} .

□

Corollary 1.

Given the assumptions in Theorem 3, the sequence

{X_{i}^{(k)}}_{k = 1}^{\infty}

generated by (8) also satisfies

D_{c_{i}} (X^{(k)}) \leq 0

for all

i \leq m

.

Proof.

We have shown that at the k-th step (see (12)),

(X_{i}^{(k)} - X_{i}^{(k + 1)}) - {({\hat{A}}_{i}^{(k)})}^{⊤} (E_{i} (X^{(k)}) - E_{i} (X^{(k + 1)})) {\hat{A}}_{i}^{(k)} \geq 0 .

By using Newton’s iteration, it is equivalent to

\begin{matrix} X_{i}^{(k)} - {(A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤})}^{⊤} E_{i} (X^{(k)}) (A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤}) - L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤} - Q_{i} \\ = & X_{i}^{(k)} - A_{i}^{⊤} E_{i} (X^{(k)}) A_{i} + 2 A_{i}^{⊤} E_{i} (X^{(k)}) B_{i} {({\hat{R}}_{i}^{(k)})}^{(- 1)} B_{i}^{⊤} E_{i} (X^{(k)}) A_{i} - L_{i}^{(k)} {\hat{R}}_{i}^{(k)} {(L_{i}^{(k)})}^{⊤} - Q_{i} \\ = & - D_{c_{i}} (X^{(k)}) \geq 0 . \end{matrix}

So

D_{c_{i}} (X^{(k)}) \leq 0

for

i \leq m

. □

The following theorem describes the convergence rate of the operator Newton method.

Theorem 4.

Given the assumptions in Theorem 3 and that the sequence

{X_{i}^{(k)}}_{k = 0}^{\infty}

of SPSD matrices generated by (8) converges to the solution

X_{i}^{*}

for

i \leq m

. Let

δ X^{(k)} : = X_{i}^{(k)} - X_{i}^{*} a n d ∥ δ X^{(k)} ∥ : = max_{i \leq m} ∥ X_{i}^{(k)} - X_{i}^{*} ∥ .

Let

L_{i}^{*}

be defined by (15). If

{\hat{A}}_{i}^{*} = A_{i} - B_{i} {(L_{i}^{*})}^{⊤}

is d-stable, then there exists a constant

r > 0

such that

∥ δ X^{(k + 1)} ∥ \leq c ∥ δ X^{(k)} ∥^{2} .

Proof.

Let

{\hat{R}}_{i}^{*} = R_{i} + B_{i}^{⊤} E_{i} (X^{*}) B_{i}, {\hat{R}}_{i}^{(k)} = R_{i} + B_{i}^{⊤} E_{i} (X^{(k)}) B_{i}

and

L_{i}^{(k)}

,

L_{i}^{*}

be defined in (11) and (15), respectively. Then

\begin{matrix} L_{i}^{(k)} - L_{i}^{*} \\ = & A_{i}^{⊤} E_{i} (X^{(k)}) B_{i} {({\hat{R}}_{i}^{(k)})}^{- 1} - A_{i}^{⊤} E_{i} (X^{*}) B_{i} {({\hat{R}}_{i}^{*})}^{- 1} \\ = & A_{i}^{⊤} E_{i} (X^{(k)} - X^{*}) B_{i} {({\hat{R}}_{i}^{(k)})}^{- 1} + A_{i}^{⊤} E_{i} (X^{*}) B_{i} {({\hat{R}}_{i}^{*})}^{- 1} (B_{i}^{⊤} E_{i} (X^{(k)} - X^{*}) B_{i}) {({\hat{R}}_{i}^{(k)})}^{- 1} . \end{matrix}

Since

ρ (A_{i}) < 1

and

X_{i}^{(k)} \geq X_{i}^{*}

for all

i \leq m

and

k = 1, 2, \dots

, one has

{({\hat{R}}_{i}^{(k)})}^{- 1} \leq {({\hat{R}}_{i}^{*})}^{- 1},

and therefore for some norm, one has

\begin{matrix} ∥ L_{i}^{(k)} - L_{i}^{*} ∥ \leq c_{1} ∥ \sum_{j = 1}^{m} p_{i j} (X_{j}^{(k)} - X_{j}^{*}) ∥ \leq c_{1} ∥ δ X^{(k)} ∥, \end{matrix}

(17)

where the constant

c_{1} = ∥ {({\hat{R}}_{i}^{*})}^{- 1} ∥ \cdot ∥ B_{i} ∥ + ∥ {({\hat{R}}_{i}^{*})}^{- 1} ∥^{2} \cdot ∥ B_{i} ∥^{2} \cdot ∥ B_{i}^{⊤} E_{i} (X^{*}) A_{i} ∥ .

By using Newton’s iteration, it follows that

\begin{matrix} (X_{i}^{(k + 1)} - X_{i}^{*}) - {({\hat{A}}_{i}^{(k)})}^{⊤} (E_{i} (X^{(k + 1)}) - E_{i} (X^{*})) {\hat{A}}_{i}^{(k)} \\ = & (X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)}) - (X_{i}^{*} + {({\hat{A}}_{i}^{*})}^{⊤} E_{i} (X^{*}) {\hat{A}}_{i}^{*}) \\ + {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{*}) {\hat{A}}_{i}^{(k)} - {({\hat{A}}_{i}^{*})}^{⊤} E_{i} (X^{*}) {\hat{A}}_{i}^{*} \\ = & L_{i}^{(k)} R_{i} {(L_{i}^{(k)})}^{⊤} - L_{i}^{*} R_{i} {(L_{i}^{*})}^{⊤} + {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{*}) {\hat{A}}_{i}^{(k)} - {({\hat{A}}_{i}^{*})}^{⊤} E_{i} (X^{*}) {\hat{A}}_{i}^{*} . \end{matrix}

(18)

Note that

{\hat{A}}_{i}^{(k)} = A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤}

and

{\hat{A}}_{i}^{*} = A_{i} - B_{i} {(L_{i}^{*})}^{⊤}

. Then (18) is equivalent to the coupled Stein equations

\begin{matrix} δ X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (δ X^{(k + 1)}) {\hat{A}}_{i}^{(k)} = W_{i}^{(k)}, \end{matrix}

where

W_{i}^{(k)} = (L_{i}^{(k)} - L_{i}^{*}) {\hat{R}}_{i}^{*} {(L_{i}^{(k)} - L_{i}^{*})}^{⊤} .

(19)

With the current

{\hat{A}}_{i}^{(k)}

and

W_{i}^{(k)}

at the k-th step, Theorem 1 indicates that the solution to the above equation can be written as

δ X_{i}^{(k + 1)} = W_{i}^{(k)} + {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (\sum_{j = 0}^{\infty} {({({\hat{A}}^{(k)})}^{⊤} E)}^{j} (W^{(k)}) {({\hat{A}}^{(k)})}^{j}) {\hat{A}}_{i}^{(k)} .

By taking the term-by-term norm on the right-hand side, one then has

\begin{matrix} ∥ δ X_{i}^{(k + 1)} ∥ \\ \leq & ∥ W_{i}^{(k)} ∥ + ∥ {\hat{A}}_{i}^{(k)} ∥^{2} (\sum_{j = 1}^{m} p_{i j} ∥ W_{j}^{(k)} ∥) + \dots \\ + ∥ {\hat{A}}_{i}^{(k)} ∥^{2} (\sum_{j = 1}^{m} p_{i, i_{l}} ∥ {\hat{A}}_{i_{l}}^{(k)} ∥^{2} \dots (\sum_{j = 1}^{m} p_{i_{1}, j} ∥ W_{j}^{(k)} ∥)) + \dots \\ \leq & ∥ W^{(k)} ∥ + {(ρ^{(k)})}^{2} ∥ W^{(k)} ∥ + \dots + {({(ρ^{(k)})}^{2})}^{i_{l}} ∥ W^{(k)} ∥) + \dots \\ \leq & \frac{∥ W^{(k)} ∥}{1 - {(ρ^{(k)})}^{2}}, \end{matrix}

where

ρ^{(k)} = {max}_{i \leq m} ∥ {\hat{A}}_{i}^{(k)} ∥

and

∥ W^{(k)} ∥ = {max}_{i \leq m} ∥ W_{i}^{(k)} ∥

. Since

{\hat{A}}_{i}^{(k)}

converges to

{\hat{A}}_{i}^{*}

and

{\hat{A}}_{i}^{*}

is d-stable, this implies that

ρ^{(k)}

is bounded. This, together with (17) and (19), indicates that

\begin{matrix} ∥ δ X^{(k + 1)} ∥ = max_{i \leq m} ∥ δ X_{i}^{(k + 1)} ∥ \leq c ∥ δ X^{(k)} ∥^{2}, \end{matrix}

where the constant c does not associate with k. □

4. Structured ONM for Large-Scale Problems

In many engineering applications, the matrix

A_{i}

is typically sparse, while the matrix

Q_{i}

often possesses a low-rank structure. Therefore, in this section, we adapt the ONM (8) to a low-rank framework that is particularly suitable for large-scale problems.

4.1. Structured Iteration Scheme

We first review the low-rank OSA (3) for the coupled discrete-time Stein equations (CDSEs)

X_{i} - A_{i}^{⊤} E_{i} (X) A_{i} - W_{i} = 0, i = 1, \dots, m,

where

W_{i} = L_{i}^{W} {(L_{i}^{W})}^{⊤}

.

Lemma 4

([20]). Let

X_{i}^{(0)} = W_{i} = L_{i}^{W} {(L_{i}^{W})}^{⊤}

with

L_{i}^{W} \in R^{N \times l_{i}}

and

l_{i} ≪ N

. The sequences

F_{i}^{(k)} (X^{(k)})

and

X_{i}^{(k)}

generated by (3) are factorized as in the following format

\begin{matrix} X_{i}^{(k)} = L_{i, k}^{W} K_{i, k}^{W} {(L_{i, k}^{W})}^{⊤}, F_{i}^{(k)} (X^{(k)}) = L_{i, k}^{F_{2^{k}}} K_{i, k}^{F_{2^{k}}} {(L_{i, k}^{F_{2^{k}}})}^{⊤}, k = 0, 1, 2, \dots, \end{matrix}

(20)

where

\begin{matrix} L_{i, k}^{F_{2^{k}}} = A_{i}^{⊤} [L_{1, k}^{F_{2^{k - 1}}}, \dots, L_{m, k}^{F_{2^{k - 1}}}], K_{i, k}^{F_{2^{k}}} = p_{i, 1} K_{1, k}^{F_{2^{k - 1}}} \oplus \dots \oplus p_{i, m} K_{m, k}^{F_{2^{k - 1}}}, \\ L_{i, k}^{W} = [L_{i, k - 1}^{W}, L_{i, k - 1}^{F_{2^{k - 1}}}], K_{i, k}^{W} = K_{i, k - 1}^{W} \oplus K_{i, k - 1}^{F_{2^{k - 1}}} \end{matrix}

and

L_{i, k}^{F_{1}} = A_{i}^{⊤} [L_{1, k}^{W}, \dots, L_{m, k}^{W}], K_{i, k}^{F_{1}} = p_{i, 1} K_{1, k}^{W} \oplus \dots \oplus p_{i, m} K_{m, k}^{W}, K_{i, 0}^{W} = I, L_{i, 0}^{W} = L_{i}^{W} .

The above lemma shows that the iteration of OSA applied into CDSEs has a low-ranked format when

W_{i}

is of low-rank form. For the operator Newton method, each iteration step requires solving low-rank coupled Stein equations of the form

X_{i}^{(k + 1)} - {({\hat{A}}_{i}^{(k)})}^{⊤} E_{i} (X^{(k + 1)}) {\hat{A}}_{i}^{(k)} = {\bar{Q}}_{i}^{(k)} {\bar{K}}_{i} {({\bar{Q}}_{i}^{(k)})}^{⊤}, i = 1, \dots, m,

(21)

where

\begin{matrix} {\bar{Q}}_{i}^{(k)} = [L_{i}^{(k)}, C_{i}^{⊤}], {\bar{K}}_{i} = R_{i} \oplus I, {\hat{A}}_{i}^{(k)} = A_{i} - B_{i} {(L_{i}^{(k)})}^{⊤}, \end{matrix}

and

L_{i}^{(k)}

is defined in (9).

We will demonstrate that when

Q_{i} = C_{i}^{⊤} C_{i}

with

C_{i} \in R^{m^{c} \times N}

(

m^{c} ≪ N

) and the current iteration term

X_{i}^{(k)}

in (21) admits a low-rank approximation with accuracy

ϵ

, then so does the next iteration term

X_{i}^{(k + 1)}

.

Specifically, if

X^{(0)} = 0

, Equation (21) reduces to

X_{i}^{(1)} - A_{i}^{⊤} E_{i} (X^{(1)}) A_{i} = {\bar{Q}}_{i}^{(0)} {\bar{K}}_{i} {({\bar{Q}}_{i}^{(0)})}^{⊤}, i = 1, \dots, m,

where

{\bar{Q}}_{i}^{(0)} = C_{i}^{⊤}

and

{\bar{K}}_{i} = I

. By Lemma 4,

X_{i}^{(1)}

admits an approximate low-rank representation:

X_{i}^{(1)} \approx Y_{i, l_{1}}^{(0)} = L_{i, l_{1}}^{Q^{(0)}} K_{i, l_{1}}^{Q} {(L_{i, l_{1}}^{Q^{(0)}})}^{⊤},

with accuracy

ϵ

(after

l_{1}

iterations), where

\begin{matrix} L_{i, l_{1}}^{Q^{(0)}} = [L_{i, l_{1} - 1}^{Q^{(0)}}, L_{i, l_{1} - 1}^{F_{2^{l_{1} - 1}}^{(0)}}], K_{i, l_{1}}^{Q} = K_{i, l_{1} - 1}^{Q} \oplus K_{i, l_{1} - 1}^{F_{2^{l_{1} - 1}}}, \\ L_{i, l_{1} - 1}^{F_{2^{l_{1} - 1}}^{(0)}} = A_{i}^{⊤} [L_{1, l_{1} - 1}^{F_{2^{l_{1} - 1} - 1}^{(0)}}, \dots, L_{m, l_{1} - 1}^{F_{2^{l_{1} - 1} - 1}^{(0)}}], \dots, L_{i, l_{1} - 1}^{F_{1}^{(0)}} = A_{i}^{⊤} [L_{1, l_{1} - 1}^{Q^{(0)}}, \dots, L_{m, l_{1} - 1}^{Q^{(0)}}], \\ K_{i, l_{1} - 1}^{F_{2^{l_{1} - 1}}} = p_{i, 1} K_{1, l_{1} - 1}^{F_{2^{l_{1} - 1} - 1}} \oplus \dots \oplus p_{i, m} K_{m, l_{1} - 1}^{F_{2^{l_{1} - 1} - 1}}, \dots, K_{i, l_{1} - 1}^{F_{1}} = p_{i, 1} K_{1, l_{1} - 1}^{Q} \oplus \dots \oplus p_{i, m} K_{m, l_{1} - 1}^{Q}, \\ \dots \dots \\ L_{i, 0}^{F_{1}^{(0)}} = A_{i}^{⊤} [C_{1}^{⊤}, \dots, C_{m}^{⊤}], K_{i, 0}^{F} = p_{i, 1} I \oplus \dots \oplus p_{i, m} I . \end{matrix}

Assume that the k-th iteration term

X_{i}^{(k)}

is approximated by

X_{i}^{(k)} \approx Y_{i, l_{k}}^{(k - 1)} = L_{i, l_{k}}^{Q^{(k - 1)}} K_{i, l_{k}}^{Q} {(L_{i, l_{k}}^{Q^{(k - 1)}})}^{⊤},

(22)

with tolerance

ϵ

(after

l_{k}

iterations). Then,

E_{i} (X^{(k)}) \approx E_{i} (Y_{\cdot, l_{k}}^{(k - 1)}) = p_{i, 1} Y_{1, l_{k}}^{(k - 1)} \oplus \dots \oplus p_{i, m} Y_{m, l_{k}}^{(k - 1)} : = L_{l_{k}}^{Q^{(k - 1)}} K_{l_{k}}^{Q} {(L_{l_{k}}^{Q^{(k - 1)}})}^{⊤},

where

L_{l_{k}}^{Q^{(k - 1)}} = [L_{1, l_{k}}^{Q^{(k - 1)}}, \dots, L_{m, l_{k}}^{Q^{(k - 1)}}]

and

K_{l_{k}}^{Q} = p_{i, 1} K_{1, l_{k}}^{Q} \oplus \dots \oplus p_{i, m} K_{m, l_{k}}^{Q}

.

Then the matrices

L_{i}^{(k)}

and

{\hat{A}}_{i}^{(k)}

in (9), as well as

{\bar{Q}}_{i}^{(k)}

in (21), are approximated by

{}_{a}L_{i}^{(k)} = A_{i}^{⊤} L_{l_{k}}^{Q^{(k - 1)}} K_{l_{k}}^{Q} {(L_{l_{k}}^{Q^{(k - 1)}})}^{⊤} B_{i} {(R_{i} + B_{i}^{⊤} L_{l_{k}}^{Q^{(k - 1)}} K_{l_{k}}^{Q} {(L_{l_{k}}^{Q^{(k - 1)}})}^{⊤} B_{i})}^{- 1},

{}_{a}{\hat{A}}_{i}^{(k)} = A_{i} - B_{i} {({}_{a}L_{i}^{(k)})}^{⊤}, {}_{a}{\bar{Q}}_{i}^{(k)} = [{}_{a}L_{i}^{(k)}, C_{i}^{⊤}] .

We now solve the approximated low-rank coupled Stein equations

X_{i} - {(A_{i} - B_{i} {({}_{a}L_{i}^{(k)})}^{⊤})}^{⊤} E_{i} (X) (A_{i} - B_{i} {({}_{a}L_{i}^{(k)})}^{⊤}) = {}_{a}{\bar{Q}}_{i}^{(k)} {\bar{K}}_{i} {({}_{a}{\bar{Q}}_{i}^{(k)})}^{⊤}, i = 1, \dots, m

(23)

with

{\bar{K}}_{i} = R_{i} \oplus I

. Applying Lemma 4 again, we approximate

X_{i}^{(k + 1)}

as

Y_{i, l_{k + 1}}^{(k)} = L_{i, l_{k + 1}}^{Q^{(k)}} K_{i, l_{k + 1}}^{Q} {(L_{i, l_{k + 1}}^{Q^{(k)}})}^{⊤},

(24)

with tolerance

ϵ

(after

l_{k + 1}

iterations), where

\begin{matrix} L_{i, l_{k + 1}}^{Q^{(k)}} = [L_{i, l_{k + 1} - 1}^{Q^{(k)}}, L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}], K_{i, l_{k + 1}}^{Q} = K_{i, l_{k + 1} - 1}^{Q} \oplus K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}}, \\ L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}} = {({}_{a}{\hat{A}}_{i}^{(k)})}^{⊤} [L_{1, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1} - 1}^{(k)}}, \dots, L_{m, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1} - 1}^{(k)}}], & \dots, \\ L_{i, l_{k + 1} - 1}^{F_{1}^{(k)}} = {({}_{a}{\hat{A}}_{i}^{(k)})}^{⊤} [L_{1, l_{k + 1} - 1}^{Q^{(k)}}, \dots, L_{m, l_{k + 1} - 1}^{Q^{(k)}}], \\ \dots, \\ L_{i, 0}^{F_{1}^{(k)}} = {({}_{a}{\hat{A}}_{i}^{(k)})}^{⊤} [{}_{a}{\bar{Q}}_{1}^{(k)}, \dots, {}_{a}{\bar{Q}}_{m}^{(k)}], \end{matrix}

(25)

\begin{matrix} K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}} = p_{i, 1} K_{1, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1} - 1}} \oplus \dots \oplus p_{i, m} K_{m, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1} - 1}}, \\ \dots, \\ K_{i, l_{k + 1} - 1}^{F_{1}} = p_{i, 1} K_{1, l_{k + 1} - 1}^{Q} \oplus \dots \oplus p_{i, m} K_{m, l_{k + 1} - 1}^{Q}, \\ \dots \dots \\ K_{i, 0}^{F_{1}} = p_{i, 1} {\bar{K}}_{1} \oplus \dots \oplus p_{i, m} {\bar{K}}_{m} . \end{matrix}

(26)

We can then construct

{}_{a}L_{i}^{(k + 1)} = A_{i}^{⊤} L_{l_{k + 1}}^{Q^{(k)}} K_{l_{k + 1}}^{Q} {(L_{l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{i} {(R_{i} + B_{i}^{⊤} L_{l_{k + 1}}^{Q^{(k)}} K_{l_{k + 1}}^{Q} {(L_{l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{i})}^{- 1},

(27)

{}_{a}{\hat{A}}_{i}^{(k + 1)} = A_{i} - B_{i} {({}_{a}L_{i}^{(k + 1)})}^{⊤}, {}_{a}{\bar{Q}}_{i}^{(k + 1)} = [{}_{a}L_{i}^{(k + 1)}, C_{i}^{⊤}],

(28)

and resume the iteration (23) successively.

Note that the size of the kernel matrix

{\bar{K}}_{i} = R_{i} \oplus I

remains invariant throughout the iterations. Consequently, when the number of columns in

R_{i}

and

C_{i}^{⊤}

is small relative to the dimension N, a distinctive feature of the large-scale ONM is that the rank of the approximated solution resets to a small, fixed number after each iteration before growing again. Given the quadratic convergence of ONM, this property—combined with the truncation and compression discussed later—this enables effective control over rank growth and prevents excessive increase in the solution’s column dimension during the iterative process.

4.2. Computation of the Residual

Assume that the subproblem (21) in the ONM is solved after

l_{k + 1}

inner iterations, yielding an approximated solution

Y_{i, l_{k + 1}}^{(k)}

in (22) to the coupled Stein equations with error

ϵ

. Below, we provide a detailed description of the residual computation at the current ONM iteration.

Substitute (22) into (1). The residual of the large-scale equation can be expressed as follows:

\begin{matrix} R_{d} (Y_{i, l_{k + 1}}^{(k)}) & = & - Y_{i, l_{k + 1}}^{(k)} + A_{i}^{⊤} E_{i} (Y_{\cdot, l_{k + 1}}^{(k)}) A_{i} + C_{i}^{⊤} C_{i} \\ - A_{i}^{⊤} E_{i} (Y_{\cdot, l_{k + 1}}^{(k)}) B_{i} {(R + B_{i}^{⊤} E_{i} (Y_{\cdot, l_{k + 1}}^{(k)}) B_{i})}^{- 1} B_{i}^{⊤} E_{i} (Y_{\cdot, l_{k + 1}}^{(k)}) A_{i} \\ = & L_{i}^{R^{(k + 1)}} K_{i}^{R^{(k + 1)}} {(L_{i}^{R^{(k + 1)}})}^{⊤}, \end{matrix}

(29)

where

L_{i}^{R^{(k + 1)}} = [L_{i, l_{k + 1}}^{Q^{(k)}}, A_{i}^{⊤} L_{1, l_{k + 1}}^{Q^{(k)}}, \dots, A_{i}^{⊤} L_{m, l_{k + 1}}^{Q^{(k)}}, C_{i}^{⊤}], K_{i}^{R^{(k + 1)}} = - K_{i, l_{k + 1}}^{Q} \oplus {\bar{K}}_{i, l_{k + 1}}^{Q} \oplus I

(30)

and

\begin{matrix} {\bar{K}}_{i, l_{k + 1}}^{Q} & = & [\begin{matrix} p_{i 1} K_{1, l_{k + 1}}^{Q} \\ ⋱ \\ p_{i m} K_{m, l_{k + 1}}^{Q} \end{matrix}] - [\begin{matrix} p_{i 1} K_{1, l_{k + 1}}^{Q} \\ ⋱ \\ p_{i m} K_{m, l_{k + 1}}^{Q} \end{matrix}] \\ [\begin{matrix} {(L_{1, l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{1} \\ ⋮ \\ {(L_{m, l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{m} \end{matrix}] {(R_{i} + [B_{1}^{⊤} L_{1, l_{k + 1}}^{Q^{(k)}}, \dots, B_{m}^{⊤} L_{m, l_{k + 1}}^{Q^{(k)}}] D_{i}^{{\bar{K}}^{Q}} [\begin{matrix} {(L_{1, l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{1} \\ ⋮ \\ {(L_{m, l_{k + 1}}^{Q^{(k)}})}^{⊤} B_{m} \end{matrix}])}^{- 1} \\ [B_{1}^{⊤} L_{1, l_{k + 1}}^{Q^{(k)}}, \dots, B_{m}^{⊤} L_{m, l_{k + 1}}^{Q^{(k)}}] [\begin{matrix} p_{i 1} K_{1, l_{k + 1}}^{Q} \\ ⋱ \\ p_{i m} K_{m, l_{k + 1}}^{Q} \end{matrix}] . \end{matrix}

Clearly, the residual in (29) possesses a low-rank structure. By applying the truncation and compression technique discussed in the next subsection, the computation of the residual can be reduced to a small-scale problem whose size matches that of the kernel matrix

K_{i}^{R^{(k + 1)}}

.

4.3. Truncation and Compression

The truncation and compression (TC) technique [24,25] is applied at two critical stages in our algorithm: during the iteration process and in the computation of the residual. Specifically, in each ONM iteration, we solve the subproblem using the operator Smith algorithm. From the iterative structure in (26), it can be observed that the number of columns in

L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}}

grows approximately as

O (m^{2^{k - 1}} l_{i})

, where

l_{i}

, a fixed constant independent of k, denotes the initial column count of the factor

{}_{a}{\bar{Q}}_{i}^{(k)}

. To mitigate the resulting growth in computational and memory cost, we apply the TC technique to reduce the column dimension of

L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}}

in (26).

Concretely, we impose QR decompositions with the column pivoting on

L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}}

as

\begin{matrix} L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}} P_{i}^{F_{2^{l_{k} - 1}}} = [T_{i}^{F_{2^{l_{k} - 1}}} {\tilde{Q}}_{i}^{F_{2^{l_{k} - 1}}}] [\begin{matrix} U_{1}^{F_{2^{l_{k} - 1}}} & U_{2}^{F_{2^{l_{k} - 1}}} \\ 0 & {\tilde{U}}^{F_{2^{l_{k} - 1}}} \end{matrix}], \end{matrix}

(31)

such that

{\tilde{U}}^{F_{2^{l_{k} - 1}}}

, after

l_{k}

iterations, satisfies

∥ {\tilde{U}}^{F_{2^{l_{k} - 1}}} ∥ < u_{0}^{f} τ,

where

P_{i}^{F_{2^{l_{k} - 1}}}

is the permutation matrix ensuring that the diagonal elements of the decomposed block triangular matrices decrease in absolute value. Additionally,

u_{0}^{f}

represents a constant, and

τ

is some small tolerance controlling TC, respectively. Denote

m^{f_{2^{l_{k} - 1}}}

by the column number of

L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}}

, bounded above by a given

m_{max}

. Then, it admits that

r^{f_{2^{l_{k} - 1}}} : = rank (L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}}) \leq m^{f_{2^{l_{k} - 1}}} \leq m_{max},

with

m_{max} ≪ N

. We then truncate

{\tilde{U}}^{F_{2^{l_{k} - 1}}}

in (31) and get the approximated factor as

\begin{matrix} {}_{t}L_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}^{(k)}} = T_{i}^{F_{2^{l_{k} - 1}}} . \end{matrix}

(32)

The correspondingly compressed kernel is

\begin{matrix} {}_{t}K_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}} : = [U_{i, 1}^{F_{2^{l_{k} - 1}}} U_{i, 2}^{F_{2^{l_{k} - 1}}}] {(P_{i}^{F_{2^{l_{k} - 1}}})}^{⊤} K_{i, l_{k} - 1}^{F_{2^{l_{k} - 1}}} P_{i}^{F_{2^{l_{k} - 1}}} {[U_{i, 1}^{F_{2^{l_{k} - 1}}} U_{i, 2}^{F_{2^{l_{k} - 1}}}]}^{⊤} . \end{matrix}

(33)

In addition, TC is also employed in the evaluation of the residual of (29). Specifically, we implement the QR decomposition with pivoting on

L_{i}^{R^{(k)}}

in (30) as

\begin{matrix} L_{i}^{R^{(k)}} P_{i}^{R^{(k)}} = [T_{i}^{R^{(k)}} {\tilde{Q}}_{i}^{R^{(k)}}] [\begin{matrix} U_{i, 1}^{R^{(k)}} & U_{i, 2}^{R^{(k)}} \\ 0 & {\tilde{U}}_{i}^{R^{(k)}} \end{matrix}], \end{matrix}

(34)

such that

{\tilde{U}}_{i}^{R^{(k)}}

satisfies

∥ {\tilde{U}}_{i}^{R^{(k)}} ∥ < u_{0}^{r} τ,

where

P_{i}^{R^{(k)}}

is a pivoting matrix and

u_{0}^{r}

is some constant. Then the compressed kernel of the residual is

\begin{matrix} {}_{t}K_{i}^{R (k)} = [U_{i, 1}^{R^{(k)}} U_{i, 2}^{R^{(k)}}] {(P_{i}^{R^{(k)}})}^{⊤} K_{i, k}^{R} P_{i}^{R^{(k)}} {[U_{i, 1}^{R^{(k)}} U_{i, 2}^{R^{(k)}}]}^{⊤}, \end{matrix}

(35)

and the termination condition of the whole algorithm is that

R e l_R e s = max_{i} \frac{∥ {}_{t}K_{i}^{R (k)} ∥}{∥ {}_{t}K_{i}^{R (0)} ∥} \leq ϵ

(36)

with

ϵ

being some tolerance.

4.4. Algorithm and Complexity

The low-rank structured ONM (ONM_lr), enhanced with truncation and compression (TC), is outlined in Algorithm 1 and the corresponding flowchart is given in Figure 1.

Algorithm 1: ONM_lr. Solve large-scale CDAREs with sparse

A_{i}

and low-ranked

Q_{i} = C_{i}^{⊤} C_{i}

.

Inputs: Sparse matrices

A_{i}

, low-rank factors

B_{i}

and

C_{i}

as well as small matrices

R_{i}

for

i = 1, \dots, m

, probability matrix

Π \in R^{m \times m}

, truncation tolerance

τ

, upper bound

m_{max}

and the iteration tolerance

ϵ

.

Outputs: Low-ranked matrix

L_{i}^{S}

and the kernel matrix

K_{i}^{S}

with the solution

X_{i}^{*} \approx L_{i}^{S} K_{i}^{S} {(L_{i}^{S})}^{⊤}

.

1. Set

{\bar{Q}}_{i}^{(0)} = C_{i}^{⊤}

and

{\bar{K}}_{i} = I

for

i = 1, \dots, m

.

2. For

k = 0, 1, \dots,

until convergence is reached:

3. Compute

L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}

and

K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}}

as in (26) and (26), respectively.

4. Truncate and compress

L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}

as in (31) with accuracy

u_{0}^{f} τ

.

5. Construct compressed

L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}

and

K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}}

as in (32) and (33), respectively.

6. Construct

L_{i, l_{k + 1}}^{Q^{(k)}}

and

K_{i, l_{k + 1}}^{Q}

as in (25).

7. Compute residual matrices

L_{i}^{R^{(k)}}

and

K_{i}^{R}

as in (30).

8. Truncate and compress

L_{i}^{R^{(k)}}

as in (34) with accuracy

u_{0}^{r} τ

.

9. Construct compressed residual matrix

{}_{t}K_{i}^{R (k)}

as in (35).

10. Evaluate the relative residual Rel_Res in (36).

11. If Rel_Res

< ϵ

, break. End.

12. Construct

{}_{a}L_{i}^{(k + 1)}

and

{}_{a}{\bar{Q}}_{i}^{(k + 1)}

as in (27) and (28), respectively. Set

{\bar{K}}_{i} = R_{i} \oplus I

.

13.

k : = k + 1

.

14. End (For)

15. Output

K_{i}^{S} : = K_{i, l_{k} + 1}^{Q}

,

L_{i}^{S} : = L_{i, l_{k} + 1}^{Q^{(k)}}

.

Remark 2.

Note that in lines 3–6, we solve the CDSEs (23) via the operator Smith algorithm with a truncation. Assume that the subproblem is solved after

k + 1

iterations with the tolerance ϵ, the obtained approximated solution to the subproblem is (24).

Figure 1. Flowchart of the low-rank structured ONM equipped with truncation and compression.

We next show the computational complexity of ONM_lr at each iteration. We always assume that the matrix

A_{i}

(

i = 1, \dots, m

) is sufficiently sparse. This allows us to consider the cost of both the product

A_{i} B_{i}

and solving the equation

A_{i} X = U

, which are both within the range of

c N

floating-point operations (flops), where U is an

N \times m^{u}

matrix with

m^{u} ≪ N

, and c is a constant. Additionally, for

i = 1, \dots, m

, the maximal numbers of the columns of initial matrices

B_{i}

and

C_{i}^{⊤}

, truncated factors

L_{i, l_{j}}^{F_{2^{l_{j}}}^{(k)}}

and

L_{i, l_{j}}^{Q_{2^{l_{j}}}^{(k)}}

, as well as residual matrix

L_{i}^{R^{(k + 1)}}

, are denoted by

m^{b}

,

m^{c}

,

m_{j}^{f^{(k)}}

,

m_{j}^{q^{(k)}}

, and

m^{r^{(k + 1)}}

, respectively. The flops and memory of the k-th iteration are summarized in Table 1.

Table 1. Complexity and memory at each iteration in Algorithm ONM_lr.

Remark 3.

1. We assume that

l_{k + 1}

iterations of OSA are required to solve the subproblem (21) with tolerance ϵ, the complexity of each iteration in solving the subproblem is given in [20], and the resulting complexities of

L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}

and

K_{i, l_{k + 1} - 1}^{F_{2^{l_{{k + 1}^{- 1}}}}}

in each ONM iteration are listed in the first two rows of Table 1.

2. The QR decomposition and finding the inverse in (27) to form

{}_{a}L_{i}^{(k + 1)}

are implemented by the Householder method and the LU decomposition, respectively [26].

5. Numerical Examples

In this section, we demonstrate the effectiveness of the proposed ONM_lr algorithm for computing the solution of large-scale CDARE (1), through examples drawn from [27,28,29,30]. The implementation of ONM_lr was coded by MATLAB R2019a on a 64-bit Windows 10 desktop, equipped with a 3.0 GHz Intel Core i5 processor (6 cores/6 threads) and 32 GB of RAM. The machine precision was set to eps =

2.22 \times 10^{- 16}

. Especially, the HODLR structure [21,22] used in the standard Newton’s method was also coded by MATLAB and can be viewed at https://github.com/numpi/hm-toolbox (accessed on 30 July 2025.).

The maximum number of columns in the low-rank factors was restricted to

m_{max} = 1000

, and the truncation–compression (TC) tolerance was chosen as

τ = 10^{- 16}

. The residuals were evaluated as in (36), and the stopping criterion was set to a tolerance of

ϵ = 10^{- 13}

. We did not compare the proposed method with the recently developed inverse-free fixed-point methods [12,13], as those approaches require the initial matrices

Q_{i}

to be nonsingular—an assumption that is clearly not satisfied in large-scale engineering problems with low-rank structure.

Example 1.

This example is adapted from a slightly modified all-pass single-input single-output (SISO) system originally studied in [29], generating an all-pass SISO system. In this setting, the controllability and observability Gramians satisfy a quasi-inverse relation, i.e.,

W_{c i} W_{o i} = σ_{i} I

for some

σ_{i} > 0

. Consequently, the system exhibits a single Hankel singular value with multiplicity equal to the system order.

The derived system matrices are as follows:

\begin{matrix} A_{1} = 0.4 {\bar{A}}_{1} \in R^{N \times N}, & A_{2} = 0.5 {\bar{A}}_{2} \in R^{N \times N}, \\ B_{1} = {[1, \dots, 0, 0]}^{⊤} \in R^{N \times 1}, & B_{2} = {[0, \dots, 0, 1]}^{⊤} \in R^{N \times 1}, \\ R_{1} = 1, & R_{2} = 1, \\ C_{1} = [1, 0, \dots, 0, 1] \in R^{1 \times N}, & C_{2} = [0, 1, 0, \dots, 0, 1, 0] \in R^{1 \times N}, \end{matrix}

where

{\bar{A}}_{1}

and

{\bar{A}}_{2}

are both tri-diagonal matrices

tridiag (- 1, 0, 1)

but with

{\bar{A}}_{1} (1, 1) = - 0.5

and

{\bar{A}}_{2} (1, 1) = - 0.8

, respectively. We consider

m = 2

and select the probability matrix

Π = (\begin{matrix} 0.244 & 0.756 \\ 0.342 & 0.658 \end{matrix})

.

We first compare the performance of the ONM_lr algorithm with the standard Newton’s method incorporating the HODLR structure (SN_HODLR) for CDARE of dimensions

10, 000

and

20, 000

. The results are reported in Table 2, where columns It., CPU, and Rel_Res report the iteration number, elapsed CPU time, and the relative residuals of the CDARE, respectively, when the algorithm terminates. For

N = 10, 000

, ONM_lr achieves the prescribed residual level in approximately 6.2 s, while the SN_HODLR requires about 1320 s to reach termination, roughly 212 times longer than ONM_lr. For

N = 20, 000

, ONM_lr reaches the prescribed residual in about 6.7 s, whereas the SN_HODLR is out of memory during iterations and fails to complete the computation.

Table 2. Comparison between ONM_lr and SN_HODLR in Example 1.

We then assess the performance of the ONM_lr algorithm on larger CDARE with dimensions

N = 50, 000

,

70, 000

,

90, 000

, and

110, 000

, and summarize the numerical results in Table 3. The quantities

δ t_{k}

and

t_{k}

denote the CPU time of the k-th iteration and the cumulative runtime up to iteration k, respectively. The column Rel_Res reports the relative residuals of the CDARE at each iteration, while the NC column indicates the maximum number of columns in

L_{i, l_{k + 1}}^{Q^{(k)}}

. As shown in Table 3, ONM_lr consistently achieves a residual on the order of

10^{- 13}

after 4 iterations. Moreover, the column Rel_Res clearly demonstrates the algorithm’s quadratic convergence rate.

Table 3. CPU time and residual in Example 1.

To further illustrate the convergence characteristics of the OSA used to solve each subproblem, we depict its convergence trajectories under various dimensions in Figure 2. In each subplot, the outer iterations 1 through 4 are represented by red, yellow, green, and orange, respectively. Within each color, a gradient from light to dark corresponds to increasing values drawn from the interval

(0, 10)

. The concentric rings indicate logarithmic scales from

10^{- 1}

to

10^{- 15}

. The convergence behavior of the OSA is marked by black circles, blue pentagrams, purple stars, and brown diamonds across the four outer iterations. From the number of concentric levels traversed by each marker, it is evident that the OSA achieves near-quadratic convergence across all subproblems. This reinforces the effectiveness of the proposed operator Newton framework in delivering rapid and robust convergence from inner iterations to the overall CDARE solution.

Example 2.

Consider a structural model of a vertically mounted stand, representative of machinery control systems. This model corresponds to a segment of a machine tool frame, where a series of guide rails is fixed along one surface to facilitate the motion of a tool slide during operation [28,31]. The geometry has been modeled and meshed using ANSYS, and the spatial discretization employs linear Lagrange elements within the finite element framework implemented in FEniCS.

Figure 2. Residual history of operator Smith iteration in solving each subproblem of Example 1.

The resulting system matrices are

\begin{matrix} A_{1} = r_{1} A \in R^{16, 626 \times 16, 626}, & A_{2} = r_{2} A \in R^{16, 626 \times 16, 626}, \\ B_{1} = B_{2} = B \in R^{16, 626 \times 1} & R_{1} = R_{2} = 1, \end{matrix}

where random scalars

r_{1}, r_{2} \in (0, 1)

are used for certain parameterizations, and the vector B has 392 nonzero elements, with a maximum entry of at most

0.00251

. Due to the structural similarity between

A_{1}

and

A_{2}

, only the sparsity pattern of

A_{1}

is depicted on the left panel of Figure 3. Vectors

C_{1}

and

C_{2} \in R^{16, 626}

are mostly zero, with five nonzero entries located at rows (3341, 6743, 8932, 11,324, 16,563) for

C_{1}

and (1046, 2436, 6467, 8423, 12,574) for

C_{2}

, respectively. Full matrix data are available from [27] and at the MOR Wiki repository (https://morwiki.mpi-magdeburg.mpg.de/morwiki/index.php/Vertical_Stand) (accessed on 30 July 2025). For this example, we set

m = 2

and define the mode transition probability matrix as

Π = (\begin{matrix} 0.564 & 0.436 \\ 0.785 & 0.215 \end{matrix})

.

Figure 3. The discretized matrix A and residual history of OSI in solving each subproblem of Example 2.

We apply the ONM_lr algorithm to solve the resulting coupled CDARE. We omit the comparison with the standard Newton’s method incorporating the HODLR structure, as it is out of memory capacity during iterations at this problem dimension. Table 4 summarizes the numerical results of ONM_lr. The residuals decrease to

O (10^{- 15})

after only three outer iterations. The columns

t_{k}

and

δ t_{k}

report the cumulative and per-iteration CPU time, respectively. The NC column shows that the column dimension of

L_{i, l_{k + 1}}^{Q^{(k)}}

increases by more than twice in the first iteration but grows more slowly in subsequent iterations. The Rel_Res column confirms that ONM_lr retains a quadratic convergence.

Table 4. CPU time and residual in Example 2.

To further investigate the convergence of the OSA for the subproblems, we present their convergence histories in the right panel of Figure 3. The red, yellow, and green colors correspond to the first, second, and third subproblem solving, respectively, with increasing intensities representing values in

(0, 10)

. Concentric circles indicate residual magnitudes from

10^{- 1}

to

10^{- 18}

. The convergence trajectories are plotted using black circles, blue pentagrams, and purple stars. The number of magnitude levels traversed by these markers provides clear evidence of nearly quadratic convergence for the OSA. This validates the robustness and efficiency of the ONM_lr algorithm in conjunction with the OSA strategy.

Example 3.

Consider a semi-discretized heat transfer model arising from the optimal cooling of steel profiles in automated control systems, as studied in [27]. The dimension of the resulting dynamical system depends on the level of refinement applied to the computational mesh. Spatial discretization is performed using linear Lagrange elements via the ALBERTA-1.2 finite element toolbox [30].

We slightly modify the model matrices as follows:

\begin{matrix} A_{1} = r_{1} {\bar{A}}_{1} \in R^{N \times N}, & A_{2} = r_{2} {\bar{A}}_{2} \in R^{N \times N}, \\ B_{1} = B_{2} \in R^{N \times 7}, & C_{1} = C_{2} \in R^{7 \times N}, \end{matrix}

where

\bar{A} = A / ∥ A ∥

and

∥ A ∥

is estimated by ‘normest’ in Matlab. We take

r_{1} = 0.98

and

r_{2} = 0.92

for

N = 20, 209

and

r_{1} = 0.87

and

r_{2} = 0.97

for

N = 79, 841

.

R_{1} = R_{2} = I_{7}

. In this experiment, we take

C_{1} = r_{3} C

and

C_{2} = r_{4} C

with

r_{3}

and

r_{4}

being random numbers in (0,1). Matrices

\bar{A} \in R^{N \times N}

,

B \in R^{N \times 1}

, and

C \in R^{1 \times N}

can be found at [27], or the MOR Wiki repository (https://morwiki.mpi-magdeburg.mpg.de/morwiki/index.php/Steel_Profile) (accessed on 30 July 2025). The probability matrix is defined as

Π = (\begin{matrix} 0.713 & 0.287 \\ 0.584 & 0.416 \end{matrix})

.

To assess the performance of the proposed ONM_lr algorithm, we solve Equation (1) for two system sizes:

N = 20, 209

and

N = 79, 841

. Again, we omit the comparison with the standard Newton’s method incorporating the HODLR structure, as it is out of memory capacity during iterations at this problem dimension. The numerical results of ONM_lr are reported in Table 5. In both cases, ONM_lr attains a residual norm on the order of

10^{- 15}

within just three outer iterations. The cumulative CPU times are approximately 10.5 s and 19.5 s. The It. column records the number of outer iterations, while the NC column reflects the maximum number of columns in

L_{i, l_{k + 1}}^{Q^{(k)}}

at each step. The modest growth in NC, remaining below two times across iterations, highlights the efficiency of the truncation–compression (TC) strategy employed within both the outer ONM_lr iterations and the inner OSA iterations. Furthermore, the Rel_Res column confirms the quadratic convergence of ONM_lr.

Table 5. CPU time and residual in Example 3.

To further visualize the convergence characteristics of the subproblem solvers, we depict the residual histories for both problem sizes in the right panel of Figure 4. Each subplot uses red, yellow, and green markers to represent the first through third subproblem solving. Within each color, darker shades correspond to higher iteration indices. Concentric rings denote residual levels ranging from

10^{- 1}

to

10^{- 16}

. The convergence paths of the operator Smith iteration are illustrated using black circles, blue pentagrams, and purple stars. The number of magnitude rings traversed by these markers clearly indicates that each subproblem is solved with nearly quadratic convergence. This further confirms the rapid and robust performance of the ONM_lr algorithm when combined with the OSA subproblem strategy.

Figure 4. Residual history of OSA in each subproblem of Example 3.

6. Conclusions

We have developed an operator Newton method for computing the solutions to a class of coupled discrete-time algebraic Riccati equations (CDARE) arising from jump systems. The proposed framework leverages an inner operator Smith algorithm to solve subproblems, and under appropriate assumptions, guarantees locally quadratic convergence in both the inner and outer iterations. To efficiently address large-scale systems in engineering applications, we further present a low-rank variant, ONM_lr, which incorporates truncation and compression strategies to control memory and computational costs. Compared to the standard Newton’s method incorporating the HODLR structure [21,22] and the recently developed inverse-free fixed-point iterations [12,13], which require invertible constant matrices and therefore fail to accommodate low-rank structures in large-scale scenarios, the proposed ONM_lr algorithm demonstrates its effectiveness in handling large-scale problems through numerical experiments. However, when CDARE approaches the critical case, the presented ONM method tends to exhibit near-linear convergence, leading to a significant increase in computational time for large-scale problems. Overcoming this limitation is an important direction for further research and is currently under investigation. Another future research avenue may be to extend the ONM method to coupled continuous-time Riccati equations and other types of equations arising in large-scale stochastic jump systems.

Author Contributions

Conceptualization, B.Y.; methodology, B.Y.; software, Y.L.; validation, N.D.; and formal analysis, N.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the NSF of Hunan Province (2023JJ50164, 2023JJ50165, 2026JJ50180), the foundation of the degree and postgraduate education reform project of the Hunan University of Technology (JGYB23009), and the basic education teaching reform research project of Hunan Province (Y2025452).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

Abbreviations /Notations	Definition
$D_{<}$ $(D_{\leq})$	The inner region of the open (closed) unit disk on the complex plane
$I_{m}$	Identity matrix with size m
$σ (A)$	Spectral of matrix A
$ρ (A)$	Spectral radius of matrix A
$∥ \cdot ∥$	∞-norm of a matrix
${∥ \cdot ∥}_{2}$	Euclidean norm of a matrix
$A \oplus B$	Block diagonal matrix $[\begin{matrix} A & 0 \\ 0 & B \end{matrix}]$
$r a n k (\cdot)$	The rank of a matrix
d-stable	The spectral radius of a matrix is less than 1, i.e., $ρ (\cdot)$ <1
SPSD (or $A \geq 0$ )	Symmetric and Positive Semi-Definite matrix A
SPD (or $A > 0$ )	Symmetric and Positive Definite matrix A
ONM	Operator Newton Method
OSA	Operator Smith Algorithm
TC	Truncation and compression
$ϵ$	Tolerance for iteration termination
$τ$	Tolerance for truncation and compression
$m_{max}$	Allowable maximum column of a matrix in iterations

References

Abou-Kandil, H.; Freiling, G.; Jank, G. On the solution of discretetime markovian jump linear quadratic control problems. Automatica 1995, 31, 765–768. [Google Scholar] [CrossRef]
Liu, Y.; Wang, Z.; Lin, X. Non-zero sum Nash game for discrete-time infinite Markov jump stochastic systems with applications. Axioms 2023, 12, 882. [Google Scholar] [CrossRef]
Do Costa, O.L.V.; Marques, R.P.; Fragoso, M.D. Discrete-Time Markov Jump Linear Systems; Springer: London, UK, 2005. [Google Scholar]
Zhou, B.; Duan, G.-R.; Li, Z.-Y. Gradient based iterative algorithm for solving coupled matrix equations. Syst. Control. Lett. 2009, 58, 327–333. [Google Scholar] [CrossRef]
Wang, Q.; Lam, J.; Wei, Y.; Chen, T. Iterative solutions of coupled discrete Markovian jump Lyapunov equations. Comput. Math. Appl. 2008, 55, 843–850. [Google Scholar] [CrossRef]
Wu, A.-G.; Duan, G.-R. New iterative algorithms for solving coupled Markovian jump Lyapunov equations. IEEE Trans. Autom. Control 2015, 60, 289–294. [Google Scholar]
Dragan, V.; Morozan, T.; Stoica, A.M. Robust Control of Discrete-time Linear Stochastic Systems; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Ivanov, I.G. Accelerated LMI solvers for the maximal solution to a set of discrete-time algebraic Riccati equations. Appl. Math. E-Notes 2012, 12, 228–238. [Google Scholar]
Ivanov, I.G. A method to solve the discrete-time coupled algebraic riccati equations. Appl. Math. Comput. 2008, 206, 34–41. [Google Scholar] [CrossRef]
Haghani, F.K.; Soleymani, F. An improved Schulz-type iterative method for matrix inversion with application. Trans. Inst. Meas. Control 2014, 36, 983–991. [Google Scholar] [CrossRef]
Dai, H.; Bai, Z.Z. On eigenvalue bounds and iteration methods for discrete algebraic riccati equations. J. Comput. Math. 2011, 29, 341–366. [Google Scholar] [CrossRef]
Jiang, K.-W.; Li, Z.; Zhang, Y. An inversion-free iterative algorithm with a scalar tuning parameter for coupled Riccati matrix equations arising in LQ optimal control of Markov jump systems. IEEE Trans. Autom. Control 2024, 70, 1913–1920. [Google Scholar] [CrossRef]
Li, Z.; Zhang, Y.; Wu, A.-G. An inversion-free iterative algorithm for riccati matrix equations in discrete-time markov jump systems. IEEE Trans. Autom. Control 2022, 67, 4754–4761. [Google Scholar] [CrossRef]
Feng, T.-T.; Chu, E.K.W. Newton’s method for coupled continuous-time algebraic Riccati equations. J. Appl. Math. Comput. 2024, 70, 1023–1042. [Google Scholar] [CrossRef]
Salama, A.; Gourishankar, V. A computational algorithm for solving a system of coupled algebraic matrix riccati equations. IEEE Trans. Comput. 1974, C-23, 100–102. [Google Scholar] [CrossRef]
Margenov, S.D.; Vulkov, L.G.; Wasniewski, J. (Eds.) Numerical Analysis and Its Applications. In Proceedings of the 4th International Conference (NAA 2008), Lozenetz, Bulgaria, 16–20 June 2008; Revised Selected Papers; Springer: Berlin/Heidelberg, Germany, 2009; Volume 5434. [Google Scholar]
Wang, L.; Zhu, Y.-L. A new inversion-free iterative algorithm for the discrete algebraic Riccati equation. IMA J. Math. Control Inf. 2024, 41, 149–164. [Google Scholar] [CrossRef]
Jerbi, H.; Alshammari, O.; Aoun, S.B.; Kchaou, M.; Simos, T.E.; Mourtas, S.D.; Katsikis, V.N. Hermitian solutions of the quaternion algebraic Riccati equations through zeroing neural networks with application to quadrotor control. Mathematics 2024, 12, 15. [Google Scholar] [CrossRef]
Escorcia, J.M.; Suazo, E. On blow-up and explicit soliton solutions for coupled variable coefficient nonlinear schrödinger equations. Mathematics 2024, 12, 2694. [Google Scholar] [CrossRef]
Yu, B.; Dong, N.; Hu, B.-Q. Operator Smith algorithm for coupled Stein equations from jump control systems. Axioms 2024, 13, 249. [Google Scholar] [CrossRef]
Massei, S.; Palitta, D.; Robol, L. Solving rank structured Sylvester and Lyapunov equations. SIAM J. Matrix Anal. Appl. 2018, 39, 1564–1590. [Google Scholar] [CrossRef]
Massei, S.; Robol, L.; Kressner, D. hm-toolbox: Matlab software for HODLR and HSS matrices. SIAM J. Sci. Comput. 2020, 42, C43–C68. [Google Scholar] [CrossRef]
Higham, N.J. Accuracy and Stability of Numerical Algorithms, 2nd ed.; SIAM: Philadephia, PA, USA, 2002. [Google Scholar]
Yu, B.; Dong, N.; Tang, Q. Factorized squared Smith method for large-scale Stein equations with high-rank terms. Automatica 2023, 154, 111057. [Google Scholar] [CrossRef]
Yu, B.; Fan, H.-Y.; Chu, E.K.-W. Large-scale algebraic Riccati equations with high-rank constant terms. J. Comput. Appl. Math. 2019, 361, 130–143. [Google Scholar] [CrossRef]
Higham, N.J. Functions of Matrices: Theory and Computation; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
Korvink, G.; Rudnyi, B. Oberwolfach Benchmark Collection. In Dimension Reduction of Large-Scale Systems; Benner, P., Sorensen, D.C., Mehrmann, V., Eds.; Lecture Notes in Computational Science and Engineering; Springer: Berlin/Heidelberg, Germany, 2005; p. 45. [Google Scholar]
Lang, N. Numerical Methods for Large-Scale Linear Time-Varying Control Systems and related Differential Matrix Equations; Logos-Verlag: Berlin, Germany, 2018. [Google Scholar]
Ober, R.J. Asymptotically Stable All-Pass Transfer Functions: Canonical Form, Parametrization and Realization. IFAC Proc. Vol. 1987, 20, 181–185. [Google Scholar] [CrossRef]
Schmidt, A.; Siebert, K. Design of Adaptive Finite Element Software—The Finite Element Toolbox ALBERTA; Lecture Notes in Computational Science and Engineering; Springer: Berlin/Heidelberg, Germany, 2005; Volume 42. [Google Scholar]
Chahlaoui, Y.; Van Dooren, P. Benchmark examples for model reduction of linear time-invariant dynamical systems, dimension reduction of large-scale systems. In Dimension Reduction of Large-Scale Systems; Springer: Berlin/Heidelberg, Germany, 2005; Volume 45, pp. 379–392. [Google Scholar]

Figure 1. Flowchart of the low-rank structured ONM equipped with truncation and compression.

Figure 2. Residual history of operator Smith iteration in solving each subproblem of Example 1.

Figure 3. The discretized matrix A and residual history of OSI in solving each subproblem of Example 2.

Figure 4. Residual history of OSA in each subproblem of Example 3.

Table 1. Complexity and memory at each iteration in Algorithm ONM_lr.

Items	Flops	Memory
$L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}$	$\sum_{j = 1}^{l_{k + 1}} c m_{j - 1}^{q^{(k)}} 2^{j - 1} (m^{2^{j - 1}} + m) N$	$m^{2^{l_{k + 1} - 1}} m_{l_{k + 1} - 1}^{q^{(k)}} N$
$K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}}$	$\sum_{j = 1}^{l_{k + 1}} m {(m_{j - 1}^{q^{(k)}})}^{2} (1 + m^{2 j}) (j + 1) / 2$	${(m^{2^{l_{k + 1} - 1}} m_{l_{k + 1} - 1}^{q^{(k)}})}^{2}$
$L_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}^{(k)}}$ QR	$2 {(m^{2^{l_{k + 1} - 1}} m_{l_{k + 1} - 1}^{q^{(k)}})}^{2} (N - m^{2^{l_{k} - 1}} m_{l_{k + 1} - 1}^{q^{(k)}} / 3)$	${(m_{l_{k + 1}}^{f^{(k)}})}^{2}$
Compressed $K_{i, l_{k + 1} - 1}^{F_{2^{l_{k + 1} - 1}}}$	$4 m_{l_{k + 1}}^{f^{(k)}} {(m^{2^{l_{k + 1} - 1}} m_{l_{k + 1} - 1}^{q^{(k)}})}^{2}$	${(m_{l_{k + 1}}^{f^{(k)}})}^{2}$
$L_{i, l_{k + 1}}^{Q^{(k)}}$	—	$m_{l_{k + 1}}^{q^{(k)}} N$
$K_{i, l_{k + 1}}^{Q}$	—	${(m_{l_{k + 1}}^{q^{(k)}})}^{2}$
$L_{i}^{R^{(k + 1)}}$	$c m m_{l_{k + 1}}^{q^{(k)}} N$	$[(1 + m) m_{l_{k + 1}}^{q^{(k)}} + m^{c}] N$
$L_{i}^{R^{(k + 1)}}$ QR	$2 {[(1 + m) m_{l_{k + 1}}^{q^{(k)}} + m^{c}]}^{2} (N - [(1 + m) m_{l_{k + 1}}^{q^{(k)}} + m^{c}] / 3)$	${(m^{r^{(k + 1)}})}^{2}$
Compressed $K_{i}^{R^{(k + 1)}}$	$4 m^{r^{(k + 1)}} {[(1 + m) m_{l_{k + 1}}^{q^{(k)}} + m^{c}]}^{2}$	${(m^{r^{(k + 1)}})}^{2}$
${}_{a}L_{i}^{(k + 1)}$	$c m_{l_{k + 1}}^{q^{(k)}} N$ + $2 m^{b} m_{l_{k + 1}}^{q^{(k)}} (2 N + m_{l_{k + 1}}^{q^{(k)}}) + 14 {(m^{b})}^{2} m_{l_{k + 1}}^{q^{(k)}} / 3$	$m_{l_{k + 1}}^{q^{(k)}} N$
${}_{a}{\bar{Q}}_{i}^{(k + 1)}$	—	$(m_{l_{k + 1}}^{q^{(k)}} + m^{c}) N$

Table 2. Comparison between ONM_lr and SN_HODLR in Example 1.

		ONM_lr			SN_HODLR
$N$	It.	CPU Time	Rel_Res	It.	CPU Time	Rel_Res
10,000	4	$6.28$	$1.83 \times 10^{- 13}$	3	$1, 319.5$	$1.77 \times 10^{- 14}$
20,000	4	$6.71$	$1.86 \times 10^{- 13}$	—	—	—

Table 3. CPU time and residual in Example 1.

N	It.	$δ t_{k}$	$t_{k}$	Rel_Res	NC
50,000	1	$0.370$	$0.370$	$1.34 \times 10^{- 1}$	63
	2	$1.613$	$1.983$	$3.26 \times 10^{- 2}$	254
	3	$2.367$	$4.350$	$4.59 \times 10^{- 6}$	327
	4	$2.697$	$7.029$	$1.86 \times 10^{- 13}$	391
70,000	1	$0.515$	$0.515$	$1.34 \times 10^{- 1}$	63
	2	$1.732$	$2.246$	$3.37 \times 10^{- 2}$	250
	3	$2.382$	$4.628$	$4.61 \times 10^{- 6}$	320
	4	$3.193$	$7.882$	$1.85 \times 10^{- 13}$	394
90,000	1	$0.648$	$0.648$	$1.34 \times 10^{- 1}$	63
	2	$2.204$	$2.852$	$3.42 \times 10^{- 2}$	261
	3	$4.032$	$6.884$	$4.82 \times 10^{- 6}$	323
	4	$4.721$	$11.606$	$1.85 \times 10^{- 13}$	388
110,000	1	$0.713$	$0.713$	$1.34 \times 10^{- 1}$	63
	2	$2.812$	$3.525$	$3.40 \times 10^{- 2}$	262
	3	$4.141$	$7.667$	$4.51 \times 10^{- 6}$	309
	4	$5.021$	$12.688$	$1.89 \times 10^{- 13}$	391

Table 4. CPU time and residual in Example 2.

N	It.	$δ t_{k}$	$t_{k}$	Rel_Res	NC
16,626	1	$0.036$	$0.036$	$1.74 \times 10^{- 4}$	31
	2	$0.923$	$0.959$	$9.23 \times 10^{- 9}$	68
	3	$1.731$	$2.691$	$2.58 \times 10^{- 15}$	122

Table 5. CPU time and residual in Example 3.

N	It.	$δ t_{k}$	$t_{k}$	Rel_Res	NC
20,209	1	$0.258$	$0.258$	$5.65 \times 10^{- 4}$	48
	2	$2.843$	$3.101$	$1.53 \times 10^{- 9}$	88
	3	$7.472$	$10.574$	$1.69 \times 10^{- 15}$	103
79,841	1	$0.458$	$0.458$	$8.72 \times 10^{- 4}$	48
	2	$5.098$	$5.556$	$2.34 \times 10^{- 8}$	88
	3	$13.912$	$19.468$	$4.25 \times 10^{- 15}$	101

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Operator Newton Method for Large-Scale Coupled Riccati Equations Arising from Jump Systems

Abstract

1. Introduction

2. Preliminaries

2.1. Operator Smith Algorithm for Coupled Discrete-Time Stein Equations

2.2. Some Lemmas

3. Operator Newton Method

3.1. Iteration Format

3.2. Convergence and Convergence Rate

4. Structured ONM for Large-Scale Problems

4.1. Structured Iteration Scheme

4.2. Computation of the Residual

4.3. Truncation and Compression

4.4. Algorithm and Complexity

5. Numerical Examples

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics