Algebraic Characterizations of Relationships between Different Linear Matrix Functions

Yongge Tian; Ruixia Yuan

doi:10.3390/math11030756

and

Shanghai Business School, College of Business and Economics, Shanghai 201400, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics2023, 11(3), 756;https://doi.org/10.3390/math11030756

This article belongs to the Special Issue Matrix Equations and Their Algorithms Analysis

Version Notes

Order Reprints

Abstract

Let

f (X_{1}, X_{2}, \dots, X_{k})

be a matrix function over the field of complex numbers, where

X_{1}, X_{2}, \dots, X_{k}

are a family of matrices with variable entries. The purpose of this paper is to propose and investigate the relationships between certain linear matrix functions that regularly appear in matrix theory and its applications. We shall derive a series of meaningful, necessary, and sufficient conditions for the collections of values of two given matrix functions to be equal through the cogent use of some highly selective formulas and facts regarding ranks, ranges, and generalized inverses of block matrix operations. As applications, we discuss some concrete topics concerning the algebraic connections between general solutions of a given linear matrix equation and its reduced equations.

Keywords:

block matrix; general solution; generalized inverse; matrix equation; matrix expression; range; rank

MSC:

15A09; 15A10; 15A24

1. Introduction

Throughout this paper, we adopt the following notation:

C^{m \times n}

denotes the collection of all

m \times n

matrices over the field of complex numbers;

A^{T}

and

A^{*}

denote the transpose and the conjugate transpose of

A \in C^{m \times n}

, respectively;

r (A)

denotes the rank of a matrix

A \in C^{m \times n}

;

R (A) = {A x | x \in C^{n}}

denotes the range of a matrix

A \in C^{m \times n}

;

I_{m}

denotes the identity matrix of order m;

[A, B]

denotes a partitioned matrix consisting of two submatrices A and B; the Moore–Penrose generalized inverse of a matrix

A \in C^{m \times n}

, denoted by

A^{†}

, is defined as the unique matrix

X \in C^{n \times m}

that satisfies the following four Penrose equations:

\begin{matrix} (1) A X A = A, (2) X A X = X, (3) {(A X)}^{*} = A X, (4) {(X A)}^{*} = X A . \end{matrix}

(1)

In addition, we denote by

P_{A} = I_{m} - A A^{†}

and

Q_{A} = I_{n} - A^{†} A

the two orthogonal projectors (Hermitian idempotent matrices) induced from A. For more detailed information regarding the generalized inverses of matrices, we refer the reader to [1,2,3,4]. The Kronecker product of any two matrices

A \in C^{m \times n}

and

B \in C^{p \times q}

is defined as

A \otimes B = (a_{i j} B)

. The vectorization operator of a matrix

A = [a_{1}, \dots, a_{n}]

is defined as

vec (A) = \vec{A} = {[a_{1}^{T}, \dots, a_{n}^{T}]}^{T}

, where

a_{i}

denotes the i-th column of A,

i = 1, 2, \dots, k

. A well-known property of the vec operator of a triple matrix product is known as

\vec{A X B} = (B^{T} \otimes A) \vec{X}

l; see, e.g., [5,6].

Consider a matrix function

\begin{matrix} Z = f (X_{1}, X_{2}, \dots, X_{k}), \end{matrix}

(2)

where

X_{1}, X_{2}, \dots, X_{k}

are matrices of appropriate sizes with variable entries from the field of complex numbers, and Z denotes the matrix value of the matrix function corresponding to the k variable matrices. Further, we denote the collection of all possible values of the function corresponding to the k variable matrices by

\begin{matrix} D_{f} = {Z | Z = f (X_{1}, X_{2}, \dots, X_{k})}, \end{matrix}

(3)

and call it the domain of the matrix function. Given a matrix function as such, algebraists would like to know their algebraic properties and performances and then employ them when solving problems related to matrix functions in theoretical and computational mathematics.

Given such a matrix function, there are some fundamental questions that we may necessarily ask:

(I): When is the matrix value of (2) unique with respect to all the variable matrices $X_{1}, X_{2}, \dots, X_{k}$ ?
(II): What is the solvability condition of the matrix equation $f (X_{1}, X_{2}, \dots, X_{k}) = 0$ , and what is the general solution of $f (X_{1}, X_{2}, \dots, X_{k}) = 0$ when it is solvable?
(III): Given two matrix functions $f (\cdot)$ and $g (\cdot)$ of the same size with the domains $D_{f}$ and $D_{g}$ , respectively, what are the necessary and sufficient conditions for the four statements

$\begin{matrix} D_{f} \cap D_{g} \neq \emptyset, D_{f} \supseteq D_{g}, D_{f} \subseteq D_{g}, D_{f} = D_{g} \end{matrix}$

(4)

to hold, respectively?

Theoretically speaking, concrete matrix functions can be arbitrarily constructed according to various ordinary algebraic operations of matrices, while algebraists can propose or encounter numerous specified matrix functions when dealing with theoretical and computational problems in matrix analysis and its applications. In comparison, linear matrix functions (LMFs) are a class of simple forms of all matrix functions, and they can be routinely defined according to the additions and multiplications of matrices. Let us just mention here a typical example of LMFs:

\begin{matrix} f (X_{1}, X_{2}, \dots, X_{k}) = A + B_{1} X_{1} C_{1} + B_{2} X_{2} C_{2} + \dots + B_{k} X_{k} C_{k}, \end{matrix}

(5)

where

A \in C^{m \times n}

,

B_{i} \in C^{m \times p_{i}}

, and

C_{i} \in C^{q_{i} \times n}

are given, and

X_{i} \in C^{p_{i} \times q_{i}}

are variable matrices;

i = 1, 2, \dots, k

. Correspondingly, the domain of the LMF is denoted by

\begin{matrix} D_{f} = {Z = A + B_{1} X_{1} C_{1} + B_{2} X_{2} C_{2} + \dots + B_{k} X_{k} C_{k} | X_{i} \in C^{p_{i} \times q_{i}}, i = 1, 2, \dots, k} . \end{matrix}

(6)

The LMF in (5) includes many simple and well-known matrix expressions of this kind with variable entries as its special cases, such as

A + B X

,

A + B X C

, and

A + B X + Y C

, as well as various partially specified matrices, such as

[\begin{matrix} A & B \\ C & * \end{matrix}]

,

[\begin{matrix} A & * \\ * & D \end{matrix}]

,

[\begin{matrix} A & * \\ * & * \end{matrix}]

, where the symbol * denotes a unspecified sub-matrix (cf. [7,8,9,10,11]).

Nowadays, we have some powerful algebraic tools and techniques for characterizing relationships between different matrix functions and matrix equations. Among them are the simple but surprisingly effective matrix rank method, as well as the matrix range method and the block matrix representation method. The purpose of this paper is to propose and study some fundamental problems related to the domains of certain specified cases of (5) through the nicely organized employment of some known formulas and facts for ranges and ranks of matrices. As applications, we also discuss the connections among general solutions of some linear matrix equations and their reduced linear matrix equations. This paper is organized as follows. In Section 2, the authors give some preliminary results and facts about generalized inverses, rank formulas, and matrix equations. In Section 3, we first present some known and new results and facts about the relationships between two matrix sets generated from (5), and then we discuss the relationships between general solutions of some well-known linear matrix equations that occur in linear algebra and matrix analysis and their applications. Section 4 discusses the relationships between the general solutions of some basic linear matrix equations and their reduced equations.

2. Notation and Some Preliminary Results

As we know in linear algebra and matrix theory, a matrix is null if and only if its rank is zero. As a direct consequence of this elementary fact, we easily came to the conclusion that two given matrices A and B of the same size are equal if and only if

r (A - B) = 0

. In view of this fact, we may figure out that if certain nontrivial algebraic formulas for calculating the rank of the difference

A - B

are obtained, we can reasonably utilize them to describe essential links between the two matrices and, especially, to characterize the matrix quality

A = B

in a convenient manner. A solid underpinning of this proposed method is that we are really able to determine or compute the rank of a matrix through various elementary operations of matrices and to obtain analytical formulas for expressing the ranks of matrices in many cases. Recall, in addition, that block matrices and matrix equations are two types of basic conceptual objects in linear algebra. Correspondingly, the matrix rank method (for short, MRM), the block matrix representation method (for short, BMRM), and the matrix equation method (for short, MEM) are three basic and traditional analytic tools and techniques that are extensively employed in matrix theory and its applications because they give algebraists the capacity to construct and analyze various complicated matrix expressions and matrix equalities in a subtle and computationally tractable way. On the other hand, it has been realized since the 1960s that generalized inverses of matrices can be adopted to derive numerous exact and analytical expansion formulas for calculating the ranks of block matrices. In the following, we present a series of seminal equalities and facts about the ranks of matrices and matrix equations.

Lemma 1

([12]). Let

A \in C^{m \times n}, B \in C^{m \times k},

and

C \in C^{l \times n} .

Then,

\begin{matrix} r [A, B] & = r (A) + r (P_{A} B) = r (B) + r (P_{B} A), \end{matrix}

(7)

\begin{matrix} r [\begin{matrix} A \\ C \end{matrix}] & = r (A) + r (C Q_{A}) = r (C) + r (A Q_{C}), \end{matrix}

(8)

\begin{matrix} r [\begin{matrix} A & B \\ C & 0 \end{matrix}] & = r (B) + r (C) + r (P_{B} A Q_{C}) . \end{matrix}

(9)

In particular, the following results hold.

(a): $r [A, B] = r (A) \Leftrightarrow R (A) \supseteq R (B) \Leftrightarrow A A^{†} B = B \Leftrightarrow P_{A} B = 0 .$
(b): $r [\begin{matrix} A \\ C \end{matrix}] = r (A) \Leftrightarrow R (C^{*}) \subseteq R (A^{*}) \Leftrightarrow C A^{†} A = C \Leftrightarrow C Q_{A} = 0 .$
(c): $r [\begin{matrix} A & B \\ C & 0 \end{matrix}] = r (B) + r (C) \Leftrightarrow P_{B} A Q_{C} = 0 .$

Lemma 2

([13]). Let

A_{i} \in C^{m \times n_{i}},

and denote

{\hat{A}}_{i} = [A_{1}, \dots, A_{i - 1}, A_{i + 1}, \dots, A_{k}],

i = 1, 2, \dots, k .

Then,

\begin{matrix} (k - 1) r [A_{1}, A_{2}, \dots, A_{k}] + dim (R ({\hat{A}}_{1}) \cap R ({\hat{A}}_{2}) \cap \dots \cap R ({\hat{A}}_{k})) \\ = r ({\hat{A}}_{1}) + r ({\hat{A}}_{2}) + \dots + r ({\hat{A}}_{k}) . \end{matrix}

(10)

In particular, the following three statements are equivalent:

(a): $r [A_{1}, A_{2}, \dots, A_{k}] = r (A_{1}) + r (A_{2}) + \dots + r (A_{k}) .$
(b): $(k - 1) r [A_{1}, A_{2}, \dots, A_{k}] = r ({\hat{A}}_{1}) + r ({\hat{A}}_{2}) + \dots + r ({\hat{A}}_{k}) .$
(c): $R ({\hat{A}}_{1}) \cap R ({\hat{A}}_{2}) \cap \dots \cap R ({\hat{A}}_{k}) = {0} .$

Lemma 3

([14]). Let

\begin{matrix} A X = B \end{matrix}

(11)

be a given linear matrix equation, where

A \in C^{m \times n}

and

B \in C^{m \times p}

are known matrices, and

X \in C^{n \times p}

is an unknown matrix. Then, the following four statements are equivalent:

(a): Equation (11) is solvable for $X .$
(b): $R (A) \supseteq R (B) .$
(c): $r [A, B] = r (A) .$
(d): $A A^{†} B = B .$

In this case, the general solution of the equation in (11) can be written in the parametric form

X = A^{†} B + Q_{A} U,

(12)

where

U \in C^{n \times p}

is an arbitrary matrix. In particular, (11) holds for all matrices

X \in C^{n \times p}

if and only if both

A = 0

and

B = 0,

or equivalently,

[A, B] = 0 .

Lemma 4

([14]). Let

\begin{matrix} A X B = C \end{matrix}

(13)

be a two-sided linear matrix equation, where

A \in C^{m \times n},

B \in C^{p \times q},

and

C \in C^{m \times q}

are given, and

X \in C^{n \times p}

is an unknown matrix.

Then, the following four statements are equivalent:

(a): Equation (13) is solvable for $X .$
(b): Both $R (A) \supseteq R (C)$ and $R (B^{*}) \supseteq R (C^{*}) .$
(c): Both $r [A, C] = r (A)$ and $r [\begin{matrix} B \\ C \end{matrix}] = r (B) .$
(d): $A A^{†} C B^{†} B = C .$

In this case, the general solution X of (13) can be written in the parametric form

X = A^{†} C B^{†} + Q_{A} U + V P_{B},

where

U, V \in C^{n \times p}

are two arbitrary matrices. In particular, (13) holds for all matrices

X \in C^{n \times p}

if and only if

[A, C] = 0 o r [\begin{matrix} B \\ C \end{matrix}] = 0 .

(14)

Lemma 5

([15]). The linear matrix equation

\begin{matrix} A_{1} X_{1} + X_{2} B_{2} = C \end{matrix}

(15)

is solvable for the two unknown matrices

X_{1}

and

X_{2}

of appropriate sizes if and only if

\begin{matrix} r [\begin{matrix} C & A_{1} \\ B_{2} & 0 \end{matrix}] = r (A_{1}) + r (B_{2}) \end{matrix}

(16)

hold, or equivalently,

\begin{matrix} P_{A_{1}} C Q_{B_{2}} = 0 \end{matrix}

(17)

holds. In particular, (15) holds for all matrices

X_{1}

and

X_{2}

if and only if

[\begin{matrix} C & A_{1} \\ B_{2} & 0 \end{matrix}] = 0 .

Lemma 6

([16]). The linear matrix equation

\begin{matrix} A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2} = C \end{matrix}

(18)

is solvable the two unknown matrices

X_{1}

and

X_{2}

of appropriate sizes if and only if the following four matrix rank equalities

\begin{matrix} r [C, A_{1}, A_{2}] = r [A_{1}, A_{2}], r [\begin{matrix} C & A_{1} \\ B_{2} & 0 \end{matrix}] = r (A_{1}) + r (B_{2}), \end{matrix}

(19)

\begin{matrix} r [\begin{matrix} C & A_{2} \\ B_{1} & 0 \end{matrix}] = r (A_{2}) + r (B_{1}), r [\begin{matrix} C \\ B_{1} \\ B_{2} \end{matrix}] = r [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] \end{matrix}

(20)

hold, or equivalently, the following four matrix equalities

\begin{matrix} P_{A} C = 0, P_{A_{1}} C Q_{B_{2}} = 0, P_{A_{2}} C Q_{B_{1}} = 0, C Q_{B} = 0 \end{matrix}

(21)

hold, where

A = [A_{1}, A_{2}]

and

B = [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] .

Lemma 7

([17,18]). Equation (18) holds for all matrices

X_{1}

and

X_{2}

of appropriate sizes if and only if any one of the following four block matrix equalities

\begin{matrix} [C, A_{1}, A_{2}] = 0, [\begin{matrix} C & A_{1} \\ B_{2} & 0 \end{matrix}] = 0, [\begin{matrix} C & A_{2} \\ B_{1} & 0 \end{matrix}] = 0, [\begin{matrix} C \\ B_{1} \\ B_{2} \end{matrix}] = 0 \end{matrix}

(22)

holds.

Lemma 8

([19]). The linear matrix equation

\begin{matrix} A_{1} X_{1} + X_{2} B_{2} + A_{3} X_{3} B_{3} + A_{4} X_{4} B_{4} = C \end{matrix}

(23)

is solvable for the four unknown matrices

X_{1},

X_{2},

X_{3},

and

X_{4}

of appropriate sizes if and only if the following four matrix rank equalities hold:

\begin{matrix} r [\begin{matrix} C & A_{1} & A_{3} & A_{4} \\ B_{2} & 0 & 0 & 0 \end{matrix}] = r [A_{1}, A_{3}, A_{4}] + r (B_{2}), \end{matrix}

(24)

\begin{matrix} r [\begin{matrix} C & A_{1} & A_{3} \\ B_{2} & 0 & 0 \\ B_{4} & 0 & 0 \end{matrix}] = r [\begin{matrix} B_{2} \\ B_{4} \end{matrix}] + r [A_{1}, A_{3}], \end{matrix}

(25)

\begin{matrix} r [\begin{matrix} C & A_{1} & A_{4} \\ B_{2} & 0 & 0 \\ B_{3} & 0 & 0 \end{matrix}] = r [\begin{matrix} B_{2} \\ B_{3} \end{matrix}] + r [A_{1}, A_{4}], \end{matrix}

(26)

\begin{matrix} r [\begin{matrix} C & A_{1} \\ B_{2} & 0 \\ B_{3} & 0 \\ B_{4} & 0 \end{matrix}] = r [\begin{matrix} B_{2} \\ B_{3} \\ B_{4} \end{matrix}] + r (A_{1}) . \end{matrix}

(27)

3. Main Results

We start by presenting two groups of fundamental results and facts regarding the relationships between two matrix sets generated from the two simplest cases in (5).

Lemma 9

([20]). Assume that two LMFs and their domains are given by

\begin{matrix} D_{1} = {A_{1} + B_{1} X_{1} | X_{1} \in C^{p_{1} \times n}} a n d D_{2} = {A_{2} + B_{2} X_{2} | X_{2} \in C^{p_{2} \times n}}, \end{matrix}

(28)

where

A_{1}, A_{2} \in C^{m \times n},

B_{1} \in C^{m \times p_{1}},

and

B_{2} \in C^{m \times p_{2}}

are known matrices, and

X_{1} \in C^{p_{1} \times n}

and

X_{2} \in C^{p_{2} \times n}

are variable matrices. Then, we have the following results.

(a): $D_{1} \cap D_{2} \neq \emptyset,$ i.e., there exist $X_{1}$ and $X_{2}$ such that $A_{1} + B_{1} X_{1} = A_{2} + B_{2} X_{2}$ if and only if $R (A_{1} - A_{2}) \subseteq R [B_{1}, B_{2}] .$
(b): $D_{1} \subseteq D_{2}$ if and only if $R [A_{1} - A_{2}, B_{1}] \subseteq R (B_{2})$ .
(c): $D_{1} = D_{2}$ if and only if $R (A_{1} - A_{2}) \subseteq R (B_{1}) = R (B_{2})$ .

Lemma 10

([20]). Assume that two LMFs and their domains are given by

\begin{matrix} D_{1} = {A_{1} + B_{1} X_{1} C_{1} | X_{1} \in C^{p_{1} \times q_{1}}} a n d D_{2} = {A_{2} + B_{2} X_{2} C_{2} | X_{2} \in C^{p_{2} \times q_{2}}}, \end{matrix}

(29)

where

A_{i} \in C^{m \times n},

B_{i} \in C^{m \times p_{i}},

and

C_{i} \in C^{q_{i} \times n}

are known matrices, and

X_{i} \in C^{p_{i} \times q_{i}}

are variable matrices;

i = 1, 2 .

Then, we have the following results.

(a)

D_{1} \cap D_{2} \neq \emptyset

if and only if the following four conditions hold:

\begin{matrix} R (A_{1} - A_{2}) \subseteq R [B_{1}, B_{2}], R (A_{1}^{*} - A_{2}^{*}) \subseteq R [C_{1}^{*}, C_{2}^{*}], \\ r [\begin{matrix} A_{1} - A_{2} & B_{1} \\ C_{2} & 0 \end{matrix}] = r (B_{1}) + r (C_{2}), r [\begin{matrix} A_{1} - A_{2} & B_{2} \\ C_{1} & 0 \end{matrix}] = r (B_{2}) + r (C_{1}) . \end{matrix}

(b)

D_{1} \subseteq D_{2}

if and only if any one of the following three conditions holds:

(i): $R [A_{1} - A_{2}, B_{1}] \subseteq R (B_{2})$ and $R [A_{1}^{*} - A_{2}^{*}, C_{1}^{*}] \subseteq R (C_{2}^{*}) .$
(ii): $B_{1} = 0,$ $R (A_{1} - A_{2}) \subseteq R (B_{2}),$ and $R (A_{1}^{*} - A_{2}^{*}) \subseteq R (C_{2}^{*}) .$
(iii): $C_{1} = 0,$ $R (A_{1} - A_{2}) \subseteq R (B_{2}),$ and $R (A_{1}^{*} - A_{2}^{*}) \subseteq R (C_{2}^{*}) .$

(c)

D_{1} = D_{2}

if and only if any one of the following five conditions holds:

(i): $R (A_{1} - A_{2}) \subseteq R (B_{1}) = R (B_{2})$ and $R (A_{1}^{*} - A_{2}^{*}) \subseteq R (C_{1}^{*}) = R (C_{2}^{*}) .$
(ii): $A_{1} = A_{2},$ $B_{1} = 0,$ and $B_{2} = 0 .$
(iii): $A_{1} = A_{2},$ $B_{1} = 0,$ and $C_{2} = 0 .$
(iv): $A_{1} = A_{2},$ $B_{2} = 0,$ and $C_{1} = 0 .$
(v): $A_{1} = A_{2},$ $C_{1} = 0,$ and $C_{2} = 0 .$

As an extension of these known facts, we proceed to derive the following results and facts about relationships between the domains of two general matrix functions, which we shall use in the latter part of the article.

Theorem 1.

Assume that two LMFs and their domains are given by

\begin{matrix} D_{1} = {A_{1} + B_{1} X_{1} + Y_{1} C_{1} | X_{1} \in C^{p_{1} \times n_{1}}, Y_{1} \in C^{m_{1} \times q_{1}}}, \end{matrix}

(30)

\begin{matrix} D_{2} = {A_{2} + B_{2} X_{2} C_{2} + D_{2} Y_{2} E_{2} | X_{2} \in C^{s_{2} \times t_{2}}, Y_{2} \in C^{u_{2} \times v_{2}}} . \end{matrix}

(31)

where

A_{1} \in C^{m \times n},

B_{1} \in C^{m \times p_{1}},

C_{1} \in C^{q_{1} \times n},

A_{2} \in C^{m \times n},

B_{2} \in C^{m \times s_{2}},

C_{2} \in C^{t_{2} \times n},

D_{2} \in C^{m \times u_{2}},

and

E_{2} \in C^{v_{2} \times n}

are known matrices. Then, we have the following results.

(a): $D_{1} \cap D_{2} \neq \emptyset$ if and only if the following four conditions hold:

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} & D_{2} \\ C_{1} & 0 & 0 & 0 \end{matrix}] = r [B_{1}, B_{2}, D_{2}] + r (C_{1}), \end{matrix}$

(32)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} \\ C_{1} & 0 & 0 \\ E_{2} & 0 & 0 \end{matrix}] = r [\begin{matrix} C_{1} \\ E_{2} \end{matrix}] + r [B_{1}, B_{2}], \end{matrix}$

(33)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & D_{2} \\ C_{1} & 0 & 0 \\ C_{2} & 0 & 0 \end{matrix}] = r [\begin{matrix} C_{1} \\ C_{2} \end{matrix}] + r [B_{1}, D_{2}], \end{matrix}$

(34)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} \\ C_{1} & 0 \\ C_{2} & 0 \\ E_{2} & 0 \end{matrix}] = r [\begin{matrix} C_{1} \\ C_{2} \\ E_{2} \end{matrix}] + r (B_{1}) . \end{matrix}$

(35)
(b): $D_{1} \supseteq D_{2}$ if and only if any one of the following four conditions holds:

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} & D_{2} \\ C_{1} & 0 & 0 & 0 \end{matrix}] = r (B_{1}) + r (C_{1}), \end{matrix}$

(36)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} \\ C_{1} & 0 & 0 \\ E_{2} & 0 & 0 \end{matrix}] = r (B_{1}) + r (C_{1}), \end{matrix}$

(37)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} & D_{2} \\ C_{1} & 0 & 0 \\ C_{2} & 0 & 0 \end{matrix}] = r (B_{1}) + r (C_{1}), \end{matrix}$

(38)

$\begin{matrix} r [\begin{matrix} A_{2} - A_{1} & B_{1} \\ C_{1} & 0 \\ C_{2} & 0 \\ E_{2} & 0 \end{matrix}] = r (B_{1}) + r (C_{1}) . \end{matrix}$

(39)
(c): $D_{1} \subseteq D_{2}$ if and only if any one of the following four groups of conditions holds:

$\begin{matrix} r [B_{2}, D_{2}] = m o r r [\begin{matrix} A_{1} - A_{2} & B_{1} & B_{2} & D_{2} \\ C_{1} & 0 & 0 & 0 \end{matrix}] = r [B_{2}, D_{2}], \end{matrix}$

(40)

$\begin{matrix} r (B_{2}) = m o r r (E_{2}) = n o r r [\begin{matrix} A_{1} - A_{2} & B_{1} & B_{2} \\ C_{1} & 0 & 0 \\ E_{2} & 0 & 0 \end{matrix}] = r (B_{2}) + r (E_{2}), \end{matrix}$

(41)

$\begin{matrix} r (C_{2}) = n o r r (D_{2}) = m o r r [\begin{matrix} A_{1} - A_{2} & B_{1} & D_{2} \\ C_{1} & 0 & 0 \\ C_{2} & 0 & 0 \end{matrix}] = r (C_{2}) + r (D_{2}), \end{matrix}$

(42)

$\begin{matrix} r [\begin{matrix} C_{2} \\ E_{2} \end{matrix}] = n o r r [\begin{matrix} A_{1} - A_{2} & B_{1} \\ C_{1} & 0 \\ C_{2} & 0 \\ E_{2} & 0 \end{matrix}] = r [\begin{matrix} C_{2} \\ E_{2} \end{matrix}] . \end{matrix}$

(43)
(d): $D_{1} = D_{2}$ if and only if both (b) and (c) hold.

Proof.

The condition

D_{1} \cap D_{2} \neq \emptyset

is obviously equivalent to

A_{1} + B_{1} X_{1} + Y_{1} C_{1} = A_{2} + B_{2} X_{2} C_{2} + D_{2} Y_{2} E_{2}

for some variable matrices

X_{1}

,

Y_{1}

,

X_{2}

, and

Y_{2}

. We rewrite this equation as

\begin{matrix} B_{1} X_{1} + Y_{1} C_{1} - B_{2} X_{2} C_{2} - D_{2} Y_{2} E_{2} = A_{2} - A_{1} . \end{matrix}

(44)

In this case, applying Lemma 8 to (44) leads to (a).

By (15)–(17) and (44), the condition

D_{1} \supseteq D_{2}

is equivalent to the fact that

\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} + P_{B_{1}} B_{2} X_{2} C_{2} Q_{C_{1}} + P_{B_{1}} D_{2} Y_{2} E_{2} Q_{C_{1}} = 0 \end{matrix}

(45)

holds for all variable matrices

X_{2}

and

Y_{2}

. Further, by Lemma 7, the matrix equality in (45) holds for all variable matrices

X_{2}

and

Y_{2}

if and only if the following four matrix equalities,

\begin{matrix} [P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}}, P_{B_{1}} B_{2}, P_{B_{1}} D_{2}] = 0, \end{matrix}

(46)

\begin{matrix} [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} & P_{B_{1}} B_{2} \\ E_{2} Q_{C_{1}} & 0 \end{matrix}] = 0, \end{matrix}

(47)

\begin{matrix} [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} & P_{B_{1}} D_{2} \\ C_{2} Q_{C_{1}} \end{matrix}] = 0, \end{matrix}

(48)

\begin{matrix} [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} \\ C_{2} Q_{C_{1}} \\ E_{2} Q_{C_{1}} \end{matrix}] = 0 \end{matrix}

(49)

hold, where

\begin{matrix} r [P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}}, P_{B_{1}} B_{2}, P_{B_{1}} D_{2}] = r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} & D_{2} \\ C_{1} & 0 & 0 & 0 \end{matrix}] - r (B_{1}) - r (C_{1}), \\ [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} & P_{B_{1}} B_{2} \\ E_{2} Q_{C_{1}} & 0 \end{matrix}] = r [\begin{matrix} A_{2} - A_{1} & B_{1} & B_{2} \\ C_{1} & 0 & 0 \\ E_{2} & 0 & 0 \end{matrix}] - r (B_{1}) - r (C_{1}), \\ r [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} & P_{B_{1}} D_{2} \\ C_{2} Q_{C_{1}} \end{matrix}] = r [\begin{matrix} A_{2} - A_{1} & B_{1} & D_{2} \\ C_{1} & 0 & 0 \\ C_{2} & 0 & 0 \end{matrix}] - r (B_{1}) - r (C_{1}), \\ r [\begin{matrix} P_{B_{1}} (A_{2} - A_{1}) Q_{C_{1}} \\ C_{2} Q_{C_{1}} \\ E_{2} Q_{C_{1}} \end{matrix}] = r [\begin{matrix} A_{2} - A_{1} & B_{1} \\ C_{1} & 0 \\ C_{2} & 0 \\ E_{2} & 0 \end{matrix}] - r (B_{1}) - r (C_{1}) \end{matrix}

hold by Lemma 1(c). Substituting these four matrix rank equalities into (46)–(49) leads to the equivalences of (36)–(39) and (46)–(49), respectively.

Applying (19)–(21) to (44), we see that the condition

D_{1} \subseteq D_{2}

holds if and only if any one of the following four equations,

\begin{matrix} P_{G} (A_{1} - A_{2}) + P_{G} B_{1} X_{1} + P_{G} Y_{1} C_{1} = 0, \end{matrix}

(50)

\begin{matrix} P_{B_{2}} (A_{1} - A_{2}) Q_{E_{2}} + P_{B_{2}} B_{1} X_{1} Q_{E_{2}} + P_{B_{2}} Y_{1} C_{1} Q_{E_{2}} = 0, \end{matrix}

(51)

\begin{matrix} P_{D_{2}} (A_{1} - A_{2}) Q_{C_{2}} + P_{D_{2}} B_{1} X_{1} Q_{C_{2}} + P_{D_{2}} Y_{1} C_{1} Q_{C_{2}} = 0, \end{matrix}

(52)

\begin{matrix} (A_{1} - A_{2}) Q_{H} + B_{1} X_{1} Q_{H} + Y_{1} C_{1} Q_{H} = 0 \end{matrix}

(53)

holds for all matrices

X_{1}

and

Y_{1}

, where

G = [B_{2}, D_{2}]

and

H = [\begin{matrix} C_{2} \\ E_{2} \end{matrix}]

. By Lemma 7, the matrix equality in (50) holds for all matrices

X_{1}

and

Y_{1}

if and only if any one of the following two conditions holds:

P_{G} = 0 or r [\begin{matrix} P_{G} (A_{1} - A_{2}) & P_{G} B_{1} \\ C_{1} & 0 \end{matrix}] = 0,

which are further equivalent to

r [B_{2}, D_{2}] = m or r [\begin{matrix} A_{1} - A_{2} & B_{1} & B_{2} & D_{2} \\ C_{1} & 0 & 0 & 0 \end{matrix}] = r [B_{2}, D_{2}]

by (7), (9), and Lemma 1(a) and (c), as is required for (40); the matrix equality in (51) holds for all matrices

X_{1}

and

Y_{1}

if and only if any one of the following three conditions holds:

P_{B_{2}} = 0 or r [\begin{matrix} P_{B_{2}} (A_{1} - A_{2}) Q_{E_{2}} & P_{B_{2}} B_{1} \\ C_{1} Q_{E_{2}} & 0 \end{matrix}] = 0 or Q_{E_{2}} = 0,

which are further equivalent to

r (B_{2}) = m or r [\begin{matrix} A_{1} - A_{2} & B_{1} & B_{2} \\ C_{1} & 0 & 0 \\ E_{2} & 0 & 0 \end{matrix}] = r (B_{2}) + r (E_{2}) or r (E_{2}) = n

by (7), (9), and Lemma 1(a) and (c), thus establishing (41); the matrix equality in (52) holds for all matrices

X_{1}

and

Y_{1}

if and only if any one of the following three conditions holds:

P_{D_{2}} = 0 or [\begin{matrix} P_{D_{2}} (A_{1} - A_{2}) Q_{C_{2}} & P_{D_{2}} B_{1} \\ C_{1} Q_{C_{2}} & 0 \end{matrix}] = 0 or Q_{C_{2}} = 0

These three equalities are further equivalent to

r (D_{2}) = m or r [\begin{matrix} A_{1} - A_{2} & B_{1} & D_{2} \\ C_{1} & 0 & 0 \\ C_{2} & 0 & 0 \end{matrix}] = r (C_{2}) + r (D_{2}) or r (C_{2}) = n

by (7), (9), and Lemma 1(a) and (c), as is required for (42); the matrix equality in (53) holds for all matrices

X_{1}

and

Y_{1}

if and only if any one of the following four conditions holds:

\begin{matrix} Q_{H} = 0 or [\begin{matrix} (A_{1} - A_{2}) Q_{H} & B_{1} \\ C_{1} Q_{H} & 0 \end{matrix}] = 0, \end{matrix}

which are further equivalent to

r [\begin{matrix} C_{2} \\ E_{2} \end{matrix}] = n or r [\begin{matrix} A_{1} - A_{2} & B_{1} \\ C_{1} & 0 \\ C_{2} & 0 \\ E_{2} & 0 \end{matrix}] = r [\begin{matrix} C_{2} \\ E_{2} \end{matrix}]

by (8), (9), and Lemma 1(b) and (c), thus establishing (43). □

It has been well known since Penrose [14] that general solutions of linear matrix equations can be derived and represented by certain algebraic linear matrix expressions that are composed of the given matrices in the matrix equations and their generalized inverses. In view of this basic fact, we are able to employ the preceding formulas, results, and facts to describe and characterize various possible relationships between solutions of different linear matrix equations.

We remark that there exist many types of linear matrix equations for which we can represent their general solutions in certain explicit linear matrix functions, as given in (54). In this section, we present a selection of results and facts on the relationships between certain linear transformations of the solutions of some fundamental linear matrix equations.

Theorem 2.

Assume that the two linear matrix equations

A_{1} X_{1} = B_{1} a n d A_{2} X_{2} = B_{2}

(54)

are solvable for

X_{1}

and

X_{2},

respectively, where

A_{i} \in C^{m_{i} \times n_{i}}

and

B_{i} \in C^{m_{i} \times p}

are given,

i = 1, 2 .

We also denote

D_{1} = {S_{1} X_{1} + T_{1} | A_{1} X_{1} = B_{1}} a n d D_{2} = {S_{2} X_{2} + T_{2} | A_{2} X_{2} = B_{2}},

(55)

where

S_{i} \in C^{s \times n_{i}}

and

T_{i} \in C^{s \times p}

are given;

i = 1, 2 .

Then, we have the following results.

(a): $D_{1} \cap D_{2} \neq \emptyset$ if and only if $r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} S_{1} & S_{2} \\ A_{1} & 0 \\ 0 & A_{2} \end{matrix}] .$
(b): $D_{1} \subseteq D_{2}$ if and only if $r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} S_{2} \\ A_{2} \end{matrix}] + r (A_{1}) .$
(c): $D_{1} = D_{2}$ if and only if $r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} S_{1} \\ A_{1} \end{matrix}] + r (A_{2}) = r [\begin{matrix} S_{2} \\ A_{2} \end{matrix}] + r (A_{1}) .$

Proof.

By Lemma 3, the general solutions of the two linear matrix equations in (54) can be expressed as

X_{1} = A_{1}^{†} B_{1} + Q_{A_{1}} U_{1}, X_{2} = A_{2}^{†} B_{2} + Q_{A_{2}} U_{2},

(56)

where

U_{1} \in C^{n_{1} \times p}

and

U_{2} \in C^{n_{2} \times p}

are arbitrary matrices. Then, the two sets in (55) can be represented as

D_{1} = {S_{1} A_{1}^{†} B_{1} + S_{1} Q_{A_{1}} U_{1} + T_{1}} and D_{2} = {S_{2} A_{2}^{†} B_{2} + S_{2} Q_{A_{2}} U_{2} + T_{2}} .

(57)

Applying Lemma 9(a) to (57), we obtain that

D_{1} \cap D_{2} \neq \emptyset

if and only if

r [S_{1} Q_{A_{1}}, S_{2} Q_{A_{2}}, S_{1} A_{1}^{†} B_{1} - S_{2} A_{2}^{†} B_{2} + T_{1} - T_{2}] = r [S_{1} Q_{A_{1}}, S_{2} Q_{A_{2}}],

(58)

where by (8),

\begin{matrix} r [S_{1} Q_{A_{1}}, S_{2} Q_{A_{2}}, S_{1} A_{1}^{†} B_{1} - S_{2} A_{2}^{†} B_{2} + T_{1} - T_{2}] \\ = r [\begin{matrix} S_{1} & S_{2} & S_{1} A_{1}^{†} B_{1} - S_{2} A_{2}^{†} B_{2} + T_{1} - T_{2} \\ A_{1} & 0 & 0 \\ 0 & A_{2} & 0 \end{matrix}] - r (A_{1}) - r (A_{2}) \\ = r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] - r (A_{1}) - r (A_{2}) \end{matrix}

(59)

r [S_{1} Q_{A_{1}}, S_{2} Q_{A_{2}}] \begin{matrix} = r [\begin{matrix} S_{1} & S_{2} \\ A_{1} & 0 \\ 0 & A_{2} \end{matrix}] - r (A_{1}) - r (A_{2}), \end{matrix}

(60)

Substitution of (59) and (60) into (58) yields

r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} S_{1} & S_{2} \\ A_{1} & 0 \\ 0 & A_{2} \end{matrix}],

thus establishing (a).

Applying Lemma 9(b) to (58), we obtain that

D_{1} \cap D_{2} \neq \emptyset

if and only if

r [S_{1} Q_{A_{1}}, S_{2} Q_{A_{2}}, S_{1} A_{1}^{†} B_{1} - S_{2} A_{2}^{†} B_{2} + T_{1} - T_{2}] = r (S_{2} Q_{A_{2}}),

(61)

where by (8), the following rank equality

\begin{matrix} r (S_{2} Q_{A_{2}}) = r [\begin{matrix} S_{2} \\ A_{2} \end{matrix}] - r (A_{2}) \end{matrix}

(62)

holds. Substitution of (59) and (62) into (61) yields Result (b). With a similar approach, we obtain that

D_{1} \supseteq D_{2}

if and only if

r [\begin{matrix} S_{1} & S_{2} & T_{1} - T_{2} \\ A_{1} & 0 & - B_{1} \\ 0 & A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} S_{1} \\ A_{1} \end{matrix}] + r (A_{2}) .

Combining this matrix rank equality with (b) leads to (c). □

Corollary 1.

Assume that

A_{1} X_{1} = B_{1}

and

A_{2} X_{2} = B_{2}

in (54) are solvable for

X_{1}

and

X_{2},

respectively, and denote

D_{1} = {X_{1} | A_{1} X_{1} = B_{1}} a n d D_{2} = {X_{2} | A_{2} X_{2} = B_{2}} .

(63)

Then, we have the following results.

(a): The two equations in (54) have a common solution if and only if $r [\begin{matrix} A_{1} & B_{1} \\ A_{2} & B_{2} \end{matrix}] = r [\begin{matrix} A_{1} \\ A_{2} \end{matrix}],$ i.e., $R [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] \subseteq R [\begin{matrix} A_{1} \\ A_{2} \end{matrix}] .$
(b): $D_{1} \subseteq D_{2}$ if and only if $r [\begin{matrix} A_{1} & B_{1} \\ A_{2} & B_{2} \end{matrix}] = r (A_{1}),$ i.e., $R [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] \subseteq R [\begin{matrix} A_{1} \\ A_{2} \end{matrix}]$ and $R (A_{2}^{*}) \subseteq R (A_{1}^{*}) .$
(c): $D_{1} = D_{2}$ if and only if $r [\begin{matrix} A_{1} & B_{1} \\ A_{2} & B_{2} \end{matrix}] = r (A_{1}) = r (A_{2}),$ i.e., $R [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] \subseteq R [\begin{matrix} A_{1} \\ A_{2} \end{matrix}]$ and $R (A_{2}^{*}) = R (A_{1}^{*}) .$

Corollary 2.

Let

A \in C^{m \times n}

and

B \in C^{m \times p}

be given, and suppose that

A X = B

is solvable for

X \in C^{n \times p} .

In addition, we denote

D_{1} = {S X | A X = B} a n d D_{2} = {S X | M A X = M B},

(64)

where

M \in C^{t \times m}

and

S \in C^{s \times n}

are two given matrices. Then, the following results hold.

(a): $D_{1} \subseteq D_{2}$ always holds.
(b): $D_{1} = D_{2}$ if and only if $r [\begin{matrix} M A \\ S \end{matrix}] = r [\begin{matrix} A \\ S \end{matrix}] + r (M A) - r (A) .$

Corollary 3.

Let

A \in C^{m \times n}

and

B \in C^{m \times p}

be given, and suppose that

A X = B

is solvable for

X \in C^{n \times p} .

In addition, we denote

D_{1} = {X | A X = B} a n d D_{2} = {X | M A X = M B},

(65)

where

M \in C^{s \times m} .

Then, we have the following results.

(a): $D_{1} \subseteq D_{2}$ always holds.
(b): $D_{1} = D_{2}$ if and only if $r (M A) = r (A) .$

4. Relationships between Solutions of Some Linear Matrix Equations and Their Reduced Equations

Let us partition the matrix equation in (11) as

A X = A_{1} X_{1} + A_{2} X_{2} + \dots + A_{k} X_{k} = B,

(66)

where

A_{i} \in C^{m \times n_{i}}

,

A = [A_{1}, A_{2}, \dots, A_{k}]

, and

X_{i} \in C^{n_{i} \times p}

are unknown matrices with

X = {[X_{1}^{T}, X_{2}^{T}, \dots, X_{k}^{T}]}^{T}

and

p = p_{1} + p_{2} + \dots + p_{k}

,

i = 1, 2, \dots, k

. In this case, pre-multiplying (66) by

P_{Y_{i}}

yields the following reduced linear matrix equations:

P_{Y_{i}} A X = P_{Y_{i}} A_{i} X_{i} = P_{Y_{i}} B, i = 1, 2, \dots, k,

(67)

where

Y_{i} = [A_{1}, \dots, A_{i - 1}, 0, A_{i + 1}, \dots, A_{k}],

i = 1, 2, \dots, k

. Now, assume that (67) is solvable for X. Then, the equations in (67) are solvable for

X_{i}

. Correspondingly, we denote

D_{i} = {X_{i} | A_{1} X_{1} + A_{2} X_{2} + \dots + A_{k} X_{k} = B} and H_{i} = {X_{i} | P_{Y_{i}} A_{i} X_{i} = P_{Y_{i}} B}

(68)

for the matrix sets composed of the partial solutions

X_{i}

of (66) and (67), respectively,

i = 1, 2, \dots, k

, and denote

D = {X | A X = B} and H = {{[X_{1}^{T}, X_{2}^{T}, \dots, X_{k}^{T}]}^{T} | P_{Y_{i}} A_{i} X_{i} = P_{Y_{i}} B, i = 1, 2, \dots, k} .

(69)

In this section, we first discuss the relationships between

D_{i}

and

H_{i}

in (68),

i = 1, 2, \dots, k

, as well as the two sets in (69).

Theorem 3.

Assume that the matrix equation in (66) is solvable for

X,

and let

D_{i}

and

H_{i}

be as given in (68);

i = 1, 2, \dots, k .

Then, the following matrix set equalities

D_{i} = H_{i}

(70)

always hold,

i = 1, 2, \dots, k .

Proof.

Set

S = [0 \dots, I_{n_{i}}, \dots, 0]

and

M = P_{Y_{i}}

in (64),

i = 1, 2, \dots, k

. Then, by (11) and simplifications, we obtain that

\begin{matrix} r [\begin{matrix} P_{Y_{i}} A_{i} \\ S \end{matrix}] - r [\begin{matrix} A \\ S \end{matrix}] - r (P_{Y_{i}} A_{i}) + r (A) & = r [\begin{matrix} A & Y_{i} \\ S & 0 \end{matrix}] - r [\begin{matrix} A \\ S \end{matrix}] - r [Y_{i}, A_{i}] + r (A) \\ = r [\begin{matrix} 0 & Y_{i} \\ S & 0 \end{matrix}] - r [\begin{matrix} Y_{i} \\ S \end{matrix}] - r (A) + r (A) = 0 \end{matrix}

for

i = 1, 2, \dots, k

. Thus, (70) holds by Corollary 2. □

Theorem 4.

Assume that the matrix equation in (66) is solvable for

X,

and let

Y_{i},

D,

and

H

be as given in (67) and (69), respectively, for

i = 1, 2, \dots, k .

Then, we have the following results.

(a)

D \subseteq H

always holds.

(b)

The following four statements are equivalent:

(i): $D = H .$
(ii): $(k - 1) r (A) = r (Y_{1}) + r (Y_{2}) + \dots + r (Y_{k}) .$
(iii): $r (A) = r (A_{1}) + r (A_{2}) + \dots + r (A_{k}) .$
(iv): $R (Y_{1}) \cap R (Y_{2}) \cap \dots \cap R (Y_{k}) = {0} .$

Proof.

By Lemma 3, the general solutions of (67) are given by

X_{i} = {(P_{Y_{i}} A_{i})}^{†} P_{Y_{i}} B + (I_{n_{i}} - {(P_{Y_{i}} A_{i})}^{†} (P_{Y_{i}} A_{i})) U_{i},

(71)

where

U_{i} \in C^{n_{i} \times p}

are arbitrary,

i = 1, 2, \dots, k

. Substitution of (12) and (71) into (69) yields

\begin{matrix} D = {A^{†} B + Q_{A} U}, \end{matrix}

(72)

\begin{matrix} H = \{[\begin{matrix} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B \\ ⋮ \\ {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B \end{matrix}] + [\begin{matrix} I_{n_{1}} - {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} A_{1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} - {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} A_{k} \end{matrix}] [\begin{matrix} U_{1} \\ ⋮ \\ U_{k} \end{matrix}]\} . \end{matrix}

(73)

Applying Lemma 9(b) to (72) and (73), we see that

D \subseteq H

if and only if

\begin{matrix} r [\begin{matrix} A^{†} B - [\begin{matrix} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B \\ ⋮ \\ {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B \end{matrix}], Q_{A}, [\begin{matrix} I_{n_{1}} - {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}) & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} - {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k}) \end{matrix}] \end{matrix}] \\ = r [\begin{matrix} I_{n_{1}} - {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}) & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} - {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k}) \end{matrix}], \end{matrix}

(74)

where by (8), we obtain

\begin{matrix} r [\begin{matrix} A^{†} B - [\begin{matrix} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B \\ ⋮ \\ {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B \end{matrix}], Q_{A}, [\begin{matrix} I_{n_{1}} - {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}) & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} - {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k}) \end{matrix}] \end{matrix}] \\ = r [\begin{matrix} A^{†} B - [\begin{matrix} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B \\ ⋮ \\ {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B \end{matrix}] & I_{n} & [\begin{matrix} I_{n_{1}} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} \end{matrix}] \\ 0 & A & 0 \\ 0 & 0 & [\begin{matrix} P_{Y_{1}} A_{1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & P_{Y_{k}} A_{k} \end{matrix}] \end{matrix}] \\ - r (A) - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}) \\ = r [\begin{matrix} 0 & I_{n} & 0 \\ - B & 0 & - A \\ [\begin{matrix} P_{Y_{1}} B \\ ⋮ \\ P_{Y_{k}} B \end{matrix}] & 0 & [\begin{matrix} P_{Y_{1}} A_{1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & P_{Y_{k}} A_{k} \end{matrix}] \end{matrix}] - r (A) - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}) \\ = n - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}), \end{matrix}

(75)

and

\begin{matrix} r [\begin{matrix} I_{n_{1}} - {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}) & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & I_{n_{k}} - {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k}) \end{matrix}] \\ = n - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}) . \end{matrix}

(76)

Both (75) and (76) mean that (74) is an identity, thus establishing (a). Substitution of (71) into (66) gives

\begin{matrix} (A_{1} - A_{1} {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1})) U_{1} + \dots + (A_{k} - A_{k} {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k})) U_{k} \\ = B - A_{1} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B - \dots - A_{k} {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B . \end{matrix}

(77)

It is obvious that

D \supseteq H

holds if and only if the matrix equation in (77) holds for all

U_{1}, \dots, U_{k}

, which, by Lemma 3, is equivalent to

\begin{matrix} [B - A_{1} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B - \dots - A_{k} {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B, A_{1} - A_{1} {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}), \\ \dots, A_{k} - A_{k} {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k})] = 0, \end{matrix}

(78)

where by (8), we obtain

\begin{matrix} r [B - A_{1} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B - \dots - A_{k} {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B, A_{1} - A_{1} {(P_{Y_{1}} A_{1})}^{†} (P_{Y_{1}} A_{1}), \\ \dots, A_{k} - A_{k} {(P_{Y_{k}} A_{k})}^{†} (P_{Y_{k}} A_{k})] \\ = r [\begin{matrix} B - A_{1} {(P_{Y_{1}} A_{1})}^{†} P_{Y_{1}} B - \dots - A_{k} {(P_{Y_{k}} A_{k})}^{†} P_{Y_{k}} B & A_{1} & \dots & A_{k} \\ 0 & P_{Y_{1}} A_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & P_{Y_{k}} A_{k} \end{matrix}] \\ - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}) \\ = r [\begin{matrix} B & A_{1} & \dots & A_{k} \\ P_{Y_{1}} B & P_{Y_{1}} A_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ P_{Y_{k}} B & 0 & \dots & P_{Y_{k}} A_{k} \end{matrix}] - r (P_{Y_{1}} A_{1}) - \dots - r (P_{Y_{k}} A_{k}) \\ = r [\begin{matrix} B & A_{1} & \dots & A_{k} & 0 & \dots & 0 \\ B & A_{1} & \dots & 0 & Y_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ B & 0 & \dots & A_{k} & 0 & \dots & Y_{k} \end{matrix}] - k r (A) \\ = r [\begin{matrix} B & A & 0 & \dots & 0 \\ B & A & Y_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ B & A & 0 & \dots & Y_{k} \end{matrix}] - k r (A) \\ = r [\begin{matrix} 0 & A & 0 & \dots & 0 \\ 0 & 0 & Y_{1} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & Y_{k} \end{matrix}] - k r (A) \\ = r (Y_{1}) + \dots + r (Y_{k}) - (k - 1) r (A) . \end{matrix}

Substituting it into (78), we see that (78) is equivalent to the rank equality

(k - 1) r (A) = r (Y_{1}) + \dots + r (Y_{k})

. Combining this fact with (a) leads to the equivalence of (i) and (ii) in Result (b). The equivalences of (ii), (iii), and (iv) in Result (b) follow from Lemma 2. □

Equation (18) is well known in matrix theory and its applications, while the solvability condition and the general solution of this equation were precisely established by implementing certain calculations of the ranks, ranges, and generalized inverses of the given matrices in this matrix equation; see, e.g., [15,16,18,21,22] and the relevant literature quoted there.

It is easy to see that we can construct from (18) some small or transformed linear matrix equations. For instance, pre- and post-multiplying (18) by

P_{A_{i}}

and

Q_{B_{i}}

, respectively, yields the following four reduced matrix equations:

\begin{matrix} P_{A_{2}} (A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2}) = P_{A_{2}} A_{1} X_{1} B_{1} = P_{A_{2}} C, \end{matrix}

(79)

\begin{matrix} P_{A_{1}} (A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2}) = P_{A_{1}} A_{2} X_{2} B_{2} = P_{A_{1}} C, \end{matrix}

(80)

\begin{matrix} (A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2}) Q_{B_{2}} = A_{1} X_{1} B_{1} Q_{B_{2}} = C Q_{B_{2}}, \end{matrix}

(81)

\begin{matrix} (A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2}) Q_{B_{1}} = A_{2} X_{2} B_{2} Q_{B_{1}} = C Q_{B_{1}}, \end{matrix}

(82)

respectively. Each of (79)–(82) is consistent as well if the matrix equation in (18) is consistent. Concerning the relationships among the solutions of (18) and (79)–(82), we have the following results.

Theorem 5.

Assume that the matrix equation in (18) is solvable for

X_{1}

and

X_{2},

and we denote by

\begin{matrix} D = {(X_{1}, X_{2}) | A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2} = C}, \end{matrix}

(83)

\begin{matrix} H_{1} = {(X_{1}, X_{2}) | P_{A_{2}} A_{1} X_{1} B_{1} = P_{A_{2}} C a n d P_{A_{1}} A_{2} X_{2} B_{2} = P_{A_{1}} C}, \end{matrix}

(84)

\begin{matrix} H_{2} = {(X_{1}, X_{2}) | P_{A_{2}} A_{1} X_{1} B_{1} = P_{A_{2}} C a n d A_{2} X_{2} B_{2} Q_{B_{1}} = C Q_{B_{1}}}, \end{matrix}

(85)

\begin{matrix} H_{3} = {(X_{1}, X_{2}) | A_{1} X_{1} B_{1} Q_{B_{2}} = C Q_{B_{2}} a n d P_{A_{1}} A_{2} X_{2} B_{2} = P_{A_{1}} C}, \end{matrix}

(86)

\begin{matrix} H_{4} = {(X_{1}, X_{2}) | A_{1} X_{1} B_{1} Q_{B_{2}} = C Q_{B_{2}} a n d A_{2} X_{2} B_{2} Q_{B_{1}} = C Q_{B_{1}}}, \end{matrix}

(87)

the collections of all pairs of solutions of (18) and (79)–(82), respectively. Then, we have the following results.

(a): $D \subseteq H_{i}$ always hold; $i = 1, 2, 3, 4 .$
(b): $D = H_{1}$ if and only if $R (A_{1}) \cap R (A_{2}) = {0}$ or $[B_{1}^{*}, B_{2}^{*}] = 0 .$
(c): $D = H_{2}$ if and only if $A_{2} = 0,$ or $B_{1} = 0,$ or $R (A_{1}) \cap R (A_{2}) = {0}$ and $R (B_{1}^{*}) \cap R (B_{2}^{*}) = {0} .$
(d): $D = H_{3}$ if and only if $A_{1} = 0,$ or $B_{2} = 0,$ or $R (A_{1}) \cap R (A_{2}) = {0}$ and $R (B_{1}^{*}) \cap R (B_{2}^{*}) = {0} .$
(e): $D = H_{4}$ if and only if $[A_{1}, A_{2}] = 0$ or $R (B_{1}^{*}) \cap R (B_{2}^{*}) = {0} .$

Proof.

Result (a) follows directly from (79)–(82). By Lemma 4, the general solutions of (79)–(82) are given by

\begin{matrix} X_{1} = {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} + (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) U_{1} + V_{1} (I_{q_{1}} - B_{1} B_{1}^{†}), \end{matrix}

(88)

\begin{matrix} X_{2} = {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C B_{2}^{†} + (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})) U_{2} + V_{2} (I_{q_{2}} - B_{2} B_{2}^{†}), \end{matrix}

(89)

\begin{matrix} X_{1} = A_{1}^{†} C Q_{B_{2}} {(B_{1} Q_{B_{2}})}^{†} + (I_{p_{1}} - A_{1}^{†} A_{1}) U_{3} + V_{3} (I_{q_{1}} - (B_{1} Q_{B_{2}}) {(B_{1} Q_{B_{2}})}^{†}), \end{matrix}

(90)

\begin{matrix} X_{2} = A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} + (I_{p_{2}} - A_{2}^{†} A_{2}) U_{4} + V_{4} (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}), \end{matrix}

(91)

respectively, where

U_{i}

and

V_{i}

are arbitrary matrices;

i = 1, 2, 3, 4

. Substitution of (88)–(91) into (18) gives the following four matrix equations:

\begin{matrix} A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) U_{1} B_{1} + A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})) U_{2} B_{2} \\ = C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C, \end{matrix}

(92)

\begin{matrix} A_{1} [I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) U_{1} B_{1} + A_{2} V_{4} (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} \\ = C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2}, \end{matrix}

(93)

\begin{matrix} A_{1} V_{3} (I_{q_{1}} - (B_{1} Q_{B_{2}}) {(B_{1} Q_{B_{2}})}^{†}) B_{1} + A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})) U_{2} B_{2} \\ = C - C Q_{B_{2}} {(B_{1} Q_{B_{2}})}^{†} B_{1} - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C, \end{matrix}

(94)

\begin{matrix} A_{1} V_{3} (I_{q_{1}} - (B_{1} Q_{B_{2}}) {(B_{1} Q_{B_{2}})}^{†}) B_{1} + A_{2} V_{4} (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} \\ = C - C Q_{B_{2}} {(B_{1} Q_{B_{2}})}^{†} B_{2} - C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2}, \end{matrix}

(95)

respectively. By Lemma 7, the equality in (92) holds for all

U_{1}

and

U_{2}

if and only if any one of the following four equalities holds:

\begin{matrix} [A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})), A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})), \\ C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C] = 0, \end{matrix}

(96)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C & A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) \\ B_{2} & 0 \end{matrix}] = 0, \end{matrix}

(97)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C & A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})) \\ B_{1} & 0 \end{matrix}] = 0, \end{matrix}

(98)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C \\ B_{1} \\ B_{2} \end{matrix}] = 0 . \end{matrix}

(99)

In addition, it is easy to verify that the ranks of the left-hand sides of (96)–(99) are given by

\begin{matrix} r [A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})), A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})), \end{matrix}

(100)

\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C] \\ = r (A_{1}) + r (A_{2}) - r [A_{1}, A_{2}], \end{matrix}

(101)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C & A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) \\ B_{2} & 0 \end{matrix}] \\ = r (A_{1}) + r (A_{2}) - [A_{1}, A_{2}] + r (B_{2}), \end{matrix}

(102)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C & A_{2} (I_{p_{2}} - {(P_{A_{1}} A_{2})}^{†} (P_{A_{1}} A_{2})) \\ B_{1} & 0 \end{matrix}] \\ = r (A_{1}) + r (A_{2}) - [A_{1}, A_{2}] + r (B_{1}), \end{matrix}

(103)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C \\ B_{1} \\ B_{2} \end{matrix}] = r [\begin{matrix} B_{1} \\ B_{2} \end{matrix}] . \end{matrix}

(104)

Combining (96)–(99) with (101)–(104) leads to the equivalence in (b).

By Lemma 7, the equality in (93) holds for all

U_{1}

and

V_{4}

if and only if any one of the following four equalities holds:

\begin{matrix} [C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C, A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})), A_{2}] = 0, \end{matrix}

(105)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} & A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) \\ (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} & 0 \end{matrix}] = 0, \end{matrix}

(106)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} & A_{1} \\ B_{2} & 0 \end{matrix}] = 0, \end{matrix}

(107)

\begin{matrix} [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} \\ B_{1} \\ (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} \end{matrix}] = 0 . \end{matrix}

(108)

In addition, it is easy to verify that the ranks of the left-hand sides of (105)–(108) are given by

\begin{matrix} r [C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C - A_{2} {(P_{A_{1}} A_{2})}^{†} P_{A_{1}} C, A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})), A_{2}] = r (A_{2}), \end{matrix}

(109)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} & A_{1} (I_{p_{1}} - {(P_{A_{2}} A_{1})}^{†} (P_{A_{2}} A_{1})) \\ (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} & 0 \end{matrix}] \\ = r (A_{1}) + r (A_{2}) + r (B_{1}) + r (B_{2}) - r [A_{1}, A_{2}] - r [\begin{matrix} B_{1} \\ B_{2} \end{matrix}], \end{matrix}

(110)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} & A_{1} \\ B_{2} & 0 \end{matrix}] = r (A_{1}) + r (B_{2}), \end{matrix}

(111)

\begin{matrix} r [\begin{matrix} C - A_{1} {(P_{A_{2}} A_{1})}^{†} P_{A_{2}} C B_{1}^{†} B_{1} - A_{2} A_{2}^{†} C Q_{B_{1}} {(B_{2} Q_{B_{1}})}^{†} B_{2} \\ B_{1} \\ (I_{q_{2}} - (B_{2} Q_{B_{1}}) {(B_{2} Q_{B_{1}})}^{†}) B_{2} \end{matrix}] = r (B_{1}) . \end{matrix}

(112)

Combining (105)–(108) with (109)–(112) leads to the equivalence in (c). Results (d) and (e) can be established with a similar approach. □

Theorem 6.

Assume that the matrix equation in (18) is solvable

X_{1}

and

X_{2},

and let

\begin{matrix} D_{1} = {X_{1} | A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2} = C}, \end{matrix}

(113)

\begin{matrix} D_{2} = {X_{2} | A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2} = C}, \end{matrix}

(114)

\begin{matrix} H_{1} = {X_{1} | A_{1} X_{1} B_{1} - A_{2} A_{2}^{†} A_{1} X_{1} B_{1} B_{2}^{†} B_{2} = C - A_{2} A_{2}^{†} C B_{2}^{†} B_{2}}, \end{matrix}

(115)

\begin{matrix} H_{2} = {X_{2} | A_{2} X_{2} B_{2} - A_{1} A_{1}^{†} A_{2} X_{2} B_{2} B_{1}^{†} B_{1} = C - A_{1} A_{1}^{†} C B_{1}^{†} B_{1}}, \end{matrix}

(116)

\begin{matrix} D = {(X_{1}, X_{2}) | A_{1} X_{1} B_{1} + A_{2} X_{2} B_{2} = C}, \end{matrix}

(117)

\begin{matrix} H = {(X_{1}, X_{2}) | X_{1} \in H_{1} a n d X_{2} \in H_{2}} . \end{matrix}

(118)

Then, we have the following results.

(a): The matrix set equalities $D_{i} = H_{i}$ always hold; $i = 1, 2 .$
(b): $D \subseteq H$ always holds.
(c): $D = H$ if and only if $R (B_{1}^{T} \otimes A_{1}) \cap R (B_{2}^{T} \otimes A_{2}) = {0} .$

Proof.

By the vec operation of matrix, (18) can be equivalently expressed as

(B_{1}^{T} \otimes A_{1}) {\vec{X}}_{1} + (B_{2}^{T} \otimes A_{2}) {\vec{X}}_{2} = \vec{C},

(119)

which is a special case of (66). Pre-multiplying (119) with

P_{B_{i}^{T} \otimes A_{i}}

,

i = 1, 2

, yields the following two reduced linear matrix equations:

P_{B_{2}^{T} \otimes A_{2}} (B_{1}^{T} \otimes A_{1}) {\vec{X}}_{1} = P_{B_{2}^{T} \otimes A_{2}} \vec{C}, P_{B_{1}^{T} \otimes A_{1}} (B_{2}^{T} \otimes A_{2}) {\vec{X}}_{2} = P_{B_{1}^{T} \otimes A_{1}} \vec{C} .

(120)

Now, we denote

\begin{matrix} {\hat{D}}_{i} = {{\vec{X}}_{i} | (B_{1}^{T} \otimes A_{1}) {\vec{X}}_{1} + (B_{2}^{T} \otimes A_{2}) {\vec{X}}_{2} = \vec{C}}, i = 1, 2, \end{matrix}

(121)

\begin{matrix} {\hat{H}}_{1} = {{\vec{X}}_{1} | P_{B_{2}^{T} \otimes A_{2}} (B_{1}^{T} \otimes A_{1}) {\vec{X}}_{1} = P_{B_{2}^{T} \otimes A_{2}} \vec{C}}, \end{matrix}

(122)

\begin{matrix} {\hat{H}}_{2} = {{\vec{X}}_{2} | P_{B_{1}^{T} \otimes A_{1}} (B_{2}^{T} \otimes A_{2}) {\vec{X}}_{2} = P_{B_{1}^{T} \otimes A_{1}} \vec{C}} . \end{matrix}

(123)

Then, we obtain from Theorem 3 that

\begin{matrix} {\hat{D}}_{i} = {\hat{H}}_{i}, i = 1, 2 \end{matrix}

(124)

always hold. On the other hand, it is easy to verify that

\begin{matrix} P_{B_{i}^{T} \otimes A_{i}} & = I_{m n} - (B_{i}^{T} \otimes A_{i}) {(B_{i}^{T} \otimes A_{i})}^{†} \\ = I_{m n} - (B_{i}^{T} \otimes A_{i}) ({(B_{i}^{T})}^{†} \otimes A_{i}^{†}) \\ = I_{m n} - {(B_{i}^{†} B_{i})}^{T} \otimes A_{i} A_{i}^{†}, i = 1, 2, \end{matrix}

and

\begin{matrix} P_{B_{2}^{T} \otimes A_{2}} (B_{1}^{T} \otimes A_{1}) & = B_{1}^{T} \otimes A_{1} - ({(B_{2}^{†} B_{2})}^{T} \otimes A_{2} A_{2}^{†}) (B_{1}^{T} \otimes A_{1}) \\ = B_{1}^{T} \otimes A_{1} - {(B_{1} B_{2}^{†} B_{2})}^{T} \otimes A_{2} A_{2}^{†} A_{1}, \\ P_{B_{1}^{T} \otimes A_{1}} (B_{2}^{T} \otimes A_{2}) & = B_{2}^{T} \otimes A_{2} - ({(B_{1}^{†} B_{1})}^{T} \otimes A_{1} A_{1}^{†}) (B_{2}^{T} \otimes A_{2}) \\ = B_{2}^{T} \otimes A_{2} - {(B_{2} B_{1}^{†} B_{1})}^{T} \otimes A_{1} A_{1}^{†} A_{2} \end{matrix}

hold. Thus, the two equations in (120) denoted by the matrix vectorization operations are equivalent to

\begin{matrix} A_{1} X_{1} B_{1} - A_{2} A_{2}^{†} A_{1} X_{1} B_{1} B_{2}^{†} B_{2} = C - A_{2} A_{2}^{†} C B_{2}^{†} B_{2}, \\ A_{2} X_{2} B_{2} - A_{1} A_{1}^{†} A_{2} X_{2} B_{2} B_{1}^{†} B_{1} = C - A_{1} A_{1}^{†} C B_{1}^{†} B_{1}, \end{matrix}

respectively. Hence, the two set equalities in (124) are equivalent to the set equalities in (a). Results (b) and (c) follow from applying Theorem 4 to (119). □

5. Conclusions

In the preceding sections, we described and studied some relationships between two different linear matrix functions and presented a series of clear explanations and solutions to some concrete problems in this subject area. The whole work covers some principal cases that are often encountered in the theory of matrix functions and their applications, while the results and facts obtained deeply reveal the inherent natures and properties of some basic linear matrix functions and their connections. Notice that the derivations of the main results and facts are based on various precise algebraic calculations of the ranks of certain block matrices related to the matrix equalities and matrix set inclusions, which substantially avoid certain complicated matrix operations occurring in matrix expressions and matrix equalities. Hence, they clearly demonstrate that the matrix rank method and the block matrix representation method are useful and effective for solving various matrix equality and matrix set inclusion problems. Actually, the two fundamental algebraic methods have been recognized as reliable and efficient tools and techniques in the descriptions and investigations of many matrix problems in theoretical and computational mathematics.

Finally, the two authors remark that the research results and facts in this article offer certain deep and valuable insights into various intrinsic links among different matrix functions that could not be seen previously, and therefore, we believe that this study will have a profound impact on the explorations of the connections among domains of general matrix functions under various specified assumptions.

Author Contributions

Methodology, Y.T.; Validation, R.Y.; Formal analysis, Y.T. and R.Y.; Investigation, Y.T. and R.Y.; Writing—original draft, Y.T.; writing—review and editing, Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The two authors would like to thank the four referees for their helpful comments and suggestions on an earlier version of this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ben-Israel, A.; Greville, T.N.E. Generalized Inverses: Theory and Applications, 2nd ed.; Springer: New York, NY, USA, 2003. [Google Scholar]
Bernstein, D.S. Scalar, Vector, and Matrix Mathematics: Theory, Facts, and Formulas–Revised and Expanded Edition; Princeton University Press: Princeton, NJ, USA; Oxford, UK, 2018. [Google Scholar]
Campbell, S.L.; Meyer, C.D. Generalized Inverses of Linear Transformations; SIAM: Philadelphia, PA, USA, 2009. [Google Scholar]
Rao, C.R.; Mitra, S.K. Generalized Inverse of Matrices and Its Applications; Wiley: New York, NY, USA, 1971. [Google Scholar]
Horn, R.; Johnson, C.R. Topics in Matrix Analysis; Cambridge University Press: Cambridge, UK, 1991. [Google Scholar]
Steeb, W.-H. Matrix Calculus and Kronecker Product with Applications and C++ Programs; World Scientific Publishing: Singapore, 1997. [Google Scholar]
Bakonyi, M.; Woerdeman, H.J. Matrix Completions, Moments, and Sums of Hermitian Squares; Princeton University Press: Princeton, NJ, USA, 2011. [Google Scholar]
Dancis, J. Choosing the inertias for completions of certain partially specified matrices. SIAM J. Matrix Anal. Appl. 1993, 14, 813–829. [Google Scholar] [CrossRef]
Gohberg, I.; Kaashoek, M.A.; Schagen, F.V. Partially Specified Matrices and Operators: Classification, Completion, Applications; Birkhäuser: Basel, Switzerland, 1995. [Google Scholar]
Jordán, C.; Torregrosa, J.R.; Urbano, A. On the Jordan form of completions of partial upper triangular matrices. Linear Algebra Appl. 1997, 254, 241–250. [Google Scholar] [CrossRef]
Krupnik, M. Geometric multiplicities of completions of partial triangular matrices. Linear Algebra Appl. 1995, 220, 215–227. [Google Scholar] [CrossRef]
Marsaglia, G.; Styan, G.P.H. Equalities and inequalities for ranks of matrices. Linear Multilinear Algebra 1974, 2, 269–292. [Google Scholar] [CrossRef]
Tian, Y. Formulas for calculating the dimensions of the sums and the intersections of a family of linear subspaces with applications. Contrib. Algebra Geom. 2019, 60, 471–485. [Google Scholar] [CrossRef]
Penrose, R. A generalized inverse for matrices. In Mathematical Proceedings of the Cambridge Philosophical Society; Cambridge University Press: Cambridge, UK, 1955; Volume 51, pp. 406–413. [Google Scholar]
Baksalary, J.K.; Kala, R. The matrix equation AXB + CYD = E. Linear Algebra Appl. 1980, 30, 141–147. [Google Scholar] [CrossRef]
Özgüler, A.B. The matrix equation AXB + CYD = E over a principal ideal domain. SIAM J. Matrix Anal. Appl. 1991, 12, 581–591. [Google Scholar] [CrossRef]
Jiang, B.; Tian, Y. Necessary and sufficient conditions for nonlinear matrix identities to always hold. Aequ. Math. 2019, 93, 587–600. [Google Scholar] [CrossRef]
Tian, Y. Upper and lower bounds for ranks of matrix expressions using generalized inverses. Linear Algebra Appl. 2002, 355, 187–214. [Google Scholar] [CrossRef]
Tian, Y. Solvability of two linear matrix equations. Linear Multilinear Algebra 2000, 48, 123–147. [Google Scholar] [CrossRef]
Tian, Y. Relations between matrix sets generated from linear matrix expressions and their applications. Comput. Math. Appl. 2011, 61, 1493–1501. [Google Scholar] [CrossRef]
Tian, Y. Ranks and independence of solutions of the matrix equation AXB + CYD = M. Acta Math. Univ. Comen. 2006, 75, 75–84. [Google Scholar]
Xu, G.; Wei, M.; Zheng, D. On solution of matrix equation AXB + CYD = F. Linear Algebra Appl. 1998, 279, 93–109. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Algebraic Characterizations of Relationships between Different Linear Matrix Functions

Abstract

1. Introduction

2. Notation and Some Preliminary Results

3. Main Results

4. Relationships between Solutions of Some Linear Matrix Equations and Their Reduced Equations

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics