The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control

Bekhiti, Belkacem; Hariche, Kamel; Zaitsev, Vasilii; Duan, Guangren R.; Sharkawy, Abdel-Nasser

doi:10.3390/mca30060131

Open AccessArticle

The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control

by

Belkacem Bekhiti

¹

,

Kamel Hariche

²,

Vasilii Zaitsev

³

,

Guangren R. Duan

⁴ and

Abdel-Nasser Sharkawy

^5,6,*

¹

Institute of Aeronautics and Space Studies (IASS), University of Blida 1, Blida 09000, Algeria

²

Institute of Electrical and Electronic Engineering (IGEE), University of Boumerdes, Boumerdes 35000, Algeria

³

Laboratory of Mathematical Control Theory, Udmurt State University, Izhevsk 426034, Russia

⁴

Center for Control Theory and Guidance Technology, Harbin Institute of Technology, Harbin 150001, China

⁵

Mechanical Engineering Department, Faculty of Engineering, Qena University, Qena 83523, Egypt

⁶

Mechanical Engineering Department, College of Engineering, Fahad Bin Sultan University, Tabuk 47721, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Math. Comput. Appl. 2025, 30(6), 131; https://doi.org/10.3390/mca30060131

Submission received: 16 October 2025 / Revised: 25 November 2025 / Accepted: 26 November 2025 / Published: 29 November 2025

Download

Browse Figures

Versions Notes

Abstract

This paper develops an algebraic framework for operator matrix polynomials and demonstrates its application to control-design problems in aeroservoelastic systems. We present constructive spectral-factorization and linearization tools (block spectral divisors, companion forms and realization algorithms) that enable systematic block-pole assignment for large-scale MIMO models. Building on this theory, an adaptive block-pole placement strategy is proposed and cast in a practical implementation that augments a nominal state-feedback law with a compact neural-network compensator (single hidden layer) to handle un-modeled nonlinearities and uncertainty. The method requires state feedback and the system’s nominal model and admits Laplace-domain analysis and straightforward implementation for a two-degree-of-freedom aeroelastic wing with cubic stiffness nonlinearity and Roger aerodynamic lag is validated in MATLAB R2023a. Comprehensive simulations (Runge–Kutta 4) for different excitations and step disturbances demonstrate the approach’s advantages: compared with Eigenstructure assignment, LQR and

H_{2}

-control, the proposed method achieves markedly better robustness and transient performance (e.g., closed-loop

{‖H (i ω)‖}_{2}

≈ 4.64, condition number χ ≈ 11.19, and reduced control efforts μ ≈ 0.41, while delivering faster transients and tighter regulation (rise time ≈ 0.35 s, settling time ≈ 1.10 s, overshoot ≈ 6.2%, steady-state error ≈ 0.9%, disturbance-rejection ≈ 92%). These results confirm that algebraic operator-polynomial techniques, combined with a compact adaptive NN augmentation, provide a well-conditioned, low-effort solution for robust control of aeroelastic systems.

Keywords:

operator matrix polynomials; λ-matrices; spectral divisors; block-pole assignment; matrix fraction description (MFD); neural adaptive control; aeroelasticity; flight dynamics; robust control; aeroservoelastic systems

MSC:

15A21; 34K35; 15A04; 15A22; 34A37; 47L05

1. Introduction

The algebraic theory of operator and matrix polynomials has emerged as a central tool in both pure mathematics and control engineering, providing systematic approaches for modeling, analysis, and synthesis of large-scale multivariable systems. Recent progress has shown its versatility in problems ranging from recursive inversion algorithms for matrix polynomials [1], matrix fraction descriptions in large-scale descriptor systems [2], and spectrum assignment via static output feedback [3], to block-pole placement strategies [4,5] and generalized spectrum assignment for bilinear and time-delay systems [6,7]. These advances build on the foundational contributions of Vardulakis [8], Vayssettes [9], Sugimoto [10], Kurbatov [11], Cohen [12], Hariche and Denman [13], Pereira [14], Chen [15], and Bekhiti [16,17], who developed λ-matrix formulations and algebraic strategies for MIMO system design. Classic works on singular systems [18], matrix polynomials [19], matrix functions [20], and linear system theory [21] established the mathematical background for these developments, later consolidated in doctoral and postdoctoral contributions on λ-matrices and block decomposition [22]. Recent applications further extend into neural adaptive control [23], block-companion forms [24], matrix theory [25], interpolation [26], spectral operator pencils [27,28], and Newton-based algorithms for polynomial equations [29].

Parallel to these algebraic advances, control of aeroelastic and aeroservoelastic systems has attracted significant attention. Neural-network-based identification of nonlinear aeroelastic models [30], robust flutter suppression by

H_{\infty}

control [31], fault-tolerant wing control [32], eigenstructure-based aircraft control [33], and integral LQR schemes for micro aerial vehicles [34] demonstrate the practical relevance of advanced control methods in flight dynamics. Yet, despite extensive theoretical foundations and application-driven advances, a clear gap remains: there is still a lack of unified algebraic frameworks that translate operator polynomial theory into numerically conditioned, practically implementable controllers for aeroelastic systems, while also addressing robustness, nonlinearities, and disturbance rejection in realistic flight scenarios.

This work aims to fill this gap by presenting a rigorous algebraic framework for operator matrix polynomials and applying it to the control of aeroelastic systems in flight dynamics. The main contributions are as follows:

(i): Development of constructive tools for spectral factorization, companion forms, and block-pole assignment within an operator-theoretic setting.
(ii): Proposal of an adaptive block-pole placement strategy enhanced with a compact neural compensator to account for nonlinearities/uncertainties.
(iii): Demonstration of the method’s effectiveness on a nonlinear aeroelastic wing section model, where it outperforms benchmark strategies such as eigenstructure assignment, LQR, and $H_{2}$ -control in terms of robustness, transient performance, and control effort.

The remainder of the paper is structured as follows. Section 2 recalls the fundamentals of matrix algebra and linear vector spaces that underpin the subsequent developments. Section 3 addresses matrix polynomials (λ-matrices) and spectral divisors, while Section 4 presents their standard structures and realization forms. Section 5 is devoted to the determination of operator roots (spectral factors), followed by Section 6, which discusses transformations between solvents and spectral factors. Section 7 introduces matrix fraction description (MFD) realizations and transformations between canonical forms. Section 8 develops the proposed control-design strategies, and Section 9 demonstrates their application to aeroelastic systems in flight dynamics. Finally, Section 10 concludes the paper and outlines perspectives for future research.

2. Fundamentals of Matrix Algebra and Linear Vector Spaces

Let

C

denote the field of complex numbers,

R

represent the field of real numbers,

C^{m \times n}

denote the set

m \times n

matrices over

C

, and

R^{m \times n}

is the set of matrices with real entries. Unless stated otherwise, all matrices will be in

C^{m \times n}

. The column vector in the vector space

C^{n} = C^{1 \times n}

will be denoted as

u, x, e t c .

If

A \in C^{m \times n}

, we use

A^{H}

for the conjugate transpose of

A

. For vectors

x, y \in C^{n}

we employ the usual inner product

〈x, y〉 = y^{⟙} x

; the norm of vector

x \in C^{n}

is the Euclidean norm,

‖x‖ = {〈x, x〉}^{1 / 2}

. For matrices

A \in C^{n \times n}

, we use the operator norm

‖A‖ = \sup \{‖A x‖ : ‖x‖ = 1\}

. A subspace

M

of

C^{n}

is called the invariant subspace or

A

-invariant if

A x \in M

for every

x \in M

. If

M

is a subspace of

C^{n}

,

\dim M

denotes the dimension of

M

. If

A \in C^{m \times n}

, the range (column space) of

A

is denoted by

R (A) = \{y : y = A x\}

, and the null space of

A

, by

N (A) = \{x : A x = 0\}

. Recall that

R (A) + N (A) = C^{n}

and

\dim R (A) + \dim N (A) = n

. Let

M_{1}, \dots, M_{s}

be subspaces of

C^{n}

, the sum of these subspaces

M_{1} + \dots + M_{s} = \{z = x_{1} + \dots + x_{s} : x_{i} \in M_{i}\}

is the subspace. If

M_{i} \cap M_{j} = \{0\}

for

i \neq j

, the subspaces are said to be independent, and the sum is then called a direct sum and we write

⨁_{i = 1}^{s} M_{i} = M_{1} ⨁ \dots ⨁ M_{s}

. Recall that

\dim (⨁_{i = 1}^{s} M_{i}) = ⅀_{i = 1}^{s} (\dim M_{i})

and if

x \in M_{1} ⨁ \dots ⨁ M_{s}

, then there exists a unique

x_{i} \in M_{i}

such that

x = x_{1} + \dots + x_{s}

. A projection is a matrix

P \in C^{n \times n},

such that

P^{2} = P

. It is easily seen that

R (P) + N (P) = C^{n}

. Conversely, if

C^{n} = M ⨁ N,

there exists a unique

P

, such that

R (P) = M

and

N (P) = N

; we denote this projection by

P_{M, N}

, the projection onto

M

along

N

. If

M

is a subspace of

C^{n}

, the orthogonal complement of

M

is

M^{⊥} = \{x \in C^{n} : 〈x, y〉 = 0 for all y \in M\}

.

M^{⊥}

is a subspace and

M ⨁ M^{⊥} = C^{n}

.

P_{M, M^{⊥}}

is denoted by

P_{M}

. If

A \in C^{m \times n}

, there exists a unique matrix

A^{+} \in C^{n \times m},

which satisfies:

A A^{+} A = A, A^{+} A A^{+} = A^{+}

and also

A A^{+} = P_{R (A)}, A^{+} A = P_{R (A^{H})}

. The matrix

A^{+}

is the Moore–Penrose (Generalized) inverse of

A

. If

A x = b

is consistent, then

A^{+} b

is a solution (in fact, min-norm solutions) and all solutions are given by

x = A^{+} b + (I - A^{+} A) h

, where

h

is arbitrary. Also, the Moore–Penrose inverse (or the pseudoinverse) can be defined as follows:

A^{+} = \underset{δ \to 0}{l i m} {(A^{⟙} A + δ^{2} I)}^{- 1} A^{⟙} = \underset{δ \to 0}{l i m} A^{⟙} {(A A^{⟙} + δ^{2} I)}^{- 1}

. We shall often make use of block matrices. In particular, if

A \in C^{n \times n}

is block-diagonal, that is,

A

has blocks

A_{1}, A_{2} \dots, A_{s}

along the main diagonal and zero blocks elsewhere, we write

A = d i a g (A_{1}, \dots, A_{s})

or

A = b l o c k d i a g (A_{1}, \dots, A_{s}) = ⨁_{i = 1}^{s} A_{i}

. The eigenvalues of

A \in C^{n \times n}

are the roots of the polynomial

Δ (λ) = \det (λ I - A)

. The spectrum of

A

is the set of eigenvalues of

A

and is denoted by

σ (A)

. The spectrum radius of a square matrix

A

is

ρ (A) = \sup \{|λ| : λ \in σ (A)\}

. If

λ_{i}

is a root of multiplicity

m_{i}

of

Δ (λ)

, we say that

λ_{i}

is an eigenvalue of

A

of algebraic multiplicity

m_{i}

. The geometric multiplicity of

λ_{i}

is the number of associated independent eigenvectors

m_{g} = n - r a n k (λ I - A) = \dim N (λ I - A)

. If

λ \in σ (A)

has algebraic multiplicity

m

, then

1 < \dim N (λ_{i} I - A) < m_{i}

. Thus, if we denote the geometric multiplicity of

λ_{i}

by

g_{i}

, then we must have

1 < g_{i} < m_{i}

. A matrix

A \in C^{n \times n}

is said to be defective if it has an eigenvalue whose geometric multiplicity is not equal to (i.e., less than) its algebraic multiplicity. Equivalently,

A

is said to be defective if it does not have

n

linearly independent (right or left) eigenvectors [18,20,25].

If

A, B \in C^{n \times n},

we say that

A

is similar to

B

in the case that there exists a nonsingular matrix

T

such that

A = T B T^{- 1}

. Similar matrices represent the same linear operator on

C^{n}

but with respect to different bases. We shall also make use of the fact that every

A \in C^{n \times n}

is similar to a matrix in Jordan canonical form, that is,

A

is similar to

J = b l o c k d i a g (J_{1}, \dots, J_{l})

and

J_{i} = λ_{i} I_{k_{i}} + N_{k_{i}}

with

N_{k_{i}} = ⅀_{j = 1}^{k_{i} - 1} E_{j, j + 1}

, and

E_{j, j + 1}

are the standard basis matrices (matrix unit). There may be more than one such block corresponding to the eigenvalue

λ_{i}

. The numerical range of

A

is

w (A) = \{〈A x, x〉 : ‖x‖ = 1\}

, and the numerical radius of

A

is

r (A) = \sup \{|λ| : λ \in w (A)\}

.

r (A)

is a compact convex set which contains

σ (A)

. In general,

r (A)

may be larger than the convex hull of

σ (A)

. However, it is possible to find an invertible matrix

T

, such that

w (T A T^{- 1})

is as close as desired to the convex hull of

σ (A)

. A Hermitian matrix

A \in C^{n \times n}

is positive semi-definite if

〈A x, x〉 \geq 0

for all

x \in C^{n}

. If

A

is a positive semi-definite, then it has a unique positive semi-definite square root, which we denote as

A^{1 / 2}

, that is,

{(A^{1 / 2})}^{2} = A

. If

E \in C^{n \times n}

is a complex matrix, then the

i n d e x

of

E

, denoted by

i n d (E)

, is the smallest nonnegative integer

k,

such that

r a n k (E^{k}) = r a n k (E^{k + 1})

. For nonsingular matrices,

i n d (E) = 0

. For singular matrices,

i n d (E)

is the smallest positive integer

k

, such that either of the following two statements is true:

R (E^{k}) \cap N (E^{k}) = 0

and

C^{n} = R (E^{k}) ⨁ N (E^{k})

. The matrix

N_{n \times n}

is said to be nilpotent whenever

N^{k} = 0

for some positive integer

k

.

k = i n d (N)

is the smallest positive integer, such that

N^{k} = 0

(some authors refer to

i n d (N)

as the index of nilpotency). If

A

is an

n \times n

singular matrix of index

k

, such that

r a n k (A^{k}) = r

, then there exists a nonsingular matrix

Q,

such that

Q A Q^{- 1} = C_{r \times r} ⨁ N

in which

C_{r \times r}

is nonsingular, and

N

is nilpotent of index k. This last block-diagonal matrix is called a core-nilpotent decomposition of

A

. When

A

is nonsingular,

k = 0

and

r = n

, such that

N

is not present, and then we can set

Q = I

and

C = A

[15]. Inverting the nonsingular core

C_{r \times r}

and neglecting the nilpotent part

N

in the core-nilpotent decomposition produces a natural generalization of matrix inversion. More precisely, if we have, as follows:

A = Q^{- 1} [C_{r \times r} ⨁ N] Q

, then

A^{D} = Q [C_{r \times r}^{- 1} ⨁ N] Q^{- 1}

defines the Drazin inverse of

A

. Even though the components in a core-nilpotent decomposition are not uniquely defined by

A

, it can be proven that

A^{D}

is unique and has the properties:

$A^{D} = A^{- 1}$ , when $A$ is nonsingular (the nilpotent part is not present).
$A^{D} A A^{D} = A^{D}, A A^{D} = A^{D} A a n d A^{k + 1} A^{D} = A^{k}$ , where $k = i n d (A)$ ;
If $A x = b$ in which $b \in R (A^{k})$ then $x = A^{D} b \in R (A^{k})$ is the unique solution;
$A A^{D}$ is the projector onto $R (A^{k})$ along $N (A^{k})$ ;
$[I - A A^{D}]$ is the complementary projector onto $N (A^{k})$ along $R (A^{k})$ ;
$A^{k} (I - A A^{D}) = 0, w i t h k = i n d (A)$ and $A^{k} (I - A A^{D}) \neq 0, f o r k < i n d (A)$ ;
$A^{D} = \underset{α \to 0}{l i m} {[A^{k + 1} + α I]}^{- 1} A^{k}$ , with $k \geq i n d (A)$ .

Theorem 1

([18]). If

A \in C^{n \times n}

and

0

is an eigenvalue of

A

of multiplicity

k_{i}

, then

0

is an eigenvalue of

A^{D}

of multiplicity

k_{i}

. If

λ_{i} \neq 0

is an eigenvalue of

A

of multiplicity

k_{i}

, then

λ^{- 1}

is an eigenvalue of

A^{D}

of multiplicity

k_{i}

. If

A \in C^{n \times n}

, then the Drazin inverse

A^{D}

is a polynomial in

A

of degree

n - 1

or less.

Function of Matrix:

Now, we use the following notations: for a matrix

A \in C^{n \times n}

, let its characteristic polynomial be

c (λ) = ℿ_{i = 1}^{s} {(λ - λ_{i})}^{m_{i}}

where the eigenvalues

λ_{i}

are repeated

m_{i}

and

m_{1} + \dots + m_{s} = n

. Let

v_{i} = i n d (λ_{i} I - A)

and

N_{i} = N ({(λ_{i} I - A)}^{v_{i}})

. We know that

N_{i}

is an invariant subspace for

A

and

\dim N_{i} = m_{i}

. We also know that

E_{i} = I - (λ_{i} I - A) {(λ_{i} I - A)}^{D}

is a projection on

N_{i}

. Since

E_{i}

and

E_{j}

are polynomials in

A

, we have

E_{i} E_{j} = E_{j} E_{i}

[27]. Other properties of

N_{i}

and

E_{i}

are given by Theorem 2.

Theorem 2

([20]). Let

N_{i} = N ({(λ_{i} I - A)}^{v_{i}})

and

E_{i} = I - (λ_{i} I - A) {(λ_{i} I - A)}^{D},

then (1)

N_{i} ⋂ N_{j} = \{\underline{0}\}, E_{i} E_{j} = E_{j} E_{i} = 0 i \neq j

; (2)

C^{n} = ⨁_{i = 1}^{s} N_{i}

and (3)

I = ⅀_{i = 1}^{s} E_{i}

, are true.

The concept of a matrix function generalizes the evaluation of scalar analytic functions at matrices and is standard in the theory of matrix analysis. Theorem 3 summarizes the fundamental results of such a theory (see Higham [20]).

Theorem 3

([18,20]). For any

A \in C^{n \times n}

with spectrum

σ (A)

, let

F (A)

denote the class of all functions f:

C \to C,

which are analytic in some open set containing

σ (A)

. For any scalar function

f \in F (A)

, the corresponding matrix function f(A) is defined by

f (A) = ⅀_{i = 1}^{s} ⅀_{k = 0}^{v_{i} - 1} \{\frac{f^{(k)} (λ_{i})}{k!} {(A - λ_{i} I)}^{k} E_{i}\} = ⅀_{i = 1}^{s} E_{i} [⅀_{k = 0}^{v_{i} - 1} \frac{f^{(k)} (λ_{i})}{k!} {(A - λ_{i} I)}^{k}]

(1)

The Drazin inverse is a matrix function corresponding to the reciprocal

f (z) = 1 / z

, defined on nonzero eigenvalues. The analogous result for Drazin inverse is, as follows:

A^{D} = ⅀_{i = 1}^{s} {E_{i} ⅀}_{k = 0}^{v_{i} - 1} \{\frac{{(- 1)}^{k}}{λ_{i}^{k + 1}} {(A - λ_{i} I_{n})}^{k}\} for all 0 \neq λ_{i} \in σ (A)

(2)

Theorem 4

([22]). For any

A \in C^{n \times n}

with spectrum

σ (A)

, let

f a n d g

be an analytic functions in some open set containing

σ (A)

, then

f (A) = g (A)

if and only if

f^{(k)} (λ_{i}) = g^{(k)} (λ_{i})

, for

k = 0,1, \dots, v_{i} - 1

and

i = 1,2, \dots, s

. In particular, If

r (λ)

is a polynomial, such that

f^{(k)} (λ_{i}) = r^{(k)} (λ_{i})

, for

k = 0,1, \dots, m_{i} - 1,

and

i = 1,2, \dots, s,

then

f (A) = r (A) .

Lemma 1

([19,20,25]). Let

A \in C^{n \times n}

be any arbitrary complex matrix with spectrum

σ (A)

and let

p (λ)

,

q (λ)

and

r (λ)

be analytic at

λ_{i} \in σ (A)

,

λ_{i}, i = 1,2, \dots s \leq n

, then

(i): if $p (λ) = k$ , then $p (A) = k I_{n}$ ;
(ii): if $p (λ) = λ$ , then $p (A) = A$ and $p (A) A = A p (A)$ ;
(iii): if $p (λ) = q (λ) + r (λ)$ , then $p (A) = q (A) + r (A)$ ;
(iv): if $p (λ) = q (λ) r (λ)$ , then $p (A) = q (A) r (A) = r (A) q (A)$ ;
(v): if $p (λ) = r (q (λ))$ is analytic at $q (λ_{i})$ , and $\exists λ_{i} \in σ (A)$ , then $p (A) = r (q (A))$ ;
(vi): $p (I_{m} \otimes A) = I_{m} \otimes p (A)$ , where ⊗ is the Kronecker product;
(vii): $p (A \otimes I_{m}) = p (A) . \otimes I_{m}$ , $p (A^{⟙}) = {\{p (A)\}}^{⟙} and p (X A X^{- 1}) = X p (A) X^{- 1}$ .

In many engineering applications, it becomes advantageous to express matrix functions through contour-integral representations. Recall that if

f (z)

is analytic in and on a simple closed contour

C

, then

\oint_{C} f (ξ) d ξ = 0

. Furthermore, if

λ

lies inside

C

, then

f (λ) = \frac{1}{2 π i} \oint_{C} \frac{f (z)}{z - λ} d z a n d f^{(k)} (λ) = \frac{k!}{2 π i} \oint_{C} \frac{f (z)}{{(z - λ)}^{k + 1}} d z

(3)

A similar approach can be applied to the functions of matrices. For

A \in C^{n \times n}

, the matrix

{(λ I_{n} - A)}^{- 1}

is referred to as the resolvent of

A

; it is analytic for

λ \notin σ (A)

. If the characteristic polynomial of

A

is

c (λ) = ℿ_{i = 1}^{s} {(λ - λ_{i})}^{m_{i}}

, with distinct eigenvalues

λ_{i}

and

m_{1} + \dots + m_{s} = n

, then the resolvent has the spectral form:

{(λ I - A)}^{- 1} = ⅀_{i = 1}^{s} ⅀_{k = 0}^{m_{i} - 1} \frac{{(A - λ_{i} I_{n})}^{k}}{{(λ - λ_{i})}^{k + 1}} E_{i}, λ \notin σ (A)

(4)

where

E_{i} = I - (λ_{i} I - A) \underset{α \to 0}{l i m} \{{[{(λ_{i} I - A)}^{m_{i} + 1} + α I]}^{- 1} {(λ_{i} I - A)}^{m_{i}}\}

is a projection.

Theorem 5

([18]). If

A \in C^{n \times n}

and

f

is analytic function for

|λ| < ζ

and

ρ (A) < ζ

, then the matrix Cauchy integral formula:

f (A) = [\oint_{C} f (λ) {(λ I - A)}^{- 1} d λ] / 2 π i

where

C

is a contour lying in the disk

|λ| < ζ

and enclosing all the eigenvalues of

A

.

Consider a matrix

A \in C^{n \times n}

with spectra

σ (A) = \{λ_{1}, \dots, λ_{s}\}

where

s \leq n

. If a scalar function

P (λ)

is analytic at

λ_{i}, i = 1, \dots s,

then the matrix function

P (A)

is generated by

P (A) = [\oint_{C} P (λ) {(λ I - A)}^{- 1} d λ] / 2 π i,

where

C

is a simple closed contour which encloses

λ_{i}, i = 1, \dots s

. The matrix function described by contour integral has the properties:

Corollary 1

([18,25]). If we let

σ (A) = σ_{1} \cup σ_{2}

where

σ_{1}, σ_{2}

are disjoint sets of eigenvalues and

C_{1}

is a contour enclosing

σ_{1}

while leaving

σ_{2}

outside, then

\frac{1}{2 π i} \oint_{C_{1}} f (λ) {(λ I - A)}^{- 1} d λ = f (A) P = f (A P) P where P is the projection

(5)

P = ⅀_{λ_{i} \in σ_{1}} \{I - (λ_{i} I_{n} - A) {(λ_{i} I_{n} - A)}^{D}\} = \frac{1}{2 π i} \oint_{C_{1}} {(λ I - A)}^{- 1} d λ

(6)

It should be noted that if

A \in C^{n \times n}

and

0 \neq α \in C

, then

{(α A)}^{D} = {α^{- 1} A}^{D}

. If

A \in C^{n \times n}

and

A = T d i a g (A_{1}, A_{2}) T^{- 1}

, where

A_{1}, A_{2}

are square matrices, then Drazin matrix is

A^{D} = T d i a g ({A_{1}}^{D}, {A_{2}}^{D}) T^{- 1}

. If

A B = B A

, then

{(A B)}^{D} = B^{D} A^{D} = A^{D} B^{D}

. Finally, we deduce that

A^{D} = [\oint_{C} {λ^{- 1} (λ I - A)}^{- 1} d λ] / 2 π i

where

C

encloses all the nonzero eigenvalues of

A

[18]. Also, the following statements hold:

(i): ${(A^{⟙})}^{D} = {(A^{D})}^{⟙}$ .
(ii): $R (A^{k}) = R (A^{D}) = R (A A^{D}) = N (I - A A^{D})$ .
(iii): $N (A^{k}) = N (A^{D}) = N (A A^{D}) = R (I - A A^{D})$ .
(iv): $r a n k A^{k} = r a n k A^{D} = r a n k A A^{D}$ .
(v): $A A^{D}$ is the idempotent matrix onto $R (A^{D})$ along $N (A^{D})$ .
(vi): $A^{D} = 0$ if and only if $A$ is nilpotent.
(vii): $A^{D} = \underset{α \to 0}{l i m} {(A^{k + 1} + α I)}^{- 1} A^{k}$ .

3. Matrix Polynomials (λ-Matrices) and Spectral Divisors

By a matrix polynomial

A (λ)

, we mean a matrix of the form

A (λ) = [a_{i j} (λ)]

, where all elements

a_{i j} (λ)

are polynomials in

F [λ]

(i.e., the ring of polynomials in the variable

λ

with coefficients from

F

, typically

F = R

or

F = C

). The set of these matrices will be designated by

F^{m \times m} [λ]

, or symbolized directly by

R^{m \times m} [λ]

(resp.

C^{m \times m} [λ]

), and their subsets containing the constant matrices

A_{i}

, are denoted by

F^{m \times m}

(

R^{m \times m}

or

C^{m \times m}

, respectively). The matrices in

R^{m \times m}

and

R^{m \times m} [λ]

are called real matrices. Then, scalar multiplication, addition, and multiplication of matrix polynomials are the same operations as for general matrices with entries in a commutative ring. An alternative formulation of

A (λ)

(

λ

-matrices) is, as follows:

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i} = A_{0} λ^{l} + \dots + A_{l - 1} λ + A_{l}

where the coefficients

A_{i}

are constant matrices in

F^{m \times m}

. The matrix

A_{0}

is named the highest coefficient or leading matrix coefficient of the matrix polynomial

A (λ)

. If

A_{0} \neq ○_{m}

(where

○_{m}

is the

m \times m

zero matrix) is true, then the number

l

is called the degree of the matrix polynomial

A (λ)

, and it is designated by

\deg A (λ)

, and the number

m

is called the order of the matrix polynomial

A (λ),

where

λ

is a complex variable. The matrix polynomial

A (λ)

is called monic if leading matrix coefficient

A_{0}

is the identity matrix; comonic if the trailing matrix coefficient

A_{l}

is the identity matrix; regular if

d e t (A (λ))

is not identically zero; unimodular if

d e t (A (λ))

is a nonzero constant; co-regular if the trailing matrix coefficient

A_{l}

is also nonsingular; non-monic if the leading matrix coefficient satisfies

\det (A_{0}) \neq I

. If

\det (A_{0}) = 0

, the polynomial has a singular leading coefficient, which implies the existence of infinite eigenvalues [14,22].

Suppose

B (λ)

is a matrix polynomial of degree

d

with invertible leading coefficient. If there exist matrix polynomials

Q (λ)

and

R (λ)

, with

R (λ) \equiv ○

or

d e g (R) < d

, such that

A (λ) = Q (λ) B (λ) + R (λ)

, then

Q (λ)

is the right quotient of

A (λ)

on division by

B (λ)

, and

R (λ)

is the corresponding right remainder. Similarly, if we have the following decomposition

A (λ) = B (λ) S (λ) + L (λ)

, with

L (λ) \equiv ○

or

d e g (L) < d

, then

S (λ)

and

L (λ)

are the left quotient and left remainder, respectively. If the right remainder is zero, then

Q (λ)

is a right divisor of

A (λ)

; an analogous definition holds for left divisors [12]. Both quotients and remainders are uniquely determined.

Theorem 6

([25]). Let

A (λ) \in R^{m \times m} [λ]

. The right and left remainders of

A (λ)

upon division by

λ I - X

are denoted by

A_{R} (X)

and

A_{L} (X)

, respectively:

A (λ) = B (λ) (λ I - X) + A_{R} (X)

,

A (λ) = (λ I - X) B (λ) + A_{L} (X)

. An

m \times m

matrix

Z \in C^{m \times m}

is called a right solvent of

A (λ)

if

A_{R} (Z) = ○

, and a left solvent if

A_{L} (Z) = ○

.

Corollary 2

([22]). The matrix polynomial

A (λ)

is divisible on the right (respectively, left) by

λ I - Z

with zero remainder if and only if

Z

is a right (respectively, left) solvent of

A (λ)

.

Let

A (λ)

be a matrix polynomial. An

m \times m

matrix

R

is called a right solvent of

A (λ)

with multiplicity

k > 1

if

{(λ I - R)}^{k}

divides

A (λ)

exactly on the right. Similarly, a matrix

L \in R^{m \times m}

is a left solvent of multiplicity

k > 1

if

{(λ I - L)}^{k}

divides

A (λ)

on the left. In these cases,

A (λ) = B (λ) {(λ I - R)}^{k} + A_{R} (R)

,

A (λ) = {(λ I - L)}^{k} C (λ) + A_{L} (L)

, where

A_{R} (R)

and

A_{L} (L)

denote the right and left functional evaluations of

A (λ)

. Moreover,

A_{R} (R) = ○

or

A_{L} (L) = ○

if and only if

R

or

L

is a right or left solvent of

A (λ)

[13].

Definition 1

[14,26]). Let

A (λ) \in C^{m \times m} [λ]

be an

m \times m

matrix polynomial. A constant matrix

R \in C^{m \times m}

is a right solvent of

A (λ)

if

A (R) = ⅀_{i = 0}^{l} A_{i} R^{l - i} = ○

and a matrix

L \in C^{m \times m}

is called a (left) solvent for

A (λ)

if

A (L) = ⅀_{i = 0}^{l} L^{l - i} A_{i} = ○

.

An equivalent representation for

A (R) = ○

(or

A (L) = ○

) that uses the contour integral is as follows:

A (R) = \frac{1}{2 π i} \oint_{Γ} A (λ) {(λ I - R)}^{- 1} d λ = ○ or A (L) = \frac{1}{2 π i} \oint_{Γ} {(λ I - L)}^{- 1} A (λ) d λ = ○

(7)

for any closed contour

Γ \subseteq C

enclosing the spectrum of

R

(or

L

) in its interior. An interesting consequence of matrix complex analysis is the existence of spectral right and spectral left solvent, which can be stated as the following theorem:

Theorem 7

([19,27]). Suppose that

Z

is an operator root (right or left) of the polynomial operator

A (λ)

, with

σ (Z) \subset σ (A)

, and

σ (A) \ σ (Z)

is closed. If

Γ

is a closed curve separating

σ (Z)

from

σ (A) \ σ (Z)

, then

Z

is a spectral root (right or left) of

A (λ),

if, and only if

Z_{R}^{k} = M_{k} {[\frac{1}{2 π i} \oint_{Γ} A^{- 1} (λ) d λ]}^{- 1}; Z_{L}^{k} = {[\frac{1}{2 π i} \oint_{Γ} A^{- 1} (λ) d λ]}^{- 1} M_{k}; M_{k} = [\frac{1}{2 π i} \oint_{Γ} λ^{k} A^{- 1} (λ) d λ]; k = 1, \dots, l - 1

(8)

More generally, let

A (λ)

be a monic matrix polynomial and let

Γ

be a contour consisting of regular points of

A (λ)

that enclose exactly

k m

eigenvalues of

A (λ)

, counted with multiplicities. Then

A (λ)

possesses both a Γ-spectral right divisor and a Γ-spectral left divisor if and only if the following

k m \times k m

matrix

M_{k, k}

is defined by

M_{k, k} = \frac{1}{2 π i} \oint_{Γ} [\begin{matrix} A^{- 1} (λ) \\ ⋮ \\ λ^{k - 1} A^{- 1} (λ) \end{matrix} \begin{matrix} \dots \\ ⋮ \\ \dots \end{matrix} \begin{matrix} λ^{k - 1} A^{- 1} (λ) \\ ⋮ \\ λ^{2 k - 2} A^{- 1} (λ) \end{matrix}] d λ

(9)

is nonsingular. In case that

M_{k, k}

is nonsingular, the Γ-spectral right divisor

A_{1} (λ) = I λ^{k} + A_{11} λ^{k - 1} + \dots + A_{1 k}

(or the Γ-spectral left divisor

A_{2} (λ) = I λ^{k} + A_{21} λ^{k - 1} + \dots + A_{2 k}

) is given by the formula [12,19,25]:

[A_{1 k} \dots A_{12}, A_{11}] = - [\frac{1}{2 π i} \oint_{Γ} [λ^{k} A^{- 1} (λ) \dots λ^{2 k - 1} A^{- 1} (λ)] d λ] M_{k, k}^{- 1}; [\begin{matrix} A_{2 k} \\ ⋮ \\ A_{21} \end{matrix}] = - M_{k, k}^{- 1} [\frac{1}{2 π i} \oint_{Γ} [\begin{matrix} λ^{k} A^{- 1} (λ) \\ \begin{matrix} ⋮ \\ λ^{2 k - 1} A^{- 1} (λ) \end{matrix} \end{matrix}] d λ]

(10)

Now we are going to introduce some definitions, which are results related to the concept of linearization, companion forms of a matrix polynomial. Let

R_{i} \in R^{m \times m}

(for

i = 1, \dots, l

) be square matrices (called right block roots or solvents), such that the right functional evaluation of

A (λ)

by

R_{i}

is identically zero, that is, as follows:

⅀_{k = 0}^{l} A_{k} R_{i}^{l - k} = ○ ⟺ [A_{l} \dots A_{0}] . [c o l {(R_{i}^{k})}_{k = 0}^{l}] = ○ ⟺ [\begin{matrix} ○ & I_{m} & ○ \\ ⋮ & ⋱ \\ ○ & ○ & I_{m} \\ - A_{l} & \dots & - A_{1} \end{matrix}] [\begin{matrix} \begin{matrix} I_{m} \\ R_{i} \end{matrix} \\ ⋮ \\ R_{i}^{l - 1} \end{matrix}] = [\begin{matrix} I_{m} & ○ & ○ \\ ⋱ & ⋮ \\ ○ & I_{m} & ○ \\ ○ & \dots & ○ & A_{0} \end{matrix}] [\begin{matrix} \begin{matrix} I_{m} \\ R_{i} \end{matrix} \\ ⋮ \\ R_{i}^{l - 1} \end{matrix}] R_{i}

(11)

In a compact form (i.e.,

A_{0} = I_{m}

) we have

A_{c} X_{i} = X_{i} R_{i},

where

X_{i} = c o l {(R_{i}^{k})}_{k = 0}^{l - 1}

and

A_{c}

is the first companion matrix. If we define

V_{R} = r o w {(X_{i})}_{i = 1}^{l} = [X_{1} X_{2} \dots X_{l}],

then

A_{c} V_{R} = V_{R} Λ_{R}

with

Λ_{R} = b l k d i a g (R_{1} \dots R_{l})

. The matrix

X \in R^{r \times r}

is a block eigenvalue of order

r

of a matrix

A \in R^{n \times n}

with

n = l r

if there exists a block eigenvector

V \in R^{n \times r}

of full rank, such that

A V = V X

. Moreover, if

A V = V X

, with

V

of full rank, then all the eigenvalues of

X

are eigenvalues of

A

. A matrix

X

has the property that any similar block is also a block eigenvalue, and it is clear that a block eigenvector

V

spans an invariant subspace of

A

, since being of full rank is equivalent to having linearly independent columns [20].

The word “Linearization” to a matrix polynomial, in fact, comes from the linearization of differential equations. Consider the following system of differential equation with constant coefficients

A (λ) x (λ) = f (λ) ⟺ \frac{d^{l} x (t)}{{d t}^{l}} + ⅀_{i = 1}^{l} A_{i} \frac{d^{l - i} x (t)}{{d t}^{l - i}} = f (t)

(12)

where

f (t) \in R^{m \times 1}

is a given forcing function and

x (t) \in R^{m \times 1}

is an unknown vector function called state. We can reduce (12) to a first-order differential equation

A (λ) x (λ) = f (λ) ⟺ (λ I - A_{c}) X (λ) = F (λ)

(13)

where

X (λ) = c o l {(λ^{k} x (λ))}_{k = 0}^{l - 1}

and

F (λ) = {[○ ○ \dots f^{⟙} (λ)]}^{⟙}

. This operation of reducing the

l^{t h}

degree differential equation to a first-order equation is called a linearization. (That is, we increased the dimension of the unknown function, which becomes

n = l m

.)

x (λ) = [I ○ \dots ○] X (λ) and x (λ) = A^{- 1} (λ) f (λ) ⟺ x (λ) = [I ○ \dots ○] {(λ I - A_{c})}^{- 1} F (λ)

We know that

f (λ) = [○ \dots ○ I] F (λ)

so

A^{- 1} (λ) [○ \dots ○ I] = [I ○ \dots ○] {(λ I - A_{c})}^{- 1}

, if we define the following matrices

B_{c} = {[○ \dots ○ I]}^{⟙}

and

C_{c} = [I ○ \dots ○]

we obtain

A^{- 1} (λ) = C_{c} {(λ I - A_{c})}^{- 1} B_{c}

. Since

A (λ)

and

(λ I - A_{c})

have the same spectrum as

\det (A (λ)) = \det (λ I - A_{c})

therefore they are equivalent.

Definition 2.

Let

A (λ) \in C^{m \times m} [λ]

be an

l^{t h}

degree monic matrix polynomial (i.e., with nonsingular leading coefficient). A linear matrix polynomial,

(λ I_{n} - A_{c}) \in C^{n \times n},

is known as a linearization (or a matrix pencil) of

A (λ)

if there exist a two unimodular matrix polynomials

P (λ)

and

Q (λ),

such that

[A (λ) \oplus I_{(n - m)}] = P (λ) (λ I_{n} - A_{c}) Q (λ)

or we say that they are equivalent and we write

[A (λ) \oplus I_{(n - m)}] ~ (λ I_{n} - A_{c})

.

An

m \times m

matrix polynomial

A (λ)

is said to be similar to a second matrix polynomial

B (λ)

of the same order if there exists a unimodular matrix polynomial

T (λ),

such that

A (λ) = T (λ) B (λ) T^{- 1} (λ)

.

Theorem 8

([25]). Two matrix polynomials

B (λ) \in C^{m \times m} [λ]

and

A (λ) \in C^{m \times m} [λ]

, are called similar if and only if the matrix polynomials

λ I_{n} - B_{c}

and

λ I_{n} - A_{c}

are equivalent (i.e.,

B_{c} = P^{- 1} A_{c} P

). Any matrix

A \in C^{n \times n}

is a linearization of

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i}

if and only if

A

is similar to the first companion matrix

A_{c}

of

A (λ)

, that is,

A = T_{c}^{- 1} A_{c} T_{c}

.

What role do the solvents play in contributing to the solution of the diff-equation?

A_{c} V_{R} = V_{R} Λ_{R} ⟺ {[λ I_{n} - A_{c}]}^{- 1} = V_{R} {[λ I_{n} - Λ_{R}]}^{- 1} V_{R}^{- 1}

If we let

V_{R} = [X_{c 1}, \dots, X_{c l}] = r o w {(X_{c i})}_{i = 1}^{l}

and

V_{R}^{- 1} = {[Y_{c 1}^{⟙}, \dots, Y_{c l}^{⟙}]}^{⟙} = c o l {(Y_{c i})}_{i = 1}^{l},

then

{[λ I_{n} - A_{c}]}^{- 1} = r o w {(X_{c i})}_{i = 1}^{l} . [⨁_{i = 1}^{l} {(λ I_{m} - R_{i})}^{- 1}] . [c o l {(Y_{c i})}_{i = 1}^{l}]

, equivalently.

A_{c} V_{R} = V_{R} Λ_{R} ⟺ {[λ I_{n} - A_{c}]}^{- 1} = ⅀_{i = 1}^{l} X_{c i} {(λ I_{m} - R_{i})}^{- 1} Y_{c i}

(14)

From the above similarity transformation, it is well-known that

A = T_{c}^{- 1} A_{c} T_{c}

and

{[λ I_{n} - A]}^{- 1} = {[λ I_{n} - T_{c}^{- 1} A_{c} T_{c}]}^{- 1} = T_{c}^{- 1} {[λ I_{n} - A_{c}]}^{- 1} T_{c}

Means that

{[λ I_{n} - A]}^{- 1} = T_{c}^{- 1} [⅀_{i = 1}^{l} X_{c i} {(λ I_{m} - R_{i})}^{- 1} Y_{c i}] T_{c} = ⅀_{i = 1}^{l} X_{i} {(λ I_{m} - R_{i})}^{- 1} Y_{i,}

where

X_{i} = T_{c}^{- 1} X_{c i}

and

Y_{i} = Y_{c i} T_{c}

. Using the inverse Laplace transform, we obtain:

e^{A t} = ⅀_{i = 1}^{l} X_{i} e^{R_{i} t} R Y_{i}

. We also know that the homogeneous solution of the differential equation

X^{'} (t) = A X (t)

is

X (t) = e^{A t} X (t_{0}) = e^{A t} X_{0} = [⅀_{i = 1}^{l} X_{i} e^{R_{i} t} Y_{i}] X_{0}

[17].

The standard triples

(C_{c}, A_{c}, B_{c})

corresponding to

A (λ)

will be used extensively throughout the remainder of this paper [12]:

\{\begin{matrix} A^{- 1} (λ) = C_{c 1} {(λ I - A_{c 1})}^{- 1} B_{c 1} with C_{c 1} = [I_{m} ○ \dots ○], A_{c 1} = [\begin{matrix} ○ & I_{m} & ○ \\ ⋮ & ⋱ \\ ○ & ○ & I_{m} \\ - A_{l} & \dots & - A_{1} \end{matrix}]; B_{c 1} = [\begin{matrix} ○ \\ ⋮ \\ ○ \\ I_{m} \end{matrix}] \\ A^{- 1} (λ) = C_{c 2} {(λ I - A_{c 2})}^{- 1} B_{c 2} with C_{c 2} = [○ \dots ○ I_{m}], A_{c 2} = [\begin{matrix} ○ & \dots & ○ & - A_{l} \\ I_{m} & ○ & ⋮ \\ ⋱ & - A_{2} \\ ○ & I_{m} & - A_{1} \end{matrix}]; B_{c 2} = [\begin{matrix} I_{m} \\ ○ \\ ⋮ \\ ○ \end{matrix}] \end{matrix}\}

(15)

The following equality is verified by direct multiplication:

A_{c 2} = T_{1} A_{c 1} T_{1}^{- 1}

or

[\begin{matrix} ○ & \dots & ○ & - A_{l} \\ I_{m} & ○ & ⋮ \\ ⋱ & - A_{2} \\ ○ & I_{m} & - A_{1} \end{matrix}] = [\begin{matrix} A_{l - 1} & \dots & A_{1} & I \\ ⋮ & ⋰ \\ A_{1} & ⋰ \\ I & ○ \end{matrix}] [\begin{matrix} ○ & I_{m} & ○ \\ ⋮ & ⋱ \\ ○ & ○ & I_{m} \\ - A_{l} & \dots & - A_{1} \end{matrix}] {[\begin{matrix} A_{l - 1} & \dots & A_{1} & I \\ ⋮ & ⋰ \\ A_{1} & ⋰ \\ I & ○ \end{matrix}]}^{- 1} with T_{1} = [\begin{matrix} A_{l - 1} & \dots & A_{1} & I \\ ⋮ & ⋰ \\ A_{1} & ⋰ \\ I & ○ \end{matrix}]

Lemma 2

([19,27]). If

\{Z_{1}, \dots, Z_{l}\} \in C^{m \times m}

are operator roots of the polynomial operator

A (λ)

then,

C_{A} \cdot V_{R} = V_{R} Λ_{R}

with

C_{A}

is the companion form matrix corresponding to the pencil

A (λ) .

If the Vandermonde operator

V_{R} = r o w {(c o l {(Z_{i}^{k})}_{k = 0}^{l - 1})}_{i = 1}^{l}

is invertible, then the operators

C_{A}

and

Λ_{R} = b l k d i a g (Z_{1}, \dots, Z_{l})

are similar. If

V_{R}

is invertible, then

σ (A) = ⋃_{k = 1}^{l} σ (Z_{k})

.

Theorem 9

([25]). If

\{Z_{1}, \dots, Z_{l}\} \in C^{m \times m}

is complete set of operator roots of

A (λ)

, then

▪: $V_{R}$ is left-invertible ⟺ $K e r (V_{R}) = \{○\}$ ⟺ $σ (Z_{j}) ⋂ σ (Z_{k}) = \emptyset, (j, k = 1, \dots, l; j \neq k)$ .
▪: $V_{R}$ is right-invertible ⟺ $K e r (V_{R}^{H}) = {○}$ ⟺ $σ (A) = ⋃_{k = 1}^{l} σ (Z_{k})$ .
▪: $V_{R}^{- 1}$ exist ⟺ $K e r (V_{R}) = K e r (V_{R}^{H}) = \{○\}$ ⟺ $σ (A) = ⋃_{k = 1}^{l} σ (Z_{k}); σ (Z_{j}) ⋂ σ (Z_{k}) = \emptyset$ .

What forms can the block Vandermonde matrix

V_{R}

take when we have some repeated solvents (block roots)?

Proposition 1

([13,22]). An

m \times m

square matrix

R

is a right solvent of

A (λ)

with multiplicity

k > 1

if and only if it is a right solvent of each derivative

A^{(i)} (λ)

for

i = 0, 1, 2, \dots, k - 1

. Similarly, an

m \times m

matrix

L

is a left solvent of multiplicity

k > 1

if and only if it is a left solvent of

A^{(i)} (λ)

for

i = 0, 1, 2, \dots, k - 1

, where

A^{(i)} (λ)

denotes the

i^{t h}

derivative of

A (λ)

with respect to

λ

.

Let

A (λ) \in C^{m \times m} [λ]

be an

l^{t h}

degree matrix polynomial, then a matrix polynomial

B (λ) = ⅀_{i = 0}^{k} B_{i} λ^{k - i} \in C^{m \times m} [λ]

(

k < l

) is called the right divisor of

A (λ)

if there exists a

P (λ),

such that

A (λ) = P (λ) B (λ)

. In addition, if

σ [P (λ)] \cap σ [B (λ)] = \emptyset

, then

B (λ)

is called spectral divisor of

A (λ)

[27]. If the linear pencil

(λ I - X)

is a (spectral) right divisor of

A (λ)

, then the matrix

X

is called (spectral) right root of

A (λ)

and satisfies

A_{R} (X) = ○_{m}

. Therefore, if

A_{R} (X) = ○_{m},

then

A (λ) = P (λ) (λ I - X) with P (λ) = ⅀_{i = 1}^{l} P_{i - 1} λ^{l - i} and P_{k} = ⅀_{i = 0}^{k} A_{i} X^{k - i} .

(16)

Definition 3

([12,13,14]). Let

A

be a matrix, and let

\{X_{1}, \dots, X_{r}\}

be a set of block eigenvalues of

A

with

σ (X_{i}) ⋂ σ (X_{j}) = \emptyset

. We say that this set of block eigenvalues is a complete set, if

▪: The union of the eigenvalues of all $X_{i}$ together equal those of $A$ (i.e., $⋃ σ (X_{i}) = σ (A)$ ).
▪: Each eigenvalue appears with the same partial multiplicities in the $X_{i}$ as it does in $A$ .

The set is complete if these blocks capture the entire spectral data of

A

without distortion.

Theorem 10

([26]). A set of block eigenvalues

\{X_{1}, \dots, X_{r}\}

of a matrix

A

, is a complete set if and only if there is a set of corresponding block eigenvectors

\{V_{1}, \dots, V_{r}\}

, such that the matrix

[V_{1}, \dots, V_{r}]

is of full rank, and

A [V_{1}, \dots, V_{r}] = [V_{1}, \dots, V_{r}] . b l k d i a g (X_{1}, \dots, X_{r})

. Moreover, if

R_{1}, \dots, R_{l}

is a complete set of solvents of a companion matrix

A_{c}

then the respective block Vandermonde matrix:

V_{R} = r o w {[c o l {(R_{i}^{k - 1})}_{k = 1}^{l}]}_{i = 1}^{l}

is nonsingular. In addition, if

R_{1}, \dots, R_{s}

is a complete set of solvents of the matrix

A_{c}

with multiplicities,

l_{1}, \dots, l_{s}

(i.e.,

{(λ I - R_{i})}^{l_{i}}

is a right divisor of

A (λ)

and

{(λ I - R_{i})}^{l_{i} + 1}

is not), then

A_{c} = V_{R} J_{R} V_{R}^{- 1}

and the generalized block matrices

V_{R}, J_{R}

are given by

V_{R} = r o w {[r o w {[c o l {[(\begin{matrix} k - 1 \\ j \end{matrix}) R_{i}^{k - j - 1}]}_{k = 1}^{l}]}_{j = 0}^{l_{i} - 1}]}_{i = 1}^{s}; J_{R} = \begin{matrix} \begin{matrix} s \\ \oplus \end{matrix} \\ i = 1 \end{matrix} ([\begin{matrix} R_{i} & I_{m} & \dots & ○ \\ ○ & R_{i} & ⋮ & ○ \\ ⋮ & ⋮ & ⋮ \\ ○ & \dots & R_{i} & I_{m} \\ ○ & \dots & ○ & R_{i} \end{matrix}])

(17)

A problem closely related to that of finding solvents of a matrix polynomial is finding a scalar

λ = p

, such that the lambda-matrix

A (λ)

is singular. Such a scalar is called a latent root of

A (λ)

and vectors

v

and

w

are right and left latent vectors, respectively, if for a latent root

p

,

A (p) v = ○

and

w^{⟙} A (p) = ○

.

Definition 4

([5,22]). Let

A (λ) \in R^{m \times m} [λ]

be matrix polynomial, then we define the zeroes of

\det (A (λ))

to be the latent roots (eigenvalues) of

A (λ)

, and the set of latent roots of

A (λ)

is called the spectrum of

A (λ)

denoted by

σ (A (λ))

. And if a nonzero

v \in R^{m}

is such that

A (λ_{i}) v = ○

, then we say that

v

is a right latent (or eigen) vector of

A (λ)

, and if a nonzero

w^{⟙} \in R^{1 \times m}

is such that

w^{⟙} A (λ_{i}) = ○

or

A^{⟙} (λ_{i}) w = ○

, then

w

is a left latent (or eigen) vector of

A (λ)

.

The relationship between solvents and latent vectors/roots is given by

A (λ_{k}) v_{k} = ○ for k = 1 \dots m ⟺ ⅀_{i = 0}^{l} A_{i} λ_{k}^{l - i} v_{k} = ○ ⟺ ⅀_{i = 0}^{l} A_{i} [v_{1}| v_{2}| \dots| v_{m}] [\begin{matrix} λ_{1}^{l - i} & ○ \\ ⋱ \\ ○ & λ_{m}^{l - i} \end{matrix}] = ○

(18)

If we define

Z = [v_{1}| \dots| v_{m}] . [d i a g (λ_{1} \dots λ_{m})] . {[v_{1}| \dots| v_{m}]}^{- 1},

then

⅀_{i = 0}^{l} A_{i} Z^{l - i} = ○

. Solvents of

A (λ)

can be constructed as

Z = V . [d i a g (μ_{1} \dots μ_{m})] . V^{- 1},

where the matrix

V = [v_{1} v_{2} \dots v_{m}]

and the pairs

{\{μ_{i}, v_{i}\}}_{i = 1}^{m}

are chosen among the pairs

{\{μ_{i}, v_{i}\}}_{i = 1}^{p}

of

A (λ)

.

Theorem 11

([14,17]). If

λ_{k}

is the latent root of

A (λ) \in R^{m \times m} [λ]

with

v_{k}

and

w_{k}

as the right and left latent vectors, respectively, then

λ_{k}

is an eigenvalue of

A_{c}

with

v_{c k} = {c o l (λ_{k}^{i} v_{k})}_{i = 0}^{l - 1}

as the right eigenvector of

A_{c},

and

w_{c k} = {c o l (λ_{k}^{i} w_{k})}_{i = 0}^{l - 1}

is the left eigenvector of

A_{c}

.

Now, we are going to explore the relationship between latent vectors of

A (λ)

and eigenvectors of an arbitrary linearization matrix

A

. Given a matrix

A \in C^{n \times n},

whose eigenvectors are denoted by

x_{k},

let

A_{c} \in C^{n \times n}

be its companion form, that is,

A_{c} = T_{c} A T_{c}^{- 1}

where

T_{c}^{⟙} = [T_{1}^{⟙}, T_{2}^{⟙}, \dots, T_{l}^{⟙}]

and

T_{k} \in C^{m \times n}

. We know this from the above theorem

v_{k} = [I, ○, \dots, ○] v_{c k} = [I, ○, \dots, ○] T_{c} x_{k} = T_{1} x_{k}

. From the theory of control systems, the matrix transformation

T_{1}

is given by

T_{1} = [○, \dots, ○, I] Ω_{c}^{- 1}

and

Ω_{c} = {r o w (A^{i} M)}_{i = 0}^{l - 1} = [M, \dots, A^{l - 1} M],

where

M \in C^{n \times m}

is chosen so that

Ω_{c}

is nonsingular.

v_{k} = T_{1} x_{k} = [○, \dots, ○, I] {[{row (A^{i} M)}_{i = 0}^{l - 1}]}^{- 1} x_{k}

(19)

4. Standard Structures of Matrix Polynomials and Realization

We extend the spectral analysis of matrix polynomials by introducing standard triples (canonical triples) of matrices, which encode all eigenvalues, eigenvectors, and Jordan chains. These triples not only generalize the Jordan normal form to monic matrix polynomials on finite-dimensional spaces but also enable the inverse problem: reconstructing polynomial coefficients from spectral data [2,25].

In the previous development, we have seen that

A^{- 1} (λ) = C_{c} {(λ I - A_{c})}^{- 1} B_{c}

and

A_{c} = V_{R} Λ_{R} V_{R}^{- 1};

therefore,

A^{- 1} (λ) = C_{c} {(λ I - V_{R} Λ_{R} V_{R}^{- 1})}^{- 1} B_{c} = C_{c} {V_{R} (λ I - Λ_{R})}^{- 1} V_{R}^{- 1} B_{c},

which is equivalent to

A^{- 1} (λ) = X_{R} {(λ I - Λ_{R})}^{- 1} Y_{R}

with

X_{R} = [I, ○, \dots, ○] V_{R} = [I, \dots, I]

and

V_{R} Y_{R} = {[○, \dots, ○, I]}^{⟙}

. Now, if we let

S_{i} = [x_{i 1} \dots x_{i m}]

where

x ’ s

are latent vectors corresponding to the solvent

R_{i}

with

R_{i} S_{i} = [R_{i} x_{i 1} \dots R_{i} x_{i m}] = [λ_{i 1} x_{i 1} \dots λ_{i m} x_{i m}] = S_{i} J_{i}

therefore,

R_{i} = S_{i} J_{i} S_{i}^{- 1},

then this leads to

Λ_{R} = S J S^{- 1},

where

S = b l k d i a g (S_{1}, \dots, S_{l})

. Notice that

S = b l k d i a g (S_{1}, \dots, S_{l})

⟹

S^{- 1} = b l k d i a g (S_{1}^{- 1}, \dots, S_{l}^{- 1}),

which implies that

Λ_{R} = b l k d i a g (R_{1}, \dots, R_{l}) = S J S^{- 1}

. Based on this information, we can define the Jordan triple by taking the following similarity transformation

Λ_{R} = S J S^{- 1}

.

A^{- 1} (λ) = X_{R} {(λ I - Λ_{R})}^{- 1} Y_{R} = X_{R} S {(λ I - J)}^{- 1} S^{- 1} Y_{R} = X {(λ I - J)}^{- 1} Y

(20)

where

X = X_{R} S = [I, \dots, I] S = [S_{1}, \dots, S_{l}]

and

Y = S^{- 1} Y_{R} = S^{- 1} V_{R}^{- 1} {[○, \dots, ○, I]}^{T}

implies that

(V_{R} S) Y = {[○, \dots, ○, I]}^{⟙}

.

However, in this situation, we are asked to check that

V_{R} S = {c o l (X J^{i - 1})}_{i = 1}^{l}

. Now, observe that the set of all solvents can be gathered in compact form:

Λ_{R}^{k} S = S J^{k} ⟹ [R_{1}^{k} \dots R_{l}^{k}] S = [I, \dots, I, I] S J^{k} = X J^{k} ⟹ X J^{k} = [R_{1}^{k} \dots R_{l}^{k}] S

(21)

Now, we can explicitly write

[R_{1}^{k} \dots R_{l}^{k}] S = [R_{1}^{k} ξ_{1} \dots R_{l}^{k} ξ_{l}]

; where

ξ_{i}

are appropriate matrices, this means that

X J^{k} = [R_{1}^{k} ξ_{1} \dots R_{l}^{k} ξ_{l}]

, and therefore,

⅀_{i = 0}^{l} A_{i} X J^{l - i} = ○

, which can be written as

A_{0} X J^{l} + \dots + A_{l - 1} X J + A_{l} X = ○ ⟺ [{- r o w (A_{l - i})}_{i = 0}^{l - 1}] . [{c o l (X J^{i - 1})}_{i = 1}^{l}] = A_{0} X J^{l}

(22)

A_{c} [{c o l (X J^{i - 1})}_{i = 1}^{l}] = [{c o l (X J^{i - 1})}_{i = 1}^{l}] J ⟺ A_{c} = Q J Q^{- 1} with Q = {col (X J^{i - 1})}_{i = 1}^{l} and A_{0} = I_{m}

Also, we have

A_{c} = Q J Q^{- 1} = V_{R} (S J S^{- 1}) V_{R}^{- 1} ⟹ Q = V_{R} S = {c o l (X J^{i - 1})}_{i = 1}^{l}

.

4.1. Triples of Matrix Polynomials (λ-Matrices)

A triple of matrices

(X, J, Y)

with

X \in R^{m \times m l}

,

J \in R^{m l \times m l}

and

Y \in R^{m l \times m}

is called a Jordan triple of the monic matrix polynomial

A (λ)

of degree

l

and order

m

if

A^{- 1} (λ) = X {(λ I - J)}^{- 1} Y

. Here,

J

is a block-diagonal matrix formed from Jordan blocks, each corresponding to a particular eigenvalue. Each column of

X

belongs to a Jordan chain associated with the corresponding Jordan block in

J

, and

Y

is a matrix of left latent vectors, which can be computed via:

{col (X J^{i - 1})}_{i = 1}^{l} Y = {[○, \dots, ○, I]}^{⟙}

, [19].

The coefficients of the monic matrix polynomial can be recovered from either the right or left latent structure:

⅀_{i = 0}^{l} A_{i} X J^{l - i} = ○

and

⅀_{i = 0}^{l} J^{l - i} Y A_{i} = ○,

which leads to

[{r o w (A_{l - i})}_{i = 0}^{l - 1}] = - [A_{0} X J^{l}] \cdot {[{c o l (X J^{i - 1})}_{i = 1}^{l}]}^{- 1}; [{c o l (A_{l - i})}_{i = 0}^{l - 1}] = - {[{r o w (J^{i - 1} Y)}_{i = 1}^{l}]}^{- 1} \cdot [J^{l} {Y A}_{0}]

(23)

Definition 5

(standard triple [8]). A set of matrices

(Z, T, W)

is called a standard triple of the monic matrix polynomial

A (λ)

if it is obtained from a Jordan triple

(X, J, Y)

by the following similarity transformation:

Z = X M^{- 1}

,

T = M J M^{- 1}

,

W = M Y

and that

T

is standard form.

Now, if we let

T

be any linearization of the operator polynomial

A (λ)

with invertible leading coefficient, then there exists an invertible matrix

Q

, such that

Q^{- 1} T Q = A_{c}

. We then deduce from the structure of

A_{c}

and the relation

T Q = Q A_{c}

that

Q

must have the form

Q = {c o l (Q_{1} T^{i - 1})}_{i = 1}^{l}

for some operator

Q_{1}

, and that

⅀_{i = 0}^{l} A_{i} Q_{1} T^{l - i} = ○

.

Theorem 12

([19]). Let

A (λ)

be a monic matrix polynomial of degree

l

and order

m

with standard triple

(X, T, Y)

, then

A^{- 1} (λ) = X {(λ I - T)}^{- 1} Y

and

A (λ)

has the representations:

\{\begin{matrix} \begin{matrix} A (λ) = I λ^{l} - X T^{l} (V_{1} + V_{2} λ + \dots V_{l} λ^{l - 1}); [V_{1} \dots V_{l}] = {[{col (X T^{i - 1})}_{i = 1}^{l}]}^{- 1}; V_{i} \in R^{m l \times m} \end{matrix} \\ = I λ^{l} - (W_{1} + W_{2} λ + \dots W_{l} λ^{l - 1}) T^{l} Y; [\begin{matrix} W_{1} \\ \begin{matrix} ⋮ \\ W_{l} \end{matrix} \end{matrix}] = {[{row (T^{i - 1} Y)}_{i = 1}^{l}]}^{- 1}; W_{i} \in R^{m \times m l} \end{matrix}\}

(24)

Proof.

Notice that

A (λ) = I λ^{l} + [A_{l}, \dots, A_{1}] c o l {(I λ^{i - 1})}_{i = 1}^{l}

, and previously, we have seen that if

T

is any linearization of the monic operator polynomial

A (λ)

, then there exist some linear operator

X,

such that

⅀_{i = 0}^{l} A_{i} X T^{l - i} = ○,

which can be written as

[A_{l}, \dots, A_{1}] = - [X T^{l}] \cdot {[{c o l (X T^{i - 1})}_{i = 1}^{l}]}^{- 1}

and define

[V_{1} \dots V_{l}] = {[{c o l (X T^{i - 1})}_{i = 1}^{l}]}^{- 1};

then, we obtain

A (λ) = I λ^{l} - [X T^{l}] \cdot [V_{1} \dots V_{l}] c o l {(I λ^{i - 1})}_{i = 1}^{l} = I λ^{l} - X T^{l} (⅀_{i = 1}^{l} V_{i} λ^{i - 1})

. Following the same procedure, we can prove the rest. □

Theorem 13

([2,25]). If

A_{k} (λ)

are monic matrix polynomials with standard triple

(X_{k}, T_{k}, Y_{k})

for

k = 1,2

, then

A (λ) = A_{2} (λ) A_{1} (λ)

has the following standard triple.

X = [X_{1} ○]; T = [\begin{matrix} T_{1} & Y_{1} X_{2} \\ ○ & T_{2} \end{matrix}]; Y = [\begin{matrix} ○ \\ Y_{2} \end{matrix}]

(25)

Proof.

From the theory of standard triples.

\begin{matrix} A^{- 1} (λ) & = X {(λ I - T)}^{- 1} Y = [X_{1} ○] {[\begin{matrix} {λ I - T}_{1} & - Y_{1} X_{2} \\ ○ & λ I - T_{2} \end{matrix}]}^{- 1} [\begin{matrix} ○ \\ Y_{2} \end{matrix}] \\ = [X_{1} ○] {[\begin{matrix} {({λ I - T}_{1})}^{- 1} & {({λ I - T}_{1})}^{- 1} Y_{1} X_{2} {(λ I - T_{2})}^{- 1} \\ ○ & {(λ I - T_{2})}^{- 1} \end{matrix}]}^{- 1} [\begin{matrix} ○ \\ Y_{2} \end{matrix}] \\ = (X_{1} {(λ I - T_{1})}^{- 1} Y_{1}) (X_{2} {(λ I - T_{2})}^{- 1} Y_{2}) = A_{1}^{- 1} (λ) A_{2}^{- 1} (λ) \end{matrix}

If

A (λ) = A_{2} (λ) A_{1} (λ)

is a particular factorization of the monic matrix polynomial

A (λ)

, such that

σ (A_{1} (λ)) ⋂ σ (A_{2} (λ)) = \emptyset

, then

A_{1} (λ)

and

A_{2} (λ)

are called spectral divisors of

A (λ)

. It follows that, whenever a matrix polynomial has spectral divisors, there exists a similarity transformation that converts its block-companion matrix

A_{c}

into a block-diagonal form [13,26]. □

Remark 1.

If the set of matrices

(X, T, Y)

is a standard (or Jordan) triple, then we call the set

(X, T)

a standard (or Jordan) pair.

4.2. Pairs of Matrix Polynomials (λ-Matrices)

A monic matrix polynomial

A (λ)

is fully characterized by its invariant pairs

(X_{i}, J_{i})

, which generalize eigenpairs through Jordan chains. Since Jordan chains are numerically unstable, invariant pairs provide a more robust framework for spectral analysis and computation. Let

λ_{k} \in C

be an eigenvalue of the regular

m \times m

matrix polynomial

A (λ)

of multiplicity

m_{k}

. Then, we can construct from the spectral data of

A (λ)

a pair of matrices

(X_{k}, J_{k})

with the following properties:

X_{k} \in C^{m \times m_{k}}

,

J_{k} \in C^{m_{k} \times m_{k}}

and

J_{k}

is a Jordan matrix with

λ_{k}

as its only eigenvalue,

⅀_{i = 0}^{l} A_{i} X_{k} J_{k}^{l - i} = ○,

and

r a n k [c o l {(X_{k} J_{k}^{i})}_{i = 0}^{l - 1}] = m_{k}

. We shall say that any pair of matrices

(X_{k}, J_{k})

with these properties is a local Jordan pair of

A (λ)

at

λ_{k}

. If

λ_{1}, \dots, λ_{n}

are all the eigenvalues of

A (λ),

in this way, we obtain

n

local Jordan pairs

(X_{i}, J_{i})

,

i = 1, \dots, n

[2].

A_{0} X J^{l} + \dots + A_{l - 1} X J + A_{l} X = ○ ⟺ [A_{l}, \dots, A_{1}] = - [A_{0} X J^{l}] \cdot {[{c o l (X J^{i - 1})}_{i = 1}^{l}]}^{- 1}

(26)

If the spectrum of a matrix polynomial contains infinity as an eigenvalue, the corresponding Jordan pair is naturally split into two components: the finite Jordan pair

(X_{F}, J_{F})

and the infinite Jordan pair

(X_{\infty}, J_{\infty})

.

Definition 6.

If

(X_{i}, J_{i})

is a finite local Jordan pairs of

A (λ),

then the pair

(X_{F}, J_{F})

of the form

X_{F} = [X_{1}, \dots X_{n}]

and

J_{F} = J_{1} \oplus \dots \oplus J_{n}

is called a finite Jordan pair for

A (λ)

. Now, we define a Jordan pair of order

l

for

A (λ)

as pair

(X, J)

with the following properties:

(1): $X = [X_{F}, X_{\infty}], J = J_{F} \oplus J_{\infty}$ ;
(2): $(X_{F}, J_{F})$ is a finite Jordan pair for $A (λ)$ ;
(3): $(X_{\infty}, J_{\infty})$ is a local Jordan pair for $r e v (A (λ)) = λ^{l} A (λ^{- 1})$ at $λ = 0$ .

It can be easily verified that

A_{0} X_{F} J_{F}^{l} + A_{1} X_{F} J_{F}^{l - 1} + \dots + A_{l} X_{F} = ○ and A_{l} X_{\infty} J_{\infty}^{l} + \dots + A_{1} X_{\infty} J_{\infty} + A_{0} X_{\infty} = ○,

means that

[A_{l}, \dots, A_{1}, A_{0}] Q_{l} = ○

with

Q_{l} = [{c o l (X_{F} J_{F}^{i})}_{i = 0}^{l} {c o l (X_{\infty} J_{\infty}^{l - i})}_{i = 0}^{l}]

.

The pair

(X, T)

will be called a standard pair of order

l

if the condition

\det Q_{l - 1} \neq 0

is satisfied. Its main property is summarized in the following: The admissible pair

(X, T)

is standard if and only if the

r a n k (Q_{k})

is maximal for all

k \geq 1

(also,

Q_{k} = [{c o l (X_{F} J_{F}^{i})}_{i = 0}^{k} {c o l (X_{\infty} J_{\infty}^{k - i})}_{i = 0}^{k}]

is sometimes called controllable matrix).

Lemma 3

([2,25]). If

ν

,

μ

denote the sums of the degrees of the finite and infinite elementary divisors of a general matrix polynomial

A (λ)

, respectively, then

n = μ + ν = l m

.

The finite Jordan pair of

r e v (A (λ))

, associated with the zero structure at

λ = 0

corresponds to the infinite Jordan pair

(X_{\infty}, J_{\infty})

of

A (λ)

. Consequently, the finite and infinite Jordan pairs of

A (λ)

satisfy the following properties:

\{\begin{matrix} Ω_{F} = col {(X_{F} J_{F}^{i - 1})}_{i = 1}^{l} = [\begin{matrix} X_{F} \\ \begin{matrix} X_{F} J_{F} \\ ⋮ \end{matrix} \\ X_{F} J_{F}^{l - 1} \end{matrix}]; \\ ⅀_{i = 0}^{l} A_{i} X_{F} J_{F}^{l - i} = 0 with rank Ω_{F} = ν \end{matrix}\} and \{\begin{matrix} Ω_{\infty} = col {(X_{\infty} J_{\infty}^{l - i})}_{i = 1}^{l} = [\begin{matrix} X_{\infty} J_{\infty}^{l - 1} \\ \begin{matrix} ⋮ \\ X_{\infty} J_{\infty} \end{matrix} \\ X_{\infty} \end{matrix}] \\ ⅀_{i = 0}^{l} A_{i} X_{\infty} J_{\infty}^{i} = 0 with rank Ω_{\infty} = μ \end{matrix}\}

Moreover, the structure of the infinite Jordan pair of

A (λ)

is closely connected (see Vardulakis in [8]) to its Smith–McMillan form at

λ = \infty

. In particular,

J_{\infty} = b l k d i a g \{J_{i}^{\infty}\} \in R^{μ \times μ}; w i t h J_{i}^{\infty} = [\begin{matrix} 0 \\ \begin{matrix} 0 \\ ⋮ \end{matrix} \\ 0 \end{matrix} \begin{matrix} 1 \\ \begin{matrix} 0 \\ ⋱ \end{matrix} \\ \dots \end{matrix} \begin{matrix} 0 \\ \begin{matrix} ⋱ \\ ⋱ \end{matrix} \\ 0 \end{matrix} \begin{matrix} 0 \\ \begin{matrix} ⋮ \\ 1 \end{matrix} \\ 0 \end{matrix}]

Theorem 14

([8,19]). Let

(X_{F}, J_{F})

and

(X_{\infty}, J_{\infty})

be the finite and infinite Jordan pairs of

A (λ)

, with

X_{F} \in R^{m \times ν}

,

J_{F} \in R^{ν \times ν}

,

X_{\infty} \in R^{m \times μ}

,

J_{\infty} \in R^{μ \times μ}

and

μ = l m - ν

. These pairs satisfy the following properties:

$\deg (\det (A (λ))) = ν$ and $\det (λ^{l} A (λ^{- 1}))$ has a zero at $λ = 0$ with multiplicity $μ$ ;
$⅀_{i = 0}^{l} A_{i} X_{F} J_{F}^{l - i} = ○, ⅀_{i = 0}^{l} A_{i} X_{\infty} J_{\infty}^{i} = ○;$
$rank Ω_{F} = ν, a n d rank Ω_{\infty} = μ;$
${[A_{l}^{⟙} \dots A_{1}^{⟙} A_{0}^{⟙}]}^{⟙} \in n u l l (Q^{⟙}), w i t h Q = [Q_{F} Q_{\infty}] = [\begin{matrix} X_{F} \\ \begin{matrix} X_{F} J_{F} \\ ⋮ \end{matrix} \\ X_{F} J_{F}^{l} \end{matrix} \begin{matrix} X_{\infty} J_{\infty}^{l} \\ \begin{matrix} ⋮ \\ X_{\infty} J_{\infty} \end{matrix} \\ X_{\infty} \end{matrix}]$ .

In addition, a realization of

A^{- 1} (λ)

is given by

A^{- 1} (λ) = [X_{F} X_{\infty}] {[\begin{matrix} λ I_{ν} - J_{F} & ○ \\ ○ & λ J_{\infty} - I_{μ} \end{matrix}]}^{- 1} [\begin{matrix} Y_{F} \\ Y_{\infty} \end{matrix}] with Y_{F} \in R^{ν \times m}, Y_{\infty} \in R^{μ \times m} [\begin{matrix} Y_{F} \\ Y_{\infty} \end{matrix}] = {[\begin{matrix} I_{ν} & ○ \\ ○ & J_{\infty}^{l - 1} \end{matrix}] Γ}^{- 1} [\begin{matrix} ○_{m} \\ ⋮ \\ I_{m} \end{matrix}] a n d Γ = [Γ_{1} Γ_{2}] = [\begin{matrix} X_{F} \\ \begin{matrix} ⋮ \\ X_{F} J_{F}^{l - 2} \end{matrix} \\ A_{0} X_{F} J_{F}^{l - 1} \end{matrix}| \begin{matrix} X_{\infty} J_{\infty}^{l - 2} \\ \begin{matrix} ⋮ \\ X_{\infty} \end{matrix} \\ - ⅀_{i = 0}^{l - 1} A_{l - i} X_{\infty} J_{\infty}^{l - 1 - i} \end{matrix}]

A pair of matrices

(X, T)

is called a right admissible pair of order

k

if

X \in C^{m \times k}

and

T \in C^{k \times k}

. Similarly, a pair

(V, Y)

with

Y \in C^{l \times m}

and

V \in C^{l \times l}

is a left admissible pair of order

l

. Here and elsewhere, mmm is fixed, and unless specified otherwise, admissible pairs are assumed to be right admissible. All notions defined below for right admissible pairs can be naturally reformulated for left admissible pairs. Two right admissible pairs

(X_{1}, T_{1})

and

(X_{2}, T_{2})

of the same order

p

are called similar if there exists an invertible

p \times p

matrix

Q

, such that

X_{1} = X_{2} Q

and

T_{1} = Q^{- 1} T_{2} Q

[25]. Let

(X_{1}, T_{1})

,

(X_{2}, T_{2})

be admissible pairs of orders

p_{1}

, and

p_{2}

with

p_{1} > p_{2}

. The pair

(X_{1}, T_{1})

is said to be an extension of

(X_{2}, T_{2})

(equivalently,

(X_{2}, T_{2})

is a restriction of

(X_{1}, T_{1})

) if there exists a full-rank

p_{1} \times p_{2}

matrix

S

, such that

X_{1} S = X_{2}

and

T_{1} S = S T_{2}

. A pair

(X, T)

is called a common restriction of a family of admissible pairs

(X_{j}, T_{j})

(j = 1, \dots, s)

, if each

(X_{j}, T_{j})

is an extension of

(X, T)

. A common restriction

(X_{0}, T_{0})

is called the greatest common restriction if it is an extension of every other common restriction in the family. For a matrix polynomial

A (λ)

, if

(X, T)

and

(T, Y)

are right and left admissible pairs, respectively, then it is evident that

A (X, T) = ⅀_{i = 0}^{l} A_{i} X T^{l - i}

and

A (T, Y) = ⅀_{i = 0}^{l} T^{l - i} Y A_{i}

.

Next, we recall some basic facts from the spectral theory of matrix polynomials. If

A (λ)

is an

m \times m

monic matrix polynomial of degree

l

, a right standard pair

(T, Y)

is an admissible pair of order

l m

, such that the matrix

c o l {(X T^{i - 1})}_{i = 1}^{l}

is nonsingular and

A (X, T) : = ○

. Similarly, a left admissible pair

(T, Y)

of order

l m

with

\det (r o w {(T^{i - 1} Y)}_{i = 1}^{l}) \neq 0

is called a standard pair of

A (λ)

if

A (T, Y) : = ○

[2,19,25].

Another equivalent definition of standard pair is given in the following result:

Lemma 4

([12,19]). The admissible pair

(X, T)

is standard of order

l

for

A (λ)

iff

(X, T)

is standard of order

l

and the equation:

[\begin{matrix} Q_{l - 2} \\ W \end{matrix}] T (λ) = C_{l} (λ, A) Q_{l - 1}

holds, where

$C_{l} (λ, A) = λ I - A_{c}$ is the companion linearization of the matrix polynomial $A (λ)$ ,
$W = [A_{0} X_{1} {T_{1}}^{l - 1}, - ⅀_{k = 0}^{l - 1} A_{l - k} X_{2} {T_{2}}^{l - 1 - k}]$ , $Q_{k} = [{c o l (X_{1} T_{1}^{i})}_{i = 0}^{k} {c o l (X_{2} T_{2}^{k - i})}_{i = 0}^{k}]$ and
$T (λ) = (λ I - T_{1}) \oplus (λ T_{2} - I)$ .

A resolvent form for the regular matrix polynomial

A (λ)

is a representation

A^{- 1} (λ) = C T^{- 1} (λ) B,

where

T (λ)

is a linear matrix polynomial and

B, C

are matrices of appropriate size. As a consequence, any two regular matrix polynomials,

A (λ)

and

B (λ),

have the same standard pair

(X, T)

if and only if there exists a (constant) invertible matrix

K \in C^{m \times m}

, such that

B (λ) = K A (λ)

.

Theorem 15

([2,8]). Let the matrices

X = [X_{1}, X_{2}], T = T_{1} \oplus T_{2}

be a standard pair (finite–infinite) of order

l

for the regular matrix polynomial

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i} \in R^{m \times m} [λ]

. Then,

A^{- 1} (λ) = C {[(λ I - T_{1}) \oplus (λ T_{2} - I)]}^{- 1} B

where the matrices

C

,

B

and

W

are given by

C = [X_{1}, X_{2} {T_{2}}^{l - 1}], B = {[\begin{matrix} Q_{l - 2} \\ W \end{matrix}]}^{- 1} {[○_{m}, \dots, I_{m}]}^{⟙}, W = [A_{0} X_{1} {T_{1}}^{l - 1}, - ⅀_{k = 0}^{l - 1} A_{l - k} X_{2} {T_{2}}^{l - 1 - k}] .

Proof.

We know that

C_{l}^{- 1} (λ, A) = Q_{l - 1} {[(λ I - T_{1}) \oplus (λ T_{2} - I)]}^{- 1} {[\begin{matrix} Q_{l - 2} \\ W \end{matrix}]}^{- 1}

; therefore,

A^{- 1} (λ) = C_{c} C_{l}^{- 1} (λ, A) B_{c} = [I_{m} \dots ○_{m}] Q_{l - 1} {[\begin{matrix} λ I - T_{1} & ○ \\ ○ & λ T_{2} - I \end{matrix}]}^{- 1} {[\begin{matrix} Q_{l - 2} \\ W \end{matrix}]}^{- 1} [\begin{matrix} \begin{matrix} ○_{m} \\ ⋮ \end{matrix} \\ I_{m} \end{matrix}] = C T^{- 1} (λ) B

□

The next theorem gives the explicit solutions of basic linear systems of differential (and/or difference) equations, using a given standard pair of the characteristic polynomial

A (λ)

. We shall assume throughout that

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i}

, is a given regular matrix polynomial (

A_{i} \in C^{m \times m}

) with a given standard pair

(X = [X_{1}, X_{2}], T = T_{1} \oplus T_{2})

. It is also assumed that

T_{2}

is a nilpotent matrix, i.e.,

{T_{2}}^{ν} = 0

for some

ν \geq 0

. This is equivalent to stating that

T_{1} \in C^{n \times n}

where

n = \deg (\det (A (λ)))

. This condition can always be achieved by transforming our given standard pair

(X, T)

to a Jordan pair, via simple operations. In most applications, however,

(X, T)

is a Jordan pair to begin with, so that this condition holds.

Theorem 16

([19]). The general solution of the differential equation

⅀_{i = 0}^{l} A_{l - i} u^{(i)} (t) = f (t),

with

t \in R

and for a given smooth differentiable function

f (t)

is, as follows:

u (t) = X_{1} e^{T_{1} t} Z + \int_{0}^{t} X_{1} e^{T_{1} (t - s)} B_{1} f (s) d s - ⅀_{i = 0}^{ν - 1} X_{2} {T_{2}}^{i} B_{2} f^{(i)} (t)

(27)

For a given sequence of vectors

f_{k} \in C^{m}

, the general solution of the difference equation

⅀_{i = 0}^{l} A_{l - i} u_{i + k} = f_{k}, k = 0,1, \dots

is

u_{k} = X_{1} [{T_{1}}^{k} Z + ⅀_{j = 0}^{k - 1} {T_{1}}^{k - 1 - j} B_{1} f_{j}] - ⅀_{j = 0}^{ν - 1} {X_{2} {T_{2}}^{j} B}_{2} f_{j + k}

where

Z \in C^{n}

is an arbitrary vector,

T_{2}^{ν} = 0

and

A_{0} = I

.

4.3. Characterization of Solvents by Invariant Pairs

We now examine the matrix solvent problem as a special case of the invariant pair problem, applying to solvents some of the results previously established for invariant pairs. Let

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i} \in R^{m \times m} [λ]

be a monic matrix polynomial. The associated right and left matrix difference equations are given by

U_{k} + A_{1} U_{k - 1} + \dots + A_{l} U_{k - l} = ○

,

V_{k} + V_{k - 1} A_{1} + \dots + V_{k - l} A_{l} = ○

where

U_{j} \in C^{m \times m}

,

V_{j} \in C^{m \times m}

,

j = 0,1, 2, \dots .

Theorem 17

([19,25]). Given a matrix polynomial

A (λ)

having

(X, T, Y)

as a standard triple, the general solution of

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○

is, as follows:

U_{k} = X T^{k} C,

where

C \in C^{l m \times m},

and the general solution of

⅀_{i = 0}^{l} {V_{k - i} A}_{i} = ○

is, as follows:

V_{k} = D T^{k} Y,

where

D \in C^{m \times l m}

.

Proof.

By using the definition of a standard pair, the following identity is satisfied:

X T^{l} + A_{1} X T^{l - 1} + \dots + A_{l - 1} X T + A_{l} X = ○_{m \times k} ⟺ [A_{l}, \dots, A_{1}] [{c o l (X T^{i - 1})}_{i = 1}^{l}] = - [A_{0} X T^{l}]

. If we multiply on the right by

T^{k - l} C,

we obtain

X T^{k} C + A_{1} X T^{k - 1} C + \dots + A_{l} X T^{k - l} C = ○

and thus,

U_{k} = X T^{k} C

verifies the equation

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○

. From the other side, the proof of

⅀_{i = 0}^{l} {V_{k - i} A}_{i} = ○

can be derived by using the fact that the standard triple of

{[A (λ)]}^{⟙}

is

(Y^{⟙}, T^{⟙}, X^{⟙})

. □

Corollary 3

([22]). The solution of the difference equation

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○

with the initial conditions:

U_{0} = \dots = U_{l - 2} = ○

,

U_{l - 1} = I_{m}

is given by

U_{k} = X T^{k} Y

. The solution of the difference equation

⅀_{i = 0}^{l} V_{k - i} A_{i} = ○

with the initial conditions:

V_{0} = \dots = V_{l - 2} = ○

, and

V_{l - 1} = I_{m}

is given by

V_{k} = X T^{k} Y

.

Proof.

Using

U_{k} = X T^{k} Y

, we obtain the set of equations

{c o l (X T^{i - 1})}_{i = 1}^{k} Y = {[○_{m} \dots I_{m}]}^{⟙}

. □

Thus, for these initial conditions, the right and left difference equations yield the same solution.

Corollary 4

([22]). If the matrix polynomial

A (λ)

has a complete set of right solvents, then the solution of the matrix sequence

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○

, subject to the initial conditions

U_{l - 1} = I_{m}

and

U_{i} = ○

,

i = 0, \dots, l - 2

, is given by

U_{k} = [R_{1}^{k} \dots R_{l}^{k}] Y = ⅀_{i = 1}^{l} R_{i}^{k} Y_{i}

with

Y = {c o l (Y_{i})}_{i = 1}^{l}

.

Proof.

We know that

A (λ)

admits the standard triple

(X, T, Y)

where

X = [I \dots I]

,

T = Λ_{R} = [\oplus_{i = 1}^{l} R_{i}]

and

Y = V_{R}^{- 1} \cdot {[○ \dots I]}^{⟙} = {c o l (Y_{i})}_{i = 1}^{l}

. Replacing in

U_{k} = X T^{k} Y,

we obtain

U_{k} = [R_{1}^{k} \dots R_{l}^{k}] Y = ⅀_{i = 1}^{l} R_{i}^{k} Y_{i}

. □

Theorem 18

([25]). Let

A (λ) \in C^{m \times m} [λ]

be an

l^{t h}

degree monic λ-matrix and consider an invariant pair

(X, T) \in C^{m \times k} \times C^{k \times k}

of (λ) (sometimes called admissible pairs). If the matrix

X

has size

m \times m

, i.e.,

k = m

, and is invertible, then

R = X T X^{- 1}

satisfies

A (R) = 0

, i.e.,

R

is a matrix solvent of

A (λ)

.

Proof.

As

(X, T)

is an invariant pair of

A (λ)

, we have, as follows:

⅀_{i = 0}^{l} A_{i} X T^{l - i} = ○

. Since

X

is invertible, we can multiply by

X^{- 1}

and obtain

⅀_{i = 0}^{l} A_{i} X T^{l - i} X^{- 1} = ○,

which is equivalent to

⅀_{i = 0}^{l} A_{i} R^{l - i} = ○

. Therefore,

R = X T X^{- 1}

is a matrix solvent of

A (λ)

, with

(X, T) \in C^{m \times m} \times C^{m \times m}

. □

Theorem 19

([19]). Let

A (λ) \in C^{m \times m} [λ]

be an

l^{t h}

degree monic λ-matrix, if

A (λ)

has a complete spectral factorization:

A (λ) = (λ I_{m} - Q_{l}) \dots (λ I_{m} - Q_{2}) (λ I_{m} - Q_{1})

then the pencile

λ I - T

is a linearization of

A (λ)

, that is, as follows:

[λ I - T] ~ [A (λ) \oplus I] o r [\begin{matrix} λ I_{m} - Q_{1} & - I_{m} & ○ \\ ⋱ \\ ⋱ & - I_{m} \\ ○ & λ I_{m} - Q_{l} \end{matrix}] ~ [\begin{matrix} A (λ) & ○ \\ ○ & I \end{matrix}] with T = [\begin{matrix} Q_{1} & I_{m} & ○ \\ ⋱ \\ ⋱ & I_{m} \\ ○ & Q_{l} \end{matrix}]

(28)

5. Determination of Operator Roots (Spectral Factors)

We next review several existing algorithms for factoring a linear term from a given matrix polynomial

A (λ) \in C^{m \times m} [λ]

. As will be shown later, the Q.D. algorithm can be interpreted as a generalization of these approaches. Following this, we introduce a new optimization-based algorithm [35,36,37].

A. Bernoulli’s Method is sufficient to find only the dominant linear spectral factor at each iteration, and in order to find all of them, we use synthetic long division. To convert spectral factors to solvents, we use some algorithmic conversion methods.

Theorem 20

([19,22]). Let A(λ) be a monic matrix polynomial of degree

l

and order

m

. Assume that

A (λ)

has a dominant right solvent

R

and a dominant left solvent

L

. Let the sequence

U_{k}

,

k = 0,1, \dots

be the solution of

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○,

subject to the initial conditions

U_{i}

,

i = 0,1, \dots l - 2

and

U_{l - 1} = I

. Then, the matrix

U_{k}

is not singular for

k

or large enough, and

\underset{k \to \infty}{l i m} U_{k + 1} U_{k}^{- 1} = R

and

\underset{k \to \infty}{l i m} U_{k}^{- 1} U_{k + 1} = L

.

Proof.

We have seen that

U_{k} = ⅀_{i = 1}^{l} R_{i}^{k} Y_{i} = R_{1}^{k} Y_{1} + ⅀_{i = 2}^{l} R_{i}^{k} Y_{i}

, where

R_{1}

is a dominant right solvent

U_{k} = [I + ⅀_{i = 2}^{l} R_{i}^{k} Y_{i} Y_{1}^{- 1} R_{1}^{- k}] R_{1}^{k} Y_{1} = [I + H_{k}] R_{1}^{k} Y_{1}

, and

\underset{k \to \infty}{l i m} ‖H_{k}‖

converge toward zero. Thus, for large enough

k

,

U_{k}

is nonsingular and we can write:

\underset{k \to \infty}{l i m} U_{k + 1} U_{k}^{- 1} = \underset{k \to \infty}{l i m} [I + H_{k + 1}] R_{1}^{k + 1} Y_{1} Y_{1}^{- 1} R_{1}^{- k} {[I + H_{k}]}^{- 1} = R

(29)

If there is some nonsingular

U_{k}

then the Bernoulli method will breakdown. □

Now, consider the generalized Bernoulli’s method (Algorithm 1) with the matrix polynomial as follows:

A (λ) = A_{0} λ^{3} + A_{1} λ^{2} + A_{2} λ + A_{3}

.

Algorithm 1. Generalized Bernoulli’s Method

1 Enter the number of iterations

N

,

I = e y e (2,2);

A_{2} = (1 / 3) [- 13 - 4; 4 - 23];

A_{1} = [4 4; - 4 14];

2

U_{1} = z e r o s (2,2);

U_{2} = e y e (2,2);

% Initialization
3 For

k = 1 : N

4

U_{0} = - (A_{2} ⋆ U_{1} + A_{1} ⋆ U_{2});

%

U_{0} = U_{k}, U_{1} = U_{k - 1}, U_{2} = U_{k - 2}

5

U_{2} = U_{1};

U_{1} = U_{0};

R_{1} = U_{1} ⋆ i n v (U_{2});

% Update

U_{k}

and evaluate

R_{1}

6 End,

R_{1}

B. Block Power Method: The block power method will be used to compute the block eigenvectors and associated block eigenvalues of a companion matrix $A_{c}$ . The block eigenvalue is also the solvent of the characteristic matrix polynomial of $A_{c}$ .

Theorem 21

([22]). Let

A (λ) \in R^{m \times m} [λ]

be a monic matrix polynomial of degree

l

and order

m

. We can associate this

A (λ)

by the recursive equation

⅀_{i = 0}^{l} A_{i} U_{k - i} = ○

. The Bernoulli’s iteration can be written as

X_{k + 1} = A_{c} X_{k}

with

X_{k} = c o l {(U_{k + i})}_{i = 0}^{l - 1}

. The general solution of this matrix difference equation is

X_{k} = A_{c}^{k} X_{0}

with

X_{0} = [○, ○, \dots, I]

. The dominant right solvent

R = \underset{k \to \infty}{l i m} X_{k + 1} (1 : m, :) X_{k}^{- 1} (1 : m, :)

.

C. Matrix Horner’s Method efficiently evaluates an $l$ -degree polynomial using only $l$ multiplications and $l$ additions, by expressing it in nested form via synthetic division. We now give an extended version of this nested scheme to matrix polynomials [28].

Theorem 22.

Let us define the functional

A (λ) \in C^{m \times m} [λ]

to be the matrix polynomial of degree

l

and order

m

over the complex field

A (λ) : C \to C^{m \times m} [λ],

where the matrices

A_{i} \in C^{m \times m}

are some constant matrix coefficients and

λ

is a complex variable, with

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i}

. Let

A_{R} (X_{k}) = ⅀_{i = 0}^{l} A_{i} X_{k}^{l - i}

,

(k = 0,1, \dots)

be the right functional evaluation of

A (λ)

by the sequence of

m \times m

square matrices

{\{X_{k}\}}_{k = 0}^{\infty}

. The solution

X (k)

of matrix polynomial

A (λ)

converges iteratively to the exact solution if

X (k + 1) = - {(B_{l - 1} (k))}^{- 1} B_{0} A_{l}, B_{0} = A_{0}, k = 1,2, \dots w i t h B_{i} (k) = B_{0} A_{i} + B_{i - 1} (k) X (k), i = 1,2, \dots l - 1

(30)

Proof.

Divide

A (λ)

on the right by

(λ I - X),

we obtain

A (λ) = B (λ) (λ I - X) + A_{R} (X)

(i.e., remainder theorem), which means that

A (λ) = [⅀_{i = 0}^{l - 1} B_{i} λ^{l - i - 1}] (λ I - X) + A_{R} (X)

. If we set

B_{l} = A_{R} (X)

, and we expand this last formula of the functional

A (λ) \in C^{m \times m} [λ],

we obtain:

A (λ) = B_{0} λ^{l} + (B_{0} A_{1} + B_{0} X) λ^{l - 1} + (B_{0} A_{2} + B_{1} X) λ^{l - 2} + \dots + B_{l}

. By identifying the coefficients of different powers, we obtain:

B_{i} (k) = B_{0} A_{i} + B_{i - 1} (k) X (k), i = 1,2, \dots l - 1

. Since

X

is a right operator root of

A (λ)

, this means that

B_{l} = A_{R} (X) = ○,

and from this last equation of

B_{i}

, we can deduce that

B_{l} = B_{0} A_{l} + B_{l - 1} X = ○

; in other words,

X = - B_{l - 1}^{- 1} B_{0} A_{l}

. If we iterate these, we arrive at

X (k + 1) = - {(B_{l - 1} (k))}^{- 1} B_{0} A_{l}

with

B_{i} (k) = B_{0} A_{i} + B_{i - 1} (k) X (k); B_{0} = A_{0}; i = 1,2, \dots l - 1; k = 1,2, \dots

Based on the iterative Horner’s sachem (Algorithm 2), we can redo the process many more times to obtain a solution

S = \underset{k \to \infty}{l i m} X_{k}

, and the theorem is proved. □

Algorithm 2. Block Horner’s Method

1 Specify the number of iterations

N

2 For

k = 0 : N

3 Input the degree and the order

m, l

and the coefficients

A_{i} \in R^{m \times m}

4

X (0) \in R^{m \times m}

is the initial guess; and let

B_{0} = B_{0} (0) = A_{0}

5 For

i = 1 : l - 1

,

B_{i} (k) = B_{0} A_{i} + B_{i - 1} (k) X (k)

;

B_{i - 1} (k) = B_{i} (k)

End
6

X (k + 1) = - {(B_{l - 1} (k))}^{- 1} B_{0} A_{l}

;

X (k + 1) = X (k)

7 End

Now, we introduce a new version of the block Horner’s method, which is an efficient algorithm for the convergence study; after back substitution of the sequence

B_{i} (k) = B_{0} A_{i} + B_{i - 1} (k) X (k),

we obtain the following:

B_{l - 1} (k) = ⅀_{i = 0}^{l - 1} A_{i} {(X (k))}^{l - i - 1} = [A_{R} (X (k)) - A_{l}] {X (k)}^{- 1}

, so, by using the last theorem and substituting

B_{l - 1} (k)

into the equation of

X (k + 1),

we obtain:

X (k + 1) = X (k) {[A_{l} - A_{R} (X (k))]}^{- 1} A_{l}

. The following corollary is an immediate consequence of the above theorem.

Corollary 5

([22]). Let the function

A (λ) \in C^{m \times m} [λ]

be a monic matrix polynomial of degree

l

. Assume that

A (λ)

has an operator root

R \in C^{m \times m}

, let the sequence

{\{X_{k}\}}_{k = 0}^{\infty} \in C^{m \times m}

and

A_{R} (X_{k})

be the right functional evaluation of

A (λ)

, which means that

A_{R} (X_{k}) = ⅀_{i = 0}^{l - 1} A_{i} X_{k}^{l - i}

for

k = 0,1, \dots

If the square matrix

[A_{l} - A_{R} (X_{k})]

is invertible for each given value of

k

, then the sequence of

m \times m

matrices

{\{X_{k}\}}_{k = 0}^{\infty}

converges linearly to the operator root

R \in C^{m \times m}

(i.e.,

R = \underset{k \to \infty}{l i m} X_{k}

) under the condition:

X_{k + 1} = X_{k} \cdot {[A_{l} - A_{R} (X_{k})]}^{- 1} \cdot A_{l}, k = 0, 1, \dots

where

X_{0} = ξ \in C^{m \times m}

is any arbitrary initial guess.

Here is the extended block Horner’s method (Algorithm 3) for any monic matrix polynomial as follows:

A (λ) = A_{0} λ^{l} + A_{1} λ^{l - 1} + \dots + A_{l - 1} λ + A_{l}

.

Algorithm 3. Extended Block Horner’s Method

1 Specify the degree and the order

m, l

2 Input the matrix polynomial coefficients

A_{i} \in R^{m \times m}

3 Provide an initial guess

X_{0} \in R^{m \times m}

;
4 Set a small threshold

η

(initial tolerance) and initialize

k = 0

5 While

δ \geq η

6

A_{R} (X_{k}) = A_{0} {X_{k}}^{l} + A_{1} {X_{k}}^{l - 1} + \dots + A_{l - 1} X_{k} + A_{l}

7

X_{k + 1} = X_{k} {[A_{l} - A_{R} (X_{k})]}^{- 1} A_{l}

8

δ = 100 \times ‖X_{k + 1} - X_{k}‖ / ‖X_{k}‖

9

X_{k} \leftarrow X_{k + 1}

;

k \leftarrow k + 1

10 End

D. Matrix Broyden’s Method: In the iterative form of Newton’s method, we know that $J (x_{k}) ∆ x_{k} = f (x_{k + 1}) - f (x_{k})$ , and from the other side, we know that ${(∆ x_{k})}^{⟙} ∆ x_{k} = {‖∆ x_{k}‖}^{2}$ so $J (x_{k}) ∆ x_{k} = ∆ f (x_{k}),$ or equivalently

$J (x_{k}) = J (x_{k - 1}) + [\frac{∆ f (x_{k}) - J (x_{k - 1}) ∆ x_{k}}{{‖∆ x_{k}‖}^{2}}] {(∆ x_{k})}^{⟙}$

(31)

The computation of

J^{- 1} (x_{k})

occupies a space in memory and this can be avoided if we calculate it iteratively at each step by using the Sherman–Morrison–Woodbury formula

{(A_{k} + u_{k} v_{k}^{⟙})}^{- 1} = A_{k}^{- 1} + [A_{k}^{- 1} u_{k} v_{k}^{⟙} A_{k}^{- 1}] / (1 + v_{k}^{⟙} A_{k}^{- 1} u_{k})

with

(1 + v_{k}^{⟙} A_{k}^{- 1} u_{k}) \neq 0

. Define the variables:

A_{k} = J (x_{k})

,

u_{k} = [(∆ f (x_{k}) - J (x_{k - 1}) ∆ x_{k}) / {‖∆ x_{k}‖}^{2}]

,

v_{k} = ∆ x_{k}

, and

B_{k} = {[J (x_{k})]}^{- 1},

then it can be verified that

B_{k} = B_{k - 1} + [(∆ x_{k} - B_{k - 1} \cdot ∆ f (x_{k})) / ({(∆ x_{k})}^{⟙} \cdot B_{k - 1} \cdot ∆ f (x_{k}))] \cdot {(∆ x_{k})}^{⟙} \cdot B_{k - 1}

(32)

This method converges without any conditions (i.e., symmetry positive-definiteness dominance, etc.). There are no constraints on the values that

x

can take. Algorithm 4 begins at an initial estimate for the optimal value

x_{0}

and proceeds iteratively to obtain a better estimate at each stage.

Algorithm 4. Extended Broyden’s Method

1 Data:

f (x)

,

x_{0}

,

f (x_{0})

and

B_{0}

2 Result:

x_{k}

3 Begin:
4

x_{k + 1} = x_{k} - B_{k} f (x_{k})

5

B_{k + 1} = B_{k} + [\frac{s_{k} - B_{k} \cdot y_{k}}{s_{k}^{⟙} \cdot B_{k} \cdot y_{k}}] s_{k}^{⟙} B_{k}

6

y_{k} = f (x_{k + 1}) - f (x_{k}); s_{k} = x_{k + 1} - x_{k}

7 End

Determine the solution of

F (X) = ○

where

F (X) = X A_{1} - A_{4} X - X A_{2} X + A_{3},

where

X \in R^{2 \times 2}

is the variable matrix to be determined and

A_{i} \in R^{2 \times 2}

are constant matrices. The solution to such a problem is given by the program in Table 1.

E. Matrix Newton’s Method: The matrix $A_{R} (X)$ is the right evaluation of $A (λ)$ at $X$ , and $A (λ)$ is a nonlinear operator that maps the space of square $m \times m$ matrices onto itself. Since the space of complex $m \times m$ square matrices is a Banach space under any matrix norm, we can use powerful results from functional analysis. This space is also a finite-dimensional space, and as such, the equation $A_{R} (X) = ○$ is a set of $m^{2}$ nonlinear equations with $m^{2}$ unknowns. Let $A_{R} (X) = ⅀_{i = 0}^{l} A_{i} X^{l - i} \in C^{m \times m}$ with $(i = 1, \dots, l)$ as a matrix polynomial. We present a Newton method to solve the equation $A_{R} (X) = B$ , and we prove that the algorithm converges quadratically near simple solvents.

Theorem 23

([29]). Let

A (λ) = ⅀_{s = 0}^{l} A_{s} λ^{l - s} \in C^{m \times m} [λ]

in an arbitrary monic

λ

-matrix. If

R \in C^{m \times m}

is a simple solvent of

A (λ)

and if the starting matrix

X_{0}

is sufficiently near

R

, then the iteration:

v e c (X_{i + 1} - X_{i}) = {[⅀_{k = 1}^{l} {(X_{i}^{l - k})}^{⟙} ⨂ ⅀_{s = 0}^{k} {(A_{s} X_{i}^{k - s - 1})}^{⟙}]}^{- 1} [v e c (⅀_{s = 0}^{l} A_{s} λ^{l - s})]

for

i = 1,2, \dots

converges quadratically to

R

. More precisely: if

‖X_{0} - R‖ = ε_{0} < ε

, and for sufficiently small

ε

and

δ

, then we have, as follows:

▪: $\underset{k \to \infty}{l i m} X_{k} = R$ and $‖X_{0} - R‖ = ε_{0} q^{i}$ with $q < 1$ for $i = 1,2, \dots$
▪: $‖X_{i + 1} - R‖ = δ {‖X_{i} - R‖}^{2}$ for $i = 1,2, \dots$

Let us now consider the generalized Newton’s algorithm (Algorithm 5). The considered matrix polynomial is, as follows:

A (λ) = A_{0} λ^{3} + A_{1} λ^{2} + A_{2} λ + A_{3}

.

Algorithm 5. Generalized Newton’s Method

1 Enter the number of iterations

N

and let

X_{0} \in R^{m \times m}

be an arbitrary initial guess.
2 For

k = 1 : N

3

B_{1} = A_{0}

;

B_{2} = A_{0} X_{k} + A_{1}

;

B_{3} = A_{0} X_{k}^{2} + A_{1} X_{k} + A_{2}

;
4

A_{R} = A_{0} X_{k}^{3} + A_{1} X_{k}^{2} + A_{2} X_{k} + A_{3}

;
5

v e c A = [A_{R} (:, 1); A_{R} (:, 2)];

6

G = k r o n ({(X^{2})}^{⟙}, B_{1}) + k r o n (X^{⟙}, B_{2}) + k r o n (I^{⟙}, B_{3})

;
7

v e c X = - (G^{- 1}) v e c A

;
8

d X = [v e c X (1 ∶ 2, ∶), v e c X (3 ∶ 4, ∶)]

;

X_{k + 1} = X_{k} + d X

;
9 End,

X_{k}

F. Matrix Quotient–Difference Method: The matrix quotient–difference (Q.D.) algorithm extends the scalar method originally proposed by Henrici in 1958. Its use for matrix polynomials was first suggested by Hariche in 1987 and later formalized by Bekhiti in 2018 through recurrence relations defining the Q.D tableau, initialized as follows: $Q_{1}^{(0)} = - A_{1} A_{0}^{- 1}, Q_{i}^{(0)} = ○, a n d E_{i - 1}^{(0)} = A_{i} A_{i - 1}^{- 1}, f o r i = 2,3, \dots l$ . These equations yield the first two rows of the Q.D. tableau—one row for the Q’s and one for the $E$ ’s. Using the rhombus rules, the bottom element (referred to as the south element by Henrici [22]) can be computed. This leads to the row generation procedure of the Q.D. algorithm:

$Q_{i}^{(k + 1)} = Q_{i}^{(k)} + E_{i}^{(k)} - E_{i - 1}^{(k + 1)}, E_{i}^{(k + 1)} = Q_{i + 1}^{(k)} E_{i}^{(k)} {[Q_{i}^{(k + 1)}]}^{- 1} a n d E_{0}^{(k)} = E_{l}^{(k)} = ○ f o r \begin{matrix} i = 1, \dots l - 1 \\ k = 1,2, \dots \end{matrix}$

(33)

Here, the

Q_{i}^{(k)}

represent the spectral factors of

A (λ)

. Notably, the Q.D. algorithm computes all spectral factors simultaneously and in dominance order. The row generation variant is chosen here due to its superior numerical stability. Consider a matrix polynomial of second-order and third-degree

A (λ) = A_{0} λ^{3} + A_{1} λ^{2} + A_{2} λ + A_{3}

. We now apply the generalized row generation Q.D. scheme (Algorithm 6) to compute its spectral factors.

Algorithm 6. Generalized Quotient–Difference Method

1 Specify the degree and the order of

A (λ)

:

m = 2, l = 3

2 Input the number of iterations

N = 35

3 Give the matrix polynomial coefficients

A_{i}

4 Let

Q_{1} = [- A_{1} A_{0}^{- 1} ○_{2} ○_{2}]; E_{1} = [○_{2} A_{2} A_{1}^{- 1} A_{3} A_{2}^{- 1} ○_{2}];

n = m ⋆ l

; % initialization
5 For

n = 1 : N

,

E_{2} = []; Q_{2} = [];

6 For

k = 1 : 2 : n

,

q_{2} = [E_{1} (:, k + 2 : k + 3) - E_{1} (:, k : k + 1)] + Q_{1} (:, k : k + 1); Q_{2} = [Q_{2}, q_{2}];

End,

Q_{2};

7 For

k = 1 : 2 : n - 2,

e_{2} = [Q_{2} (:, k + 2 : k + 3)] ⋆ [E_{1} (:, k : k + 1)] ⋆ {[Q_{2} (:, (k : k + 1))]}^{- 1}; E_{2} = [E_{2}, e_{2}];

End
8

E_{2} = [○_{2}, E_{2}, ○_{2}]; Q_{1} = Q_{2}; E_{1} = E_{2};

9 End,

Q_{1};

S_{1} = Q_{1} (:, 1 : 2) S_{2} = Q_{1} (:, 3 : 4) S_{3} = Q_{1} (:, 5 : 6)

It is noteworthy that, since a matrix polynomial admits both left and right evaluations, there are two corresponding Q.D. algorithms: one for right factorization and one for left factorization. Accordingly, we provide two subroutines—QDRF and QDLF—for right and left factorization, respectively. These subroutines (Table 2) are direct implementations of the corresponding formulas.

G. Spectral Factors by Optimization Algorithms: Spectral factorization of matrix polynomials can be formulated as a nonlinear optimization problem. Given a monic polynomial $A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i},$ the objective is to determine a factor $X,$ such that $⅀_{i = 0}^{l} A_{i} X^{l - i} = ○$ or $A (λ) = (λ I - X) B (λ)$ . This is achieved by minimizing the Frobenius-based cost functional $J = {‖⅀_{i = 0}^{l} A_{i} X^{l - i}‖}_{F}^{2},$ which is convex in the least-squares sense. Algorithm 7 initializes with the coefficient matrices $A_{i}$ and iteratively updates $X$ using a gradient-based solver (e.g., fminunc), yielding numerical spectral factors without explicit algebraic decomposition.

Algorithm 7. Spectral Factors by Optimization

1 Function solvepoly
2

m = 4

;

l = 3

;
3 For

i = 1 : l + 1

,

A (:, :, i) = r a n d n (m)

; End % Make some random matrices

A_{i}

4

[x f, f v a l] = f m i n u n c (@ d o C o s t, z e r o s (m))

5 Function

C O S T = d o C o s t (x)

6

C O S T = z e r o s (m, m)

;
7 For

i = 1 : l + 1

,

C O S T = C O S T + A (:, :, i) ⋆ x^{l - i + 1}

; End,

C O S T = s u m ({C O S T (:)}^{. 2})

;
8 End
9 End

6. Transformations Between Solvents and Spectral Factors

The relationships between solvents and spectral factors of a high-degree matrix polynomial are introduced. Various transformations which convert right (left) solvents into spectral factors and vice-versa are also given here in this section. The transformation of right (left) solvent to left (right) solvent is also established.

▪: Right (Left) Solvents to Spectral Factors: Since the diagonal forms of a complete set of solvents $Λ_{R} = V_{R}^{- 1} A_{c} V_{R} = b l k d i a g (R_{1}, \dots, R_{l})$ and those of a complete set of spectral factors $Λ_{Q} = b l k d i a g (Q_{1}, \dots, Q_{l})$ are identical, then they are related by similarity transformations $Λ_{R} = P Λ_{Q} P^{- 1}$ .

Theorem 24

([22]). If we let

{\{R_{i}\}}_{i = 1}^{l}

be a complete set of right solvents of a monic

λ

-matrix

A (λ)

, then

A (λ)

can be factored as follows:

A (λ) = N_{l} (λ) = (λ I - Q_{l}) \dots (λ I - Q_{1})

, and by using the scheme:

Q_{k} = N_{k - 1} (R_{k}) R_{k} {[N_{k - 1} (R_{k})]}^{- 1}

and

N_{k} (λ) = (λ I - Q_{k}) N_{k - 1} (λ),

we can write

N_{k} (R_{j}) = N_{k - 1} (R_{j}) R_{j} - Q_{k} N_{k - 1} (R_{j})

for

k = 1, \dots, l

, and for any

j

with

N_{0} (λ) = I, N_{0} (R_{j}) = I

and

r a n k [N_{k - 1} (R_{j})] = m

for

k = 1,2, \dots, l

. Similarly, the

λ

-matrix can be factored into a product of linear factors as:

A (λ) = M_{l} (λ) = (λ I - Q_{1}) (λ I - Q_{2}) \dots (λ I - Q_{l})

using the following scheme:

Q_{k} = {[M_{k - 1} (L_{k})]}^{- 1} L_{k} M_{k - 1} (L_{k}),

where

M_{k} (λ) = M_{k - 1} (λ) (λ I - Q_{k})

for

k = 1, \dots, l

and for any

j

we write

M_{k} (L_{j}) = L_{j} M_{k - 1} (L_{j}) - M_{k - 1} (L_{j}) Q_{k}

for

k = 1, \dots, l

, with

M_{0} (λ) = I, M_{0} (L_{j}) = I

for any

j

and

r a n k [M_{k - 1} (L_{j})] = m

for

k = 1,2, \dots, l

.

▪: Spectral Factors to Right (Left) Solvents: If a complete set of spectral factors of a λ-matrix is available or can be determined without relying on latent vectors [35,36], they can be systematically transformed into a complete set of right (or left) solvents. The derivation of this transformation is outlined as follows.

Theorem 25

([22]). Given a monic

λ

-matrix

A (λ) \in R^{m \times m}

with all elementary divisors being linear

A (λ) = ℿ_{i = 0}^{l - 1} (λ I - Q_{l - i}),

where

Q_{k} (≜ Q_{l + 1 - k}) k = 1, \dots, l

is a complete set of spectral factors of

A (λ)

, and

σ (Q_{i}) ⋂ σ (Q_{j}) = \emptyset

. For

k = 1, \dots, l

we define a new

λ

-matrice,

N_{k} (λ)

, as follows:

N_{k} (λ) = I_{m} λ^{l - k} + A_{1 k} λ^{l - k - 1} + . . . + A_{(l - k - 1) k} λ + A_{(l - k) k} = {(λ I - Q_{k})}^{- 1} N_{k - 1} (λ)

with

N_{0} = A (λ),

then the transformation matrix

P_{k}

(r a n k (P_{k}) = m)

which transforms the spectral factor

Q_{k} (≜ Q_{l + 1 - k})

into the right solvent

R_{k} (≜ R_{l + 1 - k})

of

A (λ)

can be constructed from a new algorithm as follows:

R_{k} ≜ R_{l + 1 - k} = P_{k} Q_{k} P_{k}^{- 1} k = 1, \dots, l

, where the

m \times m

matrix

P_{k}

can be solved from the equation

: v e c (P_{k}) = {[G_{k} (Q_{k})]}^{- 1} v e c (I_{m})

with

k = 1, \dots, m

and

(r a n k [G_{k} (Q_{k})] = m^{2}),

where

G_{k} (Q_{k})

is defined by

G_{k} (Q_{k}) = ⅀_{i = k}^{l} {(Q_{k}^{l - i})}^{⟙} \otimes A_{(i - k) k}

.

Theorem 25 is more efficiently interpreted as shown in Algorithms 8 and 9.

Algorithm 8.

Q_{i}

to right solvents

R_{i}

1 Given

A_{1}

,

A_{2}

, …,

A_{l}

and

Q_{1}

,

Q_{2}

, …,

Q_{l}

2 For

i = 1 : l

3

N_{i 0} = I_{m} & X_{i} = Q_{l - i + 1}

% flipping the order
4 For

j = 1 : l - i

5

N_{i j} = A_{j} + X_{i} N_{i (j - 1)}

;

G_{i} = ⅀_{j = i}^{l} {[X_{i}^{l - j}]}^{⟙} \otimes N_{i (j - i)}

;
6

A_{j} = N_{i j}

;
7 End,

v e c (P_{i}) = {[G_{i}]}^{- 1} v e c (I_{m})

;

R_{i} = P_{i} X_{i} P_{i}^{- 1}

8 End

Algorithm 9.

Q_{i}

to left solvents

L_{i}

1 Given

A_{1}

,

A_{2}

, …,

A_{l}

and

Q_{1}

,

Q_{2}

, …,

Q_{l}

2 For

i = 1 : l

3

M_{i 0} = I_{m} & X_{i} = Q_{i}

% don’t flip the order
4 For

j = 1 : l - i

5

M_{i j} = A_{j} + M_{i (j - 1)} X_{i}

;

H_{i} = ⅀_{j = i}^{l} {[M_{i (j - i)}]}^{⟙} \otimes (X_{i}^{l - j})

;
6

A_{j} = M_{i j}

;
7 End,

v e c (S_{i}) = {[H_{i}]}^{- 1} v e c (I_{m})

;

L_{i} = S_{i}^{- 1} X_{i} S_{i}

8 End

▪: Transformation between Left and Right Solvents: For design and analysis of large-scale multivariable systems, it is useful to determine a complete set of solvents of the matrix polynomial. Given a $λ$ -matrix $A (λ)$ , if a right solvent R is obtained, then the left solvent of $L$ of $A (λ)$ can be determined algorithmically [37]. Let $A (λ) = (λ I - L) E (λ) = F (λ) (λ I - R) ⟹ (λ I - L) = F (λ) (λ I - R) E^{- 1};$ first of all, we can conclude that there is a similarity transformation between right and left solvents. If we let $λ = 0$ , then $L = F (0) R E^{- 1} (0)$ , but $E (λ)$ and $F (λ)$ are not provided, so, we must think of an algorithmic procedure to find such similarities in between. Let $L_{k} = W_{k}^{- 1} R_{k} W_{k},$ where $r a n k (W_{k}) = m,$ and we want to find a recursive scheme, such that $R_{k} \leftarrow W_{k} \to L_{k}$ . The transformation is shown in Algorithms 10 and 11.

Algorithm 10. right solvents

R_{i}

to left solvents

L_{i}

1 Given

A_{1}

,

A_{2}

, …,

A_{l}

and

R_{1}

,

R_{2}

, …,

R_{l}

2 For

k = 1 : l

,

B_{k 0} = I_{m}

3 For

i = 1 : l - 1

4

B_{k i} = B_{k 0} A_{i} + B_{k (i - 1)} R_{k}

,

⅀_{i = 0}^{l - 1} (R_{k}^{l - 1 - i} W_{k} B_{k i}) = I_{m}

5

v e c (W_{k}) = {[⅀_{i = 0}^{l - 1} (B_{k i}^{⟙} \otimes R_{k}^{l - 1 - i})]}^{- 1} v e c (I_{m})

6 End,

L_{k} = W_{k}^{- 1} R_{k} W_{k}

7 End

Algorithm 11. left solvents

L_{i}

to right solvents

R_{i}

1 Given

A_{1}

,

A_{2}

, …,

A_{l}

and

L_{1}

,

L_{2}

, …,

L_{l}

2 For

k = 1 : l

,

C_{k 0} = I_{m}

3 For

i = 1 : l - 1

4

C_{k i} = A_{i} C_{k 0} + L_{k} C_{k (i - 1)}

,

⅀_{i = 0}^{l - 1} ({C_{k i} V}_{k} L_{k}^{l - 1 - i}) = I_{m}

5

v e c (V_{k}) = {[⅀_{i = 0}^{l - 1} {[L_{k}^{l - 1 - i}]}^{⟙} \otimes C_{k i}]}^{- 1} v e c (I_{m})

6 End,

R_{k} = V_{k} L_{k} V_{k}^{- 1}

7 End

7. MFD Realization and Transformation Between Canonical Forms

A rational matrix function

H (λ)

is the quotient of two matrix polynomials in

λ

. A strictly proper left form is

H (λ) = {[D_{L} (λ)]}^{- 1} N_{L} (λ),

where

D_{L} (λ) = I λ^{l} + ⅀_{k = 1}^{l} D_{L k} λ^{l - k}

and

N_{L} (λ) = ⅀_{k = 0}^{l} N_{L k} λ^{l - k}

. Similarly, the right form is

H (λ) = N_{R} (λ) {[D_{R} (λ)]}^{- 1},

where

D_{R} (λ) = I λ^{l} + ⅀_{k = 1}^{l} D_{R k} λ^{l - k}

and

N_{R} (λ) = ⅀_{k = 0}^{l} N_{R k} λ^{l - k}

. The function

H (λ)

can also be expressed as follows:

H (λ) = [a d j [D_{L} (λ)] N_{L} (λ)] / ∆_{L} (λ) = [N_{R} (λ) a d j [D_{R} (λ)] / ∆_{R} (λ)]

. For an irreducible

H (λ) \in R^{p \times m} (λ)

, the denominators are coprime with the numerators, and the roots of

∆_{L} (λ) = \det (D_{L} (λ))

or roots of

∆_{R} (λ) = \det (D_{R} (λ))

are called the poles of

H (λ)

. Furthermore,

∆_{L} (λ) = ∆_{R} (λ)

and

a d j [D_{L} (λ)] N_{L} (λ) = N_{R} (λ) a d j [D_{R} (λ)]

[21,22,38].

▪: State-Space from Left MFD: A state-space realization of a proper rational left $λ$ -matrix $\{H (λ) = {[D_{L} (λ)]}^{- 1} N_{L} (λ)\} \in R^{p \times m} (λ)$ is given by

$\dot{x} (t) = A x (t) + B u (t); y (t) = C x (t) + D u (t) with x \in R^{n}; u \in R^{m}; y \in R^{p}; n = l m$

(34)

where A is the transition matrix with dim = n × n. B is the input matrix with dim = n × m. C is the observation matrix with dim = m × n. D is the direct term matrix with dim = m × m.

$A = [\begin{matrix} ○ & I_{m} & ○ \\ ⋮ & ⋱ \\ ○ & ○ & I_{m} \\ - D_{L l} & \dots & - D_{L 1} \end{matrix}]; B = {[\begin{matrix} I_{m} & ○ \\ D_{L 1} & I_{m} \\ ⋮ & ⋱ \\ D_{L (l - 1)} & \dots & D_{L 1} & I_{m} \end{matrix}]}^{- 1} [\begin{matrix} N_{L 1} - D_{L 1} N_{L 0} \\ N_{L 2} - D_{L 2} N_{L 0} \\ \begin{matrix} ⋮ \\ N_{L l} - D_{L l} N_{L 0} \end{matrix} \end{matrix}]; C = [I, ○, \dots, ○]; D = N_{L 0}$

(35)

An alternative way of determining the matrix coefficients of the LMF is

A = [\begin{matrix} ○ & \dots & ○ & - D_{L l} \\ I_{m} & ○ & ⋮ \\ ⋱ & - D_{L 2} \\ ○ & I_{m} & - D_{L 1} \end{matrix}]; B = [\begin{matrix} N_{L l} - D_{L l} N_{L 0} \\ \begin{matrix} \begin{matrix} ⋮ \\ N_{L 2} - D_{L 2} N_{L 0} \end{matrix} \end{matrix} \\ N_{L 1} - D_{L 1} N_{L 0} \end{matrix}]; C = [○, ○, \dots, I]; D = N_{L 0}

(36)

▪: Left MFD from State-Space: A minimal realization of the state-space system, described by {A, B, C, D}, can be represented by a proper rational left $λ$ -matrix as follows: $H (λ) = {[D_{L} (λ)]}^{- 1} N_{L} (λ)$ , if the state-space realization fulfills the dimensional requirement that the state dimension n, divided by the number of channels $l$ , equals an integral value k. The matrix coefficients of the rational left $λ$ -matrix, which all have the dimensions m × m, are then given by

$[D_{L l}, \dots, D_{L 1}] = - C A^{l} {[c o l {(C A^{i})}_{i = 0}^{l - 1}]}^{- 1}; [N_{L l}, \dots, N_{L 0}] = [D_{L l}, \dots, D_{L 1}, I] [\begin{matrix} D & ○ & \dots & ○ & ○ \\ C B & D \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ C A^{l - 2} B & C A^{l - 3} B & D & ○ \\ C A^{l - 1} B & C A^{l - 2} B & \dots & C B & D \end{matrix}]$

(37)

An alternative way of determining the matrix coefficients of the LMF is

[D_{L l} \dots D_{L 1}] = - C A^{l} Ω_{o}^{- 1}; [\begin{matrix} \begin{matrix} N_{L l} \\ ⋮ \end{matrix} \\ N_{L 1} \end{matrix}] = [\begin{matrix} D_{L (l - 1)} & \dots & D_{L 1} & I \\ ⋮ & ⋰ \\ D_{L 1} & ⋰ \\ I & ○ \end{matrix}] Ω_{o} B + [c o l {(D_{L (l - i)})}_{i = 0}^{l - 1}] N_{L 0}; Ω_{o} = c o l {(C A^{i})}_{i = 0}^{l - 1}

(38)

▪: State-Space from Right MFD: The controllable state-space representation of the system described by $\{H (λ) = N_{R} (λ) {[D_{R} (λ)]}^{- 1}\} \in R^{p \times m} (λ)$ is

$A = [\begin{matrix} ○ & I_{m} & ○ \\ ⋮ & ⋱ \\ ○ & ○ & I_{m} \\ - D_{R l} & \dots & - D_{R 1} \end{matrix}]; B = [\begin{matrix} ○ \\ ⋮ \\ ○ \\ I_{m} \end{matrix}]; C = [(N_{R l} - {N_{R 0} D}_{R l}), \dots, (N_{R 2} - D_{R 2} N_{R 0}), (N_{R 1} - N_{R 0} D_{R 1})]; D = N_{L 0}$

(39)
▪: Right MFD from State-Space: The matrix coefficients of the RMF can be obtained directly from the general state-space equations. Given the general state-space matrices {A, B, C, D} and let $N_{R 0} = D$ with $D_{R 0} = I_{m},$ we can then determine the matrix coefficients of

$[\begin{matrix} \begin{matrix} D_{R l} \\ ⋮ \end{matrix} \\ D_{R 1} \end{matrix}] = - Ω_{c}^{- 1} A^{l} B; [N_{R l} \dots N_{R 1}] = C Ω_{c} [\begin{matrix} D_{R (l - 1)} & \dots & D_{R 1} & I \\ ⋮ & ⋰ \\ D_{R 1} & ⋰ \\ I & ○ \end{matrix}] + N_{R 0} [r o w {(D_{R (l - i)})}_{i = 0}^{l - 1}]; Ω_{c} = r o w {(A^{i} B)}_{i = 0}^{l - 1}$

(40)
▪: Realization by General State-Space: For the general case, the Laplace transform of $E \dot{x} (t) = A x (t) + B u (t)$ and $y = C x (t) + B u (t)$ under the zero conditions results in the generalized function $H (λ) = C {(E λ - A)}^{- 1} B + D = C [a d j (E λ - A) \cdot ∆_{E}^{- 1} (λ)] B + D,$ with the characteristic equation of the form $∆_{E} (λ) = \det (E λ - A)$ . It can be shown that, in certain cases, the $H (λ)$ of linear singular systems cannot be derived. This depends entirely on the solvability of the system: only regular singular systems admit such a description. If the system is irregular and thus lacks a transfer function, it may still be represented in a generalized form, R(s)Y(s)=Q(s)U(s), where Y(s) and U(s) are the Laplace transforms of the output and input, respectively (Campbell, 1980 [18]). Since irregular systems may admit multiple or no solutions, the question of whether they are encountered in practice arises. To address this, Tsai in 1992 [39] proposed a constructive algorithm for deriving generalized state-space models from column-/row-pseudoproper or proper matrix fraction descriptions (MFDs). The resulting model, expressed in a special coordinate system, is guaranteed to be controllable-observable in the Rosenbrock–Cobb sense, with order equal to the determinantal degree of the MFD. For coprime MFDs, the algorithm ensures minimal realization, preserving both controllability and observability [15,21].

Definition 7

([5]). Consider a nonsingular polynomial matrix

D (λ) \in R^{m \times m} [λ]

, and let

k_{c i}

be the highest degree of the

i^{t h}

column of

D (λ)

and

k_{r i}

be the highest degree of the

i^{t h}

row of

D (λ)

. If

d e g d e t D (λ) = ⅀_{i = 1}^{m} k_{c i}

, then

D (λ)

is column-reduced. If

d e g d e t D (λ) = ⅀_{i = 1}^{m} k_{r i}

, then

D (λ)

is row-reduced.

Definition 8

([21,39]). Assume that the given right matrix fraction description (RMFD)

H (λ) = N (λ) D^{- 1} (λ)

is in column-based form. It is called column-pseudoproper if

k_{i} > r_{i}

,

i = 1, \dots, m

, where

k_{i}

and

r_{i}

are the

i^{t h}

generalized column degrees of

D (λ)

and

N (λ)

, respectively. This notion can be similarly extended to define a row-pseudoproper case.

The realization of a column-

p s e u d o p r o p e r

(or column-proper) rational transfer matrix in the right MFD is constructed as follows. Define

U (λ) = b l o c k d i a g \{[λ^{k_{c i}}]\}

,

V^{⟙} (λ) = b l o c k d i a g {[1, λ, . . ., λ^{k_{c i} - 1}]}

, and

i = 1,2, . . ., m

, where

⊤

denotes the transpose. Then,

D (λ) = D_{h c} U (λ) + D_{l c} V (λ) a n d N (λ) = C_{c} V (λ)

, with

D_{h c} \in R^{m \times m}

representing the highest-column-degree coefficient matrix,

D_{l c} \in R^{m \times n}

the lowest-column-degree coefficient matrix, and

C_{c} \in R^{p \times n}

a constant matrix. Next, we define the core realization:

\{\begin{matrix} E_{c} = I_{n} (a n n \times n d i m e n s i o n a l i d e n t i t y m a t r i x) \\ A_{c} = b l o c k d i a g {[\begin{matrix} \begin{matrix} 0 & 1 & \begin{matrix} 0 & 0 \end{matrix} \end{matrix} \\ \begin{matrix} 0 & 0 & \begin{matrix} ⋱ & ⋮ \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} ⋮ & ⋱ & \begin{matrix} ⋱ & 1 \end{matrix} \end{matrix} \\ \begin{matrix} 0 & \dots & \begin{matrix} 0 & 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix}]}_{k_{c i} \times k_{c i}}, i = 1,2, . . ., m \\ W_{c} = b l o c k d i a g {[0,0, \dots, 1]}_{1 \times k_{c i}}, i = 1,2, . . ., m \\ H_{c} = P_{c} D_{h c} Q_{c} = [\begin{matrix} I_{q} \\ ○ \end{matrix}], q = rank (D_{h c}) \\ Z_{c} = {[○_{m}, ○_{m}, \dots, I_{m}]}_{m \times n} \end{matrix}

(41)

In the above core realization,

I_{q}

denotes the

q \times q

dimensional identity matrix. If the given right MFD is column-proper, then

H_{c} = I_{m}

,

P_{c} = D_{h c}^{- 1}

,

Q_{c} = I_{m}

and

B_{c} = W_{c}

. However, if the MFD is column-pseudoproper, then

B_{c} = Z_{c}^{⟙}

, while

P_{c}

and

Q_{c}

can be chosen arbitrarily based on

H_{c}

and

D_{h c}

. Using the definitions from the core realization above, a generalized realization for the system can then be obtained as follows:

\begin{matrix} E = & E_{c} + B_{c} H_{c} B_{c}^{⟙} - B_{c} B_{c}^{⟙} \\ A = & A_{c} G_{c} - B_{c} P_{c} D_{l c} G_{c} \\ B = & B_{c} P_{c} \\ C = & C_{c} G_{c} \end{matrix}

(42)

where

E \in R^{n \times n}

,

A \in R^{n \times n}

.

B \in R^{n \times m}

,

C \in R^{p \times n}

and

G_{c}

is defined as

G_{c} = I_{n - m} \oplus Q_{c}

.

Next, the realization of a row-

p s e u d o p r o p e r

(or row-proper) rational transfer matrix in the left MFD is constructed as follows: Let

V (λ) = b l o c k d i a g \{[1, λ, . . ., λ^{k_{r i} - 1}]\}

and

U (λ) = b l o c k d i a g \{[λ^{k_{r i}}]\}

,

i = 1, \dots, p

. Then,

(λ) = U (λ) D_{h r} + V (λ) D_{l r}

,

N (λ) = V (λ) B_{o}

, where

D_{h r} \in R^{p \times p}

is the highest-row-degree coefficient matrix,

D_{l r} \in R^{n \times p}

is the lowest-row-degree coefficient matrix, and

B_{o} \in R^{n \times m}

is a constant matrix.

Firstly, we define the core realization as follows:

\{\begin{matrix} E_{o} = I_{n} (an n \times n dimensional identity matrix) \\ A_{o} = blockdiag {[\begin{matrix} \begin{matrix} 0 & 0 & \begin{matrix} 0 & 0 \end{matrix} \end{matrix} \\ \begin{matrix} 1 & 0 & \begin{matrix} ⋱ & ⋮ \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} ⋮ & ⋱ & \begin{matrix} ⋱ & 0 \end{matrix} \end{matrix} \\ \begin{matrix} 0 & \dots & \begin{matrix} 1 & 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix}]}_{k_{r i} \times k_{r i}}, i = 1,2, . . ., p \\ W_{o} = blockdiag {[0,0, \dots, 1]}_{1 \times k_{r i}}, i = 1,2, . . ., p \\ H_{o} = P_{o} D_{h r} Q_{o} = [\begin{matrix} I_{z} \\ ○ \end{matrix}], z = rank (D_{h r}) \\ Z_{o} = {[○_{p}, ○_{p}, \dots, I_{p}]}_{p \times n} \end{matrix}

(43)

In the above core realization,

I_{z}

denotes the

z \times z

identity matrix. If the given left MFD is row-proper, then

H_{o} = I_{p}

,

P_{o} = D_{h r}^{- 1}

,

Q_{o} = I_{p}

and

C_{o} = W_{o}

In contrast, if the MFD is row-pseudoproper, then

C_{o} = Z_{o}

,

P_{o}

and

Q_{o}

can be chosen arbitrarily based on

H_{o}

and

D_{h r}

. Using these definitions from the core realization, a generalized realization for the system can be obtained as follows:

\begin{matrix} E = & E_{o} + C_{o}^{⟙} H_{o} C_{o} - C_{o}^{⟙} C_{o} \\ A = & G_{o} A_{o} - G_{o} D_{l r} P_{o} C_{o} \\ B = & {G_{o} B}_{o} \\ C = & P_{o} C_{o} \end{matrix}

(44)

where

E \in R^{n \times n}

,

A \in R^{n \times n}

.

B \in R^{n \times m}

,

C \in R^{p \times n}

and

G_{o}

is defined as

G_{o} = I_{n - p} \oplus Q_{o}

.

8. The Proposed Control-Design Strategies

Building upon the algebraic theory of operator matrix polynomials, spectral divisors, and matrix fraction descriptions presented in the previous sections, we now focus on the development of a constructive feedback control strategy. The aim is to exploit the operator matrix polynomials framework to achieve block-pole placement in large-scale MIMO processes. Unlike conventional methods that rely on simple eigenstructure assignment, the proposed approach integrates realization theory and matrix polynomial techniques to ensure global block roots relocation. This section introduces the formulation of feedback control laws and establishes its mathematical foundations.

▪: Relocation of Block Poles via State Feedback: Pole placement through state feedback is a well-established method for closed-loop control design. For MIMO systems, the block-controllable canonical form offers a suitable framework for implementing block-pole placement. When the number of inputs divides the system order ( $n = l m$ ), the feedback gain matrix $K_{c}$ is chosen so that the closed-loop matrix $A_{d} = A_{c} - B_{c} K_{c}$ attains the prescribed right characteristic polynomial $D_{d} (λ)$ .

Theorem 26.

Let a linear time-invariant state model be given by

\dot{x} (t) = A x (t) + B u (t)

(or the discrete-time analog

x_{k + 1} = A x_{k} + B u_{k}

) with

x \in R^{n}

,

u \in R^{m}

. Assume

$n = l m$ for some integer $l$ .
The pair $(A, B)$ is block-controllable of index $l$ (i.e., $[r o w {(A^{i} B)}_{i = 0}^{l - 1}]$ is nonsingular).
A desired right characteristic polynomial $D_{d} (λ)$ is prescribed (from a set of block roots ${\{R_{i}\}}_{i = 1}^{l}$ ) and its block-companion matrix $A_{d} \in R^{n \times n}$ is formed in the controllable space.

Define the companion coordinates

A_{c} = T_{c} A T_{c}^{- 1}

,

B_{c} = T_{c} B;

and assume

B_{c}

has the standard companion form

B_{c} = {[○, \dots, ○, I_{m}]}^{⟙} \in R^{n \times m}

. Let

B_{c}^{⟙}

denote a left-inverse of

B_{c}

, satisfying

B_{c}^{⟙} B_{c} = I_{m}

. Let

Ω_{c} = [r o w {(A^{i} B)}_{i = 0}^{l - 1}] \in R^{n \times l m}

be the block-controllability matrix and define the companion transform

T_{c} = c o l {(B_{c}^{⟙} Ω_{c}^{- 1} A^{k})}_{k = 0}^{l - 1}

.

A state-feedback gain

K \in R^{m \times n}

that places the closed-loop block poles so that the matrix

A_{d} = T_{c} (A - B K) T_{c}^{- 1}

is given by the explicit, computable formula

K = \{B_{c}^{⟙} Ω_{c}^{- 1} . A^{l}\} - \{[R_{1}^{l} \dots R_{l}^{l}] . V_{R}^{- 1} . T_{c}\} with V_{R} = [row {(col {(R_{i}^{k})}_{k = 0}^{l - 1})}_{i = 1}^{l}]

(45)

Proof.

The closed-loop matrix is given by

A_{d} = T_{c} (A - B K) T_{c}^{- 1} ⟺ T_{c} A - T_{c} B K = A_{d} T_{c}

, that is,

T_{c} B K = T_{c} A - A_{d} T_{c} ⟺ B_{c} K = T_{c} A - A_{d} T_{c}

. From the knowledge of

B_{c}^{⟙} B_{c} = I_{m}

and

T_{c} A - A_{c} T_{c}

, we can write

B_{c}^{⟙} B_{c} K = B_{c}^{⟙} (A_{c} - A_{d}) T_{c} ⟺ K = B_{c}^{⟙} (A_{c} - A_{d}) T_{c}

or

K = B_{c}^{⟙} A_{c} T_{c} - B_{c}^{⟙} A_{d} T_{c} = B_{c}^{⟙} T_{c} A - B_{c}^{⟙} A_{d} T_{c}

, but

B_{c}^{⟙} A_{d} V_{R} = B_{c}^{⟙} V_{R} Λ_{R} = [R_{1}^{l} \dots R_{l}^{l}]

with

Λ_{R} = \oplus_{i = 1}^{l} R_{i}

,

V_{R} = [r o w {(c o l {(R_{i}^{k})}_{k = 0}^{l - 1})}_{i = 1}^{l}]

and

B_{c}^{⟙} = [○, \dots, ○, I_{m}]

; therefore, we obtain the following relation

B_{c}^{⟙} A_{d} = - [D_{d l} \dots D_{d 1}] = [r o w {(R_{i}^{l})}_{i = 1}^{l}] . V_{R}^{- 1}

. From the other side, we know that

B_{c}^{⟙} T_{c} = T_{l} = B_{c}^{⟙} Ω_{c}^{- 1} A^{l - 1} = B_{c}^{⟙} {[r o w {(A^{i} B)}_{i = 0}^{l - 1}]}^{- 1} A^{l - 1}

, so the feedback gain matrix is given by the formula

K = \{B_{c}^{⟙} {[r o w {(A^{i} B)}_{i = 0}^{l - 1}]}^{- 1} . A^{l}\} - \{[r o w {(R_{i}^{l})}_{i = 1}^{l}] . {[r o w {(c o l {(R_{i}^{k})}_{k = 0}^{l - 1})}_{i = 1}^{l}]}^{- 1} . T_{c}\}

with

T_{c} = c o l {(B_{c}^{⟙} {[r o w {(A^{i} B)}_{i = 0}^{l - 1}]}^{- 1} A^{k})}_{k = 0}^{l - 1}

. □

The above illustrations can be summarized into Algorithm 12.

Algorithm 12. Relocation of Block Poles via State Feedback

1.: Model setup: Start with the LTI system $\dot{x} (t) = A x (t) + B u (t)$ , $x \in R^{n}$ , $u \in R^{m}$ , where $n = l m$ .
2.: Compute the block-controllability matrix $Ω_{c} = [r o w {(A^{i} B)}_{k = 0}^{l - 1}]$
3.: Check controllability: Verify that $(A, B)$ is block-controllable of index $l$ , i.e., $Ω_{c}$ is nonsingular.
4.: Construct desired right characteristic polynomial $D_{d} (λ) = (λ I_{m} - R_{1}) \dots (λ I_{m} - R_{l})$ from ${\{R_{i}\}}_{i = 1}^{l}$ .
5.: Compute $T_{c}$ to obtain $A_{c} = T_{c} A T_{c}^{- 1}$ , $B_{c} = T_{c} B$ , where $B_{c} = {[○, \dots, ○, I_{m}]}^{⟙}$ .
6.: Form the block-companion matrix: Construct $A_{d}$ corresponding to the desired polynomial $D_{d} (λ)$ .
7.: Compute: $V_{R} = [r o w {(c o l {(R_{i}^{k})}_{k = 0}^{l - 1})}_{i = 1}^{l}]$ and $K = \{B_{c}^{⟙} Ω_{c}^{- 1} \cdot A^{l}\} - \{[R_{1}^{l} \dots R_{l}^{l}] . V_{R}^{- 1} . T_{c}\}$
8.: Closed-loop system: Apply $u (t) = - K x (t) + F r (t)$ . The closed-loop matrix becomes $A_{c l} = A - B K$ , whose block poles match the prescribed set ${\{R_{i}\}}_{i = 1}^{l}$ .

▪: Implementation and Checking: The formula is constructive: one first computes $Ω_{c}$ and inverts it to form $T_{c}$ , then constructs $A_{d}$ from the desired polynomial $D_{d} (λ)$ (or from $R_{i}$ ), and finally computes $K$ directly. Numerical stability requires careful formation of $Ω_{c}$ and $V_{R}$ and the use of numerically stable solvers for $V_{R}^{- 1}$ and $Ω_{c}^{- 1}$ . Under coprimeness or block-controllability assumptions, the resulting $K$ is the unique state feedback that produces the specified $A_{d}$ in the chosen companion coordinates. In terms of computational complexity, spectral factorization requires $O (n^{3} m)$ , block-companion transformations $O (n^{3})$ , and feedback gain extraction $O (n^{2} m)$ , while the memory footprint is $O (n^{2})$ , due to the storage of structured block Vandermonde matrices.

An alternative approach for constructing the linear state-feedback control law is developed using algebraic methods and subspace theory, serving as the counterpart to eigenstructure assignment in MIMO system design (see [40,41,42]). To complete the derivation, we consider control of the form

u (t) = K x (t) + r (t)

and focus on the case

D = ○

, with the development divided into open- and closed-loop parameterizations.

Theorem 27.

Consider the observable MIMO linear LTI system

\dot{x} (t) = A x (t) + B u (t)

and

y (t) = C x (t)

with matrix transfer function

H (λ) = C {(λ I - A)}^{- 1} B = C N_{R} (λ) \cdot D_{R}^{- 1} (λ),

where

N_{R} (λ) = ⅀_{i = 0}^{l} N_{i} λ^{l - i}

is a right numerator polynomial. Let the closed-loop dynamics be governed by

A_{c l s} = A + B K

or

H_{c l s} (λ) = C N_{R} (λ) \cdot D_{n e w}^{- 1} (λ)

. If the desired block roots

{\{R_{i}\}}_{i = 1}^{l}

, then the state-feedback gain matrix

K \in R^{m \times m}

is uniquely determined by

K = [r o w {\{D_{R} (R_{i})\}}_{i = 1}^{l}] . {[r o w {\{N_{R} (R_{i})\}}_{i = 1}^{l}]}^{- 1}

. Here, the coefficients of

N_{R} (R_{i})

and

D_{R} (R_{i})

are computed from

c o l {\{D_{R (l - i + 1)}\}}_{i = 1}^{l} = - Ω_{c}^{- 1} A^{l} B; [N_{R l} \dots N_{R 1}] = Ω_{c} M_{c}; Ω_{c} = [r o w {(A^{i} B)}_{i = 0}^{l - 1}]

and

M_{c}

is an upper-triangular block matrix formed by the coefficients of

D_{R} (λ)

.

Proof.

The first part: in open loop, we have

H (λ) = C {(λ I - A)}^{- 1} B = N_{R} (λ) \cdot D_{R}^{- 1} (λ)

. Assume that there exists a special factorization of the right numerator matrix polynomial

N_{R} (λ) = C N_{R} (λ) = ⅀_{i = 0}^{l} {C N}_{i} λ^{l - i}

, such that

H (λ) = C {[λ I - A]}^{- 1} B = C [⅀_{i = 0}^{l} N_{i} λ^{i}] D_{R}^{- 1} (λ)

, which is equivalent to

{(λ I - A)}^{- 1} B = (⅀_{i = 0}^{l} N_{i} λ^{i}) \cdot D_{R}^{- 1} (λ)

. More precisely,

(λ I - A) N_{R} (λ) = B D_{R} (λ) ⟺ λ N_{R} (λ) - A N_{R} (λ) = B D_{R} (λ)

The second part: in closed loop, we have

H_{c l s} (λ) = C {(λ I - A_{c l s})}^{- 1} B = N_{R} (λ) \cdot D_{n e w}^{- 1} (λ)

, where

A_{c l s} = A + B K

is the closed-loop matrix and

D_{n e w} (λ)

is the new denominator. By following the same proposition, we obtain

\begin{matrix} C {(λ I - A - B K)}^{- 1} B = C N_{R} (λ) . D_{n e w}^{- 1} (λ) ⟺ {(λ I - A - B K)}^{- 1} B = N_{R} (λ) . D_{n e w}^{- 1} (λ) \\ ⟺ λ N_{R} (λ) - A N_{R} (λ) - B K N_{R} (λ) = B D_{n e w} (λ) \end{matrix}

The set of desired block roots is given by

{\{R_{i}\}}_{i = 1}^{l}

, such that

D_{n e w} (R_{i}) = ○

. Now, the right evaluation of the solvent

R_{i}

in open and closed-loop equations gives

\{N_{R} (R_{i}) R_{i} - A N_{R} (R_{i}) = B D_{R} (R_{i}) a n d N_{R} (R_{i}) R_{i} - A N_{R} (R_{i}) - B K N_{R} (R_{i}) = B D_{n e w} (R_{i}) = ○

If we perform a subtraction, we obtain

K N_{R} (R_{i}) = D_{R} (R_{i})

. Now the feedback gain matrix can be written in terms of

N_{R} (R_{i})

and

D_{R} (R_{i})

by

K [N_{R} (R_{i}) \dots N_{R} (R_{l})] = [D_{R} (R_{1}) \dots D_{R} (R_{l})] ⟺ K = [r o w {\{D_{R} (R_{i})\}}_{i = 1}^{l}] . {[r o w {\{N_{R} (R_{i})\}}_{i = 1}^{l}]}^{- 1}

where the matrix coefficient of

N_{R} (R_{i})

and

D_{R} (R_{i})

are obtained using

[\begin{matrix} \begin{matrix} D_{R l} \\ ⋮ \end{matrix} \\ D_{R 1} \end{matrix}] = - Ω_{c}^{- 1} A^{l} B; [N_{R l} \dots N_{R 1}] = Ω_{c} M_{c} w i t h M_{c} = [\begin{matrix} D_{R (l - 1)} & \dots & D_{R 1} & I \\ ⋮ & ⋰ \\ D_{R 1} & ⋰ \\ I & ○ \end{matrix}]; Ω_{c} = [r o w {(A^{i} B)}_{i = 0}^{l - 1}] □

The above illustrations can be summarized into the following Algorithm 13.

Algorithm 13. Algebraic Construction of State Feedback via Subspace Methods

1.: System model: Consider the observable MIMO LTI system $\dot{x} (t) = A x (t) + B u (t)$ , $y (t) = C x (t)$ with transfer function $H (λ) = C {(λ I - A)}^{- 1} B = C N_{R} (λ) . D_{R}^{- 1} (λ)$ , where $N_{R} (λ) = ⅀_{i = 0}^{l} N_{i} λ^{l - i}$ .
2.: Check controllability/observability: Ensure that $(A, B)$ is controllable and $(C, A)$ is observable.
3.: Open-loop factorization: form the right matrix polynomial relation $λ N_{R} (λ) - A N_{R} (λ) = B D_{R} (λ)$ .
4.: Closed-loop formulation: For $u (t) = K x + r$ , write $λ N_{R} (λ) - A N_{R} (λ) - B K N_{R} (λ) = B D_{n e w} (λ)$ .
5.: Coefficient evaluation: Compute the coefficients of $N_{R} (R_{i})$ and $D_{R} (R_{i})$ from the following relation $c o l {(D_{R (l - i)})}_{i = 0}^{l - 1} = - Ω_{c}^{- 1} A^{l} B; [N_{R l} \dots N_{R 1}] = Ω_{c} M_{c}$ , where $Ω_{c} = [r o w {(A^{i} B)}_{i = 0}^{l - 1}]$ and $M_{c}$ is the upper-triangular block matrix constructed from the coefficients of $D_{R} (λ)$ .
6.: Desired block roots: Prescribe block roots ${\{R_{i}\}}_{i = 1}^{l}$ , such that $D_{n e w} (R_{i}) = ○$ .
7.: Gain computation: Evaluate at $R_{i}$ : $K N_{R} (λ) = D_{R} (λ)$ , $i = 1, \dots, l$ . Collecting all terms yields the explicit feedback law: $K = [r o w {\{D_{R} (R_{i})\}}_{i = 1}^{l}] . {[r o w {\{N_{R} (R_{i})\}}_{i = 1}^{l}]}^{- 1}$ .

▪: Relocation of Standard Structure via State Feedback: Consider a MIMO linear time-invariant (LTI) system with characteristic λ-matrix $A (λ) \in {C [λ]}^{m \times m}$ . The objective of this section is to design a state-feedback control law $u (t) = r (t) - K x (t)$ , by determining a gain matrix $K$ that relocates an admissible pair to a desired location.

Theorem 28.

Let the MIMO LTI system be

\dot{x} (t) = A x (t) + B u (t); y (t) = C x (t)

with characteristic λ-matrix

A (λ) = ⅀_{i = 0}^{l} A_{i} λ^{l - i} \in R^{m \times m} [λ]

, where

A_{0} = I_{m}

. Suppose

(X, J)

is an admissible pair of

A (λ)

with

\det (A_{0}) \neq 0

. Then,

A (X, J) = ⅀_{i = 0}^{l} A_{i} X J^{l - i} = 0

implies that

[A_{l} \dots A_{1}] = - X J^{l} {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1}

. Assume that the state-feedback control law is given by

u (t) = r_{c} (t) - K_{c} x_{c} (t)

and

K_{c} = [K_{c l} \dots K_{c 2} K_{c 1}] \in R^{m \times m l}

with

K_{c i} \in R^{m \times m}, i = 1, \dots, l

. Then, the explicit gain matrix

K = K_{c} T_{c}

is given constructively by

K = B_{c}^{⟙} . {[c o l {(A^{i - 1} B)}_{i = 1}^{l}]}^{- 1} . A^{l} - X J^{l} {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1} . T_{c}

(46)

Proof.

If we assume that

(X, J)

is an admissible pair of

A (λ)

with

\det (A_{0}) \neq 0

, then we have

A (X, J) = A_{0} X J^{l} + \dots + A_{l - 1} X J + A_{l} X = 0 ⟺ [A_{l} \dots A_{1}] c o l {(X J^{i - 1})}_{i = 1}^{l} = - A_{0} X J^{l}

so

[A_{l} \dots A_{1}] = - A_{0} X J^{l} {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1}; w i t h A_{0} = I_{m}

Let the state-feedback control law be

u (t) = r_{c} (t) - K_{c} x_{c} (t)

, where

r_{c} (t) \in R^{m}

is the reference input, and the feedback gain matrix is

K_{c} = [K_{c l} \dots K_{c 2} K_{c 1}] \in R^{m \times m l}

and

K_{c i} \in R^{m \times m} (i = 1, \dots, l)

; then, the explicit formula of

K = K_{c} T_{c}

becomes

K = B_{c}^{⟙} . {[c o l {(A^{i - 1} B)}_{i = 1}^{l}]}^{- 1} . A^{l} + [A_{l} \dots A_{1}] . T_{c} = B_{c}^{⟙} . {[c o l {(A^{i - 1} B)}_{i = 1}^{l}]}^{- 1} . A^{l} - X J^{l} {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1} . T_{c}

An alternative method for constructing the linear state-feedback control law based on an admissible pair is as follows. Let

A_{d} (λ) \in {C [λ]}^{m \times m}

denote the target matrix polynomial. Form the difference

A_{d} (λ) - A (λ) = K_{c l} λ^{l - 1} + \dots + K_{c 2} λ + K_{c 1}

and from the definition, the desired admissible pair satisfies

A_{d} (X, J) = ○

. Hence, using the admissible pair relation, we may write

- A (X, J) = K_{c l} X J^{l - 1} + \dots + K_{c 2} X J + K_{c 1} X

. Hence, the gain matrix can be expressed as

K_{c} = [K_{c l} \dots K_{c 2} K_{c 1}] = - A (X, J) . {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1}

. Finally, the state-feedback gain matrix is

K = [K_{l} \dots K_{2} K_{1}] = - A (X, J) . {[c o l {(X J^{i - 1})}_{i = 1}^{l}]}^{- 1} T_{c}

(47)

□

▪: Decoupling via Block Poles Assignment: The proposed procedure aims to decouple the MIMO dynamic system by placing spectral factors. First, the numerator matrix polynomial $N (λ)$ is factorized into a complete set of spectral factors using a standard algorithm. Then, block zeros are enforced by relocating them into the denominator through state feedback, thereby achieving decoupling. Consider the matrix function: $H (λ) = N_{R} (λ) D_{R}^{- 1} (λ) = [⅀_{i = 0}^{k} N_{i} λ^{k - i}] . {[⅀_{i = 0}^{l} D_{i} λ^{l - i}]}^{- 1}$ with $D_{l} = I_{m}$ is an identity matrix, $N_{i} \in R^{m \times m}$ , $(i = 0, 1, \dots, k)$ , $D_{i} \in R^{m \times m}$ , $(i = 0, 1, \dots, l)$ , $l > k$ . Assume that $N_{R} (λ)$ can be factorized into $k$ block zeros and $D_{R} (λ)$ into $l$ block roots, where $N_{R} (λ) = N_{0} (λ I_{m} - Z_{1}) \dots (λ I_{m} - Z_{k}); D_{R} (λ) = (λ I_{m} - Q_{1}) \dots (λ I_{m} - Q_{l})$ .

In addition, we know that

H (λ) = C {(λ I - A)}^{- 1} B .

Now, via the use of state feedback, the control law becomes state-dependent and can be rewritten as

u (t) = - K . x (t) + F . r (t)

. Hence, we obtain the following closed-loop system:

H_{c l o s e d} (λ) = C {(λ I - A + B K)}^{- 1} B F = N_{R} (λ) D_{R d}^{- 1} (λ) F

where

D_{R d} (λ) = (λ I_{m} - Q_{d 1}) \dots . (λ I_{m} - Q_{d l})

,

D_{R d}^{- 1} (λ) = {(λ I_{m} - Q_{d l})}^{- 1} \dots {(λ I_{m} - Q_{d 1})}^{- 1}

and

Q_{d i}

: are the desired spectral factors to be placed

H_{closed} (λ) = N_{0} (λ I_{m} - Z_{1}) \dots . (λ I_{m} - Z_{k}) {(λ I_{m} - Q_{d l})}^{- 1} \dots {(λ I_{m} - Q_{d 1})}^{- 1} F

Choose:

Q_{d l} = Z_{k}

,…,

Q_{d (l - k + 1)} = Z_{1}

,

Q_{d (l - k)} = N_{0}^{- 1} J_{l - k} N_{0}

,…,

Q_{d 1} = N_{0}^{- 1} J_{1} N_{0}

with

J_{i} = d i a g (λ_{i 1} \dots λ_{i m})

and

F = N_{0}^{- 1}

. Now, by assigning those prescribed block roots, the system is decoupled and the closed-loop matrix transfer function becomes

H_{closed} (λ) = {(λ I_{m} - J_{1})}^{- 1} \dots {(λ I_{m} - J_{l - k})}^{- 1} = d i a g (1 / Δ_{1} \dots 1 / Δ_{m})

, where

Δ_{i} = (λ + λ_{1 i}) \dots (λ + λ_{(l - k) i})

,

i = 1, . . ., m

. Let us summarize this in the next algorithmic version (Algorithm 14) to be more understandable and efficient for use in the linear MIMO control system.

Algorithm 14. Decoupling via Spectral Factors Assignment

1 Assume that all system states are available and measurable.
2 Verify the block controllability and observability of the given square dynamic system.
3 Construct the matrix polynomials

N_{R} (λ)

and

D_{R} (λ)

.
4 Factorize the

D_{R} (λ)

into a complete set of block spectral factors.
5 Assign

k

spectral factors of

N_{R} (λ)

as block roots of

D_{R} (λ)

, and set the remaining in diagonal form.
6 Construct the desired matrix polynomial form those obtained block spectral data.
7 Design the state-feedback gain matrix in controller form and then transform it to the original base.
Here, we are ready to design SISO tracking regulators for each input–output pairs, (i.e., the system is decoupled).

Alternatively, we can assign

k

block roots of

N_{R} (λ)

as block roots of

D_{R} (λ)

. Now, let

\{Z_{1}, \dots, Z_{k}\} \in R^{m \times m}

be the block zeros of

N_{R} (λ)

; then, the desired block poles are given directly by

R_{i} = Z_{i}

, for

i = 1, \dots, k

and

R_{j} = ○_{m}

for

j = k + 1, \dots, l

. Knowing that

R_{i}

and

R_{j}

are a block roots to

D_{R d} (λ)

means that

[D_{d l}, \cdot \cdot \cdot, D_{d 1}] V_{Z} = - [Z_{1}^{l}, \cdot \cdot \cdot, Z_{k}^{l}, ○_{m \times m (l - k)}]; w i t h V_{Z} = [\begin{matrix} I_{m} & \dots & I_{m} & I_{m} & \dots & I_{m} \\ Z_{1} & \dots & Z_{k} & ○_{m} & \dots & ○_{m} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ Z_{1}^{l - 1} & \dots & Z_{k}^{l - 1} & ○_{m} & \dots & ○_{m} \end{matrix}]

Some algebraic manipulations give

K = - [[D_{l}, \cdot \cdot \cdot, D_{1}] V_{Z} + [Z_{1}^{l}, \cdot \cdot \cdot, Z_{k}^{l}, ○_{m \times m (l - k)}]] . {(T_{c}^{- 1} . V_{Z})}^{- 1}

(48)

These illustrations can be summarized into an algorithmic steps (Algorithm 15).

Algorithm 15. Decoupling via Block Poles Assignment

1 Assume that all states are available and measurable.
2 Check the block observability/controllability of the given square dynamic system.
3 Construct the right numerator

N_{R} (λ)

and right denominator

D_{R} (λ)

matrix polynomials.
4 Compute a complete set of block roots for the numerator matrix polynomial

N_{R} (λ)

.
5 Assign the k solvents of

N_{R} (λ)

as block roots to

D_{R} (λ)

and force the rest ones to zeros.
6 Design the state-feedback gain matrix to assign the complete set of block structures.
Here, we are ready to design SISO tracking regulators for each input–output pairs, (i.e., the system is decoupled).

9. Applications in Control System Engineering

9.1. Aeroelasticity in Flight Dynamics

Aeroelasticity, particularly flutter, has shaped aircraft development since the earliest days of flight. In modern high-speed aircraft, aeroelastic effects strongly influence structural and aerodynamic design due to the combined action of aerodynamic, inertial, and elastic forces. During maneuvers, lifting surfaces may experience flutter—self-excited oscillations that extract energy from the airflow, causing large, often destructive vibrations [31]. Suppressing flutter is therefore essential to prevent excessive deformation and potential structural failure of wings. In modern aviation, flight control system dynamics are included in the analysis, since closed-loop interactions can couple with aeroelastic effects. This integrated study, known as aeroservoelasticity, aims to analyze control systems under aeroelastic interactions. Accurate multivariable state-space models are thus essential to enable control law synthesis using advanced methods such as block-pole placement and compensator design [30]. Consider the typical section illustrated in Figure 1, where the wing is mounted on a flexible support consisting of a translational spring of stiffness

K_{h}

and a torsional spring of stiffness

K_{α}

, both attached at the airfoil’s shear center. The system thus exhibits two degrees of freedom, with

h

denoting the plunge displacement and

α

is the pitch angle and

β

is the control-surface flap deflection.

The governing equations of motion for the structure of the nonlinear aeroelastic system can be written as follows [31]:

M \ddot{q} + C_{s} \dot{q} + K_{s} q = Q_{a} (q, \dot{q}, z) + B_{u} u (t)

, where

M = [\begin{matrix} m & m x_{α} \\ m x_{α} & I_{α} \end{matrix}]; C_{s} = [\begin{matrix} c_{h} & 0 \\ 0 & c_{α} \end{matrix}]; K_{s} = [\begin{matrix} K_{h} & 0 \\ 0 & K_{α} \end{matrix}]; Q_{a} (t) = A_{\infty} q (t) + A_{1} \dot{q} (t) + A_{2} z (t)

(49)

and

B_{u} \in R^{2 \times m_{u}}

maps actuator commands generalized forces/torques.

Q_{a} (t)

models the aerodynamic generalized force vector with one aerodynamic lag state vector

z (t) \in R^{2}

. The lag dynamics (Roger form) are

\dot{z} (t) = (q (t) - z (t)) / τ

. Combining and isolating accelerations:

\ddot{q} (t) = M^{- 1} [(A_{\infty} - K_{s}) q (t) + (A_{1} - C_{s}) \dot{q} (t) + A_{2} z (t) + B_{u} u (t)]

(50)

If we choose

x (t) = {[h, α, \dot{h}, \dot{α}, z_{h}, z_{α}]}^{⟙} \in R^{6}

, we obtain a first-order state equation,

\dot{x} (t) = A x (t) + B u (t)

, that is, as follows:

\frac{d}{d t} [\begin{matrix} h \\ \begin{matrix} \begin{matrix} α \\ \dot{h} \\ \dot{α} \end{matrix} \\ z_{h} \end{matrix} \\ z_{α} \end{matrix}] = [\begin{matrix} ○_{2} & I_{2} & ○_{2} \\ M^{- 1} K_{e f f} & - M^{- 1} C_{e f f} & M^{- 1} A_{2} \\ \frac{1}{τ} I_{2} & ○_{2} & - \frac{1}{τ} I_{2} \end{matrix}] [\begin{matrix} h \\ \begin{matrix} \begin{matrix} α \\ \dot{h} \\ \dot{α} \end{matrix} \\ z_{h} \end{matrix} \\ z_{α} \end{matrix}] + [\begin{matrix} ○_{2 \times m_{u}} \\ M^{- 1} B_{u} \\ ○_{2 \times m_{u}} \end{matrix}] . [\begin{matrix} u_{1} (t) \\ u_{2} (t) \end{matrix}]

(51)

where

K_{e f f} = K_{s} - A_{\infty}

,

C_{e f f} = C_{s} - A_{1}

, and the outputs may be any measured signals

y (t) = C x (t) + D u (t)

. In the example below, we choose

y (t) = {[h, \dot{α}]}^{⟙} \in R^{2}

and

u (t) = {[u_{1} (t), u_{2} (t)]}^{⟙} \in R^{2}

, which are two actuator commands (radians). The terms

u_{1} (t), u_{2} (t)

represent two control-surface deflections whose effect on generalized forces/torques is given by

B_{u}

, which converts

r a d \to N

(plunge) and

N . m

(pitch) [30].

Parameters used (illustrative)

$m = 20 [k g]$ , $x_{α} = 0.2 m$ , $I_{α} = 2 [k g \cdot m^{2}]$ , $K_{h} = 10^{4} [N / m]$ , $K_{α} = 500 [N \cdot m / r a d]$ .
$c_{h} = 100 [N \cdot s / m]$ , $c_{α} = 5 [N \cdot m \cdot s / r a d]$ .
Aerodynamic fit (Roger, 1 lag) and the control-effectiveness (two actuators):

$A_{\infty} = [\begin{matrix} - 5000 & - 200 \\ - 300 & - 800 \end{matrix}], A_{1} = [\begin{matrix} - 200 & - 10 \\ - 10 & - 50 \end{matrix}], A_{2} = [\begin{matrix} 400 & 20 \\ 20 & 100 \end{matrix}], τ = 0.05 s, and B_{u} = [\begin{matrix} 0 & 0 \\ 10 & 5 \end{matrix}] .$

The numeric results below were computed exactly from those parameter values.

A = [\begin{matrix} 0.000 \\ 0.000 \\ \begin{matrix} - 1200 \\ \begin{matrix} 2250 \\ 20.00 \\ 0.000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 0.0000 \\ 0.0000 \\ \begin{matrix} 200.00 \\ \begin{matrix} - 1050.0 \\ 0.0000 \\ 20.000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 1.0000000 \\ 0.0000000 \\ \begin{matrix} - 23.333333 \\ \begin{matrix} 41.666667 \\ 0.0000000 \\ 0.0000000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 0.0000000 \\ 1.0000000 \\ \begin{matrix} 8.3333333 \\ \begin{matrix} - 44.166667 \\ 0.0000000 \\ 0.0000000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 0.0000 \\ 0.0000 \\ \begin{matrix} 30.000 \\ \begin{matrix} - 50.000 \\ - 20.000 \\ 0.0000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 0.0000 \\ 0.0000 \\ \begin{matrix} - 15.000 \\ \begin{matrix} 80.000 \\ 0.0000 \\ - 20.000 \end{matrix} \end{matrix} \end{matrix}]; B = [\begin{matrix} 0.0000 \\ 0.0000 \\ \begin{matrix} - 1.6666 \\ \begin{matrix} 8.3333 \\ 00.000 \\ 0.0000 \end{matrix} \end{matrix} \end{matrix} \begin{matrix} 0.0000 \\ 0.0000 \\ \begin{matrix} - 0.5444 \\ \begin{matrix} 4.1666 \\ 00.000 \\ 0.0000 \end{matrix} \end{matrix} \end{matrix}]

A disturbance model is introduced to capture un-modeled dynamics, nonlinear flexibility, and hard nonlinearities. The disturbance vector includes filtered noise for broadband uncertainties, harmonic terms for periodic effects, and impulsive loads for sudden nonlinear events. These signals enter through a disturbance input matrix, providing a realistic framework for robustness and controller evaluation.

\dot{x} (t) = (A + Δ A) x (t) + B u (t) + B_{d} d (t) = A x (t) + B u (t) + f_{N} (t); w i t h f_{N} (t) = Δ A x (t) + B_{d} d (t)

(52)

where

d (t) = d_{c} (t) + d_{h a r m} (t) + d_{i m p} (t)

is the disturbance term, with components:

d_{c} (t) = η (t)

, such that

\dot{η} (t) = (ω (t) - η (t)) / τ_{d}

,

d_{h a r m} (t) = ⅀_{j} A_{j} \sin (ω_{j} t + ϕ_{j})

,

d_{i m p} (t) = [A_{i m p} e^{- (t - t_{0}) / τ_{i m p}}] . 1_{t \geq t_{0}}

and

Δ A = ⅀_{i} δ_{i} (t) A_{i}

with

|δ_{i} (t)| < δ_{m a x}

and each

δ_{i} (t)

represents an external disturbance channel. Define disturbance input matrix

B_{d}

mapping three disturbance channels

B_{d} = [○_{2 \times 4}; 1 0 0 1; ○_{2 \times 4}; 0 1 0 0]

.

The parameters of the disturbance model are given by

τ_{d} = 0.2 [s]

,

A_{1} = 0.01 [m]

,

f_{1} = 5 [H z]

(

ω_{1} = 2 π 5

),

t_{0} = 1.0 [s]

,

A_{i m p} = 200 [N]

applied to plunge for

τ_{i m p} = 0.05 [s]

(exponential decay). We use oscillator states

s \in R^{2}

and a filter state

η \in R^{2}

.

To enhance robustness against un-modeled dynamics and nonlinear disturbances, the block-pole state-feedback law is augmented with a neural-network compensator. The nominal gain

K_{0}

is obtained via block-pole placement to ensure desired closed-loop poles, while the neural network generates an adaptive correction

∆ K (x, t)

. The combined control law

u (t) = (K_{0} + ∆ K (x, t, θ)) x (t)

preserves the nominal stability structure and provides adaptive capability to reject uncertainties and nonlinear effects [23,32].

Compute a nominal state feedback

K_{0}

using the block-pole placement algorithm so that the linearized model has the desired characteristic matrix polynomial. Next, augment this nominal law with a small, state-dependent correction

∆ K (x, t, θ)

produced by a neural network. Finally, train/adapt

θ

offline (robustification) and/or online with an adaptation law (Lyapunov-/MRAC-style) to guarantee stability/performance and limit how much

∆ K

can move the poles [23]. The combined control law is written in the structured additive form

u (t) = K_{0} . x (t) + {N N}_{θ} (x (t))

, where

{N N}_{θ} (x) \in R^{m}

is the neural-network output or, equivalently, when a state-dependent gain correction is required,

u (t) = [K_{0} + ∆ K_{θ} (x)] . x (t)

, where

∆ K_{θ} (x) = r e s h a p e (v_{θ} (x), m, n)

. We parameterize the NN as a single hidden layer with parameters

θ = \{V, b, W\}

:

$K_{0} \in R^{m \times n}$ is nominal block-pole gain,
$ϕ (x) = (V^{⊤} x + b) \in R^{q}$ is a chosen feature vector (e.g., $[x; |x|; s a t (x)]$ ),
$h (x) = σ (ϕ (x)) \in R^{r}$ , $v_{θ} (x) = W^{⊤} h (x) = W^{⊤} σ (V^{⊤} x + b)$ ,
$σ (\cdot)$ is elementwise nonlinear basis (sigmoid/ReLU/Gaussian),
$V \in R^{n \times r}$ , $b \in R^{r}$ , $W \in R^{r \times (m n)}$ .

We used bounded activations (e.g.,

t a n h

) or an explicit final scaling to enforce

‖∆ K_{θ} (x)‖ \leq ε

, which limits perturbation of closed-loop poles. This yields linear + NN additive action; it is easier to analyze and to bound influence on closed-loop poles. Algorithm 16 illustrates in detail how to implement this neural network.

Algorithm 16. Practical implementation recipe (pseudo-algorithm)

1.

Compute nominal

K_{0}

with block-pole algorithm (offline). % Nominal stability is guaranteed before

2.

NN architecture: Use a single hidden layer with

p

neurons. Output dimension =

m \times n

(for direct increment

∆ K

). % The network is designed to respect stability margins by construction.

3.

Training: simulate many uncertainty scenarios and train NN to minimize

L = ⅀_{t} {‖u^{⋆} (t) - K_{0} x (t) - {N N}_{θ} (x (t))‖}^{2} + λ_{p e n a l t y} (e i g (A - B (K_{0} + ∆ K)))

.

% The penalty term explicitly enforces stability by penalizing deviations of the closed-loop eigenvalues from the desired region.

4.

Online control loop at each step:

○: measure state $x (t)$ and compute $u (t) = K_{0} . x (t) + {N N}_{θ} (x (t))$ (or $u (t) = [K_{0} + ∆ K_{θ} (x)] . x (t)$ ),
○: apply the control $u (t)$ , then compute error signal $e_{x}$ (e.g., $e_{x} = x - x_{r e f}$ or measured errors),
○: update $θ$ by adaptive law with projection: $\dot{θ} = - Γ J_{θ}^{⊤} (x (t)) q (t)$ , with the compact Jacobian $J_{θ} = [\partial v_{θ} (x) / \partial V, \partial v_{θ} (x) / \partial b, \partial v_{θ} (x) / \partial W]$ and $q (t) = x (t) \otimes B^{⊤} P e_{x}$ .

5.

Enforce bounds: clip

{N N}_{θ} (x (t))

or project

θ

into a convex compact set to ensure poles remain inside target region, preserving stability under uncertainties.

9.2. Comparative Study

Let us construct the desired block poles with a target latent values and vectors:

\{\begin{matrix} λ_{1} = - 10, λ_{2} = - 20 \\ v_{12} = [\begin{matrix} 0.1102 \\ 0.1376 \end{matrix} \begin{matrix} 0.4248 \\ 0.2211 \end{matrix}] \end{matrix}; \begin{matrix} λ_{3} = - 12, λ_{4} = - 15 \\ v_{34} = [\begin{matrix} 0.093198 \\ 0.388624 \end{matrix} \begin{matrix} 0.444068 \\ 0.139655 \end{matrix}] \end{matrix}; \begin{matrix} λ_{5} = - 25, λ_{6} = - 30 \\ v_{56} = [\begin{matrix} 0.4683 \\ 0.4543 \end{matrix} \begin{matrix} 0.1009 \\ 0.2231 \end{matrix}] \end{matrix}\}

To convert into eigenstructure, we use

x_{c k} = c o l {(λ_{k}^{i} v_{k})}_{i = 0}^{l - 1}

, where

v_{k}

are latent vectors, and

x_{c k}

are eigenvectors of

A_{(c l s) c} = T_{c} A_{c l s} T_{c}^{- 1}

, where

A_{c l s} = A - B K_{0}

.

The corresponding block roots to be assigned are given by

R_{1} = [\begin{matrix} - 27.1500 \\ - 8.92740 \end{matrix} \begin{matrix} 13.7355 \\ - 2.8500 \end{matrix}]; R_{2} = [\begin{matrix} - 15.2447 \\ - 1.02040 \end{matrix} \begin{matrix} 0.77810 \\ - 11.7553 \end{matrix}]; R_{3} = [\begin{matrix} - 21.0945 \\ 8.63860 \end{matrix} \begin{matrix} - 4.02610 \\ - 33.9055 \end{matrix}]

Note 1.

The optimal selection of these roots can be achieved by using optimization algorithms such as particle swarm optimization (PSO) method or others, which are a sensitive point as they increase the strength and effectiveness of the proposed method.

The coefficients of the desired closed-loop matrix polynomial that need to be assigned are given by the following expression

[D_{c l s}] = - [R_{1}^{3} R_{2}^{3} R_{3}^{3}] \cdot V_{R}^{- 1}

, that is, as follows:

D_{c l s} (λ) = [\begin{matrix} 1 \\ 0 \end{matrix} \begin{matrix} 0 \\ 1 \end{matrix}] λ^{3} + [\begin{matrix} 51.2048 \\ - 12.2468 \end{matrix} \begin{matrix} - 6.27970 \\ 60.7952 \end{matrix}] λ^{2} + [\begin{matrix} 960.04 \\ - 361.30 \end{matrix} \begin{matrix} - 357.660 \\ 1078.86 \end{matrix}] λ + [\begin{matrix} 6093.5 \\ - 2685.4 \end{matrix} \begin{matrix} - 3440.1 \\ 5947.0 \end{matrix}]

Since the block-controllability matrix

Ω_{c} = [B, A B, A^{2} B]

is full-rank, our model is transformable to the control space via the similarity operator

T_{c} = [T_{c 1}; T_{c 1} A; T_{c 1} A^{2}]

with

T_{c 1} = [○ ○ I] Ω_{c}^{- 1}

. The nominal control is given by

u_{n o m} (t) = - K_{0} u (t) + F_{0} r (t)

so that the closed-loop system is given by the equation

\dot{x} (t) = A_{c l s} x (t) + B_{c l s} x (t)

, where

A_{c l s} = A - B K_{0}

,

B_{c l s} = B F_{0} = {(C {(B K_{0} - A)}^{- 1})}^{⋕}

, with a state-feedback gain matrix,

K_{0} = \{B_{c}^{⟙} Ω_{c}^{- 1} \cdot A^{3}\} - \{[R_{1}^{3} R_{2}^{3} R_{3}^{3}] \cdot V_{R}^{- 1} \cdot T_{c}\}

, and with the following Vandermond matrix,

V_{R} = [I_{2} I_{2} I_{2}; R_{1} R_{2} R_{3}; R_{1}^{2} R_{2}^{2} R_{3}^{2}]

. The nominal state-feedback gain matrix is

K_{0} = [\begin{matrix} 183.3800 \\ - 1484.545 \end{matrix} \begin{matrix} - 345.325 \\ 173.770 \end{matrix} \begin{matrix} - 44.7820 \\ 110.486 \end{matrix} \begin{matrix} - 15.839 \\ 29.282 \end{matrix} \begin{matrix} 221.407 \\ 219.261 \end{matrix} \begin{matrix} 56.521 \\ 31.316 \end{matrix}]

The DC gain matrix

F_{0}

can be obtained by minimizing

{‖B F_{0} - {(C {(B K_{0} - A)}^{- 1})}^{⋕}‖}_{F}

.

The proposed control law ensures stabilization of the state-space model. It requires only measurable outputs and the system’s nominal model. Its Laplace-domain analysis and practical implementation for the aeroelastic airfoil system are illustrated through the schematic block diagram shown in Figure 2.

In the open-loop system, we verified two case studies by employing the chirp and sine signals. Responses of the system are calculated by fourth-order Runge–Kutta algorithm with MATLAB R2023a on a Windows 10 (64-bit) platform with an Intel Core i7-1165G7 CPU (2.80 GHz) and 16 GB RAM. The open-loop results are shown in Figure 3.

Figure 3a,b shows the output response of the aeroelastic system with cubic-spring nonlinearity under a chirp excitation signal, while Figure 3c,d depicts the system’s response to a sinusoidal excitation applied to both actuators. In the absence of control, the system exhibits periodic oscillations of equal amplitude, known as limit cycle oscillations (LCOs), in both plunge and pitch. The dominant response occurs in the pitch direction, driven by stiffness nonlinearity. With the proposed controller, these oscillations gradually decay, demonstrating effective suppression of the LCO. Figure 4 illustrates this behavior: (a) the pitch-direction phase diagram without control shows a sustained LCO, while (b) with the proposed controller, the trajectory converges, confirming oscillation suppression.

Simulations were conducted using the dynamic model of the aeroelastic wing implemented in MATLAB R2023a. Intelligent neural block-pole placement controllers were deployed to mitigate the vibrational motions. Disturbances were introduced into the system through the input/output channel, and the controller performance was evaluated accordingly. For the given wing, under a prescribed flight condition (2° angle of attack and 0° pitch angle), the required closed-loop performance specifications were defined as follows: settling time less than 1.10 s, peak overshoot below 6.2%, and steady-state error equal to zero. A disturbance of 5° was applied at 2 s. The control constraints were set to remain below 15° for actuator-1 deflection and 10° for actuator-2 deflection. The closed-loop control system was expected to exhibit strong disturbance-rejection capability. Figure 5 illustrates the response of the aeroelastic system under these conditions.

To highlight the advantages of the proposed adaptive block-pole placement strategy, we compare it with well-known classical techniques such as eigenstructure assignment, LQR, and

H_{2}

-control. The comparison is carried out in terms of robustness, decoupling capability, sensitivity to noise/disturbances, transient response, and steady-state performance.

▪: Performance Metrics: The following indices are used.

$σ_{m} (H_{c l s})$ , $σ_{M} (H_{c l s})$ : smallest/largest singular values of the closed-loop transfer function (robustness and conditioning).

$σ_{m} (H_{c l s}) = \underset{ω \in [0, \infty [}{s u p} [\sqrt{|λ_{m i n} (Γ_{H_{c l s}} (j ω))|}]; σ_{M} (H_{c l s}) = \underset{ω \in [0, \infty [}{s u p} [\sqrt{|λ_{m a x} (Γ_{H_{c l s}} (j ω))|}]$

where is the $Γ_{H_{c l s}} (j ω)$ Gram matrix: $Γ_{H_{c l s}} (j ω) = H_{c l s}^{H} (j ω) H_{c l s} (j ω)$ .
$χ (H_{c l s})$ : condition number of the closed-loop system (robust stability).

$χ (H_{c l s} (j ω)) = σ_{M} (H_{c l s}) / σ_{m} (H_{c l s})$

Here is a MATLAB code (Table 3) for the computation of measures

σ_{m}

and

σ_{M}

.

$ϱ_{1} = {‖H_{cls}‖}_{2}$ : the $H_{2}$ norm of the transfer function measures the average energy amplification from input disturbances to outputs in the closed-loop system

${‖H_{cls} (t)‖}_{2} = \sqrt \{|\frac{1}{2 π} \int_{- \infty}^{+ \infty} {‖H_{cls} (j ω)‖}_{F}^{2} d ω|\} = \sqrt{|t r a c e [C P C^{⊤}]|} = \sqrt{|t r a c e [B_{cls}^{⊤} Q B_{cls}]|}$

The matrices

P

and

Q

are the solution of Lyapunov equations

A_{c l s}^{⊤} Q + Q A_{c l s} = - C^{⊤} C

and

P A_{c l s}^{⊤} + A_{c l s} P = - B_{c l s} B_{c l s}^{⊤}

, or more explicitly, they are given by the formulas

P = \int_{0}^{\infty} e^{(A - B K) t} B_{c l s} B_{c l s}^{⊤} e^{{(A - B K)}^{⊤} t} d t

and

Q = \int_{0}^{\infty} e^{{(A - B K)}^{⊤} t} C^{⊤} C e^{(A - B K) t} d t

. Table 4 gives a code for the computation of such measures.

${‖S (j ω)‖}_{\infty}$ : peak sensitivity norm (disturbance rejection).

${‖S (j ω)‖}_{\infty} = \underset{ω \in [0, \infty [}{s u p} [\sqrt{|λ_{m a x} (S_{c l s}^{H} (j ω) S (j ω))|}]$

where $S (s) = {[I + {(s I - A_{c l s})}^{- 1} B_{c l s} K]}^{- 1}$ . Table 5 gives a code for the computation of such measures.
$μ (H_{c l s}, Δ H)$ : structured singular value (robustness against uncertainty). In robust control analysis, we evaluate the structured singular value frequency-wise: $μ (H_{c l s}, Δ H) = 1 / m i n \{‖Δ‖ : Δ \in Δ H, d e t (I - H_{c l s} (j ω) Δ) = 0\}$ for each ω, then the system is robustly stable against all uncertainties $Δ$ with $‖Δ‖ \leq 1$ .

Table 6 provides the comparative study of the above robustness metrics.

The proposed adaptive block-pole placement method balances robustness and efficiency by keeping the maximum singular value low

(σ_{M} = 1.81)

and the condition number minimal

(χ = 11.19)

, yielding a well-conditioned closed loop. Although its minimum singular value

(σ_{m} = 0.162)

is lower than those of LQR and

H_{2}

-control, this trade-off leads to the smallest

H_{2}

norm

({‖H_{cls}‖}_{2} = 4.64)

and thus, minimal energy amplification. Moreover, it requires the lowest control gains

({‖K‖}_{2} = 1528.1, {‖K‖}_{\infty} = 2048.7)

, reducing actuator effort. The sensitivity peak

({‖S (j ω)‖}_{\infty} = 168.31)

and structured singular value

(μ = 0.41)

confirm superior disturbance rejection and robustness against structured uncertainty. The method further maintains stable performance under parameter variations up to 10% and disturbance noise levels of 12%, while achieving near-complete decoupling (<5% cross-interaction). In contrast, eigenstructure assignment exhibits poor conditioning

(χ = 28.44)

and high effort

({‖K‖}_{\infty} = 3797.4)

, while LQR and

H_{2}

-control improve robustness relative to eigenstructure, but remain more sensitive to modeling errors and disturbances due to their reliance on fixed quadratic cost functions and stochastic assumptions.

To further illustrate the benefits of the proposed method, we compare transient response and steady-state regulation under reference tracking and disturbance-rejection scenarios. The following performance indices are considered.

Rise Time ( $t_{r}$ ): time to reach 90% of the final value.
Settling Time ( $t_{s}$ ): time to remain within ±2% of final value.
Overshoot (OS): maximum deviation beyond the steady-state value (%).
Steady-State Error (SSE): final tracking error (% of reference).
Disturbance Rejection (DR): percentage attenuation of a step disturbance.

Table 7 provides a comparative study of the transient and steady-state metrics.

The proposed method achieves the fastest rise and settling times while maintaining overshoot below 7%. The steady-state error is nearly eliminated (<1%), clearly outperforming LQR and

H_{2}

-control, which depend on cost-function tuning. Disturbance rejection is also significantly enhanced, with more than 90% attenuation compared to 62–81% for classical methods. Overall, the results confirm that the adaptive block-pole approach not only guarantees robustness but also delivers superior transient performance and regulation accuracy.

In addition to robustness and transient metrics, further insight can be gained through classical and integral performance indices. These include gain and phase margins (stability robustness), closed-loop bandwidth (tracking capability), integral error measures such as IAE and ISE (overall regulation quality), and peak control effort (actuator demand). These indices provide a complementary evaluation of the controllers under practical operating conditions.

Gain Margin (GM) and Phase Margin (PM): classical robustness margins.
Bandwidth ( $ω_{B W}$ ): The range over which the closed-loop tracks reference well.
${‖T‖}_{\infty}$ (with $T (ω) = I - S (ω)$ ): robustness to measurement noise.
$H_{2}$ norm ( ${‖T‖}_{2}$ ): energy gain from disturbance to output (performance).
Integral of Absolute Error (IAE): $\int |e (t)| d t$ .
Integral of Square Error (ISE): $\int e^{2} (t) d t$ .
Integral of Time-weighted Absolute Error (ITAE): penalizes long errors.
Peak Control Effort ( ${‖u‖}_{\infty}$ ): actuator demand (saturation risk).

Table 8 provides the comparative study of some additional performance indices.

The results demonstrate that the proposed method provides the largest gain and phase margins, ensuring superior robustness to model uncertainty. Its higher bandwidth enables faster response without sacrificing stability. Integral error (IAE) is minimized, confirming excellent tracking and regulation, while the required control effort remains the lowest among all methods, reducing actuator stress. Finally, both the

H_{2}

norm

{‖T‖}_{2}

) and the

H_{\infty}

norm (

{‖T‖}_{\infty}

) are significantly reduced, indicating improved overall performance and stronger noise attenuation. In contrast, eigenstructure assignment suffers from poor robustness and higher effort, while LQR and

H_{2}

-control improve robustness but remain less efficient than the proposed approach.

Final note: Noise can perturb contour integration and spectral factor computation by distorting the resolvent and generating spurious factors. To mitigate these effects, we applied regularized resolvent evaluation, low-order noise filtering, and companion-matrix preconditioning, which stabilize the computation without altering the underlying method. For numerical experiments, the grid and solver configurations are specified as follows: RK4 integration with a step size of 1 × 10⁻⁴, an adaptive tolerance of 1 × 10⁻⁷, and Tikhonov regularization of 1 × 10⁻⁶ applied in matrix inversion to ensure numerical stability.

10. Conclusions

This study has established a rigorous algebraic framework for operator matrix polynomials and demonstrated its relevance to control system engineering, with a particular focus on aeroelasticity in flight dynamics. By unifying spectral factorization, companion forms, and block-pole assignment within a constructive operator-theoretic setting, the work provides both theoretical depth and practical utility.

The proposed adaptive block-pole placement scheme, enhanced with a neural compensator, successfully addresses the dual challenges of robustness and numerical conditioning, while maintaining modest control effort. The aeroelastic wing application confirmed that the method not only stabilizes nonlinear dynamics but also ensures rapid transients, precise regulation, and effective disturbance rejection when compared against established benchmarks such as eigenstructure assignment, LQR, and

H_{2}

-control.

Beyond its immediate results, this work underscores the potential of algebraic operator methods as a unifying bridge between abstract mathematical structures and applied control synthesis. Future research should focus on experimental validation through hardware-in-the-loop and wind-tunnel testing, extension to higher-order and distributed aeroelastic models, and automated optimization-based block-root selection (e.g., PSO or similar solvers) to further improve robustness and performance. Additionally, integrating real-time adaptation schemes to tune the neural compensator under time-varying operating points and actuator nonlinearities will consolidate the framework’s practical value for robust aeroservoelastic control.

Author Contributions

B.B.: Conceptualization, methodology, software, formal analysis, investigation, data curation, original draft writing, visualization, and project administration. K.H. and G.R.D.: Supervision, validation, methodology, and review and editing of the manuscript. V.Z.: Contribution of resources, technical input, and manuscript review and editing. A.-N.S.: Visualization, methodology, formal analysis, investigation, overall supervision, project coordination, funding acquisition, draft writing and critical manuscript revision. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Bekhiti, B. A Novel Recursive Algorithm for Inverting Matrix Polynomials via a Generalized Leverrier–Faddeev Scheme: Application to FEM Modeling of Wing Vibrations in a 4th-Generation Fighter Aircraft. Mathematics 2025, 13, 2101. [Google Scholar] [CrossRef]
Bekhiti, B. Matrix Fraction Description in Large Scale MIMO Descriptor Systems: Matrix Polynomials Approaches. Symmetry 2025, 17, 1413. [Google Scholar] [CrossRef]
Zaitsev, V.; Kim, I. Matrix eigenvalue spectrum assignment for linear control systems by static output feedback. Linear Algebra Its Appl. 2021, 613, 115–150. [Google Scholar] [CrossRef]
Yaici, M.; Hariche, K. On eigenstructure assignment using block poles placement. Eur. J. Control 2014, 20, 217–226. [Google Scholar] [CrossRef]
Yaici, M.; Tim, C. A Contribution to the Polynomial Eigen Problem. Int. J. Math. Comput. Nat. Phys. Eng. 2014, 8, 1131–1338. [Google Scholar] [CrossRef]
Zaitsev, V.; Kim, I. Finite Spectrum Assignment Problem and Stabilization of Bilinear Systems with Both Lumped and Distributed Delay. IFAC Pap. 2024, 58, 25–30. [Google Scholar] [CrossRef]
Zaitsev, V.A. On Arbitrary Matrix Coefficient Assignment for the Characteristic Matrix Polynomial of Block Matrix Linear Control Systems. Vestn. Udmurtsk. Univ. Mat. Mekh. Komp. Nauki 2024, 34, 339–358. [Google Scholar] [CrossRef]
Vardulakis, A.I.G. Linear Multivariable Control: Algebraic Analysis and Synthesis Methods; John Wiley and Sons: New York, NY, USA, 1991. [Google Scholar]
Vayssettes, J. New developments for matrix fraction descriptions: A fully-parametrised approach. Automatica 2016, 66, 15–24. [Google Scholar] [CrossRef]
Sugimoto, K. Left-right Polynomial Matrix Factorization for MIMO Pole/Zero Cancellation with Application to FEL. Trans. Inst. Syst. Control Inf. Eng. 2019, 32, 32–38. [Google Scholar] [CrossRef]
Kurbatov, V.G. Computation of a function of a matrix with close eigenvalues by means of the Newton interpolating polynomial. Linear Multilinear Algebra 2015, 62, 111–122. [Google Scholar] [CrossRef]
Cohen, N. Spectral analysis of regular matrix polynomials. Integral Equ. Oper. Theory 1983, 6, 161–183. [Google Scholar] [CrossRef]
Hariche, K.; Denman, E.D. On Solvents and Lagrange Interpolating-Matrices. Appl. Math. Comput. 1988, 25, 321–332. [Google Scholar] [CrossRef]
Periera, E. On solvents of matrix polynomials. Appl. Numer. Math. 2003, 47, 197–208. [Google Scholar] [CrossRef]
Chen, C.T. Linear System Theory and Design; Holt, Reinhart and Winston: Austin, TX, USA, 1984. [Google Scholar]
Bekhiti, B. On λ-matrices and their applications in MIMO control systems design. Int. J. Model. Identif. Control 2018, 29, 281–294. [Google Scholar] [CrossRef]
Bekhiti, B. On the theory of λ-matrices based MIMO control system design. Control Cybern. 2015, 44, 421–443. [Google Scholar]
Campbell, S.L. Singular Systems of Differential Equations; Pitman Advanced Publishing Program: Lanham, MD, USA, 1980. [Google Scholar]
Gohberg, I.; Lancaster, P.; Rodman, L. Matrix Polynomials; Classics in Applied Mathematics; Society for Industrial and Applied Mathematics: Lancaster, PA, USA, 2009; Volume 58. [Google Scholar]
Higham, N.J. Functions of Matrices: Theory and Computation; SIAM: New Delhi, India, 2000. [Google Scholar]
Kailath, T.; Li, W. Linear Systems; Prentice-Hall: Englewood Cliffs, NJ, USA, 1980. [Google Scholar]
Bekhiti, B. Multivariable Control System Design Using the Theory of Matrix Polynomials. Ph.D. Thesis, University of M’Hamed Bougara, Boumerdes, Algeria, 2018. [Google Scholar]
Bekhiti, B. Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications. Robotics 2025, 14, 84. [Google Scholar] [CrossRef]
Shieh, L.S. Block transformations of a class of multivariable control systems to four basic block companion forms. Applications 1983, 9, 703–714. [Google Scholar] [CrossRef][Green Version]
Lancaster, P.; Tismenetsky, M. The Theory of Matrices with Applications; Academic Press: Cambridge, MA, USA, 1985. [Google Scholar]
Hariche, K.; Denman, E.D. Interpolation theory and λ-matrices. J. Math. Anal. Appl. 1989, 143, 530–547. [Google Scholar] [CrossRef][Green Version]
Markus, A.S. Introduction to the Spectral Theory of Polynomial Operator Pencils; American Mathematical Society: Providence, RI, USA, 1988. [Google Scholar]
Bekhiti, B. On the block decomposition and spectral factors of λ-matrices. Control Cybern. 2020, 49, 41–77. [Google Scholar]
Kratz, W. Numerical Solution of Matrix Polynomial Equations by Newton’s Method. IMA J. Numer. Anal. 1987, 7, 355–369. [Google Scholar] [CrossRef]
Zhang, B. Nonlinear Aeroelastic System Identification Based on Neural Network. Appl. Sci. 2018, 8, 1916. [Google Scholar] [CrossRef]
Muñoz, Á. Active Flutter Suppression of a Wing Section in the Subsonic, Sonic and Supersonic Regimes by the $H_{\infty}$ Control Method. Aerospace 2024, 11, 198. [Google Scholar] [CrossRef]
Ossmann, D. Fault Tolerant Control of an Experimental Flexible Wing. Aerospace 2019, 6, 76. [Google Scholar] [CrossRef]
Steffensen, R. Practical System Identification and Incremental Control Design for a Subscale Fixed-Wing Aircraft. Actuators 2024, 13, 130. [Google Scholar] [CrossRef]
Anjali, B.S. Simulation and Analysis of Integral LQR Controller for Inner Control Loop Design of a Fixed Wing Micro Aerial Vehicle (MAV). Procedia Technol. 2016, 25, 76–83. [Google Scholar] [CrossRef][Green Version]
Dennis, J.E., Jr.; Traub, J.F.; Weber, R.P. Algorithms for Solvents of Matrix Polynomials. SIAM J. Numer. Anal. 1978, 15, 523–533. [Google Scholar] [CrossRef]
Shieh, L.S.; Chahin, N. A computer-aided method for the factorisation of matrix polynomials. Int. J. Syst. Sci. 1981, 12, 305–323. [Google Scholar] [CrossRef]
Tsai, J.S.H. A computer-aided method for solvents and spectral factors of matrix polynomials. Appl. Math. Comput. 1992, 47, 211–235. [Google Scholar] [CrossRef]
Buslowicz, M. The computation of a transfer function matrix from the given state equations for time-delays systems. IEEE Trans. Autom. Control 1981, 26, 949–951. [Google Scholar] [CrossRef]
Tsai, J.S.H.; Chen, W.S. Generalized realization with a special coordinate for matrix fraction description. Int. J. Syst. Sci. 1992, 23, 2197–2217. [Google Scholar] [CrossRef]
Liu, G.P.; Patton, R. Eigenstructure Assignment for Control System Design; John Wiley & Sons, Inc.: New York, NY, USA, 1998. [Google Scholar]
Wonham, W.M. On Pole Assignment in Multi-Input Controllable Linear System. IEEE Trans. Autom. Control 1967, 12, 660–665. [Google Scholar] [CrossRef]
Andry, A.N. Eigenstructure Assignment for Linear Systems. IEEE Trans. Aerosp. Electron. Syst. 1983, AES-19, 711–729. [Google Scholar] [CrossRef]

Figure 1. Flexible wing with trailing-edge control surface.

Figure 2. The diagram of the control system consisting of a regulator with observed states.

Figure 3. Response (pitch angle and plunge) of the system for chirp and sine input signals.

Figure 4. Pitch-direction phase diagram: (a) without controller and (b) with controller.

Figure 5. The response of the aeroelastic wing using neural block-pole placement control.

Table 1. The extended Broyden’s algorithm for solving the matrix equation F(X) = 0.

1	$A_{1} = [1 2; 3 2]$ ; $A_{2} = [4 3; 2 5]$ ; $A_{3} = [1 3; 5 5]$ ; $A_{4} = [8 8; 6 1]$ ; % coefficients of $F (X)$
2	$X_{1} = I - 0.1 ⋆ r a n d (2,2)$ ; $B_{1} = i n v (100 \times y e (4,4))$ ; % initialization
3	For k = 1:N
4	$F_{k} = X_{k} ⋆ A_{1} - A_{4} ⋆ X_{k} - X_{k} ⋆ A_{2} ⋆ X_{k} + A_{3}$ ;	% $F (X_{k}) = F_{k}$
5	$f_{k} = [F_{k} (:, 1); F_{k} (:, 2)]$ ; $x_{k} = [X_{k} (:, 1); X_{k} (:, 2)]$ ;	% $f_{k} = v e c (F_{k})$ and $x_{k} = v e c (X_{k})$
6	$x_{k + 1} = x_{k} - B ⋆ f_{k};$	% $v e c (X_{k + 1}) = v e c (X_{k}) - J_{k}^{- 1} . v e c (F_{k + 1})$
7	$X_{k + 1} = [x_{k + 1} (1 : 2, :) x_{k + 1} (3 : 4, :)];$	% $X_{k}$ from $x_{k}$ using ${v e c}^{- 1} (X_{k})$
8	$F_{k + 1} = X_{k + 1} ⋆ A_{1} - A_{4} ⋆ X_{k + 1} - X_{k + 1} ⋆ A_{2} ⋆ X_{k + 1} + A_{3}$ ;	% $F (X_{k + 1}) = F_{k + 1}$
9	$f_{k + 1} = [F_{k + 1} (:, 1); F_{k + 1} (:, 2)]$ ;	% $f_{k + 1} = v e c (F (F_{k + 1}))$
10	$x_{k + 1} = [X_{k + 1} (:, 1); X_{k + 1} (:, 2)]$ ;	% $x_{k + 1} = v e c (X_{k + 1})$
11	$y_{k} = f_{k + 1} - f_{k}$ ; $s_{k} = x_{k + 1} - x_{k}$ ;	% $y_{k} = f_{k + 1} - f_{k}$ and $s_{k} = x_{k + 1} - x_{k}$
12	$B_{k + 1} = B_{k} + [(s_{k} - B_{k} ⋆ y_{k}) ⋆ (s_{k}^{⟙} ⋆ B_{k})] / (s_{k}^{⟙} ⋆ B_{k} ⋆ y_{k})$ ;	% $J^{- 1} (x_{k})$
13	$X_{k} = X_{k + 1}$ ;	% update
14	End, $X_{k}$ % solution

Table 2. The quotient–difference subroutines for the right and left factorization.

\underline{\bar{\begin{matrix} \begin{matrix} \begin{matrix} Right Q . D . algorithm \end{matrix} \\ \begin{matrix} \begin{matrix} \begin{matrix} Q_{i}^{(k + 1)} = Q_{i}^{(k)} + E_{i}^{(k)} - E_{i - 1}^{(k + 1)} \\ E_{i}^{(k + 1)} = Q_{i + 1}^{(k)} E_{i}^{(k)} {[Q_{i}^{(k + 1)}]}^{- 1} \end{matrix} \\ E_{0}^{(k)} = E_{l}^{(k)} = ○ \end{matrix} \\ i = 1, \dots l - 1, k = 1,2, \dots \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} Left Q . D . algorithm \end{matrix} \\ \begin{matrix} \begin{matrix} \begin{matrix} Q_{i}^{(k + 1)} = Q_{i}^{(k)} + E_{i}^{(k)} - E_{i - 1}^{(k + 1)} \\ E_{i}^{(k + 1)} = {[Q_{i}^{(k + 1)}]}^{- 1} E_{i}^{(k)} Q_{i + 1}^{(k)} \end{matrix} \\ E_{0}^{(k)} = E_{l}^{(k)} = ○ \end{matrix} \\ i = 1, \dots l - 1, k = 1,2, \dots \end{matrix} \end{matrix} \end{matrix} \end{matrix}}}

Table 3. Coded implementation for closed-loop singular value analysis of the aeroelastic system.

I_{n} = e y e (6,6)

;

S_{M} = []

;

S_{m} = []

;
For

ω = 0 : 0.01 : 20

;

H_{c l s} = C ⋆ i n v (j ⋆ ω ⋆ I_{n} - A_{c l s}) ⋆ B_{c l s} + D

;

H_{c l s}^{H} = - B_{c l s}^{⟙} ⋆ i n v (j ⋆ ω ⋆ I_{n} + A_{c l s}^{⟙}) ⋆ C^{⟙} + D^{⟙}

;

s = s q r t (a b s \{e i g (H_{c l s}^{H} * H_{c l s})\})

;

s_{M} = m a x (s)

;

S_{M} = [S_{M} s_{M}]

;

s_{m} = m i n (s)

;

S_{m} = [S_{m} s_{m}]

;
End

σ_{M} = m a x (S_{M})

,

σ_{m} = m a x (S_{m})

, %. Additionally, we can use

σ_{M}

= norm(tf(sys),Inf).

Table 4. Computation of

H_{2}

performance indices for the closed-loop aeroelastic system.

Table 4. Computation of

H_{2}

performance indices for the closed-loop aeroelastic system.

s y s = s s (A_{c l s}, B_{c l s}, C, D)

;

P = l y a p (A_{c l s}, - B_{c l s} ⋆ B_{c l s}^{⊤})

;

Q = l y a p (A_{c l s}^{⊤}, - C^{⊤} ⋆ C)

;

ϱ_{1} = s q r t (a b s \{t r a c e (C ⋆ P ⋆ C^{⊤})\})

,

ϱ_{2} = s q r t (a b s \{t r a c e (B_{c l s}^{⊤} ⋆ Q ⋆ B_{c l s})\})

% -------------------------- Comparison to MATLAB results -------------------------------------%

ϱ_{3} = n o r m (s y s, 2)

,

ϱ_{4} = s q r t (t r a c e \{c o v a r (s y s, 1)\})

% Produces the same result as

ϱ_{3}

Table 5. Computation of peak sensitivity norm for disturbance rejection in the closed-loop system.

I_{n} = e y e (6,6)

;

S_{M} = []

;
For

ω = 0 : 0.01 : 20

;

H_{1} = i n v (j ⋆ ω ⋆ I_{n} - A_{c l s}) ⋆ B_{c l s}

;

H_{1}^{H} = - B_{c l s}^{⟙} ⋆ i n v (j ⋆ ω ⋆ I_{n} + A_{c l s}^{⟙})

;

S_{c l s} = i n v (I_{n} + H_{1} ⋆ K)

;

S_{c l s}^{H} = i n v (I_{n} + K^{⟙} ⋆ H_{1}^{H})

;

s = s q r t (a b s \{e i g (S_{c l s}^{H} ⋆ S_{c l s})\})

;

s_{M} = m a x (s)

;

S_{M} = [S_{M} s_{M}]

;
End

S_{M} = m a x (S_{M})

Table 6. Comparative robustness metrics (evaluated on ω ∈ [0, ∞[).

Method	$σ_{m} (H_{c l s})$	$σ_{M} (H_{c l s})$	${‖H_{c l s}‖}_{2}$	$χ (H_{c l s})$	${‖K‖}_{2}$	${‖K‖}_{\infty}$	${‖S (j ω)‖}_{\infty}$	$μ (H_{c l s}, Δ H)$
Eigenstructure [33]	0.28	4.1293	29.873	28.441	2856.2	3797.4	310.68	0.95
LQR [31]	0.40	3.7820	15.635	14.324	1827.2	3215.0	220.65	0.84
$H_{2}$ -control [32]	0.52	2.2807	12.142	12.525	1678.1	2870.5	190.23	0.73
Proposed Method	0.1617	1.8090	4.6418	11.187	1528.1	2048.7	168.31	0.41

Table 7. Comparative transient and steady-state metrics.

Method	$t_{r} (s)$	$t_{s} (s)$	OS (%)	SSE (%)	DR (%)
Eigenstructure [33]	0.45	1.82	18.0	5.5	62
LQR [31]	0.40	1.50	12.5	3.2	74
$H_{2}$ -control [32]	0.38	1.40	10.0	2.8	81
Proposed Method	0.35	1.10	6.2	0.9	92

Table 8. Additional frequency and energetic performance indices.

Method	$G M (°)$	$P M (°)$	$ω_{B W}$ (rad/s)	$I A E$	$Effort {‖u‖}_{\infty}$	${‖T‖}_{2}$	${‖T‖}_{\infty}$
Eigenstructure [33]	3.50	35	18.0	1.85	2.50	240.2	310.3
LQR [31]	5.00	45	22.5	1.20	2.10	195.1	220.2
$H_{2}$ -control [32]	5.80	48	25.3	1.05	1.95	180.3	190.7
Proposed Method	8.21	62	31	0.72	1.55	125.2	140.8

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bekhiti, B.; Hariche, K.; Zaitsev, V.; Duan, G.R.; Sharkawy, A.-N. The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control. Math. Comput. Appl. 2025, 30, 131. https://doi.org/10.3390/mca30060131

AMA Style

Bekhiti B, Hariche K, Zaitsev V, Duan GR, Sharkawy A-N. The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control. Mathematical and Computational Applications. 2025; 30(6):131. https://doi.org/10.3390/mca30060131

Chicago/Turabian Style

Bekhiti, Belkacem, Kamel Hariche, Vasilii Zaitsev, Guangren R. Duan, and Abdel-Nasser Sharkawy. 2025. "The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control" Mathematical and Computational Applications 30, no. 6: 131. https://doi.org/10.3390/mca30060131

APA Style

Bekhiti, B., Hariche, K., Zaitsev, V., Duan, G. R., & Sharkawy, A.-N. (2025). The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control. Mathematical and Computational Applications, 30(6), 131. https://doi.org/10.3390/mca30060131

Article Menu

The Algebraic Theory of Operator Matrix Polynomials with Applications to Aeroelasticity in Flight Dynamics and Control

Abstract

1. Introduction

2. Fundamentals of Matrix Algebra and Linear Vector Spaces

3. Matrix Polynomials (λ-Matrices) and Spectral Divisors

4. Standard Structures of Matrix Polynomials and Realization

4.1. Triples of Matrix Polynomials (λ-Matrices)

4.2. Pairs of Matrix Polynomials (λ-Matrices)

4.3. Characterization of Solvents by Invariant Pairs

5. Determination of Operator Roots (Spectral Factors)

6. Transformations Between Solvents and Spectral Factors

7. MFD Realization and Transformation Between Canonical Forms

8. The Proposed Control-Design Strategies

9. Applications in Control System Engineering

9.1. Aeroelasticity in Flight Dynamics

9.2. Comparative Study

10. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI