Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition

Kornilova, Mariya; Kovalnogov, Vladislav; Fedorov, Ruslan; Zamaleev, Mansur; Katsikis, Vasilios N.; Mourtas, Spyridon D.; Simos, Theodore E.

doi:10.3390/math10081208

Open AccessArticle

Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition

by

Mariya Kornilova

¹,

Vladislav Kovalnogov

¹,

Ruslan Fedorov

¹

,

Mansur Zamaleev

¹,

Vasilios N. Katsikis

²

,

Spyridon D. Mourtas

²

and

Theodore E. Simos

^1,3,4,5,6,*

¹

Laboratory of Inter-Disciplinary Problems in Clean Energy Production, Ulyanovsk State Technical University, 32 Severny Venetz Street, 432027 Ulyanovsk, Russia

²

Department of Economics, Division of Mathematics and Informatics, National and Kapodistrian University of Athens, Sofokleous 1 Street, 10559 Athens, Greece

³

Department of Medical Research, China Medical University Hospital, China Medical University, Taichung 40402, Taiwan

⁴

Data Recovery Key Laboratory of Sichun Province, Neijing Normal University, Neijiang 641100, China

⁵

Section of Mathematics, Department of Civil Engineering, Democritus University of Thrace, 67100 Xanthi, Greece

⁶

Department of Mathematics, University of Western Macedonia, 50100 Kozani, Greece

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(8), 1208; https://doi.org/10.3390/math10081208

Submission received: 11 March 2022 / Revised: 5 April 2022 / Accepted: 6 April 2022 / Published: 7 April 2022

(This article belongs to the Special Issue Numerical Analysis and Scientific Computing II)

Download

Browse Figure

Versions Notes

Abstract

:

Many researchers have investigated the time-varying (TV) matrix pseudoinverse problem in recent years, for its importance in addressing TV problems in science and engineering. In this paper, the problem of calculating the inverse or pseudoinverse of an arbitrary TV real matrix is considered and addressed using the singular value decomposition (SVD) and the zeroing neural network (ZNN) approaches. Since SVD is frequently used to compute the inverse or pseudoinverse of a matrix, this research proposes a new ZNN model based on the SVD method as well as the technique of Tikhonov regularization, for solving the problem in continuous time. Numerical experiments, involving the pseudoinversion of square, rectangular, singular, and nonsingular input matrices, indicate that the proposed models are effective for solving the problem of the inversion or pseudoinversion of time varying matrices.

Keywords:

singular value decomposition (SVD); zeroing neural network (ZNN); Moore–Penrose inverse; Tikhonov regularization; dynamical system

MSC:

15A09; 65F20; 68T05

1. Introduction and Preliminaries

In this paper, the zeroing neural network (ZNN) approach is used to address the problem of calculating the inverse or pseudoinverse of an arbitrary time-varying (TV) real matrix. On the one hand, the pseudoinverse, or Moore–Penrose inverse, of

A \in R^{m \times n}

is the unique matrix

A^{†}

, such that the system of Penrose equations holds for

X : = A^{†}

[1,2,3]:

A = A X A, X = X A X, {(X A)}^{T} = X A, {(A X)}^{T} = A X,

(1)

where

A^{T}

denotes the transpose of A. Note that, if A is a nonsingular square matrix,

A^{†}

becomes the usual inverse

A^{- 1}

. On the other hand, the singular value decomposition (SVD) of

A \in R^{m \times n}

is a factorization of the form [4]:

A = U S V^{T},

(2)

where

U \in R^{m \times m}

and

V \in R^{n \times n}

are orthogonal matrices, i.e.,

U^{T} = U^{- 1}

and

V^{T} = V^{- 1}

, while

S \in R^{m \times n}

is a rectangular (or square, in the case

m = n

) diagonal matrix with the singular values of A on its main diagonal. SVD is frequently used to compute the inverse or pseudoinverse of a matrix, while commonly existing in fields of scientific research, such as medical treatment and industrial applications, lattice computing [5], automatic classification of electromyograms [6], and face recognition [7]. In a recent work [8], the authors provided a zeroing neural network for computing the singular value decomposition of an arbitrary matrix. This work move things one step further, by designing a new ZNN model for calculating the inverse or pseudoinverse of an arbitrary TV matrix based on the singular value decomposition. For comparison purposes, we build another model based on direct pseudoinversion in accordance with the paper [9], and the experiments section demonstrates the efficacy of the proposed SVD model.

Zhang et al. in [10], developed a ZNN design for generating online solutions to TV problems. It is worth noting that most ZNN based dynamical systems fall under the category of recurrent neural networks (RNN) that are designed to find equation zeros. As a consequence, numerous valuable research findings have been presented in the literature. Addressing generalized inversion problems [11,12], tensor and matrix inversion problems [13], systems of linear equations [14,15], systems of matrix equations [14,16], quadratic optimization problems [17], and diverse matrix functions approximation [18,19] are the main applications of ZNNs. The first stage in developing ZNN dynamics is to design an error function

E (t)

that is tailored to the underlying problem, commonly known as the Zhang function [20]. The second stage takes advantage of the proper dynamical evolution that follows:

\dot{E} (t) = \frac{d E (t)}{d t} = - λ F (E (t)),

(3)

where

\dot{E} (t) \in R^{m \times n}

is the time derivative of

E (t) \in R^{m \times n}

,

λ > 0

is the design parameter that is used for scaling the convergence, while

F (\cdot) : R^{m \times n} \to R^{m \times n}

means elementwise utilization of an odd and increasing activation function on

E (t)

. In our research, we will consider the ZNN evolution (3) under the linear activation function. That is,

\dot{E} (t) = \frac{d E (t)}{d t} = - λ E (t) .

(4)

This work’s key points may be summarized as below:

A novel ZNN approach, which is based on SVD, is employed for solving the problem of calculating the pseudoinverse of an arbitrary TV real matrix.
Two ZNN models for calculating the pseudoinverse of an arbitrary TV matrix are offered: one called ZNNSVDP, which is based on SVD, and the other called ZNNP, which is based on a more direct approach to the problem and is offered for comparison purposes.
Four numerical experiments, involving the pseudoinversion of square, rectangular, singular, and nonsingular input matrices, indicate that both models are effective for solving the problem and that the ZNNSVDP model converges to the problem’s solution faster than the ZNNP model.

Additionally, it is worth mentioning some of the paper’s general notations: the symbols

1_{n}, 0_{n}

denote a vector in

R^{n}

consisting of ones and zeros, respectively;

O_{n \times n} \in R^{n \times n}

denotes a zero matrix of

n \times n

dimensions;

I_{n} \in R^{n \times n}

denotes the identity

n \times n

matrix; ⊗ denotes Kronecker product;

vec (\cdot)

denotes the vectorization technique; ⊙ denotes the Hadamard (or element wise) product; and

{∥\cdot∥}_{F}

denotes the matrix Frobenius norm.

The paper is constituted as follows. Section 2 and Section 3, respectively, define and analyse the ZNNSVDP and ZNNP models. Section 4 presents and discusses the results of four numerical experiments employing the pseudoinversion of square, rectangular, singular, and nonsingular input matrices. Lastly, the final remarks and conclusions are offered in Section 5.

2. Time-Varying Pseudoinverse Computation Based on SVD

This section presents and analyses the ZNNSVDP model for calculating the pseudoinverse of an arbitrary TV real matrix. Considering a smooth TV matrix

A (t) \in R^{m \times n}

, the inverse or pseudoinverse of

A (t)

based on SVD (2) is the following:

\{\begin{matrix} A^{- 1} (t) = V (t) S^{- 1} (t) U^{T} (t), & m = n = rank (A (t)) \\ A^{†} (t) = V (t) S^{†} (t) U^{T} (t), & otherwise, \end{matrix}

(5)

where

U (t) \in R^{m \times m}

and

V (t) \in R^{n \times n}

are TV orthogonal matrices, and

S (t) \in R^{m \times n}

is a rectangular (or square, in the case

m = n

) diagonal matrix with the singular values of

A (t)

on its main diagonal. Here, we consider decomposition so that the singular values of

A (t)

are in descending order on the main diagonal of

S (t)

. Based on (2) and (5), the ZNNSVDP model considers the following group of error functions for calculating the inverse or pseudoinverse of

A (t)

:

\{\begin{matrix} E_{1} (t) = & A (t) V (t) - U (t) S (t) \\ E_{2} (t) = & U^{T} (t) U (t) - I_{m} \\ E_{3} (t) = & V^{T} (t) V (t) - I_{n} \\ E_{4} (t) = & X (t) - V (t) Y (t) U^{T} (t), \end{matrix}

(6)

where

X (t)

is the desired solution of the problem, i.e., the inverse or pseudoinverse of

A (t)

, and

Y (t) = \{\begin{matrix} S^{- 1} (t), & m = n = rank (A (t)) \\ S^{†} (t), & otherwise . \end{matrix}

(7)

The following proposition about the structure and construction of the pseudoinverse of a diagonal matrix is offered, whereas [21] provides a full examination of this proposition.

Proposition 1.

For a rectangular (or a square singular) diagonal matrix

B \in R^{m \times n}

, let

b_{1}, b_{2} \dots, b_{w}

with

w = rank (B)

signify the elements of the main diagonal of B. Then, the pseudoinverse matrix of B is the following:

B^{†} = [\begin{matrix} {\bar{B}}^{- 1} & O_{w \times (n - w)} \\ O_{(m - w) \times w} & O_{(m - w) \times (n - w)} \end{matrix}], with {\bar{B}}^{- 1} = [\begin{matrix} \frac{1}{b_{1}} & 0 & \dots & 0 \\ 0 & \frac{1}{b_{2}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & \frac{1}{b_{w}} \end{matrix}] .

(8)

In addition, the first time derivative of (6) is the following:

\{\begin{matrix} {\dot{E}}_{1} (t) = & \dot{A} (t) V (t) + A (t) \dot{V} (t) - \dot{U} (t) S (t) - U (t) \dot{S} (t) \\ {\dot{E}}_{2} (t) = & {\dot{U}}^{T} (t) U (t) + U^{T} (t) \dot{U} (t) \\ {\dot{E}}_{3} (t) = & {\dot{V}}^{T} (t) V (t) + V^{T} (t) \dot{V} (t) \\ {\dot{E}}_{4} (t) = & \dot{X} (t) - \dot{V} (t) Y (t) U^{T} (t) - V (t) \dot{Y} (t) U^{T} (t) - V (t) Y (t) {\dot{U}}^{T} (t), \end{matrix}

(9)

where the first time derivative of

Y (t)

is the following [22]:

\dot{Y} (t) = \{\begin{matrix} {\dot{S}}^{- 1} (t) = & - S^{- 1} (t) \dot{S} (t) S^{- 1} (t), m = n = rank (A (t)) \\ {\dot{S}}^{†} (t) = & - S^{†} (t) \dot{S} (t) S^{†} (t) + (S^{†} (t) {(S^{†} (t))}^{T}) {\dot{S}}^{T} (t) (I_{m} - S (t) S^{†} (t)) \\ + (I_{n} - S^{†} (t) S (t)) {\dot{S}}^{T} (t) ({(S^{†} (t))}^{T} S^{†} (t)), otherwise, \end{matrix}

(10)

or equivalent

\dot{Y} (t) = - Y (t) \dot{S} (t) Y (t) + (Y (t) Y^{T} (t)) {\dot{S}}^{T} (t) (I_{m} - S (t) Y (t)) + (I_{n} - Y (t) S (t)) {\dot{S}}^{T} (t) (Y^{T} (t) Y (t)) .

(11)

Notice that (11) is simplified to

\dot{Y} (t) = - Y (t) \dot{S} (t) Y (t)

for

m = n = rank (A (t))

. Then, combining (6), (9) and (11) with the ZNN design under the linear activation function (4), the following may be acquired:

\{\begin{matrix} \dot{A} (t) V & (t) + A (t) \dot{V} (t) - \dot{U} (t) S (t) - U (t) \dot{S} (t) = - λ E_{1} (t) \\ {\dot{U}}^{T} (t) & U (t) + U^{T} (t) \dot{U} (t) = - λ E_{2} (t) \\ {\dot{V}}^{T} (t) & V (t) + V^{T} (t) \dot{V} (t) = - λ E_{3} (t) \\ \dot{X} (t) - & \dot{V} (t) Y (t) U^{T} (t) - V (t) (- Y (t) \dot{S} (t) Y (t) + (Y (t) Y^{T} (t)) {\dot{S}}^{T} (t) (I_{m} - S (t) Y (t)) \\ + (I_{n} - Y (t) S (t)) {\dot{S}}^{T} (t) (Y^{T} (t) Y (t))) U^{T} (t) - V (t) Y (t) {\dot{U}}^{T} (t) = - λ E_{4} (t) . \end{matrix}

(12)

Using vectorization and the Kronecker product, the dynamics of (12) are modified as follows:

\{\begin{matrix} - (S^{T} (t) \otimes I_{m}) vec (\dot{U} (t)) + (I_{n} \otimes A (t)) vec (\dot{V} (t)) - (I_{n} \otimes U (t)) vec (\dot{S} (t)) \\ = vec (- λ E_{1} (t) - \dot{A} (t) V (t)) \\ (U^{T} (t) \otimes I_{m}) vec ({\dot{U}}^{T} (t)) + (I_{m} \otimes U^{T} (t)) vec (\dot{U} (t)) = vec (- λ E_{2} (t)) \\ (V^{T} (t) \otimes I_{n}) vec ({\dot{V}}^{T} (t)) + (I_{n} \otimes V^{T} (t)) vec (\dot{V} (t)) = vec (- λ E_{3} (t)) \\ - (I_{m} \otimes V (t) Y (t)) vec ({\dot{U}}^{T} (t)) - (U (t) Y^{T} (t) \otimes I_{n}) vec (\dot{V} (t)) - K_{1} (t) + I_{m n} vec (\dot{X} (t)) \\ = vec (- λ E_{4} (t)), \end{matrix}

(13)

where

K_{1} (t) = - (U (t) Y^{T} (t) \otimes V (t) Y (t)) vec (\dot{S} (t)) + (U (t) {(I_{m} - S (t) Y (t))}^{T} \otimes V (t) Y (t) Y^{T} (t)) vec ({\dot{S}}^{T} (t)) + (U (t) Y^{T} (t) Y (t) \otimes V (t) (I_{n} - Y (t) S (t))) vec ({\dot{S}}^{T} (t))

(14)

Note that (13) must be simplified in order to produce a simple and explicit dynamical model that may easily calculate

U (t)

,

V (t)

,

S (t)

, and

X (t)

. As a result, the following lemmas about vectorization and the Kronecker product are offered, whereas [23] provides a full examination of Lemmas’ 1 and 2 content.

Lemma 1.

(L^{T} \otimes A) vec (X) = vec (Y) .

Lemma 2.

For

B \in R^{m \times m}

, let

vec (B) \in R^{m^{2}}

signify the matrix B vectorization. The following occurs:

vec (B^{T}) = Q_{m} vec (B),

(15)

where

Q_{m} \in R^{m^{2} \times m^{2}}

is a constant permutation matrix defined exclusively by m.

The following, Algorithm 1, presents an algorithmic process for obtaining the permutation matrix

Q_{m}

in (15), which corresponds to a matrix of

m \times m

dimensions. Note that the notations eye(.) and reshape(.) in Algorithm 1 have the typical notion of the related MATLAB functions [24].

Algorithm 1 Permutation matrix calculation

Require: The rows or columns number m of a square matrix

B \in R^{m \times m}

.

1: procedurePermutation_Matrix(m)

2: Set

a =

eye

(m^{2})

and

b =

reshape

(1 : m^{2}, m, m)

3: return

Q = a (:,

reshape

(b^{'}, 1, m^{2}))

4: end procedure

Ensure:

Q_{m}

, i.e., the permutation matrix.

Furthermore, because

S (t)

and

S^{T} (t)

are rectangular (or square, in the case

m = n

) diagonal matrices, just the nonzero elements of

\dot{S} (t)

and

{\dot{S}}^{T} (t)

that are placed in their main diagonal must be obtained. By doing so, we may confine

S (t)

to being a diagonal matrix, while also reducing the dimensions of (13). Hence, employing the nonzero elements on the main diagonal of

S (t)

and

S^{T} (t)

, whose number is

w = rank (A (t))

, we utilize the equations

vec (\dot{S} (t)) = G_{1} \dot{s} (t)

and

vec ({\dot{S}}^{T} (t)) = G_{2} \dot{s} (t)

, respectively, to replace

\dot{S} (t)

and

{\dot{S}}^{T} (t)

in (14), where the matrices

G_{1}, G_{2} \in R^{m n \times w}

are operational matrices that can be calculated using the algorithmic procedure presented Algorithm 2. Additionally, the notation sum(.), min(.), zeros(.), mod(.) and floor(.) in Algorithm 2 have the typical notion of the related MATLAB functions [24].

Algorithm 2 Operational matrix calculation

Require: The number of the rows and columns, respectively, m and n of a matrix

B \in R^{m \times n}

,

and

w = rank (B)

.

1: procedureOperational_Matrix(

m, n, w

)

2: if

w < min (m, n)

then

3: Set

h = w

4: else

5: Set

h = m

6: end if

7: Set

G =

zeros

(m n, w)

8: for

k = 1 : m n

do

9: Set

c =

mod

(k - 1, h) + 1

and

d =

floor

(\frac{k - 1}{h}) + 1

10: if

d = = c

then

11: Set

G (k, c) = 1

12: end if

13: end for

14: return G

15: end procedure

Ensure: The operational matrix G.

Based on the aforementioned discussion, (13) can be reformulated as follows:

\{\begin{matrix} - (S^{T} (t) \otimes I_{m}) vec (\dot{U} (t)) + (I_{n} \otimes A (t)) vec (\dot{V} (t)) - (I_{n} \otimes U (t)) G_{1} \dot{s} (t) \\ = vec (- λ E_{1} (t) - \dot{A} (t) V (t)) \\ (U^{T} (t) \otimes I_{m}) Q_{m} vec (\dot{U} (t)) + (I_{m} \otimes U^{T} (t)) vec (\dot{U} (t)) = vec (- λ E_{2} (t)) \\ (V^{T} (t) \otimes I_{n}) Q_{n} vec (\dot{V} (t)) + (I_{n} \otimes V^{T} (t)) vec (\dot{V} (t)) = vec (- λ E_{3} (t)) \\ - (I_{m} \otimes V (t) Y (t)) vec ({\dot{U}}^{T} (t)) - (U (t) Y^{T} (t) \otimes I_{n}) vec (\dot{V} (t)) - K_{2} (t) \dot{s} (t) \\ + I_{m n} vec (\dot{X} (t)) = vec (- λ E_{4} (t)), \end{matrix}

(16)

where

K_{2} (t) = - (U (t) Y^{T} (t) \otimes V (t) Y (t)) G_{1} + (U (t) {(I_{m} - S (t) Y (t))}^{T} \otimes V (t) Y (t) Y^{T} (t)) G_{2} + (U (t) Y^{T} (t) Y (t) \otimes V (t) (I_{n} - Y (t) S (t))) G_{2}

(17)

As a result, setting

Z_{1} (t) = [\begin{matrix} - (S^{T} (t) \otimes I_{m}) \\ (U^{T} (t) \otimes I_{m}) Q_{m} + (I_{m} \otimes U^{T} (t)) \\ 0_{n^{2} \times m^{2}} \\ - (I_{m} \otimes V (t) Y (t)) \end{matrix}], Z_{3} (t) = [\begin{matrix} - (I_{n} \otimes U (t)) G_{1} \\ 0_{m^{2} \times w} \\ 0_{n^{2} \times w} \\ - K_{2} (t) \end{matrix}], Z_{2} (t) = [\begin{matrix} (I_{n} \otimes A (t)) \\ 0_{m^{2} \times n^{2}} \\ (V^{T} (t) \otimes I_{n}) Q_{n} + (I_{n} \otimes V^{T} (t)) \\ - (U (t) Y^{T} (t) \otimes I_{n}) \end{matrix}], Z_{4} (t) = [\begin{matrix} 0_{m n \times m n} \\ 0_{m^{2} \times m n} \\ 0_{n^{2} \times m n} \\ I_{m n} \end{matrix}], Z (t) = [\begin{matrix} Z_{1} (t) & Z_{2} (t) & Z_{3} (t) & Z_{4} (t) \end{matrix}],

(18)

q (t) = {[\begin{matrix} vec (- λ E_{1} (t) - \dot{A} (t) V (t)) \\ vec (- λ E_{2} (t)) \\ vec (- λ E_{3} (t)) \\ vec (- λ E_{4} (t)) \end{matrix}]}^{T}, \dot{x} = [\begin{matrix} vec (\dot{U} (t)) \\ vec (\dot{V} (t)) \\ vec (\dot{s} (t)) \\ vec (\dot{X} (t)) \end{matrix}], x = [\begin{matrix} vec (U (t)) \\ vec (V (t)) \\ vec (s (t)) \\ vec (X (t)) \end{matrix}],

(19)

we propose the following ZNN model:

Z^{T} (t) Z (t) \dot{x} (t) = Z^{T} (t) q (t),

(20)

where

Z^{T} (t) Z (t)

is a singular mass matrix. To solve the singularity problem, the Tikhonov regularization is used and (20) is converted into:

(Z^{T} (t) Z (t) + β I_{m^{2} + n^{2} + w + m n}) \dot{x} (t) = Z^{T} (t) q (t),

(21)

where

β \geq 0

signifies the regularization parameter. The ZNN model (21) is termed as the ZNNSVDP model and can be solved efficiently with an appropriate ode Matlab solver. The exponential convergence of the ZNNSVDP model (21) to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

is proven in Theorem 1.

Remark 1.

According to MATLAB’sodesolvers syntax [24], the mass matrix is a symmetric matrix M that expresses the connection between the time derivative

\dot{x}

of the generalized coordinate vector x of a system, by the equation:

M \dot{x} = x .

Theorem 1.

Let

U (t) \in R^{m \times m}, V (t), \in R^{n \times n}, S (t) \in R^{m \times n}

be differentiable and

S (t)

be a rectangular diagonal matrix. The ZNNSVDP model (21), starting from any initial value

x (0)

, converges exponentially to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

.

Proof.

In order to obtain the solution

x (t)

, which corresponds to the TV inverse or pseudoinverse of the input matrix

A (t)

, the error matrix equation group is defined as in (9), inline with the ZNN design. Following that, by adopting the linear design formula for zeroing (9), the model (12) is obtained. From [Theorem 1] [10], each error matrix equation in the error matrix equation group (12) converges exponentially to the theoretical solution when

t \to \infty

. As a result, the solution of (12) converges exponentially to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

when

t \to \infty

. Furthermore, from the derivation procedure of (21) from (12), the proof is completed. □

3. Alternative Time-Varying Pseudoinverse Computation

This section presents and analyzes a ZNN model, namely, ZNNP, for calculating the pseudoinverse of any TV real matrix, based on a recent work [9] on ZNN pseudoinverse computation, and this new model will serve as a strong and fair competitor to the proposed ZNNSVDP model. Considering a smooth TV matrix

A (t) \in R^{m \times n}

, then, if

rank (A (t)) = n < m

, the MP inverse

A^{†} (t)

becomes the left inverse

A^{†} (t) = A_{L}^{- 1} (t) \equiv {(A^{T} (t) A (t))}^{- 1} A^{T} (t)

of

A (t)

, where

A^{T} (t) A (t) A^{†} (t) = A^{T} (t)

. Otherwise, if

rank (A (t)) = n > m

, the MP inverse

A^{†} (t)

becomes the right inverse

A^{†} (t) = A_{R}^{- 1} (t) \equiv A^{T} (t) {(A (t) A^{T} (t))}^{- 1}

, where

A^{†} (t) A (t) A^{T} (t) = A^{T} (t)

. Therefore, we can design a ZNN model according to the following equations:

\{\begin{matrix} A^{T} (t) A (t) A^{- 1} (t) = A^{T} (t), & m = n = rank (A (t)) \\ A^{T} (t) A (t) A^{†} (t) = A^{T} (t), & rank (A (t)) \leq n < m \\ A^{†} (t) A (t) A^{T} (t) = A^{T} (t), & rank (A (t)) \leq m < n . \end{matrix}

(22)

Based on (22), the ZNNP model considers the following error function for calculating the inverse or pseudoinverse of

A (t)

:

E_{D} (t) = \{\begin{matrix} A^{T} (t) A (t) X (t) - A^{T} (t), & rank (A (t)) \leq n \leq m \\ X (t) A (t) A^{T} (t) - A^{T} (t), & rank (A (t)) \leq m < n . \end{matrix}

(23)

where

X (t)

is the desired solution of the problem, i.e., the inverse or pseudoinverse of

A (t)

. Furthermore, the first time derivative of (23) is the following:

{\dot{E}}_{D} (t) = \{\begin{matrix} {\dot{A}}^{T} (t) A (t) X (t) + A^{T} (t) \dot{A} (t) X (t) + A^{T} (t) A (t) \dot{X} (t) - {\dot{A}}^{T} (t), & rank (A (t)) \leq n \leq m \\ \dot{X} (t) A (t) A^{T} (t) + X (t) \dot{A} (t) A^{T} (t) + X (t) A (t) {\dot{A}}^{T} (t) - {\dot{A}}^{T} (t), & rank (A (t)) \leq m < n . \end{matrix}

(24)

Then, combining (23) and (24) with the ZNN design (4), under the linear activation function, the following can be obtained:

\{\begin{matrix} {\dot{A}}^{T} (t) A (t) X (t) & + A^{T} (t) \dot{A} (t) X (t) + A^{T} (t) A (t) \dot{X} (t) - {\dot{A}}^{T} (t) \\ = - λ (A^{T} (t) A (t) X (t) - A^{T} (t)), rank (A (t)) \leq n \leq m \\ \dot{X} (t) A (t) A^{T} (t) & + X (t) \dot{A} (t) A^{T} (t) + X (t) A (t) {\dot{A}}^{T} (t) - {\dot{A}}^{T} (t) \\ = - λ (X (t) A (t) A^{T} (t) - A^{T} (t)), rank (A (t)) \leq m < n . \end{matrix}

(25)

Using vectorization and the Kronecker product, the dynamics of (25) are modified as follows:

\{\begin{matrix} (I_{m} \otimes A^{T} (t) A (t)) vec (\dot{X} (t)) & = vec (- λ (A^{T} (t) A (t) X (t) - A^{T} (t)) - {\dot{A}}^{T} (t) A (t) X (t) \\ - A^{T} (t) \dot{A} (t) X (t) + {\dot{A}}^{T} (t)), rank (A (t)) \leq n \leq m \\ (A (t) A^{T} (t) \otimes I_{n}) vec (\dot{X}) (t) & = vec (- λ (X (t) A (t) A^{T} (t) - A^{T} (t)) - X (t) \dot{A} (t) A^{T} (t) \\ - X (t) A (t) {\dot{A}}^{T} (t) + {\dot{A}}^{T} (t)), rank (A (t)) \leq m < n . \end{matrix}

(26)

As a result, setting

L (t) = \{\begin{matrix} I_{m} \otimes A^{T} (t) A (t) + β I_{m n}, & rank (A (t)) < n \leq m \\ I_{m} \otimes A^{T} (t) A (t), & rank (A (t)) = n \leq m \\ A (t) A^{T} (t) \otimes I_{n} + β I_{m n}, & rank (A (t)) < m < n \\ A (t) A^{T} (t) \otimes I_{n}, & rank (A (t)) = m < n, \end{matrix} r (t) = \{\begin{matrix} vec (- λ & (A^{T} (t) A (t) X (t) - A^{T} (t)) - {\dot{A}}^{T} (t) A (t) X (t) - A^{T} (t) \dot{A} (t) X (t) \\ + {\dot{A}}^{T} (t)), rank (A (t)) \leq n \leq m \\ vec (- λ & (X (t) A (t) A^{T} (t) - A^{T} (t)) - X (t) \dot{A} (t) A^{T} (t) - X (t) A (t) {\dot{A}}^{T} (t) \\ + {\dot{A}}^{T} (t)), rank (A (t)) \leq m < n . \end{matrix} \dot{x} (t) = vec (\dot{X} (t)), x (t) = vec (X (t)),

(27)

where

β \geq 0

signifies the Tikhonov regularization parameter, we have the next ZNN model:

L (t) \dot{x} (t) = r (t),

(28)

where

L (t)

is a mass matrix. Note that the Tikhonov regularization is used in

L (t)

to solve the singularity problem of the cases

rank (A (t)) < n \leq m

and

rank (A (t)) < m < n

, respectively, because the products

A (t) A^{T} (t)

and

A^{T} (t) A (t)

result to singular matrices. The ZNN model (28) is termed as the ZNNP model and can be solved efficiently with an ode Matlab solver, while its exponential convergence to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

is proven in Theorem 2.

Theorem 2.

The ZNNP model (28) starting form any initial value

x (0)

, converges exponentially to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

.

Proof.

In order to obtain the solution

x (t)

, which corresponds to the TV inverse or pseudoinverse of the input matrix

A (t)

, the error matrix equation group is defined as in (23), inline with the ZNN design. Following that, by adopting the linear design formula for zeroing (23), the model (25) is obtained. From [Theorem 1] [10], each error matrix equation in the error matrix equation group (25) converges to the theoretical solution when

t \to \infty

. As a consequence, the solution of (25) converges to the theoretical TV inverse or pseudoinverse of the input matrix

A (t)

when

t \to \infty

. Moreover, from the derivation procedure of (28), we know it is (25) in a different form. The proof is, thus, completed. □

4. Numerical Experiments

This section compares and contrasts the performances of the ZNNSVDP model (21) with the ZNNP model (28) on four numerical experiments (NE), involving the pseudoinversion of square, rectangular, singular, and nonsingular input matrices. In all NE, the time interval is restricted to

[0, 10]

during the computation, which indicates that the starting time is

t_{0} = 0

and the ending time is

t_{f} = 10

, while the ZNN design parameter has been set to

λ = 10

and the Tikhonov regularization parameter has been set to

β = 1 e - 8

. It is worth mentioning that the notation

ZNNSVDP

and

ZNNP

in the legends of Figure 1, respectively, denote the solutions produced by the ZNNSVDP and ZNNP models. Lastly, the MATLAB solver ode45 has been used, while the initial value for both models has been set to

x (0) = sign (x^{*} (0))

, where

x^{*} (0)

is the theoretical solution at

t = 0

and

sign

is the signum function.

4.1. Experiment 1

This NE deals with the inversion of the following square matrix:

A (t) = [\begin{matrix} 4 / (t + 18) & sin (t) + 2 \\ (t + 18) / (2 t + 2) & cos (t) - 20 \end{matrix}] .

Note that

A (t)

is a full rank matrix with dimensions

2 \times 2

.

4.2. Experiment 2

This NE deals with the pseudoinversion of the following rectangular matrix:

A (t) = [\begin{matrix} 3 sin (t) + 7 & 3 + cos (t) & sin (2 t) + 4 \\ 5 - sin (t) & 4 / (t + 8) & 7 - sin (t) \\ 7 / 2 + sin (3 t) & 4 + cos (t) & 6 + cos (3 t) \\ 3 sin (t) & cos (t) - 20 & 7 / 2 + sin (5 t) \\ 5 - sin (t) & sin (t) + 1 & 3 + cos (t) \end{matrix}] .

Notice that

A (t)

is a full column rank matrix with dimensions

5 \times 3

.

4.3. Experiment 3

The pseudoinversion of the following rectangular matrix is the subject of this NE:

A (t) = {[\begin{matrix} 5 - cos (π t) & 3 + sin (π t) & - 4 - cos (t) & 1 + 3 sin (t) \end{matrix}]}^{T} ⊙ 1_{4 \times 2} .

The matrix

A (t)

is rank deficient, with

rank (A (t)) = 1

, and its dimensions are

4 \times 2

.

4.4. Experiment 4

This NE is related to the pseudoinversion of the rectangular matrix given below:

A (t) = [\begin{matrix} 2 + sin (t) & 2 + 1 / 2 sin (t) & \dots & 2 + 1 / n sin (t) \end{matrix}] ⊙ 1_{m \times n} .

With

rank (A (t)) = 1

, the matrix

A (t)

is rank deficient, and its dimensions are

m \times n

, where

m = 4

and

n = 9

.

4.5. Analysis of Numerical Experiments—Results and Comparison

The performance of the ZNNSVDP and ZNNP models for calculating the inverse or pseudoinverse of an arbitrary matrix

A (t)

is investigated through the four NE defined in Section 4.1 Section 4.2, Section 4.3 and Section 4.4. For all the experiments, the results produced by the ZNNSVDP and ZNNP models are depicted in Figure 1. It is worth noting that Figure 1 has the following layout: the first column figures show the convergence of the error function, i.e.,

∥E_{i} (t)∥, i = 1, \dots, 4

, of the ZNNSVDP model and

∥E_{D} (t)∥

of the ZNNP model; the second column figures show the convergence of the models according to the appropriate error function, i.e., residual errors; the third column figures show the trajectories of the solutions generated by the models.

The following can be deduced from the NE of this section. Overall, the error functions of the ZNNSVDP model, i.e.,

∥E_{i} (t)∥, i = 1, \dots, 4

, receive lower values than the error function of the ZNNP model, i.e.,

∥E_{D} (t)∥

, in all NE, as depicted in Figure 1a,d,g,j. When

X (t)

corresponds to the solution of the ZNNSVDP model rather than the solution of the ZNNP model, the convergence in Figure 1b,h,k is faster, while the convergence in Figure 1e is almost identical. It is worth noting that Figure 1b depicts the residual error

{∥I - A (t) X (t)∥}_{F}

in the case of NE Section 4.1, Figure 1e depicts the residual error

{∥I - X (t) A (t)∥}_{F}

in the case of NE Section 4.2, Figure 1h depicts the residual error

{∥A^{T} (t) - X (t) A (t) A^{T} (t)∥}_{F}

in the case of NE Section 4.3, and Figure 1b depicts the residual error

{∥A^{T} (t) - A^{T} (t) A (t) X (t)∥}_{F}

in the case of NE Section 4.4. Finally, Figure 1c,f,i,l show that both models’ solutions match the theoretical inverse in the case of NE Section 4.1, and the theoretical pseudoinverse in the cases of NE Section 4.2, Section 4.3 and Section 4.4.

According to the presented NE, the following are some general generalizations that can be drawn. The ZNNSVDP model presented in this paper, which is based on the SVD method, shows better performances than the ZNNP model, which is based on a more direct approach for calculating the inverse or pseudoinverse. In addition, the ZNNSVDP model generates the minimum amount for the Frobenius norm of both the error functions and the residual errors. It is also important to note that, for both models, the larger the value of the design parameter

λ

, the higher the degree of convergence.

5. Conclusions

The problem of calculating the inverse or pseudoinverse of an arbitrary TV real matrix is addressed using the ZNN approach in this paper. Two ZNN models for calculating the inverse or pseudoinverse of an arbitrary TV matrix, one called ZNNSVDP, which is based on SVD, and the other called ZNNP, which is based on a more direct approach to the problem, are defined, analysed and compared. Four numerical experiments, involving the pseudoinversion of square, rectangular, singular, and nonsingular input matrices, indicate that both models are effective for solving the problem and that the ZNNSVDP model converges to the problem’s solution faster than the ZNNP model.

Some potential study areas can be identified:

It is possible to explore the streams of the ZNNSVDP and ZNNP models that are accelerated by a nonlinear activation function, as well as nonlinear ZNNSVDP and ZNNP model flows, with a terminal convergence in this direction.
Another option is to use carefully chosen fuzzy parameters to define future ZNN dynamics upgrades.
The presented ZNNSVDP and ZNNP models have the drawback of not being noise tolerant, because all types of noise have a substantial impact on the accuracy of the proposed ZNN approaches. As a consequence, future research could focus on adapting the ZNNSVDP and ZNNP models to an integration enhanced and noise-handling ZNN class of dynamical systems.

Author Contributions

M.K.: conceptualization, methodology. V.K.: validation, investigation. R.F.: formal analysis, investigation. M.Z.: methodology, investigation. V.N.K.: conceptualization, methodology, validation, formal analysis, investigation, writing—original draft. S.D.M.: conceptualization, methodology, validation, formal analysis, investigation, writing—original draft. T.E.S.: methodology, formal analysis, investigation. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by a Mega Grant from the Government of the Russian Federation within the framework of federal project No. 075-15-2021-584.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Penrose, R. A generalized inverse for matrices. Proc. Cambridge Philos. Soc. 1955, 51, 406–413. [Google Scholar] [CrossRef] [Green Version]
Sayevand, K.; Pourdarvish, A.; Machado, J.A.T.; Erfanifar, R. On the Calculation of the Moore-Penrose and Drazin Inverses: Application to Fractional Calculus. Mathematics 2021, 9, 2501. [Google Scholar] [CrossRef]
Chien, M.T. Numerical Range of Moore-Penrose Inverse Matrices. Mathematics 2020, 8, 830. [Google Scholar] [CrossRef]
Crane, D.K.; Gockenbach, M.S. The Singular Value Expansion for Arbitrary Bounded Linear Operators. Mathematics 2020, 8, 1346. [Google Scholar] [CrossRef]
Valverde-Albacete, F.J.; Peláez-Moreno, C. The Singular Value Decomposition over Completed Idempotent Semifields. Mathematics 2020, 8, 1577. [Google Scholar] [CrossRef]
Hazarika, A.; Barthakur, M.; Dutta, L.; Bhuyan, M. F-SVD based algorithm for variability and stability measurement of bio-signals, feature extraction and fusion for pattern recognition. Biomed. Signal Process. Control 2019, 47, 26–40. [Google Scholar] [CrossRef]
Wang, J.; Le, N.T.; Lee, J.; Wang, C. Illumination compensation for face recognition using adaptive singular value decomposition in the wavelet domain. Inf. Sci. 2018, 435, 69–93. [Google Scholar] [CrossRef]
Chen, J.; Zhang, Y. Online singular value decomposition of time-varying matrix via zeroing neural dynamics. Neurocomputing 2020, 383, 314–323. [Google Scholar] [CrossRef]
Katsikis, V.N.; Stanimirović, P.S.; Mourtas, S.D.; Xiao, L.; Karabasević, D.; Stanujkić, D. Zeroing Neural Network with Fuzzy Parameter for Computing Pseudoinverse of Arbitrary Matrix. IEEE Trans. Fuzzy Syst. 2021; Early Access. [Google Scholar] [CrossRef]
Zhang, Y.; Ge, S.S. Design and analysis of a general recurrent neural network model for time-varying matrix inversion. IEEE Trans. Neural Netw. 2005, 16, 1477–1490. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, X.; Che, M.; Wei, Y. Recurrent neural network for computation of generalized eigenvalue problem with real diagonalizable matrix pair and its applications. Neurocomputing 2016, 216, 230–241. [Google Scholar] [CrossRef]
Stanimirović, P.S.; Katsikis, V.N.; Zhang, Z.; Li, S.; Chen, J.; Zhou, M. Varying-parameter Zhang neural network for approximating some expressions involving outer inverses. Optim. Methods Softw. 2020, 35, 1304–1330. [Google Scholar] [CrossRef]
Ma, H.; Li, N.; Stanimirović, P.S.; Katsikis, V.N. Perturbation theory for Moore–Penrose inverse of tensor via Einstein product. Comput. Appl. Math. 2019, 38, 111. [Google Scholar] [CrossRef]
Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Zhang, Y. Solving Complex-Valued Time-Varying Linear Matrix Equations via QR Decomposition With Applications to Robotic Motion Tracking and on Angle-of-Arrival Localization. IEEE Trans. Neural Netw. Learn. Syst. 2021; Early Access. [Google Scholar] [CrossRef] [PubMed]
Stanimirović, P.S.; Katsikis, V.N.; Li, S. Hybrid GNN-ZNN models for solving linear matrix equations. Neurocomputing 2018, 316, 124–134. [Google Scholar] [CrossRef]
Stanimirović, P.S.; Katsikis, V.N.; Li, S. Integration enhanced and noise tolerant ZNN for computing various expressions involving outer inverses. Neurocomputing 2019, 329, 129–143. [Google Scholar] [CrossRef]
Zhang, Z.; Yang, S.; Zheng, L. A Penalty Strategy Combined Varying-Parameter Recurrent Neural Network for Solving Time-Varying Multi-Type Constrained Quadratic Programming Problems. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 2993–3004. [Google Scholar] [CrossRef] [PubMed]
Katsikis, V.N.; Stanimirović, P.S.; Mourtas, S.D.; Li, S.; Cao, X. Chapter Towards Higher Order Dynamical Systems. In Generalized Inverses: Algorithms and Applications; Mathematics Research Developments, Nova Science Publishers, Inc.: Hauppauge, NY, USA, 2021; pp. 207–239. [Google Scholar]
Katsikis, V.N.; Mourtas, S.D.; Stanimirović, P.S.; Zhang, Y. Continuous-Time Varying Complex QR Decomposition via Zeroing Neural Dynamics. Neural Process. Lett. 2021, 53, 3573–3590. [Google Scholar] [CrossRef]
Zhang, Y.; Guo, D. Zhang Functions and Various Models; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar] [CrossRef]
Ben-Israel, A.; Greville, T.N.E. Generalized Inverses: Theory and Applications, 2nd ed.; CMS Books in Mathematics; Springer: New York, NY, USA, 2003. [Google Scholar] [CrossRef]
Golub, G.H.; Pereyra, V. The Differentiation of Pseudo-Inverses and Nonlinear Least Squares Problems Whose Variables Separate. SIAM J. Numer. Anal. 1973, 10, 413–432. [Google Scholar] [CrossRef]
Graham, A. Kronecker Products and Matrix Calculus with Applications; Courier Dover Publications: Mineola, NY, USA, 2018. [Google Scholar]
Gupta, A.K. Numerical Methods Using MATLAB; MATLAB Solutions Series; Springer: New York, NY, USA, 2014. [Google Scholar]

Figure 1. The convergence of ZFs and the solutions’ convergence and trajectories in NEs Section 4.1, Section 4.2, Section 4.3 and Section 4.4. (a) NE Section 4.1: Convergence of ZFs. (b) NE Section 4.1: Solutions convergence. (c) NE Section 4.1: Solutions trajectories. (d) NE Section 4.2: Convergence of ZFs. (e) NE Section 4.2: Solutions convergence. (f) NE Section 4.2: Solutions trajectories. (g) NE Section 4.3: Convergence of ZFs. (h) NE Section 4.3: Solutions convergence. (i) NE Section 4.3: Solutions trajectories. (j) NE Section 4.4: Convergence of ZFs. (k) NE Section 4.4: Solutions convergence. (l) NE Section 4.4: Solutions trajectories.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kornilova, M.; Kovalnogov, V.; Fedorov, R.; Zamaleev, M.; Katsikis, V.N.; Mourtas, S.D.; Simos, T.E. Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition. Mathematics 2022, 10, 1208. https://doi.org/10.3390/math10081208

AMA Style

Kornilova M, Kovalnogov V, Fedorov R, Zamaleev M, Katsikis VN, Mourtas SD, Simos TE. Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition. Mathematics. 2022; 10(8):1208. https://doi.org/10.3390/math10081208

Chicago/Turabian Style

Kornilova, Mariya, Vladislav Kovalnogov, Ruslan Fedorov, Mansur Zamaleev, Vasilios N. Katsikis, Spyridon D. Mourtas, and Theodore E. Simos. 2022. "Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition" Mathematics 10, no. 8: 1208. https://doi.org/10.3390/math10081208

APA Style

Kornilova, M., Kovalnogov, V., Fedorov, R., Zamaleev, M., Katsikis, V. N., Mourtas, S. D., & Simos, T. E. (2022). Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition. Mathematics, 10(8), 1208. https://doi.org/10.3390/math10081208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Zeroing Neural Network for Pseudoinversion of an Arbitrary Time-Varying Matrix Based on Singular Value Decomposition

Abstract

1. Introduction and Preliminaries

2. Time-Varying Pseudoinverse Computation Based on SVD

3. Alternative Time-Varying Pseudoinverse Computation

4. Numerical Experiments

4.1. Experiment 1

4.2. Experiment 2

4.3. Experiment 3

4.4. Experiment 4

4.5. Analysis of Numerical Experiments—Results and Comparison

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI