Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products

El Ichi, Alaa; Jbilou, Khalide; Sadaka, Rachid

doi:10.3390/math8081298

Open AccessArticle

Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products

by

Alaa El Ichi

^1,2,

Khalide Jbilou

^2,3,* and

Rachid Sadaka

¹

LABMIA-SI, Department of Mathematics, University Mohammed V Rabat, Rabat 10000, Morocco

²

Department of Mathematics LMPA, 50 rue F. Buisson, ULCO, 62100 Calais, France

³

Laboratory of Modeling Simulation, Mohammed VI Polytechnic University, Hay Moulay Rachid, Ben Guerir 43150, Morocco

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(8), 1298; https://doi.org/10.3390/math8081298

Submission received: 9 June 2020 / Revised: 21 July 2020 / Accepted: 2 August 2020 / Published: 5 August 2020

(This article belongs to the Special Issue Matrix Structures: Numerical Methods and Applications)

Download

Browse Figure

Versions Notes

Abstract

:

In this paper, we present new Tensor extrapolation methods as generalizations of well known vector, matrix and block extrapolation methods such as polynomial extrapolation methods or

ϵ

-type algorithms. We will define new tensor products that will be used to introduce global tensor extrapolation methods. We discuss the application of these methods to the solution of linear and non linear tensor systems of equations and propose an efficient implementation of these methods via the global-QR decomposition.

Keywords:

Einstein product; Krylov subspaces; sequence transformation; tensor extrapolation

1. Introduction

Scalar extrapolation methods have been developed to accelerate the convergence of some sequences of numbers in

R

or

C

. This process consists in transforming a sequence

(s_{n})

converging to s to new ones by applying some transformation

T_{k}

,

k = 1, \dots

defined as follows

T_{k} : s_{n} ⟶ T_{k}^{(n)} = s_{n} + \sum_{i = 1}^{k - 1} a_{i}^{(n)} g_{i} (n),

where

g_{i} (n)

,

i = 1, \dots, k - 1

is a scalar sequence that defines the method. One of the earlier extrapolation method is the well known Aitken’s

Δ^{2}

process [1] defined by

T_{2}^{(n)} = s_{n} - \frac{Δ s_{n}}{Δ^{2} s_{n}},

where the first and the second forward differences are defined by

Δ s_{n} = s_{n + 1} - s_{n}

and

Δ^{2} s_{n} = Δ s_{n + 1} - Δ s_{n}

and in that case we have

g_{1} (n) = Δ s_{n}

. Such a process was first proposed in 1926 by Aitken in [2]. It is well known that under some assumptions, the new Aitken sequence

T_{2}^{(n)}

will converge faster than

(s_{n})

to the same limit s; see [1] for more details. Vector extrapolation methods have been proposed the last decades and among them are the minimal polynomial extrapolation (MPE) [3,4], the reduced rank extrapolation (RRE) method [5] and the modified minimal polynomial extrapolation (MMPE) [6,7]. For more details, see [1,8,9,10,11,12]. A second class of vector sequence transformations contains the topological

ϵ

-algorithm (TEA) [6]. Applications for solving large linear and nonlinear systems of equations have been considered in [8,13].

Vector extrapolation methods were used in many applications such as google page rank by Golub et al. [14,15] and in other fields such as in statistics [16] or for solving discretized Navier–Stokes problems [17]. In the present paper, we consider tensor sequence transformations and propose new tensor extrapolation methods that generalize the classical vector ones. Using the Einstein product, we define some new tensor products that will allow us to develop the new methods based on orthogonal or oblique projections onto subspaces of small dimensions. It will be shown that when the tensor sequence is generated linearly then the proposed methods are theoretically equivalent to some tensor Krylov subspace methods such as the tensor version of GMRES and Lanczos methods developed recently in [18,19].

The remainder of this paper is organized as follows. In Section 2, we give notations, some basic definitions and properties related to tensors. In Section 3, we introduce the tensor versions of the vector polynomial extrapolation methods namely the Tensor Global Reduced Rank Extrapolation (TG-RRE), the Tensor Global Minimal Polynomial Extrapolation (TG-MPE) and the Tensor Global Modified Minimal Polynomial Extrapolation (TG-MMPE). We also give a Global tensor version of the topological

ϵ

-transformation. Section 4 describes the application of the proposed methods for solving linear and nonlinear tensor system of equations. In Section 5, we introduce efficient implementations via the tensor global-QR decomposition and in the last section, we present some numerical experiments.

2. Preliminaries and Notations

In this section, we briefly review some concepts and notions that are used throughout the paper. A tensor is a multidimensional array of data and a natural extension of scalars, vectors and matrices to a higher order, a scalar is a 0th order tensor, a vector is a 1th-order tensor and a matrix is 2th-order tensor. The tensor order is the number of its indices, which is called modes or ways. For a given N-mode tensor

X \in R^{I_{1} \times I_{2} \times \dots \times I_{N}}

, the notation

x_{i_{1}, \dots, i_{N}}

(with

1 \leq i_{j} \leq I_{j}, j = 1, \dots N

) stands for element

(i_{1}, \dots, i_{N})

of the tensor

X

. Corresponding to a given tensor

X \in R^{I_{1} \times I_{2} \dots \times I_{N}}

, the notation

X_{\underset{(N - 1) - t i m e s}{\underset{︸}{:, :, \dots, :}}, k} k = 1, 2, \dots, I_{N}

denotes a tensor in

R^{I_{1} \times I_{2} \times \dots \times I_{N - 1}}

which is obtained by fixing the last index and is called frontal slice. Fibers are the higher-order analogue of matrix rows and columns. A fiber is defined by fixing every index but one. A matrix column is a mode-1 fiber and a matrix row is a mode-2 fiber. Figure 1 shows the frontal, horizontal and lateral slices of a third order tensor and also a mode-3 tube fiber.

The n-mode matrix of a tensor

X \in R^{I_{1} \times I_{2} \times I_{3} \dots \times I_{N}}

is denoted by

X_{(n)}

and arranges the mode-n fibers to be the columns of the resulting matrix

X_{(n)} \in R^{I_{n} \times (I_{1} \dots I_{3} \dots I_{n - 1} I_{n + 1} \dots I_{N})}

with

n = 1, \dots, N .

We have:

X_{(n)} (i_{n}, j) = x_{i_{1}, \dots, i_{N}}

where

j = 1 + \sum_{k = 1 k \neq n}^{N} (i_{k} - 1) J_{k}

with

J_{K} = \prod_{m = 1, m \neq n}^{k - 1} I_{m} .

An important operation for a tensor is the tensor-matrix multiplication [20], also known as n-mode product of a tensor

X \in R^{I_{1} \times I_{2} \times \dots \times I_{N}}

with a matrix

U \in R^{J \times I_{n}}

, the n-mode product is a tensor of size

I_{1} \times I_{2} \times \dots I_{n - 1} \times J \times I_{n + 1} \times \dots \times I_{N}

whose entries are given by:

{(X \times_{n} U)}_{i_{1}, \dots, i_{n - 1}, j, i_{n + 1, \dots}, i_{N}} = \sum_{i_{n} = 1}^{I_{n}} x_{i_{1} \dots i_{N}} u_{j i_{n}}

The n-mode vector product of a tensor

X \in R^{I_{1} \times I_{2} \times \dots \times I_{N}}

with a vector

v \in R^{I_{n}}

is denote by

(X {\tilde{\times}}_{n} v)

is a tensor of size

I_{1} \times I_{2} \times \dots I_{n - 1} \times I_{n + 1} \times \dots \times I_{N}

whose entries are given by:

{(X {\tilde{\times}}_{n} v)}_{i_{1}, \dots, i_{n - 1}, i_{n + 1, \dots}, i_{N}} = \sum_{i_{n} = 1}^{I_{n}} x_{i_{1} \dots i_{N}} v_{i_{n}}

We recall the definition and some properties of the Einstein product, see [19].

Definition 1.

Let

A \in R^{I_{1} \times \dots \times I_{L} \times K_{1} \times \dots \times K_{N}}

,

B \in R^{K_{1} \times \dots \times K_{N} \times J_{1} \times \dots \times J_{M}}

, the Einstein product of the tensors

A

and

B

is the tensor of size

(I_{1} \times \dots \times I_{L} \times J_{1} \times \dots \times J_{M})

defined as:

{(A *_{N} B)}_{i_{1} \dots i_{L} j_{1} \dots j_{M}} = \sum_{k_{1} = 1}^{K_{1}} \sum_{k_{2} = 1}^{K_{2}} \sum_{k_{3} = 1}^{K_{3}} \dots \sum_{k_{N} = 1}^{K_{N}} a_{i_{1} \dots i_{L} k_{1} \dots k_{N}} b_{k_{1} \dots k_{N} j_{1} \dots j_{M}}

Definition 2

([19]). Let

A = a_{i_{1}, \dots, i_{N}, j_{1}, \dots, j_{M}} \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{M}}

,

B = b_{i_{1}, \dots, i_{M}, j_{1}, \dots, j_{N}} \in R^{J_{1} \times \dots \times J_{M} \times I_{1} \times \dots \times I_{N}}

, if

b_{i_{1}, \dots, i_{M}, j_{1}, \dots, j_{N}} = a_{j_{1}, \dots, j_{N}, i_{1}, \dots, i_{M}}

then

B

is called the transpose of

A

and denoted by

A^{T}

.

A tensor

D = d_{i_{1}, \dots, i_{N}, j_{1}, \dots, j_{N}} \in C^{I_{1} \times \dots \times I_{N} \times I_{1} \times \dots \times I_{N}}

is diagonal if

d_{i_{1}, \dots, i_{N}, j_{1}, \dots, j_{N}} = 0

in the case that the indices

i_{1}, \dots, i_{N}

are different from

j_{1}, \dots, j_{N}

. If all the entries of tensor are zero, we say this tensor is zero tensor, denoted by

O

.

The trace of the tensor

A \in C^{I_{1} \times \dots \times I_{N} \times I_{1} \times \dots \times I_{N}}

is given by

t r (A) = \sum_{i_{1} \dots i_{N}} a_{i_{1} \dots i_{N} i_{1} \dots i_{N}}

.

The inner product of two tensors of the same order

A, B \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{m}}

is given by

〈 A, B 〉 = t r (B^{T} *_{N} A),

(1)

and the associated norm is defined by:

{| | A | |}^{2} = t r (A^{T} *_{N} A) .

Definition 3

([21]). A tensor

X \in R^{I_{1} \times \dots \times I_{N} \times I_{1} \times \dots \times I_{N}}

is called the inverse of the square tensor

A \in R^{I_{1} \times \dots \times I_{N} \times I_{1} \times \dots \times I_{N}}

and denoted by

A^{- 1}

if it satisfies:

A *_{N} X = X *_{N} A = I

where

I

the identity tensor is called a unit tensor or identity tensor which all this entries are zero except for the diagonal entries

I_{i_{1}, \dots, i_{N}, i_{1}, \dots, i_{N}} = 1

.

In the following, we introduce the new

⊡^{(N + M + 1)}

product between two tensors which is a generalization of diamond product introduced in [22].

Definition 4.

The

⊡^{(N + M + 1)}

tensor product between two

(M + N + 1)

-mode tensors

X = [X_{1}, \dots, X_{ℓ}] \in R^{I_{1} \times I_{2} \times I_{3} \dots \times I_{N} \times J_{1} \times J_{2} \times J_{3} \dots \times J_{M} \times ℓ}

and

Y = [Y_{1}, \dots, Y_{p}] \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{M} \times p}

where

X_{i} \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{M}}

and

Y_{j} \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{M}}

, is the

ℓ \times p

matrix whose

(i, j)

entries,

i = 1, \dots, l

and

j = 1, \dots, p

, is given by

{(X ⊡^{(N + M + 1)} Y)}_{i, j} = 〈 X_{i}, Y_{j} 〉 = t r (Y_{j}^{T} *_{N} X_{i}) .

Next, we give a proposition that will be used later.

Proposition 1.

Assume that

U = [U_{1}, \dots, U_{m}] \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M} \times m}

is an

(N + M + 1)

-mode tensor such that

U_{1}, \dots, U_{m} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

and let

y = {(y_{1}, \dots, y_{m})}^{T} \in R^{m}

. Then for an arbitrary

(N + M + 1)

-mode tensor

Y = [Y_{1}, \dots, Y_{m}]

such that

Y_{1}, \dots, Y_{m} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

, we have:

\begin{matrix} Y ⊡^{(N + M + 1)} (U {\tilde{\times}}_{(N + M + 1)} y) = (Y ⊡^{(N + M + 1)} U) y, \end{matrix}

Proof.

The proof comes easily from the definition of the two involved tensor products. □

Definition 5.

The set of (

N + M

)-mode tensors

U_{1}, \dots, U_{m} \in R^{I_{1} \times \dots \times I_{N} \times J_{1} \times \dots \times J_{M}}

is called orthonormal if

〈 U_{i}, U_{j} 〉 = δ_{i, j} (= 1 i f i = j a n d 0 e l s w h e r e) .

Remark 1.

Suppose that

U = [U_{1}, \dots, U_{m}] \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M} \times m}

is an

(N + M + 1)

-mode tensor such that

U_{1}, \dots, U_{m} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

. If the (

M + N

)-mode tensors

U_{1}, \dots, U_{m}

are orthonormal, then we have:

U ⊡^{(N + M + 1)} U = I_{m} .

3. Tensor Extrapolation Methods

In this section, we present two classes of new tensor global extrapolation methods. The first class contains the tensor polynomial based methods while the second class is devoted to the global tensor topological

ϵ

-algorithm.

3.1. Tensor Global-Polynomial Extrapolation Methods

Let

(S_{n})

be a sequence of tensors of

\in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M}}

and consider the transformation

T_{k}

,

k = 1, 2, \dots

from

R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M}}

onto

R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M}}

and defined by

\begin{matrix} T_{k}^{(n)} = T_{k} (S_{n}) & = S_{n} + \sum_{j = 0}^{k - 1} α_{j}^{(k)} G_{j} (n) \end{matrix}

(2)

\begin{matrix} = S_{n} + H_{k} (n) {\tilde{\times}}_{(N + M + 1)} α^{(k)} \end{matrix}

(3)

where

α^{(k)} = {(α_{0}^{(k)}, \dots, α_{k - 1}^{(k)})}^{T} \in R^{k}

and

H_{k} (n) = [G_{0} (n), \dots, G_{k - 1} (n)]

is the (N + M + 1)-mode tensor defined from the given auxiliary tensor sequences

{(G_{i} (n))}_{n} \in R^{I_{1} \times \dots \times I_{N} \times K_{1} \times \dots \times K_{M}}; i = 0, \dots, k - 1

. We will see later how to choose the vector

α^{(k)} \in R^{k}

.

Let

{\tilde{T}}_{k}

denote the new transformation obtained from

(T_{k})

as follows:

\begin{matrix} {\tilde{T}}_{k} (S_{n}) & = S_{n + 1} + \sum_{j = 0}^{k - 1} α_{j}^{(k)} G_{j} (n + 1) \end{matrix}

(4)

\begin{matrix} = S_{n + 1} + H_{k} (n + 1) {\tilde{\times}}_{(N + M + 1)} α^{(k)} . \end{matrix}

(5)

Notice that the scalars

α_{j}^{(k)}

are the same in the expressions of

T_{k} (S_{n})

and

{\tilde{T}}_{k} (S_{n})

. We define now the generalized residual of

T_{k}^{(n)}

by

\begin{matrix} {\tilde{R}}_{k}^{n} = \tilde{R} (T_{k}^{(n)}) & = {\tilde{T}}_{k} (S_{n}) - T_{k} (S_{n}) \\ = (S_{n + 1} - S_{n}) + \sum_{j = 0}^{k - 1} α_{j}^{(k)} (G_{j} (n + 1) - G_{j} (n)) \\ = Δ S_{n} + \sum_{j = 0}^{k - 1} α_{j}^{(k)} (Δ G_{j} (n)) . \end{matrix}

Then we get

\begin{matrix} \tilde{R} (T_{k}^{(n)}) & = Δ S_{n} + Δ H_{k} (n) {\tilde{\times}}_{(N + M + 1)} α^{(k)}, \end{matrix}

(6)

where the first forward differences

Δ S_{n} = S_{n + 1} - S_{n}

and

Δ H_{k} (n) = [Δ G_{0} (n), \dots, Δ G_{k - 1} (n)] \in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M} \times k}

. The vector

α^{(k)}

is obtained from the orthogonality relation:

\begin{matrix} \tilde{R} (T_{k}^{(n)}) \in {(s p a n {Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}})}^{⊥} \end{matrix}

(7)

where

Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times K_{1} \times K_{2} \times \dots \times K_{M}}

are given tensors.

Here, the

s p a n {Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}}

is the tensor subspace generated by the tensors

Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}

.

If

{\tilde{H}}_{k, n}

and

{\tilde{L}}_{k, n}

denote the tensor subspaces

{\tilde{H}}_{k, n} = s p a n {Δ G_{0} (n), \dots, Δ G_{k - 1} (n)}

and

{\tilde{L}}_{k, n} = s p a n {Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}}

, then from (6) and (7), the generalized residual satisfies the following relations:

\tilde{R} (T_{k}^{(n)}) - Δ S_{n} \in {\tilde{H}}_{k, n}

(8)

and

\tilde{R} (T_{k}^{(n)}) \in {({\tilde{L}}_{k, n})}^{⊥} .

(9)

Let

L_{k, n} = [Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}] \in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M} \times k}

, then the relations (8) and (9) could be expressed as follows:

\tilde{R} (T_{k}^{(n)}) - Δ S_{n} = Δ H_{k} (n) {\tilde{\times}}_{(N + M + 1)} α^{(k)}

(10)

and

L_{k, n} ⊡^{(N + M + 1)} \tilde{R} (T_{k}^{(n)}) = 0 .

(11)

Assuming that

(L_{k, n} ⊡^{N + M + 1} Δ H_{k} (n))

is nonsingular, the vector

α^{(k)}

appearing in the expression (6) of the generalized residual

\tilde{R} (T_{k}^{(n)})

is given by

\begin{matrix} α^{(k)} = - {(L_{k, n} ⊡^{(N + M + 1)} Δ H_{k} (n))}^{- 1} (L_{k, n} ⊡^{(N + M + 1)} Δ S_{n}) . \end{matrix}

(12)

Therefore, the approximation

T_{k}^{(n)}

is computed as follows

\begin{matrix} T_{k}^{(n)} = S_{n} + H_{k} (n) {\tilde{\times}}_{(N + M + 1)} α^{(k)} \end{matrix}

(13)

For the tensor global polynomial extrapolation methods, namely Tensor Global MPE (TG-MPE), Tensor Global RRE (TG-RRE) and Tensor Global MMPE (TG-MMPE), the auxiliary sequences are given as

G_{ℓ}^{(n)} = Δ S_{(n + ℓ)}, ℓ = 0, \dots, k - 1; n \geq 0

Let

Δ^{i} V_{k} (n)

be the tensor defined by

Δ^{i} V_{k} (n) = [Δ^{i} S_{n}, \dots, Δ^{i} S_{n + k - 1}] \in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M} \times k}, i = 1, 2,

where

Δ^{2}

is the second forward difference

Δ^{2} S_{n} = Δ S_{n + 1} - Δ S_{n}

.

In this case, using the relation (12) and the fact that

H_{k} (n) = Δ V_{k} (n)

, and

Δ H_{k} (n) = Δ^{2} V_{k} (n)

, the approximations

T_{k}^{(n)}

given in (13) can be expressed as:

\begin{matrix} T_{k}^{(n)} = S_{n} - Δ V_{k} (n) {\tilde{\times}}_{(N + M + 1)} ({(L_{k, n} ⊡^{(N + M + 1)} Δ^{2} V_{k} (n))}^{- 1} (L_{k, n} ⊡^{(N + M + 1)} Δ S_{n})) . \end{matrix}

(14)

It is clear that

T_{k}^{(n)}

exists and is unique if and only if the square matrix

L_{k, n} ⊡^{(N + M + 1)} Δ^{2} V_{k} (n)

is nonsingular. The generalized residual given in the relation (6) can be expressed as follows:

\begin{matrix} \tilde{R} (T_{k}^{(n)}) = Δ S_{n} - Δ^{2} V_{k} (n) {\tilde{\times}}_{(N + M + 1)} ({(L_{k, n} ⊡^{(N + M + 1)} Δ^{2} V_{k} (n))}^{- 1} (L_{k, n} ⊡^{(N + M + 1)} Δ S_{n})) . \end{matrix}

(15)

The choice of the tensors

Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times K_{1} \times K_{2} \times \dots \times K_{M}}

required in the orthogonality relation (7) determines the global-polynomial tensor extrapolation method. For the three well known extrapolation polynomial-based methods, we have the following choices

\begin{matrix} Y_{ℓ}^{(n)} = Δ S_{n + ℓ} & for TG-MPE, \\ Y_{ℓ}^{(n)} = Δ^{2} S_{n + ℓ} & for TG-RRE, \\ Y_{ℓ + 1}^{(n)} = Y_{n + l} & for TG-MMPE . \end{matrix}

Next, we will propose an efficient implementation of these methods. For this purpose, we first express the approximation

T_{k}^{(n)}

given in relation (2) in a different way.

Using the relation

(2)

, the fact that

G_{ℓ}^{(n)} = Δ S_{(n + ℓ)}, ℓ = 0, \dots, k - 1; n \geq 0

, and

\tilde{R} (T_{k}^{(n)}) \in {(s p a n {Y_{0}^{(n)}, \dots, Y_{k - 1}^{(n)}})}^{⊥}

, the (TG-RRE), (TG-MPE) and (TG-MMPE) extrapolation methods produce approximations

T_{k}^{(n)}

of the form

\begin{matrix} T_{k}^{(n)} = \sum_{j = 0}^{k} γ_{j}^{(k)} S_{n + j}, \end{matrix}

(16)

where

\sum_{j = 0}^{k} γ_{j}^{(k)} = 1 and \sum_{j = 0}^{k} η_{ℓ, j} γ_{j}^{(k)} = 0 0 \leq ℓ < k,

(17)

with

η_{ℓ, j} = 〈 Y_{ℓ}^{(n)}, Δ S_{n + j} 〉

.

The system of linear Equation (17) can be written as

\begin{matrix} \{\begin{matrix} γ_{0}^{(k)} + γ_{1}^{(k)} + \dots + γ_{k}^{(k)} = 1 \\ γ_{0}^{(k)} 〈 Y_{0}^{(n)}, Δ S_{n} 〉 + γ_{1}^{(k)} 〈 Y_{0}^{(n)}, Δ S_{n + 1} 〉 + \dots + γ_{k}^{(k)} 〈 Y_{0}^{(n)}, Δ S_{n + k} 〉 = 0 \\ γ_{0}^{(k)} 〈 Y_{1}^{(n)}, Δ S_{n} 〉 + γ_{1}^{(k)} 〈 Y_{1}^{(n)}, Δ S_{n + 1} 〉 + \dots + γ_{k}^{(k)} 〈 Y_{1}^{(n)}, Δ S_{n + k} 〉 = 0 \\ \dots \dots \dots \dots \dots \dots \dots \\ γ_{0}^{(k)} 〈 Y_{k - 1}^{(n)}, Δ S_{n} 〉 + γ_{1}^{(k)} 〈 Y_{k - 1}^{(n)}, Δ S_{n + 1} 〉 + \dots + γ_{k}^{(k)} 〈 Y_{k - 1}^{(n)}, Δ S_{n + k} 〉 = 0 \end{matrix} \end{matrix}

(18)

Let

β_{l}^{(k)} = γ_{0}^{(k)} / γ_{k}^{(k)}

for

0 \leq l \leq k

, then

\begin{matrix} γ_{l}^{(k)} = \frac{β_{l}^{(k)}}{\sum_{l = 0}^{k} β_{l}^{(k)}} for 0 \leq l < k and β_{k}^{(k)} = 1 . \end{matrix}

(19)

With these notations, the linear system of Equation (18) becomes

\begin{matrix} \{\begin{matrix} β_{0}^{(k)} 〈 Y_{0}^{(n)}, Δ S_{n} 〉 + \dots + β_{k - 1}^{(k - 1)} 〈 Y_{0}^{(n)}, Δ S_{n + k - 1} 〉 = - 〈 Y_{0}^{(n)}, Δ S_{n + k} 〉 \\ \dots \dots \dots \dots \dots \dots \dots \\ β_{0}^{(k)} 〈 Y_{k - 1}^{(n)}, Δ S_{n} 〉 + \dots + β_{k - 1}^{(k)} 〈 Y_{k - 1}^{(n)}, Δ S_{n + k - 1} 〉 = - 〈 Y_{k - 1}^{(n)}, Δ S_{n + k} 〉 . \end{matrix} \end{matrix}

(20)

The above system of equations can also be expressed in the following compact form

\begin{matrix} (L_{k, n} ⊡^{(N + M + 1)} Δ V_{k} (n)) β^{(k)} = - (L_{k, n} ⊡^{(N + M + 1)} Δ S_{n + k}) \end{matrix}

(21)

where

β^{(k)} = {[β_{0}^{(k)}, \dots, β_{k - 1}^{(k)}]}^{T}

. Assume now that

γ_{0}^{(k)}, γ_{1}^{(k)}, \dots, γ_{k}^{(k)}

have been calculated and introduce the new variables

\begin{matrix} δ_{0}^{(k)} = 1 - γ_{0}^{(k)}, δ_{j}^{(k)} = δ_{j - 1}^{(k)} - γ_{j}^{(k)}, 1 \leq j < k and δ_{k - 1}^{(k)} = γ_{k}^{(k)}, \end{matrix}

(22)

then the tensor approximation

T_{k}^{(n)}

can be expressed as

\begin{matrix} T_{k}^{(n)} = S_{n} + \sum_{j = 0}^{k - 1} δ_{j}^{(k)} Δ S_{n + j} = S_{n} + Δ V_{k} (n) {\tilde{\times}}_{(N + M + 1)} δ^{(k)} \end{matrix}

(23)

where

δ^{(k)} = {[δ_{0}^{(k)}, \dots, δ_{k - 1}^{(k)}]}^{T}

.

3.2. The tensor Global Topological $ϵ$ -Transformation

In [6], Brezinski proposed a generalization of the scalar

ϵ

-algorithm for vector sequences called the topological

ϵ

-algorithm (TEA). The matrix case has been introduced by Jbilou and Sadok in ([9]). In this section we define the tensor global topological

ϵ

-transformation (TG-TET).

Let

(S_{n})

be a sequence of tensors of

\in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M}}

and consider approximations

E_{k} (S_{n}) = E_{k}^{(n)}

of the limit of the tensor sequence

{(S_{n})}_{n \in N}

such that

\begin{matrix} E_{k}^{(n)} = S_{n} + \sum_{i = 1}^{k} a_{i}^{(n)} Δ S_{n + i - 1}, n \geq 0 . \end{matrix}

(24)

where

a_{i}^{(n)} \in R

, for

i = 1, \dots k

. We introduce the new tensor transformation

{\tilde{E}}_{k, j}^{(n)}, j = 1, \dots k

defined by

\begin{matrix} {\tilde{E}}_{k, j}^{(n)} = S_{n + j} + \sum_{i = 1}^{k} a_{i}^{(n)} Δ S_{n + i + j - 1} j = 1, \dots, k . \end{matrix}

(25)

We set

{\tilde{E}}_{k, 0}^{(n)} = E_{k}^{(n)}

and define the j-th tensor generalized residual as follows

\begin{matrix} {\tilde{R}}_{j} (E_{k}^{(n)}) = {\tilde{E}}_{k, j}^{(n)} - {\tilde{E}}_{k, j - 1}^{(n)} . \end{matrix}

(26)

The coefficients

a_{i}^{(n)}

involved in the expression (24) of

E_{k}^{(n)}

are computed such that each j-th generalized residual is orthogonal to some chosen tensor

Y \in R^{I_{1} \times I_{2} \dots \times I_{N} \times K_{1} \times K_{2} \dots \times K_{M}}

, that is

\begin{matrix} 〈 Y, {\tilde{R}}_{j} (E_{k}^{(n)}) 〉 = 0; j = 1, \dots, k . \end{matrix}

(27)

Let

D_{k, n}

denote the following matrix

\begin{matrix} D_{k, n} = (\begin{matrix} 〈 Y, Δ^{2} S_{n} 〉 & . & . & . & . & 〈 Y, Δ^{2} S_{n + k - 1} 〉 \\ 〈 Y, Δ^{2} S_{n + 1} 〉 & . & . & . & . & 〈 Y, Δ^{2} S_{n + k} 〉 \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ 〈 Y, Δ^{2} S_{n + k - 1} 〉 & . & . & . & . & 〈 Y, Δ^{2} S_{n + 2 k - 2} 〉 \end{matrix}) . \end{matrix}

(28)

Then, from the orthogonality relation (27),

a^{(n)} = {(a_{1}^{(n)}, \dots, a_{k}^{(n)})}^{T} \in R^{k}

is given by

\begin{matrix} a^{(n)} = - D_{k, n}^{- 1} z^{(n)} \end{matrix}

(29)

where

z^{(n)} = {(〈 Y, Δ S_{n} 〉, \dots, 〈 Y, Δ S_{n + k - 1} 〉)}^{T}

. We assume that the matrix

D_{k, n}

is nonsingular. Hence, the approximation

E_{k}^{(n)}

exists, is unique and is expressed as

\begin{matrix} E_{k}^{(n)} = S_{n} + Δ W_{k} (n) {\tilde{\times}}_{(N + M + 1)} β^{(n)} \end{matrix}

(30)

where

W_{k} (n) = [Δ S_{n}, \dots, Δ S_{n + k - 1}]

. If the matrix

D_{k, n}

is singular, then the approximation

E_{k}^{(n)}

is not defined and in that case we will have a breakdown of the method. One possibility to overcome this drawback is, instead of computing

a^{(n)}

by using (29), we can solve the least squares problem

min_{a^{(n)} \in R^{k}} ∥ D_{k, n} a^{(n)} + z^{(n)} ∥ .

In the next sections, we will see how these tensor global extrapolation methods could be applied for solving linear and nonlinear tensor equations.

4. Application to Tensor Linear/Non Linear Systems of Equations

4.1. Application to Tensor Linear Systems

Consider the following tensor linear system of equations

\begin{matrix} A *_{N} X = B, \end{matrix}

(31)

where

A

is a tensor in

R^{I_{1} \times I_{2} \times \dots \times I_{N} \times I_{1} \times I_{2} \times \dots \times I_{N}}

,

B \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

and

X \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

is the unknown tensor to be determined. We assume here that the square tensor

A

is nonsingular so that (31) has a unique solution

X^{*} = A^{- 1} ⋆ B

. The purpose here is to apply tensor extrapolation methods to the problem (31) to compute approximate solutions to the solution

X^{*}

of (31). Starting from an initial tensor guess

S_{0}

, we construct the linear sequence

{(S_{n})}_{n}

as follows

\begin{matrix} S_{n + 1} = (I - A) *_{N} S_{n} + B \end{matrix}

(32)

Notice that if the sequence

{(S_{n})}_{n}

is convergent, then its limit

S = X^{*}

is the solution of the system (31).

In [18,19], tensor Krylov subspaces via the Einstein product have been introduced including the tensor GMRES and some tensor based Lanczos methods. The next theorem shows that, when applied to the generated linear sequence (32), the proposed tensor extrapolation methods are Tensor Krylov subspace methods and are mathematically equivalent to some well known Krylov based methods such as the tensor GMRES.

Theorem 1.

When applied to the sequence generated by (32), TG-RRE, and TG-MPE are tensor Krylov subspace methods and are mathematically equivalent to the tensor global GMRES and the tensor global Arnoldi methods, respectively.

Proof.

Notice first that for tensor linear sequences (32) the generalized residual becomes the true residual. In fact, from (32), we have

Δ S_{n} = B - A *_{N} S_{n} = R (S_{n})

the residual of the tensor

S_{n}

. Since

Δ^{2} S_{n} = - A *_{N} Δ S_{n}

we have

Δ^{2} V_{k} (n) = [Δ^{2} S_{n}, \dots, Δ^{2} S_{n + k - 1}] = - A *_{N} Δ V_{k} (n)

where

Δ V_{k} (n) = [Δ S_{n}, \dots, Δ S_{n + k - 1}]

. Consequently using (6) and (32), the generalized residual of the approximation

(T_{k}^{(n)})

is the true residual

\tilde{R} (T_{k}^{(n)}) = R (T_{k}^{(n)}) = B - A *_{N} T_{k}^{(n)} .

For simplicity and unless specified otherwise, we set

n = 0

,

T_{k}^{(0)} = T_{k}

,

S_{0} = X_{0}

the initial guess and drop the index n in all our notations. When applied to the sequence generated by the linear relation (32), the TG-RRE, TG-MMPE and the TG-MPE above produce approximations

X_{k} = T_{k}

such that the corresponding residual

R_{k} = B - A *_{N} T_{k}

satisfies the relations

\begin{matrix} R_{k} - Δ S_{0} & \in & {\tilde{H}}_{k} = - A *_{N} {\tilde{V}}_{k} \\ R_{k} & \in & {({\tilde{L}}_{k})}^{⊥} \end{matrix}

where

{\tilde{V}}_{k} = s p a n \{Δ S_{0}, \dots, Δ S_{k - 1}\}

and

{\tilde{L}}_{k} \equiv {\tilde{H}}_{k}

for TG-RRE,

{\tilde{L}}_{k} \equiv {\tilde{V}}_{k}

for TG-MPE and

{\tilde{L}}_{k} \equiv {\tilde{Y}}_{k} = s p a n \{Y_{0}, \dots, Y_{k - 1}\}

for TG-MMPE where

Y_{0}, \dots, Y_{k - 1}

are some chosen tensors.

Notice that, since

{\tilde{V}}_{k} = K_{m} (A, R_{0})

(the tensor Krylov subspace defined in [18,19]), the extrapolation methods above are tensor global Krylov subspace methods. TG-RRE is an orthogonal projection and is theoretically equivalent to the tensor global GMRES while TG-MPE is oblique projection method and is equivalent to the tensor global Arnoldi method. □

As the TG-RRE is an orthogonal projection method, we also have the classical minimization property for the residual

| | R_{T G - R R E} | | = min_{T_{k} \in X_{0} + K_{m} (A, R_{0})} | | B - A *_{N} T_{k} | | .

When the linear process (32) is convergent, it is more useful in practice to apply the tensor extrapolation methods after some fixed number of iterations. To save memory, we can also use the algorithm in a cycling mode which means that the iterations are restarted after a fixed number m of iterations. Algorithm 1 is summarized as follows:

Algorithm 1 TG-RRE, TG-MPE and TG-MMPE Algorithms

Step 1.

k = 0

choose

X_{0}

and the numbers p and m.

Step 2. Basic iteration

Set

T_{0} = X_{0}

Z_{0} = T_{0}

Z_{j + 1} = (I - A) *_{N} Z_{j} + B, j = 0, \dots, p - 1

Step 3. Extrapolation schema

S_{0} = Z_{p}

S_{n + 1} = (I - A) *_{N} S_{n} + B, n = 0, \dots, m

Compute the approximation

T_{m}^{(0)}

by TG-RRE, TG-MPE or TG-MMPE.

Set

X_{0} = T_{m}^{(0)}

,

k = k + 1

and go to step 2.

4.2. Application to Non Linear Tensor Systems

Consider the nonlinear system of tensor equations

\begin{matrix} G (X) = B \end{matrix}

(33)

with

G (X)

an operator from

R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

onto

R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

, and

X^{*} \in R^{I_{1} \times I_{2} \times \dots \times I_{N} \times J_{1} \times J_{2} \times \dots \times J_{M}}

the solution of the system of Equation (33). For any arbitrary tensor

X

, the residual is given by

\begin{matrix} R (X) = G (X) - X \end{matrix}

(34)

Let

{(S_{n})}_{n}

be the sequence of tensors generated from an initial guess

S_{0}

as follows

S_{n + 1} = G (S_{n}), j = 0, 1 \dots .

To get approximate solutions to a solution of (33), we apply the tensor extrapolation methods above to the sequence

(S_{n})

. As for the linear case, the different steps are summarized in the following Algorithm 2.

Algorithm 2 TG-RRE, TG-MPE and TG-MMPE for nonlinear systems

Step 1.

k = 0

choose

X_{0}

and the numbers p and m.

Step 2. Basic iteration

Set

T_{0} = X_{0}

W_{0} = T_{0}

W_{j + 1} = G (W_{j}) j = 0, \dots, p - 1

Step 3. Extrapolation scheme

S_{0} = W_{p}

If

| | S_{1} - S_{0} {| |}_{F} < ϵ

stop

otherwise generate

S_{n + 1} = G (S_{n}), n = 0, \dots, m

Compute the approximation

T_{m}^{(0)}

by TG-RRE, TG-MPE or TG-MMPE.

Set

X_{0} = T_{m}^{(0)}

,

k = k + 1

and go to step 2.

Remark 2.

As we stated earlier, when applied to linearly generated tensor sequences, the proposed tensor extrapolation methods produce the same iterates as some well known tensor Krylov subspace methods but differ in the way that the approximations are computed. In the case where the process of generating the sequence

{(S_{n})}_{n}

is not known and what is known is only the terms of this sequences, Krylov-based methods or Newton-type methods could not be used and in that case, extrapolation methods are well come. Such a problem can be found for example in some statistical-problems; see for example [16] for the application of the vector ϵ-algorithm of Wynn [23] in the expectation–maximization (EM) algorithm to find maximum likelihood estimates from incomplete or missing data.

5. The Global-QR Implementation of TG-MPE/TG-RRE

The purpose of this section is to give an efficient implementation of TG-MPE and TG-RRE using a generalisation of the technique based on the QR decomposition and given in [11] for vector MPE and vector RRE methods. We first introduce the Tensor Global-QR decomposition. Let

U \in R^{I_{1} \times I_{2} \times \times \dots I_{M} \times J_{1} \times J_{2} \times \dots \times J_{N} \times k}

, be an

(M + N + 1)

-mode tensor with the column tensors

U_{0}, \dots, U_{k - 1} \in R^{I_{1} \times I_{2} \times \dots \times I_{M} \times J_{1} \times J_{2} \times \dots \times J_{N}}

. Then, there is an

(M + N + 1)

-mode orthogonal tensor

Q = [Q_{0}, \dots, Q_{k - 1}] \in R^{I_{1} \times I_{2} \times \times \dots I_{M} \times J_{1} \times J_{2} \times \dots \times J_{N} \times k}

satisfying

Q ⊡^{(N + M + 1)} Q = I_{k \times k}

and an upper triangular matrix

R \in R^{k \times k}

such that

\begin{matrix} U = Q \times_{(M + N + 1)} R^{T} \end{matrix}

(35)

The tensor decomposition (35) will be called the Tensor Global-QR (TG-QR) decomposition of

A

and is summarized in the following Algorithm 3.

Algorithm 3 The global-QR decomposition

Given $U_{0}$ , and compute the scalar $r_{00}$ and the tensor $Q_{0}$ by $r_{0, 0} = {〈 U_{0}, U_{0} 〉}^{\frac{1}{2}}$ and $Q_{0} = \frac{U_{0}}{r_{00}}$ .
For $j = 1, \dots, k$
(a)
$W = U_{j}$
(b)
For $i = 0, \dots, j - 1$ do
$r_{i, j} = 〈 Q_{i}, W 〉$ ; $W = W - r_{i, j} Q_{i}$ ,
End for
(c)
$r_{j, j} = {〈 W, W 〉}^{\frac{1}{2}}$
(d)
$Q_{j} = \frac{W}{r_{j, j}}$
(e)
End for
End

To show that the tensor

Q

is orthogonal, we just proceed by induction on k. The coefficients of the matrix R are the

r_{i, j}

’s given by Algorithm 3. Next, we give a proposition to be used.

Proposition 2

([24]). Let

X \in R^{I_{1} \times I_{2} \times \times \dots I_{N}}

,

A \in R^{J_{n} \times I_{n}}

and

y \in R^{J_{n}}

, then we have

X \times_{(n)} A {\tilde{\times}}_{(n)} y = X {\tilde{\times}}_{(n)} (A^{T} y) .

For simplicity, we set here

n = 0

and then the approximation

T_{k}^{(0)}

(defined earlier in (23)) is given by

\begin{matrix} T_{k}^{(0)} = S_{0} + \sum_{j = 0}^{k - 1} δ_{j}^{(k)} Δ S_{j} = S_{0} + Δ V_{k} (0) {\tilde{\times}}_{(N + M + 1)} δ^{(k)} \end{matrix}

Substituting now

V_{k} = V_{k} (0) = Q \times_{(N + M + 1)} R^{T}

and using Proposition 2, we get

\begin{matrix} T_{k}^{(0)} & = S_{0} + Δ V_{k} {\tilde{\times}}_{(N + M + 1)} δ^{(k)} \\ = S_{0} + Q {\tilde{\times}}_{(M + N + 1)} (R δ^{(k)}), \end{matrix}

which gives

T_{k}^{(0)} = S_{0} + \sum_{j = 0}^{k - 1} θ_{j} Q_{j},

(36)

where

θ_{j}

is the

(j + 1)

component of the column vector

R δ^{(k)}

; the matrix R and the tensors

Q_{j} \in R^{I_{1} \times \dots I_{M} \times J_{1} \times \dots \times J_{N}}

,

j = 0, \dots, k - 1

are given by Algorithm 3. To compute

δ^{(k)}

, we have first to compute

β^{(k)} = {(β_{0}^{(k)}, \dots, β_{k - 1}^{(k)})}^{T}

and use the relations given in (22).

For TG-MPE, the coefficients

γ_{l}^{(k)}

’s of the vector

γ^{(k)}

are determined by computing the vector

β^{(k)}

, the solution of the linear system of equations (20)

(Δ V_{k} ⊡^{(N + M + 1)} Δ V_{k}) β^{(k)} = - (Δ V_{k} ⊡^{(N + M + 1)} Δ S_{k})

(37)

where

\begin{matrix} (Δ V_{k} ⊡^{(N + M + 1)} Δ V_{k}) & = (\begin{matrix} 〈 Δ S_{0}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ S_{0}, Δ S_{k - 1} 〉 \\ 〈 Δ S_{1}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ S_{1}, Δ S_{k - 1} 〉 \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ 〈 Δ S_{k - 1}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ S_{k - 1}, Δ S_{k - 1} 〉 \end{matrix}) \in R^{k \times k} \end{matrix}

(38)

and

(Δ V_{k} ⊡^{(N + M + 1)} Δ S_{k}) = {(〈 Δ S_{0}, Δ S_{k} 〉, \dots, 〈 Δ S_{k - 1}, Δ S_{k} 〉)}^{T} \in R^{k}

.

For TG-RRE,

β^{(k)}

are determined by solving the linear system of equation

(Δ^{2} V_{k} ⊡^{(N + M + 1)} Δ V_{k}) β^{(k)} = - (Δ^{2} V_{k} ⊡^{(N + M + 1)} Δ S_{k})

where

\begin{matrix} (Δ^{2} V_{k} ⊡^{(N + M + 1)} Δ V_{k}) & = (\begin{matrix} 〈 Δ^{2} S_{0}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ^{2} S_{0}, Δ S_{k - 1} 〉 \\ 〈 Δ^{2} S_{1}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ^{2} S_{1}, Δ S_{k - 1} 〉 \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ . & . & . & . & . & . \\ 〈 Δ^{2} S_{k - 1}, Δ S_{0} 〉 & . & . & . & . & 〈 Δ^{2} S_{k - 1}, Δ S_{k - 1} 〉 \end{matrix}) \in R^{k \times k}, \end{matrix}

(39)

and

(Δ^{2} V_{k} ⊡^{(N + M + 1)} Δ S_{k}) = {(〈 Δ^{2} S_{0}, Δ S_{k} 〉, \dots, 〈 Δ^{2} S_{k - 1}, Δ S_{k} 〉)}^{T} \in R^{k}

.

Finally, once

β^{(k)}

is computed, the coefficients

γ_{l}^{(k)}

’s are given by

γ_{l}^{(k)} = \frac{β_{l}^{(k)}}{\sum_{l = 0}^{k} β_{l}^{(k)}} for 0 \leq l < k and β_{k}^{(k)} = 1 .

(40)

The following Algorithm 4 summarizes the main steps.

Algorithm 4 Implementation of TG-MPE/TG-RRE via the global-QR decomposition

$S_{0}$ is a given initial guess and k is a fixed index.
Apply Algorithm 3 to compute the global-QR decomposition of the tensor $Δ V_{k}$ .
Compute $β^{(k)}$ by solving the linear system (38) or (39).
Compute the coefficients $γ_{j}^{(k)}$ from (40).
Compute $δ^{(k)}$ from the relation (22) and $θ_{j}$ is $(j + 1)$ component of the column vector $R δ^{(k)}$ .
Compute $T_{k}^{(0)}$ by $T_{k}^{(0)} = S_{0} + \sum_{j = 0}^{k - 1} θ_{j} Q_{j}$

As a numerical test, we consider the linear system of tensor equations given by (31) where

A

is a random tensor

N = 3

,

I_{1} = I_{2} = 20

,

I_{3} = 10

,

J_{1} = 20

,

J_{2} = 10

and

J_{3} = 5

as in [19]. The exact solution is the tensor

X

whose elements are all one and the right hand side

B

is given by

B = A *_{N} X

. The computations are carried out using MATLAB 7.4 with machine epsilon about

2 \cdot 10^{- 16}

.

We took

m = 10

,

p = 1

and stopped the iteration when the relative error norm of the residual was less than

10^{- 6}

. Then after

k = 6

iterations, Algorithm 4 for the tensor RRE method, gives the relative residual norm

\frac{R_{k}}{R_{0}} = 8.5 \times 10^{- 7}

where the initial guess is

X_{0} = O

. We notice that the convergence of the original sequence

(S_{n})

is very slow and at

n = 200

, we get a relative residual

R (S_{n}) / R_{0}

of

10^{- 1}

. Generally, extrapolation methods are more effective than Krylov-based methods for nonlinear problems. More experimental studies and applications in some areas such as mutilinear google pagerank should be considered in the future.

6. Conclusions

In this paper, we introduced new tensor global extrapolation methods to accelerate the convergence of some tensor sequences. The proposed methods are generalisations to the tensor case of some well known vector extrapolation methods such as the reduced rank extrapolation or the topological epsilon algorithm. The new methods were defined as orthogonal or oblique projection processes using the Einstein product and also some new interesting tensor products. We showed how to apply the derived algorithms to tensor linear and nonlinear systems of tensor equations. Application to some interesting problems such as the multilinear page rank is still under investigation.

Author Contributions

Formal analysis, A.E.I.; Methodology, R.S.; Writing—original draft, K.J. All authors have read and agreed to the published version of the manuscript.

Funding

For the first author, this project was financially supported by Ministry of Europe and Foreign Affairs, Ministry of Higher Education, Research and Innovation and the French Institute of Rabat (PHC TOUBKAL 20XX (French-Morocco bilateral program) Grant Number: 12345AB).

Acknowledgments

We would like to thank the two referees for valuable remarks and helpful comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Brezinski, C.; Redivo Zaglia, M. Extrapolation Methods. Theory and Practice; North Holland Publishing: Amsterdam, The Netherlands, 1991. [Google Scholar]
Aitken, A.C. On Bernoulli’s numerical solution of algebraic equations. Proc. R. Soc. Edinb. 1926, 46, 289–305. [Google Scholar] [CrossRef]
Cabay, S.; Jackson, L.W. A polynomial extrapolation method for finding limits and antilimits for vector sequences. SIAM J. Numer. Anal. 1976, 13, 734–752. [Google Scholar] [CrossRef]
Mesina, M. Convergence acceleration for the iterative solution of = Ax + f. Comput. Meth. Appl. Mech. Eng. 1977, 10, 165–173. [Google Scholar] [CrossRef]
Eddy, V.P. Extrapolation to the limit of a vecteur sequence. In Information Linkage between Applied Mathematics and Industry; Wang, P.C.C., Ed.; Academic Press: New York, NY, USA, 1979; pp. 387–396. [Google Scholar]
Brezinski, C. Généralisation de la transformation de Shanks, de la table de Padéé et l’epsilon algorithm. Calcolo 1975, 12, 317–360. [Google Scholar] [CrossRef]
Pugatchev, B.P. Acceleration of the convergence of iterative processes and a method for solving systems of nonlinear equations. USSR Comput. Math. Math. Phys. 1978, 17, 199–207. [Google Scholar] [CrossRef]
Jbilou, K.; Sadok, H. Analysis of some vector extrapolation methods for linear systems. Numer. Math. 1995, 70, 73–89. [Google Scholar] [CrossRef]
Jbilou, K.; Sadok, H. VMatrix polynomial and epsilon-type extrapolation methods with applications. Numer. Algorithms 2015, 68, 107–119. [Google Scholar] [CrossRef]
Jbilou, K.; Messaoudi, A.; Tabaa, K. Some Schur complement identities to matrix extrapolation methods. Linear Algebra Appl. 2004, 392, 195–210. [Google Scholar] [CrossRef] [Green Version]
Sidi, A. Efficient implementation of minimal polynomial and reduced rank extrapolation methods. J. Comput. Appl. Math. 1991, 17, 305–337. [Google Scholar] [CrossRef] [Green Version]
Sidi, A.; Ford, W.F.; Smith, D.A. Acceleration of convergence of vector sequences. SIAM J. Numer. Anal. 1986, 23, 178–196. [Google Scholar] [CrossRef] [Green Version]
Jbilou, K.; Sadok, H. Vector extrapolation methods. Application and numerical comparison. J. Comput. Appl. Math. 2000, 122, 149–165. [Google Scholar] [CrossRef] [Green Version]
Brezinski, C.; Redivo Zaglia, M.; Serra-Capizzano, S. Extrapolation methods for PageRank computation. Comptes Rendus Math. 2005, 340, 393–397. [Google Scholar] [CrossRef]
Kamvar, D.; Haveliwala, T.H.; Manning, C.D.; Golub, G.H. Extrapolations methods for accelerating PageRank computations. In Proceedings of the WWW 2003—Twelfth International World Wide Web Conference, Budapest, Hungary, 20–24 May 2003. [Google Scholar]
Kuroda, M.; Sakakihara, M. Accelerating the convergence of the EM algorithm using the vector ϵ-algorithm. Comput. Stat. Data Anal. 2006, 51, 1549–1561. [Google Scholar] [CrossRef]
Duminil, S.; Sadok, H.; Silvester, D. Fast solvers for discretized Navier–Stokes problems using vector extrapolation. Numer. Algorithms 2014, 66, 89–104. [Google Scholar] [CrossRef] [Green Version]
Elguide, M.; El Ichi, A.; Jbilou, K.; Beik, F.P.A. Tensor GMRES and Golub-Kahan Bidiagonalization methods via the Einstein product with applications to image and video processing. arXiv 2020, arXiv:2005.07458. [Google Scholar]
Huang, B.; Xie, Y.; Ma, C. Krylov subspace methods to solve a class of tensor equations via the Einstein product. Numer. Linear Algebra Appl. 2019, 26, e2254. [Google Scholar] [CrossRef]
Kolda, T.G.; Bader, B.W. Tensor Decompositions and Applications. SIAM Rev. 2009, 3, 455–500. [Google Scholar] [CrossRef]
Liang, M.; Zheng, B. Further results on Moore–Penrose inverses of tensors with application to tensor nearness problems. Comput. Math. Appl. 2019, 77, 1282–1293. [Google Scholar] [CrossRef]
Bouyouli, R.; Jbilou, K.; Sadaka, R.; Sadok, H. Convergence properties of some block Krylov subspace methods for multiple linear systems. J. Comput. Appl. Math. 2006, 196, 498–511. [Google Scholar] [CrossRef] [Green Version]
Wynn, P. Acceleration techniqes for iterated vector and matrix problems. Math. Comput. 1962, 16, 301–322. [Google Scholar] [CrossRef]
Beik, F.P.A.; Movahed, F.S.; Ahmadi-Asl, S. On the Krylov subspace methods based on tensor format for positive definite Sylvester tensor equations. Numer. Linear Algebra Appl. 2016, 16, 444–466. [Google Scholar] [CrossRef]

Figure 1. (a) Frontal, (b) horizontal, and (c) lateral slices of a third order tensor. (d) A mode-3 tube fibers.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

El Ichi, A.; Jbilou, K.; Sadaka, R. Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products. Mathematics 2020, 8, 1298. https://doi.org/10.3390/math8081298

AMA Style

El Ichi A, Jbilou K, Sadaka R. Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products. Mathematics. 2020; 8(8):1298. https://doi.org/10.3390/math8081298

Chicago/Turabian Style

El Ichi, Alaa, Khalide Jbilou, and Rachid Sadaka. 2020. "Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products" Mathematics 8, no. 8: 1298. https://doi.org/10.3390/math8081298

APA Style

El Ichi, A., Jbilou, K., & Sadaka, R. (2020). Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products. Mathematics, 8(8), 1298. https://doi.org/10.3390/math8081298

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products

Abstract

1. Introduction

2. Preliminaries and Notations

3. Tensor Extrapolation Methods

3.1. Tensor Global-Polynomial Extrapolation Methods

3.2. The tensor Global Topological $ϵ$ -Transformation

4. Application to Tensor Linear/Non Linear Systems of Equations

4.1. Application to Tensor Linear Systems

4.2. Application to Non Linear Tensor Systems

5. The Global-QR Implementation of TG-MPE/TG-RRE

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Tensor Global Extrapolation Methods Using the n-Mode and the Einstein Products

Abstract

1. Introduction

2. Preliminaries and Notations

3. Tensor Extrapolation Methods

3.1. Tensor Global-Polynomial Extrapolation Methods

3.2. The tensor Global Topological ϵ -Transformation

4. Application to Tensor Linear/Non Linear Systems of Equations

4.1. Application to Tensor Linear Systems

4.2. Application to Non Linear Tensor Systems

5. The Global-QR Implementation of TG-MPE/TG-RRE

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. The tensor Global Topological $ϵ$ -Transformation