A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition

Hached, Mustapha; Jbilou, Khalide; Koukouvinos, Christos; Mitrouli, Marilena

doi:10.3390/math9111249

Open AccessArticle

A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition^†

by

Mustapha Hached

^1,‡,

Khalide Jbilou

^2,‡,

Christos Koukouvinos

^3,‡ and

Marilena Mitrouli

^4,*,‡

¹

University of Lille, CNRS, UMR 8524—Laboratoire Paul Painlevé, F-59000 Lille, France

²

Laboratoire LMPA, 50 rue F. Buisson, ULCO, 62228 Calais, France

³

Department of Mathematics, National Technical University of Athens, Zografou, 15773 Athens, Greece

⁴

Department of Mathematics, National and Kapodistrian University of Athens Panepistimiopolis, 15784 Athens, Greece

^*

Author to whom correspondence should be addressed.

^†

This paper is dedicated to Mr Constantin M. Petridi.

^‡

These authors contributed equally to this work.

Mathematics 2021, 9(11), 1249; https://doi.org/10.3390/math9111249

Submission received: 11 May 2021 / Revised: 18 May 2021 / Accepted: 25 May 2021 / Published: 29 May 2021

(This article belongs to the Special Issue Numerical Linear Algebra and the Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Face recognition and identification are very important applications in machine learning. Due to the increasing amount of available data, traditional approaches based on matricization and matrix PCA methods can be difficult to implement. Moreover, the tensorial approaches are a natural choice, due to the mere structure of the databases, for example in the case of color images. Nevertheless, even though various authors proposed factorization strategies for tensors, the size of the considered tensors can pose some serious issues. Indeed, the most demanding part of the computational effort in recognition or identification problems resides in the training process. When only a few features are needed to construct the projection space, there is no need to compute a SVD on the whole data. Two versions of the tensor Golub–Kahan algorithm are considered in this manuscript, as an alternative to the classical use of the tensor SVD which is based on truncated strategies. In this paper, we consider the Tensor Tubal Golub–Kahan Principal Component Analysis method which purpose it to extract the main features of images using the tensor singular value decomposition (SVD) based on the tensor cosine product that uses the discrete cosine transform. This approach is applied for classification and face recognition and numerical tests show its effectiveness.

Keywords:

cosine product; Golub–Kahan algorithm; Krylov subspaces; PCA; SVD; tensors

1. Introduction

An important challenge in the last few years was the extraction of the main information in large datasets, measurements, observations that appear in signal and hyperspectral image processing, data mining, machine learning. Due to the increasing volume of data required by these applications, approximative low-rank matrix and tensor factorizations play a fundamental role in extracting latent components. The idea is to replace the initial large and maybe noisy and ill conditioned large scale original data by a lower dimensional approximate representation obtained via a matrix or multi-way array factorization or decomposition. Principal Components Analysis is a widely used technique for image recognition or identification. In the matrix case, it involves the computation of eigenvalues or singular decompositions. In the tensor case, even though various factorization techniques have been developed over the last decades (high-order SVD (HOSVD), Candecomp–Parafac (CP) and Tucker decomposition), the recent tensor SVDs (t-SVD and c-SVD), based on the use of the tensor t-product or c-products offer a matrix-like framework for third-order tensors, see [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15] for more details on recent work related to tensors and applications. In the present work, we consider third order tensors that could be defined as three dimensional arrays of data. As our study is based on the cosine transform product, we limit this work to three-order tensors.

For a given 3-mode tensor

X \in R^{n_{1} \times n_{2} \times n_{3}}

, we denote by

x_{i_{1}, i_{2}, i_{3}}

the element

(i_{1}, i_{2}, i_{3})

of the tensor

X

. A fiber is defined by fixing all the indexes except one. An element

c \in R^{1 \times 1 \times n}

is called a tubal-scalar or simply tube of length n. For more details refer to [1,2].

2. Definitions and Notations

2.1. Discrete Cosine Transformation

In this subsection we recall some definitions and properties of the discrete cosine transformation and the c-product of tensors. During recent years, many advances were made in order to establish a rigorous framework enabling the treatment of problems for which the data is stored in three-way tensors without having to resort to matricization [1,8]. One of the most important feature of such a framework is the definition of a tensor-tensor product as the t-product, based on the Fast Fourier Transform. For applications as image treatment, the tensor-tensor product based on the Discrete Cosine Transformation (DCT) has shown to be an interesting alternative to FFT. We now give some basic facts on the DCT and its associated tensor-tensor product. The DCT of a vector

v \in R^{n}

is defined by

\tilde{v} = C_{n} v s . \in R^{n},

(1)

where

C_{n}

is the

n \times n

discrete cosine transform matrix with entries

{(C_{n})}_{i j} = \sqrt{\frac{2 - δ_{i 1}}{n}} cos (\frac{(i - 1) (2 j - 1) π}{2 n}) 1 \leq i, j \leq n

with

δ_{i j}

is the Kronecker delta; see p. 150 in [16] for more details. It is known that the matrix

C_{n}

is orthogonal, i.e.,

C_{n}^{T} C_{n} = C_{n} C_{n}^{T} = I_{n}

; see [17]. Furthermore, for any vector

v \in R^{n}

, the matrix vector multiplication

C_{n} v

can be computed in

O (n l o g (n))

operations. Moreover, Reference [17] have shown that a certain class of Toeplitz-plus-Hankel matrices can be diagonalized by

C_{n}

. More precisely, we have

C_{n} th (v) C_{n}^{- 1} = Diag (\tilde{v}),

(2)

where

th (v) = \underset{Toeplitz}{\underset{︸}{(\begin{matrix} v_{1} & v_{2} & \dots & v_{n} \\ v_{2} & v_{1} & \dots & v_{3} \\ ⋮ & ⋮ & \dots & ⋮ \\ v_{n} & v_{n - 1} & \dots & v_{1} \end{matrix})}} + \underset{Hankel}{\underset{︸}{(\begin{matrix} v_{2} & \dots & v_{n} & 0 \\ ⋮ & ⋰ & ⋰ & v_{n} \\ v_{n} & 0 & \dots & ⋮ \\ 0 & v_{n} & \dots & v_{2} \end{matrix})}}

and

Diag (\tilde{v})

is the diagonal matrix whose i-th diagonal element is

{(\tilde{v})}_{i}

.

2.2. Definitions and Properties of the Cosine Product

In this subsection, we briefly review some concepts and notations, which play a central role for the elaboration of the tensor global iterative methods based on the c-product; see [18] for more details on the c-product.

Let

A \in R^{n_{1} \times n_{2} \times n_{3}}

be a real valued third-order tensor, then the operations mat and its inverse ten are defined by

\begin{matrix} mat (A) = \underset{Block Toeplitz}{\underset{︸}{(\begin{matrix} A_{1} & A_{2} & \dots & A_{n} \\ A_{2} & A_{1} & \dots & A_{3} \\ ⋮ & ⋮ & \dots & ⋮ \\ A_{n} & A_{n - 1} & \dots & A_{1} \end{matrix})}} & + \underset{Block Hankel}{\underset{︸}{(\begin{matrix} A_{2} & \dots & A_{n} & 0 \\ ⋮ & ⋰ & ⋰ & A_{n} \\ A_{n} & 0 & \dots & ⋮ \\ 0 & A_{n} & \dots & A_{2} \end{matrix})}} \in R^{n_{1} n_{3} \times n_{2} n_{3}} \end{matrix}

and the inverse operation denoted by ten is simply defined by

ten (mat (A)) = A .

Let us denote

\tilde{A}

the tensor obtained by applying the DCT on all the tubes of the tensor

A

. This operation and its inverse are implemented in the Matlab by the commands

dct

and

idct

as

\tilde{A} = dct (A, [], 3), and idct (\tilde{A}, [], 3) = A,

where

idct

denotes the Inverse Discrete Cosine Transform.

Remark 1.

Notice that the tensor

\tilde{A}

can be computed by using the 3-mode product defined in [2] as follows:

\tilde{A} = A \times_{3} M

where M is the

n_{3} \times n_{3}

invertible matrix given by

M = W^{- 1} C_{n_{3}} (I + Z)

where

C_{n_{3}}

denote de

n_{3} \times n_{3}

Discrete Cosine Transform DCT matrix,

W = diag (C_{n_{3}} (:, 1))

is the diagonal matrix made of the first column of the DCT matrix, Z is

n_{3} \times n_{3}

circulant upshift matrix which can be computed in MATLAB using

W = diag (ones (n_{3} - 1, 1), 1)

and I the

n_{3} \times n_{3}

identity matrix; see [18] for more details.

Let

A

be the matrix

A = (\begin{matrix} A^{(1)} \\ A^{(2)} \\ ⋱ \\ A^{(n_{3})} \end{matrix}) \in R^{n_{3} n_{1} \times n_{3} n_{2}}

(3)

where the matrices

A^{(i)}

’s are the frontal slices of the tensor

\tilde{A}

. The block matrix

mat (A)

can also be block diagonalized by using the DCT matrix as follows

(C_{n_{3}} \otimes I_{n_{1}}) mat (A) (C_{n_{3}}^{T} \otimes I_{n_{2}}) = A

(4)

Definition 1.

The c-product of two tensors

A \in R^{n_{1} \times n_{2} \times n_{3}}

and

B \in R^{n_{2} \times m \times n_{3}}

is the

n_{1} \times m \times n_{3}

tensor defined by:

A 🟉_{c} B = ten (mat (A) mat (B)) .

Notice that from Equation (3), we can show that the product

C = A 🟉_{c} B

is equivalent to

C = A B

. Algorithm 1 allows us to compute, in an efficient way, the c-product of the tensors

A

and

B

, see [18].

Algorithm 1 Computing the c-product.

Inputs:

A \in R^{n_{1} \times n_{2} \times n_{3}}

and

B \in R^{n_{2} \times m \times n_{3}}

Output:

C = A 🟉_{c} B \in R^{n_{1} \times m \times n_{3}}

1. Compute

\tilde{A} = dct (A, [], 3)

and

\tilde{B} = dct (B, [], 3)

.

2. Compute each frontal slices of

\tilde{C}

by

C^{(i)} = A^{(i)} B^{(i)}

3. Compute

C = idct (\tilde{C}, [], 3)

.

Next, give some definitions and remarks on the c-product and related topics.

Definition 2.

The identity tensor

I_{n_{1} n_{1} n_{3}}

is the tensor such that each frontal slice of

{\tilde{I}}_{n_{1} n_{1} n_{3}}

is the identity matrix

I_{n_{1} n_{1}}

.

An

n_{1} \times n_{1} \times n_{3}

tensor

A

is said to be invertible if there exists a tensor

B

of order

n_{1} \times n_{1} \times n_{3}

such that

A 🟉_{c} B = I_{n_{1} n_{1} n_{3}} and B 🟉_{c} A = I_{n_{1} n_{1} n_{3}} .

In that case, we denote

B = A^{- 1}

. It is clear that

A

is invertible if and only if

mat (A)

is invertible.

The inner scalar product is defined by

〈 A, B 〉 = \sum_{i_{1} = 1}^{n_{1}} \sum_{i_{2} = 1}^{n_{2}} \sum_{i_{3} = 1}^{n_{3}} a_{i_{1} i_{2} i_{3}} b_{i_{1} i_{2} i_{3}}

and its corresponding norm is given by

{∥ A ∥}_{F} = \sqrt{〈 A, A 〉} .

An

n_{1} \times n_{1} \times n_{3}

tensor

Q

is said to be orthogonal if

Q^{T} 🟉_{c} Q = Q 🟉_{c} Q^{T} = I_{n_{1} n_{1} n_{3}} .

Definition 3

([1]). A tensor is called f-diagonal if its frontal slices are diagonal matrices. It is called upper triangular if all its frontal slices are upper triangular.

Next we recall the Tensor Singular Value Decomposition of a tensor (Algorithm 2); more details can be found in [19].

Theorem 1.

Let

A

be an

n_{1} \times n_{2} \times n_{3}

real-valued tensor. Then

A

can be factored as follows

A = U 🟉_{c} S 🟉_{c} V^{T},

(5)

where

U

and

V

are orthogonal tensors of order

(n_{1}, n_{1}, n_{3})

and

(n_{2}, n_{2}, n_{3})

, respectively, and

S

is an f-diagonal tensor of order

(n_{1} \times n_{2} \times n_{3})

. This factorization is called Tensor Singular Value Decomposition (c-SVD) of the tensor

A

.

Algorithm 2 The Tensor SVD (c-SVD).

Input:

A \in R^{n_{1} \times n_{2} \times n_{3}}

Output:

U

,

V

and

S

.

1. Compute

\tilde{A} = dct (A, [], 3)

.

2. Compute each frontal slices of

\tilde{U}

,

\tilde{V}

and

\tilde{S}

from

\tilde{A}

as follows

(a) for

i = 1, \dots, n_{3}

[{\tilde{U}}^{(i)}, {\tilde{S}}^{(i)}, {\tilde{V}}^{(i)}] = s v d ({\tilde{A}}^{(i)})

(b) End for

3. Compute

U = idct (\tilde{U}, [], 3)

,

S = idct (\tilde{S}, [], 3)

and

V = idct (\tilde{V}, [], 3)

.

Remark 2.

As for the t-product [19], we can show that if

A = U 🟉_{c} S 🟉_{c} V^{T}

is a c-SVD of the tensor

A

, then we have

\sum_{k = 1}^{n_{3}} A_{k} = (\sum_{k = 1}^{n_{3}} U_{k}) (\sum_{k = 1}^{n_{3}} S_{k}) (\sum_{k = 1}^{n_{3}} V_{k}^{T}),

(6)

where

A_{k}

,

U_{k}

,

S_{k}

and

V_{k}

are the frontal slices of the tensors

A

,

U

,

S

and

V

, respectively, and

A = \sum_{i = 1}^{min (n_{1}, n_{2})} U (:, i, :) 🟉_{c} S (i, i, :) 🟉_{c} V {(:, i, :)}^{T} .

(7)

Theorem 2.

Let

A = U 🟉_{c} S 🟉_{c} V^{T}

given by (5), and define for

k \leq m i n (n_{1}, n_{2})

the tensor

A_{k} = \sum_{i = 1}^{k} U (:, i, :) 🟉_{c} S (i, i, :) 🟉_{c} V {(:, i, :)}^{T} .

(8)

Then

A_{k} = a r g min_{X \in M} {∥ A_{k} - A ∥}_{F},

(9)

where

M = {X 🟉_{c} Y; X \in R^{n_{1} \times k \times n_{3}}, Y \in R^{k \times n_{2} \times n_{3}}}

.

Note that when

n_{3} = 1

this theorem reduces to the well known Eckart–Young theorem for matrices [20].

Definition 4

(The tensor tubal-rank).Let

A

be an

n_{1} \times n_{2} \times n_{3}

be a tensor and consider its c-SVD

A = U 🟉_{c} S 🟉_{c} V^{T}

. The tensor tubal rank of

A

, denoted as rank

_{t}

(

A

) is defined to be the number of non-zero tubes of the f-diagonal tensor

S

, i.e.,

{rank}_{t} (A) = # {i, S (i, i, :) \neq 0} .

Definition 5.

The multi-rank of the tensor

A

is a vector

p \in R^{n_{3}}

with the i-th element equal to the rank of the i-th frontal slice of

\tilde{A} = fft (A, [], 3)

, i.e.,

p (i) = r a n k (A^{(i)}), i = 1, \dots, n_{3} .

The well known QR matrix decomposition can also be extended to the tensor case; see [19].

Theorem 3.

Let

A

be a real-valued tensor of order

n_{1} \times n_{2} \times n_{3}

. Then

A

can be factored as follows

A = Q 🟉_{c} R,

(10)

where

Q

is an

n_{1} \times n_{1} \times n_{3}

orthogonal tensor and

R

is an

n_{1} \times n_{1} \times n_{3}

f-upper triangular tensor.

3. Tensor Principal Component Analysis for Face Recognition

Principle Component Analysis (PCA) is a widely used technique in image classification and face recognition. Many approaches involve a conversion of color images to grayscale in order to reduce the training cost. Nevertheless, for some applications, color an is important feature and tensor based approaches offer the possibility to take it into account. Moreover, especially in the case of facial recognition, it allows the treatment of enriched databases including for instance additional biometric information. However, one has to bear in mind that the computational cost is an important issue as the volume of data can be very large. We first recall some background facts on the matrix based approach.

3.1. The Matrix Case

One of the simplest and most effective PCA approaches used in face recognition systems is the so-called eigenface approach. This approach transforms faces into a small set of essential characteristics, eigenfaces, which are the main components of the initial set of learning images (training set). Recognition is done by projecting a test image in the eigenface subspace, after which the person is classified by comparing its position in eigenface space with the position of known individuals. The advantage of this approach over other face recognition strategies resides in its simplicity, speed and insensitivity to small or gradual changes on the face.

The process is defined as follows: Consider a set of training faces

I_{1}

,

I_{2}

, …,

I_{p}

. All the face images have the same size:

n \times m

. Each face

I_{i}

is transformed into a vector

x_{i}

using the operation

v e c

:

x_{i} = v e c (I_{i})

. These vectors are columns of the

n m \times p

matrix

X = [x_{1}, \dots, x_{p}] .

We compute the average image

μ = \frac{1}{p} \sum_{i = 1}^{p} x_{i} .

Set

{\bar{x}}_{i} = x_{i} - μ

and consider the new matrices

\bar{X} = [{\bar{x}}_{1}, \dots, {\bar{x}}_{p}], and C = \bar{X} {\bar{X}}^{T} .

Notice that the

n m \times n m

covariance matrix

C = \bar{X} {\bar{X}}^{T}

can be very large. Therefore, the computation of the

n m

eigenvalues and the corresponding eigenvectors (eigenfaces) can be very difficult. To circumvent this issue, we instead consider the smaller

p \times p

matrix

L = {\bar{X}}^{T} \bar{X}

.

Let

v_{i}

be an eigenvector of L then

L v_{i} = {\bar{X}}^{T} \bar{X} v_{i} = λ_{i} v_{i}

and

\bar{X} L v_{i} = \bar{X} {\bar{X}}^{T} \bar{X} v_{i} = λ_{i} \bar{X} v_{i},

which shows that

\bar{X} v_{i}

is an eigenvector of the covariance matrix

C = \bar{X} {\bar{X}}^{T}

.

The p eigenvectors of

L = {\bar{X}}^{T} \bar{X}

are then used to find the p eigenvectors

u_{i} = \bar{X} v_{i}

of C that form the eigenface space. We keep only k eigenvectors corresponding to the largest k eigenvalues (eigenfaces corresponding to small eigenvalues can be omitted, as they explain only a small part of characteristic features of the faces.)

The next step consists of projecting each image of the training sample onto the eigenface space spanned by the orthogonal vectors

u_{1}, \dots, u_{k}

:

U_{k} = s p a n {u_{1}, \dots, u_{k}}, with U_{k} = [u_{1}, \dots, u_{k}]

The matrix

U_{k} U_{k}^{T}

is an orthogonal projector onto the subspace

U_{k}

. A face image can be projected onto this face space as

y_{i} = U_{k}^{T} (x_{i} - μ) .

We now give the steps of an image classification process based on this approach:

Let

x = v e c (I)

be a test vector-image and project it onto the face space to get

y = U_{k}^{T} (x - μ) .

Notice that the reconstructed image is given by

x^{r} = {\tilde{U}}_{k} y + μ .

Compute the Euclidean distance

ϵ_{i} = ∥ y - y_{i} ∥, i = 1, \dots, k .

A face is classified as belonging to the class l when the minimum l is below some chosen threshold

θ

Set

θ = \frac{1}{2} max_{i, j} ∥ y_{i} - y_{j} ∥, i, j = 1, \dots, k,

and let

ϵ

be the distance between the original test image x and its reconstructed image

x^{r}

:

ϵ = ∥ x - x^{r} ∥

. Then

If $ϵ \geq θ$ , then the input image is not even a face image and not recognized.
If $ϵ < θ$ and $ϵ_{i} \geq θ$ for all i then the input image is a face image but it is an unknown image face.
If $ϵ < θ$ and $ϵ_{i} < θ$ for all i then the input images are the individual face images associated with the class vector $x_{i}$ .

We now give some basic facts on the relation between the singular value decomposition (SVD) and PCA in this context:

Consider the Singular Value Decomposition of the matrix A as

\bar{X} = U Σ V^{T} = \sum_{i = 1}^{p} σ_{i} u_{i} v_{i}^{T}

where U and V are orthonormal matrices of sizes

n m

and p, respectively. The singular values

σ_{i}

are the square roots of the eigenvalues of the matrix

L = {\bar{X}}^{T} \bar{X}

, the

u_{i}

’s are the left vectors and the

v_{i}^{'} s

are the right vectors. We have

L = {\bar{X}}^{T} \bar{X} = V Δ V^{T}; Δ = d i a g (σ_{1}^{2}, \dots, σ_{p}^{2})

which is is the eigendecomposition of the matrix L and

C = \bar{X} {\bar{X}}^{T} = U D U^{T}; D = d i a g (σ_{1}^{2}, \dots, σ_{p}^{2}, 0, \dots, 0) .

In the PCA method, the projected eigenface space is then generated by the first

u_{1}, \dots, u_{k}

columns of the unitary matrix U derived from the SVD decomposition of the matrix

\bar{X}

.

As only a small number k of the largest singular values are needed in PCA, we can use the well known Golub–Kahan algorithm to compute these wanted singular values and the corresponding singular vectors to define the projected subspace.

In the next section, we explain how the SVD based PCA can be extended to tensors and propose an algorithm for facial recognition in this context.

4. The Tensor Golub–Kahan Method

As explained in the previous section, it is important to take into account the potentially large size of datasets, especially for the training process. The idea of extending the matrix Golub–Kahan bidiagonalization algorithm to the tensor context has been explored in the recent years for large and sparse tensors [21]. In [1], the authors established the foundations of a remarkable theoretical framework for tensor decompositions in association with the tensor-tensor t- or c-products, allowing to generalize the main notions of linear algebra to tensors.

4.1. The Tensor C-Global Golub–Kahan Algorithm

Let

A \in R^{n_{1} \times n_{2} \times n_{3}}

be a tensor ans

s \geq 1

an integer. The Tensor c-global Golub–Kahan bidiagonalization algorithm (associated to the c-product) is described in Algorithm 3.

Algorithm 3 The Tensor Global Golub–Kahan algorithm (TGGKA).

1. Choose a tensor

V_{1} \in R^{n_{2} \times s \times n_{3}}

such that

∥ V_{1} ∥_{F} = 1

and set

β_{0} = 0

.

2. For

i = 1, 2, \dots, k

(a)

U_{i} = A 🟉_{c} V_{i} - β_{i - 1} U_{i - 1}

,

(b)

α_{i} = {∥ U_{i} ∥}_{F}

,

(c)

U_{i} = U_{i} / α_{i}

,

(d)

V_{i + 1} = A^{T} 🟉_{c} U_{i} - α_{i} V_{i}

,

(e)

β_{i} = {∥ V_{i + 1} ∥}_{F}

.

(f)

V_{i + 1} = V_{i + 1} / β_{i}

.

End

Let

C_{k}

be the

k \times k

upper bidiagonal matrix defined by

C_{k} = [\begin{matrix} α_{1} & β_{1} \\ α_{2} & β_{2} \\ ⋱ & ⋱ \\ α_{k - 1} & β_{k - 1} \\ α_{k} \end{matrix}] .

(11)

Let

V_{k}

and

A 🟉_{c} V_{k}

be the

(n_{2} \times (s k) \times p)

and

(n_{1} \times (s k) \times n_{3})

tensors with frontal slices

V_{1}, \dots, V_{k}

and

A 🟉_{c} V_{1}, \dots, A 🟉_{c} V_{k}

, respectively, and let

U_{k}

and

A^{T} 🟉_{c} U_{k}

be the

(n_{1} \times (s k) \times n_{3})

and

(n_{2} \times (s k) \times n_{3})

tensors with frontal slices

U_{1}, \dots, U_{k}

and

A^{T} 🟉_{c} U_{1}, \dots, A^{T} 🟉_{c} U_{k}

, respectively. We set

\begin{matrix} V_{k} : & = [V_{1}, \dots, V_{k}], and A 🟉_{c} V_{k} : = [A 🟉_{c} V_{1}, \dots, A 🟉_{c} V_{k}], \end{matrix}

(12)

\begin{matrix} U_{k} : & = [U_{1}, \dots, U_{k}], and A^{T} 🟉_{c} U_{k} : = [A^{T} 🟉_{c} U_{1}, \dots, A^{T} 🟉_{c} U_{k}], \end{matrix}

(13)

with

{\tilde{C}}_{k}^{T} = [\begin{matrix} C_{k}^{T} \\ β_{k} e_{k}^{T} \end{matrix}] \in R^{(k + 1) \times k}, e_{k}^{T} = {(0, 0, \dots, 0, 1)}^{T} .

Then, we have the following results [13].

Proposition 1.

The tensors produced by the tensor c-global Golub–Kahan algorithm satisfy the following relations

\begin{matrix} A 🟉_{c} V_{k} & = & U_{k} ⊛ C_{k}, \end{matrix}

(14)

\begin{matrix} A^{T} 🟉_{c} U_{k} & = & V_{k + 1} ⊛ {\tilde{C}}_{k}^{T} \end{matrix}

(15)

\begin{matrix} = & V_{k} ⊛ C_{k}^{T} + β_{k} [O_{n \times s \times p}, \dots, O_{n_{1} \times s \times n_{3}}, V_{k + 1}], \end{matrix}

(16)

where the product ⊛ is defined by:

U_{k} ⊛ y = \sum_{j = 1}^{k} y_{j} V_{j}, y = {(y_{1}, \dots, y_{m})}^{T} \in R^{k} .

We set the following notation:

U_{k} ⊛ C_{k} = [U_{k} ⊛ C_{k}^{1}, \dots, U_{k} ⊛ C_{k}^{k}],

where

C_{k}^{i}

is the i-th column of the matrix

C_{k}

.

We note that since the matrix

C_{k}

is bidiagonal,

T_{k} = C_{k}^{T} C_{k}

is symmetric and tridiagonal and then Algorithm computes the same information as tensor global Lanczos algorithm applied to the symmetric matrix

A^{*} 🟉_{c} A

.

4.2. Tensor Tubal Golub–Kahan Bidiagonalisation Algorithm

First, we introduce some new products that will be useful in this section.

Definition 6

([13]). Let

a \in R^{1 \times 1 \times n_{3}}

and

B \in R^{n_{1} \times n_{2} \times n_{3}}

, the tube fiber tensor product

(a ⋇ B)

is an

(n_{1} \times n_{2} \times n_{3})

tensor defined by

\begin{matrix} a ⋇ B = (\begin{matrix} a 🟉_{c} b (1, 1, :) & \dots & a 🟉_{c} b (1, n_{2}, :) \\ ⋮ & ⋱ & ⋮ \\ a 🟉_{c} b (n_{1}, 1, :) & \dots & a 🟉_{c} b (n_{1}, n_{2}, :) \end{matrix}) \end{matrix}

Definition 7

([13]). Let

A \in R^{n_{1} \times m_{1} \times n_{3}}

,

B \in R^{n_{1} \times m_{2} \times n_{3}}

,

C \in R^{n_{2} \times m_{1} \times n_{3}}

and

D \in R^{n_{2} \times m_{2} \times n_{3}}

be tensors. The block tensor

[\begin{matrix} A & B \\ C & D \end{matrix}] \in R^{(n_{1} + n_{2}) \times (m_{1} + m_{2}) \times n_{3}}

is defined by compositing the frontal slices of the four tensors.

Definition 8.

Let

A = [A_{1}, \dots, A_{n_{2}}] \in R^{n_{1} \times n_{2} \times n_{3}}

where

A_{i} \in R^{n_{1} \times 1 \times n_{3}}

, we denoted byTVect(

A

) the tensor vectorization operator:

R^{n_{1} \times n_{2} \times n_{3}} \mapsto R^{n_{1} n_{2} \times 1 \times n_{3}}

obtained by superposing the laterals slices

A_{i}

of

A

, for

i = 1, \dots, n_{2}

. In others words, for a tensor

A = [A_{1}, \dots, A_{n_{2}}] \in R^{n_{1} \times n_{2} \times n_{3}}

where

A_{i} \in R^{n_{1} \times 1 \times n_{3}}

, we have:

TVect (A) = (\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{n_{2}} \end{matrix}) \in R^{n_{1} n_{2} \times 1 \times n_{3}}

Remark 3.

TheTVectoperator transform a given tensor on lateral slice. Its easy to see that when we take

p = 1

, theTVectoperator coincides with the operation

v e c

which transform the matrix on vector.

Proposition 2.

Let

A

be a tensor of size

R^{n_{1} \times n_{2} \times n_{3}}

, we have

{∥ A ∥}_{F} = {∥ TVec (A) ∥}_{F}

Definition 9.

Let

A = [A_{1}, \dots, A_{n_{2}}] \in R^{n_{1} \times n_{2} \times n_{3}}

where

A_{i} \in R^{n_{1} \times 1 \times n_{3}}

. We define the range space of

A

denoted by

Range (A)

as the c-linear span of the lateral slices of

A

Range (A) = \{A_{1} 🟉_{c} a (1, 1, :) + \dots + A_{n_{2}} 🟉_{c} a (n_{2}, n_{2}, :) | a (i, i, :) \in R^{1 \times 1 \times n_{3}}\}

(17)

Definition 10

([14]). Let

A \in R^{n_{1} \times n_{2} \times n_{3}}

and

B \in R^{m_{1} \times m_{2} \times n_{3}}

, the c-Kronecker product

A ⊙ B

of

A

and

B

is the

n_{1} m_{1} \times n_{2} m_{2} \times n_{3}

tensor in which the i-th frontal slice of their transformed tensor

\tilde{(A ⊙ B)}

is given by:

\begin{matrix} {\tilde{(A ⊙ B)}}_{i} & = (A^{(i)} \otimes B^{(i)}), i = 1, . . ., n_{3} \end{matrix}

where

A^{(i)}

and

B^{(i)}

are the i-th frontal slices of the tensors

\tilde{A} = dct (A, [], 3)

and

\tilde{B} = dct (B, [], 3)

, respectively.

We introduce now a normalization algorithm allowing us to decompose the non-zero tensor

C \in R^{n_{1} \times n_{2} \times n_{3}}

, such that:

C = a ⋇ Q, with 〈Q, Q〉 = e,

where

a

is an invertible tube fiber of size

a \in R^{1 \times 1 \times n_{3}}

and

Q \in R^{n_{1} \times n_{2} \times n_{3}}

and e is the tube fiber

e \in R^{1 \times 1 \times n_{3}}

defined by

unfold (e) = {(1, 0, 0 \dots, 0)}^{T}

.

This procedure is described in Algorithm 4.

Algorithm 4 Normalization algorithm (Normalize).

1. Input.

A \in R^{n_{1} \times n_{2} \times n_{3}}

and a tolerance

t o l > 0

.

2. Output. The tensor

Q

and the tube fiber

a

.

3. Set

\tilde{Q} = dct (A, [], 3)

(a) For

j = 1, \dots, n_{3}

i

a_{j} = | | {\tilde{Q}}^{(j)} {| |}_{F}

ii if

a_{j} > t o l

,

{\tilde{Q}}^{(j)} = \frac{{\tilde{Q}}^{(j)}}{a_{j}}

iii else

{\tilde{Q}}_{j} = rand (n_{1}, n_{2})

;

a_{j} = | | {\tilde{Q}}^{(j)} {| |}_{F}

{\tilde{Q}}^{(j)} = \frac{{\tilde{Q}}^{(j)}}{a_{j}}

;

a_{j} = 0,

(b) End

4.

Q = idct (\tilde{Q}, [], 3)

,

a = idct (a, [], 3)

5. End

Next, we give the Tensor Tube Global Golub–Kahan (TTGGKA) algorithm, seeElIchi1. Let

A \in R^{n_{1} \times n_{2} \times n_{3}}

be a tensor and let

s \geq 1

be an integer. The Tensor Tube Global Golub–Kahan bidiagonalization process is described in Algorithm 5.

Algorithm 5 The Tensor Tube Global Golub–Kahan algorithm (TTGGKA).

1. Choose a tensor

V_{1} \in R^{n_{2} \times s \times n_{3}}

such that

〈 V_{1}, V_{1} 〉 = e

and set

b_{0} = 0

.

2. For

i = 1, 2, \dots, k

(a)

U_{i} = A 🟉_{c} V_{i} - b_{i - 1} ⋇ U_{i - 1}

,

(b)

[U_{i}, a_{i}] = N o r m a l i z e (U_{i})

.

(c)

V_{i + 1} = A^{T} 🟉_{c} U_{i} - a_{i} ⋇ V_{i}

,

(d)

[V_{i + 1}, b_{i}] = N o r m a l i z e (V_{i + 1})

.

End

Let

C_{k}

be the

k \times k \times n_{3}

upper bidiagonal tensor (each frontal slice of

C_{k}

is a bidiagonal matrix) and

{\tilde{C}}_{k}

the

k \times (k + 1) \times n_{3}

defined by

C_{k} = [\begin{matrix} a_{1} & b_{1} \\ a_{2} & b_{2} \\ ⋱ & ⋱ \\ a_{k - 1} & b_{k - 1} \\ a_{k} \end{matrix}], and {\tilde{C}}_{k} = [\begin{matrix} a_{1} & b_{1} \\ a_{2} & b_{2} \\ ⋱ & ⋱ \\ a_{k - 1} & b_{k - 1} \\ a_{k} & b_{k} \end{matrix}] .

(18)

Let

V_{k}

and

A 🟉_{c} V_{k}

be the

(n_{2} \times (s k) \times n_{3})

and

(n_{1} \times (s k) \times n_{3})

tensors with frontal slices

V_{1}, \dots, V_{k}

and

A 🟉_{c} V_{1}, \dots, A 🟉_{c} V_{k}

, respectively, and let

U_{k}

and

A^{T} 🟉_{c} U_{k}

be the

(n_{1} \times (s k) \times n_{3})

and

(n_{2} \times (s k) \times n_{3})

tensors with frontal slices

U_{1}, \dots, U_{k}

and

A^{T} 🟉_{c} U_{1}, \dots, A^{T} 🟉_{c} U_{k}

, respectively. We set

\begin{matrix} V_{k} : & = [V_{1}, \dots, V_{k}], and A 🟉_{c} V_{k} : = [A 🟉_{c} V_{1}, \dots, A 🟉_{c} V_{k}], \end{matrix}

(19)

\begin{matrix} U_{k} : & = [U_{1}, \dots, U_{k}], and A^{T} 🟉_{c} U_{k} : = [A^{T} 🟉_{c} U_{1}, \dots, A^{T} 🟉_{c} U_{k}], \end{matrix}

(20)

Then, we have the following results.

Proposition 3.

The tensors produced by the tensor TTGGKA algorithm satisfy the following relations

\begin{matrix} A 🟉_{c} V_{k} & = & U_{k} 🟉_{c} (C_{k} ⊙ I_{s s n_{3}}), \end{matrix}

(21)

\begin{matrix} A^{T} 🟉_{c} U_{k} & = & V_{k + 1} 🟉_{c} ({\tilde{C}}_{k}^{T} ⊙ I_{s s n_{3}}) \end{matrix}

(22)

\begin{matrix} = & V_{k} 🟉_{c} (C_{k}^{T} ⊙ I_{s s p}) + V_{k + 1} 🟉_{c} ((b_{k} 🟉_{c} e_{1, k, :}) ⊙ I_{s s n_{3}}), \end{matrix}

(23)

where

e_{1, k, :} \in R^{1 \times k \times n_{3}}

with 1 in the

(1, k, 1)

position and zeros in the other positions,

I_{s s n_{3}} \in R^{s \times s \times n_{3}}

the identity tensor and

b_{k}

is the fiber tube in the

(k, k + 1, :)

position of the tensor

{\tilde{C}}_{k}

.

5. The Tensor Tubal PCA Method

In this section, we describe a tensor-SVD based PCA method for order 3 tensors which naturally arise in problems involving images such as facial recognition. As for the matrix case, we consider a set of N training images, each of one being encoded as

n_{1} \times n_{2} \times n_{3}

real tensors

I_{i}

,

1 \leq i \leq N

. In the case of RGB images, each frontal slice would contain the encoding for each color layer (

n_{3} = 3

) but in order to be able to store additional features, the case

n_{3} > 3

could be contemplated.

Let us consider one training image

I_{i_{0}}

. Each one of the

n_{3}

frontal slices

I_{i_{0}}^{(j)}

of

I_{i_{0}}

is resized into a column vector

v e c (I_{i_{0}}^{(j)})

of length

L = n_{1} \times n_{2}

and we form a

L \times 1 \times n_{3}

tensor

X_{i_{0}}

defined by

X_{i_{0}} (:, :, j) = v e c (I_{i_{0}}^{(j)})

. Applying this procedure to each training image, we obtain N tensors

X_{i}

of size

L \times 1 \times n_{3}

. The average image tensor is defined as

\bar{X} = \frac{1}{N} \sum_{i = 1}^{N} X_{i}

and we define the

L \times N \times n_{3}

training tensor

X = [\bar{X_{1}}, \dots, \bar{X_{N}}]

, where

\bar{X_{i}} = X_{i} - \bar{X}

.

Let us now consider the c-SVD decomposition

X = U *_{c} S *_{c} V^{T}

of

X

, where

U

and

V

are orthogonal tensors of size

L \times L \times n_{3}

and

N \times N \times n_{3}

, respectively, and S is a f-diagonal tensor of size

L \times N \times n_{3}

.

In the matrix context, it is known that just a few singular values suffice to capture the main features of an image, therefore, applying this idea to each one of the three color layers, an RGB image can be approximated by a low tubal rank tensor. Let us consider an image tensor

S \in R^{n_{1} \times n_{2} \times n_{3}}

and its c-SVD decomposition

S = U 🟉_{c} S 🟉_{c} V^{T}

. Choosing an integer r such as

r \leq m i n (n_{1}, n_{2})

, we can approximate

S

by the r tubal rank tensor

S_{r} \approx \sum_{i = 1}^{r} U (:, i, :) *_{c} S (i, i, :) *_{c} V {(:, i, :)}^{T} .

In Figure 1, we represented a

512 \times 512

RGB image and the images obtained for various truncation indices. On the left part, we plotted the singular values of one color layer of the RGB tensor (the exact same behaviour is observed on the two other layers). The rapid decrease of the singular values explain the good quality of compressed images even for small truncation indices.

Applying this idea to our problem, we want to be able to obtain truncated tensor SVDs of the training tensor

X

, without needing to compute the whole c-SVD. After k iterations of the TTGGKA algorithm (for the case

s = 1

), we obtain three tensors

U_{k} \in R^{n_{1} \times k \times n_{3}}

,

V_{k + 1} \in R^{n_{2} \times (k + 1) \times n_{3}}

and

{\tilde{C}}_{k} \in R^{(k \times (k + 1) \times n_{3}}

as defined in Equation (21) such as

A^{T} 🟉_{c} U_{k} = V_{k + 1} 🟉_{c} {\tilde{C}}_{k}^{T} .

Let

{\tilde{C}}_{k} = Φ 🟉_{c} Σ 🟉_{c} Ψ

the c-SVD of

{\tilde{C}}_{k}

, noticing that

{\tilde{C}}_{k} \in R^{k \times (k + 1) \times n_{3}}

is much smaller than

\bar{X}

. Then first tubal singular values and the left tubal singular tensors of

\bar{X}

are given by

Σ (i, i, :)

and

U_{k} 🟉_{c} Φ (:, i, :)

, respectively, for

i \leq k

, see [1] for more details.

In order to illustrate the ability to approximate the first singular elements of a tensor using the TTGGKA algorithm, we considered a

900 \times 900 \times 3

real tensor

A

which frontal slices were matrices generated by a finite difference discretization method of differential operators. On Figure 2, we displayed the error on the first diagonal coefficient of the first frontal

S (1, 1, 1)

in function of the number of iteration of the Tensor Tube Golub–Kahan algorithm, where

A = U 🟉_{c} S 🟉_{c} V^{T}

is the c-SVD of

A

.

In Table 1, we reported on the errors on the tensor Frobenius norms of the singular tubes in function of the number k of the Tensor Tube Golub–Kahan algorithm.

The same behaviour was observed on all the other frontal slices. This example illustrate the ability of the TTGKA algorithm for approximating the largest singular tubes.

The projection space is generated by the lateral slices of the tensor

P = U_{k} 🟉_{c}

Φ (:, 1 : k, :) \in R^{n_{1} \times i \times n_{3}}

derived from the TTGGKA algorithm and the c-SVD decomposition of the bidiagonal tensor

{\tilde{C}}_{k}

, i.e., the c-linear span of first k lateral slices of

P

, see [1,19] for more details.

The steps of the Tensor Tubal PCA algorithm for face recognition which finds the closest image in the training database for a given image

I_{0}

are summarized in Algorithm 6:

Algorithm 6 The Tensor Tubal PCA algorithm (TTPCA).

1. Inputs Training Image tensor

X

(N images), mean image tensor

\bar{X}

,Test image

I_{0}

, index of truncation r, k=number of iterations of the TTGGKA algorithm (

k \geq r

).

2. Output Closest image in the Training database.

3. Run k iterations of the TTGGKA algorithm to obtain tensors

U_{k}

and

\hat{C_{k}}

4. Compute

[Φ, Σ, Ψ] =

c-SVD

(\tilde{C_{k}})

5. Compute the projection tensor

P_{r} = [P_{r} (:, 1, :), \dots, P_{r} (:, r, :)]

, where

P_{r} (:, i, :) = U_{k} 🟉_{c} Φ (:, i, :) \in R^{n_{1} \times 1 \times n_{3}}

6. Compute the projected Training tensor

{\hat{X}}_{r} = P_{r}^{T} 🟉_{c} X

and projected centred test image

{\hat{I}}_{r} = P_{r}^{T} 🟉_{c} (I - \bar{X})

7. Find

i = arg {min}_{i = 1, . ., N} {∥ {\hat{I}}_{r} - {\hat{X}}_{r} (:, i, :) |}_{F}

In the next section, we consider image identification problems on various databases.

6. Numerical Tests

In this section, we consider three examples of image identification. In the case of grayscale images, the global version of Golub–Kahan was used to compute the dominant singular values in order to perform a PCA on the data. For the two other situations, we used the Tensor Tubal PCA (TTPCA) method based on the Tube Global Golub–Kahan (TTGGKA) algorithm in order to perform facial recognition on RGB images. The tests were performed with Matlab 2019a, on an Intel i5 laptop with 16 Go of memory. We considered various truncation indices r for which the recognition rates were computed. We also reported the CPU time for the training process.

6.1. Example 1

In this example, we considered the MNIST database of handwritten digits [22]. The database contains two subsets of

28 \times 28

grayscale images (60,000 training images and 10,000 test images). A sample is shown in Figure 3. Each image was vectorized as a vector of length

28 \times 28 = 784

and, following the process described in Section 3.1, we formed the training and the test matrices of sizes

784 \times 60, 000

and

784 \times 10, 000

, respectively.

Both matrices were centred by substracting the mean training image and the Golub–Kahan algorithm was used to generate an approximation of r dominant singular values

s_{i}

and left singular vectors

u_{i}

,

i = 1, \dots, r

.

Let us denote

U_{r}

the subspace spanned by the columns of

U_{r} = [u_{1}, \dots, u_{r}]

. Let t be a test image and

{\hat{t}}_{r} = U_{r}^{T} t

its projection onto

U_{r}

. The closest image in the training dataset is determined by computing

i = arg min_{i = 1, . ., 60, 000} ∥ {\hat{t}}_{r} - {\hat{X}}_{r} (:, i) ∥,

where

{\hat{X}}_{r} = U_{r}^{T} X

.

For various truncation indices r, we tested each image of the test subset and computed the recognition rate (i.e., a test is successful if the digit is correctly identified). The results are plotted on Figure 4 and show that a good level of accuracy is obtained with only a few approximate singular values. Due to the large size of the training matrix, it validates the interest of computing only a few singular values with the Golub–Kahan algorithm.

6.2. Example 2

In this example, we used the Georgia Tech database GTDB_ crop [23], which contains 750 face images of 50 persons in different illumination conditions, facial expression and face orientation, as shown in Figure 5. The RGB JPEG images were resized to 100 × 100 × 3 tensors.

Each image file is coded as a

100 \times 100 \times 3

tensor and transformed into a 10,000

\times 1 \times 3

tensor as explained in the previous section. We built the training and test tensors as follows: from 15 pictures of each person in the database, five pictures were randomly chosen and stored in the test folder and the 10 remaining pictures were used for the train tensor. Hence, the database was partitioned into two subsets containing 250 and 500 items, respectively, at each iteration of the simulation.

We applied the TTGGKA based Algoritm 6 for various truncation indices. In Figure 6, we represented a test image (top left position), the closest image in the database (top right), the mean image of the training database (bottom left) and the eigenface associated to the test image (bottom right).

In order compute the rate of recognition, we ran 100 simulations, obtained the number of successes (i.e., a test is successful if the person is correctly identified) and reported the best identification rates, in function of the truncation index r in Figure 7.

The results match the performances observed in the literature [24] for this database and it confirms that the use of a Golub–Kahan strategy is interesting especially because, in terms of training, the Tube Tensor PCA algorithm required only 5 s instead of 25 s when using a c-SVD.

6.3. Example 3

In the second example, we used the larger AR face database (cropped version) (Face crops) [9], which contains 2600 bitmap pictures of human faces (50 males and 50 females, 26 pictures per person), with different expressions, lightning conditions, facial expressions and face orientation. The bitmap pictures were resized to 100 × 100 Jpeg images. The same protocol as for Example 1 was followed: we partitioned the set of images in two subsets. Out of 26 pictures, 6 pictures were randomly chosen as test images and the remaining 20 were put into the training folder. The training process took 24 s while it would have taken 81.5 s if using a c-SVD. An example of test image, the closest match in the dataset, the mean image and its associated eigenface are shown in Figure 8.

We applied our approach (TTPCA) to the 10,000

\times 2000 \times 3

training tensor

X

and plotted the recognition rate as a function of the truncation index in Figure 9.

For all examples, it is worth noticing that, as expected in face identification problems, only a few of the first largest singular elements suffice to capture the main features of an image. Therefore, the Golub–Kahan based strategies such as the TTPCA method are an interesting choice.

7. Conclusions

In this manuscript, we focused on two types of Golub–Kahan factorizations. We used the recent advances in the field of tensor factorization and showed that this approach is efficient for image identification. The main feature of this approach resides in the ability of the Global Golub–Kahan algorithms to approximate the dominant singular elements of a training matrix or tensor without needing to compute the SVD. This is particularly important as the matrices and tensors involved in this type of application can be very large. Moreover, in the case for which color has to be taken into account, this approach do not involve a conversion to grayscale, which can be very important for some applications. In a future work, we would like to study the feasability of implementing the promising randomized PCA approaches in the Golub–Kahan tensor algorithm in order to improve the training process computational cost in the case of very large datasets.

Author Contributions

Conceptualization, M.H., K.J., C.K. and M.M.; methodology, M.H. and K.J.; software, M.H.; validation, M.H., K.J., C.K. and M.M.; writing—original draft preparation, M.H., K.J., C.K. and M.M.; writing—review and editing, M.H., K.J., C.K. and M.M.; visualization, M.H., K.J., C.K. and M.M.; supervision, K.J.; project administration, M.H. and K.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Mustapha Hached acknowledges support from the Labex CEMPI (ANR-11-LABX-0007-01).

Conflicts of Interest

The authors declare no conflict of interest.

References

Kilmer, M.E.; Braman, K.; Hao, N.; Hoover, R.C. Third-order tensors as operators on matrices: A theoretical and computational framework with applications in imaging. SIAM J. Matrix Anal. Appl. 2013, 34, 148–172. [Google Scholar]
Kolda, T.G.; Bader, B.W. Tensor Decompositions and Applications. SIAM Rev. 2009, 3, 455–500. [Google Scholar] [CrossRef]
Zhang, J.; Saibaba, A.K.; Kilmer, M.E.; Aeron, S. A randomized tensor singular value decomposition based on the t-product. Numer Linear Algebra Appl. 2018, 25, e2179. [Google Scholar] [CrossRef] [Green Version]
Cai, S.; Luo, Q.; Yang, M.; Li, W.; Xiao, M. Tensor robust principal component analysis via non-convex low rank approximation. Appl. Sci. 2019, 9, 1411. [Google Scholar] [CrossRef] [Green Version]
Kong, H.; Xie, X.; Lin, Z. t-Schatten-p norm for low-rank tensor recovery. IEEE J. Sel. Top. Signal Process. 2018, 12, 1405–1419. [Google Scholar] [CrossRef]
Lin, Z.; Chen, M.; Ma, Y. The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv 2010, arXiv:1009.5055. [Google Scholar]
Kang, Z.; Peng, C.; Cheng, Q. Robust PCA via nonconvex rank approximation. In Proceedings of the 2015 IEEE International Conference on Data Mining, Atlantic City, NJ, USA, 14–17 November 2015. [Google Scholar]
Lu, C.; Feng, J.; Chen, Y.; Liu, W.; Lin, Z.; Yan, S. Tensor Robust Principal Component Analysis with a New Tensor Nuclear Norm. IEEE Anal. Mach. Intell. 2020, 42, 925–938. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Martinez, A.M.; Kak, A.C. PCA versus LDA. IEEE Trans. Pattern Anal. Mach. Intell. 2001, 23, 228–233. [Google Scholar] [CrossRef] [Green Version]
Guide, M.E.; Ichi, A.E.; Jbilou, K.; Sadaka, R. Tensor Krylov subspace methods via the T-product for color image processing. arXiv 2020, arXiv:2006.07133. [Google Scholar]
Brazell, M.; Navasca, N.L.C.; Tamon, C. Solving Multilinear Systems Via Tensor Inversion. SIAM J. Matrix Anal. Appl. 2013, 34, 542–570. [Google Scholar] [CrossRef]
Beik, F.P.A.; Jbilou, K.; Najafi-Kalyani, M.; Reichel, L. Golub–Kahan bidiagonalization for ill-conditioned tensor equations with applications. Numer. Algorithms 2020, 84, 1535–1563. [Google Scholar] [CrossRef]
Ichi, A.E.; Jbilou, K.; Sadaka, R. On some tensor tubal-Krylov subspace methods via the T-product. arXiv 2020, arXiv:2010.14063. [Google Scholar]
Guide, M.E.; Ichi, A.E.; Jbilou, K. Discrete cosine transform LSQR and GMRES methods for multidimensional ill-posed problems. arXiv 2020, arXiv:2103.11847. [Google Scholar]
Vasilescu, M.A.O.; Terzopoulos, D. Multilinear image analysis for facial recognition. In Proceedings of the Object Recognition Supported by User Interaction for Service Robots, Quebec City, QC, Canada, 11–15 August 2002; pp. 511–514. [Google Scholar]
Jain, A. Fundamentals of Digital Image Processing; Prentice–Hall: Englewood Cliffs, NJ, USA, 1989. [Google Scholar]
Ng, M.K.; Chan, R.H.; Tang, W. A fast algorithm for deblurring models with Neumann boundary conditions. SIAM J. Sci. Comput. 1999, 21, 851–866. [Google Scholar] [CrossRef] [Green Version]
Kernfeld, E.; Kilmer, M.; Aeron, S. Tensor-tensor products with invertible linear transforms. Linear Algebra Appl. 2015, 485, 545–570. [Google Scholar] [CrossRef]
Kilmer, M.E.; Martin, C.D. Factorization strategies for third-order tensors. Linear Algebra Appl. 2011, 435, 641–658. [Google Scholar] [CrossRef] [Green Version]
Golub, G.H.; Van Loan, C.F. Matrix Computations, 3rd ed.; Johns Hopkins University Press: Baltimore, MD, USA, 1996. [Google Scholar]
Savas, B.; Eldén, L. Krylov-type methods for tensor computations I. Linear Algebra Appl. 2013, 438, 891–918. [Google Scholar] [CrossRef]
Lecun, Y.; Cortes, C.; Curges, C. The MNIST Database. Available online: http://yann.lecun.com/exdb/mnist/ (accessed on 22 February 2021).
Nefian, A.V. Georgia Tech Face Database. Available online: http://www.anefian.com/research/face_reco.htm (accessed on 22 February 2021).
Wang, S.; Sun, M.; Chen, Y.; Pang, E.; Zhou, C. STPCA: Sparse tensor Principal Component Analysis for feature extraction. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, 11–15 November 2012; pp. 2278–2281. [Google Scholar]

Figure 1. Image compression.

Figure 2.

∥ Σ (1, 1, 1) - S (1, 1, 1) ∥

vs. number of TTGGKA iteration k.

Figure 2.

∥ Σ (1, 1, 1) - S (1, 1, 1) ∥

vs. number of TTGGKA iteration k.

Figure 3. First 16 images of MNIST training subset.

Figure 4. Identification rates for different truncation indices r.

Figure 5. Fifteen pictures of one individual in the database.

Figure 6. Test image, closest image, mean image and eigenface.

Figure 7. Identification rates for different truncation indices r.

Figure 8. Test image, closest image, mean image and eigenface.

Figure 9. Identification rates for different truncation indices r.

Table 1.

{∥ S (i, i, :) - Σ (i, i, :) ∥}_{F}

vsk.

Table 1.

{∥ S (i, i, :) - Σ (i, i, :) ∥}_{F}

vsk.

	$k = 10$	$k = 30$	$k = 50$	$k = 70$
$S (1, 1, :)$	$3.6 \times 10^{- 4}$	$1.3 \times 10^{- 5}$	$5.1 \times 10^{- 11}$	$4.8 \times 10^{- 17}$
$S (2, 2, :)$	$2.0 \times 10^{- 3}$	$1.6 \times 10^{- 6}$	$5.2 \times 10^{- 7}$	$3.1 \times 10^{- 8}$
$S (3, 3, :)$	$4.9 \times 10^{- 3}$	$5.9 \times 10^{- 4}$	$2.3 \times 10^{- 4}$	$5.6 \times 10^{- 8}$
$S (4, 4, :)$	$8.4 \times 10^{- 3}$	$8.8 \times 10^{- 4}$	$1.5 \times 10^{- 4}$	$1.0 \times 10^{- 8}$
$S (5, 5, :)$	$1.4 \times 10^{- 2}$	$1.3 \times 10^{- 3}$	$2.7 \times 10^{- 4}$	$1.1 \times 10^{- 8}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hached, M.; Jbilou, K.; Koukouvinos, C.; Mitrouli, M. A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition. Mathematics 2021, 9, 1249. https://doi.org/10.3390/math9111249

AMA Style

Hached M, Jbilou K, Koukouvinos C, Mitrouli M. A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition. Mathematics. 2021; 9(11):1249. https://doi.org/10.3390/math9111249

Chicago/Turabian Style

Hached, Mustapha, Khalide Jbilou, Christos Koukouvinos, and Marilena Mitrouli. 2021. "A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition" Mathematics 9, no. 11: 1249. https://doi.org/10.3390/math9111249

APA Style

Hached, M., Jbilou, K., Koukouvinos, C., & Mitrouli, M. (2021). A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition. Mathematics, 9(11), 1249. https://doi.org/10.3390/math9111249

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition^†

Abstract

1. Introduction

2. Definitions and Notations

2.1. Discrete Cosine Transformation

2.2. Definitions and Properties of the Cosine Product

3. Tensor Principal Component Analysis for Face Recognition

3.1. The Matrix Case

4. The Tensor Golub–Kahan Method

4.1. The Tensor C-Global Golub–Kahan Algorithm

4.2. Tensor Tubal Golub–Kahan Bidiagonalisation Algorithm

5. The Tensor Tubal PCA Method

6. Numerical Tests

6.1. Example 1

6.2. Example 2

6.3. Example 3

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition †

Abstract

1. Introduction

2. Definitions and Notations

2.1. Discrete Cosine Transformation

2.2. Definitions and Properties of the Cosine Product

3. Tensor Principal Component Analysis for Face Recognition

3.1. The Matrix Case

4. The Tensor Golub–Kahan Method

4.1. The Tensor C-Global Golub–Kahan Algorithm

4.2. Tensor Tubal Golub–Kahan Bidiagonalisation Algorithm

5. The Tensor Tubal PCA Method

6. Numerical Tests

6.1. Example 1

6.2. Example 2

6.3. Example 3

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

A Multidimensional Principal Component Analysis via the C-Product Golub–Kahan–SVD for Classification and Face Recognition^†