Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism

Wen, Chenglin; Qiu, Yiting

doi:10.3390/s22145284

Open AccessArticle

Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism

by

Chenglin Wen

^1,* and

Yiting Qiu

²

¹

School of Automation, Guangdong University of Petrochemical Technology, Maoming 525000, China

²

School of Automation, Hangzhou Dianzi University, Hangzhou 310018, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(14), 5284; https://doi.org/10.3390/s22145284

Submission received: 25 May 2022 / Revised: 23 June 2022 / Accepted: 4 July 2022 / Published: 15 July 2022

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

As the acquisition and application of color images become more and more extensive, color face recognition technology has also been vigorously developed, especially the recognition methods based on convolutional neural network, which have excellent performance. However, with the increasing depth and complexity of network models, the number of calculated parameters also increases, which means the training of most high-performance models depends on large-scale samples and expensive equipment. Therefore, the key to the current research is to realize a lightweight model while ensuring the recognition accuracy. At present, PCANet, a typical lightweight framework for deep learning, has achieved good results in most of the image recognition tasks, but its recognition accuracy for color face images, especially under occlusion, still needs to be improved. Therefore, a color occlusion face recognition method based on quaternion non-convex sparse constraint mechanism is proposed in this paper. Firstly, a quaternion non-convex sparse principal component analysis network model was constructed based on Lp regularization of strong sparsity. Secondly, the fixed point iteration method and coordinate descent method were established to solve the non-convex optimization problem. Finally, the occlusion recognition performance of the proposed method was verified on Georgia Tech, Color FERET, AR, and LFW-A Color face datasets.

Keywords:

occluded color face; PCANet; Lp non-convex sparse; coordinate descent; fixed point iterative

1. Introduction

As one of the important biometric recognition technologies, face recognition technology has been more and more widely used in social security, video surveillance, identity verification, mobile payment, and other fields. In recent years, research on face recognition technology have made a series of important achievements [1], especially the deep learning method based on large label samples, which has achieved very high accuracy.

The main drawback of deep learning methods is that they rely on large datasets to train models, and those datasets need to contain enough variation to generalize to previously unseen samples. However, in many real-world use cases, most datasets are not on the scale that deep learning uses, and even small-scale data collection can be very expensive or sometimes nearly impossible [2]. If the deep learning model is trained with small samples, when the model starts to use irrelevant features for prediction, it leads to the over-fitting problem, which greatly limits its classification recognition performance. As the number of network layers increases, the model structure becomes more complex, which greatly increases the amount of computation. This makes most of the advanced deep learning algorithms need to be trained and deployed on expensive high-performance graphics cards, which largely hinders the practical application and development of deep learning.

Therefore, in order to solve the problems of the above deep learning methods, the principal component analysis network (PCANet) was proposed [3]. PCANet is a lightweight deep learning algorithm combining PCA and CNN. It has a simple structure and can achieve good results in most image classification tasks with two-layer convolutional networks. Compared with CNN, it does not need a large number of training samples and the support of highly configured hardware, and the parameters and calculations are also very small while the recognition effect is guaranteed. Moreover, compared with traditional machine learning methods, PCANet has a stronger feature extraction ability.

At the same time, with the increasing maturity of multimedia equipment and technology, daily visual information has more abundant forms of expression, especially the acquisition and application of color image is more and more extensive. A color face image can obtain more sufficient color information, such as skin color, hair color, etc. Color image has a natural recognition advantage compared with gray image, and color face recognition technology has also attracted the attention of scholars [4]. The famous feature face method proposed by Turk is mainly to extract feature space after converting a color face image to a gray image [5]. After grayscale processing, the dimension of the image matrix decreases and the operation speed increases, but this causes the loss of image color information, and the recognition effect is not good because of not making full use of the characteristics of color. Torres pointed out that color information in color images is composed of RGB in different proportions, which is of great significance for face recognition, and proposed to extend the RGB three-color channels to the traditional PCA method for color face recognition [6]. At the same time, most of the existing convolutional neural networks usually regard RGB color channels as three independent feature channels, and use three convolution kernels to convolve and add the three channels respectively in the convolution layer. The above two methods based on color image recognition can improve the target recognition rate more effectively than gray image recognition, but these methods ignore the cross-channel correlation of RGB. To overcome this shortcoming, a generalized discriminant model was proposed by Yang, converting three-color channels into one channel

P

, namely

P = x_{1} R + x_{2} G + x_{3} B

[7]. In this method, the three-color channel is separated into three images, which are processed separately, and then weighted fusion. Although this method improves the level of face recognition to a certain extent, it is difficult to directly apply this method in practice because it is often difficult to obtain the optimized fusion basis

x_{1}, x_{2}, x_{3}

. In order to avoid solving the problem of fusion basis, Li proposed an integration method based on quaternion principal component analysis (QPCA) to solve the above problems [8] by importing quaternion to process three channels of color image at the same time, that is to say, a quaternion is used to represent the color image. Moreover, the quaternion complex representation is used to transform it from quaternion domain to complex domain; the quaternion matrix of color image in complex domain can be established, which greatly improves the recognition accuracy. However, dimensionality expansion increases the computational complexity, which affects the efficiency of recognition in complex scenarios and reduces the interpretability of principal components to recognition results.

Aiming at a sparse principal component of a color image, Lin proposed a quaternion sparse principal component analysis method based on

L_{1}

norm for sparse optimization of quaternion principal components [9]. However, the principal components obtained are not sparse enough. When a face image has a large area occlusion, it is difficult to achieve ideal recognition accuracy and computational complexity. Moreover, in the practical application of face image recognition, occlusion is a relatively common type which is especially affected by COVID-19; mask occlusion is inevitable in various face recognition scenes. It is necessary to improve the algorithm recognition accuracy under occlusion.

Therefore, a color occlusion face recognition method based on the quaternion non-convex sparse constraint mechanism is proposed in this paper, and the quaternion non-convex sparse principal component analysis network model (QNSPCANet) is established. On the basis of the PCANet model structure, the quaternion representation method was used to construct the quaternion sample matrix of a color image. Then,

L_{p}

non-convex regularization was used as the constraint term in the convolution kernel sparse optimization problem.

L_{p}

regularization has a good feature extraction effect for face images with large area occlusion due to its strong sparsity [10]. Secondly, as the non-convex and non-smooth properties of the objective function built based on

L_{p}

regularization, the coordinate descent method was used to solve the sparse principal components, and the fixed point iterative method was used to obtain the optimal numerical solution of the variables. Finally, experiments were performed on Georgia Tech, Color FERET, AR, and LFW-A face datasets.

2. Main Research Work

In order to overcome the problem that the convolution kernel of quaternion sparse principal component analysis is not sparse enough, the quaternion non-convex sparse principal component analysis network (QNSPCANet) based on

L_{p}

regularization is proposed in this section: the establishment of the quaternion non-convex sparse PCANet model with

L_{p}

regularization constraints, the quaternion sparse vector solution based on coordinate descent method, and the variable solution based on fixed point iteration method are proposed in order to obtain a more sparse convolution kernel and further improve the recognition performance. The basic framework of a two-order QNSPCANet is shown in Figure 1.

Since

L_{1 / 2}

regularization has the ability of being the most sparse in nature among

L_{p} (0 < p < 1)

regularization [10],

L_{1 / 2}

regularization represents

L_{p}

regularization to verify the performance of network model in actual simulation experiments.

2.1. The Establishment of QNSPCANet Model

2.1.1. Quaternion Representation of Color Images

{S_{t}}_{t = 1}^{N}

denotes a set of training samples, and the size of each image is

m \times n

. The red, green, and blue channel matrices of each color image

S_{t}

are extracted and denoted as

R_{t}, G_{t}, B_{t} \in R^{m \times n}

, and the mean matrices

〈 R 〉, 〈 G 〉, 〈 B 〉 \in R^{m \times n}

of all color images are respectively calculated, and

{\tilde{R}}_{t}, {\tilde{G}}_{t}, {\tilde{B}}_{t} \in R^{m \times n}

are obtained after average processing of each image.

C_{t}

can be regarded as the pixel matrix of the background color of the image, and it is the zero matrix when the background of the image is white.

Q_{t} \in H^{m \times n}

is the quaternion matrix of each image, i.e.,:

Q_{t} = C_{t} + {\tilde{R}}_{t} \cdot i + {\tilde{G}}_{t} \cdot j + {\tilde{B}}_{t} \cdot k

(1)

where

i, j, k

represent the three axes of the imaginary part of quaternion.

Then, the complex representation of quaternion is introduced, and the quaternion matrix is reconstructed into a general complex matrix. Let:

\begin{array}{l} Q_{t a} = C_{t} + {\tilde{R}}_{t} \cdot i \\ Q_{t b} = {\tilde{G}}_{t} + {\tilde{B}}_{t} \cdot i \end{array}

(2)

And the Equation (1) can be equivalently converted to:

Q_{t} = Q_{t a} + Q_{t b} \cdot j

(3)

The complex representation of quaternion matrix is obtained by reconstruction.

χ_{t} = [\begin{matrix} Q_{t a} & Q_{t b} \\ - \bar{Q_{t a}} & \bar{Q_{t b}} \end{matrix}]

(4)

where

χ_{t} \in C^{2 m \times 2 n}

,

- \bar{Q_{t a}} = - C_{t} + {\tilde{R}}_{t} \cdot i

, and

\bar{Q_{t b}} = {\tilde{G}}_{t} - {\tilde{B}}_{t} \cdot i

.

Annotation 2.1: By reconstructing the color image input samples into the complex representation of quaternion, it can not only associate the color information of RGB three channels simultaneously, but also transform it from quaternion field to complex number field, which is convenient for the subsequent convolution layer operation. However, at the same time, the dimension of the input matrix is doubled and the computational complexity is increased.

2.1.2. Quaternion Non-Convex Sparse Principal Component Analysis Convolution Kernel

It can be concluded from the previous section that the original input of the convolutional network is

{χ_{t}}_{t = 1}^{N} \in C^{2 m \times 2 n}

. Then, the sliding window of size

k_{1} \times k_{2}

is used for block sampling of the

t

-th color image, and

2 m \times 2 n

sample blocks can be obtained, where each sample block is a quaternion complex representation matrix. The

t

-th color image matrix

q_{t} = [q_{t, 1}, q_{t, 2}, \dots, q_{t, 4 m n}] \in C^{k_{1} k_{2} \times 4 m n}

is obtained by means of all sampling blocks and series processing. The same operation can be performed for each sample in

{χ_{t}}_{t = 1}^{N}

to obtain the processed input sample matrix:

X = [q_{1}, q_{2}, \dots, q_{N}] \in C^{k_{1} k_{2} \times 4 N m n}

(5)

Next, the quaternion sparse convolution kernel is calculated. Firstly, the covariance matrix of sample matrix

X

needs to be calculated. The calculation formula is as follows:

Φ = X X^{H}

(6)

where

X^{H}

is the corresponding conjugate transpose of

X

. Then,

Φ

undergoes Eigendecomposition, and the Eigenvectors corresponding to the first

M_{1}

largest Eigenvalues are retained as initial values, that is,

A = B = [β_{1}, β_{2} \dots, β_{M_{1}}] \in C^{k_{1} k_{2} \times M_{1}}

, where

M_{1} \leq k_{1} \times k_{2}

. The quaternion non-convex sparse optimization problem is established as follows:

\begin{array}{l} (\hat{A}, \hat{B}) = \underset{A, B}{\arg} \min ({‖ X - A B^{H} X ‖}_{F}^{2} + λ_{2} \sum_{j = 1}^{M_{1}} {‖ β_{j} ‖}_{2}^{2} + \sum_{j = 1}^{M_{1}} λ_{p, j} {‖ β_{j} ‖}_{p}^{p}) \\ s . t . A^{H} A = I_{M_{1}} \end{array}

(7)

where

\sum_{j = 1}^{M_{1}} λ_{p, j} {‖ β_{j} ‖}_{p}^{p}

represents the

L_{p}

regularization constraint term, the sparsity of load

β_{j}

is controlled by

λ_{p, j}

,

L_{p}

norm is defined as

{‖ x ‖}_{p} = (\sum_{i} {| x_{i} |}^{p})^{\frac{1}{p}}

, and

λ_{2} \sum_{j = 1}^{M_{1}} {‖ β_{j} ‖}_{2}^{2}

is to avoid the over-fitting problem. The quaternion sparse vector basis obtained after optimization solution is

V_{S}^{1} = [v_{s 1}^{1}, v_{s 2}^{1} \dots, v_{s M_{1}}^{1}] \in C^{k_{1} k_{2} \times M_{1}}

, and,

v_{s j}^{1} = β_{j} / ‖ β_{j} ‖ \in C^{k_{1} k_{2}}

.

Therefore, the corresponding single QNSPCA convolution kernel is represented as follows:

W_{l_{1}}^{1} = m a t r i c s_{k_{1}, k_{2}} (v_{s l_{1}}^{1}) \in C^{k_{1} \times k_{2}}

(8)

where

l_{1} = 1, 2, \dots, M_{1}

,

m a t r i c s_{k_{1}, k_{2}} (•)

maps the vector to a matrix, and

v_{s l_{1}}^{1}

represents the

l_{1}

-th principal component vector in the first layer quaternion sparse vector matrix.

2.1.3. Two-Order Convolution Layer

The quaternion sparse convolution kernel calculated in the previous section is used to perform convolution operation with the sample image, then the output after the first convolution is:

F_{t}^{l_{1}} = χ_{t} * W_{l_{1}}^{1}

(9)

where

l_{1} = 1, 2, \dots, M_{1}, t = 1, 2, \dots, N

. After a convolution, a color image

χ_{t}

can obtain

M_{1}

corresponding Eigenmatrices, which serve as the input of the next layer.

The second convolution operation has the same principle as the first convolution, but the image size is reduced after convolution. Therefore, the edge of the matrix output of the previous layer needs to be zeroed before the second convolution, and the final output of the first convolution is

{F_{t}^{l_{1}}}_{l_{1} = 1}^{M_{1}} \in C^{2 m \times 2 n}

. Then each Eigenmatrix

F_{t}^{l_{1}}

is sampled by

k_{1} \times k_{2}

sliding window, and

2 m \times 2 n

sampling blocks can be obtained. The

l_{1}

-th sampling Eigenmatrix

z_{t}^{l_{1}} = [z_{t}^{l_{1}, 1}, z_{t}^{l_{1}, 2}, \dots, z_{t}^{l_{1}, 4 m n}] \in C^{k_{1} k_{2} \times 4 m n}

is obtained by means removal and series connection of all sampling blocks. Then the same operation can be performed for each matrix in

{F_{t}^{l_{1}}}_{l_{1} = 1}^{M_{1}}

, and the second layer input matrix

Z^{t} = [z_{t}^{1}, z_{t}^{2}, \dots, z_{t}^{M_{1}}] \in C^{k_{1} k_{2} \times 4 M_{1} m n}

corresponding to the

t

-th color image sample can be obtained. Finally, the sample matrix of the second convolution input of

N

image samples can be obtained.

Z = [Z^{1}, Z^{2}, \dots, Z^{M_{1}}] \in C^{k_{1} k_{2} \times 4 M_{1} N m n}

(10)

Similarly, the quaternion sparse convolution kernel of the second convolution can also be calculated through Equations (6) and (7). The second QSPCA convolution kernel is expressed as follows:

W_{l_{2}}^{2} = m a t r i c s_{k_{1}, k_{2}} (v_{s l_{2}}^{2}) \in C^{k_{1} \times k_{2}}

(11)

where

l_{2} = 1, 2, \dots, M_{2}

,

v_{s l_{2}}^{2}

represents the

l_{2}

-th principal component vector in the second quaternion sparse vector matrix.

For

M_{1}

feature matrices output after the first convolution, each feature matrix

F_{t}^{l_{1}}

corresponds to

M_{2}

feature matrices after the second convolution

R_{t}^{l_{1}} = {F_{t}^{l_{1}} * W_{l_{2}}^{2}}_{l_{2} = 1}^{M_{2}}

(12)

Therefore, each input sample

χ_{t}

can obtain

M_{1} \times M_{2}

feature matrices after feature extraction of QSPCANet two-layer convolution.

2.1.4. Pooling and Feature Output

The quaternion sparse feature matrix obtained from the second-order convolution layer can be used as the feature output of the sample only after the pooling operation. Therefore, each feature matrix of the convolution output is input to the binarization function first, and then the binarization feature matrix is encoded. Each output feature matrix is obtained by convolving the input sample with different quaternion sparse convolution kernels. The larger the Eigenvalue of the convolution kernel is, the greater its contribution is. Therefore, the corresponding output feature matrix should also be given greater weight, and the weighted feature matrix can be obtained as follows:

Γ_{t}^{l_{1}} = \sum_{l_{2} = 1}^{M_{2}} 2^{l_{2} - 1} H (F_{t}^{l_{1}} * W_{l_{2}}^{2})

(13)

where

H (\cdot)

represents the given binarization function, that is, it is set to 0 when the modulus of the element is less than the given threshold, and 1 otherwise. After the above binarization hash coding, the

M_{1} \times M_{2}

Eigenmatrices obtained by the second convolution become

M_{1}

Eigenmatrices, and the pixel values in each Eigengraph are integers with a range of

[{0, 2}^{M_{2}} - 1]

.

Finally, we form the area histogram statistics for

M_{1}

matrices corresponding to color image

χ_{t}

. Each matrix is divided into

C

blocks, and then the statistical interval is set as

2^{M_{2}}

. The information in each histogram block is counted and connected in series to obtain the histogram feature

T_{t}^{l_{1}} \in R^{2^{M_{2}} C}

of each feature matrix

Γ_{t}^{l_{1}}

. Finally, the histogram feature vector of each input sample is output as:

f_{t} = {[T_{t}^{1}, T_{t}^{2}, \dots, T_{t}^{M_{1}}]}^{T} \in R^{2^{M_{2}} C M_{1}}

(14)

The sample can output the final quaternion sparse feature matrix after feature extraction by QSPCANet. Then, in order to be consistent with other PCANet methods, the SVM classifier is also used to realize color face recognition. The advantage of SVM is that it can effectively solve the problems of a small sample, nonlinear and high-dimensional regression, and classification. Compared with the complexity of the problem, SVM requires a relatively small number of samples; a case where sample data are linearly indivisible can be solved by kernel function and relaxation variable. High dimension means that the sample dimension is very high, because the classifier generated by SVM is very simple, and the sample information only uses a support vector. At the same time, because SVM is only determined by support vector and has its own

L_{2}

regularization, it can effectively prevent the over-fitting problem.

Annotation 2.2: the QNSPCANet method described in this section is a new network model proposed in this chapter, especially the

L_{p}

regularization of strong sparsity. Moreover, the computational complexity of QNSPCANet feature extraction is

4 m n M_{1} (M_{2} + 1) k_{1} k_{2}

, and the convolution storage space is

8 k_{1} k_{2} (M_{1} + M_{2})

bytes. Compared with other methods, QNSPCANet has the following advantages:

(1): QNSPCANet uses $L_{p}$ regularization to compute sparse convolution kernels, which has higher sparse efficiency and can reduce computational complexity compared with general $L_{1}$ regularization;
(2): Sparse regularization is beneficial to identify important variables related to outliers, while the principal component convolution check outliers calculated by non-convex regularization of strong sparsity have better robustness and improve model recognition performance;
(3): For the image with occlusion, the sparse principal component convolution kernel can reduce the influence of outliers in the occlusion area and further improve the recognition accuracy.

However, the QNSPCANet model established in this section has non-convex and non-smooth problems in sparse optimization, which makes it difficult to solve. Although the establishment of an alternate solution model can effectively overcome the difficulty of solving two variables simultaneously and reduce the complexity to a certain extent, it still cannot overcome the essential difficulty brought by the introduction of

L_{p}

norm. Therefore, we discuss the solution method for

L_{p}

non-convex optimization problems in the next section.

2.2. Lp Non-Convex Sparse Optimization Method for Model Parameters

In order to overcome the difficulty in solving the parameters of QNSPCA convolution kernel, the variables of Equation (7) are first divided into two coordinate blocks,

A

and

B

, and one coordinate block is fixed to solve the sub-problems of the other coordinate block, and the sub-problems of the two variables are solved in turn until the termination condition is met [11]. In the algorithm, the initial value of

A

and

B

are the first

k

principal components obtained from QPCA, that is,

A = B = [v_{1}, v_{2} \dots, v_{k}]

.

Fixing

A

to solve problem (7) is equivalent to solving:

\hat{B} = \underset{B}{\arg} \min ({‖ X A - X B ‖}_{F}^{2} + λ_{2} \sum_{j = 1}^{k} {‖ β_{j} ‖}_{2}^{2} + \sum_{j = 1}^{k} λ_{1, j} {‖ β_{j} ‖}_{p}^{p})

(15)

Y = X A

, and

y_{j} = X v_{j}, j = 1, \dots, k

is initialized, then solving Equation (15) is equivalent to solving

k

independent optimization problems:

{\hat{β}}_{j} = \underset{β_{j}}{\arg} \min ({‖ y_{j} - X β_{j} ‖}_{F}^{2} + λ_{2} {‖ β_{j} ‖}_{2}^{2} + λ_{1, j} {‖ β_{j} ‖}_{p}^{p})

(16)

Based on the obtained

B

, the singular value decomposition of

X X^{H} B

is calculated:

X X^{H} B = U D V^{T}

(17)

And the

A

is updated:

\hat{A} = U V^{T}

(18)

B

and

A

can continue to be solved alternately until the convergence.

Annotation 2.3: the establishment of the alternate solution model can effectively overcome the difficulty of solving two variables at the same time and reduce the complexity to a certain extent, but it still cannot overcome the essential difficulty caused by the introduction of the

L_{p}

norm. Therefore, we introduce the coordinate descent method and fixed point iterative method in the next section to overcome this difficulty.

2.2.1. Coordinate Descent

Coordinate descent (CD) [12] is a simple but efficient optimization algorithm. It does not calculate gradient, but minimizes the objective function along the direction of each coordinate axis; that is, only one coordinate direction is found at this point and the remaining coordinate directions are kept unchanged. Then we can iterate over each coordinate direction until we obtain the local minimum.

If there is an objective function

f (x_{1}, x_{2}, \dots, x_{n})

, and it needs to solve its minimum point,

x = [x_{1}, x_{2}, \dots, x_{n}]

,

x

is initialized, it is called

x^{0}

, and then the cycle is started, the iteration process of the

i (i = 1, 2, \dots, n)

-th dimension in the

t

-th cycle is as follows:

x_{i}^{(t)} = \underset{x_{i}}{\arg \min} f (x_{1}^{(t)}, x_{2}^{(t)}, \dots, x_{i - 1}^{(t)}, x_{i}, x_{i + 1}^{(t - 1)}, \dots, x_{n}^{(t - 1)})

(19)

It is equivalent to solving

x_{i}

only as a variable in each iteration, while the remaining

n - 1

dimensions are regarded as constants and keep the current value unchanged. Then,

f (x_{1}, x_{2}, \dots, x_{n})

is minimized to obtain the new value of

x_{i}

and it is substituted into the next iteration as a constant.

If the relative changes in

x^{(t)}

and

x^{(t - 1)}

in each dimension are less than the specified threshold, the

x^{(t)}

is the final result. Otherwise, the cycle continues for the

t + 1

-th time until it is less than the change threshold or reaches the maximum number of cycles, and finally reaches the local minimum point.

For

k

principal components, these are

k

independent problems, so we only need to provide the coordinate descent solution algorithm for the first optimization problem, and so on for the other

k - 1

problems. Without loss of generality, the subscripts of

β_{j}

and

y_{j}

are omitted, i.e.,

β \in C^{k_{1} k_{2} \times 1}, y \in C^{4 N m n \times 1}

. Equation (16) can be simplified as

\hat{β} = \underset{β}{\arg} \min ({‖ X^{H} β - y ‖}_{2}^{2} + λ_{2} {‖ β ‖}_{2}^{2} + λ_{p} {‖ β ‖}_{p}^{p})

(20)

The coordinate descent method is applied to Equation (20). In the process of each cycle iteration, only one target variable is minimized and the values of other variables are fixed, which is equivalent to solving the unitary optimization problem. Therefore, for the

i

-th component

β_{i}, i = 1, 2, \dots, k_{1} k_{2}

of

β

, problem (20) is solved, the current value of

β_{s} (s \neq i)

is unchanged, and only

β_{i}

is optimized in each iteration, which is equivalent to solving:

{\hat{β}}_{i} = \underset{β_{i}}{\arg} \min ((λ_{2} + \sum_{j = 1}^{4 N m n} x_{i j}^{2}) β_{i}^{2} - (\sum_{j = 1}^{4 N m n} x_{i j} r_{j}^{(i)}) β_{i} + λ_{p} {| β_{i} |}^{p})

(21)

where

r_{j}^{(i)} = y_{j} - \sum_{s \neq i}^{k_{1} k_{2}} x_{s j} β_{s}, j = 1, 2, \dots, 4 N m n

, that is, the residual of

y_{j}

is fitted only with other fixed variables.

Annotation 2.4: In reference [11], this iterative updating method is called trivial updating, and the computational complexity of this method is

O (m n)

. The coordinate descent method adopted in this paper has low complexity, and the number of cycles selected by the algorithm termination condition is less than the maximum number of cycles, or the update rate of the objective function value is less than the given threshold. However, in the process of solving a single component, the introduction of

L_{p}

regularization is still unavoidable.

2.2.2. Fixed Point Iterative Method

In order to overcome the univariate solving problem caused by

L_{p}

regularization, the fixed point iterative method is used to optimize the numerical solution in this section.

Theorem 1.

Given a function

f (x_{1}, \dots, x_{i}, \dots, x_{n})

, suppose

x_{i}

exists such that

x_{i} = g (x_{i})

, then the point

x_{i}

is a fixed point of

f (x_{1}, \dots, x_{i}, \dots, x_{n})

.

For Equation (21),

f (β_{1}, \dots, β_{i}, \dots, β_{n}) = b β_{i}^{2} - a β_{i} + λ_{p} {| β_{i} |}^{p}

, the first derivative equation of it can be expressed as follows:

\begin{array}{l} {(b β_{i}^{2} - a β_{i} + λ_{p} {| β_{i} |}^{p})}^{'} = 0 \\ = > {\begin{cases} 2 b β_{i} - a + p λ_{p} β_{i}^{p - 1} = 0 β_{i} > 0 \\ 2 b β_{i} - a - p λ_{p} {(- β_{i})}^{p - 1} = 0 β_{i} < 0 \end{cases} \end{array}

(22)

Since the function is non-convex after

L_{p}

regularization is added, the variable optimization solution cannot be obtained directly from the first derivative equation. Therefore, the fixed point iteration method is used to transform the first-order derivation problem into the fixed point iteration problem

β_{i} = g (β_{i})

, and the equivalent deformation of Equation (22) can be obtained:

β_{i} = {\begin{cases} \frac{a}{2 b} - \frac{p λ_{p}}{2 b} β_{i}^{p - 1} β_{i} > 0 \\ \frac{a}{2 b} + \frac{p λ_{p}}{2 b} {(- β_{i})}^{p - 1} β_{i} < 0 \end{cases}

(23)

Therefore:

g (β_{i}) = {\begin{cases} \frac{a}{2 b} - \frac{p λ_{p}}{2 b} β_{i}^{p - 1} β_{i} > 0 \\ \frac{a}{2 b} + \frac{p λ_{p}}{2 b} {(- β_{i})}^{p - 1} β_{i} < 0 \end{cases}

(24)

where

b = λ_{2} + \sum_{j = 1}^{4 N m n} x_{i j}^{2}

,

a = \sum_{j = 1}^{4 N m n} x_{i j} r_{j}^{(i)}

. The initial value of the input variable is taken as the initial iteration value, and then the numerical solution is iterated. Therefore, the numerical solution obtained iteratively by using Equation (23) is the optimal solution

{\hat{β}}_{i}

of

β_{i}

under the current cycle.

Annotation 2.5: the fixed point iterative method is an important method for solving nonlinear equations [13]. After transformation into fixed point equation, approximate solution of the equation can be obtained through iteration, which is not limited by nonlinear and non-convex conditions. Therefore, this section adopts fixed point iteration method to solve variables.

3. Algorithm Simulation Experiments

In this section, four color face datasets including Georgia Tech [14], Color FERET [15], AR [16], and LFW-A [17] were selected to conduct algorithm simulation experiments. Color face image samples of four datasets are shown in Figure 2.

The Georgia Tech dataset consists of 750 color images, 15 images for each of 50 photographers, and is widely used in face recognition. The face images were taken in the Georgia Tech Lab, and most of the images included changes in light, expression, and posture.

Color FERET is a Color version of the gray face dataset FERET, with 11,338 color face images from 994 people. Since the number of images varies from person to person, 200 photographers were selected with seven images each, and the 1400 images were recorded as the Color FERET subset. Seven images of each person were selected with changes in lighting, expression, and posture.

AR is a face dataset composed of 3120 color face images, 26 images per 120 photographers. Face images were taken from the front, so the pose changes are relatively few. In addition to the illumination and expression changes, the influence of occlusion factors is also considered.

LFW-A is a version of LFW face dataset after face alignment processing, which includes 13,233 color images from 5749 photographers. As the images come from the network, it is suitable for face recognition research in natural scenes. The face images in the dataset include a variety of factors such as illumination, age, posture, expression, and occlusion, so the dataset is very challenging. In order to evenly distribute samples, photographers with more than nine images were selected and A subset of LFW-A was constructed.

The images of each person in each database were randomly divided into the training set and test set in accordance with 2:1. Since the distribution of training samples and test samples in the experiment was random, the experiment was repeated for ten times and the average value was taken. In order to test more effectively in the actual experiment, the size of all images was uniformly set as

32 \times 32

.

Meanwhile, based on the experimental situation in this paper, the SVM classifier chose the LIBLINEAR library based on linear kernel function. The linear kernel is suitable for cases where the number of samples is much smaller than the number of features (no need to map to higher dimensions), or where both the number of samples and the number of features are large (mainly considering the training speed).

In addition, the hardware of this experiment is Intel(R) Core (TM) i5-8265U CPU@1.80 GHz, NVIDIA GTX 1060, and the software is Matlab 2016b and Anaconda3.

3.1. Comparison of Algorithm Performance under Different Occlusion Conditions

Firstly, the QNSPCANet proposed in this paper and the three algorithms, PCANet, QPCANet, and QSPCANet, were compared under different occlusion conditions. Different occlusion conditions mainly include occlusion contained in the dataset, self-added pure color occlusion, and salt-and-pepper noise occlusion. At the same time, Color FERET, AR, and LFW-A datasets also compared the recognition accuracy of other latest occlusion algorithms under the same dataset. The Georgia Tech dataset was only compared with similar structure algorithms due to its few applications.

Since there are no large-area occlusion elements in the face images of Georgia Tech, Color FERET, and LFW-A datasets, the experiments of these three datasets can be divided into three groups: (1) when the first group is without occlusion, the images of the training set and test set are randomly selected in 2:1; (2) in the second group, a blue block with 20% pixel area was added to the randomly selected face images in the test set; (3) and the third group is the condition of salt-and-pepper noise occlusion. Salt-and-pepper noise blocks are added to randomly selected face images in the test set.

For AR dataset, each person should have 26 face images, among which, eight have no occlusion factor, six have illumination change, six have sunglasses occlusion, and six have scarf occlusion. The experiment can be divided into five groups: (1) in the first group, under the condition of no occlusion, nine images were randomly selected from 14 images with no occlusion containing light changes to form the training set, and the remaining five images were formed into the test set; (2) in the second group, 14 face images without occlusion were included in the training set, and six face images with sunglasses occlusion were included in the test set; (3) the third group was scarf occlusion, 14 images without occlusion were used as the training set, and six images with scarf were used as the test set; (4) the fourth group was in the condition of self-added pure color occlusion. The training set consisted of nine randomly selected face images without occlusion, and then added blue occlusion blocks (about 20% occlusion area) to the remaining five images, which were used as test samples; (5) and the fifth group was the condition of salt-and-pepper noise occlusion. Nine images were randomly selected from 14 images without occlusion as the training set, and then the remaining five images were added with salt-and-pepper noise blocks as the test set. Color face samples of the AR dataset, self-added solid color occlusion of images, and different areas of salt-and-pepper noise occlusion are shown in Figure 3.

The convolution order of each principal component analysis network is set as 2, the number of convolution kernels at each layer is

M_{1} = M_{2} = 8

, the size of sampling matrix is set as the optimal size of each dataset, the size of histogram window is set as

7 \times 7

, and the corresponding overlap rate is set as 0.5. For the quaternion sparse optimization problem in the convolution layer,

λ_{2} = 0.001

is set, the appropriate sparse parameter

λ_{p}, λ_{s}

for each feature vector is selected, the update rate threshold is set as

1 e - 4

, and the maximum cycle time is set as 1000. Finally, a trained SVM classifier is used for color face recognition based on the extracted features.

Table 1 shows the correct recognition rate of each algorithm in Georgia Tech dataset under different occlusion conditions.

It can be seen from Table 1 that the algorithm introducing

L_{p}

non-convex regularization achieves a relatively high recognition rate under different occlusion conditions. When there is no occlusion, the recognition rate of the two algorithms under sparse constraint is close. In the case of 20% pure color occlusion area and salt-and-pepper noise occlusion, the recognition rate of QNSPCANet is the highest. Due to the few occlusion applications in Georgia Tech dataset, this part is only compared with the relevant PCANet method.

The correct recognition rate of each algorithm in Color FERET dataset under different occlusion conditions is shown in Table 2.

As can be seen from Table 2, in the Color FERET dataset, QNSPCANet achieves the highest recognition rate in the case of non-noise occlusion. In the case of 20% salt-and-pepper noise, the recognition rate is slightly lower than GMSRC, but the overall recognition performance is still superior.

The correct recognition rate of each algorithm in AR dataset under different shielding conditions is shown in Table 3. The recognition rate of all algorithms in AR dataset is generally higher. QNSPCANet has the best recognition effect under all occlusion conditions, and the difference of recognition rate is larger when there is occlusion, indicating that

L_{p}

regularization has a good suppression effect on both outliers and occlusion.

The correct recognition rate of each algorithm in LFW-A dataset under different shielding conditions is shown in Table 4. It can be seen from the table that QNSPCANet, which introduced non-convex regularization, has a higher recognition rate than PCANet and QPCANet under different occlusion conditions. Compared with QSPCANet based on

L_{1}

regularization, the recognition rate is close to that of QSPCANet without occlusion, while the recognition rate is greatly improved with occlusion. At the same time, compared with other existing methods in the table, the algorithm proposed in this paper can also achieve better recognition effect. Although there is still a certain gap between QNSPCANet and MobileFaceNet when there is no occlusion, QNSPCANet has better performance when there is occlusion and noise.

Figure 4 shows the recognition rate curves of each algorithm under different solid color occlusion areas. As can be seen from the figure, with the increase in the occlusion area, the gap between the correct recognition ability of the four recognition algorithms gradually increases and QNSPCANet performs better under different occlusion areas.

Since no specific training time is provided in the literature of other occlusion methods, this paper only compares the training time of the four PCANet algorithms, as shown in Table 5.

As can be seen from Table 5, the overall training time of the four PCANet related algorithms is short. The training time of the QNSPCANet method proposed in this paper is increased, mainly because non-convex sparse optimization produces certain calculation consumption, but the recognition accuracy is improved, and the overall recognition performance is still superior.

Based on the above comparative experimental results, it can be shown that in the case of no occlusion, the recognition rate of non-convex sparse convolution check is improved, but the effect is not obvious because the image contains fewer outliers at this time, which has a slight impact on model recognition results. In the case of occlusion, the strong sparse performance of non-convex regularization can effectively reduce the influence of occlusion outliers to improve the identification accuracy of the model.

3.2. Algorithm Sparsity Verification

Secondly, the sparsity of QNSPCANet proposed in this paper is verified. The validation experiment is mainly carried out on AR datasets and compared with

L_{1}

regularization and SCAD regularization [25]. In order to test the sparsity of

L_{1 / 2}

non-convex regularization (

L_{p}

is the most representative of sparsity), this section adopts two sparsity measures: the proportion of non-zero elements in the sparse matrix and Hoyer′s sparsity measure. Hoyer’s sparsity measures the components of small values and is a more refined sparse measurement method than non-zero values, which is defined as:

H o y e r (V_{S}^{i}) = \frac{\sqrt{M_{i} k_{1} k_{2}} - {‖ V_{S}^{i} ‖}_{1} / {‖ V_{S}^{i} ‖}_{F}}{\sqrt{M_{i} k_{1} k_{2}} - 1} \in [0, 1]

(25)

where

V_{S}^{i}

represents the quaternion sparse vector matrix in the

i

-th convolution layer, and

M_{i} k_{1} k_{2}

represents the number of elements of the matrix.

The proportional convergence curve of non-zero elements of the sparse matrix and the convergence curve of Hoyer′s sparsity are shown in Figure 5.

As can be seen from Figure 5, the proportion of

L_{1}

regularization non-zero elements is 0.56, while the Hoyer’s sparsity is about 0.70. The proportion of non-zero elements convergent in

L_{1 / 2}

regularization is 0.41, and the Hoyer’s sparsity is about 0.77. The proportion of non-zero elements regularized by SCAD is 0.46, and the Hoyer’s sparsity is about 0.75. Compared with

L_{1}

regularization,

L_{1 / 2}

regularization decreased by 26.8% and SCAD regularization decreased by 17.9%. Compared with

L_{1}

regularization,

L_{1 / 2}

regularization improved Hoyer′s sparsity by 10.0%, while SCAD regularization improved by 7.1%. Therefore,

L_{1 / 2}

regularization has stronger sparsity than SCAD and

L_{1}

regularization.

3.3. Algorithm Robustness Verification

In order to verify the robustness of the model under

L_{1 / 2}

non-convex regularization, different samples were randomly selected to form the training set and test set under different salt-and-pepper noise occlusion areas. The input model was repeated ten times and the corresponding root mean square error (RMSE) was calculated. Root mean square error can effectively reflect the stability of the model for random sample changes, and its calculation results are shown in Table 6.

Then, a certain degree of rotation transformation and translation transformation were carried out on the samples of the test set to construct similar samples to test the stability of the model for transformation. For the image rotation transformation processing, the overall rotation angle increases within the range of

- 20^{\circ} \sim + 20^{\circ}

; for translation transformation processing,

- 10 \sim + 10

pixels of image can be translated based on the horizontal direction. Figure 6 shows the correct recognition rate curves of the four algorithms under different rotation and translational transformations.

Combined with the results in Table 5 and Figure 6, it can be shown that QNSPCANet still has a low root mean square error in the case of noise. Meanwhile, QNSPCANet has good robustness to translation and rotation changes in samples. It can be seen that the proposed algorithm has higher stability, and the model is more robust.

4. Conclusions

In this paper, we propose a QNSPCANet model based on quaternion non-convex sparse constraint mechanism for color face image recognition with large area occlusion.

L_{p}

regularization is introduced into PCANet as a constraint term in the convolution layer convolution kernel sparse optimization problem, and the non-convex optimization problem is innovatively converted into fixed point equation, avoiding the problem that the coordinate descent method cannot be directly derived in the single variable solution. Sparse regularization recognition of outliers can effectively solve the problems of face image occlusion and noise, and the strong sparsity of non-convex regularization can further improve the recognition performance of the model.

In order to verify the recognition performance of QNSPCANet model proposed in this paper, especially in the case of occlusion and noise, self-added occlusion and noise processing are performed on Georgia Tech, Color FERET, AR, and LFW-A Color face datasets and compared with PCANet, QPCANet, QSPCANet, and other latest occlusion algorithms in the same dataset. The experimental results show that the quaternion non-convex sparse principal component analysis network proposed in this chapter has a high recognition rate under different occlusion conditions. This paper also verifies the sparsity and robustness of

L_{p}

regularization, and further proves the sparsity and robustness of

L_{p}

regularization through experiments of sparsity, root mean square error, rotation, and translation transformation, etc.

The PCANet framework based on this paper adopts a relatively simple number of network structure layers and lacks deeper feature extraction. If the deep features are acquired only by increasing the layers of the network model, the number of parameters and computational complexity will also be greatly increased. Therefore, it is necessary to build a new framework that is more advanced, simple and can extract deeper features.

Author Contributions

Conceptualization, Y.Q.; data curation, Y.Q.; formal analysis, C.W.; funding acquisition, C.W.; methodology, Y.Q. and C.W.; project administration, Y.Q.; resources, C.W.; supervision, C.W.; validation, C.W.; writing—original draft, Y.Q.; writing—review and editing, Y.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grants 61933013, 61733015 and 61733009.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, Z. Face recognition technology research status review. Electron. Technol. Softw. Eng. 2020, 13, 106–107. [Google Scholar]
Zhao, K.; Jin, X.; Wang, Y. A review of research on small sample learning. J. Softw. 2021, 32, 349–369. [Google Scholar]
Chan, T.-H.; Jia, K.; Gao, S.; Lu, J.; Zeng, Z.; Ma, Y. PCANet: A Simple Deep Learning Baseline for Image Classification? IEEE Trans. Image Process. 2015, 24, 5017–5032. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bao, S.; Song, X.; Hu, G.; Yang, X.; Wang, C. Colour face recognition using fuzzy quaternion-based discriminant analysis. Int. J. Mach. Learn. Cybern. 2019, 10, 385–395. [Google Scholar] [CrossRef]
Turk, M.; Pentland, A. Eigenfaces for recognition. J. Cogn. Neurosci. 1991, 3, 71–86. [Google Scholar] [CrossRef] [PubMed]
Torres, L.; Reutter, J.; Lorente, L. The importance of the color information in face recognition. In Proceedings of the 6th IEEE International Conference on Image Processing, Kobe, Japan, 24–28 October 1999; pp. 627–631. [Google Scholar]
Yang, J.; Liu, C. Color Image Discriminant Models and Algorithms for Face Recognition. IEEE Trans. Neural Netw. 2008, 19, 2088–2098. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Zhu, S.; Zhu, L. Face recognition algorithm based on quaternion principal component analysis. Signal Processing 2007, 2, 214–216. [Google Scholar]
Xiao, X.; Zhou, Y. Two-Dimensional Quaternion PCA and Sparse PCA. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 2028–2042. [Google Scholar] [CrossRef] [PubMed]
Xu, Z.; Zhang, H.; Wang, Y.; Change, X.; Liang, Y. L_(1/2) regularization. Sci. China Inf. Sci. 2010, 53, 1159–1169. [Google Scholar] [CrossRef] [Green Version]
Yang, X. Two-stage method for sparse principal component analysis. Prog. Appl. Math. 2017, 6, 1174–1181. [Google Scholar]
Scarselli, F.; Gori, M.; Tsoi, A.C.; Hagenbuchner, M.; Monfardini, G. The Graph Neural Network Model. IEEE Trans. Neural Netw. A Publ. IEEE Neural Netw. Counc. 2008, 20, 61–80. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Huang, X.; Ren, M.; Hu, Z. An Improvement of K-Medoids Clustering Algorithm Based on Fixed Point Iteration. Int. J. Data Warehous. Min. 2020, 16, 84–94. [Google Scholar] [CrossRef]
Georgia Tech Face Database. Available online: http://www.anefian.com/research/face_reco.html (accessed on 20 May 2022).
Color FERET Database. Available online: https://www.nist.gov/itl/products-and-services/color-feret-database (accessed on 20 May 2022).
Martinez, A.; Benavente, R. The AR Face Database; Technical Report; The Computer Vision Center (CVC): Barcelona, Spain, 1998. [Google Scholar]
Gary, B.; Manu, R.; Tamara, B.; Erik, L. Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments; University of Massachusetts: Amherst, MA, USA, 2007; pp. 7–49. [Google Scholar]
Asem, M. A 3D-based Pose Invariant Face Recognition at a Distance Framework. IEEE Trans. Inf. Forensics Secur. 2014, 9, 2158–2169. [Google Scholar]
Cao, L.; Li, H.; Guo, H.; Wang, B. Robust PCA for Face Recognition with Occlusion Using Symmetry Information. In Proceedings of the 2019 IEEE 16th International Conference on Networking, Sensing and Control (ICNSC), Banff, AB, Canada, 9–11 May 2019; pp. 323–328. [Google Scholar] [CrossRef]
Cen, F.; Wang, G. Dictionary Representation of Deep Features for Occlusion-Robust Face Recognition. IEEE Access 2019, 7, 26595–26605. [Google Scholar] [CrossRef]
Yang, M.; Zhang, L. Gabor Feature Based Sparse Representation for Face Recognition with Gabor Occlusion Dictionary. In European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2010; pp. 448–461. [Google Scholar] [CrossRef] [Green Version]
Wang, Y. Detection and Recognition of Occluded Face Based on Deep Neural Network; Northeast Petroleum University: Heilongjiang, China, 2021; pp. 30–32. [Google Scholar]
Lv, S.; Liang, J.; Di, L.; Xia, Y.; Hou, Z. A probabilistic collaborative dictionary learning-based approach for face recognition. IET Image Process. 2021, 15, 868–884. [Google Scholar] [CrossRef]
Chen, S.; Liu, Y.; Gao, X.; Han, Z. MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices. In Proceedings of the Chinese Conference on Biometric Recognition, Urumqi, China, 11–12 August 2018; pp. 428–438. [Google Scholar]
Fan, J.; Li, R. Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery. In Proceedings of the International Congress of Mathematicians, Madrid, Spain, 22–30 August 2006; pp. 595–622. [Google Scholar]

Figure 1. Basic framework of two-order QNSPCANet.

Figure 2. Sample for color face image (a) Georgia Tech; (b) Color FERET; (c) AR; (d) and LFW-A.

Figure 3. Color face sample of AR dataset and self-added occlusion processing example.

Figure 4. The recognition rate curves of each algorithm under different solid color occlusion areas.

Figure 5. Convergence curve of sparsity of the first convolution layer sparse vector matrix.

Figure 6. Four algorithms correctly identify rate curves under rotation transformation and translational transformation.

Table 1. The correct recognition rate of each algorithm in Georgia Tech dataset under different occlusion conditions (%).

Algorithm	Normal	20% Block Occlusion	20% Noise Occlusion
PCANet	93.20	83.60	83.00
QPCANet	95.50	87.30	87.70
QSPCANet	97.50	92.50	91.40
QNSPCANet	97.70	95.40	94.40

Table 2. The correct recognition rate of each algorithm in Color FERET dataset under different occlusion conditions (%).

Algorithm	Normal	20% Block Occlusion	20% Noise Occlusion
FRAD [18]	96.59	94.80	94.33
GMSRC [19]	97.07	95.21	95.42
DDRC [20]	98.60	94.53	-
PCANet	93.75	84.13	84.42
QPCANet	96.44	88.52	88.11
QSPCANet	98.04	92.98	92.38
QNSPCANet	98.72	96.02	95.39

Table 3. The correct recognition rate of each algorithm in AR dataset under different occlusion conditions (%).

Algorithm	Normal	Sunglasses	Scarf	20% Block Occlusion	20% Noise Occlusion
Gabor-SRC [21]	96.72	95.83	95.26	92.41	92.89
VGG-Face [22]	98.05	97.21	93.40	89.47	91.55
Lightened-CNN [22]	98.79	98.14	98.56	96.01	-
PCANet	96.71	95.83	96.12	87.14	88.32
QPCANet	97.83	97.66	96.23	90.69	90.94
QSPCANet	99.41	98.50	98.74	93.56	92.71
QNSPCANet	99.62	99.25	99.31	96.77	95.80

Table 4. The correct recognition rate of each algorithm in LFW-A dataset under different occlusion conditions (%).

Algorithm	Normal	20% Block Occlusion	20% Noise Occlusion
ProCRC [23]	94.82	86.77	88.51
CRDDL [23]	95.20	89.56	90.13
MobileFaceNet [24]	98.20	90.53	95.08
PCANet	91.56	80.25	80.64
QPCANet	94.15	86.33	87.59
QSPCANet	97.10	92.17	93.36
QNSPCANet	97.35	95.08	95.21

Table 5. Algorithm training time comparison.

Algorithm	Georgia Tech	Color FERET	AR	LFW-A
PCANet	49.06 s	81.82 s	210.27 s	63.15 s
QPCANet	156.24 s	277.52 s	342.86 s	180.44 s
QSPCANet	194.36 s	325.58 s	481.39 s	237.89 s
QNSPCANet	226.57 s	369.14 s	510.44 s	268.51 s

Table 6. Root mean square error of five recognition algorithms under different salt-and-pepper noise occlusion area (%).

Algorithm	10%	20%	30%	40%	50%
PCANet	4.20	5.73	7.82	9.02	9.85
QPCANet	2.51	4.26	5.33	7.84	9.26
QSPCANet	0.94	1.12	1.58	1.93	2.29
QNSPCANet	0.79	0.98	1.24	1.57	2.01

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wen, C.; Qiu, Y. Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism. Sensors 2022, 22, 5284. https://doi.org/10.3390/s22145284

AMA Style

Wen C, Qiu Y. Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism. Sensors. 2022; 22(14):5284. https://doi.org/10.3390/s22145284

Chicago/Turabian Style

Wen, Chenglin, and Yiting Qiu. 2022. "Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism" Sensors 22, no. 14: 5284. https://doi.org/10.3390/s22145284

APA Style

Wen, C., & Qiu, Y. (2022). Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism. Sensors, 22(14), 5284. https://doi.org/10.3390/s22145284

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Color Occlusion Face Recognition Method Based on Quaternion Non-Convex Sparse Constraint Mechanism

Abstract

1. Introduction

2. Main Research Work

2.1. The Establishment of QNSPCANet Model

2.1.1. Quaternion Representation of Color Images

2.1.2. Quaternion Non-Convex Sparse Principal Component Analysis Convolution Kernel

2.1.3. Two-Order Convolution Layer

2.1.4. Pooling and Feature Output

2.2. Lp Non-Convex Sparse Optimization Method for Model Parameters

2.2.1. Coordinate Descent

2.2.2. Fixed Point Iterative Method

3. Algorithm Simulation Experiments

3.1. Comparison of Algorithm Performance under Different Occlusion Conditions

3.2. Algorithm Sparsity Verification

3.3. Algorithm Robustness Verification

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI