Tensorized Multi-View Subspace Clustering via Tensor Nuclear Norm and Block Diagonal Representation

Gan-Yi Tang; Gui-Fu Lu; Yong Wang; Li-Li Fan

doi:10.3390/math13172710

,

and

¹

School of Computer Science and Information, Anhui Polytechnic University, Wuhu 241000, China

²

Anhui Provincial Medical Big Data Intelligent System Engineering Research Center, Anhui Normal University, Wuhu 241000, China

^*

Author to whom correspondence should be addressed.

Mathematics2025, 13(17), 2710;https://doi.org/10.3390/math13172710

This article belongs to the Special Issue New Advances in Combinatorial Multi-Objective Optimization and Computational Intelligence

Version Notes

Order Reprints

Abstract

Recently, a growing number of researchers have focused on multi-view subspace clustering (MSC) due to its potential for integrating heterogeneous data. However, current MSC methods remain challenged by limited robustness and insufficient exploitation of cross-view high-order latent information for clustering advancement. To address these challenges, we develop a novel MSC framework termed TMSC-TNNBDR, a tensorized MSC framework that leverages t-SVD based tensor nuclear norm (TNN) regularization and block diagonal representation (BDR) learning to unify view consistency and structural sparsity. Specifically, each subspace representation matrix is constrained by a block diagonal regularizer to enforce cluster structure, while all matrices are aggregated into a tensor to capture high-order interactions. To efficiently optimize the model, we developed an optimization algorithm based on the inexact augmented Lagrange multiplier (ALM). The TMSC-TNNBDR exhibits both optimized block-diagonal structure and low-rank properties, thereby enabling enhanced mining of latent higher-order inter-view correlations while demonstrating greater resilience to noise. To investigate the capability of TMSC-TNNBDR, we conducted several experiments on certain datasets. Benchmarking on circumscribed datasets demonstrates our method’s superior clustering performance over comparative algorithms while maintaining competitive computational overhead.

Keywords:

multi-view learning; tensor nuclear norm; subspace clustering; block diagonal matrix

MSC:

68T30

1. Introduction

In the field of pattern recognition, subspace clustering (SC) is a very important research topic [,,,]. In recent decades, scholars have created many subspace clustering algorithms, among which spectral-type methods have shown good performance.

Sparse subspace clustering (SSC) [] and low-rank representation (LRR) [], which have achieved significant success, are two typical self-representation subspace clustering methods. Block diagonal structure is a matrix form where non-zero elements are confined to square blocks along the main diagonal, with all elements outside these blocks being zero. Recent studies [,] have shown that the block diagonal structure within the learned low-dimensional subspace projection serves the purpose of obtaining the correct clustering results. However, SSC and LRR pursue the block diagonal representation (BDR) matrix indirectly since they only impose nuclear-norm and L1-norm on the subspace representation, respectively. Furthermore, Feng et al. [] imposed a block diagonal prior on the subspace representation matrices obtained using SSC and LRR, and their clustering performance was improved. However, it is difficult to optimize Feng’s method since the rank constraint is NP-hard. To tackle this problem, Lu et al. [] developed a simple BDR that relaxes the rank constraint. Compared with Feng’s method, BDR is more easily optimized since it is smooth. Xu et al. [] developed a learning projective model for BDR to deal with the large-scale subspace clustering problem. Xing et al. [] proposed an enhanced version of DBSCAN, which is a highly prevalent algorithm in data mining, to improve the clustering process using the block diagonal property of affinity matrices. Meanwhile, Guo et al. [] put forward a spectral clustering algorithm with BDR for large-scale datasets.

The above-mentioned approaches are single-view-based since they assume that there is only one data source. However, in fact, data are generally sourced from various origins. For instance, one event can be represented by images, text, videos. As such, multi-view clustering (MVC) methods, which often demonstrate a better clustering performance than single-view methods [,,,,,,,], are becoming increasingly popular.

The authors of [] proposed co-training learning to flux the multi-view features. Additionally, the study in [] investigated the key factors contributing to the success of the co-training method. The co-training learning model is not robust enough against noise pollution, which can lead to error exaggeration. Kumar et al. [] proposed an MSC framework, in which the clustering hypotheses among views is co-regularized. Graph-based methods are another category MSC methods, which generally use the multiple graph fusion strategy to utilize the information among different views. Sa [] developed a two-view clustering method, which utilizes different information between two views by constructing bipartite graphs. Moreover, the authors of [] developed a multi-view spectral clustering algorithm with the help of low-rank and sparse decomposition (RMSC), achieving encouraging success in relation to several real datasets. In the work of Cao et al. [], the diversity-induced MSC (DiMSC) was presented, leveraging the Hilbert–Schmidt Independence Criterion (HSIC), which plays a key role in utilizing complementary information among different views to enhance the clustering.

By assuming that the different views of an object come from a potential subspace, subspace learning MVC methods can be developed to capture the shared potential subspace. Blaschko and Lampert [] introduced a novel spectral clustering technique that utilizes canonical correlation analysis (CCA) in its linear and kernel forms for dimensionality reduction. In [], a low-rank common subspace (LRCS) MVC method is proposed, which can obtain compatible intrinsic information among views by using a common low-rank projection.

These MVC methods shows promising performance in clustering applications; however, they only use paired associations between different views, and may overlook the higher-order associations hidden in multi-view data [,,,,,]. Zhang et al. [,] developed a novel multi-view spectral clustering method named LTMSC, incorporating low-rank tensor constraints. In the method, the subspace representations are constructed into a single tensor. It is possible to explore higher-order relationships hidden in the multi-view data. Lu et al. [] introduced an MSC method with hyper-Laplacian regularization and low-rank tensor constraints (HLR-MSCLRT), which can uncover the local information hidden in the data on the manifold. Nevertheless, the tensor norm employed in both LTMSC and HLR-MSCLRT lacks a clear physical interpretation.

Zhang et al. recently introduced the TNN [] leveraging the tensor singular value decomposition (t-SVD). The TNN, defined as the summation of singular values, provides a rigorous measure of tensor data low-rankness. In [], Xie et al. developed a t-SVD based MSC model, namely t-SVD-MSC, which preserves the low-rank property through TNN. With the use of TNN, t-SVD-MSC can more effectively explore the complementary information among all the views [,,,,]. Furthermore, in [], an essential tensor learning method for MSC using a TNN constraint, known as ETLMSC, is proposed. Pan et al. [] proposed a non-negative non-convex low-rank tensor kernel function in an MSC model (NLRTGC) to reduce the bias from rank. To exploit high-dimensional hidden information, Pan et al. [] proposed a low-rank fuzzy MSC learning algorithm with the TNN constraint (LRTGFL). Peng et al. [] designed log-based non-convex functions to approximate tensor rank and tensor sparsity in the Finger-MVC model; these are more precise than the convex ones. Wang et al. [] integrate noise elimination and subspace learning into a unified MSC framework, holding high-order associations of views constrained by the TNN. Du et al. [] proposed a robust t-SVD-based multi-view clustering which simultaneously uses low rank and local smooth priors. Luo et al. [] used an adaptively weighted tensor Schatten-p norm with an adjustable p-value to eliminate the biased estimate of rank.

The optimized BDR structure in affinity matrices inherently encodes cluster information, thereby substantially enhancing clustering efficacy. The low-rank tensor representations intrinsically capture latent high-order correlations across multi-view data through subspace embeddings, resulting in statistically significant clustering improvements. In this paper, inspired by the optimized BDR structure and low-rank tensor representations, we propose a novel MSC method called TMSC-TNNBDR, which integrates the advantages of TNN and BDR. The proposed model imposes BDR constraints on each subspace representation matrix, and all affinity matrices are combined into a tensor regularized by TNN. Finally, an efficient optimization algorithm based on ALM is developed.

The primary contributions of our work are as follows:

The proposed TMSC-TNNBDR incorporates a BDR regularizer, which promotes a more pronounced block diagonal structure and improves clustering robustness.
In the TMSC-TNNBDR model, the optimized architecture encodes a TNN constraint, under which TMSC-TNNBDR captures the global structure across all views, thereby effectively exploiting latent complementary information and high-order interactions among views.
We proposed an ALM optimizer for TMSC-TNNBDR. This approach demonstrates superior clustering performance over comparative algorithms while maintaining competitive computational efficiency.

The remainder of this work is organized as follows. In Section 2, we summarize the notations used and some preliminary definitions. In Section 3, we briefly review two methods, namely the LRR [] and the BDR []. Then, we propose the TMSC-TNNBDR and a solving procedure for TMSC-TNNBDR in Section 4. Subsequently we documented the experimental findings in Section 5. Ultimately, we conclude our work in Section 6.

2. Notations and Preliminaries

2.1. Notations

For a clear explanation of TMSC-TNNBDR, We summarize the notations in Table 1. Bold calligraphy letters, e.g.,

U

, are deployed to denote tensors. Bold upper case letters, e.g., W, are deployed to denote matrices. Bold lower case letters, e.g., u, are deployed to denote vectors. Lower case letters, e.g.,

u_{i j}

, are deployed to denote the entries. Among others,

1

is assigned to denote the column vector of one. The diagonal elements of matrix W line up as a column vector, which is referred to as

d i a g (W)

. The column vector w is expanded into a diagonal matrix, which is labeled as

D i a g (w)

.

Table 1. Notations summarized.

Let

{‖W‖}_{*}

denote the nuclear norm operator of matrix W, i.e.,

{‖W‖}_{*} = \sum_{i} σ_{i} (W)

, in which

σ_{i} (W)

is the ith largest singular value of W. Assuming that the singular value decomposition of matrix W is expressed as

W = U Σ V^{T}

, then

D_{τ} (W)

denotes the singular-value thresholding operator applied to matrix W with boundary value

τ

, i.e.,

D_{τ} (W) = U Σ_{τ} V^{T}

, where

Σ_{τ} = d i a g \{\max (σ_{i} (W) - τ, 0\}

.

Matrices extend naturally to tensors, which are multidimensional arrays. Mathematically, a matrix is a two-way tensor. Suppose

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

is a three-way tensor,

Y

is likely to be regarded as the stack of matrices. Some block-based operators [] for

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

are defined as follows:

b c i r c (Y) = [\begin{matrix} Y^{(1)} & Y^{(n_{3})} & \dots & Y^{(2)} \\ Y^{(2)} & Y^{(1)} & \dots & Y^{(n_{3})} \\ ⋮ & ⋱ & ⋱ & ⋮ \\ Y^{(n_{3})} & Y^{(n_{3} - 1)} & \dots & Y^{(1)} \end{matrix}]

(1)

b v e c (Y) : = (\begin{matrix} Y^{(1)} \\ Y^{(2)} \\ ⋮ \\ Y^{(n_{3})} \end{matrix}), bvfold (b v e c (Y)) = Y

(2)

\begin{array}{l} b d i a g (Y) : = [\begin{matrix} Y^{(1)} \\ ⋱ \\ Y^{(n_{3})} \end{matrix}], \\ bdfold (b d i a g (Y)) = Y \end{array}

(3)

2.2. Preliminaries

We introduced some preliminary definitions [] as follows:

Definition 1.

Suppose

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

and

Z \in ℝ^{n_{2} \times n_{4} \times n_{3}}

; the t-product

Y * Z

is

M \in ℝ^{n_{1} \times n_{4} \times n_{3}}

, i.e.,

M = Y * Z = : bvfold {b c i r c (Y) b v e c (Z)}

(4)

Definition 2.

Suppose

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

; then, the tensor transpose of

Y

is

Y^{T} \in ℝ^{n_{2} \times n_{1} \times n_{3}}

.

Definition 3.

Suppose

I \in ℝ^{n_{1} \times n_{1} \times n_{2}}

; then,

I

is an identity tensor while its first frontal slice is a unit matrix (

I \in ℝ^{n_{1} \times n_{1}}

) and all others are zeros.

Definition 4.

If

P \in ℝ^{n_{1} \times n_{2} \times n_{3}}

satisfies

P^{T} * P = P * P^{T} = I

(5)

Then,

P

is an orthogonal tensor.

Definition 5.

Suppose

P \in ℝ^{n_{1} \times n_{2} \times n_{3}}

; we define

P

as an f-diagonal tensor while all its frontal slices are diagonal.

Definition 6.

Suppose the tensor

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

; then, the t-SVD is defined as follows:

Y = U * L * V^{T}

(6)

where

L, U, V \in ℝ^{n_{1} \times n_{2} \times n_{3}}

,

L

is f-diagonal, while

U

and

V

are both orthogonal.

Definition 7.

Suppose

Y \in ℝ^{n_{1} \times n_{2} \times n_{3}}

; then, the TNN of

Y

, i.e.,

{‖Y‖}_{T N N}

, is the summation of the singular values which are decomposed by t-SVD. It has been proven that t-SVD based TNN is the tightest convex relaxation to tensor tubal rank [].

3. Related Work

Before presenting our method, this section establishes the theoretical foundation by first reviewing two classical methods: LRR [] and BDR [].

3.1. Low-Rank Representation (LRR)

Let us assume that

X = [x_{1}, x_{2}, \dots, x_{N}] \in ℝ^{d \times N}

is a set of N data points and the dimensionality of the data is d. LRR seeks to find a low-rank factorization of samples for clustering. The objective of LRR can be formulated as follows:

\begin{array}{l} \min_{Z, E} λ {‖E‖}_{2, 1} + {‖Z‖}_{*} \\ s . t . X = X Z + E \end{array}

(7)

where

Z = [z_{1}, z_{2}, \dots, z_{N}] \in ℝ^{N \times N}

is the representation of dataset X, E refers to the approximation error,

{‖\cdot‖}_{2, 1}

refers to the

L_{2, 1}

-norm, and

{‖\cdot‖}_{*}

refers to the nuclear norm.

LRR executes spectral clustering via the affinity matrix W, where

W = \frac{|Z| + {|Z|}^{T}}{2}

.

3.2. Block Diagonal Representation (BDR)

The authors of [] provide the following block diagonal regularizer to chase the optimal representation.

Definition 8.

The k-block diagonal regularizer of the affinity matrix

W \in ℝ^{N \times N}

can be formulated as follows:

{‖W‖}_{k} = \sum_{i = N - k + 1}^{N} δ_{i} (L_{W})

(8)

where

L_{W} = D i a g (W 1) - W

, i.e.,

L_{W}

is the Laplacian matrix of W.

δ_{i} (L_{W})

refers to the ith eigenvalue of

L_{W}

and is in decreasing order. k is the number of subspaces. The loss function of BDR is defined as follows

\begin{array}{l} \min_{Z, W} \frac{1}{2} {‖X - X Z‖}_{F}^{2} + \frac{λ}{2} {‖Z - W‖}_{F}^{2} + γ {‖W‖}_{k} \\ s . t . d i a g (W) = 0, W \geq 0, W = W^{T} \end{array}

(9)

The affinity matrix W is also defined as

W = \frac{|Z| + {|Z|}^{T}}{2}

.

4. The Proposed TMSC-TNNBDR

In this section, we introduce the TMSC-TNNBDR framework that extends classical LRR and BDR approaches. Subsequently, we derive an ALM-based optimization scheme to solve the resulting non-convex problem.

4.1. Problem Formulation

Let

X^{(v)} = [x_{1}^{(v)}, x_{2}^{(v)}, \dots, x_{N}^{(v)}] \in ℝ^{d \times N}

and

H^{(v)} = [h_{1}^{(v)}, h_{2}^{(v)}, \dots, h_{N}^{(v)}] \in ℝ^{N \times N}

be, respectively, the feature matrix and subspace coefficient for the vth view. The loss function of TMSC-TNNBDR is demonstrated as follows:

\begin{array}{l} \min_{H^{(v)}, B^{(v)}} \sum_{v = 1}^{V} (\frac{λ}{2} {‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2} + \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + γ {‖B^{(v)}‖}_{k}) + {‖H‖}_{T N N} \\ s . t . H = Ψ (H^{(1)}, H^{(2)}, \dots, H^{(V)}), d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T} \end{array}

(10)

where

Ψ (\cdot)

represents an function that stacks all

H^{(v)}

(v = 1, 2, …, V) into a tensor in

ℝ^{N \times N \times V}

then applying a rotation transformation to

H \in ℝ^{N \times V \times N}

.

In Equation (10),

{‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2}

is the self-representation reconstruction error,

{‖B^{(v)}‖}_{k}

denotes the BDR constraint to

B^{(v)}

,

{‖H‖}_{T N N}

denotes TNN low-rank constraint to

H

, and

{‖H^{(v)} - B^{(v)}‖}_{F}^{2}

can be seen as a Robust PCA term to remove the noise contained in the H^(v). Moreover,

λ

,

α

, and

γ

are tunable hyperparameters.

4.2. Optimization

The loss function of TMSC-TNNBDR, i.e., Equation (10), can be optimized through the ALM. The theorem relating to

{‖B^{(v)}‖}_{k}

is described as follows:

Theorem 1

([]). Suppose

L \in ℝ^{n \times n}

, where L is semi-positive; then, the following holds:

\begin{array}{l} \sum_{i = n - k + 1}^{n} λ_{i} (L) = \min_{W} ⟨L, W⟩ \\ s . t . 0 ≼ W ≼ I, t r (W) = k \end{array}

(11)

In accordance with Theorem 1, Equation (10) can be rewritten as Equation (12):

\begin{array}{l} \min_{H^{(v)}, W^{(v)}, B^{(v)}} \sum_{v = 1}^{V} (\frac{λ}{2} {‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2} + \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + γ ⟨D i a g (B^{(v)} 1) - B^{(v)}, W^{(v)}⟩) + {‖H‖}_{T N N} \\ s . t . H = Ψ (H^{(1)}, H^{(2)}, \dots, H^{(V)}), d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T}, 0 ≼ W^{(v)} ≼ I, t r (W^{(v)}) = k \end{array}

(12)

To solve Equation (12), an auxiliary tensor variable

G

is introduced to replace

H

. Then, the loss function of TMSC-TNNBDR is converted into the following:

\begin{array}{l} \min_{H^{(v)}, W^{(v)}, B^{(v)}, G} \sum_{v = 1}^{V} (\frac{λ}{2} {‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2} + \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + γ ⟨D i a g (B^{(v)} 1) - B^{(v)}, W^{(v)}⟩) + {‖G‖}_{T N N} \\ s . t . G = H, H = Ψ (H^{(1)}, H^{(2)}, \dots, H^{(V)}), G = Ψ (G^{(1)}, G^{(2)}, \dots, G^{(V)}), \\ d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T}, 0 ≼ W^{(v)} ≼ I, t r (W^{(v)}) = k \end{array}

(13)

Equation (13) will be converted to the augmented Lagrangian formula, as follows:

\begin{array}{l} L (H^{(1)}, \dots, H^{(V)}; W^{(1)}, \dots, W^{(V)}; B^{(1)}, \dots, B^{(V)}; G) \\ = \sum_{v = 1}^{V} (\frac{λ}{2} {‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2} + \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + γ ⟨D i a g (B^{(v)} 1) - B^{(v)}, W^{(v)}⟩) \\ + {‖G‖}_{T N N} + (P, H - G) + \frac{ρ}{2} {‖H - G‖}_{F}^{2} \\ s . t . d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T}, 0 ≼ W^{(v)} ≼ I, t r (W^{(v)}) = k \end{array}

(14)

where

P

denotes the Lagrange multiplier;

ρ

is actually the penalty parameter.

We get the resolutions to

H^{(v)}

,

W^{(v)}

,

B^{(v)}

, and

G

by solving each variable alternately in Equation (14). The steps are described as follows:

H^{(v)}

-subproblem: For computing

H^{(v)}

, we fix the other variables and tackle the following problem:

\begin{array}{l} H^{(v) *} = \arg \min_{H^{(v)}} \frac{λ}{2} {‖X^{(v)} - X^{(v)} H^{(v)}‖}_{F}^{2} + \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + (P^{(v)}, H^{(v)} - G^{(v)}) \\ + \frac{ρ}{2} {‖H^{(v)} - G^{(v)}‖}_{F}^{2} \end{array}

(15)

Differentiating by

H^{(v)}

, we can obtain the following:

H^{(v) *} = {(λ X^{(v) T} X^{(v)} + (α + ρ) I)}^{- 1} (λ X^{(v) T} X^{(v)} + α B^{(v)} + ρ G^{(v)} - P^{(v)})

(16)

W^{(v)}

-subproblem:

W^{(v)}

will be computed as follows:

\begin{array}{l} W^{(v) *} = \underset{W^{(v)}}{argmin} ⟨D i a g (B^{(v)} 1) - B^{(v)}, W^{(v)}⟩ \\ s . t . 0 ≼ W^{(v)} ≼ I, t r (W^{(v)}) = k \end{array}

(17)

For Equation (17),

W^{(v) *} = U U^{T}

, where

U \in ℝ^{N \times k}

is a matrix concatenated from k eigenvectors that correspond to the k smallest eigenvalues of

D i a g (B^{(v)} 1) - B^{(v)}

[].

B^{(v)}

-subproblem:

B^{(v)}

can be computed as follows:

\begin{array}{l} B^{(v) *} = \arg \min_{B^{(v)}} \frac{α}{2} {‖H^{(v)} - B^{(v)}‖}_{F}^{2} + γ ⟨D i a g (B^{(v)} 1) - B^{(v)}, W^{(v)}⟩ \\ s . t . d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T} \end{array}

(18)

Equation (18) can be converted into the following:

\begin{array}{l} B^{(v) *} = \arg \min_{B^{(v)}} \frac{1}{2} {‖B^{(v)} - H^{(v)} + \frac{α}{γ} (d i a g (W^{(v)}) 1^{T} - W^{(v)})‖}_{F}^{2} \\ s . t . d i a g (B^{(v)}) = 0, B^{(v)} \geq 0, B^{(v)} = B^{(v) T} \end{array}

(19)

The theorem in [] enables the solution of Equation (19).

G

-subproblem: We fixed the other variables and update

G

as follows:

G^{*} = \arg \min_{G} {‖G‖}_{T N N} + \frac{ρ}{2} {‖G - (H + \frac{P}{ρ})‖}_{F}^{2}

(20)

The solution to Equation (20) can be obtained using the theorem in [,].

P

-subproblem: the Lagrange multiplier

P

can be updated as follows:

P^{*} = P + μ (H - G),

(21)

Finally, the TMSC-TNNBDR procedure is outlined in Algorithm 1.

Algorithm 1 TMSC-TNNBDR

Input: $X^{(1)}, X^{(2)}, \dots, X^{(v)}$ , $λ$ , $α$ , $γ$ and cluster number k;
Output: Clustering result
Initialize:
$H^{(v)} = 0$ , $W^{(v)} = 0$ , $B^{(v)} = 0$ , $v = 1, \dots, V$ , $G = P = 0$ , $μ = 10^{- 5}$ , $ρ = 10^{- 4}$ , $ε = 10^{- 7}$ ,
$μ_{\max} = ρ_{\max} = 10^{10}$
While not converged do
for $v = 1, \dots, V$ do
Update $H^{(v)}$ in accordance with Equation (16);
Update $W^{(v)}$ in accordance with Equation (17);
Update $B^{(v)}$ in accordance with Equation (19)
end
Update $G$ in accordance with Equation (20);
Update $P$ in accordance with Equation (21);
Update $μ$ by $μ = \min (ρ μ; μ_{\max})$ ;
Check the convergence conditions: ${‖H^{(v)} - G^{(v)}‖}_{\infty} < ε$ .
end
Let $S = \frac{1}{V} \sum_{v = 1}^{Z} |H^{(v)}| + |H^{{(v)}^{T}}|$ ;
Perform spectral clustering on S.

4.3. Computational Complexity and Convergence

To calculate

H^{(v)}

, it involves matrix multiplication and matrix inversion, whose complexities are

O (d N^{2})

and

O (N^{3})

, respectively. For computing

W^{(v)}

, its complexity is

O (N^{3} + k N^{2})

because the main computational burdens are eigenvalue decomposition and matrix product. The computation of

B^{(v)}

is

O (N^{3})

since it mainly depends on matrix multiplication. As for computing

G

, the computational complexity is

O (N^{2} V \log (N))

. Thereafter, the total complexity of TMSC-TNNBDR is

O (V (d N^{2} + N^{3}) + N^{2} V \log (N))

.

The procedure of TMSC-TNNBDR is non-convex, which means it cannot achieve a global optimal solution. Nevertheless, TMSC-TNNBDR can converge to a local optimal point. In fact, each variable in Algorithm 1 has a closed-form solution. Following this, the value of the loss function decreases monotonically and remains bounded below. Clustering experiments are performed on some classic datasets, and the results showed that the TMSC-TNNBDR could converge stably.

5. Experimental Results

To evaluate the performance of the TMSC-TNNBDR model, we conduct experiments on abovementioned five image datasets. We compare TMSC-TNNBDR with some representative single-view-based methods, including SPC_best, LRR_best, BDR_best; multi-view-based methods, including Co-Reg SPC, RMSC, LTMSC, and DiMSC; and TNN-based method, i.e., t-SVD-MSC.

SPC_best: SPC_best is a single-view clustering model using spectral clustering [] to reach the best capability in all views.
LRR_best: LRR_best is a single-view clustering model using LRR [] to reach the best capability in all views.
BDR_best: BDR_best is a single-view clustering model using BDR [] to reach the best capability in all views.
Co-Reg SPC []: A co-regularized MVSC method.
RMSC []: RMSC is a multi-view spectral clustering algorithm with the help of low-rank and sparse de-composition.
LTMSC []: LTMSC is a multi-view spectral clustering method incorporating low-rank tensor constraints.
DiMSC []: DiMSC is a multi-view model utilizing HSIC to enhance diversity.
t-SVD-MSC []: A MSC model using t-SVD.

5.1. Description of the Dataset

UCI-Digits: (https://archive.ics.uci.edu/dataset/80/optical+recognition+of+handwritten+digits (accessed on 17 August 2025)). There are a total of 2000 digit images that correspond to 10 classes in this dataset. Similarly to the work in [], three feature types—morphological features, Fourier coefficients, and pixel averages—are extracted to assemble multi-view data. Ten UCI-Digits samples are shown in Figure 1.

Figure 1. Example images in UCI-Digits.

ORL: (https://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html (accessed on 17 August 2025)). There are a total of 400 face images in ORL, which correspond to 40 people. Firstly, we zoom each image to a resolution of 64 × 64. Then, similarly to the work of [,], we retrieved LBP [], intensity, and Gabor [] features to formulate multi-view data. Some ORL samples are shown in Figure 2.

Figure 2. Example images in ORL.

Yale: (https://gitcode.com/open-source-toolkit/885dd (accessed on 17 August 2025)). There are 165 face images in Yale, which correspond to 11 individuals. We zoom each image to a resolution of

64 \times 64

. Similarly to the work of [,], we retrieved LBP [], intensity, and Gabor [] features in order to formulate multi-view data. Some Yale samples are shown in Figure 3.

Figure 3. Example images in Yale.

Extended YaleB: (https://gitcode.com/open-source-toolkit/3d6b2 (accessed on 17 August 2025)). This dataset consists of pictures of 38 people; each individual has about 64 images under different illuminations. Similarly to the work in [,], we retrieved LBP [], intensity, and Gabor [] features to formulate multi-view data. We zoom each image to a resolution of 32 × 32. Ten examples of YaleB are shown in Figure 4.

Figure 4. Example images in Extended YaleB.

COIL-20: (https://www.cs.columbia.edu/CAVE/software/softlib/coil-20.php (accessed on 17 August 2025)). There are a total of 1440 pictures in COIL-20, which are associated with 40 object classes. We zoom each picture to a resolution of 32 × 32. Similarly to the work in [,], we retrieved LBP [], intensity, and Gabor [] features to formulate multi-view data. Some COIL-20 samples are shown in Figure 5.

Figure 5. Example images in COIL-20.

5.2. Evaluation Metrics

We employ ACC and NMI to compare the performance of various algorithms.

ACC, i.e., accuracy, is defined as follows:

ACC = \frac{\sum_{i = 1}^{n} δ (g n d_{i}, m a p (r_{i}))}{n}

(22)

where

g n d_{i}

refers to the true category label of the i-th sample,

m a p (r_{i})

refers to clustering label, and

δ (x, y)

refers to Kronecker delta, which is defined as follows:

δ (x, y) = \{\begin{matrix} 1, & i f x = y \\ 0, & o t h e r w i s e \end{matrix}

(23)

MI, i.e., mutual information, is defined as follows:

MI (D, D') = \sum_{d_{i} \in D, d_{j}^{'} \in D'} p (d_{i}, d_{j}^{'}) \log \frac{p (d_{i} d_{j}^{'})}{p (d_{i}) p (d_{j}^{'})}

(24)

where D and

D'

refer to two clusters.

p (d_{i})

and

p (d_{j}^{'})

refer to the probabilities of belonging to D and

D'

. Correspondingly,

p (d_{i}, d_{j}^{'})

refers to the joint probability.

NMI, i.e., normalize mutual information, is defined as follows:

NMI (D, D') = \frac{M I (D, D')}{\max (H (D), H (D'))}

(25)

where

H (\cdot)

is the entropy of the dataset.

5.3. Experiment Results

Notably, all evaluated clustering methods—specifically TMSC-TNNBDR, LRR_best, SPC_best, BDR_best, t-SVD-MSC, DiMSC, Co-Reg SPC, RMSC, and LTMSC—incorporate K-means procedures in the final step of spectral clustering. Randomness in K-means stems from both the random choice of initial cluster centers and the non-convex objective function. Twenty repetitions typically represent a sound compromise to mitigate the effects of randomness while maintaining computational feasibility. Therefore, we report median performance metrics after 20 experimental repetitions to ensure statistical robustness. Table 2, Table 3, Table 4, Table 5 and Table 6 report the clustering results from five public databases: UCI-digits, ORL, Yale, YaleB, COIL-20. Bold values in the tables indicate the best. The results clearly indicate that TMSC-TNNBDR exceeds the performance of other comparison algorithms. Our observation and analysis are delineated as follows:

In comparative analyses of single-view clustering methodologies, LRR and BDR consistently outperform conventional spectral clustering (SPC). This performance advantage likely stems from their enhancement of the SPC framework through the incorporation of prior structural knowledge—specifically, low-rank constraints and block-diagonal regularization, respectively.
Multi-view methods, including TMSC-TNNBDR, demonstrated a superior performance compared with single-view approaches. Our experimental results demonstrate that even selecting the best outcome from all individual single-view clustering procedures still underperforms multi-view clustering in the vast majority of cases. This confirms the superiority of multi-view clustering. It is generally believed that the success of existing multi-view clustering methods involve learning latent cross-view correlations, discovering underlying patterns, and integrating this summarized prior knowledge into clustering models. For instance, Co-Reg SPC embeds co-regularization of clustering consensus into spectral clustering. Similarly, RMSC incorporates both low-rank tensor and sparse constraints into the MSC framework. DiMSC leverages the Hilbert Schmidt Independence Criterion (HSIC) to extract complementary information across views, while LTMSC and t-SVD-MSC employ TNN constraint to explore such complementary information.
In most cases, multi-view clustering based on tensor representation outperforms co-regularization-based Co-Reg SPC in clustering performance. This superiority is generally attributed to tensor representation’s ability to integrate multiple views into a unified structure, easily capturing complementary information and high-order interactions across views. Experimental results validate this explanation.
TMSC-TNNBDR achieves superior performance compared to benchmark methods while maintaining favorable time efficiency. This advantage is likely attributed to the complementary interplay between the low-rank property and block-diagonal structure of the similarity matrix—where Tensor Nuclear Norm (TNN) and Block Diagonal Regularization (BDR), as two priori constraints, synergistically enhance the multi-view subspace clustering framework from distinct perspectives.

Table 2. Clustering results (mean ± standard deviation) on UCI-Digits.

Algorithm	LRR_best	SPC_best	BDR_best
ACC	0.968 ± 0.001	0.740 ± 0.021	0.814 ± 0.006
NMI	0.769 ± 0.002	0.639 ± 0.013	0.765 ± 0.005
Algorithm	t-SVD-MSC	DiMSC	Co-Reg SPC
ACC	0.953 ± 0.001	0.719 ± 0.013	0.786 ± 0.007
NMI	0.929 ± 0.002	0.776 ± 0.008	0.801 ± 0.004
Algorithm	RMSC	LTMSC	TMSC-TNNBDR
ACC	0.776 ± 0.008	0.912 ± 0.003	0.994 ± 0.001
NMI	0.791 ± 0.003	0.923 ± 0.002	0.984 ± 0.001

Table 3. Clustering results (mean ± standard deviation) on ORL.

Algorithm	LRR_best	SPC_best	BDR_best
ACC	0.773 ± 0.003	0.726 ± 0.025	0.848 ± 0.003
NMI	0.895 ± 0.006	0.884 ± 0.002	0.938 ± 0.002
Algorithm	t-SVD-MSC	DiMSC	Co-Reg SPC
ACC	0.973 ± 0.003	0.837 ± 0.001	0.715 ± 0.000
NMI	0.992 ± 0.002	0.939 ± 0.003	0.853 ± 0.003
Algorithm	RMSC	LTMSC	TMSC-TNNBDR
ACC	0.735 ± 0.006	0.793 ± 0.008	1.000 ± 0.000
NMI	0.873 ± 0.011	0.932 ± 0.993	1.000 ± 0.000

Table 4. Clustering results (mean ± standard deviation) on Yale.

Algorithm	LRR_best	SPC_best	BDR_best
ACC	0.703 ± 0.002	0.634 ± 0.015	0.712 ± 0.004
NMI	0.706 ± 0.012	0.646 ± 0.009	0.716 ± 0.002
Algorithm	t-SVD-MSC	DiMSC	Co-Reg SPC
ACC	0.878 ± 0.013	0.703 ± 0.004	0.668 ± 0.002
NMI	0.913 ± 0.009	0.728 ± 0.009	0.715 ± 0.003
Algorithm	RMSC	LTMSC	TMSC-TNNBDR
ACC	0.639 ± 0.038	0.743 ± 0.003	0.992 ± 0.002
NMI	0.685 ± 0.029	0.759 ± 0.09	0.994 ± 0.001

Table 5. Clustering results (mean ± standard deviation) on Extended YaleB.

Algorithm	LRR_best	SPC_best	BDR_best
ACC	0.447 ± 0.023	0.283 ± 0.035	0.464 ± 0.012
NMI	0.408 ± 0.032	0.225 ± 0.043	0.432 ± 0.007
Algorithm	t-SVD-MSC	DiMSC	Co-Reg SPC
ACC	0.568 ± 0.003	0.470 ± 0.007	0.240 ± 0.001
NMI	0.605 ± 0.002	0.397 ± 0.006	0.148 ± 0.001
Algorithm	RMSC	LTMSC	TMSC-TNNBDR
ACC	0.223 ± 0.011	0.626 ± 0.009	0.641 ± 0.002
NMI	0.161 ± 0.021	0.621 ± 0.005	0.631 ± 0.004

Table 6. Clustering results (mean ± standard deviation) on COIL-20.

Algorithm	LRR_best	SPC_best	BDR_best
ACC	0.767 ± 0.002	0.682 ± 0.024	0.805 ± 0.002
NMI	0.870 ± 0.003	0.769 ± 0.011	0.872 ± 0.003
Algorithm	t-SVD-MSC	DiMSC	Co-Reg SPC
ACC	0.803 ± 0.004	0.774 ± 0.014	0.720 ± 0.007
NMI	0.865 ± 0.003	0.846 ± 0.002	0.809 ± 0.005
Algorithm	RMSC	LTMSC	TMSC-TNNBDR
ACC	0.687 ± 0.043	0.802 ± 0.009	0.823 ± 0.004
NMI	0.802 ±0.016	0.853 ± 0.005	0.892 ± 0.003

To evaluate the actual running time of our method, we conducted experiments on the aforementioned five datasets. For fairness in comparison, single-view clustering approaches (e.g., LRR_best, SPC_best, BDR_best) are intentionally excluded. Table 7 summarizes computational times of selected multi-view methods, with bold values indicating the best performance.

Table 7. Computational time (unit: seconds) of the comparative multi-view methods.

In the experiments, all compared multi-view clustering methods are based on subspace affinity matrix computations using complete graphs and spectral clustering, exhibiting cubic time complexity of N. Results across five datasets indicate that both Co-Reg SPC and our proposed TMSC-TNNBDR achieve optimal time efficiency. While significant performance gaps persist between algorithms, these gaps do not expand drastically with increasing dataset sizes. Considering the noticeable inferior clustering performance of Co-Reg SPC compared to tensor-based multi-view clustering methods, TMSC-TNNBDR delivers exceptionally outstanding results.

5.4. Convergence Analysis

Proving the global convergence of TMSC-TNNBDR is difficult. However, we did conduct some experiments to show the convergence properties. As per the convergence conditions in Algorithm 1, the match error is defined as follows:

M E = \frac{1}{V} \sum_{v = 1}^{V} {‖H^{(v)} - G^{(v)}‖}_{\infty} < ε

(26)

The convergence curves of the TMSC-TNNBDR method on ORL and Yale are presented in Figure 6. As shown in Figure 6, the TMSC-TNNBDR method exhibits rapid convergence, achieving stable results within roughly 15 iterations.

Figure 6. Convergence curves on different datasets: (a) Convergence curves on UCI-Digits; (b) Convergence curves on ORL; (c) Convergence curves on Yale; (d) Convergence curves on COIL-20.

5.5. Parametric Sensitivity

Three tunable hyperparameters characterize our proposed TMSC-TNNBDR:

λ

,

α

and

γ

, formally defined in Equation (10). During our optimization, candidate values of

λ

,

α

and

γ

are selected from

{0.002, 0.006, 0.02, 0.06, 0.2, 0.6, 2, 6, 12}

. Where applicable, the search space is refined to narrowed ranges based on preliminary selections. The resulting configurations for tunable parameters are documented in Table 8.

Table 8. Our configurations for tunable parameters.

The clustering performance on the ORL and Yale databases is depicted in Figure 7. As shown in Figure 7, it is evident that the performance of the TMSC-TNNBDR is generally good and insensitive to varying values of

α

and

γ

, especially when

α

and

γ

are relatively small.

Figure 7. Clustering performance of TMSC-TNNBDR versus

α

and

γ

on different datasets. (a) ACC of TMSC-TNNBDR versus the parameters

α

and

γ

on ORL; (b) NMI of TMSC-TNNBDR versus the parameters

α

and

γ

on ORL; (c) ACC of TMSC-TNNBDR versus the parameters

α

and

γ

on Yale; (d) NMI of TMSC-TNNBDR versus the parameters

α

and

γ

on Yale.

6. Conclusions

In this paper, we propose a novel tensorized MSC with tensor nuclear norm and block diagonal representation (TMSC-TNNBDR). In this model, a BDR constraint is utilized to enforce the block diagonal structure of the learned similarity matrix. Additionally, low-rank tensor representations are applied to uncover the high-order correlations within multi-view data, ultimately advancing clustering objectives. Then, we developed an efficient augmented Lagrangian-based procedure for optimizing the TMSC-TNNBDR model. This method harnesses dual priors—block-diagonal structure and low-rank tensor representations—to steer subspace clustering, enabling more thorough exploitation of latent information in multi-view data while significantly alleviating noise interference.

Comparative evaluations demonstrate the superior clustering performance of TMSC-TNNBDR over baseline approaches. While the efficacy of TMSC-TNNBDR is empirically validated, significant challenges persist in multi-view subspace clustering. Our methodology leverages two distinct priors (block-diagonal structure and low-rank tensor constraints), raising new research questions: What additional priors could be incorporated? Is there demonstrable value in integrating more than two priors? Furthermore, algorithmic complexity remains a critical limitation; akin to other classical graph-based multi-view clustering techniques, TMSC-TNNBDR has a cubic time complexity of N. It is inevitably compromised by the curse of dimensionality under real-world data growth conditions. For larger-scale datasets, using anchor graphs instead of full-sample graphs may be an effective alternative. In such scenarios, leveraging prior knowledge to design regularizers remains highly valuable.

Author Contributions

Conceptualization, G.-F.L. and G.-Y.T.; methodology, G.-Y.T., G.-F.L. and Y.W.; software, G.-Y.T. and G.-F.L.; validation, G.-Y.T., G.-F.L., Y.W. and L.-L.F.; formal analysis, G.-Y.T. and G.-F.L.; investigation, G.-Y.T. and L.-L.F.; resources, G.-F.L., Y.W. and G.-Y.T.; data curation, G.-Y.T. and L.-L.F.; writing—original draft preparation, G.-Y.T. and G.-F.L.; writing—review and editing, G.-Y.T., G.-F.L., Y.W. and L.-L.F.; visualization, G.-Y.T. and L.-L.F.; supervision, G.-F.L. and Y.W.; project administration, G.-Y.T.; funding acquisition, G.-F.L., G.-Y.T., Y.W. and L.-L.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NSFC (No. 61976005), the University Natural Science Research Project of Anhui Province (No. 2022AH050970, KJ2020A0363), the Open Project of Anhui Provincial Medical Big Data Intelligent System Engineering Research Center (No. MBD2024P04), and the 2024 Anhui Provincial Quality Engineering Project for Higher Education Institutions (No. 2024aijy199).

Data Availability Statement

The original data presented in the study are openly available in: UCI Machine Learning Repository—Optical Recognition of Handwritten Digits at https://archive.ics.uci.edu/dataset/80/optical+recognition+of+handwritten+digits (accessed on 17 August 2025). Cambridge University AT&T Lab—ORL Face Database at https://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html (accessed on 17 August 2025). GitCode Open Source—Yale Face Database at https://gitcode.com/open-source-toolkit/885dd (accessed on 17 August 2025). GitCode Open Source—YaleB extends Face Database at https://gitcode.com/open-source-toolkit/3d6b2 (accessed on 17 August 2025). Columbia University CAVE Lab—COIL-20 Dataset at https://www.cs.columbia.edu/CAVE/software/softlib/coil-20.php (accessed on 17 August 2025).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ALM	Augmented Lagrange multiplier
BDR	Block-diagonal representation
Co-Reg SPC	Co-regularized multi-view spectral clustering
DiMSC	Diversity-induced multi-view subspace clustering
HSIC	Hilbert–Schmidt Independence Criterion
LRR	Low-rank representation
LTMSC	Low-rank tensor constrained multi-view subspace clustering
MSC	Multi-view subspace clustering
RMSC	Robust multi-view spectral clustering via low-rank and sparse decomposition
SC	Subspace clustering
SPC	Spectral clustering
SSC	Sparse subspace clustering
TMSC-TNNBDR	Tensorized multi-view subspace clustering via tensor nuclear norm and block diagonal representation
TNN	Tensor nuclear norm
t-SVD	the tensor singular value decomposition
t-SVD-MSC	t-SVD based multi-view subspace clustering

References

Fu, L.; Lin, P.; Vasilakos, A.V.; Wang, S. An overview of recent multi-view clustering. Neurocomputing 2020, 302, 148–161. [Google Scholar] [CrossRef]
Duda, R.O.; Hart, P.E.; Stork, D.G. Pattern Classification, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2008. [Google Scholar]
Berkhin, P. Survey of clustering data mining techniques. In Grouping Multidimensional Data: Recent Advances in Clustering; Kogan, J., Nicholas, C., Teboulle, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; pp. 25–71. [Google Scholar]
Wang, J.; Wang, X.; Tian, F.; Liu, C.H.; Yu, H. Constrained low-rank representation for robust subspace clustering. IEEE Trans. Cybern. 2017, 47, 4534–4546. [Google Scholar] [CrossRef] [PubMed]
Elhamifar, E.; Vidal, R. Sparse subspace clustering: Algorithm, theory, and applications. IEEE Trans. Pattern Anal. 2013, 35, 2765–2781. [Google Scholar] [CrossRef] [PubMed]
Liu, G.; Lin, Z.; Yan, S.; Sun, J.; Yu, Y.; Ma, Y. Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. 2013, 35, 171–184. [Google Scholar] [CrossRef]
Lu, C.; Feng, J.; Lin, Z.; Mei, T.; Yan, S. Subspace clustering by block diagonal representation. IEEE Trans. Pattern Anal. 2019, 41, 487–501. [Google Scholar] [CrossRef]
Feng, J.; Lin, Z.; Xu, H.; Yan, S. Robust subspace segmentation with block-diagonal prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014. [Google Scholar]
Xu, Y.Y.; Chen, S.; Li, J.; Li, C.; Yang, J. Fast subspace clustering by learning projective block diagonal representation. Pattern Recogn. 2023, 135, 109152. [Google Scholar] [CrossRef]
Xing, Z.; Wu, G. Block-diagonal guided DBSCAN clustering. IEEE Trans. Knowl. Data Eng. 2024, 36, 5709–5722. [Google Scholar] [CrossRef]
Guo, Y.; Chen, S. A restarted large-scale spectral clustering with self-guiding and block diagonal representation. Pattern Recogn. 2023, 156, 110746. [Google Scholar] [CrossRef]
Zhang, C.; Fu, H.; Liu, S.; Liu, G.; Cao, X. Low-rank tensor constrained multiview subspace clustering. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015. [Google Scholar]
Xia, R.; Pan, Y.; Du, L.; Yin, J. Robust multi-view spectral clustering via low-rank and sparse decomposition. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI), Québec City, QC, Canada, 27–31 July 2014. [Google Scholar]
Xie, Y.; Tao, D.; Zhang, W.; Liu, Y.; Zhang, L.; Qu, Y. On unifying multi-view self-representations for clustering by tensor multi-tank minimization. Int. J. Comput. Vision. 2018, 126, 1157–1179. [Google Scholar] [CrossRef]
Cao, X.; Zhang, C.; Fu, H.; Liu, S.; Zhang, H. Diversity induced multi-view subspace clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Bickel, S.; Scheffer, T. Multi-view clustering. In Proceedings of the Fourth IEEE International Conference on Data Mining (ICDM), Brighton, UK, 1–4 November 2004. [Google Scholar]
Kumar, A.; Rai, P.; Daumé, H. Co-regularized multi-view spectral clustering. In Proceedings of the 24th International Conference on Neural Information Processing Systems (NIPS), Granada, Spain, 12–15 December 2011. [Google Scholar]
Wu, J.; Lin, Z.; Zha, H. Essential tensor learning for multi-view spectral clustering. IEEE Trans. Image Process. 2019, 28, 5910–5922. [Google Scholar] [CrossRef]
Zhang, C.; Fu, H.; Wang, J.; Li, W.; Cao, X.; Hu, Q. Tensorized multi-view subspace representation learning. Int. J. Comput. Vision. 2020, 128, 2344–2361. [Google Scholar] [CrossRef]
Ghani, R. Combining labeled and unlabeled data for multiclass text categorization. In Proceedings of the Nineteenth International Conference on Machine Learning, Sydney, Australia, 8–12 July 2002. [Google Scholar]
Wang, W.; Zhou, Z.H. A new analysis of co-training. In Proceedings of the 27th International Conference on International Conference on Machine Learning, Haifa, Israel, 21–24 June 2010. [Google Scholar]
Sa, V.R.d. Spectral clustering with two views. In Proceedings of the ICML Workshop on Learning with Multiple Views, Bonn, Germany, 7–11 August 2005. [Google Scholar]
Blaschko, M.B.; Lampert, C.H. Correlational spectral clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008. [Google Scholar]
Ding, Z.; Fu, Y. Low-rank common subspace for multi-view learning. In Proceedings of the IEEE International Conference on Data Mining, Shenzhen, China, 14–17 December 2014. [Google Scholar]
Lu, C.; Feng, J.; Chen, Y.; Liu, W.; Lin, Z.; Yan, S. Tensor robust principal component analysis: Exact recovery of corrupted low-rank tensors via convex optimization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Zhou, P.; Feng, J. Outlier-robust tensor PCA. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
Zhou, P.; Lu, C.; Lin, Z.; Zhang, C. Tensor factorization for low-rank tensor completion. IEEE Trans. Image Process. 2018, 27, 1152–1163. [Google Scholar] [CrossRef] [PubMed]
Lu, G.F.; Yu, Q.R.; Wang, Y.; Tang, G.Y. Hyper-Laplacian regularized multi-view subspace clustering with low-rank tensor constraint. Neural Netw. 2020, 125, 214–223. [Google Scholar] [CrossRef] [PubMed]
Zhang, Z.; Ely, G.; Aeron, S.; Hao, N.; Kilmer, M. Novel methods for multilinear data completion and de-noising based on Tensor-SVD. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014. [Google Scholar]
Kilmer, M.E.; Martin, C.D. Factorization strategies for third-order tensors. Linear Algebra Appl. 2011, 435, 641–658. [Google Scholar] [CrossRef]
Kilmer, M.E.; Braman, K.S.; Hao, N.; Hoover, R.C. Third-order tensors as operators on matrices: A theoretical and computational framework with applications in imaging. SIAM J. Matrix Anal. A. 2013, 34, 148–172. [Google Scholar] [CrossRef]
Du, Y.F.; Lu, G.F.; Ji, G.Y. Robust least squares regression for subspace clustering: A multi-view clustering perspective. IEEE Trans. Image Process. 2024, 33, 216–227. [Google Scholar] [CrossRef]
Cai, B.; Lu, G.F.; Li, H.; Song, W.H. Tensorized scaled simplex representation for multi-view clustering. IEEE Trans. Multimed. 2024, 26, 6621–6631. [Google Scholar] [CrossRef]
Lu, C.; Feng, J.; Chen, Y.; Liu, W.; Lin, Z.; Yan, S. Tensor robust principal component analysis with a new tensor nuclear norm. IEEE Trans. Pattern Anal. 2020, 42, 925–938. [Google Scholar] [CrossRef]
Pan, B.; Li, C.; Che, H. Nonconvex low-rank tensor approximation with graph and consistent regularizations for multi-view subspace learning. Neural Networks. 2023, 161, 638–658. [Google Scholar] [CrossRef]
Pan, B.; Li, C.; Che, H.; Leung, M.F.; Yu, K. Low-rank tensor regularized graph fuzzy learning for multi-view data processing. IEEE Trans. Consum. Electr. 2023, 70, 2925–2938. [Google Scholar] [CrossRef]
Peng, C.; Kang, K.; Chen, Y.; Kang, Z.; Chen, C.; Cheng, Q. Fine-grained essential tensor learning for robust multi-view spectral clustering. IEEE Trans. Image Process. 2024, 33, 3145–3159. [Google Scholar] [CrossRef]
Wang, S.; Chen, Y.; Lin, Z.; Cen, Y.; Cao, Q. Robustness meets low-rankness: Unified entropy and tensor learning for multi-view subspace clustering. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 6302–6316. [Google Scholar] [CrossRef]
Du, Y.F.; Lu, G.F. Joint local smoothness and low-rank tensor representation for robust multi-view clustering. Pattern Recogn. 2025, 157, 110944. [Google Scholar] [CrossRef]
Luo, C.; Zhang, J.; Zhang, X. Tensor multi-view clustering method for natural image segmentation. Expert Syst. Appl. 2025, 260, 125431. [Google Scholar] [CrossRef]
Dattorro, J. Convex Optimization & Euclidean Distance Geometry, 2nd ed.; Meboo Publishing: Palo Alto, CA, USA, 2018. [Google Scholar]
Ng, A.Y.; Jordan, M.I.; Weiss, Y. On spectral clustering: Analysis and an algorithm. In Proceedings of the 15th International Conference on Neural Information Processing Systems: Natural and Synthetic (NIPS), Vancouver, BC, Canada, 3–8 December 2001. [Google Scholar]
Ojala, T.; Pietikäinen, M.; Mäenpää, T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. 2002, 24, 971–987. [Google Scholar] [CrossRef]
Lades, M.; Vorbruggen, J.C.; Buhmann, J.; Lange, J.; Malsburg, C.v.d.; Wurtz, R.P.; Konen, W. Distortion invariant object recognition in the dynamic link architecture. IEEE Trans. Comput. 1993, 42, 300–311. [Google Scholar] [CrossRef]

Figure 1. Example images in UCI-Digits.

Figure 2. Example images in ORL.

Figure 3. Example images in Yale.

Figure 4. Example images in Extended YaleB.

Figure 5. Example images in COIL-20.

Figure 6. Convergence curves on different datasets: (a) Convergence curves on UCI-Digits; (b) Convergence curves on ORL; (c) Convergence curves on Yale; (d) Convergence curves on COIL-20.

Figure 7. Clustering performance of TMSC-TNNBDR versus

α

and

γ

on different datasets. (a) ACC of TMSC-TNNBDR versus the parameters

α

and

γ

on ORL; (b) NMI of TMSC-TNNBDR versus the parameters

α

and

γ

on ORL; (c) ACC of TMSC-TNNBDR versus the parameters

α

and

γ

on Yale; (d) NMI of TMSC-TNNBDR versus the parameters

α

and

γ

on Yale.

Table 1. Notations summarized.

Notation	Description	Notation	Description
u	A scalar	U	A constant
u	A column vector	u_i	The ith column of a matrix
U	A matrix	$U$	A tensor
U^T	The matrix transpose	$U$ ^T	The tensor transpose
U^(v)	The vth frontal slice (matrix) of a tensor	$U$ ^(v)	The vth frontal slice (matrix) of $U$ (tensor)
${‖U‖}_{*}$	The nuclear norm	${‖U‖}_{T N N}$	The t-SVD-based TNN
${‖U‖}_{2, 1}$	L_2,1-norm	$I$	The identity tensor
${‖U‖}_{F}$	The Frobenius norm	$U * V$	The t-product
${‖U‖}_{k}$	The k-block diagonal regularizer	$⟨U, V⟩$	The Frobenius inner product
$d i a g (U)$	Line up the diagonal elements of U as a column vector	$D i a g (w)$	Expand w (column vector) into a diagonal matrix

Table 7. Computational time (unit: seconds) of the comparative multi-view methods.

Datasets	Algorithms
Datasets	t-SVD-MSC	DiMSC	Co-Reg SPC	TMSC-TNNBDR
UCI-Digits	96.57	171.39	30.38	107.82
ORL	19.17	5.07	3.26	2.74
Yale	5.93	1.09	0.77	0.47
Extended YaleB	236.58	343.68	63.11	222.59
COIL-20	77.18	71.54	14.36	44.05

Table 8. Our configurations for tunable parameters.

Database	Parameters
Database	$λ$	$α$	$γ$
UCI-Digits	0.2	0.03	0.003
ORL	0.2	0.06	0.006
Yale	0.2	0.02	0.002
YaleB	0.2	0.02	0.002
COIL-20	0.2	0.02	0.002

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Tensorized Multi-View Subspace Clustering via Tensor Nuclear Norm and Block Diagonal Representation

Abstract

1. Introduction

2. Notations and Preliminaries

2.1. Notations

2.2. Preliminaries

3. Related Work

3.1. Low-Rank Representation (LRR)

3.2. Block Diagonal Representation (BDR)

4. The Proposed TMSC-TNNBDR

4.1. Problem Formulation

4.2. Optimization

4.3. Computational Complexity and Convergence

5. Experimental Results

5.1. Description of the Dataset

5.2. Evaluation Metrics

5.3. Experiment Results

5.4. Convergence Analysis

5.5. Parametric Sensitivity

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics