Next Article in Journal
EDPNet (Efficient DB and PARSeq Network): A Robust Framework for Online Digital Meter Detection and Recognition Under Challenging Scenarios
Previous Article in Journal
Forced Oscillation Detection via a Hybrid Network of a Spiking Recurrent Neural Network and LSTM
Previous Article in Special Issue
Surface Defect Detection for Small Samples of Particleboard Based on Improved Proximal Policy Optimization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition

by
Shiyang Zhou
1,2,†,
Xuguo Yan
1,3,†,
Huaiguang Liu
1,2 and
Caiyun Gong
1,2,*
1
Key Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, China
2
Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
3
State Key Laboratory of Intelligent Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Sensors 2025, 25(8), 2606; https://doi.org/10.3390/s25082606
Submission received: 26 February 2025 / Revised: 12 April 2025 / Accepted: 17 April 2025 / Published: 20 April 2025
(This article belongs to the Special Issue Sensing and Imaging for Defect Detection: 2nd Edition)

Abstract

:

Highlights

What are the main findings?
  • Compared with the deep learning method for surface defect detection, the proposed SLRTD-based detection method is simpler and more effective in a galvanized strip steel production line.
What is the implication of the main finding?
  • The proposed SLRTD-based detection method of surface defect can be applied for other industrial products, such as glass, fabric, LCD, and AMOLED.

Abstract

Accurate and efficient white-spot defects detection for the surface of galvanized strip steel is one of the most important guarantees for the quality of steel production. It is a fundamental but “hard” small target detection problem due to its small pixel occupation in low-contrast images. By fully exploiting the low-rank and sparse prior information of a surface defect image, a Schatten-p norm-based low-rank tensor decomposition (SLRTD) method is proposed to decompose the defect image into low-rank background, sparse defect, and random noise. Firstly, the original defect images are transformed into a new patch-based tensor mode through data reconstruction for mining valuable information of the defect image. Then, considering the over-shrinkage problem in the low-rank component estimation caused by a vanilla nuclear norm and a weighted nuclear norm, a nonlinear reweighting strategy based on a Schatten p -norm is incorporated to improve the decomposition performance. Finally, a solution framework is proposed via a well-designed alternating direction method of multipliers to obtain the white-spot defect target image by a simple segmenting algorithm. The white-spot defect dataset from a real-world galvanized strip steel production line is constructed, and the experimental results demonstrate that the proposed SLRTD method outperforms existing state-of-the-art methods qualitatively and quantitatively.

1. Introduction

Galvanized strip steel is widely used in automobile manufacturing, household electrical appliances, and other daily-use products; surface defects might lead to some product quality issues for both in-progress and downstream products. The white-spot defects mainly caused by random zinc dross and ash in the hot dip or electro galvanizing process are considered the most serious threat to the steel surface quality due to their high concurrence and typical periodicity. This defect can be detected and recorded by the surface defect inspection based on machine vision, and then the quality issues with strip steel will be controlled at the early stage [1,2]. In fact, industrial production images frequently contain some prior information; for example, if a defect rarely appears, the vast majority of images show no defects. The white-spot defects are always ultra-tiny in size, occupying less than 5% of the whole raw image. The inherent issues of tiny size and low-contrast of white-spot defects under complex clutters and heavy noise constitute the significant obstacle in the detection and segmentation of the defect target: (a) they are sensitive to tiny defects because of their extreme small sizes, and (b) they cannot accurately locate the random appeared defects whose semantic relationship with the background context is weak.
In industrial machine vision, small or tiny object detection has always been a significantly studied issue, especially in industrial applications [3,4,5,6,7]. The research can be classified into three categories: traditional filtering-based methods, sparse and low-rank representation-based methods, and data-driven methods [8]. Filtering-based methods typically design relevant filtering operators in either grayscale or derivative spaces to suppress background noise and enhance targets. While these methods offer a certain level of real-time performance, they often require manual design of operators tailored to specific scenes when facing complex backgrounds, which limits their generalization ability and adaptability. Unlike the methods above that rely on manually designed features based on data priors, data-driven approaches utilize convolutional neural networks (CNNs) for deep feature extraction automatically [9,10]. Although these methods perform well and can extract deeper information, they depend heavily on image samples, which affects their real-time applicability. Due to the size limitations of small targets, which typically consist of only a few pixels, there is a risk of losing critical information during target detection, which may be ineffective or even degrade detection performance. In contrast, some traditional methods offer strong interpretability and maintain a certain level of robustness when faced with complex scenarios, maintaining certain advantages.
In recent years, tensor decomposition-based sparse and low-rank representation methods have achieved remarkable success, especially in infrared or remote sensing small target detection [11,12,13,14,15]. These methods leverage the low-rank nature of the infrared background and the sparse characteristics of small infrared targets; the background is modeled as a low-rank component, while the infrared targets are typically modeled as sparse components. The tensor model effectively preserves spatial structures and utilizes temporal information across multiple frames, which helps achieve more accurate target detection. These methods can combine spatial and temporal local information and iteratively optimize to approach the optimal detection results, significantly improving robustness to detect small targets in complex scenes. As the defect is random in the production line of strip steel, different image frames are irrelated, and the spatial and temporal information between different image frames are lost. As single-frame information is limited, these methods often miss targets or retain excessive false alarms when dealing with edge overlapping or complex backgrounds, thus failing to provide optimal decomposition guidance without temporal information across frames [16]. As shown in Figure 1, the white-spot defect of galvanized steel sheet is random and keeps changing all the time; small or tiny white-spot defects among 4096 × 2048 raw images are still easily ignored due to insufficient appearance information. At the same time, the highly varied nature of the background image and small target characteristics make the detection process extremely difficult.
Motivated by the above discussions, we establish the adaptive tensor model to accurately detect white-spot defects on galvanized steel sheet. The main contributions of this article are outlined as follows:
  • We propose an SLRTD method by digging out inter-patch correlation-ships of surface defect images of galvanized strip steel. The separated defect foreground target information with sparse outliers is embedded in the background of low-rank representation.
  • To achieve an accurate estimation of non-defect background rank, we incorporate weighted Schatten p -norm regularization for the background component, allowing for better noise removal while preserving edges, ultimately leading to improved detection results. Concurrently, a nonlinear reweighting strategy and tensor singular value decomposition (t-SVD) are adopted to help the model more delicately balance the low-rank and sparse components throughout the iterative process, which elevates the separation accuracy between the defect target and non-defect background.
  • According to the alternating direction method of multipliers (ADMM), we solve the sparse and low-rank decomposition problem. Experiment results demonstrate the feasibility and effectiveness of the proposed SLRTD method.
The remainder of this paper is structured as follows: Section 2 provides a concise overview of related work. Section 3 presents a comprehensive introduction to the architecture and components of the proposed SLRTD approach. Section 4 demonstrates numerical comparisons and module analysis. Finally, Section 5 concludes the paper. In our paper, tensors, matrices, and vectors are denoted using calligraphic letters, uppercase, and lowercase italics.

2. Related Works

In this section, we provide a brief overview of existing defect detection methods based on filtering, data-driven, and tensor decomposition methods.

2.1. Filtering-Based Methods

Generally, filter-based methods design relevant filtering operators in grayscale or derivative spaces to suppress background clutter and enhance the target. These methods can be further classified into the threshold method (e.g., Otsu, cross-entropy), morphological method (e.g., morphological operations, template matching), and spectral method (e.g., Fourier transform, Gabor transform, wavelet transform). However, the threshold and morphological methods often need manual adjustments to find the defect regions, especially when the distributions of scene intensity and texture are complicated. The images are transformed to frequency–domain in the spectral method, which often miss some small defects targets. These methods show poor adaptability and robustness [17].

2.2. Data-Driven-Based Methods

During the past few years, tremendous efforts have been devoted to small defect target detection based on deep learning [18]. Zhang et al. [19] constructed a novel feature enhancement network to improve the performance of small object detection. Hou et al. [20] proposed a contextual information and spatial attention that is based on a network for detecting small defects in the manufacturing industry. Yang et al. [21] introduced an RSADUnet model for metal surface tiny defect inspection. However, the satisfactory performance of these methods relies heavily on expensive labeled images, large numbers of parameters, and computational complexity to obtain strong feature awareness. Additionally, they are always used for natural or medical scenes where the shape of the specific object does not change much. Thus, these methods may not be suitable for steel surface defect detection in which the defect is random, especially for small defects with varied shapes and sizes. At the same time, a small defect occupies small pixels and lacks obvious texture and structure characteristics, thus it is difficult to provide reliable defect features.

2.3. Tensor Decomposition-Based Methods

In contrast, the tensor decomposition-based sparse and low-rank model always transforms the detection problem into an iterative optimization task by exploring new representations of the target and background features. Each patch is directly used as the obverse side slice to construct tensor data, which can guarantee the local features of the defect image can be reserved and is conductive to withdrawing the prior information. Many methods proposed different types of regularization to constrain low-rank and sparse attributes. The reweighted infrared patch-tensor (RIPT) model proposed by Dai et al. [22] utilizes the target sparse prior and background nonlocal self-correlation prior, and the local structure weight is designed for the target tensor term and serves as an edge indicator in the weighted model. However, RIPT still has limitations because the nuclear norm cannot accurately estimate the background. To alleviate this issue, inspired by t-SVD and a non-convex approximation of rank based on the tensor nuclear norm, Lu et al. [23] exploited the tensor robust principal component (TRPCA) method, which regularizes all singular values of tensor data equally and shrinks all singular values with the same parameter. Zhang et al. [24] introduced a novel nonconvex low-rank constraint, the partial sum of the tensor nuclear norm (PSTNN), combined with a weighted l1 norm to effectively suppress the background while effectively preventing target over-shrinkage. Gao et al. [25] developed an enhanced TRPCA (ETRPCA) model by the weighted tensor Schatten p -norm minimization, which makes the large singular values shrink less in the tensor nuclear norm minimization. The Schatten p -norm can better approximate the NP-Hard problem, optimize the recovery of its convex relaxation, and will not over-shrink the low-rank components of the data. Chen et al. [26] introduced logarithmic norm regularization as the nonconvex surrogate of matrix and tensor rank, which achieves more accurate low-rank approximation and high computational efficiency.
As it does not consider the influence from noise and clutter, the target detection effect will be reduced greatly in the face of the defect image in a more complex background. Luo et al. [27] utilized the sparse components of small targets along with a background dictionary, employing the fully connected tensor nuclear norm for low-rank estimation. Wang et al. [28] built a unique regularization term as a tensor-correlated total variation, which essentially encodes both low-rank and sparse priors of a tensor simultaneously. Geng et al. [29] developed a nonconvex and nonlocal TRPCA (NN-TRPCA) model based on the tensor adjustable logarithmic norm, which adaptively shrinks small singular values more and shrinks large singular values less. Huang et al. [30] introduced a two-stage feature complementary improved tensor low-rank sparse decomposition method, which is divided into two stages: tensor initialization and tensor decomposition, effectively integrating local and nonlocal features. Based on above analysis, the detection capability of these methods is dependent on the contrast between the target and the background. When there is strong contrast, these methods yield good detection results; otherwise, they tend to have a high false-positive rate. Moreover, these methods depend on the measurement of background rank in terms of accuracy, and obviously reduce performance and are time-consuming in the face of dim targets with complex backgrounds.

3. Methodology

The overall framework of the proposed SLRTD method is shown in Figure 2, which consists of construction of tensor model for defect image, model solution, and model analysis.

3.1. Construction of Tensor Model for Defect Image

Tensor can be expanded into matrix along n-modes and its i-th frontal slice are denoted as X i . We denote X ¯ n 1 × n 2 × n 3 as the result of Discrete Fourier Transformation (DFT) along its third dimension by using X ¯ , that is, X ¯ = fft X ,   , 3 . The inverse operator computes X from X ¯ , that is, X = fft X ¯ ,   , 3 . The tensor nuclear norm is defined as X = i = 1 r S i , i , j = 1 n 3 i = 1 r j = 1 n 3 S ¯ i , i , j , where r = rank t X and S ¯ i , i , 1 are the entries on the diagonal of the first slice of S ¯ , which has a decreasing order property. And we denote the l0 norm, the l1 norm, and the Frobenius norm as X 0 , X 1 , X F , respectively.
The defect image is mainly composed of defect target, background, and noise, which can be expressed as follows:
f D = f T + f B + f N
where f D , f T , f B , and f N represent the input of original defect image, defect image, background image, and noise defect, respectively.
The original defect image I is two-dimensional data, which can be converted to tensor data. Assume original defect image I and the tensor structure obtained after data reconstruction is D . The image I will be browsed by m × n sliding window from left to right and from top to bottom; the total sliding time is l ; the small patch obtained each time as the frontal slice of D m × n × l is shown in Figure 2. The tensor model of defect image can be constructed as follows:
D = T + B + N
where D , T , B , and N represent defect image patch-tensor, defect target patch-tensor, background patch-tensor, and random noise patch-tensor, respectively.
Defect target patch-tensor T :
For defect image from the strip steel production line, defects with fewer pixels with respect to the whole image, it is sparse compared to most of the normal background regions. Thus, the corresponding defect target patch-tensor is an extremely sparse tensor, which can be depicted as follows:
T 0 d
where d is an integer that is related with defect target characteristics of number and size.
Background patch-tensor B :
As illustrated in Figure 1, the gray-value of normal regions is almost uniform, which means that the local patches are highly correlated with each other. Actually, the mode-1, mode-2, and mode-3 unfolding matrices of the background patch-tensor are also low-rank. In Figure 3b–d, the horizontal axes denote the number of singular values; the vertical axes denote the singular values. Assuming the defect image is 200 × 200, the patch is 40 × 40 and the step size is 10, the patch-tensor D 40 × 40 × 289 , the size of mode-1, mode-2, and mode-3 unfolding matrices are 40 × 11,560, 40 × 11,560, 289 × 1600, respectively. From Figure 3, it can be observed that the singular values of three mode-unfolding matrices all exhibit a sharp decreasing trend and decline into approximate zero rapidly, with only a small fraction being significantly greater than zero. It indicates that the background patch-tensor B is correlated along each mode direction and further reflects its inherent low-rank structure within the subspaces corresponding to each mode [31,32]. Based on this property, we can consider the background patch-tensor B as a low-rank tensor, and their unfolding matrices are also all low-rank defined as follows:
rank B 1 r 1 rank B 2 r 2 rank B 3 r 3
where B 1 , B 2 , and B 3 denote model-1, model-2, and model-3 unfolding matrices of B , respectively; r 1 , r 2 , and r 3 denote the background complexity.
Noise patch-tensor N :
The noise is usually modeled as additive white Gaussian noise, and it satisfies N F 2 δ , where δ > 0 denotes the Gaussian noise level.
Based on above analysis, the small white-spot defects detection on galvanized strip steel based on tensor decomposition can be formulated as follows:
min T , B , N rank B + λ T 0 + η N F 2 s . t .   D = B + T + N
where rank · denotes matrix rank and · 0 denotes l 0 norm, · F 2 denotes F norm, λ and η denote regularization parameters to balance the low-rank and sparse characteristics.
A reweighted strategy is always adopted to solve low-rank and sparse tensor decomposition. The principle of weighted Schatten p -norm minimization (WSNM) is to assign different weights to different singular values [33,34], which can recover the low-rank component and separate target from background more accurately by adjusting p . Therefore, we incorporate the WSNM to improve the detection accuracy of defect target. It can be defined as follows:
B W p = 1 n 3 i = 1 r j = 1 n 3 W i , i , j S ¯ i , i , j p 1 p
where n 3 is the number of patches, W i , i , j = ξ n 1 n 2 S ¯ i , i , j + E B denotes a weight tensor, which is defined in Algorithm 1; S ¯ : , : , j represents the singular values of B : , : , j , ξ is a parameter, E B is a positive constant.
Algorithm 1: Solving Equation (11)
Input: X n 1 × n 2 × n 3 , p
Output: F λ X
step 1: Conduct FFT operation: X ¯ = fft X ,   , 3
step 2: Conduct SVD operation on each frontal slice X ¯ i of X ¯
      for i = 1 , 2 , , n 3 + 1 2  do
         U , Σ , V = svd X ¯ i , Σ = diag σ 1 , , σ r
        Compute w = w 1 , w 2 , , w r
          for j = 1 , 2 , , r  do
               δ i = GST σ i , λ w i , p
          end for
         Σ = diag δ 1 , , δ r , W ¯ i = U Σ V T
      end for
      for i = n 3 + 1 2 + 1 , , n 3  do
            W ¯ i = conj W ¯ n 3 i + 2
      end for
step 3: Compute F X = ifft W ¯ ,   , 3
Additionally, l 0 norm is replaced by l 2 , 1 norm for solving the model. Hence, Equation (5) can be transformed as follows:
min T , B , N B W p + λ T 2 , 1 + η N F 2 s . t .   D = B + T + N

3.2. Model Solution

In order to solve Equation (7), the ADMM are selected to solve the separable convex optimization problem. The Lagrange function can be constructed as shown in Equation (8), which can be converted to Equation (9), where, Y r × r × n 3 , μ and · denote the Lagrangian multiplier tensor, inner product among tensors, respectively.
O B , T , N , Y , μ = B W p + λ T 2 , 1 + η N F 2 + μ 2 D B T N F 2 + Y ,   D B T N
O B , T , N , Y , μ = B W p + λ T 2 , 1 + η N F 2 + μ 2 D B T N + Y μ F 2
Based on ADMM, O can be broken into several sub-problems for iterative update:
(i) Fix the rest of the variables and update of B , which is defined by Equation (10):
B k + 1 = arg min B 1 2 D T k N k + Y k μ k B k F 2 + 1 μ k B W p
It can be solved by Generalized Soft-Thresholding (GST) and tensor singular value thresholding (t-SVT) [35] as follows:
B k + 1 = F 1 μ k D T k N k + Y k μ k
where F · denotes solving WSNM-tensor problem, and the detailed process is provided in Algorithm 1. · k represents the value of variables in k-th iteration.
(ii) Fix the rest of the variables and update of T , which is confirmed by Equation (12):
T k + 1 = arg min T λ T 2 , 1 + μ k 2 D B k + 1 N k + Y k μ k T F 2
As l 2 , 1 -norm of T is defined as the sum of l 2 -norm of each mode-2 fiber, we matricize each tensor along the 2nd mode, so T k + 1 2 , 1 = T 2 k + 1 2 , 1 . It can be transformed into the matrix form with Equation (13):
T 2 k + 1 = arg min T 2 λ T 2 2 , 1 + μ k 2 D 2 B 2 k + 1 N 2 k + Y 2 k μ k T 2 F 2
Let X = D 2 B 2 k + 1 N 2 k + Y 2 k μ k , so
T 2 k + 1 = arg min T 2 λ T 2 2 , 1 + μ k 2 X T 2 F 2
According to [36], it has the following close-form solution with Equation (15):
T 2 k + 1 : , j = X : , j 2 λ μ k X : , j 2 X : , j ,             if   X : , j 2 > λ μ k 0 ,                                                             otherwise
where X : , j represents the j-th column of the matrix X .
After T 2 k + 1 is solved, it can be transformed into tensor form T k + 1 .
(iii) Fix the rest of the variables and update of N is obtained by Equation (16):
N k + 1 = arg min N η N F 2 + μ k 2 D B k + 1 T k + 1 + Y k μ k N F 2
Differentiating it with respect to N , and let it be zero:
2 η N μ k D B k + 1 T k + 1 + Y k μ k N = 0
Then, we have the following:
N k + 1 = μ k 2 η + μ k D B k + 1 T k + 1 + Y k μ k
(iv) Fix the rest of the variables and update of Y :
Y k + 1 = Y k + μ k D B k + 1 T k + 1 N k + 1
(v) Update of μ :
μ k + 1 = min ρ μ k , μ max
where 0 < ρ < 1 , μ max = 10 5 .
(vi) Iteration termination condition:
D B k + 1 T k + 1 N k + 1 F D F < ε
where 10 3 < ε < 10 5 .
Finally, we summarize the algorithmic process of the proposed SLRTD method in Algorithm 2.
From target-background separation, the tensor D is decomposed into the defect target T and background B . The defect target image f T and background image f B are reconstructed from T and B , respectively. Finally, the defect targets can be extracted and segmented by the adaptive thresholding segmentation method.
Algorithm 2: Solving Equation (7) by ADMM
Input: Original defect image sequence tensor D R ^ m × n × l , power   p ,   λ , η
Output: B , T , N
Initialize: D , T 0 = B 0 = N 0 = 0 , W = I , Y = 0 , μ max = 10 5 , ρ = 1.1 , k = 0
While: not converged, do
step 1: Update B k + 1 by Equation (11)
step 2: Update T k + 1 by Equation (15)
step 3: Update N k + 1 by Equation (16)
step 4: Update Y k + 1 by Equation (19)
step 5: Update μ k + 1 by Equation (20)
step 6: Check the convergence condition
step 7: Update k = k + 1
end While

3.3. Model Analysis

3.3.1. Computational Complexity

For surface defect image D, it could be constructed into the tensor D m × n × l with sliding window’s size m × n and times l . The computation complexity of the solution of B includes three operations [34] of FFT, SVD, and GST; FFT operation is O A m n l log l , SVD operation is O A m n 2 l + 1 2 , GST operation is O B m n , where A denotes iterations number in Algorithm 1, B denotes iterations number in GST. Thus, the computation complexity of solution B is O A m n l log l + A m n 2 l + 1 2 + B m n . For solution T , the computation complexity is O m n . Therefore, the whole computation complexity of model is O C A m n l log l + A m n 2 l + 1 2 + B m n + m n , where C is the iterations number in ADMM.

3.3.2. Convergence of Algorithm

The convergence of model Equation (9) has been proven in [25]. We evaluate the convergence of the proposed SLRTD method to empirically show the convergence through experiments in different iterations. We set a convergence error value of D B T N F / D F and found that the algorithm takes approximately 0.5 s per frame for small target detection. Figure 4 shows the variation in convergence error as the number of iterations progresses from 0 to 30. It can be observed that after 10 iterations, the error of our algorithm gradually decreases, and the convergence curve stabilizes, indicating that our method meets the convergence requirements and successfully achieves optimal or near-optimal solutions.

4. Experiment

To evaluate the effectiveness of the proposed SLRTD method, we conduct a series of qualitative and quantitative experiments. Firstly, the details of data collection and preprocessing, and evaluation metrics are described. Secondly, the key parameters and robustness are analyzed. Finally, the performance comparison is conducted between our method and the other four typical methods.

4.1. Experimental Setup

4.1.1. Data Collection and Preprocessing

The proposed SLRTD method is validated on a dataset from a real-world galvanized strip steel production line, and the image acquisition platform can be observed in Figure 5. We crop the raw images’ 4096 × 1024 pixels into 200 × 200 pixels to make the dataset. In total, we obtain 100 defect images and 100 non-defect images. For each image, the pixel-level ground-truth is manually marked by using “1” to denote defective pixels and “0” to denote defect-free pixels. The size of the white-spot defect in the production line is about 1 mm to 3 mm, which is about 5 to 8 pixels in the 200 × 200 image. Defect samples and the corresponding masks of the constructed datasets are shown in Figure 6.

4.1.2. Evaluation Metrics

In order to evaluate the detection performance of the proposed SLRTD model, Precision, Recall, Precision-Recall (P-R) curve, receiver operating characteristic (ROC) curve, area under ROC curve (AUC) average F-Measure (Fζ), and mean square error (MAE) are adopted as evaluation metrics.
Precision is used to evaluate how many positive pixels are correctly classified. It can be defined as follows:
Precision = T P T P + F P
where true positive ( T P ) indicates the number of defect pixels which are correctly classified as defect, false positive ( F P ) indicates the number of background pixels mistakenly identified as defect.
Recall is used to evaluate how many pixels are correctly classified. It can be defined as follows:
Recall = T P T P + F N
where false negative ( F N ) indicates the number of defect pixels that are not classified to defect class.
F ζ represents the weighted harmonic mean of precision and recall. It can be defined as follows:
F ζ = 1 N i = 1 N ζ 2 + 1 × Precision × Recall ζ 2 × Precision + Recall
MAE measures the dissimilarity between segmentation images B W and corresponding ground-truth image G . It can be defined as follows:
MAE = i = 1 H j = 1 W B W i , j G i , j H × W
where H and W denote height and width of surface defect image.
Using Otsu’s threshold, we determine the Precision and Recall by changing the thresholds from 0 to 255 to obtain pairs of Precision and Recall.

4.2. Validation of the Proposed SLRTD Method

4.2.1. Parameter Analysis

For the proposed SLRTD method, the patch size, sliding step size, and value of Schatten- p are evaluated by the AUC and MAE, where the regularized parameter λ is set to λ = l max m , n , which is adaptive to the patch size and number of patches.
  • Patch size
To balance the detection performance and computational complexity, we change the patch size from 20 to 50 with 10 intervals and provide the corresponding AUC and MAE in Table 1. As the patch size increases, the detection performance of the SLRTD method decreases. When the patch size is 50 × 50, the degradation is more serious. The reason is the larger patch size may destroy the low-rank and sparsity between the different patches, and influence the decomposition of the defect target and background.
b.
Step size
In practice, we hope to take a larger step in exchange for the reduction in computational complexity. We fix the patch size and vary the step size from 10 to 40 with 10 intervals. The evaluation results are shown in Table 1. It could be observed that the performances of larger steps are always better than that of smaller ones, but when the step size is similar with the patch size, the performance becomes worse.
c.
Value of Schatten- p
As shown in Table 2, the AUC and MAE with p change from 0.1 to 1 with 0.1 intervals. The detection performance of p = 1 is worse than lower p as the over-shrinkage is serious. When p is decreased, the low-rank background may be closer to true rank, while more high-rank components would become zeros [34]. When p is too small, it would also make some low-rank backgrounds become zeros to lead the bad detection performance. Therefore, in order to make the tradeoff between detection accuracy and error, p is set to 0.7 in the following experiments.

4.2.2. Robustness to Noise

The noise conditions of a real-world galvanized strip steel production line are complex and random; we evaluate the proposed SLRTD method’s performance in the noise case with different levels. Additive Gaussian noise with different signal-to-noise ratios (SNRs) are introduced to the original defect image, including 36 dB, 32 dB, and 28 dB. The experimental results are shown in Figure 7 and Table 3. It can be observed that the SLRTD is robust to noise in most cases; binarization defect images obtained by SLRTD are clear and accurate. When SNR decreases gradually, the AUC and MAE metrics can remain a relatively high level; for example, AUC still remains around 0.85 under the condition of SNR = 28   dB , which is considered as less sensitive to noise. However, some pixels of the background are misclassified for defects, which is still a great challenge for the SLRTD method.

4.3. Comparison with the State-of-the-Art Methods

The proposed SLRTD method is compared with four state-of-the-art algorithms, including TRPCA [23], PSTNN [24], ETRPCA [25], and NN-TRPCA [29].

4.3.1. Qualitative Comparison

To facilitate a more intuitive comparison of the performance of each algorithm, we selected representative defect images from the dataset. The false-detected targets are highlighted with green circles, which are shown in Figure 8. The qualitative comparison results between the proposed SLRTD method and the other four methods are shown in Figure 8. These images show various illumination and gray levels. NN-TRPCA and PSTNN can enhance defect targets, but they also enhance many clutters and noises of the background and the defect object cannot be uniformly highlighted, which can cause a high false alarm rate. These methods may lose the targets with heavy noise. NN-TRPCA exhibits poor robustness in detecting small targets against complex backgrounds, resulting in considerable background residue. The TRPCA and ETRPCA methods can detect the targets in the 2nd and 4th rows, but some background residue still remains. There are many pixels that belong to the background and are misjudged by defects, which leads to low accuracy. By contrast, SLRTD separates the defect objects from the image background successfully and locates various defects precisely. It more efficiently highlights the whole defect object with well-defined boundaries than the other methods. It can be concluded that WSNM regularization could improve the performance. In addition, WSNM regularization of SLRTD contributes to good performance, and thus, is the important factor to obtain more precise segmentation results than other methods. What is more, it is reasonable to conclude that treating the matrix singular values differently is the method by which the most important characteristics of the defects or background can be preserved. The superiority of the proposed SLRTD method can suppress the noise and detect the defect targets clearly, which can generate high-quality binary segmentation results by a simple threshold method.

4.3.2. Quantitative Comparison

To further demonstrate the superiority of the proposed SLRTD method, four competitive methods of TRPCA, ETRPCA, NN-TRPCA, PSTNN are selected for comparison. From the P-R and ROC curves demonstrated in Figure 9, for the same false alarm ratio, our method can achieve the highest detection performance. By imposing a p -norm to the patch-image, SLRTD can suppress background noise effectively for all defect images and lay a good foundation for the subsequent target segmentation. Table 4 summarizes the quantitative results of five methods, and the best results are marked in bold. It demonstrates that SLRTD also has better performance than the other four methods. Most of the AUC results are higher than 85%, and SLRTD achieves 0.9560. MAE of SLRTD is typically the lowest among all the methods. Compared with TRPCA, it is increased by 2.08% and 1.55% in AUC and MAE, respectively. Based on the above qualitative and quantitative analyses, it confirms that our proposed SLRTD method consistently outperforms some state-of-the-art methods and verifies the effectiveness of the proposed SLRTD method.

5. Conclusions

In order to detect small white-spot surface defects of galvanized strip steel accurately and rapidly, the SLRTD method is proposed in this paper. The nonlinear reweighting strategy based on Schatten p -norm is adopted to separate the defect image into a smooth non-defect background and random noise. A solution framework is proposed by the ADMM algorithm to obtain the defect target image. Based on the self-constructed defect dataset, experiments are conducted by the qualitative and quantitative method, which achieves the best performance among the state-of-the-art defect detection methods. In the future, we will focus on improving the ability of the proposed SLRTD method to detect the other tiny defects under weak illumination conditions or irregular defect-like texture surfaces. In the future, we will conduct binocular and structured-light stereo vision to obtain three-dimensional information of surface defects in industrial production.

Author Contributions

S.Z. designed the SLRTD model and performed the evaluation experiments. X.Y. collaborated closely and contributed equally to this work. H.L. arranged the datasets, as well as reviewed the article. C.G. developed the automatic optical inspection procedure. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number 52205537 and 51805386, the open fund of State Key Laboratory of Intelligent Manufacturing Equipment and Technology (Huazhong University of Science and Technology), grant number IMETKF2025024, and the Innovative Research Group Project of the National Natural Science Foundation of Hubei Province of China, grant number 2024AFA026.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are not publicly available due to privacy issues.

Acknowledgments

The authors would like to thank the editor and anonymous reviewers.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Diers, J.; Pigorsch, C. A survey of methods for automated quality control based on images. Int. J. Comput. Vision 2023, 131, 2553–2581. [Google Scholar] [CrossRef]
  2. Zhu, J.P.; He, G.H.; Zhou, P. MFNet: A novel multilevel feature fusion network with multibranch structure for surface defect detection. IEEE Trans. Instrum. Meas. 2023, 72, 1–11. [Google Scholar] [CrossRef]
  3. Cheng, G.; Yuan, X.; Yao, X.W.; Yan, K.B.; Zeng, Q.H.; Xie, X.X.; Han, J.W. Towards large-scale small object detection: Survey and benchmarks. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 13467–13488. [Google Scholar] [CrossRef]
  4. Li, J.; Wu, R.; Zhang, S.; Chen, Y.L.; Dong, Z.C. FASCNet: An edge-computational defect detection model for industrial parts. IEEE Internet. Things. 2023, 11, 6622–6637. [Google Scholar] [CrossRef]
  5. Luo, Q.W.; Chen, Y.W.; Su, J.J.; Yang, C.H.; Silvén, O.; Liu, L. Prior-guided YOLOX for tiny roll mark detection on strip steel. IEEE Sens. J. 2024, 24, 15575–15587. [Google Scholar] [CrossRef]
  6. Ameri, R.; Hsu, C.C.; Band, S.S. A systematic review of deep learning approaches for surface defect detection in industrial applications. Eng. Appl. Artif. Intel. 2023, 130, 107717. [Google Scholar] [CrossRef]
  7. Khanam, R.; Hussain, M.; Hill, R.; Allen, P. A comprehensive review of convolutional neural networks for defect detection in industrial applications. IEEE Access 2024, 12, 94250–94295. [Google Scholar] [CrossRef]
  8. Zou, Z.X.; Chen, K.Y.; Shi, Z.W.; Guo, Y.H.; Ye, J.P. Object detection in 20 years: A survey. Proc. IEEE 2023, 111, 257–276. [Google Scholar] [CrossRef]
  9. Jha, S.B.; Babiceanu, R.F. Deep CNN-based visual defect detection: Survey of current literature. Comput. Ind. 2023, 14, 103911. [Google Scholar] [CrossRef]
  10. Liu, G.H.; Chu, M.X.; Gong, R.F.; Zheng, Z.H. Global attention module and cascade fusion network for steel surface defect detection. Pattern Recogn. 2024, 158, 110979. [Google Scholar] [CrossRef]
  11. Zare, A.; Ozdemir, A.; Iwen, M.A.; Aviyente, S. Extension of PCA to higher order data structures: An introduction to tensors, tensor decompositions, and tensor PCA. Proc. IEEE 2018, 106, 1341–1358. [Google Scholar] [CrossRef]
  12. Sun, Y.; Yang, J.G.; An, W. Infrared dim and small target detection via multiple subspace learning and spatial-temporal patch-tensor model. IEEE T. Geosci. Remote. 2020, 59, 3737–3752. [Google Scholar] [CrossRef]
  13. Yu, Q.; Yang, M. Low-rank tensor recovery via non-convex regularization, structured factorization and spatio-temporal characteristics. Pattern Recogn. 2023, 137, 109343. [Google Scholar] [CrossRef]
  14. Wang, M.H.; Hong, D.F.; Han, Z.; Li, J.X.; Yao, J.; Gao, L.R.; Zhang, B.; Chanussot, J. Tensor decompositions for hyperspectral data processing in remote sensing: A comprehensive review. IEEE Geosci. Remote Sens. Mag. 2023, 11, 26–72. [Google Scholar] [CrossRef]
  15. Luo, Y.; Li, X.R.; Chen, S.H. 5-D spatial-temporal information-based infrared small target detection in complex environments. Pattern Recogn. 2024, 158, 111003. [Google Scholar] [CrossRef]
  16. Zeng, N.Y.; Wu, P.S.; Wang, Z.D.; Li, H.; Liu, W.; Liu, X.H. A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection. IEEE Trans. Instrum. Meas. 2022, 71, 3507014. [Google Scholar] [CrossRef]
  17. Wen, X.; Shan, J.; He, Y.; Song, K. Steel surface defect recognition: A survey. Coatings 2023, 13, 17. [Google Scholar] [CrossRef]
  18. Li, D.; Li, Y.; Xie, Q.; Wu, Y.; Yu, Z.; Wang, J. Tiny defect detection in high-resolution aero-engine blade images via a coarse-to-fine framework. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
  19. Zhang, H.Y.; Li, M.; Miao, D.Q.; Pedrycz, W.; Wang, Z.G.; Jiang, M.H. Construction of a feature enhancement network for small object detection. Pattern Recogn. 2023, 143, 109801. [Google Scholar] [CrossRef]
  20. Hou, X.Q.; Liu, M.Q.; Zhang, S.L.; Wei, P.; Chen, B.D. CANet: Contextual information and spatial attention based network for detecting small defects in manufacturing industry. Pattern Recogn. 2023, 140, 109558. [Google Scholar] [CrossRef]
  21. Yang, B.Y.; Liu, Z.Y.; Duan, G.F.; Tan, J.R. Residual shape adaptive dense-nested Unet: Redesign the long lateral skip connections for metal surface tiny defect inspection. Pattern Recogn. 2023, 147, 110073. [Google Scholar] [CrossRef]
  22. Dai, Y.; Wu, Y.Q. Reweighted infrared patch-tensor model with both nonlocal and local priors for single-frame small target detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 3752–3767. [Google Scholar] [CrossRef]
  23. Lu, C.Y.; Feng, J.S.; Chen, Y.D.; Liu, W.; Lin, Z.C.; Yan, S.C. Tensor robust principal component analysis with a new tensor nuclear norm. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42, 925–938. [Google Scholar] [CrossRef] [PubMed]
  24. Zhang, L.D.; Peng, Z.M. Infrared small target detection based on partial sum of the tensor nuclear norm. Remote Sens. 2019, 11, 382. [Google Scholar] [CrossRef]
  25. Gao, Q.X.; Zhang, P.; Xia, W.; Xie, D.Y.; Gao, X.B.; Tao, D.C. Enhanced tensor RPCA and its application. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 2133–2140. [Google Scholar] [CrossRef]
  26. Chen, L.; Jiang, X.; Liu, X.Z.; Zhou, Z.X. Logarithmic norm regularized low-rank factorization for matrix and tensor completion. IEEE Trans. Image Process. 2021, 30, 3434–3449. [Google Scholar] [CrossRef]
  27. Luo, Y.; Li, X.R.; Yan, Y.F.; Xia, C.Q. Spatial-temporal tensor representation learning with priors for infrared small target detection. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 9598–9620. [Google Scholar] [CrossRef]
  28. Wang, H.L.; Peng, J.J.; Qin, W.J.; Wang, J.J.; Meng, D.Y. Guaranteed tensor recovery fused low-rankness and smoothness. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10990–11007. [Google Scholar] [CrossRef]
  29. Geng, X.Y.; Guo, Q.; Hui, S.X.; Yang, M.; Zhang, C.M. Tensor robust PCA with nonconvex and nonlocal regularization. Comput. Vis. Image Und. 2024, 243, 104007. [Google Scholar] [CrossRef]
  30. Huang, Z.X.; Zhao, E.W.; Zheng, W.; Peng, X.D.; Niu, W.L.; Yang, Z. Infrared small target detection via two-stage feature complementary improved tensor low-rank sparse decomposition. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 17690–17709. [Google Scholar] [CrossRef]
  31. Liu, C.T.; Wang, H. Research on infrared dim and small target detection algorithm based on low-rank tensor recovery. J. Syst. Eng. Electron. 2023, 34, 861–872. [Google Scholar] [CrossRef]
  32. Guo, J.; Wu, Y.Q.; Dai, Y.M. Small target detection based on reweighted infrared patch-image model. IET Image Process. 2018, 12, 70–79. [Google Scholar] [CrossRef]
  33. Xie, Y.; Gu, S.H.; Liu, Y.; Zuo, W.M.; Zhang, W.S.; Zhang, L. Weighted Schatten p-norm minimization for image denoising and background subtraction. IEEE Trans. Image Process. 2016, 25, 4842–4857. [Google Scholar] [CrossRef]
  34. Sun, Y.; Yang, J.G.; Li, M.; An, W. Infrared small target detection via spatial-temporal infrared patch-tensor model and weighted Schatten p-norm minimization. Infrared Phys. Techn. 2019, 102, 103050. [Google Scholar] [CrossRef]
  35. Zuo, W.M.; Meng, D.Y.; Zhang, L.; Feng, X.C.; Zhang, D. A generalized iterated shrinkage algorithm for non-convex sparse coding. In Proceedings of the 2013 IEEE International Conference on Computer Vision 2013, Sydney, NSW, Australia, 1–8 December 2013; IEEE: Piscataway, NJ, USA, 2014; pp. 217–224. [Google Scholar] [CrossRef]
  36. Xie, D.Y.; Yang, M.; Gao, Q.X.; Song, W. Non-convex tensorial multi-view clustering by integrating l1-based sliced-Laplacian regularization and l2,p-sparsity. Pattern Recognit. 2024, 154, 110605. [Google Scholar] [CrossRef]
Figure 1. White-spot defects on the surface of galvanized steel sheet: the rightmost is the proportion of defective pixels in the whole image.
Figure 1. White-spot defects on the surface of galvanized steel sheet: the rightmost is the proportion of defective pixels in the whole image.
Sensors 25 02606 g001
Figure 2. The flowchart of the proposed SLRTD method for surface defect detection.
Figure 2. The flowchart of the proposed SLRTD method for surface defect detection.
Sensors 25 02606 g002
Figure 3. Illustration of the nonlocal similarity and the low-rank property of background patch-tensors: (a) one representative defect image; (bd) corresponding singular values distributions curve of the mode-1, mode-2, and mode-3 unfolding matrices of background patch-tensors.
Figure 3. Illustration of the nonlocal similarity and the low-rank property of background patch-tensors: (a) one representative defect image; (bd) corresponding singular values distributions curve of the mode-1, mode-2, and mode-3 unfolding matrices of background patch-tensors.
Sensors 25 02606 g003
Figure 4. Convergence curve of the proposed SLRTD method.
Figure 4. Convergence curve of the proposed SLRTD method.
Sensors 25 02606 g004
Figure 5. Industrial image acquisition platform in the galvanized strip steel production line: (a) strip steel production line; (b) image captured from the steel surface.
Figure 5. Industrial image acquisition platform in the galvanized strip steel production line: (a) strip steel production line; (b) image captured from the steel surface.
Sensors 25 02606 g005
Figure 6. Three-dimensional maps of defect samples and the corresponding ground-truth: (a) original image; (b) ground-truth.
Figure 6. Three-dimensional maps of defect samples and the corresponding ground-truth: (a) original image; (b) ground-truth.
Sensors 25 02606 g006
Figure 7. Detection results of binarization by Otsu’s method in different noise cases: (a) original image; (b) ground-truth; (c) 0 dB; (d) 36 dB; (e) 32 dB; (f) 28 dB.
Figure 7. Detection results of binarization by Otsu’s method in different noise cases: (a) original image; (b) ground-truth; (c) 0 dB; (d) 36 dB; (e) 32 dB; (f) 28 dB.
Sensors 25 02606 g007
Figure 8. Qualitative comparison results: (a) input image; (b) ground-truth image; (c) TRPCA; (d) ETRPCA; (e) NN-TRPCA; (f) PSTNN; (g) ours.
Figure 8. Qualitative comparison results: (a) input image; (b) ground-truth image; (c) TRPCA; (d) ETRPCA; (e) NN-TRPCA; (f) PSTNN; (g) ours.
Sensors 25 02606 g008
Figure 9. Quantitative comparison results with P-R curves and ROC curves.
Figure 9. Quantitative comparison results with P-R curves and ROC curves.
Sensors 25 02606 g009
Table 1. Experimental results of AUC and MAE with different patch sizes and step sizes.
Table 1. Experimental results of AUC and MAE with different patch sizes and step sizes.
PatchStepAUCMAE
20 × 20100.93410.0009
200.97120.0030
30 × 30100.95010.0007
200.97280.0019
300.97750.0046
40 × 40100.95600.0005
200.97370.0017
300.97690.0034
400.97620.0034
50 × 50100.95890.0006
200.97320.0014
300.97600.0021
400.97440.0018
500.97310.0015
Table 2. Experiment of AUC and MAE with different p .
Table 2. Experiment of AUC and MAE with different p .
pAUCMAE
0.10.99940.0613
0.20.98950.0263
0.30.98870.0105
0.40.97690.0042
0.50.97370.0018
0.60.96580.0009
0.70.95600.0005
0.80.94150.0005
0.90.93260.0005
10.92550.0005
Table 3. Experiment with different noise levels.
Table 3. Experiment with different noise levels.
SNRNo Noise36 dB32 dB28 dB
Index
AUC0.95600.93860.90580.8272
MAE0.0050.16100.17310.1939
Table 4. Comparison of AUC and MAE of different methods.
Table 4. Comparison of AUC and MAE of different methods.
MethodTRPCAETRPCANN-TRPCAPSTNNOurs
Index
AUC0.93520.92590.94270.89250.9560
F ζ 0.54070.65690.67300.40280.8364
MAE0.01600.00040.00710.00030.0005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, S.; Yan, X.; Liu, H.; Gong, C. Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition. Sensors 2025, 25, 2606. https://doi.org/10.3390/s25082606

AMA Style

Zhou S, Yan X, Liu H, Gong C. Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition. Sensors. 2025; 25(8):2606. https://doi.org/10.3390/s25082606

Chicago/Turabian Style

Zhou, Shiyang, Xuguo Yan, Huaiguang Liu, and Caiyun Gong. 2025. "Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition" Sensors 25, no. 8: 2606. https://doi.org/10.3390/s25082606

APA Style

Zhou, S., Yan, X., Liu, H., & Gong, C. (2025). Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition. Sensors, 25(8), 2606. https://doi.org/10.3390/s25082606

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop