You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

20 April 2025

Small Defects Detection of Galvanized Strip Steel via Schatten-p Norm-Based Low-Rank Tensor Decomposition

,
,
and
1
Key Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, China
2
Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
3
State Key Laboratory of Intelligent Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Sensing and Imaging for Defect Detection: 2nd Edition

Abstract

Highlights

What are the main findings?
  • Compared with the deep learning method for surface defect detection, the proposed SLRTD-based detection method is simpler and more effective in a galvanized strip steel production line.
What is the implication of the main finding?
  • The proposed SLRTD-based detection method of surface defect can be applied for other industrial products, such as glass, fabric, LCD, and AMOLED.

Abstract

Accurate and efficient white-spot defects detection for the surface of galvanized strip steel is one of the most important guarantees for the quality of steel production. It is a fundamental but “hard” small target detection problem due to its small pixel occupation in low-contrast images. By fully exploiting the low-rank and sparse prior information of a surface defect image, a Schatten-p norm-based low-rank tensor decomposition (SLRTD) method is proposed to decompose the defect image into low-rank background, sparse defect, and random noise. Firstly, the original defect images are transformed into a new patch-based tensor mode through data reconstruction for mining valuable information of the defect image. Then, considering the over-shrinkage problem in the low-rank component estimation caused by a vanilla nuclear norm and a weighted nuclear norm, a nonlinear reweighting strategy based on a Schatten p -norm is incorporated to improve the decomposition performance. Finally, a solution framework is proposed via a well-designed alternating direction method of multipliers to obtain the white-spot defect target image by a simple segmenting algorithm. The white-spot defect dataset from a real-world galvanized strip steel production line is constructed, and the experimental results demonstrate that the proposed SLRTD method outperforms existing state-of-the-art methods qualitatively and quantitatively.

1. Introduction

Galvanized strip steel is widely used in automobile manufacturing, household electrical appliances, and other daily-use products; surface defects might lead to some product quality issues for both in-progress and downstream products. The white-spot defects mainly caused by random zinc dross and ash in the hot dip or electro galvanizing process are considered the most serious threat to the steel surface quality due to their high concurrence and typical periodicity. This defect can be detected and recorded by the surface defect inspection based on machine vision, and then the quality issues with strip steel will be controlled at the early stage [,]. In fact, industrial production images frequently contain some prior information; for example, if a defect rarely appears, the vast majority of images show no defects. The white-spot defects are always ultra-tiny in size, occupying less than 5% of the whole raw image. The inherent issues of tiny size and low-contrast of white-spot defects under complex clutters and heavy noise constitute the significant obstacle in the detection and segmentation of the defect target: (a) they are sensitive to tiny defects because of their extreme small sizes, and (b) they cannot accurately locate the random appeared defects whose semantic relationship with the background context is weak.
In industrial machine vision, small or tiny object detection has always been a significantly studied issue, especially in industrial applications [,,,,]. The research can be classified into three categories: traditional filtering-based methods, sparse and low-rank representation-based methods, and data-driven methods []. Filtering-based methods typically design relevant filtering operators in either grayscale or derivative spaces to suppress background noise and enhance targets. While these methods offer a certain level of real-time performance, they often require manual design of operators tailored to specific scenes when facing complex backgrounds, which limits their generalization ability and adaptability. Unlike the methods above that rely on manually designed features based on data priors, data-driven approaches utilize convolutional neural networks (CNNs) for deep feature extraction automatically [,]. Although these methods perform well and can extract deeper information, they depend heavily on image samples, which affects their real-time applicability. Due to the size limitations of small targets, which typically consist of only a few pixels, there is a risk of losing critical information during target detection, which may be ineffective or even degrade detection performance. In contrast, some traditional methods offer strong interpretability and maintain a certain level of robustness when faced with complex scenarios, maintaining certain advantages.
In recent years, tensor decomposition-based sparse and low-rank representation methods have achieved remarkable success, especially in infrared or remote sensing small target detection [,,,,]. These methods leverage the low-rank nature of the infrared background and the sparse characteristics of small infrared targets; the background is modeled as a low-rank component, while the infrared targets are typically modeled as sparse components. The tensor model effectively preserves spatial structures and utilizes temporal information across multiple frames, which helps achieve more accurate target detection. These methods can combine spatial and temporal local information and iteratively optimize to approach the optimal detection results, significantly improving robustness to detect small targets in complex scenes. As the defect is random in the production line of strip steel, different image frames are irrelated, and the spatial and temporal information between different image frames are lost. As single-frame information is limited, these methods often miss targets or retain excessive false alarms when dealing with edge overlapping or complex backgrounds, thus failing to provide optimal decomposition guidance without temporal information across frames []. As shown in Figure 1, the white-spot defect of galvanized steel sheet is random and keeps changing all the time; small or tiny white-spot defects among 4096 × 2048 raw images are still easily ignored due to insufficient appearance information. At the same time, the highly varied nature of the background image and small target characteristics make the detection process extremely difficult.
Figure 1. White-spot defects on the surface of galvanized steel sheet: the rightmost is the proportion of defective pixels in the whole image.
Motivated by the above discussions, we establish the adaptive tensor model to accurately detect white-spot defects on galvanized steel sheet. The main contributions of this article are outlined as follows:
  • We propose an SLRTD method by digging out inter-patch correlation-ships of surface defect images of galvanized strip steel. The separated defect foreground target information with sparse outliers is embedded in the background of low-rank representation.
  • To achieve an accurate estimation of non-defect background rank, we incorporate weighted Schatten p -norm regularization for the background component, allowing for better noise removal while preserving edges, ultimately leading to improved detection results. Concurrently, a nonlinear reweighting strategy and tensor singular value decomposition (t-SVD) are adopted to help the model more delicately balance the low-rank and sparse components throughout the iterative process, which elevates the separation accuracy between the defect target and non-defect background.
  • According to the alternating direction method of multipliers (ADMM), we solve the sparse and low-rank decomposition problem. Experiment results demonstrate the feasibility and effectiveness of the proposed SLRTD method.
The remainder of this paper is structured as follows: Section 2 provides a concise overview of related work. Section 3 presents a comprehensive introduction to the architecture and components of the proposed SLRTD approach. Section 4 demonstrates numerical comparisons and module analysis. Finally, Section 5 concludes the paper. In our paper, tensors, matrices, and vectors are denoted using calligraphic letters, uppercase, and lowercase italics.

3. Methodology

The overall framework of the proposed SLRTD method is shown in Figure 2, which consists of construction of tensor model for defect image, model solution, and model analysis.
Figure 2. The flowchart of the proposed SLRTD method for surface defect detection.

3.1. Construction of Tensor Model for Defect Image

Tensor can be expanded into matrix along n-modes and its i-th frontal slice are denoted as X i . We denote X ¯ n 1 × n 2 × n 3 as the result of Discrete Fourier Transformation (DFT) along its third dimension by using X ¯ , that is, X ¯ = fft X ,   , 3 . The inverse operator computes X from X ¯ , that is, X = fft X ¯ ,   , 3 . The tensor nuclear norm is defined as X = i = 1 r S i , i , j = 1 n 3 i = 1 r j = 1 n 3 S ¯ i , i , j , where r = rank t X and S ¯ i , i , 1 are the entries on the diagonal of the first slice of S ¯ , which has a decreasing order property. And we denote the l0 norm, the l1 norm, and the Frobenius norm as X 0 , X 1 , X F , respectively.
The defect image is mainly composed of defect target, background, and noise, which can be expressed as follows:
f D = f T + f B + f N
where f D , f T , f B , and f N represent the input of original defect image, defect image, background image, and noise defect, respectively.
The original defect image I is two-dimensional data, which can be converted to tensor data. Assume original defect image I and the tensor structure obtained after data reconstruction is D . The image I will be browsed by m × n sliding window from left to right and from top to bottom; the total sliding time is l ; the small patch obtained each time as the frontal slice of D m × n × l is shown in Figure 2. The tensor model of defect image can be constructed as follows:
D = T + B + N
where D , T , B , and N represent defect image patch-tensor, defect target patch-tensor, background patch-tensor, and random noise patch-tensor, respectively.
Defect target patch-tensor T :
For defect image from the strip steel production line, defects with fewer pixels with respect to the whole image, it is sparse compared to most of the normal background regions. Thus, the corresponding defect target patch-tensor is an extremely sparse tensor, which can be depicted as follows:
T 0 d
where d is an integer that is related with defect target characteristics of number and size.
Background patch-tensor B :
As illustrated in Figure 1, the gray-value of normal regions is almost uniform, which means that the local patches are highly correlated with each other. Actually, the mode-1, mode-2, and mode-3 unfolding matrices of the background patch-tensor are also low-rank. In Figure 3b–d, the horizontal axes denote the number of singular values; the vertical axes denote the singular values. Assuming the defect image is 200 × 200, the patch is 40 × 40 and the step size is 10, the patch-tensor D 40 × 40 × 289 , the size of mode-1, mode-2, and mode-3 unfolding matrices are 40 × 11,560, 40 × 11,560, 289 × 1600, respectively. From Figure 3, it can be observed that the singular values of three mode-unfolding matrices all exhibit a sharp decreasing trend and decline into approximate zero rapidly, with only a small fraction being significantly greater than zero. It indicates that the background patch-tensor B is correlated along each mode direction and further reflects its inherent low-rank structure within the subspaces corresponding to each mode [,]. Based on this property, we can consider the background patch-tensor B as a low-rank tensor, and their unfolding matrices are also all low-rank defined as follows:
rank B 1 r 1 rank B 2 r 2 rank B 3 r 3
where B 1 , B 2 , and B 3 denote model-1, model-2, and model-3 unfolding matrices of B , respectively; r 1 , r 2 , and r 3 denote the background complexity.
Figure 3. Illustration of the nonlocal similarity and the low-rank property of background patch-tensors: (a) one representative defect image; (bd) corresponding singular values distributions curve of the mode-1, mode-2, and mode-3 unfolding matrices of background patch-tensors.
Noise patch-tensor N :
The noise is usually modeled as additive white Gaussian noise, and it satisfies N F 2 δ , where δ > 0 denotes the Gaussian noise level.
Based on above analysis, the small white-spot defects detection on galvanized strip steel based on tensor decomposition can be formulated as follows:
min T , B , N rank B + λ T 0 + η N F 2 s . t .   D = B + T + N
where rank · denotes matrix rank and · 0 denotes l 0 norm, · F 2 denotes F norm, λ and η denote regularization parameters to balance the low-rank and sparse characteristics.
A reweighted strategy is always adopted to solve low-rank and sparse tensor decomposition. The principle of weighted Schatten p -norm minimization (WSNM) is to assign different weights to different singular values [,], which can recover the low-rank component and separate target from background more accurately by adjusting p . Therefore, we incorporate the WSNM to improve the detection accuracy of defect target. It can be defined as follows:
B W p = 1 n 3 i = 1 r j = 1 n 3 W i , i , j S ¯ i , i , j p 1 p
where n 3 is the number of patches, W i , i , j = ξ n 1 n 2 S ¯ i , i , j + E B denotes a weight tensor, which is defined in Algorithm 1; S ¯ : , : , j represents the singular values of B : , : , j , ξ is a parameter, E B is a positive constant.
Algorithm 1: Solving Equation (11)
Input: X n 1 × n 2 × n 3 , p
Output: F λ X
step 1: Conduct FFT operation: X ¯ = fft X ,   , 3
step 2: Conduct SVD operation on each frontal slice X ¯ i of X ¯
      for i = 1 , 2 , , n 3 + 1 2  do
         U , Σ , V = svd X ¯ i , Σ = diag σ 1 , , σ r
        Compute w = w 1 , w 2 , , w r
          for j = 1 , 2 , , r  do
               δ i = GST σ i , λ w i , p
          end for
         Σ = diag δ 1 , , δ r , W ¯ i = U Σ V T
      end for
      for i = n 3 + 1 2 + 1 , , n 3  do
            W ¯ i = conj W ¯ n 3 i + 2
      end for
step 3: Compute F X = ifft W ¯ ,   , 3
Additionally, l 0 norm is replaced by l 2 , 1 norm for solving the model. Hence, Equation (5) can be transformed as follows:
min T , B , N B W p + λ T 2 , 1 + η N F 2 s . t .   D = B + T + N

3.2. Model Solution

In order to solve Equation (7), the ADMM are selected to solve the separable convex optimization problem. The Lagrange function can be constructed as shown in Equation (8), which can be converted to Equation (9), where, Y r × r × n 3 , μ and · denote the Lagrangian multiplier tensor, inner product among tensors, respectively.
O B , T , N , Y , μ = B W p + λ T 2 , 1 + η N F 2 + μ 2 D B T N F 2 + Y ,   D B T N
O B , T , N , Y , μ = B W p + λ T 2 , 1 + η N F 2 + μ 2 D B T N + Y μ F 2
Based on ADMM, O can be broken into several sub-problems for iterative update:
(i) Fix the rest of the variables and update of B , which is defined by Equation (10):
B k + 1 = arg min B 1 2 D T k N k + Y k μ k B k F 2 + 1 μ k B W p
It can be solved by Generalized Soft-Thresholding (GST) and tensor singular value thresholding (t-SVT) [] as follows:
B k + 1 = F 1 μ k D T k N k + Y k μ k
where F · denotes solving WSNM-tensor problem, and the detailed process is provided in Algorithm 1. · k represents the value of variables in k-th iteration.
(ii) Fix the rest of the variables and update of T , which is confirmed by Equation (12):
T k + 1 = arg min T λ T 2 , 1 + μ k 2 D B k + 1 N k + Y k μ k T F 2
As l 2 , 1 -norm of T is defined as the sum of l 2 -norm of each mode-2 fiber, we matricize each tensor along the 2nd mode, so T k + 1 2 , 1 = T 2 k + 1 2 , 1 . It can be transformed into the matrix form with Equation (13):
T 2 k + 1 = arg min T 2 λ T 2 2 , 1 + μ k 2 D 2 B 2 k + 1 N 2 k + Y 2 k μ k T 2 F 2
Let X = D 2 B 2 k + 1 N 2 k + Y 2 k μ k , so
T 2 k + 1 = arg min T 2 λ T 2 2 , 1 + μ k 2 X T 2 F 2
According to [], it has the following close-form solution with Equation (15):
T 2 k + 1 : , j = X : , j 2 λ μ k X : , j 2 X : , j ,             if   X : , j 2 > λ μ k 0 ,                                                             otherwise
where X : , j represents the j-th column of the matrix X .
After T 2 k + 1 is solved, it can be transformed into tensor form T k + 1 .
(iii) Fix the rest of the variables and update of N is obtained by Equation (16):
N k + 1 = arg min N η N F 2 + μ k 2 D B k + 1 T k + 1 + Y k μ k N F 2
Differentiating it with respect to N , and let it be zero:
2 η N μ k D B k + 1 T k + 1 + Y k μ k N = 0
Then, we have the following:
N k + 1 = μ k 2 η + μ k D B k + 1 T k + 1 + Y k μ k
(iv) Fix the rest of the variables and update of Y :
Y k + 1 = Y k + μ k D B k + 1 T k + 1 N k + 1
(v) Update of μ :
μ k + 1 = min ρ μ k , μ max
where 0 < ρ < 1 , μ max = 10 5 .
(vi) Iteration termination condition:
D B k + 1 T k + 1 N k + 1 F D F < ε
where 10 3 < ε < 10 5 .
Finally, we summarize the algorithmic process of the proposed SLRTD method in Algorithm 2.
From target-background separation, the tensor D is decomposed into the defect target T and background B . The defect target image f T and background image f B are reconstructed from T and B , respectively. Finally, the defect targets can be extracted and segmented by the adaptive thresholding segmentation method.
Algorithm 2: Solving Equation (7) by ADMM
Input: Original defect image sequence tensor D R ^ m × n × l , power   p ,   λ , η
Output: B , T , N
Initialize: D , T 0 = B 0 = N 0 = 0 , W = I , Y = 0 , μ max = 10 5 , ρ = 1.1 , k = 0
While: not converged, do
step 1: Update B k + 1 by Equation (11)
step 2: Update T k + 1 by Equation (15)
step 3: Update N k + 1 by Equation (16)
step 4: Update Y k + 1 by Equation (19)
step 5: Update μ k + 1 by Equation (20)
step 6: Check the convergence condition
step 7: Update k = k + 1
end While

3.3. Model Analysis

3.3.1. Computational Complexity

For surface defect image D, it could be constructed into the tensor D m × n × l with sliding window’s size m × n and times l . The computation complexity of the solution of B includes three operations [] of FFT, SVD, and GST; FFT operation is O A m n l log l , SVD operation is O A m n 2 l + 1 2 , GST operation is O B m n , where A denotes iterations number in Algorithm 1, B denotes iterations number in GST. Thus, the computation complexity of solution B is O A m n l log l + A m n 2 l + 1 2 + B m n . For solution T , the computation complexity is O m n . Therefore, the whole computation complexity of model is O C A m n l log l + A m n 2 l + 1 2 + B m n + m n , where C is the iterations number in ADMM.

3.3.2. Convergence of Algorithm

The convergence of model Equation (9) has been proven in []. We evaluate the convergence of the proposed SLRTD method to empirically show the convergence through experiments in different iterations. We set a convergence error value of D B T N F / D F and found that the algorithm takes approximately 0.5 s per frame for small target detection. Figure 4 shows the variation in convergence error as the number of iterations progresses from 0 to 30. It can be observed that after 10 iterations, the error of our algorithm gradually decreases, and the convergence curve stabilizes, indicating that our method meets the convergence requirements and successfully achieves optimal or near-optimal solutions.
Figure 4. Convergence curve of the proposed SLRTD method.

4. Experiment

To evaluate the effectiveness of the proposed SLRTD method, we conduct a series of qualitative and quantitative experiments. Firstly, the details of data collection and preprocessing, and evaluation metrics are described. Secondly, the key parameters and robustness are analyzed. Finally, the performance comparison is conducted between our method and the other four typical methods.

4.1. Experimental Setup

4.1.1. Data Collection and Preprocessing

The proposed SLRTD method is validated on a dataset from a real-world galvanized strip steel production line, and the image acquisition platform can be observed in Figure 5. We crop the raw images’ 4096 × 1024 pixels into 200 × 200 pixels to make the dataset. In total, we obtain 100 defect images and 100 non-defect images. For each image, the pixel-level ground-truth is manually marked by using “1” to denote defective pixels and “0” to denote defect-free pixels. The size of the white-spot defect in the production line is about 1 mm to 3 mm, which is about 5 to 8 pixels in the 200 × 200 image. Defect samples and the corresponding masks of the constructed datasets are shown in Figure 6.
Figure 5. Industrial image acquisition platform in the galvanized strip steel production line: (a) strip steel production line; (b) image captured from the steel surface.
Figure 6. Three-dimensional maps of defect samples and the corresponding ground-truth: (a) original image; (b) ground-truth.

4.1.2. Evaluation Metrics

In order to evaluate the detection performance of the proposed SLRTD model, Precision, Recall, Precision-Recall (P-R) curve, receiver operating characteristic (ROC) curve, area under ROC curve (AUC) average F-Measure (Fζ), and mean square error (MAE) are adopted as evaluation metrics.
Precision is used to evaluate how many positive pixels are correctly classified. It can be defined as follows:
Precision = T P T P + F P
where true positive ( T P ) indicates the number of defect pixels which are correctly classified as defect, false positive ( F P ) indicates the number of background pixels mistakenly identified as defect.
Recall is used to evaluate how many pixels are correctly classified. It can be defined as follows:
Recall = T P T P + F N
where false negative ( F N ) indicates the number of defect pixels that are not classified to defect class.
F ζ represents the weighted harmonic mean of precision and recall. It can be defined as follows:
F ζ = 1 N i = 1 N ζ 2 + 1 × Precision × Recall ζ 2 × Precision + Recall
MAE measures the dissimilarity between segmentation images B W and corresponding ground-truth image G . It can be defined as follows:
MAE = i = 1 H j = 1 W B W i , j G i , j H × W
where H and W denote height and width of surface defect image.
Using Otsu’s threshold, we determine the Precision and Recall by changing the thresholds from 0 to 255 to obtain pairs of Precision and Recall.

4.2. Validation of the Proposed SLRTD Method

4.2.1. Parameter Analysis

For the proposed SLRTD method, the patch size, sliding step size, and value of Schatten- p are evaluated by the AUC and MAE, where the regularized parameter λ is set to λ = l max m , n , which is adaptive to the patch size and number of patches.
  • Patch size
To balance the detection performance and computational complexity, we change the patch size from 20 to 50 with 10 intervals and provide the corresponding AUC and MAE in Table 1. As the patch size increases, the detection performance of the SLRTD method decreases. When the patch size is 50 × 50, the degradation is more serious. The reason is the larger patch size may destroy the low-rank and sparsity between the different patches, and influence the decomposition of the defect target and background.
b.
Step size
Table 1. Experimental results of AUC and MAE with different patch sizes and step sizes.
In practice, we hope to take a larger step in exchange for the reduction in computational complexity. We fix the patch size and vary the step size from 10 to 40 with 10 intervals. The evaluation results are shown in Table 1. It could be observed that the performances of larger steps are always better than that of smaller ones, but when the step size is similar with the patch size, the performance becomes worse.
c.
Value of Schatten- p
As shown in Table 2, the AUC and MAE with p change from 0.1 to 1 with 0.1 intervals. The detection performance of p = 1 is worse than lower p as the over-shrinkage is serious. When p is decreased, the low-rank background may be closer to true rank, while more high-rank components would become zeros []. When p is too small, it would also make some low-rank backgrounds become zeros to lead the bad detection performance. Therefore, in order to make the tradeoff between detection accuracy and error, p is set to 0.7 in the following experiments.
Table 2. Experiment of AUC and MAE with different p .

4.2.2. Robustness to Noise

The noise conditions of a real-world galvanized strip steel production line are complex and random; we evaluate the proposed SLRTD method’s performance in the noise case with different levels. Additive Gaussian noise with different signal-to-noise ratios (SNRs) are introduced to the original defect image, including 36 dB, 32 dB, and 28 dB. The experimental results are shown in Figure 7 and Table 3. It can be observed that the SLRTD is robust to noise in most cases; binarization defect images obtained by SLRTD are clear and accurate. When SNR decreases gradually, the AUC and MAE metrics can remain a relatively high level; for example, AUC still remains around 0.85 under the condition of SNR = 28   dB , which is considered as less sensitive to noise. However, some pixels of the background are misclassified for defects, which is still a great challenge for the SLRTD method.
Figure 7. Detection results of binarization by Otsu’s method in different noise cases: (a) original image; (b) ground-truth; (c) 0 dB; (d) 36 dB; (e) 32 dB; (f) 28 dB.
Table 3. Experiment with different noise levels.

4.3. Comparison with the State-of-the-Art Methods

The proposed SLRTD method is compared with four state-of-the-art algorithms, including TRPCA [], PSTNN [], ETRPCA [], and NN-TRPCA [].

4.3.1. Qualitative Comparison

To facilitate a more intuitive comparison of the performance of each algorithm, we selected representative defect images from the dataset. The false-detected targets are highlighted with green circles, which are shown in Figure 8. The qualitative comparison results between the proposed SLRTD method and the other four methods are shown in Figure 8. These images show various illumination and gray levels. NN-TRPCA and PSTNN can enhance defect targets, but they also enhance many clutters and noises of the background and the defect object cannot be uniformly highlighted, which can cause a high false alarm rate. These methods may lose the targets with heavy noise. NN-TRPCA exhibits poor robustness in detecting small targets against complex backgrounds, resulting in considerable background residue. The TRPCA and ETRPCA methods can detect the targets in the 2nd and 4th rows, but some background residue still remains. There are many pixels that belong to the background and are misjudged by defects, which leads to low accuracy. By contrast, SLRTD separates the defect objects from the image background successfully and locates various defects precisely. It more efficiently highlights the whole defect object with well-defined boundaries than the other methods. It can be concluded that WSNM regularization could improve the performance. In addition, WSNM regularization of SLRTD contributes to good performance, and thus, is the important factor to obtain more precise segmentation results than other methods. What is more, it is reasonable to conclude that treating the matrix singular values differently is the method by which the most important characteristics of the defects or background can be preserved. The superiority of the proposed SLRTD method can suppress the noise and detect the defect targets clearly, which can generate high-quality binary segmentation results by a simple threshold method.
Figure 8. Qualitative comparison results: (a) input image; (b) ground-truth image; (c) TRPCA; (d) ETRPCA; (e) NN-TRPCA; (f) PSTNN; (g) ours.

4.3.2. Quantitative Comparison

To further demonstrate the superiority of the proposed SLRTD method, four competitive methods of TRPCA, ETRPCA, NN-TRPCA, PSTNN are selected for comparison. From the P-R and ROC curves demonstrated in Figure 9, for the same false alarm ratio, our method can achieve the highest detection performance. By imposing a p -norm to the patch-image, SLRTD can suppress background noise effectively for all defect images and lay a good foundation for the subsequent target segmentation. Table 4 summarizes the quantitative results of five methods, and the best results are marked in bold. It demonstrates that SLRTD also has better performance than the other four methods. Most of the AUC results are higher than 85%, and SLRTD achieves 0.9560. MAE of SLRTD is typically the lowest among all the methods. Compared with TRPCA, it is increased by 2.08% and 1.55% in AUC and MAE, respectively. Based on the above qualitative and quantitative analyses, it confirms that our proposed SLRTD method consistently outperforms some state-of-the-art methods and verifies the effectiveness of the proposed SLRTD method.
Figure 9. Quantitative comparison results with P-R curves and ROC curves.
Table 4. Comparison of AUC and MAE of different methods.

5. Conclusions

In order to detect small white-spot surface defects of galvanized strip steel accurately and rapidly, the SLRTD method is proposed in this paper. The nonlinear reweighting strategy based on Schatten p -norm is adopted to separate the defect image into a smooth non-defect background and random noise. A solution framework is proposed by the ADMM algorithm to obtain the defect target image. Based on the self-constructed defect dataset, experiments are conducted by the qualitative and quantitative method, which achieves the best performance among the state-of-the-art defect detection methods. In the future, we will focus on improving the ability of the proposed SLRTD method to detect the other tiny defects under weak illumination conditions or irregular defect-like texture surfaces. In the future, we will conduct binocular and structured-light stereo vision to obtain three-dimensional information of surface defects in industrial production.

Author Contributions

S.Z. designed the SLRTD model and performed the evaluation experiments. X.Y. collaborated closely and contributed equally to this work. H.L. arranged the datasets, as well as reviewed the article. C.G. developed the automatic optical inspection procedure. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number 52205537 and 51805386, the open fund of State Key Laboratory of Intelligent Manufacturing Equipment and Technology (Huazhong University of Science and Technology), grant number IMETKF2025024, and the Innovative Research Group Project of the National Natural Science Foundation of Hubei Province of China, grant number 2024AFA026.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data are not publicly available due to privacy issues.

Acknowledgments

The authors would like to thank the editor and anonymous reviewers.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Diers, J.; Pigorsch, C. A survey of methods for automated quality control based on images. Int. J. Comput. Vision 2023, 131, 2553–2581. [Google Scholar] [CrossRef]
  2. Zhu, J.P.; He, G.H.; Zhou, P. MFNet: A novel multilevel feature fusion network with multibranch structure for surface defect detection. IEEE Trans. Instrum. Meas. 2023, 72, 1–11. [Google Scholar] [CrossRef]
  3. Cheng, G.; Yuan, X.; Yao, X.W.; Yan, K.B.; Zeng, Q.H.; Xie, X.X.; Han, J.W. Towards large-scale small object detection: Survey and benchmarks. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 13467–13488. [Google Scholar] [CrossRef]
  4. Li, J.; Wu, R.; Zhang, S.; Chen, Y.L.; Dong, Z.C. FASCNet: An edge-computational defect detection model for industrial parts. IEEE Internet. Things. 2023, 11, 6622–6637. [Google Scholar] [CrossRef]
  5. Luo, Q.W.; Chen, Y.W.; Su, J.J.; Yang, C.H.; Silvén, O.; Liu, L. Prior-guided YOLOX for tiny roll mark detection on strip steel. IEEE Sens. J. 2024, 24, 15575–15587. [Google Scholar] [CrossRef]
  6. Ameri, R.; Hsu, C.C.; Band, S.S. A systematic review of deep learning approaches for surface defect detection in industrial applications. Eng. Appl. Artif. Intel. 2023, 130, 107717. [Google Scholar] [CrossRef]
  7. Khanam, R.; Hussain, M.; Hill, R.; Allen, P. A comprehensive review of convolutional neural networks for defect detection in industrial applications. IEEE Access 2024, 12, 94250–94295. [Google Scholar] [CrossRef]
  8. Zou, Z.X.; Chen, K.Y.; Shi, Z.W.; Guo, Y.H.; Ye, J.P. Object detection in 20 years: A survey. Proc. IEEE 2023, 111, 257–276. [Google Scholar] [CrossRef]
  9. Jha, S.B.; Babiceanu, R.F. Deep CNN-based visual defect detection: Survey of current literature. Comput. Ind. 2023, 14, 103911. [Google Scholar] [CrossRef]
  10. Liu, G.H.; Chu, M.X.; Gong, R.F.; Zheng, Z.H. Global attention module and cascade fusion network for steel surface defect detection. Pattern Recogn. 2024, 158, 110979. [Google Scholar] [CrossRef]
  11. Zare, A.; Ozdemir, A.; Iwen, M.A.; Aviyente, S. Extension of PCA to higher order data structures: An introduction to tensors, tensor decompositions, and tensor PCA. Proc. IEEE 2018, 106, 1341–1358. [Google Scholar] [CrossRef]
  12. Sun, Y.; Yang, J.G.; An, W. Infrared dim and small target detection via multiple subspace learning and spatial-temporal patch-tensor model. IEEE T. Geosci. Remote. 2020, 59, 3737–3752. [Google Scholar] [CrossRef]
  13. Yu, Q.; Yang, M. Low-rank tensor recovery via non-convex regularization, structured factorization and spatio-temporal characteristics. Pattern Recogn. 2023, 137, 109343. [Google Scholar] [CrossRef]
  14. Wang, M.H.; Hong, D.F.; Han, Z.; Li, J.X.; Yao, J.; Gao, L.R.; Zhang, B.; Chanussot, J. Tensor decompositions for hyperspectral data processing in remote sensing: A comprehensive review. IEEE Geosci. Remote Sens. Mag. 2023, 11, 26–72. [Google Scholar] [CrossRef]
  15. Luo, Y.; Li, X.R.; Chen, S.H. 5-D spatial-temporal information-based infrared small target detection in complex environments. Pattern Recogn. 2024, 158, 111003. [Google Scholar] [CrossRef]
  16. Zeng, N.Y.; Wu, P.S.; Wang, Z.D.; Li, H.; Liu, W.; Liu, X.H. A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection. IEEE Trans. Instrum. Meas. 2022, 71, 3507014. [Google Scholar] [CrossRef]
  17. Wen, X.; Shan, J.; He, Y.; Song, K. Steel surface defect recognition: A survey. Coatings 2023, 13, 17. [Google Scholar] [CrossRef]
  18. Li, D.; Li, Y.; Xie, Q.; Wu, Y.; Yu, Z.; Wang, J. Tiny defect detection in high-resolution aero-engine blade images via a coarse-to-fine framework. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
  19. Zhang, H.Y.; Li, M.; Miao, D.Q.; Pedrycz, W.; Wang, Z.G.; Jiang, M.H. Construction of a feature enhancement network for small object detection. Pattern Recogn. 2023, 143, 109801. [Google Scholar] [CrossRef]
  20. Hou, X.Q.; Liu, M.Q.; Zhang, S.L.; Wei, P.; Chen, B.D. CANet: Contextual information and spatial attention based network for detecting small defects in manufacturing industry. Pattern Recogn. 2023, 140, 109558. [Google Scholar] [CrossRef]
  21. Yang, B.Y.; Liu, Z.Y.; Duan, G.F.; Tan, J.R. Residual shape adaptive dense-nested Unet: Redesign the long lateral skip connections for metal surface tiny defect inspection. Pattern Recogn. 2023, 147, 110073. [Google Scholar] [CrossRef]
  22. Dai, Y.; Wu, Y.Q. Reweighted infrared patch-tensor model with both nonlocal and local priors for single-frame small target detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 3752–3767. [Google Scholar] [CrossRef]
  23. Lu, C.Y.; Feng, J.S.; Chen, Y.D.; Liu, W.; Lin, Z.C.; Yan, S.C. Tensor robust principal component analysis with a new tensor nuclear norm. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42, 925–938. [Google Scholar] [CrossRef] [PubMed]
  24. Zhang, L.D.; Peng, Z.M. Infrared small target detection based on partial sum of the tensor nuclear norm. Remote Sens. 2019, 11, 382. [Google Scholar] [CrossRef]
  25. Gao, Q.X.; Zhang, P.; Xia, W.; Xie, D.Y.; Gao, X.B.; Tao, D.C. Enhanced tensor RPCA and its application. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 2133–2140. [Google Scholar] [CrossRef]
  26. Chen, L.; Jiang, X.; Liu, X.Z.; Zhou, Z.X. Logarithmic norm regularized low-rank factorization for matrix and tensor completion. IEEE Trans. Image Process. 2021, 30, 3434–3449. [Google Scholar] [CrossRef]
  27. Luo, Y.; Li, X.R.; Yan, Y.F.; Xia, C.Q. Spatial-temporal tensor representation learning with priors for infrared small target detection. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 9598–9620. [Google Scholar] [CrossRef]
  28. Wang, H.L.; Peng, J.J.; Qin, W.J.; Wang, J.J.; Meng, D.Y. Guaranteed tensor recovery fused low-rankness and smoothness. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10990–11007. [Google Scholar] [CrossRef]
  29. Geng, X.Y.; Guo, Q.; Hui, S.X.; Yang, M.; Zhang, C.M. Tensor robust PCA with nonconvex and nonlocal regularization. Comput. Vis. Image Und. 2024, 243, 104007. [Google Scholar] [CrossRef]
  30. Huang, Z.X.; Zhao, E.W.; Zheng, W.; Peng, X.D.; Niu, W.L.; Yang, Z. Infrared small target detection via two-stage feature complementary improved tensor low-rank sparse decomposition. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 17690–17709. [Google Scholar] [CrossRef]
  31. Liu, C.T.; Wang, H. Research on infrared dim and small target detection algorithm based on low-rank tensor recovery. J. Syst. Eng. Electron. 2023, 34, 861–872. [Google Scholar] [CrossRef]
  32. Guo, J.; Wu, Y.Q.; Dai, Y.M. Small target detection based on reweighted infrared patch-image model. IET Image Process. 2018, 12, 70–79. [Google Scholar] [CrossRef]
  33. Xie, Y.; Gu, S.H.; Liu, Y.; Zuo, W.M.; Zhang, W.S.; Zhang, L. Weighted Schatten p-norm minimization for image denoising and background subtraction. IEEE Trans. Image Process. 2016, 25, 4842–4857. [Google Scholar] [CrossRef]
  34. Sun, Y.; Yang, J.G.; Li, M.; An, W. Infrared small target detection via spatial-temporal infrared patch-tensor model and weighted Schatten p-norm minimization. Infrared Phys. Techn. 2019, 102, 103050. [Google Scholar] [CrossRef]
  35. Zuo, W.M.; Meng, D.Y.; Zhang, L.; Feng, X.C.; Zhang, D. A generalized iterated shrinkage algorithm for non-convex sparse coding. In Proceedings of the 2013 IEEE International Conference on Computer Vision 2013, Sydney, NSW, Australia, 1–8 December 2013; IEEE: Piscataway, NJ, USA, 2014; pp. 217–224. [Google Scholar] [CrossRef]
  36. Xie, D.Y.; Yang, M.; Gao, Q.X.; Song, W. Non-convex tensorial multi-view clustering by integrating l1-based sliced-Laplacian regularization and l2,p-sparsity. Pattern Recognit. 2024, 154, 110605. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.