Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors

Saludes-Rodil, Sergio; Baeyens, Enrique; Rodríguez-Juan, Carlos P.

doi:10.3390/s150510100

Open AccessArticle

Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors

by

Sergio Saludes-Rodil

^1,*,

Enrique Baeyens

² and

Carlos P. Rodríguez-Juan

³

¹

Centro Tecnológico CARTIF, Parque Tecnológico de Boecillo 205, 47151 Boecillo, Valladolid, Spain

²

Instituto de las Tecnologías Avanzadas de la Producción, Universidad de Valladolid, Paseo del cauce 59, 47011 Valladolid, Spain

³

ISEND S.A., Parque Tecnológico de Boecillo, Luis Proust 10, 47151 Boecillo, Valladolid, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2015, 15(5), 10100-10117; https://doi.org/10.3390/s150510100

Submission received: 5 March 2015 / Revised: 20 April 2015 / Accepted: 22 April 2015 / Published: 29 April 2015

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

: An unsupervised approach to classify surface defects in wire rod manufacturing is developed in this paper. The defects are extracted from an eddy current signal and classified using a clustering technique that uses the dynamic time warping distance as the dissimilarity measure. The new approach has been successfully tested using industrial data. It is shown that it outperforms other classification alternatives, such as the modified Fourier descriptors.

Keywords:

dynamic time warping; cluster analysis; modified Fourier descriptors; unsupervised classification; wire rod manufacturing; eddy current inspection; nondestructive testing

1. Introduction

Wire rods made using the hot rolling technique can present surface defects. Several techniques have been applied to detect the surface defects that appear during wire rod manufacturing. Approaches based on image processing have been proposed in [1,2]. Alternatively to computer vision-based techniques, the eddy current nondestructive technique is effectively used to detect surface defects [3]. The basic instrument for eddy current inspection is a coil fed with an alternating electric current. The complex impedance of the coil Z₀ changes in accordance with the eddy current redistribution due to material defects or inhomogeneities [4,5].

Besides detection, defect classification is of industrial interest, and a great deal of research has been devoted to this issue. Many approaches are based on signal processing or shape characterization followed by a supervised classifier. This implies the use of labeled defect sets covering all possible defect types. A complete knowledge base that includes as many examples as possible of every type of possible defect is crucial to develop a good classification procedure. This is a serious drawback, because it is not always easy nor even possible in industrial practice to collect a number of examples large enough to build a useful knowledge base.

There is an interest in developing automated eddy current-based inspection systems able to detect and classify defects. However, the lack of adequate defect collections in hot rolling industrial plants prevents the design of supervised defect classifiers. Motivated by this fact, we proposed an unsupervised classifier that can aid the plant operators to build their knowledge bases and classify and analyze the surface defects appearing in their products.

The rest of the paper is organized as follows. In Section 2, the problem of the classification of surface defects obtained by eddy current supervision in the manufacturing of a wire rod is described and formulated. A revision of related work and the main contributions in our work are also included. In Section 3, several methods of unsupervised classification are explained, and the normalized dynamic time warping distance that will be used as a dissimilarity metric between defect sequences is also introduced. The results of an experiment using real data obtained during the production of the wire rod are reported in Section 4. Section 5 contains the discussion of the results. Finally, some conclusions are given in Section 6.

2. Description of the Problem

2.1. Eddy Current Inspection in Wire Rod Manufacturing

Wire rods are an intermediate steel product of approximately a round solid cross section that is wound into coils and transported in this form. It is primarily used for subsequent drawing and finishing by wire drawers and is ultimately used to manufacture a variety of products, including electric welded chains, cold-drawn bars, springs, nails, reinforcing wire mesh, chain link fence and many different types of wires. Bar and wire rods are produced by hot rolling, and surface defects can appear on rods during the manufacturing process. These defects can be detected by an eddy current inspection system.

The layout of a wire rod mill and its inspection system are schematically represented in Figure 1. The eddy current probe is placed at the end of the mill process and operates when the wire is still hot, at around 927 °C, depending on the material.

The probe used belongs to the differential class and was managed by an ISEND HOTanalyzer system. The authors would like to not disclose the operation details due to confidentiality reasons. The methods presented in this paper are independent of the probe operational parameters.

The eddy current inspection system directly acquires impedance measurements from the product line. It isolates the recorded parts where the impedance of the coil probe experiences a change that corresponds to a surface defect.

The inspection signal is a sequence of measurements of the complex impedance:

Z_{0} (t) = x (t) + j y (t)

(1)

A surface defect is a finite subsequence of the complex impedance signal Z₀(t).

The industrial supervision system of Figure 1 is continuously collecting measurements and produces a large amount of data that are not possible to analyze by a human operator without the aid of some automatic computer system. Our solution is motivated by the requirements of the operators in a real manufacturing plant and consists of an automatic system that analyzes the eddy current signal recorded during a production batch and extracts every subsequence of interest where the impedance changes. These subsequences are the collection Σ of surface defects to be classified.

The defect collection Σ is classified offline using unsupervised classification methods, and the results are provided to the operators for their posterior analysis. Hence, our solution avoids a very tedious, unpractical and almost unfeasible human classification process and can be considered as an initial step towards an online unsupervised classification system.

2.2. Problem Formulation

In order to formalize the problem, the impedance measurement at time t is assumed to belong to a metric space ( Sensors 15 10100i1 , d) where is the underlying set and d is the distance on that set. In our case, the set can be either the complex plane ℂ or the Euclidean plane R², which are mathematically equivalent. Let Sensors 15 10100i2 ( ) denote the set of every finite sequence on . Hence, any defect is modeled as an element of ( Sensors 15 10100i1 ), and the collection of defects is given by:

\sum = {x_{k} \in S (F) : k = 1, \dots, K}

(2)

The problem of interest can be posed as follows: given a collection Σ of unlabeled defects corresponding to an unknown number K of defect types, determine the number K and find a partition {Σ_k : k = 1,…, N}, such that each defect in the subset Σ_k belongs to the same defect type. A partition {Σ_k : k = 1,…, K} of Σ satisfies that Σ_k ⊂ Σ for any k, Σ_j ∩ Σ_k = 0 for j ≠ k and $\cup_{k = 1}^{K} \sum_{k} = \sum$ .

The main difficulties in this problem are that the defects are unlabeled, and the different types of defects are unknown in advance. Besides, the sequences representing individual defects have different lengths, even for defects belonging to the same class. The features characterizing different defect classes are related to the shape and orientation of the polar graph of the complex impedance, but they are independent of the length and scale. These difficulties motivate the use of unsupervised classification methods and the definition of a metric for sequences of different lengths that allows the right classification of defects. The unsupervised classification methods considered are the K-medoids clustering algorithm and the evolving self-organizing map algorithm. As a metric for the dissimilarity of defect sequences, the normalized dynamic time warping distance is used.

2.3. Related Work

Classification algorithms for eddy current testing can be arranged into two main categories:

Signal processing based: Eddy current signals are processed in order to extract some characteristics from them, allowing differentiating among defect types, as in [6–11].
Shape based: The shape of the eddy current signals in the impedance plane are processed to find out the contours or appearances associated with every type of defect, as in [12–15].

Signal processing-based techniques are the most commonly found in the literature. Every technique in this class processes the eddy current signals in order to obtain a finite set of numeric values that unequivocally characterizes every type of defect.

Time-frequency transformation, like wavelet analysis, has been extensively used to process eddy current signals. In [6], several applications to detect defects in nuclear power generation components are reported. In [7], the wavelet analysis is used to enhance the eddy current signals prior to defect detection. Time domain methods have also been proposed [8]. The Hilbert transform [9] and the principal component analysis [10] are some of the techniques used to extract features from the eddy current signals. Furthermore, neural networks have been applied in [11].

Contrarily to signal processing based methods, there are other methods that rely on the shape that the impedance takes in the complex plane. Most of these methods are based on the modified Fourier descriptors [16], which are used to describe closed curves by a finite set of numerical features. This technique is briefly described in Section 3. Modified Fourier descriptors have been applied to classify eddy current signals in [12–15].

2.4. Main Contributions

The main contribution in this paper is an efficient unsupervised method for classifying surface defects in the manufacturing of wire rods using eddy current inspection. This method comprises two key elements. First, a new defect dissimilarity measure for eddy current signals is introduced. This measure uses the normalized dynamic time warping (DTW). Second, a clustering approach that uses the DTW distance is applied in an unsupervised way. The K-medoids clustering algorithm has been successfully tested. In addition, an evolving self-organizing map (ESOM) has been applied to obtain a set of defect prototypes that are later classified using the K-medoids clustering algorithm. A defect is classified by the cluster corresponding to the closest prototype in the DTW distance. The ESOM also uses the normalized DTW measure, and its goal is to obtain a parsimonious representation of the defects collection Σ that can be preserved from a production shift to the next one. The ESOM is evolving with any new defect, but the clustering process is accomplished only once for each shift. The techniques used in this paper are not new; however, to the knowledge of the authors, they have not been previously used in conjunction to classify surface defects in wire rod manufacturing. The resultant approach has been demonstrated to be very effective and outperforms other alternatives based on modified Fourier descriptors that have been extensively used in feature extraction of signal obtained by eddy current inspection.

3. Methods

3.1. Modified Fourier Descriptors

Let x ∈ Sensors 15 10100i2 (ℂ) be a finite sequence of complex numbers representing the impedances corresponding to a surface defect. Let N = |x| be the length of the sequence, then:

x = {x_{k} : k = 1, \dots, N}

(3)

The sequence x can be equivalently represented by the Fourier descriptors ${f_{k} : k = - \frac{N}{2} + 1, \dots, \frac{N}{2}}$ , which are the coefficients of the Fourier transform of x:

x_{k} = \sum_{ℓ = - \frac{N}{2} + 1}^{\frac{N}{2}} f_{k} e^{j 2 π k ℓ / N}

(4)

with:

f_{k} = \frac{1}{N} \sum_{ℓ = 0}^{N} x_{k} e^{- j 2 π k ℓ / N}

(5)

The defect shape in the impedance plane is completely described by the Fourier descriptors f_k. However, they are sensitive to signal transformations, such as translation, scale change and reverse description.

An alternative description is proposed in [16] to overcome this drawback. It consists of using nonlinear combinations of the Fourier descriptors:

b_{k} = \frac{f_{1 + k} f_{1 - k}}{f_{1}^{2}} with k = 2, 3, \dots, \frac{N}{2} - 1

(6)

b_{1} = \frac{f_{2} | f_{1} |}{f_{1}^{2}}

(7)

These are the Grandlun's modified Fourier descriptors. They contain information about shape and are invariant under translation and scale change. Only b₁ is sensitive to rotation, which provides information about the overall defect phase.

The main disadvantage of Grandlun's modified Fourier descriptors is that they are affected by reverse description, i.e., their value depends on the direction that the defect passes through the probe. To avoid this problem, a modified formulation is proposed in [13]:

d_{k} = \frac{f_{k} f_{- k}}{| f_{1} f_{- 1} |} with k = 1, 2, \dots, \frac{N}{2} - 1

(8)

The Oukhellou modified Fourier descriptors contain information about the shape of the defect and are invariant under translation, scale change and reverse description. Besides, they are also sensitive to rotation changes, so they provide information about the defect phase.

3.2. Dynamic Time Warping

Dynamic time warping (DTW) [17,18] is a well-known technique to obtain the optimal alignment between two given time-dependent sequences under certain restrictions. Intuitively, the sequences are warped in a nonlinear fashion to match each other.

Let x,y ∈ Sensors 15 10100i2 (ℝ²) be two sequences of length N = |x| and M = |y|, respectively, where d is the Euclidean distance in ℝ². In order to align these sequences using DTW, a matrix N-by-M is constructed. The element (i,j) of this matrix contains the Euclidean distance d (x_i, y_j) between the two points x_i ∈ x and y_j ∈ y. A warping path w is a finite sequence of K pairs of natural numbers w ≔ {w_k ∈ ℕ × ℕ: k = 1,2,…, K} satisfying the following conditions:

Path length: the length of the warping path is bounded by :

$max {N, M} \leq L \leq N + M - 1$

(9)
Boundary condition: the initial and final values of the warping path are given by :

$\begin{array}{l} w_{1} = (1, 1), & w_{L} = (N, M) \end{array}$

(10)
Step size condition: the warping path cannot increase more than one in each dimension :

$\begin{array}{r} w_{k + 1} - w_{k} \in {(0, 1), (1, 0), (1, 1)} \\ k = 1, \dots, L - 1 \end{array}$

(11)

Let Sensors 15 10100i3 (x, y) denote the set of all possible warping paths for two finite sequences x and y of elements of the set Sensors 15 10100i1 . The distance D (w; x, y) of the sequences x and y with respect to the warping path w ∈ (x, y) is defined as:

D (w; x, y) : = \sum_{ℓ = 1}^{| w |} d (x_{w_{ℓ, 1}}, y_{w_{ℓ, 2}})

(12)

Furthermore, an optimal warping path for the sequences x and y is a warping path w* ∈ Sensors 15 10100i3 (x, y) having minimal distance for those sequences with respect to all possible warping paths. The DTW distance D*(x, y) between the sequences x and y is then defined as the distance of those sequences with respect to an optimal warping path:

D^{*} (x, y) = min {D (w; x, y) : w \in W (x, y)}

(13)

The optimal path is computed by applying dynamic programming to Equation (12) that defines the distance with respect to the warping path.

The DTW distance is sensitive to the length of the sequences. Since the DTW distance is usually applied to sequences of different lengths, it can be normalized dividing by the length of the optimal warping path. The normalized DTW distance between two finite length sequences x,y ∈ Sensors 15 10100i2 ( Sensors 15 10100i1 ) is defined as:

Δ (x, y) = K^{- 1} D^{*} (x, y)

(14)

where K is the length of the optimal warping path, i.e., K = |w|.

An algorithm that computes the normalized DTW distance is given in Algorithm 1.


Algorithm 1. Normalized DTW distance.

Let x, y ∈ be two finite sequences of lengths N = \|x\| and M = \|y\| , respectively. The normalized DTW distance Δ(x, y) is computed as follows:
1.	Initialize δ_(0,0) = 0, δ₍_n_,0) = ∞ for n = 1,…, N and δ_(0,_m₎ = ∞ for m = 1,…, M and compute the terms of the DTW matrix, using the difference equation:
	δ₍_n_, _m₎ = min {δ₍_n_−1,_m₎, δ₍_n_,_m₋₁₎, δ₍_n_−1,_m₋₁₎} + d(x_n, x_m)
	for (n, m) ∈ {1,…, N} × {1,…, M}.
2.	Initialize ℓ = 0, v_ℓ = (N, M) and compute the sequence v ∈ ℕ² as follows:
	while [v_ℓ ≠ (1,1)], repeat:
	ℓ = ℓ + 1,
	v_ℓ ∈ argmin{δ_v : v ∈ {v_ℓ₋₁ − (1,0), v_ℓ₋₁ − (0,1), v_ℓ₋₁ − (1,1)}}
	end while loop
	then K = \|v\| and w* = {w_ℓ : w_ℓ = s_K₋_ℓ ₊₁(v)}.
3.	The normalized DTW distance between the sequences x and y is Δ(x, y) = k⁻¹δ₍_N_,_M₎.

3.3. The K-Medoids Algorithm

Clustering methods are used to classify a collection of objects Σ into different classes without human intervention. A well-known hard clustering method is the given by the K-medoids algorithm [19]. Each cluster is represented by a vector selected among the elements Σ, which is a set of sequences to be classified into K groups. The representative element of each class is called a medoid. Apart form its medoid, each cluster contains all sequences in Σ that are not used as medoids in other clusters and lie closer to its medoid than to the medoids representing the other clusters. An algorithm to perform K-medoids clustering is given in Algorithm 2.

3.4. The Evolving Self-Organizing Map

The evolving self-organizing map (ESOM) [20,21] is used to obtain a parsimonious representation of a given set of elements Σ in terms of a reduced number of prototype elements and certain relationships between them. The ESOM is an evolving version of the self-organizing map (SOM). The main differences are that no topological constraint is given a priori for the feature map and that prototype elements are not organized onto a lattice. The ESOM is represented by a graph, where each prototype element is a node or vertex, and the relationships are represented by edges of different weights. The ESOM provides a preserving topology representation of the input space in terms of a reduced number of defect prototypes. This representation contains the relevant information about the defect classes that is preserved among production shifts.


Algorithm 2. K-medoids algorithm.

Let Δ(x, y) denote the distance between two elements x, y ∈ Σ.
1.	Choose an arbitrary partition _k = {Σ_k : k = 1,…, K} of Σ and an arbitrary set of medoids Λ = {m(Σ_k) ∈ Σ_k : k = 1,…, K}.
2.	For every Σ_k ∈ _K, compute the elements that are wrongly classified, i.e., x ∈ Σ_k satisfying Δ(x, m(Σ_j)) < Δ(x, m(Σ_k)) for Σ_j ≠ Σ_k. For these elements, update the partition as follows:
	$\sum_{k}^{'} = \sum_{k} - {x}$ and $\sum_{j}^{'} = \sum_{j} + {x}$ . The resultant partition is $P_{K}^{'} = {\sum_{k}^{'} : k = 1, \dots, K}$ .
3.	Obtain the medoids set $\land^{'} = {m (\sum_{k}^{'}) : \sum_{k}^{'} \in P_{K}^{'}}$ for the new partition $P_{K}^{'}$ by solving the following K optimization programs:
	$m (\sum_{k}^{'}) \in arg min {max {γ_{k} : Δ (x, y) \leq γ_{k}, y \in \sum_{k}^{'}} : x \in \sum_{k}^{'}}, \sum_{k}^{'} \in P_{K}^{'}$
4.	If the medoids set does not change, i.e., if Λ′ = Λ, then the clustering process is completed. Otherwise, do Λ = Λ′, and go to Step 2.

The ESOM network starts without any vertex. During learning, the network is updated to capture the on-line incoming data, creating new nodes and

edges when necessary. Edges are used to maintain the neighborhood relationships between close nodes. The connection strength is determined by the distance between connected nodes. If the distance is large, the edge weight is weak and it can be disregarded. In this way, the feature map can be split apart, and data structures, such as clusters and outliers can emerge.

The ESOM network is characterized by a triplet:

N = (V, E, s)

(15)

where

⊂

(ℝ²) is the vertex set containing the prototype nodes, Sensors 15 10100i6

⊂

×

is the edge set and s : Sensors 15 10100i6

→ ℝ is a function that provides the edge weights. For a set of defects Σ, the ESOM is obtained by applying an iterative algorithm with a set of parameters Sensors 15 10100i4

= {ε, σ, γ, τ}. The parameter ϵ controls the distance between different prototypes; γ is the learning rate; σ controls the spread of neighborhood; and τ is used for the preservation of the weakest connections. Usually σ = ϵ [20]. The learning process can be summarized in Algorithm .


Algorithm 3. Evolving self-organizing map.

1.	Start with k = 0, = ∅, = ∅.
2.	Choose a new x ∈ Σ and compute: $V_{m} (x) = {y \in V : Δ (x, y) < ϵ}$ (16) If _m(x) = ∅ go to 4.
3.	Update: $V^{'} = V \cup {x}$ (17) $E^{'} = E \cup {(x, y_{1}), (x, y_{2})}$ (18) where: $Δ (x, y_{1}) = min {y : y \in V}$ (19) $Δ (x, y_{2}) = min {y : y \neq y_{1}, y \in V}$ (20) and go to 5.
4.	Let y* be such that: $Δ (x, y^{}) = min {Δ (x, y) : y \in V_{m}}$ (21) and (y) = {y : (y,y) ∈ }. Update: $V^{'} = {ϕ (y) : y \in V}$ (22) Where: $ϕ (y) = {\begin{array}{l} (1 - α) y + α x & y \in {y^{} \cup N (y^{*})} \\ y & otherwise \end{array}$ (23) and: $α = γ e^{- {\| Δ (z, x) \|}^{2} / 2 σ^{2}}$ (24)
5.	Update the connection strengths as follows: $s (y_{i}, y_{j}) = ϵ / Δ (y_{i}, y_{j})$ (25) for any (y_i, y_j) ∈ ′ × ′ and i ≠ j.
6.	Do k′ = k + 1; if mod (k′, τ) = 0, remove the weakest connection.
7.	Do k = k′, = and = , and go to 2.

The distance between sequences Δ(x, y) is obtained using the DTW. Besides, since the sequences have different lengths, the sum operation in Equation (23) is not trivial, but it can be computed using the warping path. If x and y are sequences in Sensors 15 10100i2 (ℝ²) with DTW distance Δ(x, y) and warping path w of length L, then the sum sequence z = x + y is a sequence of length L:

z = {z_{i} : i = 1, \dots, L}

(26)

where:

\begin{array}{l} z_{i} = x_{w_{1, i}} + y_{w_{2, i}}, & i = 1, \dots, L \end{array}

(27)

The ESOM learning process is continuous and lasts indefinitely, so strict convergence of the algorithm is not a critical issue.

Clustering with the ESOM is accomplished over the prototype defects contained in the vertex set Sensors 15 10100i5 . The K-medoids clustering algorithm can also be applied.

4. Experimental Results

4.1. Data Description

The operators of a manufacturing plant of wire rods identified and labeled the surface defects obtained for several production shifts. This has been a very tedious and time-consuming task, because it required unwinding long wire rod coils, searching the surface defects by visual inspection and classifying and putting them in correspondence with the signal recorded by the eddy current inspection system. After this manual process, a collection of labeled defects is available for validation of the developed unsupervised classification method. The surface defects have been classified by the experts into four different classes. The corresponding eddy current signals associated with them have been represented in the complex impedance plane and labeled as defects belonging to Classes A, B, C and D, respectively. An individual sequence representing each of these groups is depicted in Figure 2. The length of the available labeled sequences ranges between 101 and 996 samples. Samples of the defect classes are shown in Figure 3.

From a morphological viewpoint, Classes A and B feature lobes spreading across the second and fourth quadrants. Defects in Class A have more than two lobes, while defects in Class B exhibit exactly two lobes. Defects belonging to Classes C and D have only one lobe. The lobe of Class C defects elongates along the right side of the impedance plane, while the lobe of Class D defects goes through the left one. In our testing experiment, there are mime defects of Type A, 51 of Type B, 19 of Type C and 16 of Type D.

Two different approaches have been applied to this classification problem. Both of them are unsupervised classification approaches, as an alternative to the supervised approaches found in the literature; see Section 2. One of the approaches is based on the MFDand the other on the normalized DTW distance. Our results demonstrate that the method based on the DTW distance outperforms that based on MFD for this application.

4.2. Unsupervised Classification with Modified Fourier Descriptors

Unsupervised classification using the MFD is accomplished in two steps. The first one consists of computing the MDF for every defect according to Equation (8). In the second step, a clustering algorithm is applied to the MFD obtained in the first step.

The MFDs have been preprocessed through principal component analysis (PCA). The MFDs until the order 30 were computed, and the PCA analysis revealed that the two first principal components retained 99.63% of the variance. Several clustering algorithms were tried, but only spectral clustering [22] and ESOM-based clustering produced satisfactory results.

The adjacency matrix used as the starting point for spectral clustering has been computed over a k nearest neighbors (k-NN) similarity graph [22] with k = 15. The algorithm used is the normalized version, and the confusion matrix is presented in Figure 4. The silhouette index [19] is Sensors 15 10100i8 = 0.321. The silhouette index is a quantitative method of evaluating the results of a clustering process. It was proposed by Russeeuw in [23]. The confusion matrix shows that this method is not capable of discriminating defects in Classes C and D. Moreover, Class B is split into two different clusters, and one of the sequences is mixed with Class A, which is rightly assigned to a cluster.

The ESOM parameters are selected as σ = ϵ = 0.025, γ = 0.05 and τ = 10. The most critical parameter is ϵ and was found empirically, while γ has a small influence. The clustering method is based on computing the minimum spanning tree of the graph shaped by the prototypes and their connections. Prototypes in the same cluster are those that remain linked when inconsistent edges in the minimum spanning tree are removed. An edge is inconsistent when its weight is at least twice the mean of the weights associated with the other edges. The amount of edges averaged is chosen to maximize the silhouette index. This method can be considered as a gestalt clustering approach [24]. The original defects are clustered according to the closest prototype.

The confusion matrix shown in Figure 5 summarizes the results. Seven clusters have been found, but two of them are negligible, because they contain only one element. Defects in Classes A and B are mainly assigned to Clusters C1 and C2, respectively. Defects in Class C are assigned to Cluster C4. Most of the defects in Class D are also assigned to Cluster C4. Only five defects from this class are assigned to Cluster C5. The value of the silhouette index is Sensors 15 10100i8 = 0.348.

4.3. Unsupervised Classification with DTW

Two different unsupervised classification methods based on the normalized DTW distance have been developed and tested. The first one directly applies the K-medoids algorithm over the defect set, while the second one applies the K-medoids algorithm to the prototypes obtained by the ESOM.

4.3.1. Results with DTW and K-Medoids

The pairwise DTW distance between all of the defects in the dataset has been computed, and the K-medoids algorithm was applied to discover the underlying defect classes.

To find out the number of clusters, different values of K ∈ {2,…, 10} have been tried out. The K value with the highest global silhouette value is selected as the number of defect classes. Then, the K-medoids algorithm is applied. This clustering algorithm is sensitive to the initialization, which is performed by randomly choosing K defect classes. To minimize this effect, the algorithm is executed 100-times for every K value.

Four classes are found, and their medoids are depicted in Figure 6. Medoid 1 has only one lobe spreading across the right side of the complex impedance plane. Medoid 2 has two lobes in the first and third quadrant. Medoid 3 exhibits more than two lobes in the same quadrants. Finally, Medoid 4 has only one lobe in the left side of the impedance plane. The medoid shapes agree with the representative elements of each defect class shown in Figure 2. The global silhouette index is Sensors 15 10100i8 = 0.597.

The labels assigned by the clustering algorithm are arbitrary. Looking at the resulting medoids, it is evident that Medoid 1 correspond to Class C, Medoid 2 to Class B, Medoid 3 to Class A and Medoid 4 to Class D. It is possible to rearrange the label names and compute a confusion matrix. The confusion matrix is shown in Figure 7. It can be seen that the unsupervised classifier is capable of gathering in the same cluster all pf the defects belonging to the same class with no error.

4.3.2. Results with DTW and ESOM

In this final case, the K-medoids algorithm is applied to the prototypes obtained by the ESOM. The parameters used in the ESOM were σ = ϵ = 0.025, γ = 0.01, and τ = 10. They were empirically found. As before, the number of clusters is selected by maximizing the silhouette index. It is of importance to note that once the prototypes have been clustered, the defects are assigned to a cluster according to the closest prototype. The confusion matrix is shown in Figure 8, and the global silhouette index is Sensors 15 10100i8 = 0.597.

Temperature influence has not been considered, because it did not change during experimentation, so the possible influence on system behavior could not be studied. The processing speed changes in a natural way during production. Since the wire is pulled by the forming coil, the speed increases linearly along time. Due to a constant sampling rate, the signal associated with a defect shrinks as speed increases. The DTW deals with this effect by nature provided the sampling rate is high enough to allow a precise shape reconstruction.

5. Discussion

The results obtained show that both DTW-based methods, with ESOM and without ESOM processing, are capable of classifying the defects in an unsupervised fashion without error. Moreover, the methods that apply the normalized DTW distance outperform the MFD-based methods for the problem of classifying surface defects in the wire rod obtained by a eddy current inspection system. A schematic representation of the methods used is shown in Figure 9. The ESOM is very sensitive to the value given to ϵ. For instance, if ϵ = 0.035, the algorithm merges Classes C and D. The reason is that the ϵ parameter controls the number of prototypes and the distances between them. If ϵ is large, the number of prototypes is small and the distance is large. Hence, two clusters can merge into one. Since the class population is unbalanced, a small value of ϵ is needed to ensure that every class has enough prototypes. For ϵ = 0.025, the number of prototypes is 85, which is close to the total number of defects.

The inclusion of the ESOM processing in the DTW-based clustering algorithm presents an important advantage. ESOM is an on-line learning method, which is able to adapt the prototypes each time that a new defect is processed. Thus, a parsimonious representation of the historical surface defects is encoded in the ESOM network by a number of prototypes and their connections. Hence, a large database of defects need not be stored.

6. Conclusions

An efficient new unsupervised method for classifying surface defects in wire rod manufacturing has been developed. The defects are obtained by an eddy current inspection system. The new method is based on the DTW distance, which is used to measure the dissimilarity between the defects and uses an evolving self-organizing map to obtain a representative set of defect prototypes for each production shift. These prototypes are later classified using a K-medoids clustering algorithm.

The performance of the new method was demonstrated using a collection of real defects obtained in a manufacturing plant. This collection of defects was labeled by experts. The proposed method outperforms the classification methods based on modified Fourier descriptors that have also been applied to classify eddy current signals.

The developed method was conceived of as a computer tool to be applied offline, after a production shift, and to help the plant operators to automatically discover and classify the possible surface defects in the manufactured product. The DTW properties allows the method to deal with possible changes in production speed and the different sizes of defects belonging to the same class.

Acknowledgements

This work has been partially supported by the Spanish Ministry of Economy and Competitiveness through the INNPACTO program, Project Code IPT–2012–0755–420000. The authors would like to thank Bruno Chedal Anglay and Karen Gallardo, who are with Global Steel Wire S.A., for granting access to data and providing technical support in labeling defects.

Author Contributions

Sergio Saludes-Rodil proposed the idea. Enrique Baeyens and Sergio Saludes-Rodil arranged the methods. Carlos P. Rodíguez-Juan gathered and prepared the data. All of the authors analyzed the results and revised the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, J.; Shi, J.; Chang, T. On-line seam detection in rolling processes using snake projection and discrete wavelet transform. J. Manuf. Sci. Technol. 2007, 129, 926–933. [Google Scholar]
Yun, J.P.; Choi, D.; Jeon, Y.; Park, C.; Kim, S.W. Defect inspection system for steel wire rods produced by hot rolling process. Int. J. Adv. Manuf. Technol. 2014, 70, 1625–1634. [Google Scholar]
Blitz, J. Electrical and Magnetic Methods of Non-Destructive Testing; Springer: Berlin/Heidelberg, Germany, 1997; Volume 3. [Google Scholar]
Rajesh, S.N.; Udpa, L.; Udpa, S.S. Numerical model based approach for estimating probability of detection in NDE applications. IEEE Trans. Magn. 1993, 29, 1857–1860. [Google Scholar]
McMaster, R.C. Nondestructive Testing Handbook; American Society for Nondestructive Testing, Ronals Press: New York, NY, USA, 1959. [Google Scholar]
Grman, J.; Ravas, R.; Syrova, L. Application of wavelet transformation in eddy current testing of steam generator tubes. Proceedings of the IEEE Instrumentation and Measurement Technology Conference, Budapest, Hungary, 21–23 May 2001.
Ma, X.; Peyton, A.J. Feature detection and monitoring of eddy current imaging data by means of wavelet based singularity analysis. NDT E Int. 2010, 43, 687–694. [Google Scholar]
Udpa, L.; Ramuhalli, P.; Benson, J.; Udpa, S. Automated analysis of eddy current signals in steam generator tube inspection. Proceedings of the 16th WCNDT 2004—World Conference on NDT, Montreal, Canada, 30 August–3 September 2004.
Chen, T.; Tian, G.Y.; Sophian, A.; Que, P.W. Feature extraction and selection for defect classification of pulsed eddy current NDT. NDT E Int. 2008, 41, 467–476. [Google Scholar]
Sophian, A.; Tian, G.Y.; Taylor, D.; Rudlin, J. A feature extraction technique based on principal component analysis for pulsed eddy current NDT. NDT E Int. 2003, 36, 37–41. [Google Scholar]
De Mesquita, R.N.; Ting, D.K.S.; Cabral, E.L.L.; Upadhyaya, B.R. Classification of steam generator tube defects for real-time applications using eddy current test data and Self-Organising Maps. Real-Time Syst. 2004, 27, 49–70. [Google Scholar]
Lord, W.; Satish, S.R. Fourier Descriptor Classification of Differential Eddy Current Probe Impedance Plane Trajectories. In Review of Progress in Quantitative Nondestructive Evaluation; Springer-Verlag US: La Jolla, CA, USA, 1984; Volume 3A, pp. 589–603. [Google Scholar]
Oukhellou, L.; Aknin, P. Modified Fourier Descriptors: A new parametrisation of eddy current signatures applied to the rail defect classification. Proceedings of the III International Workshop on Advances in Signal Processing for Non Destructive Evaluation of Materials, Québec City, Quebec, Canada, 5–8 August 1997.
Lingvall, F.; Stepinski, T. Automatic detecting and classifying defects during eddy current inspection of riveted lap-joints. NDT E Int. 2000, 33, 47–55. [Google Scholar]
Smid, R.; Docekal, A.; Kreidl, M. Automated classification of eddy current signatures during manual inspection. NDT E Int. 2005, 38, 462–470. [Google Scholar]
Granlund, G.H. Fourier Preprocessing for Hand Print Character Recognition. IEEE Trans. Comput. 1972, 21, 192–201. [Google Scholar]
Sakoe, H.; Chiba, S. Dynamic Programming Algorithm Optimization for Spoken Word Recognition. IEEE Trans. Acoust. Speech Signal Process. 1978, ASSP-26, 43–49. [Google Scholar]
Ten Holt, G.A.; Reinders, M.J.T.; Hendriks, E.A. Multi-dimensional dynamic time warping for gesture recognition. Proceedings of the Thirteen Annual Conference of the Advanced School for Computing and Imaging, Zeewolde, The Netherlands, 13–15 June 2007.
Theodoridis, S.; Koutrumbas, K. Pattern Recognition, 4th ed.; Academic Press: Waltham, MA, USA, 2009. [Google Scholar]
Deng, D.; Kasabov, N. Evolving Self-organizing Maps for On-line Learning, Data Analysis and Modelling; The Information Science Discussion Paper Series, University of Otago: Dunedin, New Zealand, 2000. [Google Scholar]
Deng, D.; Kasabov, N. On-line pattern analysis by evolving self-organizing maps. Neurocomputing 2003, 51, 87–103. [Google Scholar]
Von Luxburg, U. A tutorial on spectral clustering. Stat. Comput. 2007, 17, 395–416. [Google Scholar]
Rousseeuw, PJ. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar]
Zahn, C.T. Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters. IEEE Trans. Comput. 1971, C-20, 68–86. [Google Scholar]

Figure 1. Layout of a wire rod mill with the eddy current inspection system.

Figure 2. Individual defects belonging to each class represented in the complex impedance plane.

Figure 3. Macro photography of individual defects belonging to each class.

Figure 4. Confusion matrix for MFD and spectral clustering.

Figure 5. Confusion matrix for MFD and ESOM-based clustering.

Figure 6. Resultant medoids represented in the complex impedance plane.

Figure 7. Confusion matrix for DTW-based clustering.

Figure 8. Confusion matrix for DTW and ESOM-based clustering.

Figure 9. Algorithm description.

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Saludes-Rodil, S.; Baeyens, E.; Rodríguez-Juan, C.P. Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors. Sensors 2015, 15, 10100-10117. https://doi.org/10.3390/s150510100

AMA Style

Saludes-Rodil S, Baeyens E, Rodríguez-Juan CP. Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors. Sensors. 2015; 15(5):10100-10117. https://doi.org/10.3390/s150510100

Chicago/Turabian Style

Saludes-Rodil, Sergio, Enrique Baeyens, and Carlos P. Rodríguez-Juan. 2015. "Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors" Sensors 15, no. 5: 10100-10117. https://doi.org/10.3390/s150510100

APA Style

Saludes-Rodil, S., Baeyens, E., & Rodríguez-Juan, C. P. (2015). Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors. Sensors, 15(5), 10100-10117. https://doi.org/10.3390/s150510100

Article Menu

Unsupervised Classification of Surface Defects in Wire Rod Production Obtained by Eddy Current Sensors

Abstract

1. Introduction

2. Description of the Problem

2.1. Eddy Current Inspection in Wire Rod Manufacturing

2.2. Problem Formulation

2.3. Related Work

2.4. Main Contributions

3. Methods

3.1. Modified Fourier Descriptors

3.2. Dynamic Time Warping

3.3. The K-Medoids Algorithm

3.4. The Evolving Self-Organizing Map

4. Experimental Results

4.1. Data Description

4.2. Unsupervised Classification with Modified Fourier Descriptors

4.3. Unsupervised Classification with DTW

4.3.1. Results with DTW and K-Medoids

4.3.2. Results with DTW and ESOM

5. Discussion

6. Conclusions

Acknowledgements

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI