Next Article in Journal
Studies on Three-Dimensional (3D) Accuracy Optimization and Repeatability of UAV in Complex Pit-Rim Landforms As Assisted by Oblique Imaging and RTK Positioning
Next Article in Special Issue
Ultrasound Defect Localization in Shell Structures with Lamb Waves Using Spare Sensor Array and Orthogonal Matching Pursuit Decomposition
Previous Article in Journal
A Method for Identification and Assessment of Radioxenon Plumes by Absorption in Polycarbonates
Previous Article in Special Issue
Nonlinear Ultrasound Crack Detection with Multi-Frequency Excitation—A Comparison
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Intelligence Method for Recognizing Multiple Defects in Rail

School of Electrical and Electronic Engineering, Shang Hai Institute of Technology, Shanghai 201418, China
*
Author to whom correspondence should be addressed.
Sensors 2021, 21(23), 8108; https://doi.org/10.3390/s21238108
Submission received: 28 October 2021 / Revised: 25 November 2021 / Accepted: 29 November 2021 / Published: 3 December 2021

Abstract

:
Ultrasonic guided waves are sensitive to many different types of defects and have been studied for defect recognition in rail. However, most fault recognition algorithms need to extract features from the time domain, frequency domain, or time-frequency domain based on experience or professional knowledge. This paper proposes a new method for identifying many different types of rail defects. The segment principal components analysis (S-PCA) is developed to extract characteristics from signals collected by sensors located at different positions. Then, the Support Vector Machine (SVM) model is used to identify different defects depending on the features extracted. Combining simulations and experiments of the rails with different kinds of defects are established to verify the effectiveness of the proposed defect identification techniques, such as crack, corrosion, and transverse crack under the shelling. There are nine channels of the excitation-reception to acquire guided wave detection signals. The results show that the defect classification accuracy rates are 96.29% and 96.15% for combining multiple signals, such as the method of single-point excitation and multi-point reception, or the method of multi-point excitation and reception at a single point.

1. Introduction

As an infrastructure, the structural health of the rails has attracted much attention in the fields of engineering and NDT. Because of the influence of the manufacturing process, the operating situation, and the geographic conditions, rails are prone to various defects. Based on the analysis of the operating situation, rolling contact is the main reason for the rail surface crack. In the manufacturing process, the inclusion in the railhead can lead to an area-shaped section within the rail, which will lead to the formation of transverse cracks under the shelling. Moreover, the natural status, such as air pollution, natural rainfall, and temperature change, provides favorable conditions for the generation of corrosion. In the case of in-service rail, the expansion speed of defects will increase as the size of the defect increases, and the expansion speed is different for different types of defects [1,2,3]. Therefore, further research determining the type and size of rail defects is necessary to ensure proper and effective maintenance and replacement. Attracted by multi-mode and low attenuation, ultrasonic guided waves can perform nondestructive testing of multiple types of defects in long-range rails [4,5,6,7,8]. For instance, the vertical vibration mode is sensitive to cracks at the bottom of rail [9]; the SH mode can detect a transverse defect in rail [8]; the flexural mode can measure axial stress to monitor rail breakage [10]; Evans et al. [11] used ultrasonic guided waves to detect defects in rail level crossings. Lee et al. [12] presented a hybrid analytical-FEM technique based on the dispersion characteristics of the guided wave to design the sensor which can excite specific modes and frequencies for identifying transverse cracking under the shelling. Xing et al. [13] constructed a mathematical model composed of a modal vibration factor and a modal orthogonal factor to select the guided wave modal with the highest sensitivity to detect rail cracks. According to the variation regularity of dispersion characteristics along the longitudinal direction, a three-dimensional dispersion surface with cross-section position information was proposed by Chen et al. [14] for damage detection in turnout straight switch rail. Based on the Lamb wave, Deng et al. [15] proposed the method of automatically searching defects to achieve defect location in the plate structure.
Influenced by the mode of vibration and wavelength, ultrasonic guided waves display different sensitivity to different rail defects. These will induce changes in signal features [16,17,18]. Various signal processing techniques are used to extract signal features from the time, frequency, or time-frequency domain in the field of NDT. Power Spectral Density (PSD) is used to retrieve frequency domain information. The short-term Fourier transformation (STFT) can reflect the characteristics of the time-frequency domain. Researchers extracted various features, including maximum, peak power frequency (PPF), median power frequency (MPF), and STFT coefficients from the original signals, and used an SVM classifier to realize 4 kinds of corrosion defects recognition in the rail foot [19]. Jiang et al. [20] used laser ultrasound to detect defects of different depths on the rail surface. The information generated by the wave-packet transformation, including time-frequency coefficients, local entropy, and energy from the collected detection signals, are used for identifying different defects. Zhou et al. [21] proposed that extracting features by data-driven was used for recognizing defects. Furthermore, it is based on empirical analysis that six features from different domains are extracted from the detection signal. According to the experience, Li et al. [22] extracted six features from the time domain and frequency domain of the guided wave detection signal to identify rail cracks. Torkamani et al. [23] introduced an innovative time-domain damage index named Normalized Correlation Moment (NCM) for identifying laminated composites based on the guided wave. The result was shown to have significant advantages over the signal difference coefficient (SDC), including sensitivity to attenuation of the signal and decreased sensitivity to signal noise. Luca et al. [24] calculated damage indexes (DI’s) to quantify the variations of the signal amplitude caused by the induced damage and chose the DI’s for the Probability Ellipse (PE) method to estimate the probability of the presence of the damage. All the above methods can identify defects. However, these all require professional knowledge or experience to extract the damage index reflecting the defect from the guided wave detection signal to identify the defects.
This paper proposes a new method for identifying rail defects to get away from the limitation of professional knowledge and experience. It can distinguish the defects by combining multiple signals and extracting the features by the S-PCA. The process of identifying rail weaknesses is shown in Figure 1. First, guided wave detection signals are obtained through experiments and simulations of several single-excitation and single-reception methods, as shown in the area enclosed by the round dash line. Then, the S-PCA is used to extract features from the obtained signals, as shown in the area enclosed by the round dotted line. Finally, the SVM classification model enables one to qualitatively and quantitatively identify rail defects based on the extracted features. In signal processing, the restriction of professional knowledge is avoided by dividing the guided wave detection signal and extracting features from each segment. Compared with PCA, S-PCA can dig more weak damage information from guided wave detection signals, which is more conducive to defect identification.

2. Materials and Methods

2.1. Signal Dividing

The guided wave detection signal is a typical nonstationary time-series signal. The existent defects may cause modal conversion and dispersion of the guided wave signal, and the most intuitive effect is the change of the wave shape. The diverse defect information may be retained in the local zone of the guided wave detection signal. Therefore, we consider that extracting features from each local area realizes the feature extraction of the detection signal. The dividing is an effective means that the feature extraction of the high-dimensional signal is converted into the feature extraction of P low-dimensional signals.
T c = L V p = K f P = floor ( T T c )
L is a parament in signal segmentation and is determined by both Lmin and N. The special expression is L = N × Lmin, in which Lmin is the smallest size of the defect detected by the guided wave selected. N is a constant that needs to be determined experimentally. The choice of N or L will be discussed in the following section. Tc, as shown in Equation (1), is equal to the time interval between two adjacent dividing points. VP is the guided wave group velocity. Figure 2 is a schematic diagram of dividing signals. The guided wave detection signal of crack defect is divided by Tc (12.5 μs) or L (5 mm).

2.2. Principal Component Analysis

PCA is a common method to reduce data dimensions. Its theme is to use low-dimensional data to reflect the valuable information contained in high-dimensional data based on the mapping. Gottumukka et al. [25] used a modular PCA method to improve face recognition technology. Senneville B et al. [26] applied PCA to the motion estimation of abdominal organs. Mazzeo et al. [27] combined wavelet transformed with PCA for the preprocessing of bolts image.
The role of PCA for rail defect recognition is manifested in two aspects: removing redundant information and extracting features. The specific implementation steps are as follows:
  • Construct a sample set X of the rail damage characteristics. Matrix A is composed of guided wave detection signals of different defect types or the same type of defects with different levels of damage. As shown in Equation (2), m is the number of detection signals, and n is the number of features in each sample; a m represents the mth detected signal sample with n data, and a m 1 represents the first data in the mth sample.
    A = a 1 a 2 a m m × 1 = a 11 a 12 a 21 a 22 a 1 n a 2 n a m 1 a m 2 a mn m × n
    The set of samples X of the rail damage characteristic is obtained after centralized processing A. The centralization is expressed in Equation (3), which A ¯ represents the average value of each column in A.
    X = A A ¯
  • Construct the damage covariance matrix C. The correlation coefficient in guided wave detection signals can be used to characterize different defects. A covariance matrix is an efficient tool for characterizing the correlation coefficient. The covariance matrix C is described by Equation (4).
    C = 1 m 1 X T X
  • Calculate the eigenvalues and eigenvectors of the damage covariance matrix C. According to the matrix decomposition method, the eigenvalues and corresponding eigenvectors of matrix C are solved. Then the eigenvalues are ranked in descending order, λ 1 λ 2 λ n , and the corresponding eigenvectors in sequential order, x 1   x 2   x n . Among them, each eigenvector represents a principal component.
  • Determine the number of principal components K. The information percentage of a principal component is an important reference when the number of principal components is determined. The ratio of one eigenvalue to the sum of all eigenvalues is the information percentage of this principal component, as shown in Equation (5).
    θ i = λ i i = 1 n λ i
    Set a threshold θ (0 < θ < 1) and accumulate the information percentages of the principal components sorted in sequence. When the cumulative sum of the information percentages of the Kth principal component is greater than or equal to θ, there is selected for the K principal components.
    θ i = 1 K λ i i = 1 n λ i
  • Extract features. The K principal components extracted from the covariance matrix C are gathered to form a local weight matrix W. The process of feature extraction is shown as Equation (7), where X’ is a feature set extracted.
    X = X W

2.3. Build S-PCA

Figure 3 is a schematic diagram of the S-PCA. There are two steps based on the S-PCA to extract features. The first step is to divide. According to the time interval Tc, the detection signal S is divided into P segments of equal length. Si represents the ith signal segment (1 ≤ iP). The second step is feature extraction in two sub-steps. The first sub-step is extracting features from each segment by PCA with a threshold θP (0 < θP < 1) to form a feature set named F. The second sub-step is to get the final features from F by PCA with a threshold θ (0 < θ < 1).

2.4. SVM Classification Model

SVM realizes classification by searching for the optimal hyperplane determined by a certain number of support vectors. It is often used for solving linearly separable problems. For nonlinear separable problems, the introduction of the kernel function maps the sample data to a high-dimensional space, making it a linearly separable problem, which is then solved by linear classification. The Gaussian radial basis function (RBF) has excellent characteristics of nonlinearity and continuity, so it is often used as the kernel function of SVM. Equation (8) is the expression of RBF, in which g is the kernel size of the RBF, and X and Xi are feature vectors with the same dimension.
Kernel X , X i = exp X X i 2 2 g 2

2.5. Classification Model Evaluation

Evaluating the performance of the model provides a basis for the credibility of the model classification results. These are called respectively the precision rate and the recall rate, which are usually used for evaluating the model performance. The precision rate reflects the ability of the model to detect positive samples from the sample set, its expression is shown in Equation (9). The recall rate reflects the ability of the model to detect the number of correctly classified positive samples from all positive samples and is expressed as Equation (10). This article also introduces the F1-score to balance the precision rate and the recall rate to make the model evaluation more accurate, as shown in Equation (11). All the equations above, True Positive (TP) is the number of samples whose labels and the predictions are positive samples. False Positive (FP) is the number of samples where the object labels are negative and the predicted labels inverted. True Negative (TN) is the number of samples whose object labels are positive and the predicted labels inverse. False Negative (FN) is the number of samples whose labels of the object and predicted are negative samples.
P = TP TP + FP
R = TP TP + FN
F 1 = 2 × P × R P + R

3. Experimental Setup

There are 7 different degrees of crack, 5 different degrees of transverse crack under the shelling, and 4 different degrees of corrosion within the railhead. The detect signals of the above defects are acquired by combining the experiments and the simulations.

3.1. Experimental Setup

The experimental system is shown in Figure 4. The experimental object is a 60 kg/m rail with a length of 250 mm. Ceramic piezoelectric sheets (PZT-5H, diameter 14 mm, thickness 1 mm, center frequency 200 kHz) are arranged symmetrically on the railhead and marked in sequence as 1, 2, 3, A, B, C. The distance between the center of the piezoelectric films and the rail end is 7 mm. The vertical distance between the rail tread and the center of the piezoelectric films 1, 3, A, or C is 23.57 mm. The piezoelectric plates at positions 1, 2, and 3 are used to excite guided wave signals, and the piezoelectric plates at positions A, B, and C are used to receive detection signals. Any exciting sensors are paired with any receiving sensors to form a signal acquisition channel. There are 9 channels for acquiring signals, like 1TA, 1TB, 1TC, 2TA, 2TB, 2TC, 3TA, 3TB, 3TC. Figure 5 is a connection diagram of the detection equipment. The arbitrary function generator (TektronixAFG3021B) can modulate a sinusoid exciting signal of 200 kHz and 5 circles loaded alternately on piezoelectric slices 1, 2, or 3. The guided wave signal will propagate in the rail. The detection signals are collected using piezoelectric slices A, B, or C. Both the excitation signal and the reception signal are input into the oscilloscope (Tektronix DPO4054) with a specific sampling frequency.
As shown in Figure 4, the location of the crack is indicated by the red square. The crack is artificially cut in the fillet radius on one side of the railhead, located 125.5 mm from the rail end. The crack depth gradually increases at an interval of 1 mm to reflect the varying degrees of crack damage. In this study, 8 kinds of crack defects with different damage levels, 0 mm, 1 mm, 2 mm, 3 mm, 4 mm, 5 mm, 6 mm, 7 mm, are considered, and the corresponding sample labels are 1, 2, 3, 4, 5, 6, 7, 8, respectively.
In the experiment, uncertain physical factors are inevitable, such as the tightness of coupling between the piezoelectric sheet and the surface of the rail. Hence the experiments are repeated 10 times under each excitation-reception mode for each degree of crack to reduce the effect of tightness. The detection signal is picked up at a sampling frequency of 100 MHz, and 10 samples are taken accordingly for each crack degree. 80 samples may be obtained in each signal acquisition mode, and a total of 720 may be obtained from 9 signal acquisition channels. Table 1 shows the types of cracks and the number of samples taken through the experiments of 1TA.

3.2. Numerical Simulation

The finite element simulation software ABAQUS is used to build a model consistent with the experimental object. Table 2 shows the material parameters of the rail model. ρ represents density, E represents the Elastic Modules, and υ represents Poisson’s ratio. The rail model is divided by grid cells C3D8, and the grid size is 1.5 mm. The arrangement of the piezoelectric sheets on the railhead is consistent with the experiment, as shown in Figure 6. The total time to analyze the model is 2 ms, and the time step is 0.01 μs.
Figure 7a is the model of rail defects made by ABAQUS, such as crack, corrosion. The position of the black box corresponds to the location of the failure. The method of fine-tuning the defect parameters is adopted to make the numerical simulation and the experimental effect more consistent. Moreover, more detect signals are collected. For crack, the depth of the defect is continuously adjusted with a deviation of 0.001 mm. Taking a 1 mm sample as an example, the crack depth of 0.999 mm, 0.998 mm, 1.001 mm, and 1.002 mm, etc., and these defects are classed as 1 mm cracks. According to the above method, 10 samples are expanded under each excitation-reception mode, there are a total of 80 samples expanded for 8 damage categories. In practice, the depth of crack defect is always irregular. The author uses rounding to make the mark of defects in integers. For instance, the depth of crack less than 1.5 mm carries a 1 mm tag, and the depth of crack which equals 1.5 mm or more carries a 2 mm tag. If the difference between the marked value and the defect depth is within 0.2 mm, the defect depth is indicated with the marked value. For example, the crack depth of 1.2 mm, 1.1 mm, 0.9 mm, and 0.8 mm are all classified as 1 mm cracks. The type of sample expansion is only in the crack depth of 1 mm and 7 mm in this study, and the total number of expanded samples is 58, as shown in Table 3. With respect to corrosion, four defect models of different damage levels, 6 mm2, 12 mm2, 25 mm2 and 30 mm2, are constructed by ABAQUS. And the corresponding samples are marked as 9, 10, 11, and 12, respectively. For each type of corrosion defect, the corrosion size is modified by 0.004 mm2 to produce more samples. 80 samples are obtained per the excitation-reception method. The transverse crack under shelling is also an area defect. One rectangle is used to approximate the shape of the internal transverse crack. Figure 7b is a rail model with a 6 mm × 7 mm transverse crack under the shelling defect constructed by ABAQUS. 6 mm is the defect’s width, and 7 mm is the length of the defect. The defect is located at a rail length of 125.5 mm and a depth of 12.5 mm from the rail tread. Five transverse cracks under shelling models with different damage levels, 2 mm × 3 mm, 3 mm × 4 mm, 4 mm × 5 mm, 5 mm × 6 mm, 6 mm × 7 mm, are constructed through numerical simulation. Moreover, sample labels corresponding to defect sizes are marked as 13, 14, 15, 16, 17. Under each excitation-reception channel, 21 samples are collected under each type of transverse crack under shelling by adjusting the length or width with the deviation of 0.001 mm.
Table 4 shows that the total number of samples obtained by experiments and simulations under the channel of 1TA, and the sample labels corresponding to defects. Subsequently, all samples obtained by 1TA are made into a dataset including samples and the sample labels. Moreover, the dataset is divided into two parts, which randomly take 80% of the dataset as the training set, the remaining 20% as the testing set. The classify model is trained and tested by training set and testing set, respectively. The data obtained from the remaining 8 signal acquisition methods are processed by the same method with 1TA for subsequent rail defect identification.

4. Feature Extraction

4.1. Signal Analysis

Figure 8 shows the waveform diagram of the crack detection signal obtained at the time range of 0–0.002 s by 2TA in the experiment, and the cracks’ depth is 2 mm, 4 mm, 6 mm, 7 mm, respectively. Based on the guided wave theory, the guided wave group velocity is about 3845 m/s in the experiment. And the speed in the simulation is 3850 m/s, approximately like the result of the experiment. In addition, the experimental signal waveform is like the simulated signal waveform, under different degrees of defect damage. Thus, the simulated signals are used for the expansion of defect samples.
Figure 8, Figure 9 and Figure 10 show the signal waveforms of crack defects, transverse cracks under the shelling, and corrosion defects at different damage levels, respectively. We find that the waveforms of detection signals caused by different damage degrees of defects at the same position are generally similar. According to the elastic wave theory, the corresponding wave packet will cause amplitude changes and overlap due to some factors such as defects. However, the existing method is difficult to distinguish the specific defects by the time domain waveform. In this paper, the amplitude corresponding to each time point on the detection signal is regarded as a feature value in each sample. The statistical analysis and machine learning algorithms are used to find the features between the sample data of defects and correspondence defect types and eliminate redundant features.

4.2. Feature Extraction

Feature extraction is a significant part of structural health monitoring (SHM). Redundant features are removed from the detection signal through feature extraction, and several important features are retained to achieve accurate and rapid identification of defects.
Tc is an important parameter to achieve feature extraction by S-PCA. And the selection of Tc depends on parament L and the group velocity VP of the guided wave signal. In this study, the guided wave group velocity VP is 3850 m/s. The L participating in the discussion is shown in Table 5. KP is the number of data points in each segment. The influence of different Tc on defect identification is stated in Section 6.
PCA is a means of globally reducing data dimensions. Selecting an adequate number of principal components is critical when the PCA algorithm filters the feature. In the process of extracting features from detection signals with S-PCA, θP is a threshold to extract principal components from sub-signal segments using PCA. θ is a threshold for extracting the principal components from feature set F. The influences of these two parameters on rail defect identification are outlined in Section 6, respectively.

5. Model Parameter Adjustment

The number of support vectors in the SVM classification model is an essential factor influencing the model performance [28]. The number of support vectors for nonlinear SVM is modified by adjusting the penalty coefficient C and the kernel radius g. There are always optimal C and g values for different datasets to achieve accurate classification of the samples. It is considered that the method of grid search automatically finds suitable C and g within the range of [−3, 20], and [−25, 0], for nine datasets.

6. Experiment Results and Discuss

6.1. Influence of Tc on the Defect Recognition

The guided wave detection signals are divided into P sub-signal segments with a time interval Tc. To select a suitable Tc, we assume that both θP and θ are equal to 99%. When Tc is equal to 0.5 μs, 2.5 μs, 5 μs, 7.5 μs, 10 μs, and 12.5 μs, the accuracy of SVM varies in shown in Table 6.
In Table 6, the red number represents the maximum classification accuracy attained by the classifier based on the selected parameters in each dataset. When Tc is 5 μs, seven datasets, such as 1TA, 1TB, 1TC, 2TA, 2TB, 3TA, and 3TB, reach the maximum classification accuracy rate. When Tc is equal to 7.5 μs, both 2TC and 3TC reach the maximum classification accuracy rate. But when Tc is 5 μs, the classification results of 2TC and 3TC are not much different from the optimal result maintaining a relatively high classification level. According to the content above, most of the nine datasets may achieve the best classification effect when Tc is equal to 5 μs. So 5 μs is selected as the optimal parament of Tc for subsequent analysis.

6.2. Influence of θP on The Rail Defects Identification

θP controls the number of principal components extracted from each sub-signal. Different θP can extract different numbers of principal components from the sub-signal segments and affects the classification results of the classification model. It is the initial step that Tc is equal to 5 μs and θ is equal to 99%. Table 7 shows the variation of the classification results in the nine datasets with θP selected in turn from 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 99%.
As shown in Table 7, when θP equals 50%, any one of 1TB, 1TC, 2TA, 2TB, 2TC, 3TA, and 3TC reaches the maximum classification accuracy. For 1TB, when θP is equal to 60%, the classification accuracy is the same as the optimal classification result. However, when θP equals 50%, the number of the features used to classify defects is 5. When θP equals 60%, the number is 7. Thus 50% is taken at 1TB to reduce the loss of computation. When θP equals 60%, 1TA and 3TB reach the best classification outcomes at 91.95% and 94.25%, respectively. For 1TC and 2TC, when θP equals 10%, they attain the maximum classification accuracy. Moreover, when θP is equal to 20%, 30%, 40%, and 50%, in turn, the classification accuracy keeps the same classification result equaled to the optimal result. However, when θP equals 10%, 20%, 30%, and 40%, in turn, the 13th category cannot be identified in 3TB, as shown in Table 8. The reason is that the value of θP is too low to extract enough principal components. When θP equals 50%, all categories can be recognized, and when θP equals 60%, the classification accuracy rate is 94.25% that is the best-classified outcome for 3TB. According to the analysis above, most of the nine datasets can maintain a relatively high classification accuracy when θP equals 50%. Thus 50% is selected as the optimal parament of θP for subsequent research.

6.3. Influence of θ on the Rail Defect Identification

θ is an important parameter to extract features from F which are final features. Table 9 shows that when Tc equals 5 μs and θP equals 50%, the classification results of the nine datasets vary with θ changed from 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 99%. According to Table 9, the classification accuracy rates of the nine datasets show an increasing trend as the θ value increases. Therefore, 99% is chosen as a fixed parameter of θ for follow-up research.
In summary, 5 μs, 50%, and 99% are selected, respectively, as the optimal parameters, which are the fixed values of Tc, θP, θ.

6.4. Classification Result of Single-Channel Acquisition Signal

The signals collected by different signal acquisition channels are turned into corresponding datasets. Once each dataset is normalized, the S-PCA model with selected paraments is utilized to extract features. The dataset is split into the training set and the testing set. The outcome of the classification is the average of the 10 results of the classification. The variance S represents the standard deviation of 10 classification results, which is used for evaluating the stability of the classification. And the results of nine datasets are presented in Table 10.
Table 10 shows that among the 9 signal acquisition methods, the accuracy rates of the SVM classification model exceed 76% that is a medium score. In 1TA, 1TB, 2TA, 2TB, 2TC, and 3TB, the SVM classification accuracy rates reach nearly or more than 90%, which is a good score. Both 2TA and 2TB have excellent classification because the sensor at position 2 can be smoothly attached to the track surface, making the defect information in the collected detection signal more obvious. For 3TB, the distance between the sensor and the defect is relatively close, which makes the difference of the defect signals more prominent, which is conducive to the identification of the defect. Both 1TA and 1TB have higher classification accuracy because the signal reflected by the defect can be received by the A and B sensors, which is conducive to the recognition defect. In addition, the S-PCA is an effective method of extracting features and can extract the local differences in the detection signal, which is useful for the classifier to make the correct distinction. Similarly, the distance between the exciting sensor and the receiving sensor is relatively long, so the classification results of 1TC and 3TA are 76.15% and 76.53%, respectively, lower than the signals collected by other signal acquisition methods.

6.5. Classification Result of Multi-Channel Signal Combination

The above research shows that a single channel collecting signal can identify different types and degrees of defects damage on the railhead. Based on theoretical analysis, a single-channel acquisition signal can only reflect part of the characteristics of the defect. Thus, 1TC and 3TA classification accuracy rates are lower. The two types of combination methods are comprehensively analyzed to explore the influence of multi-channel signals on rail defect recognition, such as multi-point excitation and single-point reception or single-point excitation and multi-point reception. The types of combinations and the corresponding classification results are indicated in Table 11 and Table 12.
Table 11 shows the results of the classification of the combined methods of single-point excitation and multi-points reception signals. With the number of combined signals increasing, it is found that the classification accuracy rate shows an increasing trend, such as 1TAC, 1TBC, and 1TABC; 2TAB, 2TAC, 2TBC, and 2TABC; 3TAB, 3TAC, 3TBC, and 3TABC. Moreover, compared to the classification of signals collected by one single channel, the standard deviation S of the classification is slightly reduced after multiple signal combinations. The above results show that the accuracy of the classification can be effectively enhanced by the combination method. Since the propagation path of each signal is different, the effective combination can more widely reflect the health status of the detected object. The same results can also be found in Table 12.

7. Conclusions

Aiming to identify multiple defects in rail, experiments and numerical simulations are used to focus on the ultrasonic guided wave detection signals of crack defect, transverse crack under shelling, and corrosion defect. By modifying and combining the sensor positions, a sample library of defect detection signals is obtained by 9 different signal excitation-reception methods. The S-PCA algorithm is proposed to extract the features of the signal to eliminate the dependency on professional knowledge. Furthermore, the extracting features are input into the SVM classifier to identify the type and extent of defects. At the end of the research, the following conclusions are drawn:
  • The method of extracting features from the segments of detection signal by PCA can effectively eliminate redundant information in the signal and retain adequate information, which improves the accuracy of quantitative and qualitative identification of rail defects.
  • The detection signals collected from different excitation-reception positions describe the overall health of the different parts of the rail. Obtaining the combined detection signal through single-point excitation and multi-point reception or multi-point excitation and single-point reception can more comprehensively describe the health status of the detection object, which is good to improve the accuracy of defect recognition.
  • The S-PCA algorithm is an efficient method of extracting features based on statistical theory. It does not rely too much on the professional knowledge of guided wave detection, which reduces the difficulty of rail defect identification. Furthermore, the method could be more easily implemented in practical engineering in the future.

Author Contributions

Conceptualization, F.D. and S.-Q.L.; methodology, F.D. and S.-Q.L.; software, S.-Q.L.; validation, X.-R.Z., L.Z. and J.-B.H.; formal analysis, F.D., S.-Q.L., X.-R.Z., L.Z. and J.-B.H.; investigation, S.-Q.L.; resources, S.-Q.L. and C.Z.; data curation, S.-Q.L.; writing—original draft preparation, S.-Q.L.; writing—review and editing, X.-R.Z., L.Z. and J.-B.H.; visualization, L.Z.; supervision, J.-B.H.; project administration, F.D.; funding acquisition, F.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (11202137), Shanghai Alliance Program (2019025), Collaborative innovation fund of the Shanghai Institute of Technology (XTCX2018-11).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to express profound gratitude to Beijing University of Technology, Department of Mechanical and Electrical Engineering.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tian, G.Y.; Gao, B.; Gao, Y.L.; Wang, P.; Wang, H.T.; Shi, Y.S. Summary of inspection and monitoring technology for railway rail defects and damage. Chin. J. Sci. Instrum. 2016, 37, 1763–1780. [Google Scholar]
  2. Hernandez, F.C.; Plascencia, G.; Koch, K. Rail base corrosion problem for North American transit systems. Eng. Fail. Anal. 2009, 16, 281–294. [Google Scholar] [CrossRef]
  3. Zerbst, U.; Lundén, R.; Edel, K.O.; Smith, R.A. Introduction to the damage tolerance behaviour of railway rails—A review. Eng. Fract. Mech. 2009, 76, 2563–2601. [Google Scholar] [CrossRef] [Green Version]
  4. Karahaliou, A. Evaluation of Railway Rails with Non-Destructive Techniques. Key Eng. Mater. 2014, 605, 641–644. [Google Scholar] [CrossRef]
  5. Zhang, H.; Song, Y.N.; Wang, Y.N.; Liang, Z.C.; Zhao, M. Summary of non-destructive testing and evaluation technology for rail defects. Chin. J. Sci. Instrum. 2019, 40, 11–25. [Google Scholar]
  6. He, C.F.; Zheng, M.F.; Lv, Y.; Deng, P.; Zhao, H.M.; Liu, X.C.; Song, G.R.; Liu, Z.H.; Jiao, J.P.; Wu, B. Development, Application and Challenge of ultrasonic guided wave detection technology. Chin. J. Sci. Instrum. 2016, 37, 1713–1735. [Google Scholar]
  7. He, C.F.; Wu, B.; Fan, J.W. Research progress and application of ultrasonic cylindrical guided wave technology. Adv. Mech. 2001, 2, 203–214. [Google Scholar]
  8. Rose, J.L.; Avioli, M.J.; Mudge, P.; Sanderson, R. Guided wave inspection potential of defects in rail. NDT E Int. 2003, 37, 153–161. [Google Scholar] [CrossRef]
  9. Lu, C.; Sheng, H.J.; Song, K.; Lin, J.M.; He, F.C. Scattering characteristics of ultrasonic guided waves of oblique cracks at the bottom of rails. Non-Destr. Test. 2016, 38, 18–22, 25. [Google Scholar]
  10. Loveday, P.W. Guided Wave Inspection and Monitoring of Railway Track. J. Nondestruct. Eval. 2012, 31, 303–309. [Google Scholar] [CrossRef]
  11. Evans, M.; Lucas, A.; Ingram, I. The inspection of level crossing rails using guided waves. Constr. Build. Mater. 2018, 179, 614–618. [Google Scholar] [CrossRef]
  12. Lee, C.; Joseph, L.; Younho, C. A guided wave approach to defect detection under shelling in rail. NDT E Int. 2009, 42, 174–180. [Google Scholar] [CrossRef]
  13. Xing, B.; Yu, Z.; Xu, X.; Zhu, L.; Shi, H. Mode Selection Model for Rail Crack Detection Based on Ultrasonic Guided Waves. Shock Vib. 2020, 2020, 19. [Google Scholar] [CrossRef]
  14. Chen, R.; Hu, C.; Xu, J.; Gong, Z.; Liu, L.; Wang, P.; Chen, X. Research on guided wave propagation characteristics in turnout rails with variable cross-section. J. Sound Vib. 2021, 494, 115853. [Google Scholar] [CrossRef]
  15. Deng, F.; Chen, H.L. A Defects localization Algorithm Based on the Lamb Wave of Plate Structure. J. Nondestruct. Eval. Diagn. Progn. Eng. Syst. 2021, 4, 021001. [Google Scholar] [CrossRef]
  16. Liu, X.; Liu, Z.Q. Research on the inverse problem of ultrasonic nondestructive testing. Non-Destr. Test. 2001, 10, 441–443. [Google Scholar]
  17. Wu, B.; Deng, F.; He, C.F. The research progress of signal processing in ultrasonic guided wave nondestructive testing. J. Beijing Univ. Technol. 2007, 4, 342–348. [Google Scholar]
  18. Zhou, J.M.; Xu, Q.Y.; Li, P.; Wan, Q.; Liao, X.S. Ultrasonic guided wave technology in nondestructive testing of rails. Instrum. Technol. Sens. 2015, 6, 99–102, 106. [Google Scholar]
  19. Moustakidis, S.; Kappatos, V.; Karlsson, P.; Selcuk, C.; Gan, T.H.; Hrissagis, K. An Intelligent Methodology for Railways Monitoring Using Ultrasonic Guided Waves. J. Nondestruct. Eval. 2014, 33, 694–710. [Google Scholar] [CrossRef]
  20. Jiang, Y.; Wang, H.; Tian, G.Y.; Yi, Q.J.; Zhao, J.Y.; Zhen, K. Fast classification for rail defect depths using a hybrid intelligent method. Optik 2019, 180, 455–468. [Google Scholar] [CrossRef]
  21. Zhou, C.; Deng, F.; Liu, Y.; Liu, X.C.; Cheng, H.L. Research on Identification of Corrosion Damage Degree of Guided Wave Bend Pipe Based on Neural Network and Support Vector Machine. J. Mech. Eng. 2021, 57, 136–144. [Google Scholar]
  22. Li, S.Q.; Deng, F.; Zhou, C.; Zhang, X.R. Identification of rail crack defects based on support vector machine and artificial neural network. In Proceedings of the 15th Symposium on Piezoelectricity, Acoustic Waves and Device Applications, SPAWDA 2020, Zhengzhou, China, 16–19 April 2021. [Google Scholar]
  23. Torkamani, S.; Roy, S.; Barkey, M.; Sazonov, E.; Burkett, S.; Kotru, S. A novel damage index for damage identification using guided waves with application in laminated composites. Smart Mater. Struct. 2014, 23, 095015. [Google Scholar] [CrossRef]
  24. Luca, D.; Fenza, A.; Petrone, G.; Caputo, F. Guided wave SHM system for damage detection in complex composite. Struct. Theor. Appl. Fract. Mech. 2020, 105, 102408. [Google Scholar] [CrossRef]
  25. Gottumukkal, R.; Asari, V.K. An improved face recognition technique based on modular PCA approach. Pattern Recognit. Lett. 2004, 25, 429–436. [Google Scholar] [CrossRef]
  26. Senneville, D.B.; El Hamidi, A.; Moonen, C. A direct PCA-based approach for real-time description of physiological organ deformations. IEEE Trans. Med. Imaging 2015, 34, 974–982. [Google Scholar] [CrossRef]
  27. Mazzeo, P.L.; Nitti, M.; Stella, E.; Distante, A. Visual recognition of fastening bolts for railroad maintenance. Pattern Recognit. Lett. 2004, 25, 669–677. [Google Scholar] [CrossRef]
  28. Wang, Z.Y.; Zhu, D.H. An accurate detection method for surface defects of complex components based on support vector machine and spreading algorithm. Measurement 2019, 147, 106886. [Google Scholar] [CrossRef]
Figure 1. Flow chart of the segmented principal components analysis (S-PCA) for rail defect identification. The enclosed area with the dashed line represents the process of acquiring data; the enclosed area with the dotted line represents the process of extracting features by S-PCA, and the rest area represents the process of classification. Si is the ith sub-signal of the origin signal. And W is the matrix of correlation coefficient obtained via PCA with the threshold θP.
Figure 1. Flow chart of the segmented principal components analysis (S-PCA) for rail defect identification. The enclosed area with the dashed line represents the process of acquiring data; the enclosed area with the dotted line represents the process of extracting features by S-PCA, and the rest area represents the process of classification. Si is the ith sub-signal of the origin signal. And W is the matrix of correlation coefficient obtained via PCA with the threshold θP.
Sensors 21 08108 g001
Figure 2. Diagram of the signal divided by Tc (12.5 μs) or L (5 mm), in which the black boxes represent the dividing points. Tc is the time interval between two adjacent dividing points. The vertical axis is the signal amplitude. The horizontal axis is the sampling time of the signal.
Figure 2. Diagram of the signal divided by Tc (12.5 μs) or L (5 mm), in which the black boxes represent the dividing points. Tc is the time interval between two adjacent dividing points. The vertical axis is the signal amplitude. The horizontal axis is the sampling time of the signal.
Sensors 21 08108 g002
Figure 3. Diagram of segmented principal component analysis of ultrasonic guided waves. S is the original signal; F is a feature set that combines features from each segment. Si represents the ith subsection; Tc is the time interval used for dividing S; θP is a threshold used for extracting features from each segment; θ is a threshold used for extracting features from F. The grey chip represents a feature.
Figure 3. Diagram of segmented principal component analysis of ultrasonic guided waves. S is the original signal; F is a feature set that combines features from each segment. Si represents the ith subsection; Tc is the time interval used for dividing S; θP is a threshold used for extracting features from each segment; θ is a threshold used for extracting features from F. The grey chip represents a feature.
Sensors 21 08108 g003
Figure 4. The experimental system for using ultrasonic guided waves detection, in which the position of the red box represents the location of the crack. The PZTs are located at the position of 1, 2, 3, A, B, C, respectively. The distance between the center of PZT and the rail end is 7 mm.
Figure 4. The experimental system for using ultrasonic guided waves detection, in which the position of the red box represents the location of the crack. The PZTs are located at the position of 1, 2, 3, A, B, C, respectively. The distance between the center of PZT and the rail end is 7 mm.
Sensors 21 08108 g004
Figure 5. The connection diagram of the detection equipment, in which the Tektronix AFG3021B is the arbitrary function generator and the oscilloscope is used to sample the signals. The PZTs are coupled via ultrasonic gel with the rail surface.
Figure 5. The connection diagram of the detection equipment, in which the Tektronix AFG3021B is the arbitrary function generator and the oscilloscope is used to sample the signals. The PZTs are coupled via ultrasonic gel with the rail surface.
Sensors 21 08108 g005
Figure 6. Sensor layouts in the rail simulation model, in which the black dots represent the PZTs.
Figure 6. Sensor layouts in the rail simulation model, in which the black dots represent the PZTs.
Sensors 21 08108 g006
Figure 7. Rail defect model diagram. (a) describes the position of the crack and corrosion defect model where the black sequence is the position of the defect, such as corrosion and crack; (b) describes the position of the transverse crack under the shelling model.
Figure 7. Rail defect model diagram. (a) describes the position of the crack and corrosion defect model where the black sequence is the position of the defect, such as corrosion and crack; (b) describes the position of the transverse crack under the shelling model.
Sensors 21 08108 g007
Figure 8. Comparison of experimental signals waveforms of four types of different crack defects with different degrees of damage.
Figure 8. Comparison of experimental signals waveforms of four types of different crack defects with different degrees of damage.
Sensors 21 08108 g008
Figure 9. Comparison of the detection signal waveforms of four different corrosion defects with different degrees of damage.
Figure 9. Comparison of the detection signal waveforms of four different corrosion defects with different degrees of damage.
Sensors 21 08108 g009
Figure 10. Comparison of five transverse cracks under shelling detection signal waveforms with different levels of damage.
Figure 10. Comparison of five transverse cracks under shelling detection signal waveforms with different levels of damage.
Sensors 21 08108 g010
Table 1. The types of cracks and the quantities of samples taken from the experiments of exciting at position 1 and receiving the signal at position A.
Table 1. The types of cracks and the quantities of samples taken from the experiments of exciting at position 1 and receiving the signal at position A.
Damage DegreeNumber of SamplesDamage DegreeNumber of Samples
intact104 mm10
1 mm105 mm10
2 mm106 mm10
3 mm107 mm10
Table 2. Configuring the rail model built by ABAQUS.
Table 2. Configuring the rail model built by ABAQUS.
ModelLengthρEυ
60 kg/m250 mm7.85 × 10−92 × 1050.29
Table 3. The type of crack and the quantity of samples collected in a simulation of exciting at position 1 and receiving the signal at position A.
Table 3. The type of crack and the quantity of samples collected in a simulation of exciting at position 1 and receiving the signal at position A.
Defection TypeDefection Category (mm)Total Number of SamplesActual Damage (mm)Number of Samples
intactintact10010
crack1301.010
1.110
1.210
2301.810
1.910
2.010
3103.010
4104.010
510510
610610
7286.96
710
7.16
7.26
Table 4. Sample type and the total number of samples collected by exciting at the position of 1 and receiving the signal at position A.
Table 4. Sample type and the total number of samples collected by exciting at the position of 1 and receiving the signal at position A.
Defection TypeSample LabelDefection
Category
Number of SamplesTotal Number of Samples
intact1-20403
Crack defect21 mm40
32 mm40
43 mm20
54 mm20
65 mm20
76 mm20
87 mm38
Corrosion
defect
96 mm220
1012 mm220
1125 mm220
1230 mm220
Internal nuclear defect132 mm × 3 mm21
143 mm × 4 mm21
154 mm × 5 mm21
165 mm × 6 mm21
176 mm × 7 mm21
Table 5. Selection of the parameter L and time interval Tc.
Table 5. Selection of the parameter L and time interval Tc.
NL(mm)Tc (μs)KpP
20.20.550400
101.02.525080
202.05.050040
303.07.575027
404.010.0100020
505.012.5125016
Table 6. The influence of Tc on the accuracy of defect classifications, in which the red number is the best-classified result in this dataset.
Table 6. The influence of Tc on the accuracy of defect classifications, in which the red number is the best-classified result in this dataset.
0.5 μs2.5 μs5.0 μs7.5 μs10.0 μs12.5 μs
1TA86.2186.2187.3686.2186.2186.21
1TB94.2595.4095.4090.8091.9590.80
1TC77.0177.0179.3178.1679.3178.16
2TA94.2595.4095.4095.4095.4093.10
2TB90.8090.8093.1091.9589.6688.51
2TC88.5188.5188.5189.6689.6688.51
3TA72.4174.7175.8673.5672.4172.41
3TB89.6689.6690.8089.6689.6688.51
3TC90.8191.9591.9593.1090.8191.95
Table 7. The effect of θP on classification accuracy, in which the red number is the best classification result in this dataset.
Table 7. The effect of θP on classification accuracy, in which the red number is the best classification result in this dataset.
10%20%30%40%50%60%70%80%90%99%
1TA89.6689.6689.6689.6690.8091.9590.8089.6689.6688.51
1TB88.5188.5188.5188.5194.2594.2589.6690.8088.5187.36
1TC79.3179.3179.3179.3179.3177.0170.1178.1675.8678.16
2TA95.4095.4095.4095.4096.5593.1094.2593.1094.2590.80
2TB94.2594.2594.2594.2596.5596.5595.4095.4095.4094.25
2TC90.8090.8090.8090.8090.8088.5188.5188.5188.5183.91
3TA72.4172.4172.4172.4173.5668.9768.9767.8268.9767.82
3TB81.6181.6181.6181.6189.6694.2591.9587.3683.9187.36
3TC86.2186.2186.2186.2190.8089.6687.3688.5189.6686.21
Table 8. 3TB’s confusion matrix when θP equals 10%, Tc equals 5 μs, and θ equals 99%.
Table 8. 3TB’s confusion matrix when θP equals 10%, Tc equals 5 μs, and θ equals 99%.
1234567891011121314151617
precision1.000.750.751.001.001.000.750.900.751.000.751.0001.000.800.601.00
recall1.000.670.751.001.001.001.000.900.750.801.001.0001.000.800.500.71
F1-score1.000.710.751.001.001.000.860.900.750.890.861.00-1.000.800.550.83
Table 9. The influence of θ on classification accuracy in which the red circle is the best classification result in this dataset.
Table 9. The influence of θ on classification accuracy in which the red circle is the best classification result in this dataset.
10%20%30%40%50%60%70%80%90%99%
1TA49.4349.4349.4349.4374.7174.7174.7177.0180.4690.80
1TB42.5342.5342.5363.2263.2263.2273.5673.5685.0694.25
1TC50.5750.5750.5766.6766.6766.6766.6766.6771.2678.16
2TA54.0254.0254.0275.8675.8675.8683.9189.6689.6696.55
2TB58.6258.6258.6258.6282.7682.7690.8090.8093.1095.40
2TC48.2848.2848.2848.2873.5673.5673.5683.9189.6688.51
3TA42.5342.5342.5342.5355.1755.1755.1759.7764.3773.56
3TB58.6258.6258.6258.6281.6181.6181.6185.0687.3694.25
3TC50.5750.5750.5770.1170.1174.7174.7180.4678.1690.80
Table 10. The SVM’s classification results of single-channel acquisition signal.
Table 10. The SVM’s classification results of single-channel acquisition signal.
AccuracySPrecisionRecallF1-Score
1TA90.99%0.0391.20%90.13%89.83%
1TB90.22%0.0291.95%90.07%90.19%
1TC76.15%0.0482.01%73.76%74.43%
2TA93.63%0.0294.12%93.54%93.32%
2TB92.75%0.0294.68%92.36%92.05%
2TC90.11%0.0390.79%89.26%89.06%
3TA76.53%0.0383.10%79.71%76.89%
3TB90.77%0.0291.80%90.26%90.29%
3TC88.02%0.0389.90%87.88%87.43%
Table 11. The SVM’s classification results of feature combinations method of single-point excitation and multi-point reception.
Table 11. The SVM’s classification results of feature combinations method of single-point excitation and multi-point reception.
Combination TypeAccuracySPrecisionRecallF1-Score
1TAB96.15%0.0296.29%95.59%95.51%
1TAC93.08%0.0193.67%92.37%92.08%
1TBC91.21%0.0292.56%90.29590.50%
1TABC94.73%0.0295.32%94.00%94.20%
2TAB93.74%0.0295.42%93.51%93.20%
2TAC94.07%0.0194.99%93.66%93.58%
2TBC94.95%0.0296.34%95.10%94.87%
2TABC96.15%0.0196.63%96.13%95.93%
3TAB89.01%0.0390.25%88.36%88.51%
3TAC88.79%0.0290.57%88.49%87.77%
3TBC88.90%0.0290.62%88.38%88.34%
3TABC90.55%0.0391.64%90.31%89.99%
Table 12. The SVM’s classification results of the combination method of multi-point excitation and single-point reception.
Table 12. The SVM’s classification results of the combination method of multi-point excitation and single-point reception.
Combination TypeAccuracySPrecisionRecallF1-Score
1TA + 2TA96.29%0.0296.34%96.06%95.70%
1TA + 3TA89.78%0.0190.39%87.82%87.58%
2TA + 3TA92.75%0.0293.23%92.72%92.43%
1TA + 2TA + 3TA95.49%0.0296.12%95.20%95.03%
1TB + 2TB95.38%0.0196.32%95.35%94.94%
1TB + 3TB93.08%0.0294.54%92.35%92.59%
2TB + 3TB94.03%0.0295.31%94.71%94.10%
1TB + 2TB + 3TB93.41%0.0295.23%93.79%93.45%
1TC + 2TC92.09%0.0293.02%92.05%92.00%
1TC + 3TC90.22%0.0391.97%89.74%89.06%
2TC + 3TC91.43%0.0393.35%90.98%90.83%
1TC + 2TC + 3TC92.09%0.0393.44%92.22%91.95%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Deng, F.; Li, S.-Q.; Zhang, X.-R.; Zhao, L.; Huang, J.-B.; Zhou, C. An Intelligence Method for Recognizing Multiple Defects in Rail. Sensors 2021, 21, 8108. https://doi.org/10.3390/s21238108

AMA Style

Deng F, Li S-Q, Zhang X-R, Zhao L, Huang J-B, Zhou C. An Intelligence Method for Recognizing Multiple Defects in Rail. Sensors. 2021; 21(23):8108. https://doi.org/10.3390/s21238108

Chicago/Turabian Style

Deng, Fei, Shu-Qing Li, Xi-Ran Zhang, Lin Zhao, Ji-Bing Huang, and Cheng Zhou. 2021. "An Intelligence Method for Recognizing Multiple Defects in Rail" Sensors 21, no. 23: 8108. https://doi.org/10.3390/s21238108

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop