Next Article in Journal
Dental Treatment of White Spots and a Description of the Technique and Digital Quantification of the Loss of Enamel Volume
Previous Article in Journal
Biochemical Manifestations of Gastroesophageal Reflux Disease Progression in Children: A Single Center Case-Control Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Time and Frequency Condition Indicators from Vibration Signals for Crack Detection in Railway Axles

by
Réne-Vinicio Sánchez
1,*,†,
Pablo Lucero
1,†,
Jean-Carlo Macancela
1,†,
Higinio Rubio Alonso
2,†,
Mariela Cerrada
1,†,
Diego Cabrera
1,† and
Cristina Castejón
2,†
1
GIDTEC Research Group, Universidad Politécnica Salesiana, Cuenca 010105, Ecuador
2
MAQLAB Research Group, Department of Mechanical Engineering, Universidad Carlos III de Madrid, 28911 Madrid, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Appl. Sci. 2020, 10(12), 4367; https://doi.org/10.3390/app10124367
Submission received: 3 March 2020 / Revised: 22 May 2020 / Accepted: 21 June 2020 / Published: 25 June 2020
(This article belongs to the Section Acoustics and Vibrations)

Abstract

:
Railway safety is a matter of importance as a single failure can involve risks associated with economic and human losses. The early fault detection in railway axles and other railway parts represents a broad field of research that is currently under study. In the present work, the problem of the early crack detection in railway axles is addressed through condition-based monitoring, with the evaluation of several condition indicators of vibration signals on time and frequency domains. To achieve this goal, we applied two different approaches: in the first approach, we evaluate only the vibrations signals captured by accelerometers placed along the longitudinal direction and, in the second approach, a data fusion technique at the condition indicator level was conducted, evaluating six accelerometers by merging the indicator conditions according to the sensor placement. In both cases, a total of 54 condition indicators per vibration signal was calculated and selecting the best features by applying the Mean Decrease Accuracy method of Random Forest. Finally, we test the best indicators with a K-Nearest Neighbor classifier. For the data collection, a real bogie test bench has been used to simulate crack faults on the railway axles, and vibration signals from both the left and right sides of the axle were measured. The results not only show the performance of condition indicators in different domains, but also show that the fusion of condition indicators works well together to detect a crack fault in railway axles.

1. Introduction

The railway transportation has a rapid growth worldwide, railway safety is a subject of high interest in the research field. Railway axles are one of the most critical elements in railway transportation systems, and failures such as a cracked axle can lead to the derailment and probably human and economic losses. Therefore, the early detection of faults in railway axles is crucial in railway safety [1,2].
The ultrasound technique can be used to perform condition monitoring over railway axles; however, its disadvantage is that it does not provide continuous information between different tests; therefore, it is not possible to detect fast-growing faults [3]. Other techniques that involve variables, such as temperature [4], acoustics [5], and acoustic emission [1], have been used for the continuous monitoring of faults on railway axles. However, vibration signal monitoring has become the most common monitoring technique due to its high reliability. Fault diagnosis based on vibration signals enables early fault detection, online condition monitoring, and when combined with different signal processing methods and artificial intelligence, better diagnostic results are obtained [6,7,8].
Fault diagnosis based on vibration signals with a data-driven approach is generally accomplished in four phases: (a) acquisition and conditioning of the vibration signal, (b) extraction of features, also called condition indicators, (c) selection of features, and (d) classification. After acquisition and conditioning of the vibration signal, in the feature extraction phase, we study the change of the signal behavior that can be an indicator of the fault condition. The study of these changes in the signal can be focused on the time, frequency or time-frequency domains. From these domains, condition indicators (CIs) or features can be extracted, which allows monitoring or detecting different faults [9]. The evaluation of the CIs has been widely studied, leading to achieve good results in the detection of faults in the railway axles. In the time-frequency domain, the energy calculated by means of Wavelet Packet Transform (WPT) has been used, allowing crack detection with excellent results [10,11]. The above-mentioned works only measured vibrations in the railway axle and bearings in insulation, without the bogie.
The work developed by Gómez et al. [7] (which is the same case of study of the present work), used real railway axles installed in a real Y21 bogie, where the vibration signals from six accelerometers were processed by means of the WPT energy. The feature selection stage was carried out by means of a visual analysis, the energy packages were selected to increase their values with the depth of the crack, and the packages varied with the change of speed. In the classification stage, a radial-basis function-based artificial neural network was used with 32 inputs corresponding to the selected energy packets, the load and speed values; the two possible outputs of the network were healthy or cracked conditions. This work highlighted that the six accelerometers provide important information for detection and better results are achieved at certain speeds. A recent work, over this same case, presented by Lucero at al. [12] evaluated the signals from the six accelerometers; thirty features from the time-domain signal were extracted, then, feature selection was applied, and finally the classification was implemented through a random forest classifier. The best accuracy in the classification was found with ten features, extracted from vibration signals measured with the accelerometers located in the longitudinal direction. The results in this work also show that features such as Wilson Amplitude (WAMP), Wave length (WL), Zero crossing (ZC), Slope Sign Change (SSC), mean, Energy Operator (EO), and the skewness are well-suitable to handle the fault classification. On the other hand, vibrations signals by nature exhibit random behavior in a wide range of applications. To reveal the strengths of different signal domains, the Fast Fourier Transform can be used to switch from time to frequency domain, and it has been noticed changes in the vibration signature of railway elements such as axle-boxes [13]. Moreover, if we want to understand how the strength of a signal is distributed in the frequency domain, we can use the power spectral density (PSD), which describes the power of the signal as a function per frequency unit. Therefore, the PSD can be used to infer normal operation or fault conditions of railway vehicle [14,15].
Through an analysis of CIs, it is possible to obtain adequate information for the understanding and interpretation of the machinery condition, such as the definition of limit values of certain indicators to establish that the machinery is in normal or abnormal conditions. This would support the diagnosis process and the maintenance decision-making [16,17]. On the other hand, the result of the diagnosis from the analysis of CIs can be improved by having several sensors to monitor the machinery because it would allow performing Data -Fusion [18,19]. The extraction of indicators in time and frequency domains requires a lower computational cost than the required one for calculating indicators in the time-frequency domain [20].
Data Fusion refers to the combination of data from multiple sensors of either the same or different types, and can be defined as the use of techniques that combine data from multiple sources. Thus, by using data fusion, a more reliable and realistic inference, deduction or discrimination can be made by using data from different isolated sources in data-driven approaches [21].
The goal of this work is the evaluation of the performance of condition indicators in time and frequency domains for crack detection in railway axles through the use of vibration signals. Two approaches are proposed: (1) the first one evaluates the indicators extracted from vibration signal of two accelerometers, in time domain, frequency domain, and their combination, and (2) the second approach evaluates the data fusion of the indicators of the six accelerometers. The rest of this paper is organized as follows. Section 2 presents the condition indicators used in this work, the selection, and classification methods. Section 3 contains the experimental set-up and data acquisition. In Section 4, the proposed methodology for evaluating the condition indicators performance is detailed. Then, Section 5 shows the results and discussion, and finally in Section 6, the conclusions are addressed.

2. Background

2.1. Condition Indicators

The analysis of signals can be done with several techniques and the use of signals on different domains can be useful to enrich the information obtained from a signal, leading us to a better understanding of its nature. If we are interested in quantifying some signal properties, we can use mathematics, statistics-based values or condition indicators to measure the different signal characteristics; this can help revealing the hidden information inside the signal. These condition indicators (CIs) are often called features. In the present work, the approach used in this feature extraction phase is a combination of different features computed from the signal in both time and frequency domains.
In case of time domain, we used 30 statistical indicators resulting from a mixture of common features mainly used in fault diagnosis and features coming from the Electromyography (EMG) field, such as described in [9]. Additionally, we used frequency-based CIs due to the fact that the frequency domain can reveal valuable information about the change in the monitored system condition in a different shape.
The frequency spectrum X ( k ) of a discrete-time signal x ( i ) can be computed by using the Fast Fourier Transform (FFT). The spectral analysis shows all the harmonic components of a signal, leading to a better understanding of the underlying phenomenon behavior. Another common approach to search signal characteristics in the frequency domain is the Power Spectral Density (PSD). The power spectrum of a signal P ( k ) is given by
P ( k ) = 1 K | X ( k ) | 2
where X ( k ) is a previously obtained frequency spectrum and K denotes the number of points in the power spectrum. The PSD measures the average power of a signal in terms of the frequency, and it also shows periodicities [22,23]. The knowledge about the power distribution among the frequency components contained in a signal is also useful to understand the signal nature.
In frequency domain, we have used 24 condition indicators: 15 of them computed over the frequency spectrum and nine over the power spectrum. These 24 condition indicators are presented in Table 1; here, f k is the frequency value of the spectrum in the corresponding frequency bin k, whereas K denotes the total number of samples in the frequency and power spectrum.

2.2. Random Forest for Feature Selection

Random Forest (RF) is a machine learning algorithm designed by Breiman in 2001 [24] for classification and regression. RF uses multiple decision trees to classify a sample; each decision tree is built by using bootstrap sampling with a random feature selection implementation. Their predictions are used in a voting system where, from all trees, a majority class is calculated. The out-of-bag (OOB) error is mostly used to compute the expected model generalization performance [25].
In Random Forest, two methods defined by Breiman can be used for feature selection through of a feature ranking (feature importance): Mean Decrease Impurity (MDI) and Mean Decrease Accuracy (MDA). In this work, MDA is used for feature selection because if a feature does not impact the model, the method permutes the features values, such that the prediction accuracy should not decrease over the OBB observations.

Feature Selection by MDA

Given a data set D n = { ( X 1 , Y 1 ) , , ( X n , Y n ) } of n samples and p independent variables with X i = ( X i ( 1 ) , , X i ( p ) ) ( i 1 , , n ) being a training sample, the importance of the j-th feature X ( j ) = X 1 ( j ) , , X n ( j ) ( j 1 , , p ) , is calculated averaging the OBB errors of all permutations of trees [25].
Denote D l , n the out-of-bag data set of the l t h tree and D l , n j the same data set, where the values of X ( j ) have been randomly permuted. Keeping in mind that the m n ( · ; Θ l ) represents the l-th tree estimate and where Θ 1 , . . . , Θ M are independent random variable used to resample the training set before the growth of individual trees, MDA takes the form
M D A ( X ( j ) ) = 1 M l = 1 M R n [ m n ( · ; Θ l ) , D l , n j ] R n [ m n ( · ; Θ l ) , D l , n ]
where R n is defined for D = D l , n or D = D l , n j by
R n m n ( · ; Θ l ) , D = 1 | D | i : ( X i , Y i ) D Y i m n ( X i ; Θ l ) 2

2.3. k-Nearest Neighbor Classifier

The K-Nearest Neighbor classifier (KNN) is a popular algorithm directly based on the training samples and commonly used in pattern classification [26].
To classify an unknown sample, two steps are followed. First, KNN calculates the distance between the unknown point q and points x i in the training data, according to a distance metric d ( q , x i ) . Generally, d ( · , · ) can be Euclidean, Manhattan, Minkowski, Cosine, Chebychev Euclidean, Mahalanobis Standardized Euclidean, Hassana, or Chi-Square distance [27,28], just to mention a few distance metrics. Second, the k nearest neighbors are used to determine the class of q . The specified distance rule for classification of a new sample was Simple Voting. Then, an approach where new observation is assigned to the class of the majority of the k nearest points was used [29].

3. Experimental Set-Up

3.1. Bogie Test Bench

The bogie test bench was designed and manufactured by Danobat Railway Systems. It allows simulating different faults in the elements of the bogie. Figure 1 shows the main parts of the bogie, which is composed by the fixed wheels set (1) resting on the structure anchored to the floor (3). Wheels are connected by the fixed shaft (2), whereas the set of rotating wheels (6) are connected by the rotary axis (7); here is where the faults are simulated. The rotating wheels are driven by the rollers (10), and the speed is controlled by the roll driver (4), which is operated manually. The load is simulated by two hydraulic cylinders (9), and transmitted through a chain that pushes a beam (12) against the bogie structure (5). Three accelerometers are located on the left side (11) and right side (8) to measure acceleration in all three directions on each side.

3.2. Signal Acquisition and Experimental Conditions

A pair of bearings is included inside each axle box, supporting the rotating wheels. Three uni-axial accelerometers were placed at each axle box of the wheelset as indicated in Figure 2a, three on the right side (RS) and three on the left side (LS) oriented in three directions: left vertical (LV) accelerometer, left longitudinal (LL) accelerometer, and left axial (LA) accelerometer; right vertical (RV) accelerometer, right longitudinal (RL) accelerometer, and right axial (RA) accelerometer, as presented in Figure 2b. The accelerometer model is CMSS-RAIL-9100 with sensitivity of 100 mV/g, frequency range 0.52 Hz–8 kHz and resonance frequency 25 kHz, the accelerometer is coupled to the conditioner system SKF Multilog IMX-R, that was connected to a computer with the software SKF @ptitude Observer. This conditioner system is for industrial use and reduces the possibility of the signals having noise or interference. The sampling frequency was 12.8 kHz for a time of 1.2 s. The experimental conditions for load, speed, the rotation direction, and crack depths in the railway axle (see Figure 2) are presented in Table 2 for the four experimental conditions; at least 60 samples were acquired for each condition. We named the data obtained from each accelerometer: right-hand accelerometer data set (RD), and for the left and accelerometer data set (LD). Crack faults were artificially generated by an abrasive grinding process. Further details of the experimental conditions can be found in [7].

4. The Proposed Approachs

This work proposes evaluating CIs for crack detection in railway axles. Two approaches are developed: the first approach evaluates the CIs of two accelerometers in the time domain, frequency domain, and combining them, the second approach evaluates the fusion of the CIs of all six accelerometers.

4.1. Proposed Approach 1

This approach evaluates condition indicators extracted from the vibration signal in time domain, frequency domain, and a combination of both called time + frequency domain for crack detection in railway axles. In this approach, we analyze the two accelerometers located along the longitudinal direction: on the right side (RL) and another on the left side (LL). According to the work developed by Lucero et al. [12], these accelerometers and their orientation offer the best information.
Figure 3a shows the workflow of the first approach. It has four stages: (1) Data Acquisition, (2) Feature Extraction, (3) Feature Selection, and (4) Classification. Each stage includes the following tasks:
  • Data acquisition: two sets of signals are acquired, from the left accelerometer (LL) along the longitudinal direction and from the right accelerometer (RL) along the same direction under different fault, load, and speed conditions, as explained in Section 3.2.
  • Feature extraction: the feature extraction was applied as explained in Section 3.2. Figure 4 shows examples of vibration signals in time domain, and signals in the frequency domain using FFT and PSD from both the left and right accelerometers, for normal (healthy) condition and fault level 1. For each signal, different CIs were obtained according to three domains: (i) 30 CIs for time domain (CI_T), (ii) 24 CIs for frequency domain (CI_F), and (iii) 54 CIs from time + frequency domain (CI_TF). Therefore, six data sets were completed. Three data sets were obtained with the left-side accelerometer along the longitudinal direction (LL): the first CI_TL contains CIs extracted from the left side accelerometer and in time domain, the second CI_FL contains the CIs in frequency domain for the same accelerometer, and finally, the third CI_TFL has CIs in the time + frequency domain, also with the same accelerometer. Similarly, with the right-side accelerometer along the longitudinal direction (RL), we obtained other three data sets: CI_TR, CI_FR, and CI_TFR on time, frequency and time + frequency domain, respectively.
  • Feature selection: this process starts by removing correlated features. Correlation coefficient value on 0.8–1 between two CIs suggests these two features are highly correlated [30]. In this study, a threshold value equal to or greater than 0.95 (95%) was selected to identify two highly-correlated features, in all data sets. This threshold was stated after obtaining good performance in classification. Next, a normalization process was applied by scaling characteristics between −1 and 1. Normalization can be applied to Normal distributed data, as well as to data with another type of distribution. This process helps KNN to give equal importance to all CIs. After the preprocessing step, a Random Forest model was implemented with 40 trees as the main parameter. MDA metric was used to select the ten most important CIs, and then, the selection of the ten best-ranked features from each dataset.
  • Classification: each dataset is organized in a matrix of samples in the rows and CIs in the columns. Each pre-selected dataset (built from 10 ranked CIs) in stage 3 is classified using KNN (a value of k = 3 was chosen as the main parameter after testing different number of neighbors) and the cosine distance metric was used.
    Usually, when using k-folds cross-validation, values of k = 5 or k = 10 are chosen due to the good results obtained empirically [31]. In our work, the best results were obtained for k = 5. Therefore, a five-fold cross-validation strategy was carried out; the average accuracy and standard deviation ( s t d ) on the cross-validation process were calculated from 5 runs.
    Finally, many classifications were performed; these starts by using the first feature for a classification, next, the two first features are used for a new classification, and so on, until reaching the ten first features. The purpose is to analyze the contribution of each ranked feature.

4.2. Proposed Approach 2

The second approach evaluates the signals from the six accelerometers using data fusion at the indicator level. Figure 3b shows the workflow of this approach. It has five stages: (1) Data acquisition, (2) Feature extraction, (3) Data-fusion, (4) Feature selection, and (5) Classification. Each of the stages is explained below.
  • Data acquisition: the six vibration signals, three accelerometers mounted on the left side LV, LL, LA, and three accelerometers placed on the right side RV, RL, RA, were acquired under different faults, load and velocity conditions.
  • Feature extraction: for each signal, different condition indicators were obtained according to two domains: (i) 30 CIs in time domain and (ii) 24 CIs in the frequency domain. All those condition indicators are combined, so, for each vibration signal, 54 CIs are obtained. Therefore, six data sets were used, three different sets of CIs per each accelerometer.
  • Data-Fusion: from the six data sets of CIs obtained in the feature extraction stage, data fusion is performed at the indicator level. Three new sets of indicators are obtained: (i) The sets of CIs extracted from signals of the three accelerometers on the left side (LV, LA, LL) are fused, obtaining the first set of fused CIs called CI_DFL; (ii) the CIs of the three accelerometers on the right side (RV, RA, RL) are fused, obtaining a fused data set called CI_DFR; and finally (iii) the CIs of the six accelerometers are fused, obtaining the third set of fused CIs called CI_DFRL.
  • Feature selection and classification: from the three data sets fused in stage 3, the next stages 4 (feature selection) and 5 (classification) were carried out by following the same procedures, experimental conditions, and CIs analysis of the stages Feature Selection and Classification of the approach 1 detailed in Section 4.1 and presented in Figure 3a.

5. Results and Discussion

5.1. Results of Approach 1

Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 show the approaches to the results of approach 1. The top 10 CIs and their averaged accuracies for each data set are shown in Table 3, Table 5 and Table 7. Table 4, Table 6 and Table 8 show the accuracies per class for CI_TL, CI_FL and CI_TFL, respectively.
In Table 3, the best accuracy of 92.89%, was obtained by using CI_TL with 7 CIs. Otherwise, to CI_TR a maximum accuracy of 95.17% was obtained with 8 CIs. Per-class accuracy values for CI_TL are presented in Table 4. The best performance with 10 attributes shows that class 1 has the highest classification accuracy over 98% followed by class 2 with accuracy over 93%. Classes 3 and 4 have similar classification accuracy over 86%. Class 3 does not reach accuracy over 89% in any case. Class 4 can reach accuracy over 92% opposite to the accuracy of classes 1 and 2. Regarding the top 10 of time domain CIs presented in the Table 3, nine of the 10 are common: zero crossing, wave length, SSC, WAMP, kurtosis, energy operator, CPT3, shape factor, and skewness.
In Table 5, the best accuracy of 91.29% was obtained using CI_FL with 7 CIs. By contrast, to CI_FR a maximum accuracy of 94.36% was obtained with 9 CIs. Per-class accuracy values for CI_FL are presented in Table 6. The best performance with 10 attributes shows that class 4 has the highest classification accuracy over 98% followed by class 2 with accuracy over 89%. Classes 1 and 3 have similar classification accuracy over 85%. Classes 1, 2 and 3 do not reach accuracy over 89% in any case. Regarding the top 10 frequency domain CIs presented in the Table 5, seven of the 10 are common PKF, CP2, FR, skewnessf, centroid of spectrum, VCF, and spectrum spread.
In Table 7, the best accuracy of 96.62% due to its low std 0.59, was obtained using CI_TFL with 10 CIs. By contrast, to CI_TFR a maximum accuracy of 97.14% was obtained with also 10 CIs. Per-class values for CI_TFL are presented in Table 8. The best performance with 10 attributes shows that classes 1 and 4 have similar high classification accuracy over 97%. Classes 2 and 3 have also similar classification accuracy over 95%. From the use of seven attributes, all the classes reach classification accuracy over 90%. Regarding the top 10 time + frequency domain CIs presented in the Table 7, six of the 10 are common; four belong to the frequency domain: PKF, CP2, FR, SM4; and two to the time domain: zero crossing and SCC. This result shows that the use of combined features from time and frequency domains can improve the classification accuracy for each class.
To evaluate the CIs in time, frequency and time + frequency domain, we compare the accuracy of each ranked feature set. Figure 5 shows the accuracy results for each data set. Here, the X-axis represents the number of k first CIs most important selected by RF. The Y-axis represents the average accuracy of the k first CIs.
In order to illustrate the importance of feature selection, a scatter plot obtained from the best case, related to the CI_FL set, is presented in Figure 6, where the class 1 is the red color, class 2 is the green color, class 3 is the cyan color, and class 4 is the violet color. This set provides the high accuracy and low s t d using 3 attributes. The formation of small clusters can be observed for each of the classes in different locations in the three-dimensional space, and no perfect boundaries are exhibited. However, these three attributes can provide over 80% of accuracy, according to Figure 5.

5.2. Results of Approach 2

The top 10 CIs and their accuracies are presented in Table 9 and Table 10 by using a Data Fusion approach, for all three data sets. The set of fused CIs from the left-side accelerometers (LV, LA, and LL) named CI_DFL, the set of fused CIs from the right-side accelerometers (RV, RA, and RL) named CI_DFR as presented in Table 9, and the third set of fused CIs from all six accelerometers named CI_DFRL as presented in Table 10. Figure 5 shows the accuracy results for each data set. Here, the X-axis represents the number of k first CIs most important selected by RF. The Y-axis represents the average accuracy of the k first CIs. Regarding the 10 top CIs presented in the Table 9 and Table 10, five of the 10 are common, four belong to the frequency domain, i.e., PKF, FSK, VCF, skewnessf, and one to the time domain, i.e. SCC.
The highest accuracy (acc) using 10 condition indicators in CI_DFL was 97.56% with a std of 0.88, for CI_DFR was 98.01% with std of 0.88, and for CI_DFRL was 98.37% with std of 0.76. Of the latter, its accuracy by class is presented in the Table 11, where the class with the highest accuracy was number 4 obtaining 100% accuracy with 9 CIs.
In order to illustrate the importance of feature selection and data fusion in the approach 2, a scatter plot obtained from the best case, related to the CI_DFRL set, is presented in Figure 7. This set provides the highest accuracy (98.37%) and lowest s t d (0.76) using 3 CIs. Best small clusters can be observed for each class in contrast to Figure 6. The accuracy using 3 CIs was of 88.24%. These features were MDF-Freq-LV, VCF-Freq-RL, and CP2-Freq-RA.

5.3. Discussion

The proposed approaches provide the most efficient CIs to be evaluated for crack detection. The results in Table 3, Table 5, Table 7, Table 9 and Table 10 show the approach 1 achieves an accuracy rate of more than 91% from the seventh CI for all data sets while approach 2 achieves a rate of more than 93% from the fifth CI. Furthermore, the s t d obtained by each K-fold is presented in these tables where it can be seen that new CIs help reducing the s t d of the cross-validation process. This result indicates the classifier becomes more robust to new features. Note that using the s t d metric from the cross-validation process is a more useful, unlike the typical single train/test experiment.
In Figure 5, the trend of the curves indicates a fast growth of the accuracy up to the seven CIs, later the value of the accuracy flattens, i.e. the classifier will not significantly improve the accuracy after adding more CIs. It is important to note that all trends present a continuous increment in the accuracy as CIs increases except for CI_FL, for which the fourth CI decreases its performance.
Figure 8 shows the maximum classification accuracy of each data set and the number of indicators for which it was obtained. Approach 1, which employs a single accelerometer, achieves the best results by combining the CIs of time + frequency domain for both the left and right side accelerometers. Slightly better classification results are achieved with approach 2; however, data fusion by using six-accelerometers may involve greater challenges to perform the system diagnostic. For the three domains analyzed, better results are achieved with the right-side accelerometers, which may be due to the fact that the drive is not on this side and the signal information is less noisy.
Regarding the other works in this same case, Lucero et al. [12] reached a maximum accuracy of 96.43% and Gomez et al. [7] achieved the overall probability of detection at 95%, with 32 features (energy computed on WPT). In this work, the maximum accuracy is 97.14% with the approach 1, and 98.37% with the approach 2.
The ZC and SCC condition indicators of time domain and PKF, CP2 in frequency domain, are the common CIs among the top 10 CIs in both approach 1 and 2.
One of the main benefits of approach 1 is the possibility of classifying cracks in railway axles with a few CIs of time + frequency domain extracted from signals of a single accelerometer. In this way, we can reduce the number of machine sensors to detect a crack. On the other hand, eliminating non-informative indicators improves the performance of the classification algorithm. In addition, the use of few features reduces the computational cost of data processing.

6. Conclusions

In this work, it was possible to develop a practical and straightforward methodology to evaluate the performance of condition indicators (CIs) in time and frequency domains. Its application was tested for crack detection in railway axles, by using vibration signals measured with accelerometers located on the right and left side of the railway axles, along the vertical, axial, and longitudinal directions. This test bed is made of real components and has been used in previous works for testing other approaches of vibration analysis.
Two approaches were proposed. Approach 1 only analyzes the CIs extracted from the vibrations signals measured with the accelerometers placed along the longitudinal direction, on the right and left sides of the axles, as an extension for comparing to previous works analyzing this case. Approach 2 uses a data fusion analysis, by combining the CIs extracted from all the six accelerometers.
In approach 1, CIs extracted from the accelerometer on the right side in the different domains show slightly better results of classification accuracy than the condition indicators from the accelerometer on the left side. The right side accelerometer time + frequency domain data set (CI_TFR) had the highest performance, with classification accuracy over 97.14%, which is better than the results obtained in previous works. This fact may be due because the drive motor is placed on the right side, and the driven motor is on the left side which provides movement to the rotating wheels.
In approach 2, the best performance was achieved with CIs in the data set CI_DFRL which combines all the indicators extracted from the six accelerometers. Classification accuracy was over 98.38%, and this result is better than those one by using approach 1, showing that Data Fusion is a good approach to improve accuracy in fault classification.
The best results in terms of classification accuracy were achieved by using a combination of indicators extracted from time and frequency domains. The CIs named ZC and SCC in the time domain and PKF, skewnessf, CP2 in the frequency domain, are the common CIs among the top 10 CIs in both approach 1 and approach 2.
The main advantage of this methodology is that it supports the identification of the relevant CIs for fault detection in different domains of the vibration signal. This allows identifying which CIs should be monitored on-line, mainly the CIs of time domain as they have low computational cost regarding the signal processing.
As future work, the contribution of each single attribute to improving the accuracy values regarding the crack detection in railway axles will be analyzed. Moreover, the analysis of CIs coming from other application domains will be developed. Additionally, it is expected to have a repository of discriminating and independent features to develop effective data-driven classification algorithms.

Author Contributions

R.-V.S. proposed the idea, H.R.A. and C.C. provided the experimental design and data, J.-C.M. implemented the data processing algorithms and wrote a draft of the paper, P.L. implemented classification algorithms and performed results analysis, and M.C. and D.C. supervised the work and reviewed the final manuscript. All authors contributed in the proposed methodology. All authors have read and agree to the published version of the manuscript.

Funding

This research received no external funding, the APC was funded by Universidad Politécnica Salesiana through the research group GIDTEC.

Acknowledgments

Authors would like to thank the support provided by the Spanish Government, through the MAQ-STATUS DPI2015-69325-C2-1-R project, and Universidad Politécnica Salesiana through the research group GIDTEC.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, X.; Zou, Z.; Wang, K.; Hao, Q.; Wang, Y.; Shen, Y.; Hu, H. A new rail crack detection method using LSTM network for actual application based on AE technology. Appl. Acoust. 2018, 142, 78–86. [Google Scholar] [CrossRef]
  2. Hassan, M.; Bruni, S.; Carboni, M. Crack detection in railway axle using horizontal and vertical vibration measurements. In Proceedings of the 7th IET Conference on Railway Condition Monitoring 2016 (RCM 2016), Birmingham, UK, 27–28 September 2016. [Google Scholar]
  3. Cavuto, A.; Martarelli, M.; Pandarese, G.; Revel, G.; Tomasini, E. Experimental investigation by laser ultrasonics for high speed train axle diagnostics. Ultrasonics 2015, 55, 48–57. [Google Scholar] [CrossRef] [PubMed]
  4. Cheng, Y.; Wang, Z.; Zhang, W. A novel condition-monitoring method for axle-box bearings of high-speed trains using temperature sensor signals. IEEE Sens. J. 2018, 19, 205–213. [Google Scholar] [CrossRef]
  5. Artagan, S.S.; Ciampoli, L.B.; D’Amico, F.; Calvi, A.; Tosti, F. Non-destructive assessment and health monitoring of railway infrastructures. Surv. Geophys. 2019, 41, 1–37. [Google Scholar] [CrossRef]
  6. Sabnavis, G.; Kirk, R.G.; Kasarda, M.; Quinn, D. Cracked shaft detection and diagnostics: A literature review. Shock Vib. Dig. 2004, 36, 287. [Google Scholar] [CrossRef]
  7. Gómez, M.J.; Corral, E.; Castejón, C.; García-Prada, J.C. Effective Crack Detection in Railway Axles Using Vibration Signals and WPT Energy. Sensors 2018, 18, 1603. [Google Scholar] [CrossRef] [Green Version]
  8. Li, Y.; Zhang, W.; Xiong, Q.; Lu, T.; Mei, G. A novel fault diagnosis model for bearing of railway vehicles using vibration signals based on symmetric alpha-stable distribution feature extraction. Shock Vib. 2016, 2016. [Google Scholar] [CrossRef] [Green Version]
  9. Sánchez, R.V.; Lucero, P.; Vasquez, R.; Cerrada, M.; Macancela, J.; Cabrera, D. Feature ranking for multi-fault diagnosis of rotating machinery by using random forest and KNN. J. Intell. Fuzzy Syst. 2018, 34, 3463–3473. [Google Scholar] [CrossRef]
  10. Gómez, M.; Castejón, C.; García-Prada, J. Review of recent advances in the application of the wavelet transform to diagnose cracked rotors. Algorithms 2016, 9, 19. [Google Scholar] [CrossRef]
  11. Wei, J.; Liu, C.; Ren, T.; Liu, H.; Zhou, W. Online condition monitoring of a rail fastening system on high-speed railways based on wavelet packet analysis. Sensors 2017, 17, 318. [Google Scholar] [CrossRef]
  12. Lucero, P.; Sánchez, R.V.; Macancela, J.C.; Cabrera, D.; Cerrada, M.; Li, C.; Alonso, H.R. Accelerometer Placement Comparison for Crack Detection in Railway Axles Using Vibration Signals and Machine Learning. In Proceedings of the 2019 Prognostics and System Health Management Conference (PHM-Paris), Paris, France, 2–5 May 2019; pp. 291–296. [Google Scholar]
  13. Trilla, A.; Gratacòs, P. Maintenance of bogie components through vibration inspection with intelligent wireless sensors: A case study on axle-boxes and wheel-sets using the empirical mode decomposition technique. J. Rail Rapid Trans. 2016, 230, 1408–1414. [Google Scholar] [CrossRef]
  14. Aravanis, T.C.I.; Sakellariou, J.S.; Fassois, S.D. Spectral analysis of railway vehicle vertical vibration under normal operating conditions. Int. J. Rail Transp. 2016, 4, 193–207. [Google Scholar] [CrossRef]
  15. Rolek, P.; Bruni, S.; Carboni, M. Condition monitoring of railway axles based on low frequency vibrations. Int. J. Fatigue 2016, 86, 88–97. [Google Scholar] [CrossRef]
  16. Liu, T.; Chen, J.; Dong, G. Zero crossing and coupled hidden Markov model for a rolling bearing performance degradation assessment. J. Vib. Control 2014, 20, 2487–2500. [Google Scholar] [CrossRef]
  17. Ninoslav, Z.F.; Rusmir, B.; Cvetkovic, D. Vibration feature extraction methods for gear faults diagnosis—A review. Facta Univ. Ser. Work. Living Environ. Prot. 2015, 12, 63–72. [Google Scholar]
  18. Cai, G.; Zhao, J.; Song, Q.; Zhou, M. System architecture of a train sensor network for automatic train safety monitoring. Comput. Ind. Eng. 2019, 127, 1183–1192. [Google Scholar] [CrossRef]
  19. Hua, Z.; Sun, Z.; Jia, H.; Zhao, Z.; Hou, L. Track Occupation Detection Based on a Maximum Posterior Probability Model Using Multisensor Data Fusion. Comput. Sci. Eng. 2018, 21, 40–54. [Google Scholar] [CrossRef]
  20. Li, H.; Xiao, D.Y. Fault diagnosis based on power spectral density basis transform. J. Vib. Control 2015, 21, 2416–2433. [Google Scholar] [CrossRef]
  21. Khaleghi, B.; Khamis, A.; Karray, F.O.; Razavi, S.N. Multisensor data fusion: A review of the state-of-the-art. Inf. Fusion 2013, 14, 28–44. [Google Scholar] [CrossRef]
  22. Mendel, J.M. Tutorial on higher-order statistics (spectra) in signal processing and system theory: Theoretical results and some applications. Proc. IEEE 1991, 79, 278–305. [Google Scholar] [CrossRef]
  23. Toledo, E.; Pinhas, I.; Aravot, D.; Akselrod, S. Bispectrum and bicoherence for the investigation of very high frequency peaks in heart rate variability. In Proceedings of the Computers in Cardiology 2001, Rotterdam, The Netherlands, 23–26 September 2001; pp. 667–670. [Google Scholar]
  24. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  25. Biau, G.; Scornet, E. A random forest guided tour. Test 2016, 25, 197–227. [Google Scholar] [CrossRef] [Green Version]
  26. Duda, R.O.; Hart, P.E. Pattern Classification and Scene Analysis; Wiley: New York, NY, USA, 1973. [Google Scholar]
  27. Hu, L.Y.; Huang, M.W.; Ke, S.W.; Tsai, C.F. The distance function effect on k-nearest neighbor classification for medical datasets. SpringerPlus 2016, 5, 1304. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Prasath, V.; Alfeilat, H.A.A.; Lasassmeh, O.; Hassanat, A.; Tarawneh, A.S. Distance and Similarity Measures Effect on the Performance of K-Nearest Neighbor Classifier–A Review. arXiv 2017, arXiv:1708.04321. [Google Scholar]
  29. Cunningham, P.; Delany, S.J. k-Nearest neighbour classifiers. Mult. Classif. Syst. 2007, 34, 1–17. [Google Scholar]
  30. Swinscow, T.D.V.; Campbell, M.J. Statistics at Square One; BMJ: London, UK, 2002. [Google Scholar]
  31. Casella, G.; Fienberg, S.; Olkin, I. An Introduction to statistical learning with Applications in R. In Springer Texts in Statistics; Springer: New York, NY, USA, 2013. [Google Scholar]
Figure 1. Components of the bed test. (a) Back view of the test bed. (b) Front view of the test bed.
Figure 1. Components of the bed test. (a) Back view of the test bed. (b) Front view of the test bed.
Applsci 10 04367 g001
Figure 2. Location of accelerometers and crack fault. (a) Location of the crack in the railway axle. (b) Location of accelerometers and crack fault.
Figure 2. Location of accelerometers and crack fault. (a) Location of the crack in the railway axle. (b) Location of accelerometers and crack fault.
Applsci 10 04367 g002
Figure 3. Flowcharts of the proposed approaches. (a) Flowchart approach 1 (b) Flowchart approach 2.
Figure 3. Flowcharts of the proposed approaches. (a) Flowchart approach 1 (b) Flowchart approach 2.
Applsci 10 04367 g003
Figure 4. Sample vibration signal. Left and right longitudinal accelerometers with load of 4 tons, speed of 20 km/h, and crack level 1. (a) Time Signal left Accelerometer; (b) Time Signal right Accelerometer; (c) Frequency Spectrum left Accelerometer; (d) Frequency Spectrum right Accelerometer; (e) Power Density Spectrum left Accelerometer; (f) Power Density Spectrum right Accelerometer.
Figure 4. Sample vibration signal. Left and right longitudinal accelerometers with load of 4 tons, speed of 20 km/h, and crack level 1. (a) Time Signal left Accelerometer; (b) Time Signal right Accelerometer; (c) Frequency Spectrum left Accelerometer; (d) Frequency Spectrum right Accelerometer; (e) Power Density Spectrum left Accelerometer; (f) Power Density Spectrum right Accelerometer.
Applsci 10 04367 g004
Figure 5. Accuracy vs. different numbers of selected CIs.
Figure 5. Accuracy vs. different numbers of selected CIs.
Applsci 10 04367 g005
Figure 6. Scatter plot using 3 best CIs of CI_FL.
Figure 6. Scatter plot using 3 best CIs of CI_FL.
Applsci 10 04367 g006
Figure 7. Scatter plot using 3 best CIs of CI_DFRL.
Figure 7. Scatter plot using 3 best CIs of CI_DFRL.
Applsci 10 04367 g007
Figure 8. Highest classification accuracy achieved per number of CIs.
Figure 8. Highest classification accuracy achieved per number of CIs.
Applsci 10 04367 g008
Table 1. Features.
Table 1. Features.
Mean of spectrum F 1 = k = 1 K X ( k ) K
Variance of spectrum F 2 = k = 1 K ( X ( k ) F 1 ) 2 K 1
Skewness of spectrum (Skewnessf) F 3 = k = 1 K ( X ( k ) F 1 ) 3 K ( F 2 ) 3
Kurtosis of spectrum F 4 = k = 1 K ( X ( k ) F 1 ) 4 K ( F 2 ) 4
Central Frequency F 5 = k = 1 K f k X ( k ) k = 1 K X ( k )
STD of spectrum F 6 = k = 1 K ( f k F 5 ) 2 X ( k ) k = 1 K X ( k )
RMS of spectrum F 7 = k = 1 K f k 2 X ( k ) X ( k )
CP1 F 8 = k = 1 K ( f k F 5 ) 3 X ( k ) K ( F 6 ) 3
CP2 F 9 = F 6 F 5
CP3 F 10 = k = 1 K ( f k F 5 ) 1 2 X ( k ) K F 6
CP4 F 11 = k = 1 K ( f k F 5 ) 3 X ( k ) F 6 2 K
CP5 F 12 = k = 1 K f k 4 X ( k ) k = 1 K f k 2 X ( k )
Centroid of Spectrum F 13 = k = 1 K k X ( k ) k = 1 K X ( k )
Spectrum Spread F 14 = k = 1 K ( k F 13 ) 2 X ( k ) k = 1 K X ( k )
Entropy of spectrum F 15 = k = 1 K 1 P n ( k ) l o g 2 [ P n ( k ) ]
where P n is the normalized total spectral energy
P n ( k ) = X ( k ) k = 1 K X ( k )
Total power F 16 = k = 1 K P ( k )
Median Frequency (MDF) F 17 = 1 2 k = 1 K P ( k )
Peak frequency (PKF) F 18 = m a x ( P ( k ) ) , k = 1 , . . . K .
First Spectral Moment F 19 = k = 1 K P ( k ) f k
Second Spectral Moment F 20 = k = 1 K P ( k ) f k 2
Third Spectral Moment F 21 = k = 1 K P ( k ) f k 3
Fourth Spectral Moment (SM4) F 22 = k = 1 K P ( k ) f k 4
Spectral moment ratio (VCF) F 23 = F 20 F 16 ( F 19 F 16 ) 2
Frequency ratio (FR) F 24 = L L C = f m i n U L C = f m a x / 2 P ( k ) / L H C = f m a x 2 + 1 U H C = f m a x P ( k )
Table 2. Experimental conditions.
Table 2. Experimental conditions.
3 loads- 4, 10, y 16 tons
2 speeds conditions- 20 km/h
- 50 km/h
6 vibration measurement placements- Left vertical (LV) accelerometer, left longitudinal (LL) accelerometer,
left axial (LA) accelerometer
- Right vertical (RV) accelerometer, right longitudinal (RL) accelerometer,
right axial (RA) accelerometer
Rotational directions- Clockwise and counterclockwise
4 fault conditions, see Figure 2- Healthy (Normal)
- Crack level 1 (e = 5.7 mm)
- Crack level 2 (e = 10.9 mm)
- Crack level 3 (e = 15 mm)
Table 3. Top 10 CIs and accuracies average over time domain.
Table 3. Top 10 CIs and accuracies average over time domain.
CI_TLCL_RT
#CIsAccuracy (%)Std#CIsAccuracy (%)Std
1Zero crossing25.60.201SSC25.850.26
2Wave length60.453.262SRAV54.743.63
3Kurtosis76.632.043Wave length85.433.15
4WAMP85.071.294WAMP87.211.32
5SSC86.931.505Shape factor89.271.83
6CPT489.691.126Skewness90.431.85
7Skewness92.891.457Zero crossing93.652.46
8Shape factor92.891.188Kurtosis95.171.30
9CPT3921.619Energy operator94.640.93
10Energy operator91.640.7410CPT394.361.38
Table 4. Accuracy per class to CI_TL (time domain).
Table 4. Accuracy per class to CI_TL (time domain).
#CIsClasses (Accuracy%)
1234
1Zero crossing100000
2Wave length65.9555.4448.5871.92
3Kurtosis86.4774.0471.5874.06
4WAMP89.2384.5683.0983.21
5SSC90.9784.5686.3485.76
6CPT494.4588.7784.5590.9
7Skewness97.5792.6388.8492.32
8Shape factor98.2594.0487.7591.23
9CPT397.5793.6888.1588.32
10Energy operator98.2693.6886.3387.95
Table 5. Top 10 features and accuracies average over frequency domain.
Table 5. Top 10 features and accuracies average over frequency domain.
CI_FLCI_FR
#CIsAccuracy (%)Std#CIsAccuracy (%)Std
1PKF25.60.151Skewnessf25.850.17
2CP256.82.932Spectrum spread41.772.44
3SM485.511.353PKF71.382.65
4FR722.544CP282.753.13
5Skewnessf79.032.575MDF83.992.48
6Spectrum spread87.212.436FR89.361.58
7VCF91.292.927CP394.011.16
8Centroid of Spectrum90.42.828SM193.471.32
9Kurtosisf90.131.089VCF94.360.93
10Entropy of spectrum90.231.9710Centroid of Spectrum94.271.70
Table 6. Accuracy per class to CI_FL (frequency domain).
Table 6. Accuracy per class to CI_FL (frequency domain).
#CIsClasses (Accuracy%)
1234
1PKF100000
2CP253.865.6156.1451.47
3SM478.1485.9684.5393.8
4FR71.8668.7768.3379.16
5Skewnessf81.9877.8974.7981.41
6Spectrum spread85.483.8684.1895.62
7VCF89.2489.8287.7798.54
8Centroid of Spectrum87.8689.8286.3297.81
9Kurtosisf87.1590.8884.8697.81
10Entropy of spectrum87.8589.4785.6398.17
Table 7. Top 10 CIs and accuracies average over time and frequency domain.
Table 7. Top 10 CIs and accuracies average over time and frequency domain.
CI_TFLCI_TFR
#CIsDomainAccuracy (%)Std#CIsDomainAccuracy (%)Std
1Zero crossingTime25.60.151SSCTime25.850.17
2PKFFreq52.81.332FRFreq50.811.77
3WAMPTime73.433.373SM1Freq71.561.86
4CP2Freq78.851.914SkewnessfFreq88.561.88
5SSCTime82.491.895Spectrum spreadFreq93.831.67
6Shape factorTime92.621.706Zero crossingTime95.261.19
7SkewnessTime93.692.087SM4Freq94.631.15
8KurtosisTime94.490.898CP3Freq96.870.55
9SM4Freq94.42.359CP2Freq96.62.46
10FRFreq96.620.5910PKFFreq97.141.29
Table 8. Accuracy per class to CI_TFL (time and frequency domain).
Table 8. Accuracy per class to CI_TFL (time and frequency domain).
#CIsDomainClasses (Accuracy%)
1234
1Zero crossingTime100000
2PKFFreq63.9143.5141.3662.4
3WAMPTime79.8964.2161.1688.69
4CP2Freq77.1175.0972.6490.88
5SSCTime84.0679.376.6390.15
6Shape factorTime93.0489.4792.7995.25
7SkewnessTime96.1990.8896.7790.86
8KurtosisTime96.8892.2896.3992.32
9SM4Freq96.8891.5897.4891.59
10FRFreq97.2195.0996.497.81
Table 9. Top 10 features and accuracies average over time and frequency domain using left and right sensors.
Table 9. Top 10 features and accuracies average over time and frequency domain using left and right sensors.
CI_DFLCI_DFR
#CIs - DomainSensorAcc(%)Std#CIs - DomainSensorAcc(%)Std
1CPT6 - TimeLL25.50.131SSC - TimeRA25.50.17
2PKF - FreqLA62.213.452Skewnessf - FreqRV46.111.45
3FR -FreqLV84.721.723SSC - TimeRL69.432.58
4MDF - FreqLV93.670.854Skewnessf - FreqRL87.161.2
5PKF - FreqLV94.750.695CP2 - FreqRL93.671.85
6Kurtosis - TimeLL96.381.46PKF - FreqRV93.761.55
7Skewnessf - FreqLV96.471.267ShapeFactor - TIimeRA96.481.78
8SSC - TimeLV96.561.598MDF - FreqRL96.651.22
9SM4 - FreqLA97.20.929VCF - FreqRL97.921.05
10VCF - FreqLV97.560.8810ZC - TimeRL98.010.88
Table 10. Top 10 features and accuracies average over time and frequency domain using all sensors, CI_DFRL.
Table 10. Top 10 features and accuracies average over time and frequency domain using all sensors, CI_DFRL.
CI_DFRL
#CIsDomainSensorAccuracy (%)Std
1MDFFreqLV25.50.19
2VCFFreqRL63.473.96
3CP2FreqRA88.241.12
4SSCTimeLV92.681.67
5CP2FreqLL94.211.87
6PKFFreqLV96.381.24
7PKFFreqRL97.020.82
8CP2FreqRL97.21.14
9ZCTimeLL97.380.38
10SkewnessfFreqLV98.370.76
Table 11. Accuracy per class to CI_DFRL.
Table 11. Accuracy per class to CI_DFRL.
#CIsDomainSensorClasses (Accuracy%)
1234
1MDFFreqLV100000
2VCFFreqRL70.9446.9425.8239.88
3CP2FreqRA84.464.7566.1861.97
4SSCTimeLV90.7684.7279.6493.65
5CP2FreqLL96.4694.2985.8298.13
6PKFFreqLV96.893.9386.5597.77
7PKFFreqRL97.5396.4493.0998.88
8CP2FreqRL96.8396.0994.1899.62
9ZCTimeLL97.8897.5196.36100
10SkewnessfFreqLV98.5897.8696.7398.88

Share and Cite

MDPI and ACS Style

Sánchez, R.-V.; Lucero, P.; Macancela, J.-C.; Rubio Alonso, H.; Cerrada, M.; Cabrera, D.; Castejón, C. Evaluation of Time and Frequency Condition Indicators from Vibration Signals for Crack Detection in Railway Axles. Appl. Sci. 2020, 10, 4367. https://doi.org/10.3390/app10124367

AMA Style

Sánchez R-V, Lucero P, Macancela J-C, Rubio Alonso H, Cerrada M, Cabrera D, Castejón C. Evaluation of Time and Frequency Condition Indicators from Vibration Signals for Crack Detection in Railway Axles. Applied Sciences. 2020; 10(12):4367. https://doi.org/10.3390/app10124367

Chicago/Turabian Style

Sánchez, Réne-Vinicio, Pablo Lucero, Jean-Carlo Macancela, Higinio Rubio Alonso, Mariela Cerrada, Diego Cabrera, and Cristina Castejón. 2020. "Evaluation of Time and Frequency Condition Indicators from Vibration Signals for Crack Detection in Railway Axles" Applied Sciences 10, no. 12: 4367. https://doi.org/10.3390/app10124367

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop