You are currently viewing a new version of our website. To view the old version click .
Mathematics
  • Article
  • Open Access

26 September 2023

Intelligent Fault Diagnosis of Robotic Strain Wave Gear Reducer Using Area-Metric-Based Sampling

,
,
and
1
Department of Mechanical, Robotics and Energy Engineering, Dongguk University-Seoul, 30 Pil-dong 1 Gil, Jung-gu, Seoul 04620, Republic of Korea
2
George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, North Ave NW, Atlanta, GA 30332, USA
*
Authors to whom correspondence should be addressed.
This article belongs to the Special Issue Recent Advances in Machine Learning Methods for Mechanical Engineering

Abstract

The main challenge with rotating machine fault diagnosis is the condition monitoring of machines undergoing nonstationary operations. One possible way of efficiently handling this situation is to use the deep learning (DL) method. However, most DL methods have difficulties when the issue of imbalanced datasets occurs. This paper proposes a novel framework to mitigate this issue by developing an area-metric-based sampling method. In the proposed process, the new sampling scheme can identify which locations of the datasets can potentially have a high degree of surprise. The basic idea of the proposed method is whenever significant deviations from the area metrics are observed to populate more sample points. In addition, to improve the training accuracy of the DL method, the obtained sampled datasets are transformed into a continuous wavelet transform (CWT)-based scalogram representing the time–frequency component. The dilated convolutional neural network (CNN) is also introduced as a classification process with the altered images. The efficacy of the proposed method is demonstrated with fault diagnosis problems for welding robots. The obtained results are also compared with existing methods.

1. Introduction

In intelligent manufacturing industrial systems, robots are widely used to autonomously carry out tasks. However, their long-term and continuous operation cause malfunctions in subcomponents, such as gears and bearings. If maintenance is not promptly carried out, these malfunctions can create various failures in the system. These failures can affect productivity and increase the probability of a shutdown. At this point, preventive maintenance of industrial machinery is a common procedure to improve the reliability and safety of the equipment. This eliminates unnecessary maintenance procedures, significantly reducing operating costs, while revitalizing research on “condition-based maintenance”. Recently, vibration signal-based condition monitoring (CM) has been commonly used for system health monitoring []. However, vibration signal-based CM needs additional sensors for complicated data acquisition systems and is only suitable for large electric machines [].
In modern industrial systems, the motor current signal can be easily acquired with a current transformer that is part of the protection system, and current-based monitoring can be performed remotely from the motor control center. However, robots operate under various operating and load conditions in the industry, resulting in nonstationary current signals. Several CM technologies have been developed to detect faults in the nonstationary system. Fault diagnosis methods typically include model-based methods and data-driven methods. Data-driven models are more suitable for diagnosing systems with complex signals, because unlike model-based methods, they do not require signal patterns and error mechanisms []. Machine learning (ML) algorithms, such as artificial neural networks (ANNs), support vector machines (SVMs), principal component analysis (PCA), and K-nearest neighbor (KNN), have been applied to data-driven methods [,,]. However, ML algorithms rely on human-engineered features to train, which can lead to inaccurate classification results []. Research on deep learning (DL)-based methods is actively being conducted to achieve much better accuracy under various operating conditions. Because DL is based on distributed feature learning, it can automatically learn correlations and characteristics among data, eliminating the need for feature engineering processes. However, autonomous learning of these features requires massive amounts of data.
Recently, many studies have been actively carried out to overcome the problem of data imbalance and data shortage through data augmentation. In real-world image augmentation techniques, image classification through DL includes image enlargement and reduction, rotation, and flipping [,,]. For the time series, data augmentation techniques include window warping, magnitude warping, cropping, generative adversarial networks (GANs), and seasonal and trend decomposition using loess (STL) [,,,]. However, in the data augmentation methods mentioned above, the augmented data highly depend on specific input data and may not reflect the physical meaning of the original data. In previous studies, deep learning has provided excellent prediction and classification performance. However, it is impossible to interpret the model’s internal parameters for the results. If model analysis and explanation are not possible, it is difficult to convince engineers and other stakeholders, because it is challenging to present a rationale for the premaintenance and management schedule through system condition monitoring []. In this study, the challenges of data imbalance and shortage are addressed by enhancing the model’s reliability using a minimal number of samples through the area-metric-based sampling method. This approach involves a systematic sampling technique based on geometric characteristics within the gear mechanism. This method ensures that the augmented data maintain the physical essence of the original data, allowing for a more accurate representation. Subsequently, a specialized fault diagnosis approach using convolutional neural networks (CNNs) tailored for nonstationary systems is applied. Notably, this strategy not only improves the model’s interpretability but also facilitates a clear explanation of the results using explainable artificial intelligence (XAI).
The remainder of this paper is structured as follows: Section 2 introduces prior studies on ML- and DL-based fault diagnosis, data imbalance and data shortage problems, and XAI. Section 3 introduces the proposed intelligent fault diagnosis of the robotic strain wave gear reducer, while Section 4 describes the experiment in detail. Section 5 discusses robotic strain wave gear reducer fault diagnosis to verify the proposed method and interpret the results as XAI, and Section 6 then concludes this work.

3. Proposed Methodology

In this section, the theoretical background and explanation of the proposed method are described. Figure 1 shows the flow of the system fault diagnosis method using area-metric-based sampling. First, area-metric-based data sampling identifies locations in datasets that have a potentially high degree of surprise. Second, the sampled data are converted into a continuous wavelet transform (CWT)-based scalogram image containing time–frequency information. Finally, the converted image is trained on an improved CNN model to proceed with classification. The basis for the judgment in each model layer is expressed in a heat map format using XAI.
Figure 1. Overall process of the proposed fault diagnosis method.
The choice of the CWT for the data conversion arose from its effectiveness in analyzing signals in both the time and frequency domains simultaneously. Unlike traditional Fourier-based approaches, CWT provides a time-scale representation, making it suitable for the analysis of nonstationary signals. This representation enables the capture of localized features at various scales, a critical aspect for detecting faults in dynamic systems. By utilizing CWT, essential time–frequency characteristics of the signal are preserved, thereby enhancing the representational power for subsequent analysis.
CNNs were employed for fault diagnosis because of their exceptional ability to automatically learn and extract hierarchical features from input data. CNNs excel in pattern recognition tasks, making them apt for our fault diagnosis objectives. The hierarchical feature learning in CNNs, through multiple convolutional layers, enables the model to identify intricate patterns and representations in the transformed data obtained through CWT. CNNs have proven to be highly successful in various signal processing and image recognition tasks, making them a suitable choice for our fault diagnosis application. The integration of CWT for data transformation and CNN for fault diagnosis represents a synergistic approach that leverages the strengths of each method. CWT improves the data representation by extracting meaningful features in the time–frequency domains, and CNN efficiently learns and processes these features to classify and diagnose faults. This combination ensures that the diagnostic model can effectively recognize fault patterns in the transformed data, contributing to accurate and reliable fault diagnosis in nonstationary systems.

3.1. Area Metric

The area metric is a validation metric that performs quantitative comparisons mainly by calculating the area difference between the cumulative distribution function (CDF) graph of simulation prediction, and the CDF of the actual experimental results []. As the graph area difference is small, it can be expected that the simulation prediction result is similar to the actual data []. The CDF calculation for the comparison target is required to perform the area metric. The CDF of the random variable ( X ) is defined as follows:
F X x = P X x ,   f o r   a l l   x R
As shown in Figure 2, the area metric ( M ) is represented as the area between the experimental results CDF ( F a ( x ) ) and the simulation prediction results ( F s ( x ) ) [].
M = 1 μ ¯ a F s x F a ( x ) d x
where μ ¯ a is the absolute mean of the actual data, and M is normalized by μ ¯ a . In this study, an area-metric-based approach was employed for data scanning and analysis. The computed area-based metrics provide a quantifiable measure of the dissimilarity between the experimental and simulated CDFs, aiding in assessing the accuracy of the simulation predictions. A smaller area metric indicates a closer match between the experimental and simulated data distributions, signifying a more reliable simulation. Conversely, a larger area metric suggests discrepancies between the experimental and simulated data distributions, indicating areas for potential improvement in the simulation model.
Figure 2. Illustration depicting the computation of the area-based metrics.

Area-Metric-Based Sampling Method

Step 1. Data resampling
Data acquisition (DAQ) collects motor current data for each axis of the robot operating under various operating conditions. Since the data length is different for each operating condition, the data length is set to be the same through resampling. The resampled data ( X R E ), measured through the experiment, is defined as follows:
X R E = { x i j : x i j R n × m }
The data can be represented by an n × m matrix ( i = 1 ,   2 ,   , n   ,     j = 1 ,   2 ,   , m ). Each row x i R n × 1 is the resampled data from the DAQ, and each column x j   R 1 × m is the number of different operating conditions.
Step 2. Data segmentation and area metric calculation
In this study, the empirical cumulative distributive function (ECDF) is derived for the area metric calculation. Based on the resampled data, a reference ECDF, F R E x , which can indicate the trend of the entire data, is derived from the entire scenario data for each operating condition. The reference ECDF is defined as follows, where I represents an indicator function:
F R E x = 1 m i = 1 m I x i x
As shown in Figure 3a, the data are segmented into a specific window size ( k ) to find the abnormal part of the data. Thus, the segmented data ( X S E ) are defined as:
X S E = { x i j : x i j R k × l } ,   i = 1 ,   2 ,   ,   k ,   j = 1 ,   2 ,   ,   l
Figure 3. (a) Segmentation of data into specific window sizes ( k ) to identify anomalous segments within the dataset. (b) Representation of the reference CDF and the segmented data CDF.
Figure 3b shows the representation of the reference ECDF and the segmented data ECDF, offering a graphical understanding of the comparison crucial for the subsequent analysis and decision making.
Where, X S E is represented by a k × l matrix. Each row, x i R k × 1 , denotes each segmented data divided into k lengths, while each column, x j R 1 × l , denotes the number of segmented data.
Each segmented data are converted into an ECDF. The ECDF of the segmented data, F S E x , is defined as:
F S E x = 1 l k = 1 l I x k x
The area metrics ( M R E S E ) can be determined by calculating the area difference between the ECDFs of the reference and the segmentation data:
M R E S E = 1 μ ¯ S E F R E x F S E ( x ) d x
Step 3. Data sampling
The data are sequentially scanned for each operating condition, and the area metrics of the reference ECDF and the segmented data ECDF are obtained. The difference in the area metrics between the reference ECDF and the segmented data ECDF is calculated to identify the segments of data that significantly deviate from the overall data trend. Figure 4 illustrates a schematic of the area-metric-based sampling.
Figure 4. The area-metric-based sampling method.
Step 4. Time–frequency imaging
Unlike other time–frequency analysis methods, a wavelet transform provides multiresolution to facilitate dynamic analysis in the time–frequency domain. The wavelet transform is a method of decomposing a signal into different frequency bands through a wavelet function in the form of a wave []. The wavelet transform is performed using the dot product of the original signal and a set of wavelets. The set of wavelets, called analyzing wavelet or continuous wavelet, is a wavelet family realized through translating and scaling the basic wavelet, ψ ( t ) . The continuous wavelet can be described as:
ψ a , b ( t ) = a 1 / 2   ψ ( t b a )
where a is the scale parameter, and b is the orientation parameter.
The wavelet transform has high-frequency and low-time resolution in the low-frequency band []. Still, it has high-time resolution and low-frequency resolution in the high-frequency band []. The CWT of the signal x ( t ) can be conducted through a convolution operation between x ( t ) and the complex conjugate of the wavelet family. If a is continuous variables, the CWT of the signal with finite energy ( W x a , b ) is mathematically defined as []:
W x a , b = x t , ψ a , b ( t ) = a 1 / 2 x t   ψ * t b a d t
where   ψ * is the complex conjugate of ψ . The absolute value of the CWT of a signal is the scalogram. The CWT of the signal can be represented as an image via a scalogram. The energy of x ( t ) at scale a   ( S a ) is as follows:
S a = W x a , b = W x a , b 2 d b 1 2
Figure 5 shows the conversion of a sample extracted by area metric data sampling to a CWT-scalogram:
Figure 5. Time–frequency imaging based on CWT.

3.2. Dilated CNN

To solve complex problems and improve accuracy, the CNN structure is deepened by stacking layers, and the number of hyperparameters and parameters increases accordingly. Sometimes, this feature causes fatal drawbacks. As the network layer increases, backpropagation of the gradient suffers from vanishing gradient problems []. This may result in a sharp drop or saturation in the network performance. The issues mentioned above can be mitigated through dilated CNN. As shown in Figure 6, the dilated CNN is performed using dilated convolution kernels, unlike the traditional CNN, which uses traditional convolutional kernels.
Figure 6. Deep learning model framework: (a) CNN model; (b) dilated-CNN.
Let P 0 ,   P 1 ,   ,   P n 1 :   Z 2 R be discrete functions, and ω 0 ,   ω 1 ,   ,   ω n 2 :   Ω R be discrete filter with size 3 × 3 . For the application of filters with exponentially increasing dilation, this can be expressed as:
P k + 1 = P k 2 k   ω 0   f o r   k = 1 , 2 , , n 2
The receptive field of an element β , a component of P k + 1 , is defined as a set of elements existing in P 0 that can estimate the value of P k + 1 ( β ). The size of the receptive field of each component in P k + 1 is ( 2 k + 2 1 ) ( 2 k + 2 1 ) . Figure 7 shows the convolutional kernel according to the dilation rate. In Figure 7a, P 1 is made from P 0 by a one-dilated convolution in which every element in P 1 has a receptive field of size 3 × 3. (Figure 7b) P 2 is made from P 1 by a two-dilated convolution, in which every element in P 2 has a receptive field of size 7 × 7. (Figure 7c) P 3 is made from P 2 by a two-dilated convolution in which every element in P 2 has a receptive field of size 15 × 15. As the dilation rate increases, the receptive field becomes more expansive, but the number of parameters remains the same. More information can be accommodated through this dilated convolution kernel without increasing the amount of computation.
Figure 7. (a) Receptive field change according to the dilation rate (b) Two-dilated convolution with a receptive field of size 7 × 7 (c) Two-dilated convolution with a receptive field of size 15 × 15 [].

3.3. Grad-CAM

CAM is applicable only to CNN models based on global average pooling (GAP) architecture, but Grad-CAM is a model that can be applied to CNN models of other structures. Grad-CAM obtains a class activation map by calculating the gradient of the class score ( y c ), for class, with respect to feature maps ( A k ) of the convolutional layer. Then, a global average pooling on the gradient flowing back can be performed:
a k c = 1 Z   i j y c A i j k
where i ,   j represent each pixel of the feature map, A k . H and W are rows and columns, and Z represents the number of pixels of the A k . Weight a k c captures the importance of feature map A k . Based on Equation (13), the rectified linear unit (ReLU) can be derived by removing negative values []:
L G r a d C A M c = R e L U   k a k c A k

4. Case Study

4.1. Data Acquisition System

This study introduces the methodologies introduced in Section 3 for robotic strain wave gear reducer fault diagnosis (the Robostar R004 welding robot with 6 degrees of freedom). Each axis of the robot is controlled by repeating acceleration/deceleration through a servomotor. As shown in Figure 8, the robotic strain wave gear reducer has two states: normal and faulty. The fault type is the bearing inner ring defect, and the failure is applied to axis 3. Table 3 shows detailed information on the robot:
Figure 8. Target robot Robostar R004 and fault types.
Table 3. Description of the robot.

4.2. Experimental Set-Up

In this study, we analyze normal and fault motor current data in the 3rd axis to which faults are applied. The experiment was performed, as shown in Figure 9. The current data collection system is as follows. The robot operation cycle was divided using the programmable logic controller (PLC). The PLC provides trigger signals at the start and end of the robot motion, and stores them in the PC. All axes of the robot are equipped with WCS2720 sensors, which are based on the Hall effect. The sensor signals are collected through the DAQ module, and the sampling rate was fs = 2048 Hz.
Figure 9. Experimental set-up of target robot: (a) experimental set-up; (b) car body welding; (c) welding points of car door.
Target robots are widely used for body welding. Table 4 shows detailed information on the robot’s operating conditions. A total of six working conditions (Operation 1_Normal, Operation 1_Fault, Operation 2_Normal, Operation 2_Fault, Operation 3_Normal, and Operation 3_Fault) used for car body welding were set. The robot operates under the set operating conditions. The robot repeats the welding motion in the sequence of car door points A–B–C–D, as shown in Figure 9c. After welding each point (A, B, C, and D), the robot moves to the next point (e.g., A → B, B → C, C → D). First, the robot completes the A-point welding of the car door in Operation 1. After that, it moves to point B at 40% of the maximum operating speed. In Operation 2, after the welding is completed at point B, it moves to point C at a rate of 20% of the maximum operating speed. In Operation 3, when it is completed at point C, it moves to point D at the maximum operating speed of 60%, and completes welding at point D. Figure 10 shows the motor current signal according to the robot’s operating conditions.
Table 4. Operating condition specifications.
Figure 10. Various operation conditions: (a) normal and fault states of Operation 1; (b) normal and fault state of Operation condition 2; (c) normal and fault states of Operation 3.
In this study, 500 samples of length (28 × 28) were extracted using the previously proposed area metric sampling method under each of six operating and health conditions, as summarized in Table 5. Afterward, the extracted samples were converted into CWT-based scalogram images, and classified through a deep learning model. Table 6 shows examples sampled of converted CWT-scalogram images in the experimental dataset using the proposed method.
Table 5. Description of the health condition in the experimental dataset.
Table 6. Examples of the converted CWT-scalogram images in the experimental dataset.

5. Results and Discussion

5.1. Evaluation of the Proposed Method

The proposed methodology is verified by comparison with previous studies. Table 7 is a previous study to be compared and includes raw signal input, grayscale image input, and STFT-spectrogram input.
Table 7. Description of the comparison target.
The performance of the proposed method was evaluated using accuracy among the four evaluation indicators (precision, recall, F1-score, and accuracy) shown in Table 8. Precision is the proportion of actual true data among the data judged to be true by the model. Conversely, recall is the proportion of data judged to be true by the model out of the actual true data. The F1-score is the harmonic mean value of the precision and recall. Accuracy is the proportion of data that the model correctly predicted out of the total data.
Table 8. Evaluation indicators.
In Table 8, T P denotes the true positive, F P is the false positive, T N represents the true negative, and F N represents the false negative. Dilated CNN and traditional CNN models have an adaptive moment algorithm optimizer with a batch size of 32 set as the optimization algorithm for fast convergence. The execution rate was set to 0.001. For both models, a total of 100 training epochs were performed. Of the total data, the training data and test data were divided into 70 and 30%. Figure 11 shows a structural diagram of the CNN model. The proposed dilated CNN and the traditional CNN model had the same structure, but, in the case of the proposed dilated CNN model, a dilation rate = 2 was applied in the convolutional layer.
Figure 11. Structure of the CNN model.
Table 9 summarizes the performance comparisons of the various methods. In the traditional CNN model, the raw signal input shows a classification accuracy of 48.89% when sampled with random sampling and 83.44% with area-metric-based sampling. The grayscale image input shows a classification accuracy of 58.78% when sampled with random sampling and 87.67% with area-metric-based sampling. The STFT-spectrogram input shows a classification accuracy of 68.89% when sampling with random sampling and 91.78% classification accuracy when sampling with area-metric-based sampling. The CWT-scalogram input shows an accuracy of 87.78% when sampling with random sampling. When sampling with area-metric-based sampling, the classification accuracy is 92.11%. When training with dilated-CNN, the classification accuracy for CWT-scalogram input is 88.33% for random sampling and 95.44% for area-metric sampling, confirming that the classification accuracy improved to 0.55 and 3.33%, respectively. The proposed method shows the highest accuracy among the comparison group. This means that the classification performance is the best.
Table 9. The results of the various methods.
Figure 12 shows the confusion matrix of the dilated-CNN using the CWT-scalogram results for area-metric-based sampling and random sampling. Figure 13 shows the confusion matrix of the CNN using the CWT-scalogram results for the area-metric-based sampling and random sampling. Figure 14 shows the confusion matrix of the CNN using the STFT-spectrogram results for the area-metric-based sampling and random sampling. Figure 15 shows the confusion matrix of the CNN using the grayscale image results for area-metric-based sampling and random sampling. Figure 16 shows the confusion matrix of the CNN using 1D raw signal results for area-metric-based sampling and random sampling.
Figure 12. Testing confusion matrix of the dilated-CNN using CWT-scalogram: (a) area-metric-based sampling method confusion matrix (95.44% accuracy); (b) random sampling method confusion matrix (88.33% accuracy).
Figure 13. Testing the confusion matrix of the CNN using CWT-scalogram: (a) area-metric-based sampling method confusion matrix (92.11% accuracy); (b) random sampling method confusion matrix (87.78% accuracy).
Figure 14. Testing confusion matrix of the CNN using STFT-spectrogram: (a) area-metric-based sampling method confusion matrix (91.76% accuracy); (b) random sampling method confusion matrix (68.89% accuracy).
Figure 15. Testing confusion matrix of the CNN using grayscale image: (a) area-metric-based sampling method confusion matrix (87.67% accuracy); (b) random sampling method confusion matrix (58.78% accuracy).
Figure 16. Testing confusion matrix of the CNN using grayscale image: (a) area-metric-based sampling method confusion matrix (83.44% accuracy); (b) random sampling method confusion matrix (48.89% accuracy).

5.2. Presenting the Basis for Judgment: Based on Explainable Artificial Intelligence

Table 10 shows the results of the attention map for each state of the proposed method as a heat map based on Grad-CAM. Since the robot is controlled by repeating acceleration and deceleration instantaneously, the frequency changes over time. A CWT-based scalogram representing the change in frequency over time of nonstationary data is used for model training. The model is trained by weighting the time-varying frequency components that have physical meaning in the input image and deriving a result.
Table 10. Attention map of the proposed method for each condition.
Compared to the traditional CNN model, the dilated-CNN derives a classification accuracy improvement of up to 3.33%. Using Grad-CAM, the basis for judgment in each layer is visualized through a heat map to present the basis for improving the classification accuracy. The receptive field becomes more expansive in the convolutional layer of the dilated CNN because dilated convolution kernels are used. In the heat map using Grad-CAM, it can be observed that the dilated-CNN model had more active parts than the traditional CNN model in each convolutional layer. In addition, in a grayscale image in which signals are arranged, a judgment is performed on each layer without any special criteria. On the other hand, in the STFT-spectrogram and CWT-scalogram, which are time–frequency images, each layer performs a judgment by focusing on the frequency component of the image.
Unlike a STFT-spectrogram, a CWT-scalogram provides multiresolution for the time–frequency domains. Since the CWT-scalogram contains various time–frequency information compared to the STFT-spectrogram, it provides improved classification accuracy, because there are many areas where judgment weights are placed in each layer. Table 11 is the result of visualizing the weights in each convolutional layer as a heat map using the Grad-CAM.
Table 11. The basis for judgment in each layer visualized through a heat map.

6. Conclusions

In this research, a new data sampling method was developed to solve the problems of data imbalance and shortage. In addition, we introduced the judgment basis for deep learning-based defect diagnosis using XAI. The proposed method was verified using data of nonstationary current signals from industrial robots operating under various operating conditions. The verification results are summarized as follow.
First, the data were extracted using area-metric-based sampling. The data were scanned sequentially for each operating condition, and the area metrics of the reference ECDF and segmented data ECDF were obtained. Second, The sampled data were transformed into CWT–scalogram images containing time–frequency information. A dilated CNN using dilated convolution kernels was introduced to increase the receptive field, which made it possible to accommodate a lot of information without increasing the number of computations and number of hyperparameters. Third, the proposed method was verified by performing robotic strain wave gear reducer fault diagnosis. The test classification accuracy, precision, recall, and F1-score were used as evaluation indices. As a result, the proposed method showed the highest accuracy among the comparison groups at 95.44%, which means that the classification performance is the best. For all comparison groups, area-metric-based sampling showed a higher test classification accuracy by at least 4.33% and up to 34.55% than the conventional random sampling. This indicates that the proposed area-based sampling can overcome the limitations of traditional sampling, and solve the problem of data imbalance and shortage. In addition, the CWT–scalogram showed a minimum of 0.33% and a maximum of 38.89% higher test classification accuracy compared to other input data under the same comparison criteria. This indicates that the CWT-scalogram has the least data loss compared to other data analysis methods. When a dilated CNN was used instead of the traditional CNN as a classification model, the test classification accuracy improved by at least 0.55% and, at most, 3.33%. Through Grad-CAM, the basis for judgment in the convolutional layer of the two models was presented using a heat map. Compared to the traditional CNN, the dilated CNN had more active parts, indicating that the dilated CNN was based on more feature information than the traditional CNN. The proposed method can derive high diagnostic accuracy even with a small amount of data, so it can be applied to industries where it is difficult to secure sufficient quality data. In addition, by presenting the basis for the diagnosis result, the reliability of the diagnosis model can be increased, and it can be widely applied to actual industrial sites. Future work will involve further exploring the adaptability and robustness of the proposed method across different industrial domains and expanding the dataset to encompass a broader range of operating conditions for a comprehensive validation of the approach. Additionally, investigating ensemble techniques and exploring real-time implementation possibilities will be essential steps to enhance the method’s practical applicability.

Author Contributions

Conceptualization, S.-K.C.; content preparation, Y.R.N. and S.K.; writing—original draft preparation, Y.R.N. and H.S.K.; writing—review and editing, S.K., H.S.K. and S.-K.C.; supervision, H.S.K. and S.-K.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the MOTIE (Ministry of Trade, Industry, and Energy) in Korea, under the Fostering Global Talents for Innovative Growth Program (P0017307) supervised by the Korea Institute for Advancement of Technology (KIAT).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jing, L.; Zhao, M.; Li, P.; Xu, X. A Convolutional Neural Network Based Feature Learning and Fault Diagnosis Method for the Condition Monitoring of Gearbox. Measurement 2017, 111, 1–10. [Google Scholar] [CrossRef]
  2. Zhou, W.; Lu, B.; Habetler, T.G.; Harley, R.G. Incipient Bearing Fault Detection via Motor Stator Current Noise Cancellation Using Wiener Filter. IEEE Trans. Ind. Appl. 2009, 45, 1309–1317. [Google Scholar] [CrossRef]
  3. Huang, K.; Wu, S.; Li, Y.; Yang, C.; Gui, W. A Multi-Rate Sampling Data Fusion Method for Fault Diagnosis and Its Industrial Applications. J. Process Control 2021, 104, 54–61. [Google Scholar] [CrossRef]
  4. Kankar, P.K.; Sharma, S.C.; Harsha, S.P. Fault Diagnosis of Ball Bearings Using Machine Learning Methods. Expert Syst. Appl. 2011, 38, 1876–1886. [Google Scholar] [CrossRef]
  5. Samanta, S.; Bera, J.N.; Sarkar, G. KNN Based Fault Diagnosis System for Induction Motor. In Proceedings of the 2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC), Kolkata, India, 28–30 January 2016; pp. 304–308. [Google Scholar]
  6. Widodo, A.; Kim, E.Y.; Son, J.-D.; Yang, B.-S.; Tan, A.C.C.; Gu, D.-S.; Choi, B.-K.; Mathew, J. Fault Diagnosis of Low Speed Bearing Based on Relevance Vector Machine and Support Vector Machine. Expert Syst. Appl. 2009, 36, 7252–7261. [Google Scholar] [CrossRef]
  7. Zhang, S.; Zhang, S.; Wang, B.; Habetler, T.G. Deep Learning Algorithms for Bearing Fault Diagnostics—A Comprehensive Review. IEEE Access 2020, 8, 29857–29881. [Google Scholar] [CrossRef]
  8. Khan, A.; Hwang, H.; Kim, H.S. Synthetic Data Augmentation and Deep Learning for the Fault Diagnosis of Rotating Machines. Mathematics 2021, 9, 2336. [Google Scholar] [CrossRef]
  9. Mikołajczyk, A.; Grochowski, M. Data Augmentation for Improving Deep Learning in Image Classification Problem. In Proceedings of the 2018 International Interdisciplinary PhD Workshop (IIPhDW), Świnouście, Poland, 9–12 May 2018; pp. 117–122. [Google Scholar]
  10. Wang, J.; Mall, S.; Perez, L. The Effectiveness of Data Augmentation in Image Classification Using Deep Learning. arXiv 2017, arXiv:1712.04621. [Google Scholar]
  11. Guennec, A.L.; Malinowski, S.; Tavenard, R. Data Augmentation for Time Series Classification Using Convolutional Neural Networks. In ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
  12. Um, T.T.; Pfister, F.M.J.; Pichler, D.; Endo, S.; Lang, M.; Hirche, S.; Fietzek, U.; Kulić, D. Data Augmentation of Wearable Sensor Data for Parkinson’s Disease Monitoring Using Convolutional Neural Networks. In Proceedings of the 19th ACM International Conference on Multimodal Interaction, Glasgow, UK, 13–17 November 2017; Association for Computing Machinery: New York, NY, USA, 2017; pp. 216–220. [Google Scholar]
  13. Li, Y.; Hu, H.; Zhou, G. Using Data Augmentation in Continuous Authentication on Smartphones. IEEE Internet Things J. 2019, 6, 628–640. [Google Scholar] [CrossRef]
  14. Frid-Adar, M.; Klang, E.; Amitai, M.; Goldberger, J.; Greenspan, H. Synthetic Data Augmentation Using GAN for Improved Liver Lesion Classification. In Proceedings of the 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, DC, USA, 4–7 April 2018; pp. 289–293. [Google Scholar]
  15. Chen, H.-Y.; Lee, C.-H. Vibration Signals Analysis by Explainable Artificial Intelligence (XAI) Approach: Application on Bearing Faults Diagnosis. IEEE Access 2020, 8, 134246–134256. Available online: https://ieeexplore.ieee.org/abstract/document/9131692 (accessed on 2 November 2022). [CrossRef]
  16. Xie, T.; Huang, X.; Choi, S.-K. Intelligent Mechanical Fault Diagnosis Using Multisensor Fusion and Convolution Neural Network. IEEE Trans. Ind. Inform. 2022, 18, 3213–3223. [Google Scholar] [CrossRef]
  17. Hoang, D.T.; Kang, H.J. A Motor Current Signal-Based Bearing Fault Diagnosis Using Deep Learning and Information Fusion. IEEE Trans. Instrum. Meas. 2020, 69, 3325–3333. [Google Scholar] [CrossRef]
  18. Janssens, O.; Slavkovikj, V.; Vervisch, B.; Stockman, K.; Loccufier, M.; Verstockt, S.; Van de Walle, R.; Van Hoecke, S. Convolutional Neural Network Based Fault Detection for Rotating Machinery. J. Sound Vib. 2016, 377, 331–345. [Google Scholar] [CrossRef]
  19. Park, S.; Song, J.; Kim, H.S.; Ryu, D. Non-Contact Detection of Delamination in Composite Laminates Coated with a Mechanoluminescent Sensor Using Convolutional AutoEncoder. Mathematics 2022, 10, 4254. [Google Scholar] [CrossRef]
  20. Saufi, S.R.; Ahmad, Z.A.B.; Leong, M.S.; Lim, M.H. Gearbox Fault Diagnosis Using a Deep Learning Model with Limited Data Sample. IEEE Trans. Ind. Inform. 2020, 16, 6263–6271. [Google Scholar] [CrossRef]
  21. Zhong, C.; Yan, K.; Dai, Y.; Jin, N.; Lou, B. Energy Efficiency Solutions for Buildings: Automated Fault Diagnosis of Air Handling Units Using Generative Adversarial Networks. Energies 2019, 12, 527. [Google Scholar] [CrossRef]
  22. Kumar, P.; Hati, A.S. Dilated Convolutional Neural Network Based Model for Bearing Faults and Broken Rotor Bar Detection in Squirrel Cage Induction Motors. Expert Syst. Appl. 2022, 191, 116290. [Google Scholar] [CrossRef]
  23. Yang, D.; Liu, Y.; Li, S.; Li, X.; Ma, L. Gear Fault Diagnosis Based on Support Vector Machine Optimized by Artificial Bee Colony Algorithm. Mech. Mach. Theory 2015, 90, 219–229. [Google Scholar] [CrossRef]
  24. Qian, L.; Pan, Q.; Lv, Y.; Zhao, X. Fault Detection of Bearing by Resnet Classifier with Model-Based Data Augmentation. Machines 2022, 10, 521. [Google Scholar] [CrossRef]
  25. Hu, T.; Tang, T.; Lin, R.; Chen, M.; Han, S.; Wu, J. A Simple Data Augmentation Algorithm and a Self-Adaptive Convolutional Architecture for Few-Shot Fault Diagnosis under Different Working Conditions. Measurement 2020, 156, 107539. [Google Scholar] [CrossRef]
  26. Liu, J.; Qu, F.; Hong, X.; Zhang, H. A Small-Sample Wind Turbine Fault Detection Method with Synthetic Fault Data Using Generative Adversarial Nets. IEEE Trans. Ind. Inform. 2019, 15, 3877–3888. [Google Scholar] [CrossRef]
  27. Liu, Y.; Jiang, H.; Liu, C.; Yang, W.; Sun, W. Data-Augmented Wavelet Capsule Generative Adversarial Network for Rolling Bearing Fault Diagnosis. Knowl. Based Syst. 2022, 252, 109439. [Google Scholar] [CrossRef]
  28. Li, Z.; Zheng, T.; Wang, Y.; Cao, Z.; Guo, Z.; Fu, H. A Novel Method for Imbalanced Fault Diagnosis of Rotating Machinery Based on Generative Adversarial Networks. IEEE Trans. Instrum. Meas. 2021, 70, 3500417. [Google Scholar] [CrossRef]
  29. Wang, Y.; Sun, G.; Jin, Q. Imbalanced Sample Fault Diagnosis of Rotating Machinery Using Conditional Variational Auto-Encoder Generative Adversarial Network. Appl. Soft Comput. 2020, 92, 106333. [Google Scholar] [CrossRef]
  30. Sairam, S.; Seshadhri, S.; Marafioti, G.; Srinivasan, S.; Mathisen, G.; Bekiroglu, K. Edge-Based Explainable Fault Detection Systems for Photovoltaic Panels on Edge Nodes. Renew. Energy 2022, 185, 1425–1440. [Google Scholar] [CrossRef]
  31. Srinivasan, S.; Arjunan, P.; Jin, B.; Sangiovanni-Vincentelli, A.L.; Sultan, Z.; Poolla, K. Explainable AI for Chiller Fault-Detection Systems: Gaining Human Trust. Computer 2021, 54, 60–68. [Google Scholar] [CrossRef]
  32. Morichetta, A.; Casas, P.; Mellia, M. EXPLAIN-IT: Towards Explainable AI for Unsupervised Network Traffic Analysis. In Proceedings of the 3rd ACM CoNEXT Workshop on Big DAta, Machine Learning and Artificial Intelligence for Data Communication Networks, Orlando, FL, USA, 9 December 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 22–28. [Google Scholar]
  33. Shrikumar, A.; Greenside, P.; Kundaje, A. Learning Important Features Through Propagating Activation Differences. In Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017. [Google Scholar]
  34. Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. arXiv 2016, arXiv:1610.02391. [Google Scholar]
  35. Hwang, C.; Lee, T. E-SFD: Explainable Sensor Fault Detection in the ICS Anomaly Detection System. IEEE Access 2021, 9, 140470–140486. [Google Scholar] [CrossRef]
  36. Hu, J.; Zhou, Q.; McKeand, A.; Xie, T.; Choi, S.-K. A Model Validation Framework Based on Parameter Calibration under Aleatory and Epistemic Uncertainty. Struct. Multidiscip. Optim. 2021, 63, 645–660. [Google Scholar] [CrossRef]
  37. Lee, D.; Kim, N.H.; Kim, H.-S. Validation and Updating in a Large Automotive Vibro-Acoustic Model Using a P-Box in the Frequency Domain. Struct. Multidiscip. Optim. 2016, 54, 1485–1508. [Google Scholar] [CrossRef]
  38. Strang, G.; Nguyen, T. Wavelets and Filter Banks; SIAM: Bangkok, Thailand, 1996; ISBN 978-0-9614088-7-9. [Google Scholar]
  39. Barros, J.; Diego, R.I. Analysis of Harmonics in Power Systems Using the Wavelet Packet Transform. In Proceedings of the 2005 IEEE Instrumentationand Measurement Technology Conference Proceedings, Ottawa, ON, Canada, 16–19 May 2005; Volume 2, pp. 1484–1488. [Google Scholar]
  40. Liang, P.; Deng, C.; Wu, J.; Yang, Z.; Zhu, J.; Zhang, Z. Single and Simultaneous Fault Diagnosis of Gearbox via a Semi-Supervised and High-Accuracy Adversarial Learning Framework. Knowl. Based Syst. 2020, 198, 105895. [Google Scholar] [CrossRef]
  41. Li, Y.; Zhang, X.; Chen, D. CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 19–21 June 2018; pp. 1091–1100. [Google Scholar]
  42. Yu, F.; Koltun, V. Multi-Scale Context Aggregation by Dilated Convolutions. arXiv 2016, arXiv:1511.07122. [Google Scholar]
  43. Gozzi, N.; Malandri, L.; Mercorio, F.; Pedrocchi, A. XAI for Myo-Controlled Prosthesis: Explaining EMG Data for Hand Gesture Classification. Knowl. Based Syst. 2022, 240, 108053. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.