Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis

Hu, Qin; Zhou, Haiting; Wang, Chengcheng; Zhu, Chenxi; Shen, Jiaping; He, Peng

doi:10.3390/lubricants12010010

Open AccessArticle

Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis

by

Qin Hu

¹,

Haiting Zhou

^1,*,

Chengcheng Wang

²,

Chenxi Zhu

¹,

Jiaping Shen

¹

and

Peng He

¹

School of Quality and Safety Engineering, China Jiliang University, Hangzhou 310018, China

²

Instrumental Technol & Econ Inst, Beijing 100032, China

^*

Author to whom correspondence should be addressed.

Lubricants 2024, 12(1), 10; https://doi.org/10.3390/lubricants12010010

Submission received: 5 December 2023 / Revised: 27 December 2023 / Accepted: 28 December 2023 / Published: 29 December 2023

(This article belongs to the Special Issue Tribology and Machine Learning: New Perspectives and Challenges)

Download

Browse Figures

Versions Notes

Abstract

:

To improve the accuracy of gear fault diagnosis and overcome the low diagnostic accuracy of the model caused by manual parameter selection, a combined diagnostic model based on time-frequency fusion features is combined with the improved global search whale optimization algorithm (GSWOA) to optimize the fault diagnosis capability of the kernel extreme learning machine (KELM). First, the time-domain and frequency-domain features of the gear fault state are extracted separately, and feature vectors are constructed through feature fusion, which overcomes the limitations of single features. Second, the GSWOA based on three strategies is used to optimize the regularization coefficient C and kernel function parameter γ of KELM, and a GSWOA-KELM fault diagnosis model is built to avoid the problem of low fault diagnosis accuracy caused by the manual selection of KELM parameters. Finally, the public dataset from Southeast University is taken to verify the performance of the proposed model by comparing it with KELM, SSA-KELM, and WOA-KELM models. The experimental results demonstrate that the improved time-frequency fusion features-based GSWOA-KELM model shows faster convergence speed and stronger global search ability. Compared with KELM, SSA-KELM, and WOA-KELM models, the performance of the proposed model has been improved by 11.33%, 8.67%, and 1.33%, respectively.

Keywords:

gear fault diagnosis; kernel extreme learning machine; global search whale optimization algorithm; feature fusion; machine learning

1. Introduction

Gears are essential components to ensure the normal operation of various types of rotating machinery and are widely used in all kinds of industrial scenarios, such as wind turbines and automotive transmissions [1,2,3]. However, gear mesh surfaces are subjected to operating environments that result in poor lubrication, as evidenced by the mechanical impurities in oil, oil film disruption, etc., which further causes non-lubricating factors such as friction on the tooth surface, leading to surface deterioration, scuffing, and permanent deformation, etc. [4]. In addition, due to quenching, fatigue, grinding, and cyclic loading, gears are often subject to cracks and fractures [5]. Once the gear fails, it will not only greatly reduce the safety and reliability of the equipment but also cause enormous safety production accidents, which will bring immense hidden dangers to the social economy and stability. Therefore, it is of great importance to detect gear faults and take timely measures to ensure the safe operation of equipment and maintain social security and stability.

At present, the use of intelligent diagnosis technology based on machine learning for gear fault diagnosis is a major research trend among scholars at home and abroad [6,7,8]. Reference [9] used frequency-modulated empirical mode decomposition to extract vibration signal features and calculated the energy entropy as a feature vector, which was inputted into the support vector machine (SVM) to realize gear fault diagnosis. Reference [10] input the extracted vibration signal characteristic parameters into the K-nearest neighbor (KNN) fault diagnosis model, which effectively achieved the predictive maintenance of rolling bearing faults. Although traditional machine learning algorithms can complete fault diagnosis, they still have some shortcomings in training speed and diagnosis accuracy. Therefore, scholars are gradually turning their attention to neural network-based algorithms. The literature [11] combined the traditional machine learning algorithm SVM and convolutional neural network (CNN) to build a CNN-SVM fault diagnosis model, avoiding the artificial feature extraction process and improving fault diagnosis accuracy and stability. In the literature [12], a BP-AdaBoost gear fault strong classifier model based on the BP neural network and AdaBoost algorithm was proposed and verified using experiments, and the results showed that the proposed method has higher accuracy than traditional fault diagnosis methods. The literature [13] proposed a new hierarchical fine composite multiscale fluctuation dispersive entropy (HRCMFDE) feature extraction method, which inputs the extracted features into a regularized extreme learning machine (RELM) using relief dimensionality reduction, effectively improving the practicality and versatility of model fault diagnosis.

The kernel extreme learning machine (KELM), as a novel feed-forward neural network, has higher generalization ability and stability than the BP neural network, RBF neural network, and ELM, so it has greater application advantages [14]. However, the performance of KELM is affected by the regularization coefficient C and kernel function parameter γ, and the classification accuracy of KELM, which relies on manual experience to select the regularization coefficient and kernel function parameter, is low [15]. To solve this problem, the researchers proposed to use an intelligent optimization algorithm to optimize the KELM parameters to improve the fault diagnosis accuracy. The scholars used the particle swarm optimization (PSO) algorithm [16], the sparrow search algorithm (SSA) [17], and the Harris Hawks optimization (HHO) [18] to optimize the KELM model in order to reduce the errors caused by manual parameter selection. However, the experimental results manifested that all of the above optimization algorithms have their own shortcomings [19,20]. At the same time, the whale optimization algorithm (WOA) has been rapidly developed due to its powerful search capability and less parameter setting [21]. However, WOA also has some problems, such as slow convergence speed and insufficient global search capacity [22]. Therefore, it is necessary to propose an improved whale optimization algorithm to solve the above problems. The literature [23] proposed an improved global search whale optimization algorithm (GSWOA) based on three strategies. The adaptive weight strategy, variable spiral position update strategy, and optimal neighborhood perturbation strategy were adopted to improve the whale optimization algorithm, which improved the global search performance and convergence speed of the whale optimization algorithm.

In addition, feature extraction is an indispensable step before applying machine learning methods for fault diagnosis. The literature [24] extracted 17 time-domain features from circuit breaker vibration signals and input them into the XGBoost model to implement the diagnosis of the mechanical condition of the circuit breaker. The literature [25] used sparse filtering technology to automatically extract the frequency-domain features of gear vibration signals and input them into the Softmax classifier as feature vectors in order to realize gear fault diagnosis. However, extracting only a single time-domain or frequency-domain feature often leads to the insufficient ability to represent the information of the signal, which affects the accuracy of fault diagnosis [26].

Therefore, an improved time-frequency fusion features-based GSWOA-KELM model is proposed in this study. First, the time-domain and frequency-domain features of the gear fault state are extracted separately, and feature vectors are constructed through feature fusion, which overcomes the limitations of single features. Second, the GSWOA based on three strategies is used to optimize the regularization coefficient C and kernel function parameter γ of KELM, and a GSWOA-KELM fault diagnosis model is built to avoid the problem of low fault diagnosis accuracy caused by the manual selection of KELM parameters. Finally, the public dataset from Southeast University is taken to verify the performance of the proposed model by comparing it with KELM, SSA-KELM, and WOA-KELM models. The experimental results demonstrate that the improved time-frequency fusion features-based GSWOA-KELM model shows faster convergence speed and stronger global search ability. Compared with KELM, SSA-KELM, and WOA-KELM models, the performance of the proposed model has been improved by 11.33%, 8.67%, and 1.33%, respectively.

The main innovations and contributions of this paper are as follows:

(1): This study proposes the GSWOA-KELM model for the first time. In the new model, the GSWOA is used to find the optimal parameters of the KELM, and the results show that compared with the existing model, the proposed GSWOA-KELM model has higher diagnostic accuracy, faster convergence speed, and stronger global search capability;
(2): The time-domain and frequency-domain features are extracted and fused in this study, which overcomes the limitations of single-domain features and improves the fault diagnosis ability of the model. Meanwhile, the superiority of multi-domain features in representing information ability is examined in this study, which provides a reference basis for the application of feature extraction work in other aspects.

2. Time-Frequency Features Extraction

2.1. Time-Domain Features

Since the time-domain signal of gears tends to change when they are faulty, the time-domain characteristic parameters of the gear vibration signal can be analyzed to make an effective diagnosis of the type of fault.

Dimensional time-domain characteristic parameters and dimensionless time-domain characteristic parameters are two commonly used time-domain characteristic parameters. The 13 dimensional and dimensionless time-domain characteristic parameters extracted in this study and their calculation formulas are shown in Table 1 [27].

2.2. Frequency-Domain Features

Extracting and analyzing the frequency-domain characteristic parameters of the gear fault vibration signal is also one of the efficient methods for gear fault diagnosis.

Therefore, in this study, the original time-domain vibration signal is transformed into the frequency domain using Fourier transform to observe the characteristics of the vibration signal from the perspective of frequency. The conversion from time-domain to frequency-domain can be defined as:

s (k) = \sum_{k = 0}^{N - 1} x (k ∆ t z) e^{\frac{- 2 π j n k}{N}}, (n = 1,2, \dots, N - 1)

(1)

where

x (k ∆ t z)

represents the sample value; N denotes the number of sample points; ∆t indicates the sampling interval; k refers to the discrete value of the time-domain signal.

After conversion into frequency signals, the corresponding frequency-domain characteristic parameters can be calculated according to the corresponding frequency-domain statistical index formula. The five frequency-domain characteristic parameters extracted in this study and their calculation formulas are shown in Table 2 [28].

2.3. Fusion Features

Extract the above 13 time-domain features and 5 frequency-domain features to form the time-domain feature vector matrix T and the frequency-domain feature vector matrix F, respectively. Assuming that the total number of samples is n, then:

T = [\begin{matrix} t_{1,1} & \dots & t_{13,1} \\ ⋮ & ⋱ & ⋮ \\ t_{1, n} & \dots & t_{13, n} \end{matrix}]

(2)

F = [\begin{matrix} f_{1,1} & \dots & f_{5,1} \\ ⋮ & ⋱ & ⋮ \\ f_{1, n} & \dots & f_{5, n} \end{matrix}]

(3)

The above time-domain feature vector matrix T and frequency-domain feature vector matrix F are fused to form the fused feature vector TF, then:

T F = [\begin{matrix} x_{1,1} & \dots & x_{18,1} \\ ⋮ & ⋱ & ⋮ \\ x_{1, n} & \dots & x_{18, n} \end{matrix}]

(4)

3. GSWOA-KELM Fault Diagnosis Model

3.1. Kernel Extreme Learning Machine

The KELM is an improved algorithm developed on the basis of an extreme learning machine (ELM). It introduces a kernel function on the basis of an ELM, has better generalization performance, and has a faster learning ability [29].

The ELM is a feed-forward neural network including input, hidden, and output layers [20], and its typical neural network structure is given in Figure 1.

The mathematical expression of an ELM is as follows:

H β = T

(5)

H (w_{1}, \dots, w_{l}, b_{1}, \dots, b_{l}, x_{1}, \dots, x_{n}) = (\begin{matrix} g (w_{1} \cdot x_{1} + b_{1}) & \dots & g (w_{l} \cdot x_{1} + b_{l}) \\ ⋮ & ⋱ & ⋮ \\ g (w_{1} \cdot x_{n} + b_{l}) & \dots & g (w_{l} \cdot x_{n} + b_{l}) \end{matrix})

(6)

where

H

represents the output matrix of the hidden layer,

β

is the output weight,

T

denotes the target output matrix,

w_{l}

is the weight of the

l

-th neuron in the hidden layer, and

b_{l}

denotes the bias of the

l

-th neuron in the hidden layer.

The learning process of an ELM is the process of solving the output weight β, which is solved using the least squares method:

β_{E L M} = H^{T} {({H H}^{T})}^{- 1} T = H^{+} T

(7)

where

H^{+}

represents the generalized inverse matrix of

H

.

In KELM, the regularization coefficient C and kernel function parameters γ are introduced to improve the performance of the KELM, and the kernel function matrix is expressed as:

Ω = H H^{T}

(8)

Ω_{i j} = h (x_{i}) h (x_{j}) = K (x_{i}, x_{j})

(9)

Then, the least square solution of the β value of the KELM is:

β_{K E L M} = H^{T} {(\frac{I}{C} + H H^{T})}^{- 1} T

(10)

Based on the above equations, the output function of the KELM can be expressed as:

f (x) = [\begin{matrix} \begin{matrix} K (x, x_{1}) \\ ⋮ \end{matrix} \\ K (x, x_{n}) \end{matrix}] {(Ω + \frac{I}{C})}^{- 1} T

(11)

In addition, the radial basis function (RBF) is chosen as the kernel function in this research, whose expression is as follows:

K (x_{i}, x_{j}) = e x p (- \frac{{||x_{i} - x_{j}||}^{2}}{{2 γ}^{2}})

(12)

where

γ

is the kernel parameter.

3.2. Whale Optimization Algorithm

The WOA is a swarm intelligence optimization algorithm that imitates the hunting process of whales in nature, which can be divided into three stages: the encircling prey stage, the bubble-net attacking stage, and the random hunting prey stage [30,31]. In each stage, the position of the whale is updated. The process of using the whale optimization algorithm to solve the problem is to represent the position of each whale as a feasible solution and obtain the optimal solution by constantly updating the position of the whale.

During the encircling prey phase, the whale’s position update formula can be expressed as:

\{\begin{matrix} X (t + 1) = X^{*} (t) - A \cdot D \\ D = |C \cdot X^{*} (t) - X (t)| \end{matrix}

(13)

where

t

represents the number of iterations;

X (t)

indicates the current position of the whale;

X^{*} (t)

represents the optimal whale location;

D

is the distance between the whale and the prey; and

A

and

C

represent the coefficient, whose expression is:

\{\begin{matrix} A = 2 a \cdot r_{1} - a \\ C = 2 r_{2} \\ a = 2 - 2 t / t_{m a x} \end{matrix}

(14)

During the bubble-net attacking phase, the position update of the whale can be described using two mechanisms, namely, the contraction surround mechanism and the spiral update mechanism. The mathematical expression of the spiral update mechanism is

\{\begin{matrix} X (t + 1) = X^{*} (t) + D \cdot e^{b l} c o s (2 π l) \\ D = | C \cdot X^{*} (t) - X (t) | \end{matrix}

(15)

where

l

represents the random number in the range between 0 and 1;

b

is a constant that reflects the shape of the helix.

It is worth noting that during the bubble-net attacking stage, the whale not only approaches the prey in a spiral shape but also shrinks the encircling circle, which is then mathematically modeled as:

X (t + 1) = \{\begin{matrix} X^{*} (t) - A \cdot |C \cdot X^{*} (t) - X (t)|, p < 0.5 \\ X^{*} (t) + D \cdot e^{b l} \cos (2 π l), p \geq 0.5 \end{matrix}

(16)

where

p

represents random numbers in the range between 0 and 1.

During the prey search phase, whales use a random search mechanism to search for prey globally. At this time, the updating method of whale position is determined by the range of A: if |A| < 1, the position is updated by spiral encircling; if |A|≥1, the location is updated by random search. The mathematical model of the random search mechanism updating location is

X (t + 1) = X_{r a n d} (t) - A \cdot | C \cdot X_{r a n d} (t) - X (t) |

(17)

where

X_{r a n d} (t)

denotes the position of a random whale.

3.3. Global Search Whale Optimization Algorithm

In order to improve the convergence speed and global search ability of traditional whale optimization algorithms, an improved global search whale optimization algorithm (GSWOA) is proposed based on three strategies, namely, adaptive weight strategy, variable spiral position update strategy, and optimal neighborhood perturbation strategy [32].

First, the adaptive weight strategy is to introduce an adaptive inertia weight based on the number of iterations t into the position update of the whale, and its expression is as follows:

w (t) = 0.2 \cos (\frac{π}{2} \cdot (1 - \frac{t}{t_{m a x}}))

(18)

where w(t) is the adaptive inertia weight, and the variation range is [0, 1]; t is the current iteration number; and

t_{m a x}

indicates the maximum iteration number.

According to Equation (18), in the early stage of the algorithm, the weight value is small but changes quickly; in the later stage of the algorithm, with the increase in the number of iterations, the weight is large, but the change speed slows down, thus improving the convergence of the algorithm.

The position update formula of the improved whale optimization algorithm is

X (t + 1) = \{\begin{matrix} {w (t) \cdot X}^{*} (t) - A \cdot |C \cdot X^{*} (t) - X (t)|, p < 0.5 \\ {w (t) \cdot X}^{*} (t) + D \cdot e^{b l} \cos (2 π l), p \geq 0.5 \end{matrix}

(19)

X (t + 1) = w (t) X_{r a n d} (t) - A \cdot | C \cdot X_{r a n d} (t) - X (t) |

(20)

Second, the variable spiral position update strategy refers to changing the constant b, which reflects the spiral shape in the bubble-net attacking stage, to a dynamically adjusted variable based on the number of iterations. The mathematical formula is as follows:

b = e^{5 \cdot \cos (π \cdot (1 - \frac{t}{t_{m a x}}))}

(21)

From Equation (21), it can be seen that in the early phase of the algorithm, the spiral shape range is larger, and the whale can search for optimization in a larger range and has a stronger global search ability; with the increase of the number of iterations, the spiral shape range becomes smaller, and the whale can search in a smaller range to improve the optimization accuracy.

Now, the position update formula of the improved whale optimization algorithm is

X (t + 1) = {w (t) \cdot X}^{*} (t) + b D \cdot e^{b l} c o s (2 π l)

(22)

Finally, the optimal neighborhood perturbation strategy is to expand the search scope of the optimal location to the vicinity of the current optimal location when the whale position is updated and search the nearby space simultaneously instead of being limited to the current optimal location. In this way, the search efficiency of the whale and the convergence speed of the algorithm can be enhanced. The mathematical expression for generating a disturbance in the neighborhood of the current optimal location and generating a new location is

\hat{X} (t) = \{\begin{matrix} X^{*} (t) + 0.5 \cdot rand 1 \cdot X^{*} (t), rand 2 < 0.5 \\ X^{*} (t), rand 2 \geq 0.5 \end{matrix}

(23)

where

r a n d 1

and

r a n d 2

indicate uniform random numbers in the range [0, 1];

\hat{X} (t)

is the generated new location.

If the generated new position is better than the original position, the new position is kept. If the generated new position is inferior to the original position, the original position is retained. The formula is expressed as:

X^{*} (t) = \{\begin{matrix} \hat{X} (t), f (\hat{X} (t)) < f (X^{*} (t)) \\ X^{*} (t), f (X^{*} (t)) \leq f (\hat{X} (t)) \end{matrix}

(24)

where

f (x)

represents the fitness value when the position is

x

.

The overall flow of the GSWOA is shown in Figure 2.

3.4. Kernel Extreme Learning Machine Optimized Using the Global Search Whale Optimization Algorithm

In this study, GSWOA is used to intelligently optimize the regularization coefficient C and kernel function parameter γ of KELM, and a GSWOA-KELM gear fault diagnosis model is constructed to avoid the problem of low fault diagnosis efficiency of KELM caused by artificial parameter selection. The process of using GSWOA to optimize KELM parameters is shown in Figure 3. The specific steps are as follows:

Step 1: Initialize the parameters of the GSWOA, set the whale population size to 10, the maximum number of iterations to 60, the problem dimension to 2, and the whale exploration boundary to [1,20];

Step 2: Initialize the whale position and map it to the initialization parameters of KELM: regularization coefficient C and kernel function parameter γ;

Step 3: Calculate the fitness value of each whale in the whale population and find the optimal whale location in the population;

Step 4: Update the current optimal location using Formulas (23) and (24);

Step 5: Randomly generate the update parameter p. If p < 0.5 and |A| < 1, update the whale position using Formula (16); if p < 0.5 and |A| ≥ 1, use Formula (20) to update the position of the whale. If p ≥ 0.5, the whale position is updated using Formula (22), where A is the step coefficient of the convergence factor optimization;

Step 6: Determine whether the maximum number of iterations has been reached. If it is, output the whale position at this moment as the optimal parameters of KELM, input these optimal parameters into the KELM model, and train the model for fault diagnosis; if not, repeat the above steps until the maximum number of iterations is reached.

4. Experimental Verification and Result Analysis

4.1. Data Acquisition and Preprocessing

The open gearbox fault data set collected from the drivetrain dynamic simulator (DDS) of Southeast University is used to verify the proposed method, whose data acquisition test platform is shown in Figure 4. This test platform includes a brake, brake controller, planetary gearbox, reduction gearbox, motor, motor controller, and other components. Gear bearings are mounted on the second-stage drive shaft of the reduction gearbox or the second-stage planetary shaft of the planetary gearbox, and seven vibration sensors of type 608A11 are mounted in the direction of the x, y, and z-axes of the planetary gearbox and reduction gearbox as well as in the direction of the motor’s z-axis with a sampling frequency of 5120 Hz. Its collected data include two working conditions, with the speed and load of 20 Hz/0 V and 30 Hz/2 V, respectively. There are five types of gear faults: health, chipped, root, miss, and surface. A detailed description of them is shown in Table 3. The gears of different fault states are processed in advance, the variable speed can be realized via the motor controller, and the change of load is realized via the load controller.

In this study, the data under the 20 Hz/0 V condition are selected for research; each type of fault intercepts 100 sample groups, and each sample group contains 2048 sample points. The data set is randomly divided according to the ratio of the training set to the test set = 7:3, and the labels for the five types of gearbox faults are established, as shown in Table 3.

4.2. Time-Frequency Features Extraction

Thirteen time-domain features described in Table 1 and five frequency-domain features described in Table 2 are extracted and normalized, respectively. The data distributions of different time-domain features and frequency-domain features in different gear fault states are shown in Figure 5 and Figure 6, respectively.

4.3. Fault Diagnosis and Result Analysis

4.3.1. Fault Diagnosis and Result Analysis without Feature Fusion

The time-domain feature matrix T, frequency-domain feature matrix F, and fusion-feature matrix TF obtained after the above feature extraction are input into the GSWOA-KELM fault diagnosis model, respectively. In this model, the number of the whale population is set to 10, the maximum number of iterations is 60, the dimension is 2, the upper bound is 1, and the lower bound is 20. The dataset partitioning and labeling settings are set, as shown in Table 3. The fault diagnosis results under three different inputs are shown in Figure 7, Figure 8 and Figure 9, respectively, where Figure 7a, Figure 8a and Figure 9a shows the comparison results of predicted classification and actual classification, and Figure 7b, Figure 8b and Figure 9b indicates the confusion matrix of classification results. In Figure 7b, Figure 8b and Figure 9b, the blue line indicates the number and proportion of samples correctly classified and the red line indicates the number and proportion of samples misclassified.

The accuracy rate of fault diagnosis under three different inputs is shown in Table 4.

As can be seen from Table 4, the classification accuracy of GSWOA-KELM is lower when the inputs are single-domain features in time-domain or frequency-domain, and the classification accuracy of the training set is 86.67% and 85.33%, respectively, whereas when the inputs are fusion features, GSWOA-KELM has the highest classification accuracy, and the classification accuracy reaches 100%. Compared to when the time domain and frequency domain are used as separate feature vectors, the accuracy is improved by 13.33% and 14.67%. Therefore, extracting multi-domain features of gear faults as inputs to the model can significantly improve the classification accuracy of the fault diagnosis model.

4.3.2. Fault Diagnosis and Result Analysis with Feature Fusion

First, in order to verify the superiority of the GSWOA-KELM model in terms of convergence speed and global search performance, the fusion feature vector TF is input into the SSA-KELM, WOA-KELM, and GSWOA-KELM fault diagnosis models, respectively, and the fitness curves of the three models are compared, as shown in Figure 10. During fault diagnosis, the parameters of the three models are set to be consistent, where the population number is 10, the maximum number of iterations is 60, and the dimension is 2. The dataset partitioning and label setting are set as shown in Table 3.

Figure 10a shows the algorithm converges after 50 iterations, and the final convergence value is 0.098. Figure 10b shows that when WOA is used to optimize KELM, its fitness curve is a straight line, which is analyzed because the model falls into local optimality from the beginning. As shown in Figure 10c, GSWOA-KELM only iterates twice to find the optimal value, and the final convergence value is 0, which is lower than the final convergence value of SSA-KELM, with faster convergence speed and higher optimization accuracy. Meanwhile, compared with WOA-KELM, GSWOA-KELM avoids premature convergence and local optimization and has a stronger global search ability.

Next, in order to verify the fault diagnosis accuracy of the GSWOA-KELM model, KELM, SSA-KELM, and WOA-KELM models are selected for comparison and verification. At first, the fusion feature vector TF is input into the above four fault diagnosis models respectively, in which the population number of SSA, WOA, and GSWOA optimization algorithms is set as 10, the maximum number of iterations is 60, and the dimension is two-dimensional. What is more, the dataset partitioning and label settings are set, as shown in Table 3. The classification accuracy rate of each fault diagnosis model is shown in Table 5. The fault diagnosis results of each model are shown in Figure 11, Figure 12, Figure 13 and Figure 14, where Figure 11a, Figure 12a, Figure 13a and Figure 14a is the comparison results of the predicted classification and the actual classification of each model, and Figure 11b, Figure 12b, Figure 13b and Figure 14b shows the confusion matrix of the classification results of each model.

As shown in Table 5, the fault diagnosis classification accuracies of KELM, SSA-KELM, WOA-KELM, and GSWOA-KELM models are 88.67%, 91.33%, 98.67%, and 100%, respectively. Among them, GSWOA-KELM, established in this study, has the highest accuracy, which reaches 100%. Compared with the other three models, the fault diagnosis classification accuracy of GSWOA-KELM is improved by 11.33%, 8.67%, and 1.33%, respectively.

As can be seen from Figure 11, Figure 12, Figure 13 and Figure 14, the GSWOA-KELM model can accurately identify various fault types and has no misclassification. Compared with GSWOA-KELM, the misclassified fault types of the other three models mainly focus on Miss and Surface. Figure 11 denotes that, in the KELM model, three Root samples are misclassified as Miss samples, one Root sample is misclassified as Surface samples, and two Miss samples are misclassified as Root samples. Figure 12 indicates that in the SSA-KELM model, four Root samples are misclassified as Miss samples, two Miss samples are misclassified as Root samples, and seven Miss samples are misclassified as Surface samples. As can be seen from Figure 13, in the WOA-KELM model, one Miss sample and one Surface sample are misclassified as Root samples separately.

5. Conclusions

Aiming at the problem of low accuracy caused via manual parameter selection in KELM fault diagnosis, an improved time-frequency fusion features-based GSWOA-KELM model is proposed. The results have confirmed that the model proposed in this study has the ideal effect on gear fault diagnosis. The specific conclusions are as follows:

(1): Compared with KELM, SSA-KELM, and WOA-KELM, the GSWOA-KELM has faster convergence speed, stronger global search capability, and higher recognition accuracy;
(2): When constructing a GSWOA-KELM model for gear fault diagnosis, the GSWOA-KELM performance can be improved by considering the fusion features rather than the single time-domain or frequency-domain features;
(3): Compared to KELM, SSA-KELM, and WOA-KELM, the GSWOA-KELM model proposed in this study improved the fault diagnosis accuracy by 11.33%, 8.67%, and 1.33%, respectively.

Since real-world gear systems usually have different signal-to-noise ratios, the proposed method needs to be tested in more practical engineering application scenarios. In the next step, it is necessary to study the applicability of the proposed method under different signal-to-noise ratios and port the proposed algorithm to an embedded terminal to realize on-line diagnosis of gear faults [10].

Author Contributions

Conceptualization, Q.H. and C.Z.; methodology, Q.H., J.S. and C.Z.; software, Q.H. and C.W.; validation, Q.H. and H.Z.; formal analysis, Q.H., H.Z. and P.H.; investigation, Q.H. and C.W.; resources, Q.H. and C.W.; data curation, Q.H. and C.Z.; writing—original draft preparation, Q.H. and H.Z.; writing—review and editing, Q.H. and P.H.; visualization, Q.H. and C.Z.; supervision, Q.H. and C.W.; project administration, H.Z. and Q.H.; funding acquisition, H.Z., C.W. and Q.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Plan Young Scientists Project under Grant No. 2021YFF0603400, Zhejiang Provincial Natural Science Foundation of China under Grant No. LQ21E050019, State Administration for Market Regulation Science and Technology Plan Project under Grant No. 2023MK228.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hagemann, T.; Ding, H.; Radtke, E.; Schwarze, H. Operating Behavior of Sliding Planet Gear Bearings for Wind Turbine Gearbox Applications—Part II: Impact of Structure Deformation. Lubricants 2021, 9, 98. [Google Scholar] [CrossRef]
Issaadi, I.; Hemsas, K.E.; Soualhi, A. Wind Turbine Gearbox Diagnosis Based on Stator Current. Energies 2023, 16, 5286. [Google Scholar] [CrossRef]
Shi, Z.; Liu, S.; Yue, H.; Wu, X. Noise Analysis and Optimization of the Gear Transmission System for Two-Speed Automatic Transmission of Pure Electric Vehicles. Mech. Sci. 2023, 14, 333–345. [Google Scholar] [CrossRef]
Vasić, M.P.; Stojanović, B.; Blagojević, M. Fault Analysis of Gearboxes in Open Pit Mine. Appl. Eng. Lett. 2020, 5, 50–61. [Google Scholar] [CrossRef]
Bai, Z.; Ning, Z. Dynamic Responses of the Planetary Gear Mechanism Considering Dynamic Wear Effects. Lubricants 2023, 11, 255. [Google Scholar] [CrossRef]
De las Morenas, J.; Moya-Fernández, F.; López-Gómez, J.A. The Edge Application of Machine Learning Techniques for Fault Diagnosis in Electrical Machines. Sensors 2023, 23, 2649. [Google Scholar] [CrossRef] [PubMed]
Ma, G.; Yue, X.; Zhu, J.; Liu, Z.; Lu, S. Deep Learning Network Based on Improved Sparrow Search Algorithm Optimization for Rolling Bearing Fault Diagnosis. Mathematics 2023, 11, 4634. [Google Scholar] [CrossRef]
Liu, H.; Song, X.; Zhang, F. Fault diagnosis of new energy vehicles based on improved machine learning. Soft Comput. 2021, 25, 12091–12106. [Google Scholar] [CrossRef]
Zhang, C.; Peng, Z.; Chen, S.; Li, Z.; Wang, J. A Gearbox Fault Diagnosis Method Based on Frequency-Modulated Empirical Mode Decomposition and Support Vector Machine. Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2018, 232, 369–380. [Google Scholar] [CrossRef]
Wang, H.; Yu, Z.; Guo, L. Real-Time Online Fault Diagnosis of Rolling Bearings Based on KNN Algorithm. J. Phys. Conf. Ser. 2020, 1486, 032019. [Google Scholar] [CrossRef]
Yuan, L.; Lian, D.; Kang, X.; Chen, Y.; Zhai, K. Rolling Bearing Fault Diagnosis Based on Convolutional Neural Network and Support Vector Machine. IEEE Access 2020, 8, 137395–137406. [Google Scholar] [CrossRef]
Zhang, Y.; Jia, Y.; Wu, W.; Cheng, Z.; Su, X.; Lin, A. A Diagnosis Method for the Compound Fault of Gearboxes Based on Multi-Feature and BP-AdaBoost. Symmetry 2020, 12, 461. [Google Scholar] [CrossRef]
Zhang, W.; Lu, H.; Zhang, Y.; Li, Z.; Wang, Y.; Zhou, J.; Mei, J.; Wei, Y. A Fault Diagnosis Scheme for Gearbox Based on Improved Entropy and Optimized Regularized Extreme Learning Machine. Mathematics 2022, 10, 4585. [Google Scholar] [CrossRef]
Bai, X.; Ma, Z.; Chen, W.; Wang, S.; Fu, Y. Fault Diagnosis Research of Laser Gyroscope Based on Optimized-Kernel Extreme Learning Machine. Comput. Electr. Eng. 2023, 111, 108956. [Google Scholar] [CrossRef]
Liang, R.; Chen, Y.; Zhu, R. A Novel Fault Diagnosis Method Based on the KELM Optimized by Whale Optimization Algorithm. Machines 2022, 10, 93. [Google Scholar] [CrossRef]
Li, X.; Fang, Y.; Liu, L. Kernel Extreme Learning Machine for Flatness Pattern Recognition in Cold Rolling Mill Based on Particle Swarm Optimization. J. Braz. Soc. Mech. Sci. Eng. 2020, 42, 270. [Google Scholar] [CrossRef]
Song, C.; Yao, L.; Hua, C.; Ni, Q. Comprehensive Water Quality Evaluation Based on Kernel Extreme Learning Machine Optimized with the Sparrow Search Algorithm in Luoyang River Basin, China. Environ. Earth Sci. 2021, 80, 521. [Google Scholar] [CrossRef]
Zhou, F.; Gong, J.; Yang, X.; Han, T.; Yu, Z. A New Gear Intelligent Fault Diagnosis Method Based on Refined Composite Hierarchical Fluctuation Dispersion Entropy and Manifold Learning. Measurement 2021, 186, 110136. [Google Scholar] [CrossRef]
Huang, Y.; Yuan, B.; Xu, S.; Han, T. Fault Diagnosis of Permanent Magnet Synchronous Motor of Coal Mine Belt Conveyor Based on Digital Twin and ISSA-RF. Processes 2022, 10, 1679. [Google Scholar] [CrossRef]
Li, X.; Zhao, H. Performance Prediction of Rolling Bearing Using EEMD and WCDPSO-KELM Methods. Appl. Sci. 2022, 12, 4676. [Google Scholar] [CrossRef]
Yuan, X.; Miao, Z.; Liu, Z.; Yan, Z.; Zhou, F. Multi-Strategy Ensemble Whale Optimization Algorithm and Its Application to Analog Circuits Intelligent Fault Diagnosis. Appl. Sci. 2020, 10, 3667. [Google Scholar] [CrossRef]
Huang, W.; Zhang, G.; Jiao, S.; Wang, J. Bearing Fault Diagnosis Based on Stochastic Resonance and Improved Whale Optimization Algorithm. Electronics 2022, 11, 2185. [Google Scholar] [CrossRef]
Yang, T.; Li, W.; Huang, Z.; Huang, Z.; Peng, L.; Yang, J. Short-term prediction of wind power generation based on VMD-GSWOA-LSTM model. AIP Adv. 2023, 13, 085215. [Google Scholar] [CrossRef]
Qi, J.; He, Q.; Jiang, Y.; Xu, Y. Mechanical Fault Diagnosis of Circuit Breakers Based on XGBoost and Time-Domain Features. J. Phys. Conf. Ser. 2020, 1616, 012105. [Google Scholar] [CrossRef]
Wang, J.; Li, S.; Xin, Y.; An, Z. Gear Fault Intelligent Diagnosis Based on Frequency-Domain Feature Extraction. J. Vib. Eng. Technol. 2019, 7, 159–166. [Google Scholar] [CrossRef]
Yan, X.; Jia, M. A novel optimized SVM classification algorithm with multi-domain feature and its application to fault diagnosis of rolling bearing. Neurocomputing 2018, 313, 47–64. [Google Scholar] [CrossRef]
He, Z.; Li, Q.; Chu, M.; Liu, G. Dynamic Weighing Algorithm for Dairy Cows Based on Time Domain Features and Error Compensation. Comput. Electron. Agric. 2023, 212, 108077. [Google Scholar] [CrossRef]
Zhang, Q.; Huo, R.; Zheng, H.; Huang, T.; Zhao, J. A Fault Diagnosis Method With Bitask-Based Time- and Frequency-Domain Feature Learning. IEEE Trans. Instrum. Meas. 2023, 72, 1–11. [Google Scholar] [CrossRef]
Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Huang, G.B.; Wang, D.H.; Lan, Y. Extreme Learning Machines: A Survey. Int. J. Mach. Learn. Cybern. 2011, 2, 107–122. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Wu, Z.; Lu, X. Microgrid Fault Diagnosis Based on Whale Algorithm Optimizing Extreme Learning Machine. J. Electr. Eng. Technol. 2023, 1–10. [Google Scholar] [CrossRef]

Figure 1. The network structure of ELM.

Figure 2. Flow chart for GSWOA.

Figure 3. Flow chart for GSWOA optimizes KELM.

Figure 4. Southeast University gearbox data acquisition test platform.

Figure 5. Distribution of time −domain feature parameters.

Figure 6. Distribution of frequency −domain feature parameters.

Figure 7. GSWOA-KELM fault diagnosis results for time-domain features as input. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 8. GSWOA-KELM fault diagnosis results for frequency-domain features as input. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 9. GSWOA-KELM fault diagnosis results for fusion-domain features as input. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 10. Convergence curves for three models. (a) SSA-KELM; (b) WOA-KELM; and (c) GSWOA-KELM.

Figure 11. The fault diagnosis results of KELM. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 12. The fault diagnosis results of SSA-KELM. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 13. The fault diagnosis results of WOA-KELM. (a) Predictive classification results; (b) confusion matrix for the test set.

Figure 14. The fault diagnosis results of GSWOA-KELM. (a) Predictive classification results; (b) confusion matrix for the test set.

Table 1. The 13 time-domain characteristic parameters and their calculation formulas.

Dimensional	Formula	Dimensionless	Formula
Mean Value	$\bar{x} = \frac{1}{N} \sum_{n = 1}^{N} x (n)$	Pulse Factor	$I = \frac{m a x \|x (n)\|}{\bar{x}}$
Standard Deviation	$σ_{x} = \sqrt{\frac{1}{N - 1} \sum_{n = 1}^{N} {[x (n) - \bar{x}]}^{2}}$	Margin Factor	$L = \frac{m a x \|x (n)\|}{{(\frac{1}{N} \sum_{n = 1}^{N} \sqrt{\|x (n)\|})}^{2}}$
Root-Mean-Square Value	$x_{r m s} = \sqrt{\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)}$	Waveform Factor	$W = \frac{x_{r m s}}{\bar{x}}$
Maximum Value	$x_{m a x} = m a x (x_{n})$	Kurtosis	$K = \frac{{\sum_{n = 1}^{N} [x (n) - \bar{x}]}^{4}}{(N - 1) σ_{x}^{4}}$
Minimum Value	$x_{m i n} = m i n (x_{n})$	Skewness	$S = \frac{{\sum_{n = 1}^{N} [x (n) - \bar{x}]}^{3}}{(N - 1) σ_{x}^{3}}$
Peak-peak Value	$x_{p p} = x_{m a x} - x_{m i n}$	Amplitude Factor	$A = \frac{x_{m a x}}{x_{r m s}}$
		Energy	$E = \sum_{n = 1}^{N} {x (n)}^{2}$

Note: where x(n) represents the time-domain sequence of the signal, n = 1, 2, …, N; N is the sample number.

Table 2. The five frequency-domain characteristic parameters and their calculation formulas.

Frequency Domain Characteristic Parameters	Formula
Amplitude Mean	$A M = \frac{1}{K} \sum_{k = 1}^{K} s (k)$
Center Frequency	$C F = \frac{\sum_{k = 1}^{K} f_{k \cdot} s (k)}{\sum_{k = 1}^{K} s (k)}$
Mean Square Frequency	$M S F = \frac{\sum_{k = 1}^{K} {f_{k}}^{2} s (k)}{\sum_{k = 1}^{K} s (k)}$
Root-Mean-Square Frequency	$R M S F = \sqrt{\frac{\sum_{k = 1}^{K} {f_{k}}^{2} s (k)}{\sum_{k = 1}^{K} s (k)}}$
Frequency Variance	$F V A R = \frac{\sum_{k = 1}^{K} {{(f}_{k} - C F)}^{2} \cdot s (k)}{\sum_{k = 1}^{K} s (k)}$

Note: Where s(k) stands for the spectrum of signal x(n), k = 1, 2, …, K; K is the number of spectral lines;

f_{k}

denotes the frequency value of the k-th spectral line.

Table 3. Dataset fault type description and classification label.

Fault Type	Fault Description	Classification Label	Sample Number		Total Sample Number
Fault Type	Fault Description	Classification Label	Train Set	Test Set	Total Sample Number
Health	Healthy gear.	1	70	30	100
Chipped	The gear is cracked or even broken.	2	70	30	100
Miss	Gear defect.	3	70	30	100
Root	There is a crack at the root of the gear.	4	70	30	100
Surface	Gear surface wear.	5	70	30	100

Table 4. The accuracy rate for different inputs.

Input	Accuracy Rate
T	86.67%
F	85.33%
TF	100%

Table 5. The classification accuracy rate of each fault diagnosis model.

Fault Diagnosis Model	Fault Type	Accuracy Rate	Overall Accuracy
KELM	Health	100%	88.67%
	Chipped	100%
	Miss	88.0%
	Root	93.3%
	Surface	65.7%
SSA-KELM	Health	100%	91.33%
	Chipped	100%
	Miss	82.6%
	Root	83.5%
	Surface	79.4%
WOA-KELM	Health	100%	98.67%
	Chipped	100%
	Miss	100%
	Root	100%
	Surface	96.8%
GSWOA-KELM	Health	100%	100%
	Chipped	100%
	Miss	100%
	Root	100%
	Surface	100%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hu, Q.; Zhou, H.; Wang, C.; Zhu, C.; Shen, J.; He, P. Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis. Lubricants 2024, 12, 10. https://doi.org/10.3390/lubricants12010010

AMA Style

Hu Q, Zhou H, Wang C, Zhu C, Shen J, He P. Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis. Lubricants. 2024; 12(1):10. https://doi.org/10.3390/lubricants12010010

Chicago/Turabian Style

Hu, Qin, Haiting Zhou, Chengcheng Wang, Chenxi Zhu, Jiaping Shen, and Peng He. 2024. "Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis" Lubricants 12, no. 1: 10. https://doi.org/10.3390/lubricants12010010

APA Style

Hu, Q., Zhou, H., Wang, C., Zhu, C., Shen, J., & He, P. (2024). Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis. Lubricants, 12(1), 10. https://doi.org/10.3390/lubricants12010010

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Time-Frequency Fusion Features-Based GSWOA-KELM Model for Gear Fault Diagnosis

Abstract

1. Introduction

2. Time-Frequency Features Extraction

2.1. Time-Domain Features

2.2. Frequency-Domain Features

2.3. Fusion Features

3. GSWOA-KELM Fault Diagnosis Model

3.1. Kernel Extreme Learning Machine

3.2. Whale Optimization Algorithm

3.3. Global Search Whale Optimization Algorithm

3.4. Kernel Extreme Learning Machine Optimized Using the Global Search Whale Optimization Algorithm

4. Experimental Verification and Result Analysis

4.1. Data Acquisition and Preprocessing

4.2. Time-Frequency Features Extraction

4.3. Fault Diagnosis and Result Analysis

4.3.1. Fault Diagnosis and Result Analysis without Feature Fusion

4.3.2. Fault Diagnosis and Result Analysis with Feature Fusion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI