Bearing Fault Feature Extraction Based on Adaptive OMP and Improved K-SVD

: The condition of the bearing is closely related to the condition and remaining life of the rotating machine. Targeting the problem of the large number of harmonic signals and noise signals during the operation of rolling bearings, and given that it is difﬁcult to identify the fault in time, an adaptive orthogonal matching pursuit algorithm (OMP) and an improved K-singular value decomposition (K-SVD) for bearing fault feature extraction are proposed. An adaptive OMP algorithm is applied, which uses the Fourier dictionary to improve the solution method of the OMP algorithm so that it can separate the harmonic components in the signal faster and more accurately. At the same time, the stopping criterion of the adaptive sparsity is improved in dictionary learning. There is no need to manually set the sparsity in the algorithm initialization process, which avoids the problem of algorithm performance degradation due to improper sparsity settings, and improves the efﬁciency of the K-SVD algorithm. As shown by theoretical veriﬁcation, algorithm comparison, and experimental comparisons, the algorithm has certain advantages in fault feature extraction during rolling bearing operation, and the algorithm still has considerable practical value in long-duration and strong noise environments.


Introduction
In total, 40% of the failure of rotating machinery come from rolling bearings, and the failure of bearings is a progressive process [1]. When premature failure occurs, the operator is difficult to find with the naked eye, and when an obvious failure occurs, it has caused damage to the machinery. Therefore, finding an abnormal bearing in time and accurately, and making decisions are the keys to preventing this kind of failure.
The vibration analysis method is the most used bearing fault diagnosis method at present. By installing sensors on the bearing base or inside the box, the vibration signals generated by the bearing during the operation of mechanical equipment are collected, and the vibration signals obtained by the sensors are processed by computer software to determine the type and nature of the bearing faults [2][3][4].
The processing and analysis of the bearing signal mainly includes time domain, frequency domain, and time-frequency domain analysis. The evolutionary trend of bearing faults can be determined by time domain analysis, and the location and degree of bearing faults can be determined by frequency domain analysis. Vibration analysis shows some superior features to other methods of fault diagnosis analysis of advantages, such as being suitable for bearings of multiple types and variable working conditions, which can realize In order to separate harmonic components from vibration signals, we can choose a suitable over-complete dictionary to separate harmonic components from vibration signals using an orthogonal matching tracking algorithm. We choose the Fourier dictionary as the over-complete basis because it is comprised of a series of sine and cosine functions, in line with the waveform characteristics of harmonic signals. The Fourier dictionary is set as D 1 , and its sparsity means that the sparse matrix is α 1 . By introducing the over-complete dictionary, the original vibration signal can be expressed as s = I(t) + D 1 α 1 +n(t) (2) Atoms in a Fourier dictionary can be represented by the function γ = ( f , υ), in which υ ∈ {0, 1} determines the type of phase, and when υ = 0, the atomic waveform is a cosine function; when the waveform of the υ = 1 atom is a sine function, the mathematical expression is ϕ( f , 0)= cos(2π f t) ϕ( f , 1)= sin(2 f t) In which f is the frequency parameter, and the optimization problem of matching vibration signals and separating harmonic components through a Fourier dictionary can be described as follows: An orthogonal matching pursuit algorithm is used to solve the problem. The solution process is split into a traversal process and a temporary solution updating process. Harmonic components in vibration signals can be expressed by the result of the multiplication of D 1 and α 1 . In order to select the most suitable column in the dictionary, it is necessary to verify every column in the dictionary. Set the columns in the D 1 dictionary as d i , i ∈ {1 ≤ i ≤ m, N + }, where m is the number of columns in the dictionary, the behavior γ i , i ∈ {1 ≤ i ≤ m, N + } in the sparse coefficient matrix α 1 , and m is the number of rows in the coefficient matrix. In the jth traversal, the error can be expressed as (j) = d j γ j − s 2 (5) Processes 2022, 10, 675 4 of 23 By minimizing its error, it can be obtained from Equation (5): Then, the error is Therefore, in the traversal process of an orthogonal matching pursuit, the error value between the product of multiplying the current dictionary atoms by sparse coefficients and the original signal can be obtained, which is very similar to Equation (7): where r k−1 is the residual after the (k − 1)th iteration. In order to minimize the error, we only need to maximize the productivity of the dictionary atom d T j , which is selected for the kth time, and the residual r k−1 . This process is the process of obtaining the maximum inner product (the absolute value form): where d j k represents the atoms extracted for the kth time, d i represents the ith atom in Fourier dictionary D 1 , and the atoms obtained from the dictionary are stored in support set A k . In the stage of updating the temporary solution, ||D 1 α 1 − s|| 2 2 is minimized for sparse coefficient matrix α 1 under the condition of support set A k .
We write the matrix composed of the columns in the dictionary D 1 as D A k ∈ R n×|A k | ; at this time, it becomes necessary to find the minimum value of ||D A k α A k − s|| 2 2 , which is the remaining part of sparse matrix α 1 after removing zero elements. Making the derivative of this quadratic form equal to zero can provide the solution of the minimization problem, as follows: Equation (10) uses the residual formula of the kth iteration, r k = s − D 1 γ k = s − D A k α A k . This relationship shows that the column selected in the support set A k in Fourier dictionary D 1 must be orthogonal to the residual r k , which also leads to the next iteration. Because the atoms in the Fourier dictionary are sine wave unit functions of a specific frequency and orthogonal to each other, the inner product of any two atoms is zero. The interpretation of the vibration signal from the energy point of view can be expressed as: There are two kinds of stopping conditions for the traditional orthogonal matching pursuit algorithm: one is that the residual error is less than the preset value, and the other is that it reaches the preset number of iterations and then stops.
An adaptive orthogonal matching pursuit algorithm is used to separate harmonic components. From Formula (11), it can be seen that harmonic components and modulation components are distributed on sine waves with definite frequencies, such that we can perform the calculation according to the sparse coefficients of these atoms. Supposing that the maximum value of the sparse coefficients obtained after k iterations is a and the Processes 2022, 10, 675 5 of 23 minimum value is b, when a > ρb, the iteration stops outputting sparse coefficients. In order to better recover signals, ρ = 5 can be set through experiments. Finally, the residual signal is the composite signal of the impact component and noise. The flow chart of the use of the adaptive orthogonal matching pursuit algorithm to separate harmonic and modulation components is shown in Figure 1.
Processes 2022, 10, x FOR PEER REVIEW 5 of 24 the adaptive orthogonal matching pursuit algorithm to separate harmonic and modulation components is shown in Figure 1.

K-SVD Dictionary Learning Algorithm with an Improved Termination Criterion
In order to solve the problem that the pre-built dictionary has poor adaptability when faced with different types of fault signals, a feature extraction method of learning dictionaries is proposed. For different types of fault signals, the feature information can be extracted more accurately through dictionary learning, and the interference of irrelevant information can be reduced so that the fault types can be detected. The sparse feature extraction method of learning of a dictionary is divided into two essential steps: a sparse representation solution and dictionary learning. We improve the K-SVD algorithm to optimize the stopping criterion of the dictionary learning feature extraction method, in order to better identify the fault information.
K-SVD is an over-complete dictionary learning algorithm, which is proposed to reduce the sparsity of the sparse coefficient matrix corresponding to the dictionary, and the atoms learned by K-SVD can be used to better represent the original signal. The problem can be described as where ∈ × represents the original signal, ∈ × represents the dictionary matrix containing atoms , and ∈ × represents the sparse coefficient matrix, where represents the number of samples.. The K-SVD algorithm is divided into two stages: sparse coding and dictionary learning. In the sparse representation solving stage, the orthogonal matching pursuit algorithm with an improved error threshold (OMPerr) is used to replace the orthogonal matching pursuit algorithm, and the objective function and constraint conditions of Formula (12) are changed:

K-SVD Dictionary Learning Algorithm with an Improved Termination Criterion
In order to solve the problem that the pre-built dictionary has poor adaptability when faced with different types of fault signals, a feature extraction method of learning dictionaries is proposed. For different types of fault signals, the feature information can be extracted more accurately through dictionary learning, and the interference of irrelevant information can be reduced so that the fault types can be detected. The sparse feature extraction method of learning of a dictionary is divided into two essential steps: a sparse representation solution and dictionary learning. We improve the K-SVD algorithm to optimize the stopping criterion of the dictionary learning feature extraction method, in order to better identify the fault information.
K-SVD is an over-complete dictionary learning algorithm, which is proposed to reduce the sparsity of the sparse coefficient matrix corresponding to the dictionary, and the atoms learned by K-SVD can be used to better represent the original signal. The problem can be described as min where s ∈ R m×n represents the original signal, D ∈ R m×K represents the dictionary matrix containing k atoms d k , and α ∈ R K×n represents the sparse coefficient matrix, where m represents the number of samples. The K-SVD algorithm is divided into two stages: sparse coding and dictionary learning. In the sparse representation solving stage, the orthogonal matching pursuit algorithm with Processes 2022, 10, 675 6 of 23 an improved error threshold (OMPerr) is used to replace the orthogonal matching pursuit algorithm, and the objective function and constraint conditions of Formula (12) are changed: The biggest advantage of changing the constraint condition is that it is no longer necessary to set the sparsity to stop the program, and it also avoids the problem that the sparsity of the sparse coefficient matrix cannot meet the precision requirement due to there being too few iterations; thus, the original signal cannot be accurately restored, which also avoids the problem that the calculation time is too long due to there being too many iterations. By setting the error goal of the objective function, the iteration is stopped when the error of the original signal and the sparse approximation is less than a given value, such that the accurate form of the signal can be recovered and a more accurate reconstruction can be realized.
Each symbol in Formula (14) has the same meaning as the symbol in Formula (12), and α is the set of solution vectors of s. By solving the optimization problem in Formula (13), we obtain the sparse coefficient vector corresponding to the initialization dictionary through the sparse representation solution process. Next, the dictionary learning stage-knowing the sparse vectors obtained by the sparse coding process-updates the dictionary column by column. Suppose we want to update the kth atom of the dictionary, denoted as d k , and the kth coefficient α k T of the sparse coefficient matrix.
In which E k = s − K ∑ j =k d j α j T represents the calculation residual, and the optimization problem at this time is transformed into finding It is necessary to find the optimal solution of d k , α k T to obtain the minimum error. This is a least square problem, which can be solved by the least square method or singular value decomposition (SVD). In this paper, the SVD method is utilized to solve the optimal solution of two variables. However, if E k is directly used to solve this, the sparsity of α k T obtained will be larger or no longer sparse, such that the elements corresponding to zero elements in E k and α k T can be eliminated here to form a new residual matrix E k , as shown in Figure 2.
The mathematical expression is used to describe this step. The position of non-zero in the sparse coefficient vector is recorded as 1 and the position of zero element is recorded as 0; that is, set a set A k = i|1 < i < K, α k T (i) = 0 represents the index value when α k T = 0, define a matrix B k as a matrix of N×l(A k ), and its element at the position of (A k (i), i) is 1, and the elements in the remaining positions are 0: At this time, the objective equation can be described as Therefore, we require the best d k , α k T to perform singular value decomposition on E k : In which = − ∑ represents the calculation residual, and the optimi tion problem at this time is transformed into finding min , − ( It is necessary to find the optimal solution of , to obtain the minimum err This is a least square problem, which can be solved by the least square method or singu value decomposition (SVD). In this paper, the SVD method is utilized to solve the optim solution of two variables. However, if is directly used to solve this, the sparsity of obtained will be larger or no longer sparse, such that the elements corresponding to ze elements in and can be eliminated here to form a new residual matrix , as show in Figure 2.  Take the first column u 1 = U(•, 1) of the left singular matrix U as d k , that is, d k = u 1 . Take the product of the first row of the right singular matrix and the first singular value as α k T , that is, α k T = Σ(1, 1)V T (1, •). Use this method to replace the columns in the original dictionary in turn to generate a new learning dictionary. After this repeated process, the dictionary matrix and the coefficient matrix corresponding to the original signal s can be calculated again and again. The iteration stops if ||s − Dα|| 2 2 < ε. At this time, the sparse vector matrix and the learning dictionary matrix calculated by the K-SVD algorithm are output, and the reconstructed signal is obtained by multiplying the learning dictionary by the sparse coefficient vector. At this time, the reconstructed signal contains rich transient impact components, which eliminate the interference of a large number of harmonic components and noise components. The envelope spectrum can be used to obtain the fault frequency of the reconstructed signal, so as to identify the fault type of the faulty bearing, highlighting the effectiveness and reliability of this method of extracting the transient pulse component.

Bearing Fault Feature Extraction Method Based on Adaptive OMP and Improved K-SVD
The overall method of bearing fault feature extraction based on adaptive OMP and improved K-SVD is given in Figure 3.
First, the Fourier dictionary is constructed, and the adaptive OMP algorithm is used to predominantly process the signal in order to separate the harmonic signal.
Second, the sparse objective equation is constructed, and the termination condition modified in this paper is used as the termination condition of the equation to update the constructed dictionary, in order to carry out the next iteration. Finally, the envelope analysis of the signal is performed in order to extract the fault features.

Comparison of the Harmonic Component Extraction Simulation of the Adaptive OMP Algorithm
Amplitude Modulation (AM) and Frequency Modulation (FM) are the most frequent signals in our lives. They often overlap and interfere with each other to form complex background noise, which causes serious interference for the extraction of transient pulse components. According to the theory of the analytical model, a synthetic signal ( ) is generated, which consists of a harmonic signal, ( ) ; two modulation signals, (t) and ( ); and Gaussian white noise with a variance of 1 and a mean of 0, as in Formula (20). The harmonic components, noise, and time-domain waveforms of the synthesized signal are presented in Figure 4.

Comparison of the Harmonic Component Extraction Simulation of the Adaptive OMP Algorithm
Amplitude Modulation (AM) and Frequency Modulation (FM) are the most frequent signals in our lives. They often overlap and interfere with each other to form complex background noise, which causes serious interference for the extraction of transient pulse components. According to the theory of the analytical model, a synthetic signal y(t) is generated, which consists of a harmonic signal, y 1 (t); two modulation signals, y 2 (t) and y 3 (t); and Gaussian white noise n with a variance of 1 and a mean of 0, as in Formula (20). The harmonic components, noise, and time-domain waveforms of the synthesized signal are presented in Figure 4.
In which: The time is t ∈ [0, 1], and the sampling interval is 0.0005 s. Harmonic components are extracted by an adaptive orthogonal matching pursuit algorithm, wavelet basis decomposition, and EEMD. The wavelet base decomposition adopts the SYM wavelet base, and the number of decomposition layers is five layers. EEMD decomposition discards the first IMF component because it denoises EEMD the most frequently. Figure 5 shows the experimental results of the comparison between the proposed method and the other two methods for extracting harmonic components. The blue lines are the harmonic components, and the red lines are the harmonic components extracted from the noise. The harmonic component signal extracted by the adaptive quadrature matching pursuit algorithm is smooth, and the noise component is greatly suppressed. Compared with other methods, the effect is the best and the calculation time is shorter, and the harmonic separation signal is at the peak of the original harmonic signal. The similarity with the trough is high, and the best extraction effect is achieved. The calculation time of the three algorithms to extract the harmonic components and the similarity between the extracted harmonic components and the original signal are shown in Table 1.  The time is ∈ [0,1], and the sampling interval is 0.0005 s. Harmonic components are extracted by an adaptive orthogonal matching pursuit algorithm, wavelet basis decomposition, and EEMD. The wavelet base decomposition adopts the SYM wavelet base, and the number of decomposition layers is five layers. EEMD decomposition discards the first IMF component because it denoises EEMD the most frequently. Figure 5 shows the experimental results of the comparison between the proposed method and the other two methods for extracting harmonic components. The blue lines are the harmonic components, and the red lines are the harmonic components extracted from the noise. The harmonic component signal extracted by the adaptive quadrature matching pursuit algorithm is smooth, and the noise component is greatly suppressed. Compared with other methods, the effect is the best and the calculation time is shorter, and the harmonic separation signal is at the peak of the original harmonic signal. The similarity with the trough is high, and the best extraction effect is achieved. The calculation time of the three algorithms to extract the harmonic components and the similarity between the extracted harmonic components and the original signal are shown in Table 1.    In order to verify the effectiveness and accuracy of the method proposed in this paper, the method proposed in this paper and the traditional K-SVD method are used to analyze the same set of synthetic signals. It can be known from the mechanical vibration model that a set of synthetic signals usually consists of harmonic components, shock components, and noise. Its specific expression is as follows: The initial position of the shock component is T 0 = 0.025 s, the failure period is T = 0.1 s, the failure frequency is f = 1 T = 10 Hz, and the time t ∈ [0, 1]; then, the entire cycle contains 10 shock components to learn dictionary atoms. Natural frequency f n = 500 Hz, and damping coefficient ξ = 0.1. Harmonic interference frequency f 1 = 33 Hz, and f 2 = 35 Hz; we add Gaussian white noise with variance 1 and mean 0, and its amplitude A = 0.3. The sampling frequency is at least twice the natural frequency. In this simulation, the sampling frequency is set to 4000 Hz, and the number of sampling points is 4000. The time-domain waveform of the harmonic components is shown in Figure 6a, the time-domain waveform of impulse components is shown in Figure 6b, and the time-domain waveform of noise is shown in Figure 6c. The figure is shown in Figure 6d.
First, a Fourier dictionary conforming to the waveform of the harmonic components is constructed, and the harmonic components in the synthesized signal are separated by an adaptive quadrature matching pursuit algorithm. The iteration stops when the largest element in the sparse vector is five times or more the smallest element, and the harmonic components are separated from the composite signal. Through the calculation, it can be shown that the sparseness of the harmonic and modulation components of the simulated signal is 4, which appear at the disposal of the 67th, 71st, 501st and 2008th points, and their values are 25.2898, 24.9902, 9.7865 and 2.9822, respectively. In the obtained sparse matrix, all of the coefficients except these four points are zero. Finally, the Fourier dictionary is used to multiply the sparse coefficient matrix to obtain the harmonics and modulation components in the simulated signal, as shown in Figure 7a. The harmonic components are plotted in the same graph as those extracted from the simulated signal by adaptive quadrature matching tracing, with the original harmonic components in red and the adaptive quadrature matching tracing extracted from the simulated signal in blue. The extracted harmonic components are shown in Figure 7b; it can be seen that the extracted harmonic components are very similar to the original harmonic components, and the correlation is calculated by the corroded function to obtain 0.9967, which shows the effectiveness of this method in eliminating harmonic components. When the harmonic components are removed from the simulated signal, as shown in Figure 8, the combined time-domain waveform of the remaining impulse components and noise can be seen, which further proves the effectiveness of the method in separating the interference of harmonic components. and damping coefficient = 0.1. Harmonic interference frequency = 33 Hz, and = 35 Hz; we add Gaussian white noise with variance 1 and mean 0, and its amplitude = 0.3. The sampling frequency is at least twice the natural frequency. In this simulation, the sampling frequency is set to 4000 Hz, and the number of sampling points is 4000. The time-domain waveform of the harmonic components is shown in Figure 6a, the time-domain waveform of impulse components is shown in Figure 6b, and the time-domain waveform of noise is shown in Figure 6c. The figure is shown in Figure 6d. First, a Fourier dictionary conforming to the waveform of the harmonic components is constructed, and the harmonic components in the synthesized signal are separated by an adaptive quadrature matching pursuit algorithm. The iteration stops when the largest element in the sparse vector is five times or more the smallest element, and the harmonic components are separated from the composite signal. Through the calculation, it can be shown that the sparseness of the harmonic and modulation components of the simulated signal is 4, which appear at the disposal of the 67th, 71st, 501st and 2008th points, and their values are 25.2898, 24.9902, 9.7865 and 2.9822, respectively. In the obtained sparse matrix, all of the coefficients except these four points are zero. Finally, the Fourier dictionary is used to multiply the sparse coefficient matrix to obtain the harmonics and modulation components in the simulated signal, as shown in Figure 7a. The harmonic components are plotted in the same graph as those extracted from the simulated signal by adaptive quadrature matching tracing, with the original harmonic components in red and the adaptive quadrature matching tracing extracted from the simulated signal in blue. The extracted harmonic components are shown in Figure 7b; it can be seen that the extracted harmonic components are very similar to the original harmonic components, and the correlation is calculated by the corroded function to obtain 0.9967, which shows the effective-  In order to verify the advantages of this method in separating the harmonic components in the signal, the OMP algorithm is now used to select a set of control experiments for the separation of harmonic components with a sparsity K = 7 and a set of sparsity K = 10 as a comparison. When the sparsity K = 7, it can be seen from Figure 9. Although the  In order to verify the advantages of this method in separating the harmonic components in the signal, the OMP algorithm is now used to select a set of control experiments for the separation of harmonic components with a sparsity K = 7 and a set of sparsity K = In order to verify the advantages of this method in separating the harmonic components in the signal, the OMP algorithm is now used to select a set of control experiments for the separation of harmonic components with a sparsity K = 7 and a set of sparsity K = 10 as a comparison. When the sparsity K = 7, it can be seen from Figure 9. Although the transient shock components are consistent with the original set time at the shock time, there are still transient shock components submerged in the harmonic components and noise, which is relatively sparse. The separation effect is insignificant when K = 4. When we increase the sparsity to K = 10, we obtain the results shown in Figure 10. It can be observed that with the increase of sparsity, the effect of harmonic separation is worse and worse. However, it is impossible to calculate every sparsity to obtain the best sparsity during the experiment; the adaptive orthogonal matching pursuit algorithm used in this paper makes up for this defect, and can quickly calculate the optimal sparsity. The calculation time is greatly reduced, and the harmonic components are better separated from the signal. The adaptive quadrature matching pursuit algorithm is used to separate the harmonic components from the synthesized signal, and the K-SVD-an algorithm with an improved termination criterion-is used to denoise the signal. Set errorGoal = 0.35 to construct a random over-complete dictionary. Through the algorithm proposed in this paper, the sparse coefficient and the learning dictionary are calculated. Six atoms in the learning dictionary are randomly selected, and their waveforms are shown in Figure 11. It can be observed that the waveforms of the atoms are in the form of unilateral attenuation, which is consistent with the waveform of the transient shock component. The transient impact component can be reconstructed using a sparse coefficient and a learning dictionary. The adaptive quadrature matching pursuit algorithm is used to separate the harmonic components from the synthesized signal, and the K-SVD-an algorithm with an improved termination criterion-is used to denoise the signal. Set errorGoal = 0.35 to construct a random over-complete dictionary. Through the algorithm proposed in this paper, the sparse coefficient and the learning dictionary are calculated. Six atoms in the learning dictionary are randomly selected, and their waveforms are shown in Figure 11. It can be observed that the waveforms of the atoms are in the form of unilateral attenuation, which is consistent with the waveform of the transient shock component. The transient impact component can be reconstructed using a sparse coefficient and a learning dictionary. The adaptive quadrature matching pursuit algorithm is used to separate the harmonic components from the synthesized signal, and the K-SVD-an algorithm with an improved termination criterion-is used to denoise the signal. Set errorGoal = 0.35 to construct a random over-complete dictionary. Through the algorithm proposed in this paper, the sparse coefficient and the learning dictionary are calculated. Six atoms in the learning dictionary are randomly selected, and their waveforms are shown in Figure 11. It can be observed that the waveforms of the atoms are in the form of unilateral attenuation, which is consistent with the waveform of the transient shock component. The transient impact component can be reconstructed using a sparse coefficient and a learning dictionary.
The sparse coefficient matrix and learning dictionary are obtained at the end of the K-SVD algorithm cycle. The signal is reconstructed and restored to obtain the time-domain waveform of the sparse signal, as shown in Figure 12a. It can be observed that most of the noise has been removed and restored. The signal is close to the impact component of the composite signal without adding noise and harmonic components, and the transient impact period is discernible. Using the classical K-SVD algorithm and the orthogonal matching pursuit algorithm, the time-domain waveform of the sparse signal is obtained, as in Figure 13a. The reconstructed signal is still disturbed by noise and harmonic components, and the transient impact period is not obvious. The envelope spectrum analysis of the sparse signal is carried out, as shown in Figure 12b; it can be seen that the spectral peak of the fault frequency and the spectral peak at the frequency doubling are more obvious, and all of the transient shock components in the cut-off period are identified. The higher harmonics are almost eliminated. In Figure 13b it is found that the spectral peaks at the multiplier of the fault frequency are not obvious; some transient impulse components are submerged in the noise, and are difficult to identify. This shows the effectiveness and reliability of this algorithm in the extraction of transient impulse components in intense noise. The adaptive quadrature matching pursuit algorithm is used to separate the harmonic components from the synthesized signal, and the K-SVD-an algorithm with an improved termination criterion-is used to denoise the signal. Set errorGoal = 0.35 to construct a random over-complete dictionary. Through the algorithm proposed in this paper, the sparse coefficient and the learning dictionary are calculated. Six atoms in the learning dictionary are randomly selected, and their waveforms are shown in Figure 11. It can be observed that the waveforms of the atoms are in the form of unilateral attenuation, which is consistent with the waveform of the transient shock component. The transient impact component can be reconstructed using a sparse coefficient and a learning dictionary. The sparse coefficient matrix and learning dictionary are obtained at the end of the K-SVD algorithm cycle. The signal is reconstructed and restored to obtain the time-domain waveform of the sparse signal, as shown in Figure 12a. It can be observed that most of the noise has been removed and restored. The signal is close to the impact component of the composite signal without adding noise and harmonic components, and the transient In order to illustrate the applicability of the improved K-SVD, we further compare the performance of the original K-SVD with that of the improved K-SVD. For the same simulation signal, the termination condition needs to be set manually in traditional K-SVD, which requires a certain discipline background. In order to achieve better results, it usually needs to be set many times to achieve the optimal sparsity, while the improved K-SVD can automatically terminate only by setting the final goal, which is better in terms of the time and labor costs of practical application; see Table 2 for details. Table 2. Comparison of the level of performance between the original K-SVD and the improved K-SVD.

Experimental Verification 1
Experimental data were used to verify the validity and reliability of the proposed method for extracting transient shock features, and more authoritative public data in this field were selected [32]. The rolling bearing vibration signal of Case Western Reserve University Bearing Laboratory (CWRU) was invoked as the target signal for the analysis. The schematic diagram of the rolling bearing test bench and the test bench is illustrated in Figure 14. A 2HP (1.5KW) three-phase induction motor (Reliance Electric 2HP IQPreAlert motor) is arranged on the left, and the torque sensor in the middle is connected to the right through a self-aligning coupling. A dynamometer is attached to the motor in order to ensure that the desired torque load level is achieved, and the rolling bearing supporting the motor at the drive end is our bearing to be tested. The accelerator is installed on the motor casing of the driving end to sense vibration, and a 16-channel DAT recorder is used to In order to illustrate the applicability of the improved K-SVD, we further compare the performance of the original K-SVD with that of the improved K-SVD. For the same simulation signal, the termination condition needs to be set manually in traditional K-SVD, which requires a certain discipline background. In order to achieve better results, it usually needs to be set many times to achieve the optimal sparsity, while the improved K-SVD can automatically terminate only by setting the final goal, which is better in terms of the time and labor costs of practical application; see Table 2 for details. Table 2. Comparison of the level of performance between the original K-SVD and the improved K-SVD.

Experimental Verification 1
Experimental data were used to verify the validity and reliability of the proposed method for extracting transient shock features, and more authoritative public data in this field were selected [32]. The rolling bearing vibration signal of Case Western Reserve University Bearing Laboratory (CWRU) was invoked as the target signal for the analysis. The schematic diagram of the rolling bearing test bench and the test bench is illustrated in Figure 14. A 2HP (1.5KW) three-phase induction motor (Reliance Electric 2HP IQPreAlert motor) is arranged on the left, and the torque sensor in the middle is connected to the right through a self-aligning coupling. A dynamometer is attached to the motor in order to ensure that the desired torque load level is achieved, and the rolling bearing supporting the motor at the drive end is our bearing to be tested. The accelerator is installed on the motor casing of the driving end to sense vibration, and a 16-channel DAT recorder is used to In order to illustrate the applicability of the improved K-SVD, we further compare the performance of the original K-SVD with that of the improved K-SVD. For the same simulation signal, the termination condition needs to be set manually in traditional K-SVD, which requires a certain discipline background. In order to achieve better results, it usually needs to be set many times to achieve the optimal sparsity, while the improved K-SVD can automatically terminate only by setting the final goal, which is better in terms of the time and labor costs of practical application; see Table 2 for details. Table 2. Comparison of the level of performance between the original K-SVD and the improved K-SVD.

Experimental Verification 1
Experimental data were used to verify the validity and reliability of the proposed method for extracting transient shock features, and more authoritative public data in this field were selected [32]. The rolling bearing vibration signal of Case Western Reserve University Bearing Laboratory (CWRU) was invoked as the target signal for the analysis. The schematic diagram of the rolling bearing test bench and the test bench is illustrated in Figure 14. A 2HP (1.5KW) three-phase induction motor (Reliance Electric 2HP IQPreAlert motor) is arranged on the left, and the torque sensor in the middle is connected to the right through a self-aligning coupling. A dynamometer is attached to the motor in order to ensure that the desired torque load level is achieved, and the rolling bearing supporting the motor at the drive end is our bearing to be tested. The accelerator is installed on the motor casing of the driving end to sense vibration, and a 16-channel DAT recorder is used to collect vibration signals with a sampling frequency of 12 kHz. The model of the bearing to be tested is the SKF6205-2RS-type bearing. The single-point damage with a width of 0.1778 mm and a depth of 0.2794 mm is machined by EDM. The center position is at 6 o'clock, in order to detect the outer ring fault of the rolling bearing. The detailed parameters of the rolling bearing are given in Table 3.
Processes 2022, 10, x FOR PEER REVIEW collect vibration signals with a sampling frequency of 12 kHz. The model of the to be tested is the SKF6205-2RS-type bearing. The single-point damage with a w 0.1778 mm and a depth of 0.2794 mm is machined by EDM. The center position o'clock, in order to detect the outer ring fault of the rolling bearing. The detailed p ters of the rolling bearing are given in Table 3. Figure 14. CWRU bearing test bench and its schematic diagram. In the experiment, the rotating speed of the motor is 1797 r/min, the rotat quency is 29.95 Hz, and the fault frequency of the outer ring of the rolling bearin culated to be 107.36 Hz. In total, 12,000 data points in 1s were intercepted for the mental analysis. In order to reflect the performance of the algorithm proposed in per for the processing of long signals, 0.3 (2 ) is added to the original signal the harmonic components of = 30 Hz and Gaussian white noise with varianc mean 0 are added to the original signal. The difficulty lies in extracting the transien component in the signal to be detected. Its time-domain waveform diagram with is shown in Figure 15, and the transient impulse components are all submerged i and harmonic components. The envelope spectrum analysis is performed on th after adding noise, and the obtained envelope spectrum is given in Figure 16. It seen that due to the influence of noise, only the first and second harmonic of t frequency can be observed; the other multiple frequencies are not obvious, and the harmonic is more prominent.  In the experiment, the rotating speed of the motor is 1797 r/min, the rotating frequency is 29.95 Hz, and the fault frequency of the outer ring of the rolling bearing is calculated to be 107.36 Hz. In total, 12,000 data points in 1s were intercepted for the experimental analysis. In order to reflect the performance of the algorithm proposed in this paper for the processing of long signals, 0.3sin(2π f r t) is added to the original signal, where the harmonic components of f r = 30 Hz and Gaussian white noise with variance 1 and mean 0 are added to the original signal. The difficulty lies in extracting the transient pulse component in the signal to be detected. Its time-domain waveform diagram within 0-1 s is shown in Figure 15, and the transient impulse components are all submerged in noise and harmonic components. The envelope spectrum analysis is performed on the signal after adding noise, and the obtained envelope spectrum is given in Figure 16. It can be seen that due to the influence of noise, only the first and second harmonic of the fault frequency can be observed; the other multiple frequencies are not obvious, and the higher harmonic is more prominent. The adaptive orthogonal matching pursuit algorithm is used to separate the harmonic components in the original signal, and the sparsity and sparse vector coefficients are identified as shown in Figure 17. The sparsity is obtained as 44, which can be obtained by using the 44 atoms in the Fourier dictionary. This represents the harmonic component, the absolute value of the coefficient of which is 23.1857, and the absolute value of the coefficient is 4.5894. At this time, the maximum value of the coefficient is larger than five times the minimum value of the coefficient, the iteration stops, and the mixed signal is obtained by multiplying the Fourier dictionary and the sparse vector matrix. The harmonic components in the signal are shown in Figure 18. The harmonic components are removed from the signal, and the filtered signal is obtained as shown in Figure 19. After the calculation and processing of the adaptive quadrature matching pursuit algorithm, most of the harmonic signals can be eliminated from the original signal, and the period of the transient influence component is gradually more obvious. The adaptive orthogonal matching pursuit algorithm is used to separate the harmonic components in the original signal, and the sparsity and sparse vector coefficients are identified as shown in Figure 17. The sparsity is obtained as 44, which can be obtained by using the 44 atoms in the Fourier dictionary. This represents the harmonic component, the absolute value of the coefficient of which is 23.1857, and the absolute value of the coefficient is 4.5894. At this time, the maximum value of the coefficient is larger than five times the minimum value of the coefficient, the iteration stops, and the mixed signal is obtained by multiplying the Fourier dictionary and the sparse vector matrix. The harmonic components in the signal are shown in Figure 18. The harmonic components are removed from the signal, and the filtered signal is obtained as shown in Figure 19. After the calculation and processing of the adaptive quadrature matching pursuit algorithm, most of the harmonic signals can be eliminated from the original signal, and the period of the transient influence component is gradually more obvious. The adaptive orthogonal matching pursuit algorithm is used to separate the harmonic components in the original signal, and the sparsity and sparse vector coefficients are identified as shown in Figure 17. The sparsity is obtained as 44, which can be obtained by using the 44 atoms in the Fourier dictionary. This represents the harmonic component, the absolute value of the coefficient of which is 23.1857, and the absolute value of the coefficient is 4.5894. At this time, the maximum value of the coefficient is larger than five times the minimum value of the coefficient, the iteration stops, and the mixed signal is obtained by multiplying the Fourier dictionary and the sparse vector matrix. The harmonic components in the signal are shown in Figure 18. The harmonic components are removed from the signal, and the filtered signal is obtained as shown in Figure 19. After the calculation and processing of the adaptive quadrature matching pursuit algorithm, most of the harmonic signals can be eliminated from the original signal, and the period of the transient influence component is gradually more obvious.   Set the initialization dictionary, set errorGoal = 0.9, and use the K-SVD algorithm with improved termination criteria for denoising. The sparse representation is performed in the dictionary domain where the original signal is used as the initial dictionary. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, as shown in Figure 20. The extracted transient shock component is closer to the real transient shock component, and its shock period becomes more and more obvious. The envelope spectrum analysis of the reconstructed signal is illustrated in Figure 21. It can be seen that the fault frequency and frequency multiplication of the outer race of the rolling bearing are outstanding, and the filtering effect of high-order harmonics is excellent, which is not displayed too much on the envelope spectrum. It is highlighted that the method proposed in this paper can accurately extract the fault frequency of the faulty bearing under the interference of strong noise and high-frequency harmonic components, and that it can further improve the effectiveness and reliability of the extraction of transient pulse components.   Set the initialization dictionary, set errorGoal = 0.9, and use the K-SVD algorithm with improved termination criteria for denoising. The sparse representation is performed in the dictionary domain where the original signal is used as the initial dictionary. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, as shown in Figure 20. The extracted transient shock component is closer to the real transient shock component, and its shock period becomes more and more obvious. The envelope spectrum analysis of the reconstructed signal is illustrated in Figure 21. It can be seen that the fault frequency and frequency multiplication of the outer race of the rolling bearing are outstanding, and the filtering effect of high-order harmonics is excellent, which is not displayed too much on the envelope spectrum. It is highlighted that the method proposed in this paper can accurately extract the fault frequency of the faulty bearing under the interference of strong noise and high-frequency harmonic components, and that it can further improve the effectiveness and reliability of the extraction of transient pulse components. Set the initialization dictionary, set errorGoal = 0.9, and use the K-SVD algorithm with improved termination criteria for denoising. The sparse representation is performed in the dictionary domain where the original signal is used as the initial dictionary. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, as shown in Figure 20. The extracted transient shock component is closer to the real transient shock component, and its shock period becomes more and more obvious. The envelope spectrum analysis of the reconstructed signal is illustrated in Figure 21. It can be seen that the fault frequency and frequency multiplication of the outer race of the rolling bearing are outstanding, and the filtering effect of high-order harmonics is excellent, which is not displayed too much on the envelope spectrum. It is highlighted that the method proposed in this paper can accurately extract the fault frequency of the faulty bearing under the interference of strong noise and high-frequency harmonic components, and that it can further improve the effectiveness and reliability of the extraction of transient pulse components.

Experimental Verification 2
In order to further verify the effectiveness of the method proposed in this paper to extract transient shock components in a strong noise background, the data collected by the Gearbox Dynamics Simulator (GDS) was used for analysis. The test bench consists of

Experimental Verification 2
In order to further verify the effectiveness of the method proposed in this paper to extract transient shock components in a strong noise background, the data collected by the Gearbox Dynamics Simulator (GDS) was used for analysis. The test bench consists of

Experimental Verification 2
In order to further verify the effectiveness of the method proposed in this paper to extract transient shock components in a strong noise background, the data collected by the Gearbox Dynamics Simulator (GDS) was used for analysis. The test bench consists of four parts: a signal acquisition device, motor, gearbox and load device. The acquisition device is comprised of a computer and a piezoelectric sensor of a signal acquisition instrument. The motor is a 3HP three-phase asynchronous motor with a maximum speed of 5000 rpm. It also includes a tachometer, a controller, and other display devices to form a power unit. The test bench is illustrated in Figure 22.

Experimental Verification 2
In order to further verify the effectiveness of the method proposed in this paper to extract transient shock components in a strong noise background, the data collected by the Gearbox Dynamics Simulator (GDS) was used for analysis. The test bench consists of four parts: a signal acquisition device, motor, gearbox and load device. The acquisition device is comprised of a computer and a piezoelectric sensor of a signal acquisition instrument. The motor is a 3HP three-phase asynchronous motor with a maximum speed of 5000 rpm. It also includes a tachometer, a controller, and other display devices to form a power unit. The test bench is illustrated in Figure 22. Using the HG-8916 data acquisition system, the maximum number of sampling points of the signal acquisition card is 32,768, and the maximum sampling frequency is 50 kHz, which is saved in .
format and imported into MATLAB for analysis. Owing to the limited conditions during the experiment, only the acceleration sensor could be installed on the outside of the gearbox to collect vibration data. Sensors are mounted on seven positions in total (P1−P7 in Figure 23). A piezoelectric accelerometer is placed at the point, mainly collecting vibration data, and a laser speed sensor is installed at the number 7 measuring point, mainly measuring the motor speed. The experimental bearing is an ER-16 K deep groove ball bearing, which supports the input shaft of the drive end. The bearing parameters are given in Table 4. Using the HG-8916 data acquisition system, the maximum number of sampling points of the signal acquisition card is 32,768, and the maximum sampling frequency is 50 kHz, which is saved in .txt format and imported into MATLAB for analysis. Owing to the limited conditions during the experiment, only the acceleration sensor could be installed on the outside of the gearbox to collect vibration data. Sensors are mounted on seven positions in total (P1−P7 in Figure 23). A piezoelectric accelerometer is placed at the point, mainly collecting vibration data, and a laser speed sensor is installed at the number 7 measuring point, mainly measuring the motor speed. The experimental bearing is an ER-16 K deep groove ball bearing, which supports the input shaft of the drive end. The bearing parameters are given in Table 4.  In this experiment, the sampling frequency is set to 20 kHz, the motor rotation frequency is 23.5 Hz, the acquisition time is 1s, and there are a total of 20,000 data points. According to the bearing data, the outer ring fault frequency is 83.94 Hz. This section intercepts a total of 10,000 data points within 0.5 s for analysis. The time-domain waveform  In this experiment, the sampling frequency is set to 20 kHz, the motor rotation frequency is 23.5 Hz, the acquisition time is 1s, and there are a total of 20,000 data points. According to the bearing data, the outer ring fault frequency is 83.94 Hz. This section intercepts a total of 10,000 data points within 0.5 s for analysis. The time-domain waveform diagram and its envelope spectrum are illustrated in Figure 24. It can be seen from the time-domain waveform diagram that the transient impulse component is completely submerged in the strong background noise and harmonic components, and only the fault characteristic frequency of its double frequency can be seen in the envelope spectrum; all of the other multipliers are submerged. In the high-frequency harmonic components and noise, it is difficult to distinguish the type of bearing failure. The method proposed in this paper is used to extract the transient pulse component in the original signal, and to identify the fault type of the rolling bearing.

Projects
Parameters Bearing Designation ER-166K Pitch diameter D/mm 38.506 Rolling elements diameter d/mm 8.006 Number of rolling elements 9 Contact angle β/° 0 In this experiment, the sampling frequency is set to 20 kHz, the motor rotation frequency is 23.5 Hz, the acquisition time is 1s, and there are a total of 20,000 data points. According to the bearing data, the outer ring fault frequency is 83.94 Hz. This section intercepts a total of 10,000 data points within 0.5 s for analysis. The time-domain waveform diagram and its envelope spectrum are illustrated in Figure 24. It can be seen from the time-domain waveform diagram that the transient impulse component is completely submerged in the strong background noise and harmonic components, and only the fault characteristic frequency of its double frequency can be seen in the envelope spectrum; all of the other multipliers are submerged. In the high-frequency harmonic components and noise, it is difficult to distinguish the type of bearing failure. The method proposed in this paper is used to extract the transient pulse component in the original signal, and to identify the fault type of the rolling bearing. First, the adaptive quadrature matching pursuit algorithm is used to separate the harmonic and modulation components, and an over-complete Fourier dictionary of 10,000 × 20,000 is established. The sparsity of the dictionary is 6, which appears in numbers 39, First, the adaptive quadrature matching pursuit algorithm is used to separate the harmonic and modulation components, and an over-complete Fourier dictionary of 10,000 × 20,000 is established. The sparsity of the dictionary is 6, which appears in numbers 39, 77, 6998, 7036, and 16,323, respectively. At the atoms of the six Fourier dictionaries of 16,361, the harmonic components and low-frequency noise are extracted from the original signal as shown in Figure 25. The harmonic components and low-frequency noise are extracted from the original signal, and the obtained filtered signal is shown in Figure 26. It can be seen that the signal processed by the adaptive quadrature matching pursuit algorithm has eliminated most of the harmonic signals from the original signal, and the transient impact component is gradually more obvious. After that, the K-SVD algorithm with an improved termination criterion is employed for noise reduction, and the transient shock components are successfully separated from the original signal.
Set the number of initial dictionaries as 600, set the length as 150, set errorGoal = 1.3, and use the K-SVD algorithm with improved termination criteria to continuously perform sparse representation and dictionary learning in the dictionary domain. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, and the transient impact component period is more and more obvious. As shown in Figure 27, the envelope spectrum analysis of the reconstructed signal is shown in Figure 28. It can be seen that the fault frequency of the outer ring of the rolling bearing and its frequency multiplication are more bulbous, and the filtering effect of high-order harmonics and noise is better. tracted from the original signal, and the obtained filtered signal is shown in Figure 26. It can be seen that the signal processed by the adaptive quadrature matching pursuit algorithm has eliminated most of the harmonic signals from the original signal, and the transient impact component is gradually more obvious. After that, the K-SVD algorithm with an improved termination criterion is employed for noise reduction, and the transient shock components are successfully separated from the original signal. Set the number of initial dictionaries as 600, set the length as 150, set errorGoal = 1.3, and use the K-SVD algorithm with improved termination criteria to continuously perform sparse representation and dictionary learning in the dictionary domain. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, and the transient impact component period is more and more obvious. As shown in Figure 27, the envelope spectrum analysis of the reconstructed signal is shown in Figure  28. It can be seen that the fault frequency of the outer ring of the rolling bearing and its frequency multiplication are more bulbous, and the filtering effect of high-order harmonics and noise is better. tracted from the original signal, and the obtained filtered signal is shown in Figure 26. It can be seen that the signal processed by the adaptive quadrature matching pursuit algorithm has eliminated most of the harmonic signals from the original signal, and the transient impact component is gradually more obvious. After that, the K-SVD algorithm with an improved termination criterion is employed for noise reduction, and the transient shock components are successfully separated from the original signal. Set the number of initial dictionaries as 600, set the length as 150, set errorGoal = 1.3, and use the K-SVD algorithm with improved termination criteria to continuously perform sparse representation and dictionary learning in the dictionary domain. With the continuous updating of the learning dictionary, the obtained reconstructed signal is more sparse, and the transient impact component period is more and more obvious. As shown in Figure 27, the envelope spectrum analysis of the reconstructed signal is shown in Figure  28. It can be seen that the fault frequency of the outer ring of the rolling bearing and its frequency multiplication are more bulbous, and the filtering effect of high-order harmonics and noise is better.  From the two experimental results, it can be seen that for the signal data in differen situations, the algorithm proposed in this paper has a superior effect regarding the extrac tion of the shock signal. Because of the speed and accuracy of the adaptive OMP algo rithm, it provides strong support for the K-SVD algorithm. However, in experiment 2, i can be seen that in an environment with more natural noise, the high-frequency signal i still very obvious, which also proves that this method has certain practicability.

Conclusions
Targeting the problem of the low efficiency of bearing fault data extraction, this pape combines the commonly used OMP algorithm with the improved K-SVD algorithm, and proposes a new bearing fault feature extraction method, which is mainly as follows: (1) According to the characteristics of harmonics, the Fourier dictionary is used to improve the OMP algorithm, which further improves the distinguishing ability and dis tinguishing speed of harmonic components.
(2) The termination criterion of the K-SVD algorithm is improved, and there is no need to manually set the sparsity during the initialization process of the algorithm, which avoids the problem of the performance degradation of the algorithm due to imprope sparsity setting. The calculation accuracy of the algorithm reduces the reconstruction erro of the signal.
After signal simulation and two sets of experimental verifications, the algorithm can effectively extract fault features, especially for high-frequency fault features. The ad vantages of the bearing fault feature extraction method based on adaptive OMP and im proved K-SVD are more obvious. Because of this feature, this method may produce good results in practical fault diagnosis applications for high-speed running machines, such a vehicles and aerospace machines. From the two experimental results, it can be seen that for the signal data in different situations, the algorithm proposed in this paper has a superior effect regarding the extraction of the shock signal. Because of the speed and accuracy of the adaptive OMP algorithm, it provides strong support for the K-SVD algorithm. However, in experiment 2, it can be seen that in an environment with more natural noise, the high-frequency signal is still very obvious, which also proves that this method has certain practicability.

Conclusions
Targeting the problem of the low efficiency of bearing fault data extraction, this paper combines the commonly used OMP algorithm with the improved K-SVD algorithm, and proposes a new bearing fault feature extraction method, which is mainly as follows: (1) According to the characteristics of harmonics, the Fourier dictionary is used to improve the OMP algorithm, which further improves the distinguishing ability and distinguishing speed of harmonic components.
(2) The termination criterion of the K-SVD algorithm is improved, and there is no need to manually set the sparsity during the initialization process of the algorithm, which avoids the problem of the performance degradation of the algorithm due to improper sparsity setting. The calculation accuracy of the algorithm reduces the reconstruction error of the signal.
After signal simulation and two sets of experimental verifications, the algorithm can effectively extract fault features, especially for high-frequency fault features. The advantages of the bearing fault feature extraction method based on adaptive OMP and improved K-SVD are more obvious. Because of this feature, this method may produce good results in practical fault diagnosis applications for high-speed running machines, such as vehicles and aerospace machines.