Feature Enhancement Method of Rolling Bearing Based on K-Adaptive VMD and RBF-Fuzzy Entropy

The complex and harsh working environment of rolling bearings cause the fault characteristics in vibration signal contaminated by the noise, which make fault diagnosis difficult. In this paper, a feature enhancement method of rolling bearing signal based on variational mode decomposition with K determined adaptively (K-adaptive VMD), and radial based function fuzzy entropy (RBF-FuzzyEn), is proposed. Firstly, a phenomenon called abnormal decline of center frequency (ADCF) is defined in order to determine the parameter K of VMD adaptively. Then, the raw signal is separated into K intrinsic mode functions (IMFs). A coefficient En for selecting optimal IMFs is calculated based on the center frequency bands (CFBs) of all IMFs and frequency spectrum for original signal autocorrelation operation. After that, the optimal IMFs of which En are bigger than the threshold are selected to reconstruct signal. Secondly, RBF is introduced as an innovative fuzzy function to enhance the feature discrimination of fuzzy entropy between bearings in different states. A specific way for determination of parameter r in fuzzy function is also presented. Finally, RBF-FuzzyEn is used to extract features of reconstructed signal. Simulation and experiment results show that K-adaptive VMD can effectively reduce the noise and enhance the fault characteristics; RBF-FuzzyEn has strong feature differentiation, superior noise robustness, and low dependence on data length.


Introduction
Rolling bearing is one of the most widely used components in mechanical equipment. The working environment of bearing is relatively bad, and the failure rate is high [1]. Once a failure of bearing occurs, it will seriously affect the safe operation of equipment. Thus, measurement for rolling bearings plays an increasingly important role. Adamczak et al. [2] compared various measurement systems for evaluating the vibrations of rolling bearings, which can reflect the statement of bearing. At present, fault diagnosis of rolling bearings mainly focuses on signal noise reduction and feature extraction. The commonly used denoising methods for rolling bearing vibration signal mainly include wavelet analysis [3], empirical mode decomposition (EMD) [4], ensemble empirical mode decomposition (EEMD) [5], local mean decomposition (LMD) [6], singular value decomposition (SVD) [7], stochastic resonance [8], and so on.
In 2014, Dragomiretskiy and Zosso proposed a new signal mode decomposition method, namely variational mode decomposition (VMD) [9]. VMD employs the variational frame and alternating direction multiplier method to decompose the mode of signal and extract the center frequency of each mode at the same time. It overcomes the shortcoming of mode aliasing in EMD. The essence of VMD is multiple adaptive Wiener filter banks, which has a solid theoretical foundation. The advantages of VMD make it widely used in the field of rolling bearing fault diagnosis. However, there remain two main problems to be solved in realizing noise reduction of rolling bearing vibration data by VMD: determine the number of modes K and penalty factor α; select IMFs and reconstruct the signal. Scholars have made some research achievements on these two issues.
VMD decomposes the input signal x into K modes u k (t) with a certain bandwidth and center frequency, so as to minimize the sum of bandwidths for each mode [9]. In order to reckon the bandwidth of each mode, Hilbert transform is carried out on u k (t), and its unilateral spectrum is obtained. Then the unilateral spectrum is mixed with an estimated center frequency and integrated into the baseband. Finally, the L 2 norm square is calculated for the gradient of fundamental band mode, and the following variational constraint problem is worked out: where K is the total number of modes; u k is the decomposed single-component AM-FM signal; ω is the center frequency of each component; x is original signal. The quadratic penalty factor α and Lagrangian multiplication operator λ are introduced in order to transform the above constrained problem into an unconstrained optimization problem. The augmented Lagrangian function is as follows: Equation (2) can be equivalently decomposed into sub-problems of alternately finding u k and ω k through Alternate Direction Method of Multipliers (ADMM). The calculation formulas ofû n+1 k and ω n+1 k are shown in Equations (3) and (4).
where,x(ω),û(ω) andλ(ω) are the Fourier transform of x(t), u(t) and λ(t), respectively. Afterû(ω) the updating of ω k andλ should be calculated by the following formula: where τ denotes the tolerance of noise. The optimization will end until the following condition is satisfied. < ε (6) where ε is the tolerance of convergence criterion.

Fuzzy Entropy
Fuzzy entropy, proposed by Chen et al. [23], is a widely used statistical feature, which is applied to measure the complexity of data. The specific calculation steps are listed as below: Entropy 2022, 24, 197 5 of 29 Step 1. Get the coarse-grained data. Work out X(i) from the original signal sequence x based on Equation (7). x(i + l) (7) where N is the length of x; m is the length of coarse-grained sequence X(i). Referring to paper [23], the value of m is taken as 2.
Step 2. Calculate the maximum absolute distance d m ij between X(i) and X(j).
Step 3. Work out similarity D m ij between X(i) and X(j) through fuzzy function based on maximum absolute distance d m ij . D m ij = f d m ij (9) The fuzzy function in paper [23] is f 1 d m ij = exp − d m ij n /r , and in [24] is Both functions contain two parameters r and n, which determine the width and the gradient of boundary for the fuzzy functions respectively. The forms of fuzzy functions are diverse, but must meet the following limitations: the function possesses continuity; the function should be convex, which maximizes the self-similarity.

K-Adaptive VMD
We proposed a signal decomposition method K-adaptive VMD, which does not set an upper limit of K. Based on whether there is ADCF in IMF, the maximum number of mode K is obtained. The calculation process of K-adaptive VMD is shown in Figure 1, which is divided into two parts: K value determination and component selecting.

Determination of K
Parameters K and α in VMD have certain impact on decomposition results, but many scholars have pointed out that the key to VMD lies in the optimization of decomposition layers K [12,18,25]. The determination of K is an extremely important step in VMD. When α is fixed, for small K value, multiple components of the original signal may appear in one IMF simultaneously, cause the signal and noise cannot be separated effectively, which is called under decomposition; for big K value, some components of the original signal will be decomposed into multiple IMFs, which is called over decomposition. But the problem of over-decomposition can be solved by selecting and reconstructing the corresponding components. With the increase of K, the signal is separated more finely in the whole frequency band, and the energy of the component is more concentrated. Through a great number of simulation and experimental signal research, we found that the center frequency of IMFs will decrease at some components, which is called ADCF. In addition to K and α, Entropy 2022, 24,197 6 of 29 VMD also contains two parameters: τ and ε, which represent the tolerance of noise and the tolerance of convergence criterion, respectively. τ affects the Lagrange multiplier λ. If the purpose of VMD is noise reduction, using the quadratic penalty only and keeping the value of λ at 0 by setting τ to 0. ε is typically around 10 −7 . Based on the above analysis, the specific steps for determination of K are listed as below: Step 1. Initialization parameters: K temp = 2, α = 2000, τ = 0, ε = 10 −7 , num = 0.
Step 2. Perform VMD on raw signal and get IMFs.
Step 3. If ADCF occurs, go to step 4, otherwise num = 0 and go to step 5.
Step 4. num = num + 1. If num is 2, that means ADCF occurs twice continuously, go to step 6, otherwise go to step 5.
Step 6. Number of optimized modes K = K temp − 2.

Determination of K
Parameters K and α in VMD have certain impact on decomposition results, but many scholars have pointed out that the key to VMD lies in the optimization of decomposition layers K [12,18,25]. The determination of K is an extremely important step in VMD. When α is fixed, for small K value, multiple components of the original signal may appear in one IMF simultaneously, cause the signal and noise cannot be separated effectively, which is called under decomposition; for big K value, some components of the original signal will be decomposed into multiple IMFs, which is called over decomposition. But the problem of over-decomposition can be solved by selecting and reconstructing the corresponding components. With the increase of K, the signal is separated more finely in the whole frequency band, and the energy of the component is more concentrated. Through a great number of simulation and experimental signal research, we found that the center frequency of IMFs will decrease at some components, which is called ADCF. In addition to K and α, VMD also contains two parameters: τ and ε, which represent the tolerance of noise and the tolerance of convergence criterion, respectively. τ affects the Lagrange multiplier λ. If the purpose of VMD is noise reduction, using the quadratic penalty only and keeping the value of λ at 0 by setting τ to 0. ε is typically around 10 −7 . Based on the above analysis, the specific steps for determination of K are listed as below: Step 1. Initialization parameters: Ktemp = 2, α = 2000, τ = 0, ε = 10 −7 , num = 0.
Step 2. Perform VMD on raw signal and get IMFs.

Selection of IMFs
The goal of the K-adaptive VMD is to get an optimal K as big as possible, but a big value of K will increase the difficulty of component selecting. So component selecting is very important for the proposed K-adaptive VMD. We introduced a component selecting index En based on center frequency band (CFB), frequency spectrum energy of original signal after autocorrelation operating and prior knowledge in this subsection. The amplitude of the frequency spectrum of autocorrelation operation is abbreviate as amp corr . The specific calculation steps are listed as below: Step 1. Calculate autocorrelation operation R x (τ) of original signal x(t) by Equation (12) and perform fast Fourier transform to work out the autocorrelation operation frequency spectrum, i = 1. x(t)x(t + τ)dt (12) Entropy 2022, 24,197 7 of 29

R x (τ) = lim
Step 2. Compute CFB i of IMF i . CFB i is centered on the center frequency ω i of IMF i , and the energy of IMF i frequency spectrum in CFB i is b times of the total energy of this component.
where f i is half of CFB i . The energy ratio b is between 0 and 1. The larger the value of b, the wider the CFB. In order to get a more concentrated frequency band, this paper takes the energy ratio b as 0.3.
Step 3. Calculate the energy En(i) of the amp corr for raw signal in CFB i .
Step 5. Determine whether prior knowledge is needed to assist in selecting IMF. If so, update En according to the knowledge and then go to step 6, otherwise go to step 6.
Step 6. Select all components corresponding to En index greater than 0.03 × ∑En, and reconstruct the signal.
Two aspects of prior knowledge are employed for IMF selecting supplementary when the method K-adaptive VMD is applied to reduce the noise of rolling bearing signals:

•
Prior knowledge one: delete the low-frequency components. Bearing vibration signals frequency bands can be divided into low-frequency, mid-frequency, and highfrequency in frequency domain. Low-frequency signals are usually components related to rotating speed [26]. These low-frequency components will affect the noise reduction effect of K-adaptive VMD. Therefore, the low-frequency components should not be contained in when reconstructing the signal by IMFs, so we can ignore these IMFs by setting En value of the components in low frequency to 0. We set En value of the IMF1 to IMF[K/3] components to 0 (IMF is sorted according to its center frequency from low to high, and [•] means rounded to 0). • Prior knowledge two: delete the components containing interference frequencies. In the actual engineering environment, bearing vibration signal is often interfered by natural frequency and rotation frequency of the equipment. In order to remove the influence of these interference frequencies, it is necessary to set En corresponding to the CFBs with these interference frequencies to 0.

RBF-FuzzyEn
Fuzzy entropy can represent the complexity of data. Normal bearing vibration signal often present relatively random distribution in amplitude value. The uncertainty of the change for normal bearing vibration amplitude is remarkable, which leads to a big value of fuzzy entropy. When the bearing fault occurs, the distribution of vibration signal amplitude in time domain become more certain, that is, the complexity of signal reduces and the fuzzy entropy is smaller. Therefore, fuzzy entropy of vibration signal in time domain can effectively characterize the state of bearing.
The key to the calculation of fuzzy entropy is fuzzy function. The limitations that need to be met by the fuzzy function have been introduced in Section 2.2. Kernel of radial basis function, which is commonly used in classification algorithms, satisfies the limitations of fuzzy function. The formula of RBF kernel is presented in Equation (14).
where x i − x j is a measure of the distance between x i and x j . Similarly, d m ij in fuzzy entropy is the distance between the coarse-grained sequences X i and X j . Hence the (14) can be replaced by d m ij to form a new fuzzy function based on RBF kernel. Equation of the new fuzzy function is as followings: It is apparent that Equation (15) only has one parameter r which determines the waveform of the fuzzy function. Figure 2 shows the curve of fuzzy function f d m ij when r is 0.01, 0.10 and 0.50, respectively. According to Figure 2, the curve becomes flatter with the increase of r. That is, the smaller r value is, and the similarity D m ij is more approximate to 0. So as to affect the overall mean of ϕ m , and ultimately affects the value of fuzzy entropy.
x , x = exp(− x − x /(2 )) where x − x is a measure of the distance between xi and xj. Similarly entropy is the distance between the coarse-grained sequences Xi and X x − x in Equation (14) can be replaced by to form a new fuzzy func RBF kernel. Equation of the new fuzzy function is as followings: It is apparent that Equation (15) only has one parameter r which d waveform of the fuzzy function. Figure 2 shows the curve of fuzzy function r is 0.01, 0.10 and 0.50, respectively. According to Figure 2, the curve becom the increase of r. That is, the smaller r value is, and the similarity is more to 0. So as to affect the overall mean of , and ultimately affects the valu tropy. For small value of r, most of is about 0, resulting in these h effect on which means a lot of information about the signal distributio For big value of r, most of is close to 1 which will also lead to the s Thus, it is very important to determine r adaptively according to the distribu We proposed that most of should not close to 0 based on the abo analysis. To achieve this goal, we need to determine a critical threshold of fies the following limitations for the reason that is a monotone dec tion: most of are small than this critical threshold; the fuzzy function critical threshold is close to 0. Firstly, ( ) + 3 ( ) is taken as the crit according to these two limitations. Furthermore, ( ) + 3 ( ) is equ considered that the value is close to 0 when the value is less than 10 − 6 ). Accor tion (15) and ( ) + 3 ( ) = 10 , the formula of r can be deduced where ( ) and ( ) are the mean and the standard deviation of For the calculation of multiple groups of data, r is calculated based on the largest ( ). This is because that the greater the standard deviation of , persed its distribution which means there are more large . Differ from methods for calculating fuzzy entropy, all data sets adopt a same r during tional process of RBF-FuzzyEn. The specific calculation process of RBF-Fuz hereafter: For small value of r, most of f d m ij is about 0, resulting in these d m ij have almost no effect on ϕ m which means a lot of information about the signal distribution will be lost. For big value of r, most of f d m ij is close to 1 which will also lead to the same problem. Thus, it is very important to determine r adaptively according to the distribution of signal. We proposed that most of f d m ij should not close to 0 based on the abovementioned analysis. To achieve this goal, we need to determine a critical threshold of d m ij that satisfies the following limitations for the reason that f d m ij is a monotone decreasing function: most of d m ij are small than this critical threshold; the fuzzy function valued of this critical threshold is close to 0. Firstly, µ d m ij + 3σ d m ij is taken as the critical threshold according to these two limitations. Furthermore, f µ d m ij + 3σ d m ij is equal to 10 −6 (it is considered that the value is close to 0 when the value is less than 10 −6 ). According to Equation (15) and , the formula of r can be deduced as follows: where µ d m ij and σ d m ij are the mean and the standard deviation of d m ij , respectively. For the calculation of multiple groups of data, r is calculated based on the data with the largest σ d m ij . This is because that the greater the standard deviation of d m ij , the more dispersed its distribution which means there are more large d m ij . Differ from the existing methods for calculating fuzzy entropy, all data sets adopt a same r during the computational process of RBF-FuzzyEn. The specific calculation process of RBF-FuzzyEn is listed hereafter: Step 1. Coarse the original data.
Step 2. Calculate the maximum absolute distance d m ij and d m+1 ij between X(i) and X(j) respectively.
Step 4. Determine the similarity f (d m ij ) and f (d m+1 ij ) based on Equation (15).

Simulation Analysis and Verification
where t = 1 s. It can be perceived from Equation (17) that f sig1 (t) consist of three frequency components, which are 7.5 Hz, 30 Hz frequency modulated by 5 Hz and 100 Hz amplitude modulated by 5 Hz, respectively. The sampling frequency is 1k Hz. The SNR of simulation signals is often set within the range of −5~8dB [11,14,18]. We set the Gaussian white noise η(t) with a SNR of −6 dB to simulate higher noise interference. Figure 3 shows the time domain waveform and frequency spectrum of original signal and noisy signal.
Entropy 2022, 23, x FOR PEER REVIEW 9 of 31 Step 2. Calculate the maximum absolute distance and between X(i) and X(j) respectively.

Analysis Based on Nonlinear AM-FM Simulation Signal
In this section, a nonlinear AM-FM simulation signals with Gaussian white noise are employed to illustrate the flow and feasibility of the algorithm. The simulation signal fsig1(t) is: where t = 1 s. It can be perceived from Equation (17) that ( ) consist of three frequency components, which are 7.5 Hz, 30 Hz frequency modulated by 5 Hz and 100 Hz amplitude modulated by 5 Hz, respectively. The sampling frequency is 1k Hz. The SNR of simulation signals is often set within the range of −5dB~8dB [11,14,18]. We set the Gaussian white noise η(t) with a SNR of -6 dB to simulate higher noise interference. Figure  3 shows the time domain waveform and frequency spectrum of original signal and noisy signal. In Figure 3, we can see that noise has strong interference to the original signal in both time domain and frequency domain. The time-domain waveform of original signal has been submerged by noise, and there contains more noises in high frequency range of the frequency spectrum.
In order to verify the K-adaptive VMD method proposed in Section 3.1, K is taken as 15, 35 and 150 for VMD decomposition of the AM-FM simulation signal respectively. These three different K represent the maximum K among the most works of VMD-based bearing fault diagnosis, the optimal decomposition levels obtained by K-adaptive VMD and a very large K, respectively. Select the components consistent with the main frequency of original signal when K is 13, 35 and 150 respectively, as shown in Figure 4. It can be concluded that with the increase of K, the noise in component decreases and all compo- In Figure 3, we can see that noise has strong interference to the original signal in both time domain and frequency domain. The time-domain waveform of original signal has been submerged by noise, and there contains more noises in high frequency range of the frequency spectrum.
In order to verify the K-adaptive VMD method proposed in Section 3.1, K is taken as 15, 35 and 150 for VMD decomposition of the AM-FM simulation signal respectively. These three different K represent the maximum K among the most works of VMD-based bearing fault diagnosis, the optimal decomposition levels obtained by K-adaptive VMD and a very large K, respectively. Select the components consistent with the main frequency of original signal when K is 13, 35 and 150 respectively, as shown in Figure 4. It can be concluded that with the increase of K, the noise in component decreases and all components become smoother by comparing the three subgraphs of Figure 4d-f. This is a manifestation that the band energy of IMF component becomes more concentrated with the increase of K value. nents become smoother by comparing the three subgraphs of Figure 4d-f. This is a manifestation that the band energy of IMF component becomes more concentrated with the increase of K value. As can be seen from Figure 4 that the amplitude modulation component 1 + 0.3cos(10 ) sin(200 ) of 100 Hz will gradually transform from IMF4 to IMF6 with the increase of K; while component of the 7.5 Hz has always been the lowest frequency component; the sideband of 30 ± 5 Hz component will appear in different IMF with the increase of K. For example, when K = 150, 35 Hz is in the IMF2 and IMF3.
When K is equal to 36 and 37, the central frequencies reduce at the high-frequency components as shown in Figure 5b,c, that is, ADCF occurs twice continuously. Therefore, K is determined as 35 for AM-FM simulation signal based on the proposed K-adaptive VMD. The phenomenon of ADCF becomes more serious with the increase of K, as indicated in Figure 5d. We can observe from the figure that when K is big, VMD cannot work out the IMFs with a gentle increase in the center frequency. The center frequency of many components decrease in decomposition results, and the center frequency showed an overall downward trend after about the 100th IMF. These phenomena show that VMD can no longer guarantee a steady increase of IMFs center frequency with the continuous increase of K value.  (d-f), frequency spectra of (a-c).
As can be seen from Figure 4 that the amplitude modulation component (1 + 0.3 cos(10πt)) sin(200πt) of 100 Hz will gradually transform from IMF4 to IMF6 with the increase of K; while component of the 7.5 Hz has always been the lowest frequency component; the sideband of 30 ± 5 Hz component will appear in different IMF with the increase of K. For example, when K = 150, 35 Hz is in the IMF2 and IMF3.
When K is equal to 36 and 37, the central frequencies reduce at the high-frequency components as shown in Figure 5b,c, that is, ADCF occurs twice continuously. Therefore, K is determined as 35 for AM-FM simulation signal based on the proposed K-adaptive VMD. The phenomenon of ADCF becomes more serious with the increase of K, as indicated in Figure 5d. We can observe from the figure that when K is big, VMD cannot work out the IMFs with a gentle increase in the center frequency. The center frequency of many components decrease in decomposition results, and the center frequency showed an overall downward trend after about the 100th IMF. These phenomena show that VMD can no longer guarantee a steady increase of IMFs center frequency with the continuous increase of K value.
As illustrated in Figure 3f, frequency spectrum of noisy signal has a lot of noise interference. Calculating the amp corr of original signal can reduce the white noise to a certain extent. Figure 6a shows the frequency spectrum of autocorrelation operation of original signal, several clear peaks correspond to the main frequency of original signal appeared. The mode corresponding to the frequency peak can be accurately selected by calculating the energy value of amp corr in CFBs. Therefore, the original characteristics of signal are enhanced and the noise is reduced. The energy value En is shown in Figure 6b. The red dotted line in the figure is 0.03 × ∑ En. According to this threshold, IMF1, 2, and 6 are selected to reconstruct the signal. As illustrated in Figure 3f, frequency spectrum of noisy signal has a lot of noise interference. Calculating the ampcorr of original signal can reduce the white noise to a certain extent. Figure 6a shows the frequency spectrum of autocorrelation operation of original signal, several clear peaks correspond to the main frequency of original signal appeared. The mode corresponding to the frequency peak can be accurately selected by calculating the energy value of ampcorr in CFBs. Therefore, the original characteristics of signal are enhanced and the noise is reduced. The energy value En is shown in Figure 6b. The red dotted line in the figure is 0.03 × ∑ . According to this threshold, IMF1, 2, and 6 are selected to reconstruct the signal.  Figure 7 shows the frequency spectrum of reconstructed signal. It can be noticed that noise in the signal reconstructed by K-adaptive VMD is effectively suppressed by comparing Figures 7a and 3f. The noise distributed between 30 Hz and 100 Hz and in the range greater than 100 Hz in the signal has been suppressed to a great extent after noise reduction. For further verification, the simulation signal is also decomposed and reconstructed by frequency band entropy-VMD (FBE-VMD) [10] and tentative VMD (TVMD) [12]. FBE-VMD, TVMD and K-adaptive VMD are comprised of two parts: determination of K and component selecting. In the first part for determination of K, FBE-VMD takes the kurtosis of IMF's envelope spectrum as the objective function to optimize K, and limits K in the range of 2-15. TVMD determines the optimal K by comparing the change characteristics of the maximum kurtosis of IMFs when the number of modes is K and K + 1 respectively. In the second part for component selecting, FBE-VMD operates FBE to judge the fault information in each component. TVMD determines the number of major components according to K at first. After that, take the component with the largest kurtosis as optimal component, reckon the distance between optimal component and the other components through dynamic time warping (DTW). The components with smaller distance are selected as the optimal components, and reconstruct the signal by averaging the optimal components. All of the experiments in this paper based on FBE-VMD and TVMD are set as α = 2000, τ = 0 and ε = 10 −7 .  As illustrated in Figure 3f, frequency spectrum of noisy signal has a lot of n terference. Calculating the ampcorr of original signal can reduce the white noise to a extent. Figure 6a shows the frequency spectrum of autocorrelation operation of o signal, several clear peaks correspond to the main frequency of original signal app The mode corresponding to the frequency peak can be accurately selected by calc the energy value of ampcorr in CFBs. Therefore, the original characteristics of signal hanced and the noise is reduced. The energy value En is shown in Figure 6b. T dotted line in the figure is 0.03 × ∑ . According to this threshold, IMF1, 2, an selected to reconstruct the signal.  Figure 7 shows the frequency spectrum of reconstructed signal. It can be notic noise in the signal reconstructed by K-adaptive VMD is effectively suppressed b paring Figures 7a and 3f. The noise distributed between 30 Hz and 100 Hz and in th greater than 100 Hz in the signal has been suppressed to a great extent after noise tion. For further verification, the simulation signal is also decomposed and recons by frequency band entropy-VMD (FBE-VMD) [10] and tentative VMD (TVMD) [12 VMD, TVMD and K-adaptive VMD are comprised of two parts: determination o component selecting. In the first part for determination of K, FBE-VMD takes the k of IMF's envelope spectrum as the objective function to optimize K, and limits K range of 2-15. TVMD determines the optimal K by comparing the change charact of the maximum kurtosis of IMFs when the number of modes is K and K + 1 respe In the second part for component selecting, FBE-VMD operates FBE to judge th information in each component. TVMD determines the number of major compone cording to K at first. After that, take the component with the largest kurtosis as o component, reckon the distance between optimal component and the other comp through dynamic time warping (DTW). The components with smaller distance lected as the optimal components, and reconstruct the signal by averaging the o components. All of the experiments in this paper based on FBE-VMD and TVMD as α = 2000, τ = 0 and ε = 10 −7 .  Figure 7 shows the frequency spectrum of reconstructed signal. It can be noticed that noise in the signal reconstructed by K-adaptive VMD is effectively suppressed by comparing Figures 3f and 7a. The noise distributed between 30 Hz and 100 Hz and in the range greater than 100 Hz in the signal has been suppressed to a great extent after noise reduction. For further verification, the simulation signal is also decomposed and reconstructed by frequency band entropy-VMD (FBE-VMD) [10] and tentative VMD (TVMD) [12]. FBE-VMD, TVMD and K-adaptive VMD are comprised of two parts: determination of K and component selecting. In the first part for determination of K, FBE-VMD takes the kurtosis of IMF's envelope spectrum as the objective function to optimize K, and limits K in the range of 2-15. TVMD determines the optimal K by comparing the change characteristics of the maximum kurtosis of IMFs when the number of modes is K and K + 1 respectively. In the second part for component selecting, FBE-VMD operates FBE to judge the fault information in each component. TVMD determines the number of major components according to K at first. After that, take the component with the largest kurtosis as optimal component, reckon the distance between optimal component and the other components through dynamic time warping (DTW). The components with smaller distance are selected as the optimal components, and reconstruct the signal by averaging the optimal components. All of the experiments in this paper based on FBE-VMD and TVMD are set as α = 2000, τ = 0 and ε = 10 −7 .
The number of decomposition layers computed by FBE-VMD is 15, and the selected components are IMF11 and IMF13; the optimal K estimated by TVMD is 3, and the selected component is IMF2. Figure 7b,c are the frequency spectrum of reconstructed signal decomposed and selected based on FBE-VMD and TVMD, respectively. The conclusion can be drawn that in the signal reconstructed by these two methods, many key frequencies of original signal are not effectively enhanced, but are lost instead. As demonstrated in Figure 7b, none of clear peaks appear at frequencies of 7.5 Hz in sinusoidal signal, 30 ± 5 Hz in FM signal and 100 Hz in AM signal. Moreover, the noise amplitude at high frequencies is relatively high. In Figure 7c, except for the weak amplitude at 100 Hz, the amplitude at other main frequencies is not clear, and noise also dominate the high frequency band. By comparing Figure 7a-c, it can be summarized that noise reduction effect of the proposed method is more pronounced than that of FBE-VMD and TVMD, which not only enhances the spectral amplitude at three groups of main frequencies, but also reduces most of noise. ure 7b, none of clear peaks appear at frequencies of 7.5 Hz in sinusoidal signal, 30 ± 5 Hz in FM signal and 100 Hz in AM signal. Moreover, the noise amplitude at high frequencies is relatively high. In Figure 7c, except for the weak amplitude at 100 Hz, the amplitude at other main frequencies is not clear, and noise also dominate the high frequency band. By comparing Figure 7a-c, it can be summarized that noise reduction effect of the proposed method is more pronounced than that of FBE-VMD and TVMD, which not only enhances the spectral amplitude at three groups of main frequencies, but also reduces most of noise. K-adaptive VMD has great noise reduction effect when the length of simulation signal is 1000. In order to further analyze the noise reduction effect of this method when the sample length changes, five more simulated signal in Equation (17) with 500, 800, 1200, 1500 and 2000 sample length respectively are analyzed. All these data are added with -6dB of noise. The noise reduction results obtained by K-adaptive VMD are shown in Table  1. We can draw the conclusion that the value of K is slightly different when data length changed, but both are greater than 30. Additionally, all of the results have accurately screened out the components corresponding to the three main frequencies. According to the SNR of reconstructed signal, it can be observed that the noise reduction effect is relatively stable with the change of data length, and maintains an excellent level. In the abovementioned results, the SNR of reconstructed signal is the lowest for the group of data with length of 2000, which is 3.77, but it is still significantly improved than −6 dB in the original signal. K-adaptive VMD has great noise reduction effect when the length of simulation signal is 1000. In order to further analyze the noise reduction effect of this method when the sample length changes, five more simulated signal in Equation (17) with 500, 800, 1200, 1500 and 2000 sample length respectively are analyzed. All these data are added with -6dB of noise. The noise reduction results obtained by K-adaptive VMD are shown in Table 1. We can draw the conclusion that the value of K is slightly different when data length changed, but both are greater than 30. Additionally, all of the results have accurately screened out the components corresponding to the three main frequencies. According to the SNR of reconstructed signal, it can be observed that the noise reduction effect is relatively stable with the change of data length, and maintains an excellent level. In the abovementioned results, the SNR of reconstructed signal is the lowest for the group of data with length of 2000, which is 3.77, but it is still significantly improved than −6 dB in the original signal.

Analysis Based on Nonlinear AM-FM Simulation Signal with Close Frequencies
In Section 4.1.1, the three frequencies in the AM-FM signal of are separated. This subsection will analyze the simulated signal f sig2 (t) composed of three components with relatively close frequencies, so as to verify the noise reduction and decomposition ability of K-adaptive VMD when the main frequencies of the signals are relatively close. The simulation signal f sig2 (t) is: (18) where t = 2 s. It can be perceived from Equation (18) that f sig2 (t) consist of three frequency components, which are 7.5 Hz, 20 Hz frequency modulated by 1 Hz and 30 Hz amplitude modulated by 1 Hz, respectively. Similarly, −6 dB Gaussian white noise also added in f sig2 (t). Figure 8 shows the waveforms of f sig2 (t).
( ) = sin 15 + 5 + cos(40 + sin10 ) + 1 + 0.3cos(10 ) sin(60 ) + ( ) where t = 2 s. It can be perceived from Equation (18) that ( ) consist of three frequency components, which are 7.5 Hz, 20 Hz frequency modulated by 1 Hz and 30 Hz amplitude modulated by 1 Hz, respectively. Similarly, -6dB Gaussian white noise also added in ( ). Figure 8 shows the waveforms of ( ). The noised signal ( ) is adaptively decomposed into 37 components by K-adaptive VMD. The corresponding change in the center frequency of IMFs and distribution of En is shown in Figure 9. It is clear in Figure 9b that the optimal IMFs obtained by K-adaptive VMD is IMF1, 2 and 3. The time domain waveforms of these three optimal components are shown in Figure 10a-c, in which the blue dotted line is the original signal waveform corresponding to the IMF. It can be summarized that these three components have good frequency restoration to the original signal. Furthermore, the waveform of IMF1 is very close to the sinusoidal waveform of the original signal. Although part of the amplitude of IMF2 is different from its corresponding FM component, the frequency change is consistent with the original signal, which is more important for FM signals. The amplitude variation trend of IMF3 The noised signal f sig2 (t) is adaptively decomposed into 37 components by K-adaptive VMD. The corresponding change in the center frequency of IMFs and distribution of En is shown in Figure 9.
added in ( ). Figure 8 shows the waveforms of ( ). The noised signal ( ) is adaptively decomposed into 37 components by K-ada tive VMD. The corresponding change in the center frequency of IMFs and distribution En is shown in Figure 9. It is clear in Figure 9b that the optimal IMFs obtained by K-adaptive VMD is IMF1 and 3. The time domain waveforms of these three optimal components are shown in F ure 10a-c, in which the blue dotted line is the original signal waveform corresponding the IMF. It can be summarized that these three components have good frequency resto tion to the original signal. Furthermore, the waveform of IMF1 is very close to the sinu oidal waveform of the original signal. Although part of the amplitude of IMF2 is differe from its corresponding FM component, the frequency change is consistent with the ori nal signal, which is more important for FM signals. The amplitude variation trend of IM It is clear in Figure 9b that the optimal IMFs obtained by K-adaptive VMD is IMF1, 2 and 3. The time domain waveforms of these three optimal components are shown in Figure 10a-c, in which the blue dotted line is the original signal waveform corresponding to the IMF. It can be summarized that these three components have good frequency restoration to the original signal. Furthermore, the waveform of IMF1 is very close to the sinusoidal waveform of the original signal. Although part of the amplitude of IMF2 is different from its corresponding FM component, the frequency change is consistent with the original signal, which is more important for FM signals. The amplitude variation trend of IMF3 is consistent with the AM signal as depicted in Figure 10c. Figure 10d is the frequency spectrum in double logarithmic coordinate of the reconstructed signal obtained by summing IMF1, 2 and 3. The amplitude of the reconstructed signal is very obvious at 7.5 Hz, 20 Hz and 30 Hz, meaning that noise in the signal is effectively suppressed. Additionally, FBE-VMD and TVMD are also used for noise reduction, and the results are shown in Figure 10e,f, respectively. The optimal number of K obtained by FBE-VMD according to the component kurtosis is 15, which is the maximum value of the optimization range defined by this method. In fact, when K gradually increases in the range of 2~15, the maximum kurtosis of the component shows an upward trend. If FBE-VMD does not predetermine the range of K, the optimal decomposition layers of VMD is likely to be greater than 15. After decomposition, IMF5, 6, 7, 11, 13 and 15 are screened out according to FBE. The spectrum of reconstructed signal is shown in Figure 10e. It can be observed that none of the three frequencies has a clear amplitude, and the reconstructed signal still contain many noise. TVMD decomposes the original signal into three components adaptively, and selects IMF3 as the optimal component. As shown in Figure 10f, TVMD also does not achieve effective noise reduction. the range of K, the optimal decomposition layers of VMD is likely to be greater than 15. After decomposition, IMF5, 6, 7, 11, 13 and 15 are screened out according to FBE. The spectrum of reconstructed signal is shown in Figure 10e. It can be observed that none of the three frequencies has a clear amplitude, and the reconstructed signal still contain many noise. TVMD decomposes the original signal into three components adaptively, and selects IMF3 as the optimal component. As shown in Figure 10f, TVMD also does not achieve effective noise reduction.

Analysis Based on Rolling Bearing Fault Simulation Signal
In this section, we will verify the noise reduction effect of K-adaptive VMD through a set of simulation signals of rolling bearing with inner race fault. The formula of simulation signal is: where s(t) is the periodic impulse component; amplitude A0 = 0.3; rotation frequency fr = 30 Hz; attenuation coefficient C = 700; resonance frequency fn = 4k Hz; ball pass frequency on inner race BPFI = 1/T = 120 Hz; n(t) is Gaussian white noise component with SNR = -13 dB; sampling frequency is set to 16k Hz; length of data is 4096. The time domain and frequency waveform of impulse signal s(t) and inner race fault simulation signal x(t) are shown in Figure 11. Comparing Figure 11a,b, it is found that

Analysis Based on Rolling Bearing Fault Simulation Signal
In this section, we will verify the noise reduction effect of K-adaptive VMD through a set of simulation signals of rolling bearing with inner race fault. The formula of simulation signal is: where s(t) is the periodic impulse component; amplitude A 0 = 0.3; rotation frequency f r = 30 Hz; attenuation coefficient C = 700; resonance frequency f n = 4k Hz; ball pass frequency on inner race BPFI = 1/T = 120 Hz; n(t) is Gaussian white noise component with SNR = −13 dB; sampling frequency is set to 16 kHz; length of data is 4096. The time domain and frequency waveform of impulse signal s(t) and inner race fault simulation signal x(t) are shown in Figure 11. Comparing Figure 11a,b, it is found that periodic pulse in simulation signal is completely submerged by noise, and fault characteristics cannot be found through time domain signal. Figure 11c shows the envelope spectrum of noised simulation signal x(t), and no prominent frequency components can be found. Besides, there is no clear peak at BPFI in the envelope spectrum (0-500 Hz) of Figure 11d.
By using the K-adaptive VMD, 35 is determined as the optimal K for rolling bearing inner race fault signal decomposition. The curve of central frequencies of all IMFs is displayed in Figure 12a. After acquiring the IMFs, En is calculated through the proposed IMF selecting method, the detail value of En can be found in Figure 12b. Since the signal in this section is a bearing fault signal, it is necessary to adopt the first item of prior knowledge mentioned in Section 3.1.2. The composition of simulation signal in this section is clear and there is no obvious interference frequency, so a priori input of interference frequency is not needed. In summary, the input of prior knowledge needs to set En corresponding to the 1st to 11th components to 0, and updated En is shown in Figure 12c. Same as Figure 6b, the red dotted line in Figure 12c is set at 0.03 × ∑En, and the blue column in histogram implies that the optimal IMFs with En value greater than the threshold. That is, the optimal components are selected as IMF15, 16, and 26 for reconstruction. The CFBs of IMF15 and 16 are 3921.88~3960.94 Hz and 4078.12~4109.37 Hz, respectively. CFBs of these two IMFs are the closest frequency bands to the resonance frequency 4k Hz in all IMFs. As can be seen from Figure 12c, En of these two components near the resonance frequency are much higher than other values. periodic pulse in simulation signal is completely submerged by noise, and fault charac teristics cannot be found through time domain signal. Figure 11c shows the envelope spec trum of noised simulation signal x(t), and no prominent frequency components can b found. Besides, there is no clear peak at BPFI in the envelope spectrum (0-500 Hz) of Fig  ure 11d. By using the K-adaptive VMD, 35 is determined as the optimal K for rolling bearing inner race fault signal decomposition. The curve of central frequencies of all IMFs is dis played in Figure 12a. After acquiring the IMFs, En is calculated through the proposed IM selecting method, the detail value of En can be found in Figure 12b. Since the signal in thi section is a bearing fault signal, it is necessary to adopt the first item of prior knowledg mentioned in Section 3.1.2. The composition of simulation signal in this section is clea and there is no obvious interference frequency, so a priori input of interference frequency is not needed. In summary, the input of prior knowledge needs to set En correspondin to the 1st to 11th components to 0, and updated En is shown in Figure 12c. Same as Figur 6b, the red dotted line in Figure 12c is set at 0.03 × ∑ , and the blue column in histogram implies that the optimal IMFs with En value greater than the threshold. That is, the opti mal components are selected as IMF15, 16, and 26 for reconstruction. The CFBs of IMF1 and 16 are 3921.88~3960.94 Hz and 4078.12~4109.37 Hz, respectively. CFBs of these tw IMFs are the closest frequency bands to the resonance frequency 4k Hz in all IMFs. As can be seen from Figure 12c, En of these two components near the resonance frequency ar much higher than other values.  By using the K-adaptive VMD, 35 is determined as the optimal K for rolling bearing inner race fault signal decomposition. The curve of central frequencies of all IMFs is displayed in Figure 12a. After acquiring the IMFs, En is calculated through the proposed IMF selecting method, the detail value of En can be found in Figure 12b. Since the signal in this section is a bearing fault signal, it is necessary to adopt the first item of prior knowledge mentioned in Section 3.1.2. The composition of simulation signal in this section is clear and there is no obvious interference frequency, so a priori input of interference frequency is not needed. In summary, the input of prior knowledge needs to set En corresponding to the 1st to 11th components to 0, and updated En is shown in Figure 12c. Same as Figure  6b, the red dotted line in Figure 12c is set at 0.03 × ∑ , and the blue column in histogram implies that the optimal IMFs with En value greater than the threshold. That is, the optimal components are selected as IMF15, 16, and 26 for reconstruction. The CFBs of IMF15 and 16 are 3921.88~3960.94 Hz and 4078.12~4109.37 Hz, respectively. CFBs of these two IMFs are the closest frequency bands to the resonance frequency 4k Hz in all IMFs. As can be seen from Figure 12c, En of these two components near the resonance frequency are much higher than other values. The reconstructed signal will be acquired after summing these three components. The envelope spectrum of reconstructed signal is shown in Figure 13a, in which clear peaks at BPFI and 2 × BPFI can be recognized. These are significant signs for inner race fault of rolling bearing. Through the comparison of signal envelope spectrums before and after noise reduction, it can be encapsulated that K-adaptive VMD has good noise reduction ability. At the same time, FBE-VMD and TVMD are introduced for the noise reduction effect comparison with K-adaptive VMD. IMF6, 8 and 9 are taken as the optimal components from fourteen IMFs by FBE-VMD. An optimal K of 5 is estimated by TVMD, and IMF3 and 4 are selected to reconstruct the new signal. Figure 13b,c are the envelope spectra of the reconstructed signals worked out by these two methods, respectively. There is no clear peak at fault characteristic frequency in Figure 13b, and the noise reduction effect is poor. Even though clear peaks at BPFI and 2 × BPFI appear in Figure 13c, there are more interference frequencies in the frequency band of 300-500 Hz. The envelope spectrum component of signal computed by K-adaptive VMD is simpler. Additionally, the amplitude value of BPFI in Figure 13a is much larger than that of Figure 13c

Verification of Noise Reduction Effect
In this section, different intensity of noise are added to the above two groups of simulation signals, and the K-adaptive VMD, FBE-VMD and TVMD algorithms are brought in to decompose and reconstruct simulation signal to verify noise reduction ability of these three methods under different noise levels.
Add noise to the to the AM-FM signals described in Section 4.1.1 to get noisy signal with 6 and −6dB SNR, respectively. Table 2 shows the noise reduction results of those three VMD methods. All experiments are conducted in the MATLAB (R2020b) installed in high performance computing platform of Beijing Jiaotong University with 48 core CPU. From the SNR of reconstructed signal in Table 2, K-adaptive VMD has the best noise reduction effect on these two signals with different noise levels. As revealed in Table 2, the K computed by K-adaptive VMD is 17 and 35 respectively, while the K obtained by the other two methods are 12, 15 and 3, 3, respectively. Because we do not set an upper limit of K, the optimal K reckoned by K-adaptive VMD is bigger than those of the comparison methods. This illustrate that setting an upper limit of K may not necessarily for optimizing it. We can draw the conclusion based on CFBs of the selected IMFs that K-adaptive VMD can accurately select the components close to the three main frequencies of simulation signal at 7, 30, and 100 Hz. However, central frequencies of the selected components using FBE-VMD and TVMD are quite different from those three main frequencies. Since FBE-VMD needs to manually filter optimal IMFs according to the change of the FBE, and does not realize adaptive decomposition and reconstruction, its running time cannot be counted, and the method is the most complicated. It can also be easily concluded that, K-adaptive VMD takes more time in two signals than TVMD and has a medium complexity. Although K-adaptive VMD has certain computational burden, the high performance computing platform widely used nowadays can be very good solution to his problem. The reconstructed signal will be acquired after summing these three components. The envelope spectrum of reconstructed signal is shown in Figure 13a, in which clear peaks at BPFI and 2 × BPFI can be recognized. These are significant signs for inner race fault of rolling bearing. Through the comparison of signal envelope spectrums before and after noise reduction, it can be encapsulated that K-adaptive VMD has good noise reduction ability. At the same time, FBE-VMD and TVMD are introduced for the noise reduction effect comparison with K-adaptive VMD. IMF6, 8 and 9 are taken as the optimal components from fourteen IMFs by FBE-VMD. An optimal K of 5 is estimated by TVMD, and IMF3 and 4 are selected to reconstruct the new signal. Figure 13b,c are the envelope spectra of the reconstructed signals worked out by these two methods, respectively. There is no clear peak at fault characteristic frequency in Figure 13b, and the noise reduction effect is poor. Even though clear peaks at BPFI and 2 × BPFI appear in Figure 13c, there are more interference frequencies in the frequency band of 300-500 Hz. The envelope spectrum component of signal computed by K-adaptive VMD is simpler. Additionally, the amplitude value of BPFI in Figure13a is much larger than that of Figure 13c.

Verification of Noise Reduction Effect
In this section, different intensity of noise are added to the above two groups of simulation signals, and the K-adaptive VMD, FBE-VMD and TVMD algorithms are brought in to decompose and reconstruct simulation signal to verify noise reduction ability of these three methods under different noise levels.
Add noise to the to the AM-FM signals described in Section 4.1.1 to get noisy signal with 6dB and −6dB SNR, respectively. Table 2 shows the noise reduction results of those three VMD methods. All experiments are conducted in the MATLAB (R2020b) installed in high performance computing platform of Beijing Jiaotong University with 48 core CPU. From the SNR of reconstructed signal in Table 2, K-adaptive VMD has the best noise reduction effect on these two signals with different noise levels. As revealed in Table 2, the K computed by K-adaptive VMD is 17 and 35 respectively, while the K obtained by the other two methods are 12, 15 and 3, 3, respectively. Because we do not set an upper limit of K, the optimal K reckoned by K-adaptive VMD is bigger than those of the comparison methods. This illustrate that setting an upper limit of K may not necessarily for optimizing it. We can draw the conclusion based on CFBs of the selected IMFs that K-adaptive VMD can accurately select the components close to the three main frequencies of simulation signal at 7, 30, and 100 Hz. However, central frequencies of the selected components using FBE-VMD and TVMD are quite different from those three main frequencies. Since FBE-VMD needs to manually filter optimal IMFs according to the change of the FBE, and does not realize adaptive decomposition and reconstruction, its running time cannot be counted, and the method is the most complicated. It can also be easily concluded that, Kadaptive VMD takes more time in two signals than TVMD and has a medium complexity. Although K-adaptive VMD has certain computational burden, the high performance computing platform widely used nowadays can be very good solution to his problem.  A noisy signal of inner race bearing fault is added on the basis of the simulation signal in Section 4.1.3 to verify the performance of three methods when the vibration signal of rolling bearing is at different noise levels. Since the noise content of the simulation signal in Section 4.1.3 is −13 dB and the SNR is relatively low, the SNR of the new signal is set slightly higher, which is −6 dB. The above three methods are utilized to reduce noise of these two sets of rolling bearing inner race fault simulation signals with different noise, and the results are shown in Table 3. Similar to the results of AM-FM signal, the K optimized by K-adaptive VMD is much bigger than those of the other two methods. It is clear in Table 3 that the noise reduction performance of the proposed method is much better than that of FBE-VMD. For the signal with −6 dB noise, although the SNR after noise reduction using K-adaptive VMD is slight lower than TVMD, the noise has also been significantly suppressed. For signals with higher noise level, SNR = −13 dB, K-adaptive VMD achieves better noise suppression effect than TVMD.

Analysis and Verification of RBF-FuzzyEn
The purpose of this section is to verify the influence of noise and data length of bearing vibration signal in different fault types on feature discrimination of RBF-FuzzyEn. On the basis of inner race bearing fault signal in Section 4.1.3, an outer race bearing fault simulation signal is added in this section, and the formula is as follows: where amplitude A = 0.3; rotation frequency f r = 30 Hz; attenuation coefficient C = 200; resonance frequency f n = 4 kHz; ball pass frequency on outer race BPFO = 1/T = 80 Hz; sampling frequency is set as 16 kHz; the number of analysis points is 4096; and n(t) is Gaussian white noise.

Analysis of the Distribution for Maximum Absolute Distance
The maximum absolute distance d ij is the independent variable of fuzzy function which directly determines the value of fuzzy entropy. Thus, understanding the distribution of d ij is of great consequence for the analysis of RBF-FuzzyEn.  Figure 14. Figure 14a,b show all the d ij . The abscissa is (i = 1, j = 2~N − m + 1), . . . , (i = N − m + 1, j = 1~N − m) successively, and ordinate represents the corresponding maximum distance value. Figure 14c,d show d ij when i = 1 and j = 2~N − m + 1. Figure 14a,b reveal that d ij also has the same periodic characteristics as fault impulse in the whole range which we will call it big cycle. In addition, d ij shows the same cycle as the fault impulse in a small range from inspecting Figure 14c,d, which can be called small cycle. Moreover, whether d ij is in big cycle or small cycle, the amplitude modulation phenomenon appears, which share the same characteristics with inner race fault signal.
For the purpose of observing the distribution of maximum absolute distance matrix d ij for bearing signals in two different states, K-adaptive VMD is implemented to reduce the noise of these two groups of signal. Then extract the fuzzy entropy feature from denoised signal. The points of d ij for the two groups of data are the same. The number of d m ij is 16,764,930, and the number of d m+1 ij is 16,756,742. The d ij is counted at a resolution of 2000 points to figure out the distribution of d ij for two states, as presented in Figure 15. The abscissa means the value of d m ij or d m+1 ij for inner race fault and outer race fault. The ordinate represents the total numbers of d ij which is abbreviated as Numd.
As can be noticed from Figure 15, the distance matrix distribution of the bearing signals in two states are quite different. The distance distribution of outer race fault is more scattered than that of inner race fault, indicating that the self-similarity of outer race fault data is lower than that of inner race fault data. For outer race fault signal, the distribution curve of distance matrix is smoother than inner race fault signal no matter for d m ij or d m+1 ij . The standard deviation of these two signals' d m ij , 0.18 and 0.25 respectively, indicating that d m ij of the outer race fault signal has more dispersed distribution. For the same state, the distance distribution when m = 3 is more dispersed than m = 2. It can also be revealed from the statistical figure that the maximum Numd of d m ij is close to zero. Thus, the Numd of d m ij decline gradually for both inner and outer race fault data. While, the distribution curves for d m+1 ij show an upward trend and then a downward trend. Based on the above analysis, the distance d ij of bearing vibration signals under various fault types shows quite different distribution. Therefore, a constant r value, that is, the same similarity function, can be executed to estimate the entropy value to characterize the states of different data. According to d ij properties of the above two state data, it can be noticed that the distribution of outer race fault data is broader. Therefore, r is calculated by d m ij of the outer race fault simulation signal, and r = 0.2065 is worked out.
the distance matrix and of the periodic fault impulse signal s(t) based on Equation (8). The number of points for calculated from each set of data is (N-m) × (N-m + 1), where N is the length of data. The simulation signal employed in this section is 4096 points, so the number of and are 16764930 and 16756742, respectively. The values of and are shown in Figure 14. Figure 14a,b show all the dij. The abscissa is (i = 1, j = 2~N-m + 1), …, (i = N-m + 1, j = 1~N-m) successively, and ordinate represents the corresponding maximum distance value. Figure 14c,d show dij when i = 1 and j = 2~N-m + 1. Figure 14a,b reveal that dij also has the same periodic characteristics as fault impulse in the whole range which we will call it big cycle. In addition, dij shows the same cycle as the fault impulse in a small range from inspecting Figure 14c,d, which can be called small cycle. Moreover, whether dij is in big cycle or small cycle, the amplitude modulation phenomenon appears, which share the same characteristics with inner race fault signal. For the purpose of observing the distribution of maximum absolute distance matrix dij for bearing signals in two different states, K-adaptive VMD is implemented to reduce the noise of these two groups of signal. Then extract the fuzzy entropy feature from denoised signal. The points of dij for the two groups of data are the same. The number of is 16,764,930, and the number of is 16,756,742. The dij is counted at a resolution of 2000 points to figure out the distribution of dij for two states, as presented in Figure 15. The abscissa means the value of or for inner race fault and outer race fault. The ordinate represents the total numbers of dij which is abbreviated as Numd. As can be noticed from Figure 15, the distance matrix distribution of the bearing sig nals in two states are quite different. The distance distribution of outer race fault is mor scattered than that of inner race fault, indicating that the self-similarity of outer race faul data is lower than that of inner race fault data. For outer race fault signal, the distributio curve of distance matrix is smoother than inner race fault signal no matter for o . The standard deviation of these two signals' , 0.18 and 0.25 respectively, indicat ing that of the outer race fault signal has more dispersed distribution. For the sam state, the distance distribution when m = 3 is more dispersed than m = 2. It can also b revealed from the statistical figure that the maximum Numd of is close to zero. Thus the Numd of decline gradually for both inner and outer race fault data. While, th distribution curves for show an upward trend and then a downward trend. Based on the above analysis, the distance dij of bearing vibration signals under various fault type shows quite different distribution. Therefore, a constant r value, that is, the same similar ity function, can be executed to estimate the entropy value to characterize the states o

Analysis and Verification under Different Noise Level
We add noise to inner and outer race fault simulation signals, with the SNR ranging from 20dB to -20dB with step -1dB to evaluate the relationship between RBF-FuzzyEn and noise. Besides RBF-FuzzyEn, two different fuzzy entropy calculation methods [23,24], approximate entropy (ApEn) [21], sample entropy (SampEn) [22] and multiscale fuzzy entropy (MFE) [27] are compared at the same time. The fuzzy function in paper [23] is f 1 d m ij = exp − d m ij n /r , and in [24] is Compared with the function proposed in this paper, there is one more parameter n in these two fuzzy functions. According to papers [23,24], m = 2, n = 2, and r = 0.2 × σ (data). FuzzyEn1 and FuzzyEn2 are applied to represent the entropy worked out by methods in paper [23,24], respectively. ApEn and SampEn both have the parameters of m and r, which are set the same as FuzzyEn1 and FuzzyEn2, that is, m = 2, and r = 0.2 × σ (data). With regard to MFE, we set m = 2, n = 2, r = 0.15 × σ (data) and the scale factor τ = 20 according to paper [27]. For RBF-FuzzyEn and FuzzyEn1, the entropy differentiation between inner race and outer race fault signals is remarkable despite the increase of noise as described in Figure  16a,b In addition, the two curves of inner and outer race fault data maintain the same trend while SNR is reducing. Moreover, RBF-FuzzyEn achieve the most apparent differentiation when noise content stay at the same level by comparing with FuzzyEn1. The entropy difference between inner and outer race fault signals under different noise levels are calculated. The average value of them for RBF-FuzzyEn and FuzzyEn1 are 0.4826 and 0.3480, respectively, which means that the differentiation of RBF-FuzzyEn is better than FuzzyEn1. Different from Figure 16b, the change of RBF-FuzzyEn is small with the decrease of SNR. It is clear in Figure 16c that FuzzyEn2 of the two fault types signals close to each other gradually with the increase of noise content in the signal, blurring the distinction between two states. And the curves begin to overlap when SNR is less than −6dB, demonstrating that FuzzyEn2 is susceptible to noise. The entropy curves of ApEn and SampEn are very similar, and the entropies of different states are almost mixed together. Only when SNR is within the range of 0~10 dB, the entropy obtained by ApEn and SampEn has small distinction between inner and outer race faults. With regard to MFE, the difference between two states is very small, which can not effectively represent the bearing state. All the above comparisons reach a conclusion that the RBF-FuzzyEn possesses strong noise robustness.

Analysis and Verification with Various Data Length
Fuzzy entropy has the advantage of low dependence on data length N. To analyze and verify the relationship between RBF-FuzzyEn and data length, this section generates fault simulation signals of inner and outer race fault of rolling bearings with different data lengths based on Equations (19) and (20). The data length N range from 800 to 8000 with a step length of 800. Each state is composed of 10 groups of data, a total of 20 groups of data. Noise is added to each group of data, making the SNR of all data set equal to −13dB. First, K-adaptive VMD is implemented on these 20 groups of data to reduce the noise to some extent. Then, calculating fuzzy entropy of the denoised signal based on RBF-FuzzyEn, Fuzzyen1, Fuzzyen2, ApEn, SampEn and MFE method successively. Figure 17 shows the variation of entropy of five calculation methods when N takes different values. For RBF-FuzzyEn and FuzzyEn1, the entropy differentiation between inner race and outer race fault signals is remarkable despite the increase of noise as described in Figure 16a,b In addition, the two curves of inner and outer race fault data maintain the same trend while SNR is reducing. Moreover, RBF-FuzzyEn achieve the most apparent differentiation when noise content stay at the same level by comparing with FuzzyEn1. The entropy difference between inner and outer race fault signals under different noise levels are calculated. The average value of them for RBF-FuzzyEn and FuzzyEn1 are 0.4826 and 0.3480, respectively, which means that the differentiation of RBF-FuzzyEn is better than FuzzyEn1. Different from Figure 16b, the change of RBF-FuzzyEn is small with the decrease of SNR. It is clear in Figure 16c that FuzzyEn2 of the two fault types signals close to each other gradually with the increase of noise content in the signal, blurring the distinction between two states. And the curves begin to overlap when SNR is less than −6dB, demonstrating that FuzzyEn2 is susceptible to noise. The entropy curves of ApEn and SampEn are very similar, and the entropies of different states are almost mixed together. Only when SNR is within the range of 0~10 dB, the entropy obtained by ApEn and SampEn has small distinction between inner and outer race faults. With regard to MFE, the difference between two states is very small, which can not effectively represent the bearing state. All the above comparisons reach a conclusion that the RBF-FuzzyEn possesses strong noise robustness.

Analysis and Verification with Various Data Length
Fuzzy entropy has the advantage of low dependence on data length N. To analyze and verify the relationship between RBF-FuzzyEn and data length, this section generates fault simulation signals of inner and outer race fault of rolling bearings with different data lengths based on Equations (19) and (20). The data length N range from 800 to 8000 with a step length of 800. Each state is composed of 10 groups of data, a total of 20 groups of data. Noise is added to each group of data, making the SNR of all data set equal to −13dB. First, K-adaptive VMD is implemented on these 20 groups of data to reduce the noise to some extent. Then, calculating fuzzy entropy of the denoised signal based on RBF-FuzzyEn, Fuzzyen1, Fuzzyen2, ApEn, SampEn and MFE method successively. Figure 17 shows the variation of entropy of five calculation methods when N takes different values. As depicted in Figure 17, the entropy values estimated though RBF-FuzzyEn and FuzzyEn1 show good stability with the change of data length N. And the distinction of fuzzy entropy calculated based on data in different states is always significant. However, the entropy of inner and outer race fault data computed by FuzzyEn2, ApEn and SampEn vary greatly. Furthermore, the entropy of two states are very close when N is 800 and 1600 for FuzzyEn2. As for ApEn, the entropy can be hardly distinguished under 1600 length of data. Although the MFE curve is very stable with the change of N, the two curves almost overlap in the whole range of N and the feature discrimination is very poor. The mean values of fuzzy entropy difference between inner and outer race faults are 0.4534, 0.3253, 0.0334, 0.0456, 0.0484 and 0.0085, respectively. Obviously, RBF-FuzzyEn has higher feature discrimination than FuzzyEn1, and both of them are much greater than FuzzyEn2, ApEn, SampEn and MFE. To sum up, RBF-FuzzyEn has the advantages of low data length dependence and excellent distinction compared with FuzzyEn1, FuzzyEn2, ApEn, SampEn and MFE, which is more suitable to distinguish the bearing state by experimental vibration signal.

Case 1: Bearing Data of CWRU
The experimental data of rolling bearing failures introduced in this section are derived from the bearing dataset of Case Western Reserve University (CWRU) [28]. The sampling frequency of selected data sets is 12k Hz, and all of them contain three accelerometers which are located on motor drive end, motor fan end and bench base. Signal collected by sensor located on the base of bench contains high noise due to the influence of transmission path and complicated working environment. Compared with the signal collected from drive end and fan end, the base sensor signal has lower SNR, making noise reduction more difficult. In this section, two sets of base signals in drive end bearing fault dataset are selected to verify the noise reduction performance of K-adaptive VMD. Details of the data are itemized in Table 4.  As depicted in Figure 17, the entropy values estimated though RBF-FuzzyEn and FuzzyEn1 show good stability with the change of data length N. And the distinction of fuzzy entropy calculated based on data in different states is always significant. However, the entropy of inner and outer race fault data computed by FuzzyEn2, ApEn and SampEn vary greatly. Furthermore, the entropy of two states are very close when N is 800 and 1600 for FuzzyEn2. As for ApEn, the entropy can be hardly distinguished under 1600 length of data. Although the MFE curve is very stable with the change of N, the two curves almost overlap in the whole range of N and the feature discrimination is very poor. The mean values of fuzzy entropy difference between inner and outer race faults are 0.4534, 0.3253, 0.0334, 0.0456, 0.0484 and 0.0085, respectively. Obviously, RBF-FuzzyEn has higher feature discrimination than FuzzyEn1, and both of them are much greater than FuzzyEn2, ApEn, SampEn and MFE. To sum up, RBF-FuzzyEn has the advantages of low data length dependence and excellent distinction compared with FuzzyEn1, FuzzyEn2, ApEn, SampEn and MFE, which is more suitable to distinguish the bearing state by experimental vibration signal.

Case 1: Bearing Data of CWRU
The experimental data of rolling bearing failures introduced in this section are derived from the bearing dataset of Case Western Reserve University (CWRU) [28]. The sampling frequency of selected data sets is 12k Hz, and all of them contain three accelerometers which are located on motor drive end, motor fan end and bench base. Signal collected by sensor located on the base of bench contains high noise due to the influence of transmission path and complicated working environment. Compared with the signal collected from drive end and fan end, the base sensor signal has lower SNR, making noise reduction more difficult. In this section, two sets of base signals in drive end bearing fault dataset are selected to verify the noise reduction performance of K-adaptive VMD. Details of the data are itemized in Table 4. Each group of data was analyzed with 6000 points, that is, 0.5 s. Figure 18 shows the amp corr of data set 169 and 158. The frequency bands indicated by red box in Figure 18a,b contain clear Entropy 2022, 24, 197 21 of 29 frequency peaks located at 4140 Hz along with sidebands at intervals of 60 Hz which is twice of motor rotational frequency. Thus, these frequencies appearing in the experiments of different bearing states and different working conditions are comprised by motor rotation frequency. Because the peaks at these interference frequency are apparent, the corresponding En of CFBs containing these frequency components will also be considerable when calculating the energy of amp corr in CFB. This will result in failure to select the components containing fault information. Based on this prior knowledge, En corresponding to these interference frequencies should be set to 0 when the method proposed in this paper is applied to experimental signal of case 1.
Each group of data was analyzed with 6000 points, that is, 0.5 s. Figure 18 show ampcorr of data set 169 and 158. The frequency bands indicated by red box in Figure contain clear frequency peaks located at 4140 Hz along with sidebands at intervals Hz which is twice of motor rotational frequency. Thus, these frequencies appearing i experiments of different bearing states and different working conditions are comp by motor rotation frequency. Because the peaks at these interference frequency are ap ent, the corresponding En of CFBs containing these frequency components will al considerable when calculating the energy of ampcorr in CFB. This will result in failu select the components containing fault information. Based on this prior knowledg corresponding to these interference frequencies should be set to 0 when the method posed in this paper is applied to experimental signal of case 1.  Analyze data through K-adaptive VMD, the dataset 169 is decomposed into thi IMFs. The time domain signal of all IMFs is presented in Figure 20. Table 5 lists the of all IMFs. The low-frequency components in these 13 IMFs are IMF1~3. The time do waveform of IMF3 has very obvious impulse which appeared15 times in the 0.5 s si corresponding to the rotational frequency. The amplitude changes of IMF9 and IMF time domain are relatively similar. All of them is about 60Hz, which is twice the rotat ampcorr of data set 169 and 158. The frequency bands indicated by red box in Figure contain clear frequency peaks located at 4140 Hz along with sidebands at intervals Hz which is twice of motor rotational frequency. Thus, these frequencies appearing i experiments of different bearing states and different working conditions are comp by motor rotation frequency. Because the peaks at these interference frequency are ap ent, the corresponding En of CFBs containing these frequency components will al considerable when calculating the energy of ampcorr in CFB. This will result in failu select the components containing fault information. Based on this prior knowledg corresponding to these interference frequencies should be set to 0 when the method posed in this paper is applied to experimental signal of case 1. Analyze data through K-adaptive VMD, the dataset 169 is decomposed into thi IMFs. The time domain signal of all IMFs is presented in Figure 20. Table 5 lists the of all IMFs. The low-frequency components in these 13 IMFs are IMF1~3. The time do waveform of IMF3 has very obvious impulse which appeared15 times in the 0.5 s si corresponding to the rotational frequency. The amplitude changes of IMF9 and IMF time domain are relatively similar. All of them is about 60Hz, which is twice the rotat Analyze data through K-adaptive VMD, the dataset 169 is decomposed into thirteen IMFs. The time domain signal of all IMFs is presented in Figure 20. Table 5 lists the CFBs of all IMFs. The low-frequency components in these 13 IMFs are IMF1~3. The time domain waveform of IMF3 has very obvious impulse which appeared15 times in the 0.5 s signal, corresponding to the rotational frequency. The amplitude changes of IMF9 and IMF10 in time domain are relatively similar. All of them is about 60 Hz, which is twice the rotational frequency. It is also revealed in Table 5 that the CFBs of these two components both contain the interference frequencies analyzed above. Moreover, IMF11 also contains the interference frequencies. In addition, En of IMF1-IMF[K/3], i.e., En of the 1st to the 4th component should also be set to 0 according the prior knowledge one mentioned in 3.1.2.
frequency. It is also revealed in Table 5 that the CFBs of these two components both contain the interference frequencies analyzed above. Moreover, IMF11 also contains the interference frequencies. In addition, En of IMF1-IMF[K/3], i.e., En of the 1st to the 4th component should also be set to 0 according the prior knowledge one mentioned in 3.1.2.  Set the En of IMF1, 2, 3, 4, 9, 10, 11 (as displayed in the red column diagram in Figure  21a) to 0, and get the updated En distribution as presented in Figure 21b. According to the updated En, IMF5, 6, 7, 8 and 12 are selected. By summing all the selected IMFs, the reconstructed signal is obtained, and its envelope spectrum is shown in Figure 22a. It is obvious that K-adaptive VMD can suppress noise and enhance fault characteristics effectively by comparing Figures 22a and 19b. Figure 22b shows the envelope spectrum of reconstructed signal worked out by FBE-VMD.  Set the En of IMF1, 2, 3, 4, 9, 10, 11 (as displayed in the red column diagram in Figure 21a) to 0, and get the updated En distribution as presented in Figure 21b. According to the updated En, IMF5, 6, 7, 8 and 12 are selected.  Set the En of IMF1, 2, 3, 4, 9, 10, 11 (as displayed in the red column diagram in F 21a) to 0, and get the updated En distribution as presented in Figure 21b. According t updated En, IMF5, 6, 7, 8 and 12 are selected. By summing all the selected IMFs, the reconstructed signal is obtained, and its e lope spectrum is shown in Figure 22a. It is obvious that K-adaptive VMD can sup noise and enhance fault characteristics effectively by comparing Figures 22a and 19b ure 22b shows the envelope spectrum of reconstructed signal worked out by FBE-V By summing all the selected IMFs, the reconstructed signal is obtained, and its envelope spectrum is shown in Figure 22a. It is obvious that K-adaptive VMD can suppress noise and enhance fault characteristics effectively by comparing Figures 19b and 22a. Figure 22b shows the envelope spectrum of reconstructed signal worked out by FBE-VMD. The number of modes K is 14, and IMF5 and 7 are selected as the optimal components. Figure 22c is the envelope spectrum of reconstructed signal computed by TVMD, with mode number K = 3, and IMF2 is chosen as optimal component. Comparing Figure 22a-c, it can be summarized that the noise reduction effect of K-adaptive VMD is better than the other two methods. Although the envelope spectrum of the reconstructed signal Entropy 2022, 24, 197 23 of 29 acquired by FBE-VMD contains BPFI and corresponding sidebands, peaks located at 2 × BPFI and its sidebands are not existent. In the envelope spectrum of signal reconstructed by TVMD, only BPFI appears with small amplitude, and there is no 2 × BPFI peak and sideband either. While, the envelope spectrum of signal reconstructed by K-adaptive VMD contains remarkable BPFI, 2 × BPFI and corresponding sidebands.
The number of modes K is 14, and IMF5 and 7 are selected as the optimal components. Figure 22c is the envelope spectrum of reconstructed signal computed by TVMD, with mode number K = 3, and IMF2 is chosen as optimal component. Comparing Figure 22a-c, it can be summarized that the noise reduction effect of K-adaptive VMD is better than the other two methods. Although the envelope spectrum of the reconstructed signal acquired by FBE-VMD contains BPFI and corresponding sidebands, peaks located at 2 × BPFI and its sidebands are not existent. In the envelope spectrum of signal reconstructed by TVMD, only BPFI appears with small amplitude, and there is no 2 × BPFI peak and sideband either. While, the envelope spectrum of signal reconstructed by K-adaptive VMD contains remarkable BPFI, 2 × BPFI and corresponding sidebands.  Figure 23 indicated that fault impact are not clear in time domain waveform due to noise. In the envelope spectrum, there exist small peaks at the fault characteristic frequency and its harmonic. Through K-adaptive VMD, the optimal number of decomposition components is determined to be 16 firstly, corresponding time domain waveforms and CFBs are itemized in Figure 24 and Table 6, respectively. The first five components are considered as low-frequency components according to prior knowledge. They are not used to reconstruct the signal because they generally consist of rotational frequency. The time domain waveforms of IMF3 and IMF4 in Figure 24 have relatively similar periodic shocks of about 30 Hz, that is, the rotational frequency. Among the high-frequency components, IMF12 and IMF13 have the same periodicity of 60 Hz, which is consistent with the prior knowledge two put forward in Section 3.1.2 and the interference frequencies in Figure 18b. These analyses both verify the validity of the two prior knowledge in Section 3.1.2. Vibration signal from bench base sensor contained in the 158th data set is taken as example to test and verify the effect of K-adaptive VMD for bearing outer race fault. Figure 23 shows time domain waveform and envelope spectrum of 0.5 s data selected from this data set.
it can be summarized that the noise reduction effect of K-adaptive VMD is better than t other two methods. Although the envelope spectrum of the reconstructed signal acquir by FBE-VMD contains BPFI and corresponding sidebands, peaks located at 2 × BPFI an its sidebands are not existent. In the envelope spectrum of signal reconstructed by TVM only BPFI appears with small amplitude, and there is no 2 × BPFI peak and sideband ther. While, the envelope spectrum of signal reconstructed by K-adaptive VMD contai remarkable BPFI, 2 × BPFI and corresponding sidebands. Vibration signal from bench base sensor contained in the 158th data set is taken example to test and verify the effect of K-adaptive VMD for bearing outer race fault. F ure 23 shows time domain waveform and envelope spectrum of 0.5 s data selected fro this data set.  In the envelope spectrum, there exist small peaks at the fault characteristic fr quency and its harmonic. Through K-adaptive VMD, the optimal number of decompo tion components is determined to be 16 firstly, corresponding time domain waveform and CFBs are itemized in Figure 24 and Table 6, respectively. The first five componen are considered as low-frequency components according to prior knowledge. They are n used to reconstruct the signal because they generally consist of rotational frequency. T time domain waveforms of IMF3 and IMF4 in Figure 24 have relatively similar period shocks of about 30 Hz, that is, the rotational frequency. Among the high-frequency com ponents, IMF12 and IMF13 have the same periodicity of 60 Hz, which is consistent wi the prior knowledge two put forward in Section 3.1.2 and the interference frequencies  In the envelope spectrum, there exist small peaks at the fault characteristic frequency and its harmonic. Through K-adaptive VMD, the optimal number of decomposition components is determined to be 16 firstly, corresponding time domain waveforms and CFBs are itemized in Figure 24 and Table 6, respectively. The first five components are considered as low-frequency components according to prior knowledge. They are not used to reconstruct the signal because they generally consist of rotational frequency. The time domain waveforms of IMF3 and IMF4 in Figure 24 have relatively similar periodic shocks of about 30 Hz, that is, the rotational frequency. Among the high-frequency components, IMF12 and IMF13 have the same periodicity of 60 Hz, which is consistent with the prior knowledge two put forward in Section 3.1.2 and the interference frequencies in Figure 18b. These analyses both verify the validity of the two prior knowledge in Section 3.1.2.
Based on the above mentioned analysis, the En of IMF1~IMF5 and IMF10~14 should also be set to 0 based on prior knowledge. Figure 25 illustrate the En value before and after updating. Finally, IMF6,7,8,9 and 15 are selected for reconstruction.
The envelope spectrum of reconstructed signal acquired by summing the above selected components is shown in Figure 26a. Clear peaks at BPFO and its 2×~4× harmonics can be easily found in Figure 26a, and interference frequency components have been suppressed comparing with Figure 23b. In FBE-VMD, IMF8, 11, and 12 out of 12 components are chosen for signal reconstruction, its envelope spectrum is present in Figure 26b. Meanwhile, IMF2 is taken as the optimal component in TVMD method, while K = 3. Figure 26c shows the envelope spectrum of IMF2 evaluated by TVMD. By comparing Figure 26a-c, the conclusion can be drawn that K-adaptive VMD is superior to FBE-VMD and TVMD in terms of noise reduction and characteristic enhancement performance.  Based on the above mentioned analysis, the En of IMF1~IMF5 and IMF10~14 should also be set to 0 based on prior knowledge. Figure 25 illustrate the En value before and after updating. Finally, IMF6, 7, 8, 9 and 15 are selected for reconstruction. The envelope spectrum of reconstructed signal acquired by summing the above selected components is shown in Figure 26a. Clear peaks at BPFO and its 2× ~4 × harmonics can be easily found in Figure 26a, and interference frequency components have been suppressed comparing with Figure 23b. In FBE-VMD, IMF8, 11, and 12 out of 12 components are chosen for signal reconstruction, its envelope spectrum is present in Figure 26b. Meanwhile, IMF2 is taken as the optimal component in TVMD method, while K = 3. Figure 26c shows the envelope spectrum of IMF2 evaluated by TVMD. By comparing Figure 26a-c, the conclusion can be drawn that K-adaptive VMD is superior to FBE-VMD and TVMD in terms of noise reduction and characteristic enhancement performance.   Based on the above mentioned analysis, the En of IMF1~IMF5 and IMF10~14 shou also be set to 0 based on prior knowledge. Figure 25 illustrate the En value before and af updating. Finally, IMF6, 7, 8, 9 and 15 are selected for reconstruction. The envelope spectrum of reconstructed signal acquired by summing the above lected components is shown in Figure 26a. Clear peaks at BPFO and its 2× ~4 × harmon can be easily found in Figure 26a, and interference frequency components have been su pressed comparing with Figure 23b. In FBE-VMD, IMF8, 11, and 12 out of 12 componen are chosen for signal reconstruction, its envelope spectrum is present in Figure 26b. Mea while, IMF2 is taken as the optimal component in TVMD method, while K = 3. Figure 2 shows the envelope spectrum of IMF2 evaluated by TVMD. By comparing Figure 26a the conclusion can be drawn that K-adaptive VMD is superior to FBE-VMD and TVM in terms of noise reduction and characteristic enhancement performance.

Feature Extraction
In this section, two sets of reconstruction signals estimated by K-adaptive VMD in Section 5.1.1 and a set of normal bearing vibration signal from CWRU dataset are used to test and verify the proposed fuzzy entropy method in feature extraction. Due to bench base data under normal conditions are not applied in the dataset, we select vibration signal from fan end sensor contained in the 97th dataset as an alternative, corresponding rotational speed is 1796 rpm. The length of these three groups of data also be set to 6000 points. We employed N, IF and OF as abbreviations for normal, inner race fault and outer  In this section, two sets of reconstruction signals estimated by K-adaptive VMD in Section 5.1.1 and a set of normal bearing vibration signal from CWRU dataset are used to test and verify the proposed fuzzy entropy method in feature extraction. Due to bench base data under normal conditions are not applied in the dataset, we select vibration signal from fan end sensor contained in the 97th dataset as an alternative, corresponding rotational speed is 1796 rpm. The length of these three groups of data also be set to 6000 points. We employed N, IF and OF as abbreviations for normal, inner race fault and outer race bearing fault, respectively.
The standard deviations of d m ij for these three data are 0.0159, 0.0112 and 0.0070, respectively. For the reason that the standard deviation of N is the largest among them, d m ij of normal bearing is substituted into Equation (16), and the parameter r of similarity function is calculated to be 0.0131. According to Equations (10), (11) and (15), fuzzy entropy of N, IF and OF data can be computed as 1.0523, 0.5963 and 0.4784, respectively. It can be observed that the entropy of normal bearing signal is much bigger than that of fault bearing signal, which conform to the characteristics of entropy and the vibration mechanism of bearing. The spalling of bearing will cause periodic impact, which will enhance the certainty of signal's distribution in time domain. Therefore, the entropy value of signal decreases. Furthermore, distinction on entropy among three groups of data is also very obvious. For the purpose of verifying effectiveness of RBF-FuzzyEn, this section compared the fuzzy entropy of FuzzyEn1 and FuzzyEn2. The fuzzy entropy computed based on these three methods are recorded in Table 7. According to the characteristics of five methods, the r of each state of data in FuzzyEn1, FuzzyEn2, ApEn, SampEn and MFE is different, and they are all related to the distribution of data. We can observe from Table 7 that, for the same bearing data, FuzzyEn1, FuzzyEn2, ApEn and SampEn share the same r value, because all of those methods estimate r with the formula of 0.2 × σ (data). Nevertheless, RBF-FuzzyEn utilizes the same r for different groups of data. It can also be concluded from Table 7 that the entropy values of three states are all around 0.1. Meanwhile, the entropy values between N and IF judged by FuzzyEn1 are very close, with a difference of only 0.0011, resulting in poor distinguishing between different bearing states. Considering FuzzyEn2, the three entropy values are all above 1. Contrary to common sense, fuzzy entropy of faults signals are greater than that of normal signal, which also cause a failure distinguish on bearing states. The method of determining r according to the standard deviation in FuzzyEn1 and FuzzyEn2 leads to unique fuzzy function in each group of data. In RBF-FuzzyEn, r is judged according to normal bearing data. So, three sets of data share the same r, that is, the same fuzzy function. To sum up, RBF-FuzzyEn is superior to the other two methods in terms of bearing states distinguishing ability based on entropy features and good consistency between entropy values and bearing fault state. The results of ApEn and SampEn are similar to FuzzyEn2, and the entropy of bearing fault data is greater than that of normal data. Although the normal bearing entropy obtained by the MFE method is relatively large, the entropy of IF and OF are very close with a difference of only 0.0331, which is much less than 0.1179 of RBF-FuzzyEn.

Case 2: Bearing Data of IMS
The experimental data of rolling bearing in this section comes from the rolling bearing life cycle data set of American NSFI/UCRC Center for Intelligent Maintenance Systems (IMS) [29]. Four bearings are installed on a rotating shaft of the test bench. The rotating speed is constant at 2000 rpm. Each of data set contains one second vibration signal collected at 20k Hz sampling frequency every 10 min. Three times of experiments were carried out. At the end of the second experiment, it is verified that the defect occurred on bearing one outer race, and the corresponding fault characteristic frequency is 236.4 Hz. We select the data at 5410 min in the second experiment for research.
First, we investigate the autocorrelation operation spectrum of signal at 5410 min. It is clear in Figure 27a that peak at 986 Hz, which is neither related to rotation frequency nor BPFO, dominates in the whole spectrum. In order to figure out this phenomenon, signal at 100 min, when bearing is under normal state, is checked by autocorrelation operation and spectral analysis. Once again, as displayed in Figure 27b, peak at 986 Hz has the highest amplitude, which bear no relation to bearing states. Therefore, peak at 986 Hz should be regarded as interference, En corresponding to the CFB containing this frequency should be modified to 0 when performing K-adaptive VMD. mal data. Although the normal bearing entropy obtained by the MFE method is relativ large, the entropy of IF and OF are very close with a difference of only 0.0331, which much less than 0.1179 of RBF-FuzzyEn.

Case 2: Bearing Data of IMS
The experimental data of rolling bearing in this section comes from the rolling be ing life cycle data set of American NSFI/UCRC Center for Intelligent Maintenance Syste (IMS) [29]. Four bearings are installed on a rotating shaft of the test bench. The rotati speed is constant at 2000 rpm. Each of data set contains one second vibration signal c lected at 20k Hz sampling frequency every 10 min. Three times of experiments were c ried out. At the end of the second experiment, it is verified that the defect occurred bearing one outer race, and the corresponding fault characteristic frequency is 236.4 H We select the data at 5410 min in the second experiment for research.
First, we investigate the autocorrelation operation spectrum of signal at 5410 min is clear in Figure 27a that peak at 986 Hz, which is neither related to rotation frequen nor BPFO, dominates in the whole spectrum. In order to figure out this phenomenon, s nal at 100 min, when bearing is under normal state, is checked by autocorrelation ope tion and spectral analysis. Once again, as displayed in Figure 27b, peak at 986 Hz has highest amplitude, which bear no relation to bearing states. Therefore, peak at 986 should be regarded as interference, En corresponding to the CFB containing this f quency should be modified to 0 when performing K-adaptive VMD. We take the first 0.5 s data at 5410 min to analyze. Figure 28 shows time-dom waveform and envelope spectrum of this data. As depicted in Figure 28 that peaks BPFO and its 2× −4 × harmonics exist in envelope spectrum, but there are some peaks We take the first 0.5 s data at 5410 min to analyze. Figure 28 shows time-domain waveform and envelope spectrum of this data. As depicted in Figure 28 that peaks at BPFO and its 2 × −4 × harmonics exist in envelope spectrum, but there are some peaks of noise interference around 2 × BPFO and 3 × BPFO, making the amplitude does not show a decreasing state from 2× to 4 × harmonic.
py 2022, 23, x FOR PEER REVIEW 28 of noise interference around 2 × BPFO and 3 × BPFO, making the amplitude does not sho a decreasing state from 2 × to 4× harmonic. In order to suppress noise interference and enhance fault characteristic, K-adapti VMD is put into effect. After determining the optimal K, the signal is decomposed into IMFs, their CFBs are itemized in Table 8. According to the prior knowledge, IMF3 contai the 986 Hz interference frequency. In addition, the first [K/3] IMFs, that is, the 1st-9 component, are also be considered as interference. In order to suppress noise interference and enhance fault characteristic, K-adaptive VMD is put into effect. After determining the optimal K, the signal is decomposed into 28 IMFs, their CFBs are itemized in Table 8. According to the prior knowledge, IMF3 contains the 986 Hz interference Entropy 2022, 24, 197 27 of 29 frequency. In addition, the first [K/3] IMFs, that is, the 1st-9th component, are also be considered as interference.
As illustrated in Figure 29a, the original En of IMF3 is huge. We update the indicators by modifying the En of IMF1-9 to 0. Finally, IMF10, 11, 12, 13, and 14, as the blue column described in Figure 29b, are selected for reconstruction. Figure 30a shows the envelope spectrum of reconstructed signal. Comparing with Figure 23b, it can be demonstrated that peaks of noise interference around 2 × BPFO and 3 × BPFO is significantly suppressed, and amplitude from 2× to 4 × harmonics present a downtrend. Figure 30b shows the envelope spectrum of signal reconstructed by FBE-VMD, in which IMF13 is selected out of 14 components. Although peaks at BPFO and its harmonics appear in spectrum, they experience similar interference in the envelope spectrum of the original signal. In TVMD, IMF2 is chosen from three IMFs and its envelope spectrum is illustrated in Figure 30c. Peaks of fault characteristic can be only found at BPFO and 2 × BPFO, and the amplitude of peaks are not very clear. The conclusion can be drawn by comparing Figure 30a-c that the K-adaptive VMD possesses satisfactory noise suppression and fault feature enhancement performance in processing bearing vibration signal.  In order to suppress noise interference and enhance fault characteristic, K-adapti VMD is put into effect. After determining the optimal K, the signal is decomposed into IMFs, their CFBs are itemized in Table 8. According to the prior knowledge, IMF3 contai the 986 Hz interference frequency. In addition, the first [K/3] IMFs, that is, the 1st-9 component, are also be considered as interference. As illustrated in Figure 29a, the original En of IMF3 is huge. We update the indicato by modifying the En of IMF1-9 to 0. Finally, IMF10, 11, 12, 13, and 14, as the blue colum described in Figure 29b, are selected for reconstruction.  Figure 30a shows the envelope spectrum of reconstructed signal. Comparing w Figure 23b, it can be demonstrated that peaks of noise interference around 2 × BPFO a 3 × BPFO is significantly suppressed, and amplitude from 2× to 4 × harmonics presen downtrend. Figure 30b shows the envelope spectrum of signal reconstructed by FB VMD, in which IMF13 is selected out of 14 components. Although peaks at BPFO and harmonics appear in spectrum, they experience similar interference in the envelope spe trum of the original signal. In TVMD, IMF2 is chosen from three IMFs and its envelo spectrum is illustrated in Figure 30c. Peaks of fault characteristic can be only found

Conclusions
In this paper, VMD and fuzzy entropy are improved from aspects of noise reduction and feature extraction respectively, forming a feature enhancement method that combines the K-adaptive VMD and RBF-FuzzyEn for the fault diagnosis of bearing. The main contributions of this paper can be summarized as below:

Conclusions
In this paper, VMD and fuzzy entropy are improved from aspects of noise reduction and feature extraction respectively, forming a feature enhancement method that combines the K-adaptive VMD and RBF-FuzzyEn for the fault diagnosis of bearing. The main contributions of this paper can be summarized as below: 1.
An algorithm of K-adaptive VMD was developed, which can obtain K adaptively based on ADCF, and select the optimal IMFs by a coefficient En. The analysis of reconstructed signal of five types of signals (nonlinear AM-FM simulation signal, nonlinear AM-FM simulation signal with close frequencies, inner race fault simulation signal, CWRU rolling bearing inner and outer race fault signal and IMS bearing outer race fault signal) demonstrated that compared with FBE-VMD and TVMD, the K-adaptive VMD has good noise reduction ability. The envelope spectrum of simulation and experimental bearing fault signals proved that K-adaptive VMD can realize the effective extraction of fault impulses. 2.
RBF-FuzzyEn was proposed, which introduced an innovative fuzzy function and carried out a specific way for determination of parameter r in fuzzy function. To verify the entropy feature extraction method proposed in this paper, RBF-FuzzyEn, FuzzyEn1, FuzzyEn2, ApEn, SampEn and MFE are used to extract the entropy of bearing fault simulation signals under different SNR and data length, as well as bearing fault experiment signals. The conclusion can be obtained from the results that RBF-FuzzyEn outperformed FuzzyEn1, FuzzyEn2, ApEn, SampEn and MFE in distinction between different sates of bearing. Meanwhile, the proposed RBF-FuzzyEn shows a marked noise robustness than other entropy methods and it is independent on data length N.

Conflicts of Interest:
The authors declare no conflict of interest.