1. Introduction
Rolling element bearings (REBs) are essential components in rotating machinery, commonly used in industries such as manufacturing and energy [
1,
2,
3]. These bearings are subject to heavy operational loads and non-stationary working conditions, which often result in wear and failure over time, leading to costly downtime and significant economic losses. As a result, fault diagnosis is crucial for maintaining equipment reliability and operational safety [
4]. Among various monitoring techniques, vibration signal analysis has emerged as the most widely adopted method due to its sensitivity to mechanical faults and its ability to capture dynamic system responses [
5,
6].
When a fault occurs in a rolling element bearing, periodic impacts excite the structural resonances of the bearing and adjacent components, generating transient impulses that manifest at characteristic frequencies. However, due to the complexity of mechanical structures and harsh operational environments, these fault impulses are often obscured by significant background noise and other interferences, making fault diagnosis particularly challenging [
7]. The accurate filtering of noise and extraction of meaningful fault features remain critical challenges in vibration-based fault detection. Over the past few decades, various methods have been developed to address this problem, including wavelet transform (WT) [
8,
9,
10], empirical mode decomposition (EMD) [
11,
12], local mean decomposition (LMD) [
13,
14] and singular value decomposition (SVD) [
15,
16], spectral kurtosis (SK) [
17,
18] and variational mode decomposition (VMD), etc.
Among these, VMD has gained notable attention due to its ability to decompose complex signals into a set of band-limited, quasi-orthogonal intrinsic mode functions (IMFs). Unlike EMD or LMD, which suffer from a lack of theoretical rigor, VMD formulates the decomposition as a constrained variational problem, enabling better mode separation and noise resilience. These advantages have led to its successful application in various domains, including machinery fault detection, signal denoising, and condition monitoring. For instance, Wang et al. [
19] investigated the equivalent filtering characteristics of VMD and applied it to detect multiple rubbing-caused signatures for rotor-stator fault diagnosis. Zhang et al. [
20] proposed a hybrid fault diagnosis approach based on VMD and total variation minimization for bearing fault diagnosis. Moreover, Zhang et al. [
21] applied VMD to diagnose faults in bearings of a multistage centrifugal pump. Nevertheless, the accuracy and effectiveness of VMD are highly sensitive to the selection of key parameters: the number of modes and the quadratic penalty term, which governs bandwidth constraints. Typically, these parameters are manually set based on empirical knowledge, limiting the method’s adaptability and potentially yielding suboptimal decomposition. To address this, researchers have proposed various strategies for parameter optimization. Li et al. [
22] introduced an independence-oriented VMD method that uses peak searching and similarity principles to determine the most suitable mode number, although it does not account for the impact of the bandwidth control parameter. Zhang et al. [
23] presented a parameter-adaptive VMD method using the grasshopper optimization algorithm (GOA) to estimate the optimal mode number and bandwidth control parameter adaptively. Lian et al. [
24] proposed an adaptive VMD that automatically determines the mode number based on the characteristics of the intrinsic mode functions.
Although several improved VMD approaches have been proposed, most focus on optimizing parameters such as the mode number and penalty factor using optimization algorithms, which are often time consuming and complex to implement. Moreover, the influence of the initial center frequencies —an essential factor for stable and accurate decomposition—has been largely overlooked. Since the selection of key parameters in VMD strongly influences its decomposition performance and diagnostic reliability, a fully adaptive strategy is essential. To this end, this paper proposes an Improved Variational Mode Decomposition (IVMD) method that incorporates a scale space representation to adaptively determine both the initial center frequencies and the number of modes in a data-driven manner. In addition, multipoint kurtosis (MKurt) is introduced to guide the selection and merging of modes, ensuring that critical fault features are preserved. Specifically, the IVMD method begins by generating the scale space representation through the inner product between the signal’s Fourier spectrum and a Gaussian function. The initial center frequencies are then determined based on the local maxima in the scale space. Once these parameters are established, the signal is decomposed into subcomponents using VMD. Subcomponents with large MKurt that contain fault-related features are merged to improve fault detection. Experimental results from bearings with various faults demonstrate that IVMD effectively extracts fault features, even in the presence of significant noise and interference. By adaptively selecting key parameters, IVMD outperforms conventional VMD, making it a robust and reliable tool for fault diagnosis in rolling element bearings.
The main contributions of this work can be summarized as follows:
- (1)
A novel IVMD framework is proposed, which employs scale space representation to adaptively determine both the number of modes and their initial center frequencies, thereby overcoming the reliance on manually set parameters in conventional VMD.
- (2)
The MKurt-based mode merging strategy is introduced to enhance fault-relevant components and suppress redundant modes, improving the accuracy of impulse extraction in noisy environments.
- (3)
Experimental validation on locomotive bearings with both single and compound faults demonstrates that the proposed method significantly outperforms conventional VMD in fault detection capability.
The remainder of this paper is organized as follows.
Section 2 provides a comprehensive review of VMD.
Section 3 introduces the challenges associated with parameter selection, such as mode number and center frequency initialization.
Section 4 details the proposed IVMD based on scale space representation.
Section 5 validates the effectiveness of the proposed IVMD through experimental studies, including comparisons with traditional VMD using vibration signals from bearings with compound faults. Finally,
Section 6 concludes the paper.
2. Theory of Variational Mode Decomposition
The VMD is a signal processing technique that decomposes a given signal into a set of band-limited sub-components or modes
uk, each centered around a specific frequency, while preserving the ability to reconstruct the original signal [
25]. The decomposition is formulated as a constrained variational problem aimed at minimizing the sum of bandwidths of all modes under the constraint that their sum reconstructs the original signal:
where
are the modes and the center frequencies, respectively.
K is the number of decomposed modes. In (1), the problem consists of three key steps. First, Hilbert Transform is applied to obtain the analytic signal and compute the unilateral frequency spectrum of each mode
uk; second, frequency shifting is conducted by demodulating the spectrum to the baseband via multiplication with an exponential function tuned to the estimated center frequency
wk; third, bandwidth estimation is achieved by computing the squared
L2-norm of the gradient of the demodulated signal.
To solve the constrained variational problem in (1), it is converted into an unconstrained optimization form using the augmented Lagrangian method. This involves introducing a quadratic penalty term
α and Lagrangian multiplier
λ(
t), resulting in the following formulation:
The optimization problem in Equation (2) is then solved using the alternating direction method of multipliers (ADMM). The entire solving process of VMD is summarized in Algorithm 1, which iteratively updates
,
and
, until a convergence condition defined by threshold
c is met.
Algorithm 1: Variational Mode Decomposition |
Initialize |
repeat |
for k = 1 to K do |
Update : |
Update : |
Update : |
end for |
|
until convergence: |
For the minimization problems of
,
in Algorithm 1, the solution for the intrinsic modes
can be obtained by Wiener filtering in Fourier domain [
26], and the center frequency
can be updated as the center of gravity of the mode’s power spectrum. Specifically, the mode
and center frequency
can be represented by
From the above descriptions, there are several parameters that need to be specified in advance. They are the mode number
K, initial center frequencies
, the quadratic penalty term
α, the noise-tolerance
τ and the tolerance of convergence criterion
c. Based on prior studies [
22,
23,
27], it has been demonstrated that the values of
τ and
c exert minimal influence on the decomposition performance. Therefore, the default settings in the original VMD algorithm are adopted, with
τ = 0 and
c = 1 × 10
−6. For vibration signals, a penalty term
α = 2000 has proven effective.
Notably, initial center frequencies of all K modes, should be selected carefully based on the application scenario and significantly influence the decomposition results.
3. Motivation of the Proposed IVMD
To illustrate the impact of key parameters, namely the initial center frequencies
and mode number
K, on the decomposition performance of VMD, a simulated vibration signal containing compound bearing faults, specifically inner race and outer race defects, is constructed as follows [
28]:
The first term simulates discrete harmonic interferences originating from rotors, shafts, gearboxes, or other rotating components, where Am and αm denote amplitude and initial phase, respectively. The second and third terms represent fault impulses generated by the inner race and outer race faults. Here, Di indicates impulse amplitude, and Td is the time interval between successive impulses. To simulate the slip phenomenon commonly observed in bearing vibrations, a random variable τi with zero mean, typically accounting for 1–2% of the fault period, is introduced. n(t) represents additive Gaussian white noise, simulating environmental interference or sensor noise commonly present in real-world measurements.
When a rolling element strikes a surface defect, it excites structural resonance in the system, producing a sequence of impulses as rolling elements pass over the damaged area. The frequency of these impulses is uniquely determined by the rotational speed, fault location, and geometric configuration of the bearing. Notably, multiple faults usually excite distinct or overlapping resonant frequencies [
28]. Generally, the defect impulses can generally be described as an exponentially decaying sinusoidal waveform:
where
fr is the structural resonance frequency excited by the impact, and
ξ is the decay rate.
This signal model (5) effectively mimics the dynamic characteristics of real bearing fault signals, including overlapping impulse sequences, harmonic interference, and stochastic fluctuations, thereby providing a suitable case for validating the decomposition and fault feature extraction capabilities of the proposed IVMD. In this simulation, the outer race is fixed, and the inner race rotates with the shaft. Both outer and inner race faults are present, exciting two distinct resonance frequencies. The simulation parameters are summarized in
Table 1, and waveform illustrations are provided in
Figure 1.
Figure 1a–d show outer race fault impulses, inner race fault impulses, harmonics, and Gaussian noise, respectively. The final synthetic signal is obtained by superimposing harmonic components, defect impulses, and Gaussian noise, achieving a signal-to-noise ratio (SNR) of −5 dB, as shown in
Figure 1e. Meanwhile, the frequency spectrum of mixed signal is displayed in
Figure 1f, from which the rotating frequency and two resonance frequency bands are obviously observed.
The initialization of center frequencies
,
k = 1, 2, …,
K significantly influences the performance of VMD. Common strategies for setting
include: uniformed spaced distribution
, zero initial
and initialize randomly
[
29]. However, these fixed schemes are generally non-adaptive and may yield suboptimal decomposition results depending on the input signal. To demonstrate this, a simulation is performed with
α = 2000 and
K = 3. When
is initialized using
, VMD fails to separate the components effectively (see
Figure 2a). In contrast, when
, the modes are accurately located (
Figure 2b). This highlights the fact that VMD does not guarantee convergence to a global minimum, and its results heavily depend on the initialization of center frequencies. The above analysis indicates the importance of proper center frequency selection. Thus, adaptive selection of
remains a crucial yet challenging aspect in practical applications.
Moreover, to evaluate the effect of under- and over-estimating mode number K, the simulated signal is decomposed with different mode numbers ranging from 2 to 5, while keeping the quadratic penalty term α = 2000 and selecting as a prior initialization.
The decomposition results when
K = 2–5 are shown in
Figure 3. When the number of modes
K is underestimated (e.g.,
K = 2), one important component is lost, leading to incomplete decomposition (
Figure 3a). With the optimal mode number selection (e.g.,
K = 3), three components are successfully extracted; however, they do not correspond to the three components of interest (
Figure 3b). In the near-optimal case (
K = 4), two components are accurately identified, but one remains indistinct (
Figure 3c). When
K is overestimated (e.g.,
K = 5), the three main components are still extracted; however, the additional modes primarily capture noise (
Figure 3d).
These results suggest that both the number of modes K and the initial center frequencies play a pivotal role in determining the accuracy and quality of the decomposition. Therefore, careful selection of these parameters is essential for achieving optimal performance.
4. The Proposed IVMD for Bearing Fault Diagnosis
As previously discussed, the initialization of center frequencies of K modes is crucial for ensuring the effective performance of VMD. To address this issue, this paper introduces IVMD based on scale-space representation, which enables data-driven determination of the initial center frequencies. Meanwhile, a multipoint kurtosis-based mode merging strategy is also developed for fault feature enhancement. Together, these two aspects significantly improve the decomposition accuracy and robustness of the method, particularly in the context of bearing fault diagnosis.
4.1. Scale Space Representation Based Parameter Initialization
For a discrete-time vibration signal
x(
t), its discrete Fourier transform (DFT)
X(
f) is defined as
To construct the scale-space representation, a kernel function is introduced as
, where
n is called the scale parameter. The discrete scale space representation of the signal’s Fourier spectrum is then defined as
where
M must be sufficiently large to minimize the approximation error caused by truncation of the Gaussian kernel. A common choice is to set
with
. In this study,
C = 6 is adopted to ensure the approximation error remains below
To extract mechanical fault characteristic frequencies while ensuring that each decomposed sub-component retains fault-related information, the scale parameter should suppress characteristics below a predefined threshold
, i.e.,
[
30,
31]. Since the fault characteristic frequency
and its harmonics provide strong evidence for localized defects, a relatively large-scale parameter is typically preferred. However, excessively large values may mask weak fault features. To balance resolution and sensitivity, the scale parameter is set as
where
. The setting of
is not restricted but suggested in this range. According to [
32], the scale space representation
L(
f,
n) becomes smoother with increasing
n, and small fluctuations in
n have minimal effect on the analysis results. Based on trial and error,
is selected.
The scale space representation of the simulated signal is shown in
Figure 4. The scale-space representation involves smoothing the frequency spectrum at various scales, which helps in detecting prominent peaks corresponding to significant frequency components. By identifying these local maxima, one can effectively initialize the center frequencies for each mode in VMD, leading to improved decomposition performance. For example, in
Figure 4, it is evident that the center frequencies corresponding to Mode 2 and Mode 3 are clearly identifiable.
However, low-frequency components falling below the
may be excluded from the initialization process. As shown in
Figure 4, while the center frequencies for Mode 2 and Mode 3 are clearly captured, Mode 1, which corresponds to a low-frequency component, is initially omitted because its frequency falls below the detection threshold. To address this, we consistently assign the first center frequency as
. This manual setting is a general rule in our method, designed to ensure that low-frequency content—such as fundamental rotating frequencies and their harmonics—is retained. This consideration is particularly critical in applications like bearing fault diagnosis, where such low-frequency features often carry significant fault-related information. By including
as part of the standard initialization, it enhances the completeness of the frequency decomposition and reduces the risk of missing diagnostically relevant components.
4.2. Multipoint Kurtosis-Based Mode Merging
Once the initial center frequencies are determined via scale-space representation, VMD is performed to decompose the signal into a set of intrinsic mode functions (IMFs), each represented as
where
and
are the instantaneous amplitude and phase of the
k-th mode, respectively.
Given that multiple IMFs may contain similar or overlapping information, it is essential to eliminate redundancy by merging modes with analogous fault-related features. For this purpose, the multipoint kurtosis (MKurt) of the envelope spectrum is employed as a criterion. MKurt is defined as
where
N is the length,
y is the envelope spectrum and
is the target vector.
Recognizing that rolling bearings commonly exhibit faults in the outer race, inner race, rolling elements, and so on, the MKurt value for each IMF is evaluated under these typical fault scenarios. This targeted analysis enables the identification of IMFs that are most relevant to specific fault types. An IMF is considered to contain meaningful fault-related features if its MKurt exceeds a certain threshold under a given condition. Once the relevant IMFs are identified, those corresponding to the same fault type are merged to consolidate diagnostic information, suppress redundancy, and improve feature clarity in the envelope spectrum.
In this study, a threshold of 1 × 10−3 is adopted based on extensive simulation and experimental results, as it offers a practical balance between sensitivity to weak fault impulses and robustness against noise interference. Importantly, this threshold is not fixed and can be adjusted according to the characteristics of the signal, such as noise level, fault type, and severity. A threshold that is too low may lead to the retention of noise-dominated components, while a threshold that is too high risks discarding weak yet diagnostically valuable features. Therefore, the selected value serves as a reliable default under typical conditions but can be flexibly adapted in other application scenarios as needed.
4.3. The Flowchart of Proposed IVMD for Bearing Fault Diagnosis
The proposed IVMD for bearing fault diagnosis is structured to systematically extract and analyze fault-related features from vibration signals. The flowchart, as depicted in
Figure 5, encompasses the following detailed steps:
Step 1: Vibration signal acquisition and frequency spectrum computation
Acquire the vibration signal and compute its frequency spectrum via DFT to obtain the spectral distribution of the signal.
Step 2: Scale space representation for initialization
Apply scale space representation to the frequency spectrum to determine the initial center frequencies of K modes for VMD by detecting local maxima. This data-driven initialization improves the adaptivity and accuracy of mode decomposition.
Step 3: Variational mode decomposition and multipoint kurtosis analysis
Perform VMD using the initialized center frequencies. Compute the MKurt of the envelope spectrum for each IMF under the three common fault types (outer race, inner race, rolling elements). Identify IMFs whose MKurt exceeds the threshold and merge them based on fault type to reduce redundancy.
Step 4: Envelope spectrum analysis for fault feature identification
Conduct envelope analysis on the merged IMFs to extract characteristic fault frequencies, enabling accurate identification of specific bearing defects.