1. Introduction
Rolling bearings are critical elements of mechanical equipment systems, the failure of which always causes a chain reaction, resulting in the occurrence of machine damage of varying degrees that may lead to collapses or even accidents in severe cases. Therefore, the health status of rolling bearings is directly related to the reliability of mechanical equipment operation [
1,
2,
3]. Therefore, to guarantee their security and stability during operation, rolling bearing real-time service state monitoring and fault diagnosis is essential [
4,
5]. Yet, due to the complicated internal composition of the equipment and the harsh operating environment, the captured vibration signals are often coupled with multiple component vibratory modes and noise. Hence, the key to accurately diagnosing bearing faults is extracting fault characteristic information from the vibration signal with disturbance information [
6,
7]. Traditional methods for fault feature information extraction include Fourier-transform-based spectral analysis methods, short-time Fourier transform (STFT), Wigner distribution (WD), and wavelet transform (WT) [
8,
9,
10,
11]. Due to the use of fixed basis functions, these analysis methods often lead to analysis results that lose physical significance and fail to extract the intrinsic characteristics of the signal [
12,
13]. Many experts and scholars from home and abroad have carried out much research in order to select the basis function or its parameters automatically according to the characteristics of the signal itself during signal decomposition so that the intrinsic characteristics of the mechanical fault vibration signal can be extracted effectively, and many beneficial results have been published. According to how the basic functions are identified, there are two major categories of adaptive signal decomposition methods: parametric and non-parametric.
Among them, the parametric adaptive signal decomposition method needs to select the basis function according to the characteristics of the signal in advance, then determine the optimal parameters or coefficients of the basis function adaptively to realize the optimal matched signal during decomposition [
14,
15], whereas the non-parametric adaptive signal decomposition method automatically selects the basis function according to the characteristics of the signal itself, the basis function of which does not have a definite analytic expression [
16,
17].
The non-parametric adaptive signal analysis method can automatically choose the basis function or its parameters according to the characteristics of the signal itself in the process of signal decomposition in order to obtain physically meaningful components. It thus can effectively extract the intrinsic characteristics of mechanical fault vibration signals. Compared with the parametric adaptive signal analysis method, the nonparametric adaptive signal analysis method does not need to construct a complete dictionary library based on the characteristics of the signal in advance, resulting in higher adaptiveness as well as decomposed components with physical significance. This is why it has been popularly adopted for mechanical fault diagnosis. At present, the common non-parametric adaptive signal analysis methods include Hilbert–Huang transform (HHT) [
18], local mean decomposition (LMD) [
19] and local characteristic-scale decomposition (LCD) [
20], etc. The ideas of these methods are similar: obtaining physically meaningful decomposition results by fitting the extreme value points. Compared with the parametric adaptive signal analysis methods, these methods are superior in terms of adaptiveness and physical significance of decomposition results, but they have two drawbacks in common. Firstly, they must all fit the extreme value points in the decomposition process. At the same time, such problems as over-enveloping, under-enveloping, frequency confusion, endpoint effect, etc., occur inevitably in the process of fitting the extreme value points. Next, there is no rigorous mathematical justification for whether the defined single-component signals are physically meaningful or not.
Combining with the symplectic geometry theory, Pan et al. proposed the symplectic geometry mode decomposition (SGMD) method [
21], and by decomposing the signal, certain symplectic geometry components (SGCs) with independent modes can be obtained. The Hamiltonian matrix’s eigenvalues are solved utilizing the symplectic geometric similar transform via SGMD, which is then used to reconstruct the single-component signal with the corresponding eigenvectors. The SGMD possesses advantages in that there is no subjective customization of parameters, and it can effectively be used to reconstruct existing modes and eliminate noise. However, SGMD suffers from the defects that the calculation efficiency decreases rapidly as the amount of data increases, in addition to invalid SGCs affecting the decomposition accuracy during reconstruction. Aiming at these issues, this paper takes advantage of the fact that composite multiscale fuzzy entropy (CMFE) can effectively evaluate the complexity of each initial single component of SGMD, as well as being able to overcome mutations in the signal initial single component similarity index [
22]. Firstly, the RCMFE operator is constructed to evaluate the complexity of each initial single component after reconstruction and constrain the residual energy to be minimized; then, it is combined with the constructed partial reconstruction threshold indicator to terminate the merge. Accordingly, this paper proposes a signal-denoising method based on partial reconstruction SGMD. Relative to the original version, PRSGMD only needs to deal with the part of the initial single component that contains significant modes, and the computation efficiency does not decrease with the increase in data amount, which can effectively improve decomposition speed. Meanwhile, PRSGMD can increase the decomposition accuracy while improving the speed of SGMD analysis owing to removing the effects of noisy and other invalid modes on decomposition results. The analysis results show that PRSGMD is more effective than the existing adaptive signal decomposition algorithms in denoising and extracting fault characteristic information.
The remaining parts are arranged in the following manner. 
Section 2 suggests the PRSGMD approach, which is motivated by the fundamental SGMD theory; 
Section 3 compares the PRSGMD, SGMD, VMD, and EEMD in simulation; in 
Section 4, the experimental signals are analyzed adopting the PRSGMD method; the last section is the conclusion.
  2. The Theory of the PRSGMD
  2.1. SGMD
SGMD solves the Hamiltonian matrix eigenvalues by adopting a symplectic geometric similar transform and reconstructs SGCs based on their relevant eigenvectors, thereby denoising the complex signal and performing the adaptive decomposition. SGMD consists of the following three major procedures.
Set the original signal time sequence as 
, where n is the data length. From the taken embedding theorem, employing a time sequence deferred topology equivalence on a one-dimensional signal can reconstruct a poly-dimensional signal and thus obtain the trajectory matrix 
, where 
 is the embedding dimension, 
.  is the delay time, and 
.
        
- 2.
- Symplectic Obtain s geometric initial single component 
In order to construct the Hamiltonian matrix, the autocorrelation analysis is carried out on the trajectory matrix to obtain the covariance symmetry matrix 
:
Decomposition of the matrix  yields the eigenvector matrix , where  is the eigenvector of the matrix  corresponding to the eigenvalue .
The transformed coefficient matrix 
 is gained by the unitary matrix eigenvectors and the trajectory matrix, and then it is converted to gain the initial single component matrix 
.
        
Diagonal averaging is employed to convert the initial single component matrix 
 to obtain the symplectic geometric initial single component 
, where 
.
        
- 3.
- Single Component Reconstruction 
 single-component signals are acquired through the trajectory matrix decomposition, but at this time, not all single components are independent of each other, so each group of components is likely to possess identical cyclic components, identical frequency components, etc. As a result, each initial single component needs to be recomposed. SGMD utilizes period similarity as the evaluation index; firstly, the matrix 
 is a 
 matrix, as the main parts are arranged in its front row, so the period similarity is compared between 
 and the remaining components. The first component 
 is acquired from the reconstruction of components with high similarity, while those who have been involved in the reconstruction of 
 will not participate in the rest of the reconstruction process; the remainder is denoted as 
, then the remainder signal 
 is produced by summarizing the remainder component matrix to calculate the NMSE (normalized mean squared error) between the remainder signal and the raw signal, when. W it is smaller than the specified threshold, decomposition stops; if not, the remainder component matrix is treated as the original matrix to continue iteration before reaching the iteration termination criteria. Set 
 as the number of component sequences obtained; then, the final decomposition result is
        
  2.2. Composite Multiscale Fuzzy Entropy
Given that different characteristic information and noise of the signal tend to be distributed at different scales, the initial single component containing noise tends to be of higher complexity, while the initial single component containing fault characteristic information is of inferior complexity. Therefore, the complexity of each initial single component is evaluated and ordered using the CMFE.
CMFE utilizes fuzzy entropy to overcome the mutation of the similarity index of the SGC component in the signal. Meanwhile, aiming at the effect on fuzzy entropy calculation due to the shortening of the time sequence in the coarse granulation process, the mean value of the fuzzy entropy of different crude granulation sequences under the same scale factor is used as the fuzzy entropy value under this scale factor. CMFE is calculated as follows:
At first, for a given signal 
x(
n) with 
 data points, calculate different coarse-grained time sequences 
 with scale factor 
, where
        
        where 
, 
.
Next, for each scale factor, the fuzzy entropy of each coarse-grained sequence 
 is calculated, then the mean value of the 
 entropy values is calculated, making 
 the fuzzy entropy calculation of the signal; thus, the CMFE of this scale factor 
 is obtained:
  2.3. Partial Reconstruction Symplectic Geometry Mode Decomposition
After transforming the initial single component matrix 
 by diagonal averaging, SGMD obtains 
 initial symplectic geometric single components 
, where 
 is the embedding dimension, usually set as 
. Therefore, when SGMD is used to process signals with data length 
, the single component reconstruction link needs to carry out cyclic iteration on 
 initial single components 
 to compare the similarity between 
 and other initial single components. When the amount of data increases, the calculation amount of SGMD increases rapidly correspondingly, and the calculation time becomes longer, which is not conducive to the practicability and effectiveness of SGMD. At the same time, the initial single component containing invalid modes, such as noise, is not distinguished during reconstruction, which affects the decomposition accuracy of SGC. Aiming at the above deficiencies, the Partial Reconstruction Symplectic Geometry Modeprsgm method is proposed in this paper. For the signal 
, the iterative process of the PRSGMD method is as follows, and the iterative flow is shown in 
Figure 1.
(1) Set .
(2) Construct the phase space trajectory matrix 
.
        
Here,  is the data length,  is the embedding dimension usually set to , and  is the delay time, . Selecting the appropriate embedding dimension  and delay time , the corresponding reconstruction matrix  can be obtained.
(3) Obtain the initial single component matrix  after diagonal averaging, including , as shown in Equation (4).
(4) Calculate the RCMFE operator of each initial single component.
        
Among them, the scale factor  of the CMFE is usually set to 3, and  can be used to assess the complexity of the initial single component of different scales. The different characteristic information and noise of the signal tend to be distributed in different scales; the initial single component containing noise tends to be of higher complexity, while the initial single component containing fault characteristic information is of inferior complexity.  can be used to assess the initial single component energy; the larger  is, the larger the initial single component energy, and the smaller the decomposition residual.
The initial single component matrix 
 is reordered by RCMFE, which separates the fault characteristic information from the noise using complexity quantization under the condition that the components obtained from the decomposition are valid. Therefore, 
 is sorted by the RCMFE values of 
 from largest to smallest as follows:
(5) Build  as the partial reconstruction threshold index, selecting  part of the initial single component refactoring, and obtain the partial reconstruction initial single component matrix  that contains significant modes of the raw signal, the. T remaining large number of weak invalid components will not participate in the reconstruction process, thus reducing the amount of calculation and improving decomposition speed.
(6) Merge  with the other initial single components in turn, recalculate RCMFE, merge again if it increases, and remove the merged initial single components in  so that  is the final merged obtained .
(7) Calculate the iteration termination criterion , the. T value of ε is usually 0.001; if the termination criterion is not satisfied, go back to step (6) and obtain , if. I the criterion is satisfied, terminate the iteration and accomplish the whole decomposition process.
  3. Simulation Analysis
The PRSGMD is more accurate in signal decomposition compared to other methods. To compare and analyze, the SGMD, Variational Mode Decompositionvmd (VMD), and Ensemble Empirical Mode Decompositioneemd (EEMD) were used. First, construct the following simulated signal 
		where 
 consists of an AM-FM signal 
 and a vibration attenuation signal 
, 
Figure 2a–c represents 
,
, and 
, respectively. Use the four methods to decompose the simulated signal and compare the similarity between the decomposed components and 
 and 
 to verify the excellence of the PRSGMD. 
Figure 3a–d illustrates the components and the residue decomposed by the four methods. In 
Figure 3a, the mode mixing problem of 
 is visually evident, indicating that the EEMD does not perform well in decomposing this signal. 
Figure 3b,c show the VMD and SGMD, respectively, both of which have better results compared to the EEMD. However, when comparing the amplitudes with 
 and 
, both 
 and 
 lose some information. Compared to the previous three methods, the SGC of PRSGMD closely matches 
 and 
 of 
.
A Hilbert transform was applied to IMF and SGC to obtain their instantaneous characteristics, namely, Instantaneous Amplitudeia (IA) and Instantaneous Frequencyif (IF). They were then compared with the IA and IF of 
 and 
, resulting in 
Figure 4. After comparison, the absolute difference between the IA and IF of 
 and 
 was taken, resulting in 
Figure 5, which represents the IA errors and IF errors. Based on 
Figure 4 and 
Figure 5, it can be observed that the IA and IF of the 
 component obtained by the EEMD exhibit significant deviations from the actual values, indicating severe mode mixing. Although there are fluctuations in the IA and IF of the components decomposed by the VMD and SGMD compared to the actual values, they outperform the EEMD. Meanwhile, the 
 component obtained by the PRSGMD exhibits more accurate instantaneous amplitude and frequency, with smaller fluctuations and proximity to the actual values. It can be inferred that the component obtained by PRSGMD decomposition has higher accuracy, and the PRSGMD has a better decomposition ability.
Finally, the comprehensive performance of the four methods was further compared using metrics such as energy error (
), correlation coefficient (
), orthogonality index (IO), and computation time (T). 
Table 1 was created to summarize the results and compare the overall performance of the four methods. According to the comprehensive analysis in 
Table 1, the components obtained by the PRSGMD exhibit higher correlation coefficients and smaller energy errors with the actual values, indicating closer proximity to the actual values. Additionally, the orthogonality index of the PRSGMD decomposition result is significantly lower than that of the SGMD, indicating good orthogonality of the PRSGMD method. It is worth noting that while the decomposition speed of the PRSGMD is better than the SGMD, it is still lower than that of the EEMD and VMD.
To validate the proposed method’s resistance to noise, a signal presented in Equation (12), 
, was designed that consists of two parts, namely, the vibration attenuation signal and Gaussian white noise signal generated during the simulation of actual faults. The simulation signal and its component time domain waveform are depicted in 
Figure 6. The signal-to-noise ratios of the Gaussian white noise signal are 5dB, −10 dB–10dB, and −20 dB, respectively. Further, PRSGMD, SGMD, VMD, and EEMD were decomposed for 
.
      
The results of the signal 
 decomposition by the PRSGMD, SGMD, VMD, and EEMD are depicted in 
Figure 7, 
Figure 8 and 
Figure 9 accordingly. Figure (a), (b), (c), and (d) are the PRSGMD, SGMD, VMD, and EEMD decomposition results of thesimulation signals, respectively. At the same time, to quantify the resistance to noise, the corresponding evaluation metrics are shown in 
Table 2. As can be seen from 
Figure 7, 
Figure 8 and 
Figure 9, when the signal-to-noise ratio is 5 dB, that is, when the noise is relatively weak, the four signal decomposition methods can effectively distinguish the vibration attenuation signal component from the noise, achieving the effect of signal and noise separation. Although the waveform of the effective component 
 from the VMD contains burr and is not smooth enough, it is basically consistent with the waveform trend of the vibration attenuation signal in the simulation signal.
In terms of the accuracy of decomposition, according to the corresponding correlation coefficients in 
Table 2, it was concluded that the decomposition results of the SGMD, VMD, and EEMD are superior to PRSGMD, which is also reflected in the waveform of residual errors. The residuals of the VMD and EEMD are more consistent with the added noise from the perspective of time-domain averaging. However, the signal and noise separation of the PRSGMD is not thorough enough, so the waveform trend of the residual component contains weak AM characteristics. With the Gaussian white noise being strengthened, noise disturbance decreases the effectiveness and accuracy rate of the decomposition methods to varying degrees. When the signal-to-noise ratio is −10 dB, although the waveform of 
 component of the VMD still shows a general trend of oscillation, the sawtooth fluctuation in the entire time domain seriously inhibits the attenuation characteristics of the component. At the same time, the curve of the 
 component of EEMD has obvious waveform loss. The effective components of the PRSGMD and SGMD still maintain relatively high waveform similarity, and the corresponding correlation coefficient and energy error index also provide data support from the side. Although the accuracy of decomposition is slightly reduced, the effectiveness of decomposition is guaranteed, and the signal and noise separation are realized. When the SNR is −20 dB, the amplitude fluctuation of 
 component of the VMD obviously exceeds the amplitude limit of 
 of the raw signal, and signal-noise aliasing is serious. The amplitude of the EEMD’s 
 component varies in acceptable bounds, but the waveform is clearly missing. The smooth property of 
 component of the SGMD is corrupted by noise; at the same time, its reliability declines with the end effect. In contrast, the 
 component of the PRSGMD with slight distortion in some of the peaks and valleys and slightly diminished AM/FM characteristics still retains the waveform of the raw signal 
, generally. Its r1 (correlation coefficient) index is approximately 0.8, and the E1 (energy error) is the minimum among all methods. This method still achieves effective decomposition results despite the disturbance of intense noise.
The results of the above simulation analysis show that although the accuracy of the PRSGMD method is slightly lower than the other three methods when the noise intensity is weak, the effectiveness of the VMD and EEMD methods is significantly reduced or even ineffective when the noise intensity is increased. Although the SGMD has specific anti-noise performance, the decomposition performance cannot meet the requirements in the environment of high-intensity noise. The PRSGMD method is suitable for separating signal and noise, achieves a superior decomposition effect under the noise disturbance of different signal-to-noise ratios, and has favorable anti-noise performance. In terms of computational efficiency, although the decomposition time of the PRSGMD method is less than that of the SGMD method, the decomposition time is still higher than that of the VMD and EEMD methods. There is a need to optimize the filter parameters in the PRSGMD method to reduce decomposition time and improve decomposition efficiency.
  4. The Application of the PRSGMD Method in the Diagnosis of Rolling Bearing Faults
To further explain the excellencies and practicality of the PRSGMD method, it was applied to rolling bearing vibration signals analysis. The bearing failure experimental equipment is given in 
Figure 10. The main constituent parts of the experimental equipment are the AC motor, frequency changer, gearbox, support frame, rotation shaft, coupling, acceleration sensor, load pressurization device, experimental bearing, acquisition card, VK702 signal acquisition system, etc. The bearings used in the experimental equipment are SKF 22238-MB spherical roller bearings. Before the experiment, the cage fault was set using the EDM wire-cutting machining technique, and the acceleration sensor was installed in the motor drive end housing. The frequency of sampling of the vibration signals was fixed at 1000 Hz, and each data sample selected in this paper contains 20,000 data points. The speed set for experiments was 40 rpm.
Using the Envelope Spectrumes (ES) analysis method to diagnose all the data in this dataset, the data were categorized as diagnosable (Y), with an ambiguous diagnosis (A), and completely undiagnosable (U). Therefore, a new signal analysis method can be applied to data that are completely undiagnosable (U) if it is necessary to test their practicality for rolling bearing fault diagnosis. This method can be applicationed to data that are completely undiagnosticable (U) if a good diagnostic result is achieved. The parameters associated with the data are given in 
Table 3. The rotation frequency was calculated to be 0.67 Hz, and the fundamental train frequency (FTF) was 0.288 Hz.
For comparison, five methods are used for fault diagnosis: ES analysis, EEMD decomposition followed by ES analysis, VMD decomposition followed by ES analysis, SGMD decomposition of the raw signal followed by ES analysis, and PRSGMD decomposition followed by ES analysis. 
Figure 11 presents the raw signal together with its ES. 
Figure 12, 
Figure 13, 
Figure 14, 
Figure 15, 
Figure 16, 
Figure 17, 
Figure 18 and 
Figure 19 present the results using four various decomposition methods together with their ES. A more apparent impulse response sequence was obtained using the PRSGMD method, as shown from SGC
4 in 
Figure 12. 
Figure 11b’s dashed line shows the harmonics of 
 and 
Figure 11b’s dash-dotted line shows the harmonics of FTF. Furthermore, 
Figure 11b’ES also indicates that the rotation frequency and fault information are masked.
The ES of the PRSGMD decomposition results in 
Figure 16 can clearly distinguish the harmonics of the rotation frequency (
, 
) and the first harmonic of the FTF. The ES of the SGMD and VMD decomposition results in 
Figure 17 and 
Figure 18 do not have obvious FTF harmonics, but the first harmonic of the rotation frequency can be clearly distinguished in 
Figure 17b,c and 
Figure 18b. However, inthe ES of the EEMD decomposition results in 
Figure 19, it is difficult to distinguish the obvious FTF harmonics. As a result, the PRSGMD method effectively diagnoses cage faults. According to the above classification of diagnostic effects, the diagnosis effects of the five methods of ES analysis, the EEMD decomposition followed by ES analysis, the VMD decomposition followed by ES analysis, the SGMD decomposition followed by ES analysis, and the PRSGMD decomposition followed by ES analysis were classified as U, U, U, U, and Y, respectively.
The decomposition times of the raw vibration signal using the four decomposition methods are recorded in 
Table 4. During the decomposition of the actual vibration signal, the PRSGMD method has a shorter decomposition time compared to the SGMD method, which improves the efficiency of the decomposition of the raw vibration signal, but the decomposition time is still higher than that of the VMD and EEMD methods.
From the above analysis results, the PRSGMD method can effectively diagnose the faults of rolling bearings. Moreover, compared with the direct ES of the raw signal or using EEMD, VMD, and SGMD methods to decompose, followed by the ES, the PRSGMD method can more accurately extract the fault characteristics of rolling bearings from noise signals.