PDRNet: A Novel Physical Feature-Driven Residual Network for Motor Vibration Signal Denoising

Kaijie Yu; Xiongying Wu; Meng Yang; Fanqiang Lin; Zhuobang Liang

doi:10.3390/s25237213

,

and

¹

College of Mechanical and Electrical Engineering, Chengdu University of Technology, Chengdu 610059, China

²

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610059, China

³

College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu 610059, China

^*

Author to whom correspondence should be addressed.

Sensors2025, 25(23), 7213;https://doi.org/10.3390/s25237213

This article belongs to the Section Fault Diagnosis & Sensors

Version Notes

Order Reprints

Abstract

Bearing signal denoising is a pivotal task in predictive maintenance and condition monitoring of industrial machinery. However, conventional denoising methods often face difficulties in simultaneously suppressing noise and preserving essential physical features. To address this challenge, we propose a novel denoising framework that incorporates physical feature priors derived from low-dimensional manifold-based simulated data. Specifically, a regression branch—constructed using convolutional and residual neural networks—is integrated into the main denoising model to exploit the intrinsic structure of bearing signals. By embedding manifold-informed priors, the regression branch enhances denoising performance and ensures the retention of critical physical features. Experimental results demonstrate that the proposed approach surpasses traditional methods in both signal denoising and bearing fault diagnosis. Notably, the incorporation of manifold-derived priors improves the model’s capability to capture the underlying physical characteristics of the signals, indicating that the network has learned their inherent features rather than simply minimizing the loss function. Overall, this study introduces a robust denoising paradigm for complex industrial environments where bearing signals exhibit significant variability.

Keywords:

vibration signal denoising; physics-driven; deep learning; manifold learning

1. Introduction

Rolling bearings are among the most critical components in rotating machinery, and their health condition directly determines system safety, reliability, and efficiency. Undetected faults not only degrade performance but may also trigger unexpected shutdowns or even catastrophic accidents. Consequently, accurate monitoring and timely bearing fault diagnosis have long been central issues in mechanical engineering and intelligent maintenance.

In practice, vibration signals are the most widely used indicators for bearing fault diagnosis []. However, they are typically collected under complex operating conditions and are unavoidably contaminated by strong background noise [,]. Such noise can readily obscure weak but diagnostically significant features: periodic impacts, resonance responses, and modulation effects, thereby complicating a reliable diagnosis. Over the past few decades, widely adopted approaches, including wavelet transform (WT) [,,], empirical mode decomposition (EMD) [], and singular value decomposition (SVD) [,], have enabled multiscale time–frequency analysis and shown utility for nonstationary motor vibration signals. Nevertheless, time–frequency methods generally require careful choices of basis functions, decomposition scales, and thresholds, and their computational burden can be substantial. Suboptimal parameterization often compromises denoising performance in the presence of complex, heterogeneous noise and typically demands considerable engineering experience. As a result, conventional techniques, such as filtering and wavelet thresholding, may suppress noise to some extent, but frequently attenuate or distort fault-related information, leading to unstable and unreliable diagnostic results [].

Recent years have witnessed growing interest in deep learning for bearing-signal denoising and fault diagnosis, owing to its superior capacity for nonlinear modeling and hierarchical feature extraction [,,,,]. Some methods involve improving the fault detection model to adapt to the motor signals in noisy environments [,]. Yet most existing methods are purely data-driven and lack explicit characterization of the signal’s physical structure []. In the absence of physics-based constraints, models may overfit noise patterns or inadvertently remove fault-related components during denoising, yielding signals that appear clean but lack physical interpretability [,]. This limitation fundamentally undermines robustness and trustworthiness in real-world diagnostic scenarios.

To overcome these limitations, we propose a denoising framework that integrates a physics-informed regression branch into a deep residual denoiser, which has been used in some studies for classification and recognition tasks [,]. The branch is explicitly designed to preserve and enhance physically meaningful features in vibration signals. Its design is grounded in the known dynamics of rolling element bearings and the characteristic signatures produced under fault conditions [,]. For instance, localized defects generate periodic impulsive forces as the flaw repeatedly contacts mating surfaces; these excitations in turn drive high-frequency structural resonances and yield amplitude-modulated responses governed by bearing kinematics []. Rather than treating the input as a generic time series, the physics-based branch is trained to learn and regenerate these canonical structures—transient impulses, resonance bands, and modulation sidebands—thereby steering the main denoiser toward reconstructions that respect the underlying system dynamics. Preserving such features is crucial for two reasons. First, they carry essential diagnostic information for fault localization and severity assessment; for example, the impulse repetition rate—determined jointly by fault type and shaft speed—is a direct indicator of fault location []. Distorting impulse shape or interval can thus cause misdiagnosis or misestimation of remaining useful life. Second, resonance frequencies and modulation patterns reflect structural properties of the mechanical system [,]; maintaining their integrity ensures interpretability and consistency with domain knowledge and facilitates downstream tasks such as root-cause analysis. The novelty of our proposed method compared to previous works has been shown in Table 1.

Table 1. Comparison between existing vibration signal denoising methods and the proposed PDRNet.

We validate the proposed approach on the widely used Case Western Reserve University (CWRU) bearing dataset, which spans multiple operating conditions and fault types and has become a de facto benchmark for fault diagnosis [,]. Experimental results show that our method achieves superior noise suppression relative to both traditional and state-of-the-art baselines, while substantially improving the accuracy and robustness of subsequent fault diagnosis. These findings confirm that combining low-dimensional manifold priors with deep neural architectures constitutes an effective strategy for bearing-signal denoising and diagnosis under complex operating conditions.

Our main contributions are threefold.

We introduce a physics-informed regression branch that encodes kinematics-consistent priors (impulse trains, resonance bands, modulation sidebands) to guide denoising.
We couple these priors with a residual denoiser and manifold learning to preserve diagnostically critical structures while suppressing diverse noise.
We provide comprehensive validation on CWRU, demonstrating consistent gains in both denoising quality and fault-diagnosis performance over strong baselines.

The remaining part of the paper is organized as follows. Section 2 details the simulation of physical characteristics in motor vibration signals and presents the architecture of the proposed PDRNet, including the residual main network and the regression branch, together with the quantitative evaluation protocol. Section 3 reports experimental setups and results, including denoising performance, periodicity score analysis, and fault-diagnosis accuracy. Section 4 concludes with a summary and discussion.

2. Methodology

The theoretical foundation of this work lies in the premise that vibration signals originate from the periodic kinematic interactions of gears under real-world operating conditions, thereby exhibiting distinct and physically interpretable patterns []. These patterns—including meshing harmonics, amplitude modulation, sideband structures, and transient impulses—are intrinsically governed by the mechanical properties and dynamic behaviors of the rotating components []. Consequently, it is both feasible and advantageous to exploit these inherent physical characteristics as prior knowledge to guide, regularize, and correct the learning process of denoising models. In this way, the denoising objective transcends the conventional goals of minimizing loss functions or improving signal-to-noise ratio (SNR); instead, it emphasizes the reconstruction of signals that are physically consistent and diagnostically meaningful.

To implement this concept, we propose a modified denoising architecture based on an encoder–decoder framework, augmented with a dedicated regression branch explicitly designed to capture and preserve fault-related features. The effectiveness of the proposed approach is rigorously assessed using multiple complementary metrics, including SNR, periodicity retention rate, and fault-diagnosis accuracy, thereby ensuring that both quantitative performance and functional diagnostic value are comprehensively validated.

2.1. Simulation of Motor Vibration Signals

Based on the theoretical foundation, we develop a comprehensive simulation framework specifically designed for motor vibration signals. In real bearing systems, clean vibration signals are fundamentally composed of two distinct components: normal operational vibrations and fault-induced vibrations. The clean signal can be mathematically expressed as

x_{c l e a n} (t) = x_{n o r m a l} (t) + x_{f a u l t} (t)

(1)

where

x_{n o r m a l} (t)

represents normal operational vibrations arising from periodic rotational motion, and

x_{f a u l t} (t)

denotes vibrations generated by bearing defects. Normal operational vibrations in rotating machinery originate from the fundamental periodic motion of the motor shaft and its mechanical components. During rotation, various mechanical phenomena including shaft rotation, mechanical imbalances, electromagnetic forces contribute to the vibration spectrum. These characteristics are modeled using sinusoidal functions.

x_{n o r m a l} (t) = A sin (2 π \cdot f_{r} \cdot t)

(2)

where A represent the amplitudes of the fundamental,

f_{r} = R P M / 60

Hz represent the base rotational frequency. Bearing faults create distinctive vibration signatures through periodic contact between defective surfaces and rolling elements, which can typically be categorized into three main types: inner race faults (BPFI), outer race faults (BPFO), and rolling element faults (BSF) []. Each fault type exhibits unique vibration characteristics, typically manifested as periodic impulses and their corresponding frequency components in the vibration spectrum. Specifically, these fault-related frequencies—BPFI, BPFO, and BSF—are functions of the bearing geometry and rotational speed, and are commonly used as diagnostic indicators in both time and frequency domain analyses.

Each fault location’s vibration frequency is determined by bearing geometry and operating speed, enabling fault identification through frequency analysis. To accurately capture these transient impact characteristics, we employ exponentially decaying sinusoidal functions that model the physical dissipation of impact energy:

x_{f a u l t} (t) = A_{f a u l t} \cdot e^{- λ (t mod T)} \cdot sin (2 π \cdot f_{f a u l t} \cdot t)

(3)

where indexes different fault types, represents harmonic order,

A_{f a u l t}

and

f_{f a u l t}

denotes the fault severity amplitude and frequency respectively,

λ

is the exponential decay constant modeling impact dissipation,

T = 1 / f_{f a u l t}

represents the period. The exponential decay term

e^{- λ (t mod T)}

simulates the transient nature of bearing impacts, where each impact rapidly dissipates before the subsequent occurrence. In practical bearing condition monitoring applications, measured vibration signals inevitably consist of clean signal components contaminated by various noise sources. The complete measured signal can be expressed as

x_{r e a l} (t) = x_{c l e a n} (t) + x_{n o i s e} (t)

(4)

The noise component

x_{r e a l} (t)

encompasses multiple sources of interference commonly encountered in industrial environments: measurement noise from sensors and data acquisition systems introduces electronic artifacts, environmental interference from adjacent machinery and structural vibrations creates background contamination, and process variations in motor loading and operating conditions contribute to stochastic fluctuations. To model these diverse noise sources, additive white Gaussian noise (AWGN) is employed:

x_{n o i s e} (t) \sim N (0, σ^{2})

(5)

2.2. Proposed Architecture

We propose a novel signal denoising framework that augments a conventional encoder–decoder architecture with an auxiliary regression branch, thereby enhancing denoising performance through explicit clean-signal simulation. The key components of the proposed approach are summarized as follows:

Data Pre-Processing. The raw dataset is segmented into non-overlapping windows of 900 sampling points per data sample. To exploit spatial convolution operations and improve contextual understanding, the one-dimensional time-series signal is reshaped into a two-dimensional representation. Specifically, each 900-point sequence is rearranged into a serpentine pattern, forming a $30 \times 30$ matrix in which temporal adjacency is preserved, as illustrated in Figure 1. This data preprocessing method has been proven to have a significant effect on TEM signals []. This mapping strategy maintains the temporal continuity of the original signal while enabling convolutional layers to capture both local and global contextual dependencies. Following the forward pass, the resulting $30 \times 30$ feature map is flattened back into the one-dimensional format.

Figure 1. The schematic diagram of data preproccessing.
Dilated Convolutions and Residual Learning. To enlarge the receptive field without increasing the number of parameters or reducing spatial resolution, dilated convolutions are employed at both the encoder’s input stage and the decoder’s output stage [], thereby capturing multi-scale contextual information efficiently (Figure 2a). In parallel, we adopt a deep residual learning framework comprising three types of residual blocks, which facilitate easier training and improved performance by allowing gradients to flow more effectively through the network, mitigating the vanishing gradient problem []. Each block integrates three convolutional layers followed by a residual connection. This design facilitates the modeling of complex noise structures and signal characteristics that require multi-level abstraction, ultimately improving the network’s representational capacity in challenging denoising scenarios.

Figure 2. PDRNet. (a) Description of the overall model structure consists of an encoder, a decoder and a regression branch. (b) The structural details of the regression branch.
Multi-Level Skip Connections. To preserve fine-grained information during reconstruction, skip connections are established at three hierarchical levels []. Specifically, intermediate features are extracted after Dilated-Conv1 (32 channels), Dilated-Conv2 (64 channels), and the first ResBlockV1 (128 channels) in the encoder. These features are progressively fused in the decoder through concatenation operations, followed by $1 \times 1$ convolutional layers (skip-conv) that adaptively integrate multi-resolution information. By retaining high-resolution details lost in conventional encoder–decoder structures, this design substantially improves reconstruction fidelity.
Regression Branch Innovation. An auxiliary regression branch is introduced to explicitly model the underlying clean-signal characteristics. Unlike conventional methods that rely solely on reconstruction loss between noisy input and denoised output, the regression branch extracts intermediate encoder features and processes them to generate a simulated clean-signal representation. The branch consists of a residual block, a convolutional layer with ELU activation, and a fully connected network that outputs physical feature parameters (Figure 2b). This additional pathway provides strong physical priors, ensuring that the denoising process respects the inherent dynamics of the original signal.

2.3. Performance Evaluation Methodology

To comprehensively evaluate the effectiveness of the proposed physical feature prior–guided denoising model, we adopt a multi-dimensional assessment framework that encompasses signal quality indicators, physical feature preservation analysis, and practical diagnostic performance.

2.3.1. Signal Quality Assessment

The primary metric for evaluating denoising performance is the Signal-to-Noise Ratio (

S N R

), which quantifies the improvement in signal clarity achieved after denoising. It is defined as

S N R = 10 {log}_{10} (\frac{P_{s i g n a l}}{P_{n o i s e}})

(6)

where

P_{s i g n a l}

denotes the power of the clean signal, and

P_{n o i s e}

represents the power of the residual noise.

2.3.2. Physical Feature Preservation Assessment

In addition to signal quality, it is critical to evaluate how effectively the denoising process preserves the intrinsic physical characteristics of bearing vibration signals. To this end, we adopt a comprehensive periodicity analysis framework that integrates multiple complementary indicators:

Autocorrelation Peak Analysis. This metric measures the strength of periodic components by computing the maximum normalized autocorrelation coefficient:

A_{s c o r e} = max (\frac{R_{x x} (τ)}{R_{x x} (0)})

(7)

where

R_{x x} (τ)

is the autocorrelation function at lag

τ

.

Spectrum Peak Ratio. This indicator quantifies the dominance of the fundamental frequency component relative to the average spectral energy:

S P_{s c o r e} = \frac{|X (f_{m a i n})|}{\frac{1}{N} \sum_{k = 1}^{N / 2} |X (f_{k})|}

(8)

where

|X (f_{m a i n})|

denotes the amplitude at the main frequency, and N is the number of frequency bins.

Spectral Entropy. This metric evaluates the concentration of spectral energy, with lower values indicating stronger periodicity:

S E_{s c o r e} = - \sum_{k = 1}^{N / 2} P (f_{k}) {log}_{2} P (f_{k})

(9)

where

P (f_{k})

is the normalized power spectral density.

Periodicity Consistency. This indicator measures waveform similarity across consecutive periods using Pearson correlation coefficients:

P C_{s c o r e} = \frac{1}{M - 1} \sum_{i = 1}^{M - 1} ρ (T_{i}, T_{i + 1})

(10)

where M is the number of complete signal periods,

T_{i}

and

T_{i + 1}

denote adjacent periods, and

ρ (T_{i}, T_{i + 1})

is their Pearson correlation coefficient.

Finally, an Overall Periodicity Score is computed as a weighted combination of the above indicators:

P e r i o d i c i t y - S c o r e = 0.4 \times A_{s c o r e} + 0.3 \times S P_{s c o r e} + 0.2 \times S E_{s c o r e} + 0.1 \times P C_{s c o r e}

(11)

This comprehensive evaluation system ensures that the denoising process not only suppresses noise but also preserves the inherent periodic features that are fundamental for accurate bearing fault diagnosis.

2.3.3. Evaluation of Diagnostic Performance

To evaluate the practical utility of denoised signals, we primarily use the F1 score to assess fault detection performance. The F1 score is particularly suitable for bearing fault diagnosis due to its robustness against class imbalance, which is common in industrial datasets where normal operating conditions typically outnumber fault conditions. For each fault class, the F1 score is calculated as

F 1_s c o r e = \frac{2 \cdot p r e c i s i o n \cdot r e c a l l}{p r e c i s i o n + r e c a l l}

(12)

where

\{\begin{matrix} p r e c i s i o n = \frac{T P}{T P + F P} \\ r e c a l l = \frac{T P}{T P + F N} \end{matrix}

(13)

and

T P

,

F P

,

F N

represent true positives, false positives, and false negatives, respectively.

3. Results

3.1. Dataset and Experiment Setup

The motor vibration data used in this study are sourced from the well-established CWRU Bearing Dataset. Data were collected using accelerometers mounted at the drive end and the fan end of the motor, with sampling frequencies of 12 kHz and 48 kHz, respectively. In this work, we specifically utilize the drive-end signals sampled at 48 kHz. Following the segmentation strategy described in Section 2.2, a total of 25,798 samples were obtained and subsequently divided into training and validation sets with a ratio of 9:1 for the main denoising experiments. When evaluating fault diagnosis performance, the dataset contained vibration signals corresponding to four health conditions: normal, inner race fault, outer race fault, and ball fault, which were approximately equally represented in the diagnostic dataset. After deleting and padding zeros for the data that is not a multiple of signal length we set, which is 900, the total number of samples for each class used in the diagnostic experiments is: 6693 samples for ball faults, 6490 samples for inner race faults, 8740 samples for outer race faults and 3886 samples for normal. Each fault type was induced with three defect sizes (0.007, 0.014, and 0.021 inches in diameter), and vibration data were recorded under motor loads of 0, 1, 2, and 3 horsepower. Based on different operating conditions and defect sizes, the dataset is categorized into ten distinct classes, as summarized in Table 2.

Table 2. The composition of our dataset, in which the data is derived from the 48k sampling rate data in the CWRU dataset. Each class number corresponds to a unique fault type and severity level used in this study.

3.2. Experiments

To assess the generalization capability of the proposed PDRNet in motor vibration signal denoising tasks, we design a series of experiments targeting different influencing factors. Specifically, these experiments verify (i) the superior performance of PDRNet compared with existing denoising models, (ii) the contribution of the physics-informed regression branch to the model’s denoising ability, and (iii) the critical role of the regression branch in preserving the physical features of the denoised signals.

To better quantify denoising performance, we construct a composite loss function comprising the mean squared error (MSE) from the decoder, denoted as

L o s s_{d e c o d e r}

, and that from the regression branch, denoted as

L o s s_{e x p}

. The MSE is defined as

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}

(14)

where

y_{i}

denotes the original clean signal,

{\hat{y}}_{i}

represents the denoised signal, and N is the total number of samples. The overall loss is then obtained as a weighted combination:

L o s s_{t o t a l} = α \cdot L o s s_{d e c o d e r} + β \cdot L o s s_{e x p}

(15)

where

α

and

β

are weighting coefficients.

The experiments are conducted on a 64-bit Windows 11 operating system, using Python 3.7.1 as the programming language and PyTorch 1.10.0+cu113 as the deep learning framework. Visual Studio Code 1.89.1 serves as the integrated development environment (IDE), while the hardware platform is a laptop equipped with an NVIDIA GeForce RTX 3060 GPU. After multiple rounds of training and evaluation, the final set of hyperparameters is determined, as summarized in Table 3.

Table 3. Model hyperparameter configuration.

To achieve optimal model performance, we adopted an empirical tuning strategy. Initial hyperparameter values were selected based on commonly used configurations in deep learning for signal processing, and subsequently refined by monitoring the model’s performance on the validation set. The number of training epochs was set to 200, as we observed that the validation loss and performance metrics plateaued around this value. In particular, the weighting coefficients for the loss terms were set to

α = 0.7

and

β = 0.3

. This configuration emphasizes the contribution of the regression branch, which captures essential physical features, while still preserving the complex structural components of real-world signals through the encoder–decoder pathway.

Under the full dataset and standard training settings, the total training time for PDRNet was approximately 5 h, with each epoch requiring around 90 s on average.

It’s worth noting that we adopted Cosine Annealing learning rate scheduler to ensure stable and efficient convergence during training []. This strategy gradually reduces the learning rate following a cosine curve from an initial maximum value to a minimum, enabling smoother convergence and helping the model escape shallow local minima in the early stages. Compared to traditional step decay or exponential decay, cosine annealing provides non-monotonic and continuous decay, which often improves convergence stability and prevents premature stagnation []. Step decay reduces the learning rate at fixed intervals, which can cause abrupt changes in training dynamics. Exponential decay decreases the learning rate continuously but lacks the restart capability offered by cosine annealing. In our experiments, cosine annealing was proved to yield more stable training and better generalization across various noise levels without requiring extensive manual tuning of hyperparameters.

3.2.1. Denoising Capability Assessment

Based on the results summarized in Table 4, PDRNet consistently outperforms all competing models across different noise intensity levels (−3 dB, 0 dB, and 3 dB). The improvement is particularly pronounced when the regression branch (Reg-branch) is incorporated, which markedly enhances the Signal-to-Noise Ratio (SNR). Specifically, PDRNet achieves an SNR of 7.817 at −3 dB, 9.401 at 0 dB, and 12.51 at 3 dB, all of which clearly surpass the corresponding values of other models under identical conditions. Even without the regression branch, PDRNet still demonstrates competitive performance, although the SNR values are reduced compared with the Reg-branch configuration.

Table 4. The noise reduction capabilities of different models for signals with different noise intensities, with or without the regression branch.

In contrast, DnCNN, while effective in mitigating noise, fails to reach the performance level of PDRNet, especially when the regression branch is applied. FFDNet exhibits relatively stable performance across different noise levels but remains inferior to PDRNet. U_Net struggles under low signal-to-noise ratio conditions, resulting in unsatisfactory denoising outcomes. RND maintains stable performance across varying noise intensities, yet it also falls short of PDRNet in overall effectiveness. Collectively, these results confirm that PDRNet, particularly with the regression branch, represents the most effective solution for denoising motor vibration signals under diverse noise conditions.

To further examine the role of the regression branch, we incorporated it into encoder–decoder architectures based on DnCNN and FFDNet, as shown in Figure 3. Interestingly, the results show that the regression branch actually degraded their performance. For example, at 0 dB, DnCNN achieved an SNR of 8.939 without the regression branch but only 8.768 with it, while FFDNet produced an SNR of 8.961 without the branch compared with 8.940 when it was added. This observation suggests that the regression branch, grounded in physical feature priors, is not universally compatible with all model structures. In these modified networks, the encoding process appears unable to adequately capture the underlying data features, thereby preventing the regression branch from effectively extracting the relevant physical characteristics.

Figure 3. The schematic diagram of modified architecture. The symbol ⊗ represents the merge block.

Figure 4 provides a visual comparison of the denoising results across different working conditions obtained using different denoising models, further illustrating the advantages of PDRNet with the regression branch over the benchmark methods.

Figure 4. The visualized results of different denoising models, including DnCNN, FFDNet and PDRNet when motor operating under different fault conditions. Row (a), row (b), row (c), row (d) represent Ball fault, InnerRace fault, OuterRace fault and Normal condition respectively. The signals shown in the figure are data collected from our divided dataset, specifically at the 500–625 sampling points. The x-axis represents the sampling points, while the y-axis represents the signal amplitude.

3.2.2. Periodic Assessment of Noise-Reduced Signals

To objectively evaluate the ability of different denoising models to restore the underlying rotational regularity of motor vibration signals, we adopt the periodicity score as a key performance metric. This score ranges from 0 to 1, with higher values indicating stronger and more coherent periodic structures within the signal. By integrating features from both the time and frequency domains, the periodicity score provides a comprehensive measure of signal integrity.

To illustrate the relationship between periodicity and autocorrelation peaks, representative autocorrelation functions from four operating conditions (Ball fault, Inner Race fault, Outer Race fault, and Normal condition) are presented in Figure 5. The results clearly show that signals processed by the proposed method exhibit substantially higher autocorrelation peak values in all working cinditions compared with those produced by competing models. A higher peak reflects better preservation and enhancement of the intrinsic periodic structure, with our approach achieving a peak value of 0.694 in Ball fault condition. In contrast, the other models yield lower peak values, suggesting that residual noise or distortion persists in their outputs, thereby disrupting periodicity and weakening alignment across signal cycles. This finding underscores the superior capability of our method in recovering the cyclic patterns of vibration signals, which is essential for reliable condition monitoring and accurate fault diagnosis.

Figure 5. The autocorrelation function graph of the signal through different denoising model. Four rows represent Ball fault, Inner Race fault, Outer Race fault and Normal condition respectively. Column (a) represent clean signal, while column (b–d) represent noise signal processed by denoising model PDRNet, RND and DnCNN respectively. The red dotted line indicates the point where the signal reaches the peak of autocorrelation when the period is 18 or 19.

To further assess performance under varying noise intensities, we compute the periodicity score for signals processed by each model, as summarized in Figure 6. The results consistently highlight the significant advantage of PDRNet in preserving the core characteristics of motor vibration signals. For instance, at a noise level of 3 dB, our model achieves a periodicity score of 0.681, compared with 0.650 for DnCNN, 0.650 for FFDNet, 0.637 for U_Net, and 0.649 for RND. Importantly, the physical feature regression branch plays a pivotal role in maintaining periodic structure, as evidenced by the reduced score of 0.662 for PDRNet_brief, the variant without this branch. This result emphasizes the necessity of incorporating physical feature priors to effectively preserve the periodic structure and overall integrity of motor vibration signals, thereby demonstrating the substantial benefits of the proposed approach for real-world industrial applications.

Figure 6. Periodic scores of different models under different SNR. PDRNet_brief is PDRNet without regression branch.

3.2.3. Bearing Fault Diagnosis

To examine whether the proposed denoising model improves bearing fault detection under realistic working conditions, we used three widely used diagnostic models: WDCNN [], CNN-LSTM [], and ResNet [] to evaluate the classification performance of signals processed by different denoising methods. The F1-scores of signals with varying noise levels under these diagnostic frameworks are reported in Table 5. Overall, PDRNet consistently outperforms competing denoising models across multiple noise levels and diagnostic architectures.

Table 5. Under different fault diagnosis models, a comparison of the bearing fault diagnosis performances of the signals after going through different denoising models.

When WDCNN was used as the diagnostic model, PDRNet achieved accuracies of 86.87% at −3 dB, 94.69% at 0 dB, and 97.33% at 3 dB. With CNN-LSTM, the corresponding results were 69.93%, 87.14%, and 94.54%. Using ResNet, PDRNet reached 87.06% at −3 dB, 94.46% at 0 dB, and 97.13% at 3 dB. Notably, when ResNet was used as the diagnostic model and the input contained 0 dB noise, DnCNN without the regression branch slightly outperformed PDRNet, achieving an accuracy of 94.54%. Furthermore, as shown in Table 5, the denoising performance of DnCNN and FFDNet actually deteriorated after introducing the regression branch, and the same trend was observed in their diagnostic results. This finding indicates that the regression branch requires input signals containing meaningful feature content to accurately capture and reconstruct the physical characteristics of vibration signals.

Among the three diagnostic models, ResNet demonstrated the best overall performance, achieving 97.13% accuracy at 3 dB and maintaining robust fault detection even under severe noise conditions, with 87.06% accuracy at −3 dB. By contrast, CNN-LSTM exhibited the weakest diagnostic capability, particularly in the presence of substantial noise. For instance, when combined with PDRNet for signals at −3 dB, the highest accuracy achieved was only 69.93%.

4. Conclusions

In conclusion, this study presents a novel denoising framework that seamlessly integrates physical feature priors, offering a robust approach to noise suppression while preserving the integrity of diagnostically critical signal characteristics. The framework incorporates a regression branch constructed upon convolutional and residual neural networks, which guides the denoising process to ensure that key physical features are retained while noise is effectively eliminated. Experimental evaluations demonstrate that the proposed method not only outperforms conventional denoising techniques in terms of accuracy but also substantially enhances the performance of bearing fault diagnosis systems. By leveraging physical feature priors, the model transcends the traditional objective of minimizing loss functions, enabling it to capture the inherent physical properties of bearing signals. Specifically, the proposed PDRNet, which integrates the physical feature regression branch and residual network, achieves notable improvements in signal-to-noise ratio (SNR), with gains of 7.817, 9.401, and 12.51 dB at −3 dB, 0 dB, and 3 dB noise levels, respectively. These results consistently surpass those of established denoising networks such as DnCNN, FFDNet, U_Net, and RND. Among these, RND demonstrates relatively close performance, with SNR values of 7.398, 9.305, and 12.432 dB under the same noise conditions. However, when the regression branch is removed from PDRNet, the denoising performance declines, yielding SNRs of only 7.032, 9.176, and 12.257 dB.

In addition, periodicity analysis further highlights the role of the regression branch. Under a 3 dB noise level, PDRNet with the regression branch achieves a periodicity score of 0.681, whereas its counterpart without the branch yields a slightly lower score of 0.662. This difference underscores the critical importance of incorporating physical feature priors in preserving the intrinsic cyclic structures of bearing signals. Overall, the proposed framework provides a reliable and effective solution for denoising bearing signals in industrial environments characterized by high variability and complex noise. Beyond improving denoising accuracy, this advancement holds significant potential for enhancing predictive maintenance strategies, improving condition monitoring reliability, and deepening the understanding of machine health, thereby contributing to more efficient and proactive industrial operations.

Nevertheless, this study also has certain limitations. The proposed model has been validated only on the CWRU benchmark dataset, which may not fully capture the complexity of real-world industrial environments. Moreover, the computational cost associated with deep model training also limits its direct deployment in resource-constrained embedded systems. Future research will focus on expanding the evaluation to field-collected vibration datasets under various conditions, introducing domain adaptation to enhance generalization, and exploring lightweight model designs for real-time industrial applications and deployment on edge devices. Furthermore, it would be beneficial to integrate multimodal sensory data, such as acoustic, current, and temperature signals, to further improve diagnostic robustness and interpretability.

Author Contributions

Conceptualization, K.Y., Z.L. and F.L.; methodology, K.Y., M.Y. and Z.L.; software, K.Y. and X.W.; validation, K.Y.; formal analysis, M.Y. and F.L.; investigation, X.W.; resources, X.W. and F.L.; data curation, K.Y. and Z.L.; writing—original draft preparation, K.Y.; writing—review and editing, K.Y., X.W. and F.L.; visualization, K.Y. and Z.L.; supervision, M.Y. and F.L.; project administration, X.W. and F.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data and source code used in the paper can be accessed by contacting the authors.

Acknowledgments

During the preparation of this manuscript, the authors used ChatGPT-5 for the purposes of editing certain paragraphs and correcting grammatical errors and inappropriate words in Section 2. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

McInerny, S.A.; Dai, Y. Basic vibration signal processing for bearing fault detection. IEEE Trans. Educ. 2003, 46, 149–156. [Google Scholar] [CrossRef]
Li, Y.; Cheng, G.; Liu, C. Research on bearing fault diagnosis based on spectrum characteristics under strong noise interference. Measurement 2021, 169, 108509. [Google Scholar] [CrossRef]
Bučinskas, V.; Mirzaei, S.; Kirchner, K. Some aspects of bearing noise generation. Solid State Phenom. 2010, 164, 278–284. [Google Scholar] [CrossRef]
Lou, X.; Loparo, K.A. Bearing fault diagnosis based on wavelet transform and fuzzy inference. Mech. Syst. Signal Process. 2004, 18, 1077–1095. [Google Scholar] [CrossRef]
Liang, P.; Wang, W.; Yuan, X.; Liu, S.; Zhang, L.; Cheng, Y. Intelligent fault diagnosis of rolling bearing based on wavelet transform and improved ResNet under noisy labels and environment. Eng. Appl. Artif. Intell. 2022, 115, 105269. [Google Scholar] [CrossRef]
Lu, J.; Jia, B.; Li, S.; Gong, S. A noise reduction method of rolling bearing based on empirical wavelet transform and adaptive time frequency peak filtering. Meas. Sci. Technol. 2023, 34, 125146. [Google Scholar] [CrossRef]
Faysal, A.; Ngui, W.K.; Lim, M. Noise eliminated ensemble empirical mode decomposition for bearing fault diagnosis. J. Vib. Eng. Technol. 2021, 9, 2229–2245. [Google Scholar] [CrossRef]
Golafshan, R.; Sanliturk, K.Y. SVD and Hankel matrix based de-noising approach for ball bearing fault detection and its assessment using artificial faults. Mech. Syst. Signal Process. 2016, 70, 36–50. [Google Scholar] [CrossRef]
Cui, L.; Liu, Y.; Zhao, D. Adaptive singular value decomposition for bearing fault diagnosis under strong noise interference. Meas. Sci. Technol. 2022, 33, 095002. [Google Scholar] [CrossRef]
Hoang, D.T.; Kang, H.J. A survey on deep learning based bearing fault diagnosis. Neurocomputing 2019, 335, 327–335. [Google Scholar] [CrossRef]
Fan, W.; Chen, Z.; Li, Y.; Zhu, F.; Xie, M. A reinforced noise resistant correlation method for bearing condition monitoring. IEEE Trans. Autom. Sci. Eng. 2022, 20, 995–1006. [Google Scholar] [CrossRef]
Dong, G.; Chen, J. Noise resistant time frequency analysis and application in fault diagnosis of rolling element bearings. Mech. Syst. Signal Process. 2012, 33, 212–236. [Google Scholar] [CrossRef]
MacDonald, V.H.; Schultheiss, P.M. Optimum passive bearing estimation in a spatially incoherent noise environment. J. Acoust. Soc. Am. 1969, 46, 37–43. [Google Scholar] [CrossRef]
Li, B.; Chow, M.Y.; Tipsuwan, Y.; Hung, J.C. Neural-network-based motor rolling bearing fault diagnosis. IEEE Trans. Ind. Electron. 2002, 47, 1060–1069. [Google Scholar] [CrossRef]
Wang, B.; Ding, C. An adaptive signal denoising method based on reweighted SVD for the fault diagnosis of rolling bearings. Sensors 2025, 25, 2470. [Google Scholar] [CrossRef]
Jiang, Q.; Chang, F.; Sheng, B. Bearing fault classification based on convolutional neural network in noise environment. IEEE Access 2019, 7, 69795–69807. [Google Scholar] [CrossRef]
Pancaldi, F.; Dibiase, L.; Cocconcelli, M. Impact of noise model on the performance of algorithms for fault diagnosis in rolling bearings. Mech. Syst. Signal Process. 2023, 188, 109975. [Google Scholar] [CrossRef]
Shutin, D.; Kazakov, Y.; Stebakov, I.; Savin, L. Data-driven and physics-informed approaches for improving the performance of dynamic models of fluid film bearings. Tribol. Int. 2024, 191, 109136. [Google Scholar] [CrossRef]
Zhang, X.; Fu, X.; Teng, D.; Dong, C.; Vijayakumar, K.; Zhang, J.; Chowdhury, R.R.; Han, J.; Hong, D.; Kulkarni, R.; et al. Physics-informed data denoising for real-life sensing systems. In Proceedings of the 21st ACM Conference on Embedded Networked Sensor Systems, Istanbul, Turkiye, 12–17 November 2023; pp. 83–96. [Google Scholar]
Peng, D.; Yazdanianasr, M.; Mauricio, A.; Verwimp, T.; Desmet, W.; Gryllias, K. Physics-driven cross domain digital twin framework for bearing fault diagnosis in non-stationary conditions. Mech. Syst. Signal Process. 2025, 228, 112266. [Google Scholar] [CrossRef]
Yuan, H.S.; Chen, S.B.; Luo, B.; Huang, H.; Li, Q. Multi-branch bounding box regression for object detection. Cogn. Comput. 2023, 15, 1300–1307. [Google Scholar] [CrossRef]
Li, C.; Tan, Y.Y.; He, Y.; Ren, J.; Bai, R.; Zhao, Y.; Yu, H.; Jiang, X. DARR: A dual-branch arithmetic regression reasoning framework for solving machine number reasoning. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 1373–1382. [Google Scholar]
Shao, H.; Jiang, H.; Wang, F.; Zhao, H. An enhancement deep feature fusion method for rotating machinery fault diagnosis. Knowl.-Based Syst. 2017, 119, 200–220. [Google Scholar] [CrossRef]
Tamilselvan, P.; Wang, P. Failure diagnosis using deep belief learning based health state classification. Reliab. Eng. Syst. Saf. 2013, 115, 124–135. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, S.; Wang, B.; Habetler, T.G. Deep learning algorithms for bearing fault diagnostics—A comprehensive review. IEEE Access 2020, 8, 29857–29881. [Google Scholar] [CrossRef]
Qin, Y. A new family of model-based impulsive wavelets and their sparse representation for rolling bearing fault diagnosis. IEEE Trans. Ind. Electron. 2017, 65, 2716–2726. [Google Scholar] [CrossRef]
Shoshani, O.; Shaw, S.W. Resonant modal interactions in micro/nano-mechanical structures. Nonlinear Dyn. 2021, 104, 1801–1828. [Google Scholar] [CrossRef]
Krylov, S.; Gerson, Y.; Nachmias, T.; Keren, U. Excitation of large-amplitude parametric resonance by the mechanical stiffness modulation of a microstructure. J. Micromech. Microeng. 2009, 20, 015041. [Google Scholar] [CrossRef]
Neupane, D.; Seok, J. Bearing fault detection and diagnosis using case western reserve university dataset with deep learning approaches: A review. IEEE Access 2020, 8, 93155–93178. [Google Scholar] [CrossRef]
Hendriks, J.; Dumond, P.; Knox, D. Towards better benchmarking using the CWRU bearing fault dataset. Mech. Syst. Signal Process. 2022, 169, 108732. [Google Scholar] [CrossRef]
Goyal, D.; Pabla, B. The vibration monitoring methods and signal processing techniques for structural health monitoring: A review. Arch. Comput. Methods Eng. 2016, 23, 585–594. [Google Scholar] [CrossRef]
Feng, Z.; Zuo, M.J. Vibration signal models for fault diagnosis of planetary gearboxes. J. Sound Vib. 2012, 331, 4919–4939. [Google Scholar] [CrossRef]
Chen, K.; Pu, X.; Ren, Y.; Qiu, H.; Lin, F.; Zhang, S. TEMDNet: A novel deep denoising network for transient electromagnetic signal with signal-to-image transformation. IEEE Trans. Geosci. Remote Sens. 2020, 60, 1–18. [Google Scholar] [CrossRef]
Yu, F.; Koltun, V. Multi-scale context aggregation by dilated convolutions. arXiv 2015, arXiv:1511.07122. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Zhang, L.; Zhang, J.; Shen, P.; Zhu, G.; Li, P.; Lu, X.; Zhang, H.; Shah, S.A.; Bennamoun, M. Block level skip connections across cascaded V-Net for multi-organ segmentation. IEEE Trans. Med Imaging 2020, 39, 2782–2793. [Google Scholar] [CrossRef] [PubMed]
Loshchilov, I.; Hutter, F. Sgdr: Stochastic gradient descent with warm restarts. arXiv 2016, arXiv:1608.03983. [Google Scholar]
Ge, R.; Kakade, S.M.; Kidambi, R.; Netrapalli, P. The step decay schedule: A near optimal, geometrically decaying learning rate procedure for least squares. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
Gao, Y.; Kim, C.H.; Kim, J.M. A novel hybrid deep learning method for fault diagnosis of rotating machinery based on extended WDCNN and long short-term memory. Sensors 2021, 21, 6614. [Google Scholar] [CrossRef]
Chen, X.; Zhang, B.; Gao, D. Bearing fault diagnosis base on multi-scale CNN and LSTM model. J. Intell. Manuf. 2021, 32, 971–987. [Google Scholar] [CrossRef]
Wen, L.; Li, X.; Gao, L. A transfer convolutional neural network for fault diagnosis based on ResNet-50. Neural Comput. Appl. 2020, 32, 6111–6124. [Google Scholar] [CrossRef]

Figure 1. The schematic diagram of data preproccessing.

Figure 2. PDRNet. (a) Description of the overall model structure consists of an encoder, a decoder and a regression branch. (b) The structural details of the regression branch.

Figure 3. The schematic diagram of modified architecture. The symbol ⊗ represents the merge block.

Figure 4. The visualized results of different denoising models, including DnCNN, FFDNet and PDRNet when motor operating under different fault conditions. Row (a), row (b), row (c), row (d) represent Ball fault, InnerRace fault, OuterRace fault and Normal condition respectively. The signals shown in the figure are data collected from our divided dataset, specifically at the 500–625 sampling points. The x-axis represents the sampling points, while the y-axis represents the signal amplitude.

Figure 5. The autocorrelation function graph of the signal through different denoising model. Four rows represent Ball fault, Inner Race fault, Outer Race fault and Normal condition respectively. Column (a) represent clean signal, while column (b–d) represent noise signal processed by denoising model PDRNet, RND and DnCNN respectively. The red dotted line indicates the point where the signal reaches the peak of autocorrelation when the period is 18 or 19.

Figure 6. Periodic scores of different models under different SNR. PDRNet_brief is PDRNet without regression branch.

Table 1. Comparison between existing vibration signal denoising methods and the proposed PDRNet.

Method	Physics-Driven	Residual Net	Signal Feature Preservation
Traditional methods	✗	✗	✗
DnCNN	✗	✗	Partial
FFDNet	✗	✗	Partial
RND	✗	✓	Partial
PDRNet (our model)	✓	✓	✓

Table 2. The composition of our dataset, in which the data is derived from the 48k sampling rate data in the CWRU dataset. Each class number corresponds to a unique fault type and severity level used in this study.

Class No.	Defect Size (Inch)	Fault Type	Number
0	0.007	Ball	2231
1	0.007	Inner-race	2161
2	0.007	Outer-race	2913
3	0.014	Ball	2233
4	0.014	Inner-race	2163
5	0.014	Outer-race	2913
6	0.021	Ball	2229
7	0.021	Inner-race	2166
8	0.021	Outer-race	2914
9	0	Normal	3886

Table 3. Model hyperparameter configuration.

Hyperparameter	Value
Batch size	128
Number of epochs	200
Learning rate scheduler	Cosine Annealing
Optimizer	Adam
Initial learning rate	0.001
$α$	0.7
$β$	0.3

Table 4. The noise reduction capabilities of different models for signals with different noise intensities, with or without the regression branch.

Model	If Reg-Branch	SNR (dB)
Model	If Reg-Branch	−3 dB	0 dB	3 dB
PDRNet	✓	7.817	9.401	12.51
PDRNet	✗	7.032	9.176	12.257
DnCNN	✓	6.724	8.768	11.48
DnCNN	✗	6.81	8.939	12.081
FFDNet	✓	7.136	8.94	11.903
FFDNet	✗	7.463	8.961	11.97
U_net	✓	6.38	8.093	11.02
U_net	✗	6.491	8.354	11.337
RND	✓	7.549	8.98	12.491
RND	✗	7.398	9.305	12.432

Note: The bold numbers represent the best performance for each SNR level. The meaning is consistent across all tables.

Table 5. Under different fault diagnosis models, a comparison of the bearing fault diagnosis performances of the signals after going through different denoising models.

Model		WDCNN			CNN-LSTM			ResNet
SNR (dB)		−3	0	3	−3	0	3	−3	0	3
Denoising Model	If Reg-Branch	F1-Score (%)
PDRNet	✓	86.87 ± 0.22	94.69 ± 0.44	97.33 ± 0.37	69.93 ± 0.36	87.14 ± 0.11	94.54 ± 0.43	87.06 ± 0.15	94.46 ± 0.26	97.63 ± 0.17
PDRNet	✗	84.11 ± 0.22	89.35 ± 0.23	93.79 ± 0.41	62.47 ± 0.35	85.68 ± 0.47	90.53 ± 0.24	86.92 ± 0.47	92.24 ± 0.19	95.16 ± 0.25
DnCNN	✓	77.40 ± 0.28	92.84 ± 0.37	93.65 ± 0.11	50.72 ± 0.42	85.57 ± 0.34	90.99 ± 0.27	79.19 ± 0.25	93.31 ± 0.13	95.22 ± 0.49
DnCNN	✗	77.22 ± 0.14	93.18 ± 0.17	96.82 ± 0.38	51.49 ± 0.46	86.90 ± 0.32	93.68 ± 0.49	81.94 ± 0.12	94.54 ± 0.22	96.47 ± 0.11
FFDNet	✓	77.76 ± 0.44	92.91 ± 0.18	94.54 ± 0.20	50.13 ± 0.37	68.26 ± 0.05	83.87 ± 0.46	76.84 ± 0.14	90.70 ± 0.39	96.33 ± 0.13
FFDNet	✗	78.54 ± 0.18	92.95 ± 0.43	96.44 ± 0.15	52.05 ± 0.35	69.53 ± 0.32	84.56 ± 0.27	77.41 ± 0.19	92.13 ± 0.21	96.40 ± 0.28
U_net	✓	72.46 ± 0.23	90.28 ± 0.34	95.31 ± 0.13	47.38 ± 0.47	59.51 ± 0.43	89.65 ± 0.27	75.40 ± 0.39	89.50 ± 0.19	95.58 ± 0.19
U_net	✗	73.60 ± 0.16	91.42 ± 0.38	96.37 ± 0.12	52.95 ± 0.33	64.50 ± 0.20	92.69 ± 0.46	77.18 ± 0.49	91.86 ± 0.31	96.42 ± 0.28
RND	✓	85.66 ± 0.14	90.20 ± 0.28	96.98 ± 0.44	53.74 ± 0.43	64.10 ± 0.31	94.27 ± 0.37	86.13 ± 0.27	91.24 ± 0.10	96.93 ± 0.38
RND	✗	85.67 ± 0.22	89.83 ± 0.19	94.28 ± 0.38	51.12 ± 0.36	62.25 ± 0.45	93.73 ± 0.23	83.49 ± 0.22	89.62 ± 0.27	94.41 ± 0.11

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

PDRNet: A Novel Physical Feature-Driven Residual Network for Motor Vibration Signal Denoising

Abstract

1. Introduction

2. Methodology

2.1. Simulation of Motor Vibration Signals

2.2. Proposed Architecture

2.3. Performance Evaluation Methodology

2.3.1. Signal Quality Assessment

2.3.2. Physical Feature Preservation Assessment

2.3.3. Evaluation of Diagnostic Performance

3. Results

3.1. Dataset and Experiment Setup

3.2. Experiments

3.2.1. Denoising Capability Assessment

3.2.2. Periodic Assessment of Noise-Reduced Signals

3.2.3. Bearing Fault Diagnosis

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics