1. Introduction
Rolling element bearings serve as critical components within rotating machinery, exerting significant influence over the overall performance of the machine. Modeling techniques are commonly employed to simulate the dynamic behavior of the bearings. Technically, mechanism-based and signal-based methods find extensive application for simulating the vibration response of rolling element bearings under both normal and fault operating conditions [
1,
2].
The mechanism-based model is formulated through the derivation of force and moment balance equations based on Hertz contact theory. In 1985, S. Fukata introduced an initial dynamic model featuring a two degrees of freedom (2-DoF) configuration, rooted in Hertz contact theory [
3]. The research undertaken by the Tiwari group expanded upon this concept, exploring the dynamics of balanced and unbalanced rotors supported by rolling element bearings, thereby characterizing them as nonlinear systems [
4,
5]. Subsequently, S. Sopanen presented a more comprehensive bearing dynamics model that encompasses the influence of various geometric defects, such as surface roughness, surface waviness, as well as partial and distributed defects, thereby enhancing the fidelity of the bearing model [
6,
7]. Nevertheless, it is worth noting that the mechanism-based model exhibits certain limitations. This endeavor necessitates a profound understanding of both kinematics and dynamics, resulting in the construction of a notably intricate model. Furthermore, this complexity is compounded by the presence of numerous unmeasured parameters that demand identification. This not only hinders modeling efficiency but also elevates the complexity of real-time parameter identification.
In contrast to the mechanism-based model, the signal-based model boasts a simpler structure and a reduced number of parameters requiring identification. The signal-based modeling places its emphasis on the representation of vibration signals. McFadden and Smith have previously developed a model to describe the high-frequency vibration generated by a single point defect on the inner race under radial load [
8], and subsequently extended this model to encompass the vibration produced by multiple point defects [
9]. Additionally, Mohammadi introduced a method for detecting multiple defects in bearings based on the time constant within the envelope detector. This technique is employed to discern the characteristic pattern of amplitude variations in defect frequency harmonics within the frequency domain [
10]. Various bearing fault characteristics can theoretically be simulated with the same model structure, while the signal-based model exhibits a marginally lower level of accuracy when compared to the mechanism-based model, it compensates with its remarkable modeling efficiency. This heightened efficiency enables the real-time identification of parameters.
As a burgeoning modeling approach, the digital twin possesses the capability to replicate a system using physical information and data gathered from sensors. This capability renders it suitable for application in signal-based modeling. Digital twin applications in bearings have yielded notable outcomes, encompassing three primary avenues: dynamics modeling, fault classification, and remaining useful life prediction. Qin [
11] integrated the back-propagation neural network with the digital twin framework to construct a bearing model capable of simulating the life cycle vibration signals. Addressing the challenge of limited data in bearing fault diagnosis, Zhang [
12] proposed an innovative digital-twin-driven approach featuring a transformer-based network and a selective adversarial strategy. This approach achieved an 80% accuracy in identifying various types of rolling bearing faults. Xiao [
13] introduced a pioneering joint transfer network designed for unsupervised bearing fault diagnosis. This network facilitates knowledge transfer from the simulation domain to the experimental domain. Feng [
14] devised an innovative digital-twin-enabled domain adversarial graph network (DTDAGN) that exclusively relies on the structural parameters of bearing dynamics. This novel approach is complemented by a transfer learning framework based on graph convolutional networks. Piltan [
15] harnessed machine learning and intelligent digital twins to classify bearing faults and determine crack sizes. Remarkably, accuracy rates of 99.5% and 99.6% were achieved, respectively. Zhang [
16] employed the integrated learning CatBoost method to construct a digital twin dataset and utilized fusion features for the life prediction of rolling bearings. Zhao [
17] introduced a hybrid approach that combines virtual and real aspects of a bearing digital twin. This approach is based on a modified CycleGAN (generative adversarial network) and Wasserstein distance. It demonstrates the ability to predict the life of rolling bearings with a minimal mean absolute error (MAE) of 0.13.
Updating the model and parameters represents a crucial aspect in the research of digital twin models for diagnosis and prognosis. In the realm of applying digital twin to diagnose faults in rotating machinery, Wang presented a fault diagnosis framework empowered by digital twin technology [
18]. In this framework, model updating is conceptualized as an optimization challenge, and it is addressed by employing the particle swarm optimization technique to achieve real-time updates. Aivaliotis [
19] employed periodic estimation of modeling parameters utilizing the nonlinear least squares method. Similarly, Xu [
20] introduced a novel machine learning technique known as deep transfer learning. This approach exhibits the capability to accurately forecast the evolution of performance during the initial stages of actual manufacturing and to adapt to new working conditions swiftly. Inspired by this intriguing concept, this paper contemplates the utilization of a reinforcement learning algorithm for model updating.
Drawing insights from the aforementioned literature, it becomes apparent that conventional physics-based modeling approaches can leverage intricate mechanisms and structures to establish highly precise models with commendable interpretability and generalizability. Nonetheless, the drawback of these methods lies in their inefficiency due to the requirement of identifying a substantial number of parameters. In contrast, models founded on signal response remedy this inefficiency by adopting a simpler structure and necessitating the identification of fewer parameters. Recognizing this advantage, this study introduces a lightweight bearing digital twin centered around the signal response. The primary objective of this digital twin is to strike a balance between modeling accuracy and efficiency. This is achieved by swiftly identifying a limited number of essential parameters while maintaining interpretability by applying straightforward mechanisms.
A bearing digital twin is created by fusing a signal-based model with reinforcement learning techniques. The signal-based model is specifically designed to discern and characterize the bearing’s vibration response, encompassing both normal operating conditions and fault scenarios. DDPG is employed to acquire proficiency in learning two crucial parameters integral to the signal-based model. This framework enables the continuous refinement and enhancement of the bearing digital twin by incorporating real-world measurement data.
The effectiveness of the digital twin is assessed using experimental data obtained from the three datasets. The envelope spectrum error is employed as a metric to compare the acceleration data derived from the physical test bench with that generated by the digital twin. The outcomes of these comparisons unequivocally affirm the viability and soundness of the proposed framework.
The subsequent sections of this paper are organized as follows:
Section 2 elucidates the signal-based model devised for capturing bearing vibration responses. In
Section 3, the intricate process of bearing digital twin construction is explored, leveraging the signal-based model and incorporating reinforcement learning techniques.
Section 4 provides comprehensive insights into the test bench setup and the acquisition of experimental data, which are subsequently used for validation.
Section 5 conducts a meticulous analysis of the obtained results. Ultimately, in
Section 6, this paper concludes by summarizing the research’s key findings and contributions.
2. Vibration Response Modeling for Bearing with Defects
In this section, a signal-based model will be constructed to analyze bearing responses under fault conditions, serving as the basis of the bearing digital twin. Broadly, the signal-based model integrates various functions to depict the acceleration response characteristics and dynamics of the bearing. These functions encompass load distribution, fault impulse decay, defect-induced vibration, defect localization, and defect width. Subsequently, the modeling theory and process will be introduced in the following sections.
2.1. Modeling for Load Distribution
The load directly determines the bearing’s vibration response. According to Stribeck, the load around the circumference of a rolling element bearing under radial load can be defined as [
21]:
where
a is the load proportional factor,
is the maximum load intensity,
is the load distribution factor,
is the angle between the defect and the line of application of load.
n is the bearing type factor, with
for ball bearings and
for roller bearings.
2.2. Modeling for Fault Impulse Decay
When a bearing exhibits a defect, a sequence of impulses arises as the rolling element traverses the affected area. These impulses undergo continuous oscillation and attenuation owing to the inherent spring and damping characteristics. To capture the vibration response characteristics of the bearing, an approach employing a unit fault impulse function along with a corresponding decay oscillation function is utilized.
2.2.1. Unit Fault Impulse
The assumptions of building unit fault impulse under a certain radial load are as follows. Firstly, it assumes that at
, the defect is positioned at
, and one of the rolling elements enters into the defect zone, which means an impulse occurs exactly at
. Secondly, it assumes that the impacts are produced under a unit load distributed uniformly around the bearing. Hence, the vibration produced by the defect can be modeled as an infinite series of impulses with equal amplitude. The unit fault impulse function
is given by:
where
stands for the Dirac delta unit impulse function,
represents the severity of the defect, and
is the time period between the fault impulses. The number of repeated cycles
k within one measurement sample can be derived as follows:
where
is the measurement duration of one sample, and
represents the defect frequency, which can be substituted by different defect frequencies, such as for the outer race (
), inner race (
), cage (
), and ball fault (
).
2.2.2. Decay Oscillation of Fault Impulse
The bearing can be conceptualized as a mass-spring-damping system, where the vibrational impulse resulting from an impact within the defect zone gradually diminishes over time. Hence, the oscillation of the fault impulse can be effectively modeled by the integration of an amplitude function
and an exponential decay function
. The amplitude function is given by a sinusoidal function, as shown in Equation (
4):
in which
represents the actual load applied onto the bearing, and
is the bearing system resonance frequency. The exponential function is given by
where
B is the decaying parameter. This determines the decay rate of the impulse and also represents the bearing fault dynamics. Finally, the decay oscillation
of fault impulse can be defined as the product of an amplitude function
and an exponential decay function
, as follows:
2.3. Modeling for Bearing Defect Vibration
By incorporating the effects of bearing load distribution
, unit fault impulse
, and decay oscillation of the fault impulse
and
, the vibration signal produced by bearing with defects can be reconstructed, as shown in Equation (
7):
Since
can be substituted by
, the response model can be rewritten, as shown in Equation (
8):
in which all the parameters can be expressed in the time domain. Besides the general model for vibration signals produced by bearing with defect, this study will also address the modeling of fault position and fault size.
2.4. Modeling for Defect Position
Regarding the modeling of fault positions, it is essential to address distinct scenarios for different bearing components. In the case of the outer ring, typically fixed within the housing, the fault position remains stationary at its initial location, as denoted by Equation (
9). Conversely, the inner race, which rotates with the shaft, introduces a variable fault position, as described in Equation (
10). Similarly, when a fault occurs on the rolling elements, their positions also change due to their rotation, albeit at different frequencies. This variation is expressed in Equation (
11), where
represents the shaft rotation frequency, and
denotes the revolution frequency of the rolling elements.
2.5. Modeling for Defect Length
When the ball goes through the entry and exit points, impulses will be generated. As a result, a time lag between these two impulses can be observed when the defect length is noticeable. In this subsection, the time lag will be modeled.
As shown in
Figure 1, the angle formed by entry point
A, the center of ball
O, and exit point
B are defined as
. The distance between entry point
A and exit point
B is the length of the defect and is approximately a straight line when
is very small. The angle between
and
is
. When the angle is small enough, the relationship between the angle, radius, and the length of defect can be formulated as Equation (
12):
Consequently, the period between the rolling balls entering and leaving the defect zone can be calculated as follows:
where
is the defect frequency of outer race. With the combination of Equations (
12) and (
13), the time lag of going through the defect can be generally identified as
where
L is the defect length,
is the defect frequency, and
r is the radius. For inner race fault,
r can be replaced by
and
by
. Likewise, for ball fault, the
r and
should be updated with
and
, respectively.
5. Results and Analysis
This section presents the training results of the bearing digital twin. The test data consist of measurement samples obtained from the CWRU dataset, encompassing defects on the outer ring, inner ring, and ball components. These data were acquired with a sampling frequency of 12,000 Hz from the drive end under a load of 745.7 W, while the motor operated at a speed of 1772 rpm.
The selected actions consist of the load proportional factor a and the decaying parameter B. The allowable ranges for these two parameters are defined through the output layer of the actor network, specifically set as [0.001, 0.1] and [100, 400], respectively. The training process commences with the dataset sample containing an outer ring defect. At each step, a random value within the same data sample is initiated, and subsequently, the error between the test data and their corresponding simulated data is calculated. Each episode encompasses 80 such steps. The resulting agent undergoes training across 120 episodes and can be effectively applied in diverse environments. To ascertain the agent’s ability to make accurate selections in alternative scenarios, the agent, initially trained on data involving outer ring faults, will be evaluated within environments featuring inner ring and ball defects.
Based on the training results involving states, which represent errors calculated using the cosine similarity cost function, the process entails selecting the smallest state value and subsequently identifying its corresponding action values. These action values are then integrated into the digital twin model.
Table 6 displays the step numbers associated with the minimum states and their corresponding values, which have been acquired through reinforcement learning. Concurrently,
Table 7 provides an overview of the corresponding actions.
Figure 9 depicts the learning trajectory of the load proportional factor, denoted as ’a,’ and the decaying parameter
B. Meanwhile,
Table 8 illustrates the minimum mean error achieved through digital twin training and the calculated mean error resulting from the trained actions. Notably, it is evident that following the identification of the load proportional factor
a and decaying parameter
B through the DDPG algorithm, the disparity between the simulation data and accurate measurements is greatly reduced.
Upon careful examination of
Table 6 and
Table 8, it becomes apparent that the errors obtained through digital twin training do not precisely align with the theoretical errors. This discrepancy may be attributed to two key factors. Firstly, an agent trained within a specific environment excels within that environment, resulting in the lowest error percentage when applied to outer ring defect data. Introducing a new environment may potentially impact the performance of the pre-trained agent. Secondly, the initial point of each episode and each step within an episode is characterized by random, unknown values. Consequently, the data employed for comparison and validation differ from those utilized in the DDPG process.
After training, the digital twin model can generate the acceleration response of the bearing under various fault conditions.
Figure 10 compares the acceleration data produced by the digital twin and that obtained from the CWRU test bench in both the time and frequency domains. In
Figure 10a, “test data” refers to the actual measurement data from the bearing test bench, while “simulated data” signifies the data generated by the digital twin. It is evident that the data generated by the digital twin closely resemble the actual measurement data in terms of overall trends and local features.
Figure 10b compares the frequency domain. Notably, the digital twin accurately captures the fault frequency of the actual measurement sample. However, significant disparities are observed in the time domain of simulated signals. Several factors may contribute to amplitude deviations. Firstly, noise in the actual measurements can influence the envelope spectrum, a factor not addressed in the digital twin model. Secondly, discrepancies in the identification of parameters
a and
B can lead to variations in response amplitude. Lastly, the simplified signal-based model may not fully represent the intricate dynamics of a real bearing test bench. These same observations and conclusions apply to the acceleration data generated by the digital twin for inner race faults and ball faults.
We conducted ablation experiments using the hyperparameters listed in
Table 9 as the baseline configuration. The impact of these hyperparameters on the digital twin can be assessed through the experimental outcomes presented in
Table 10. Among the four parameters, the spectrum error exhibits the highest sensitivity to variations in the discount factor, followed by the mini-batch size. Specifically, their respective average errors reach between 0.5904 and 0.5633, while the average errors observed in other models fall within the range of 0.5503–0.5576.
As the discount factor gradually approaches unity, the standard deviation (SD) of the spectrum error increases, aligning with the trend of error changes associated with the maximum step number. Notably, the samples generated by the baseline model exhibit minimal mean error and relatively low standard deviation compared to the actual samples. This finding substantiates the optimality of the parameters listed in
Table 9 for the proposed bearing digital twin.
The MFPT dataset and the PU dataset are used for comparative experiments. The results are shown in
Table 11. In the case of the MFPT dataset, the average error consistently remains below 0.9250. Notably, the SD of the error for the outer race is markedly lower than that for the inner race, a pattern analogous to the findings in the CWRU dataset. When employing the PU dataset, the average error consistently remains below 0.1741, surpassing the performance observed with the CWRU dataset. Moreover, the SD of the errors exhibits a comparable range. These results collectively indicate that the utilization of the PU and MFPT datasets leads to reductions in both the mean and standard deviation of errors. It is reasonable to infer that this improvement can be attributed to the higher sampling frequency, enabling a more faithful representation of the bearing’s true dynamic characteristics. This observation serves to validate the effectiveness of the proposed methodology across diverse bearing datasets.