1. Introduction
Bearings are critical components in EMUs, whose performance directly affects the operational safety and reliability of trains. In recent years, with the continuous expansion of EMU fleet size and increased operational frequency, the load and working intensity borne by bearings have significantly increased. According to fault statistics reported by Song et al. [
1] over five consecutive years, bearing failures in the transmission system are primarily distributed in axle box bearings and gearbox bearings, which together account for more than 90% of all faults. Although motor bearings account for only 6% of total failures, their abnormal vibrations can still directly impact the stability of the entire transmission system and its operational safety. Therefore, research on anomaly detection in motor bearing vibration signals is of great importance for early fault diagnosis and prevention. Analyzing non-real-time vibration data collected by ground monitoring systems can help identify potential anomalies, providing an effective means for early fault detection and condition trend assessment [
2,
3]. However, in engineering practice, anomaly samples representing typical fault states are extremely scarce or even entirely absent [
4,
5]. In this context, unsupervised anomaly detection methods based solely on normal operational data have become a crucial technical approach for achieving early fault identification in bearings.
Hiruta et al. [
6] propose an unsupervised learning method based on Gaussian Mixture Models. By grouping power spectral features and estimating model parameters using the Expectation-Maximization algorithm, it achieves detection of abnormal conditions in motor bearings. Evaluation via area under the curve (AUC) demonstrates the method’s effectiveness in distinguishing between normal and abnormal states. Several references [
7,
8,
9] have employed support vector machine techniques, combined with various signal processing and feature extraction methods, to achieve high-accuracy rolling bearing fault diagnosis. Other works [
10,
11,
12,
13] have focused on autoencoder-based methods. For example, Shen et al. [
10] proposed a dynamically loss-guided autoencoder (AE) to suppress randomness and drift in feature extraction, thus improving diagnostic performance. Diez et al. [
11] demonstrated that autoencoders outperform traditional principal component analysis (PCA) in detecting fully rotating and oscillating bearings, with superior sensitivity, accuracy, and recognition capabilities of early degradation. Dai et al. [
12] propose a dual-path self-supervised learning-based method for bearing anomaly detection. This approach achieves precise early detection of weak faults across operating conditions using only normal vibration data by capturing global features through contrastive learning and extracting local details via a reconstruction mechanism. Kang et al. [
13] propose a dual-input deep anomaly detection method that achieves early fault warning for rolling bearings using only normal vibration data through the integration of dual-channel high-frequency signal inputs and an experience replay mechanism. Liu et al. [
14] combined autoencoders with Wasserstein generative adversarial networks (WGANs) to efficiently detect faults in unlabeled data. König et al. [
15] pioneer an unsupervised approach using autoencoders to detect sliding bearing anomalies. Trained solely on normal-condition acoustic-emission features, the model identifies abnormalities through reconstruction error thresholds. Temporal prediction approaches have also been explored in the literature [
16,
17]. Lee et al. [
16] used CNN to extract spatial features, coupled with BiLSTM networks and regression layers to optimize temporal learning. Xu et al. [
17] developed a hybrid model that integrates long- and short-term memory (LSTM) networks, generative adversarial networks (GAN) and extreme gradient boost (XGBoost) to extract deep temporal characteristics and improve the efficiency of rolling bearing fault detection.
Furthermore, recent advancements have integrated VMD with deep learning to enhance feature extraction from noisy and non-stationary vibration signals. Habbouche et al. [
18] proposed a VMD-based notch filter to cancel the dominant mode in vibration signals, combined with a 1D-CNN, achieving high-accuracy bearing fault detection and diagnosis under noisy conditions on the CWRU dataset. Xiong et al. [
19] developed an optimized VMD using an enhanced Sparrow Search Algorithm (OCSSA) for parameter selection and integrated it with a CNN-BiLSTM network, significantly improving the diagnosis accuracy of rolling bearings under strong noise interference. Qiu et al. [
20] introduced a War Strategy Optimization (WSO) algorithm to adaptively optimize VMD parameters and proposed a ResNet-SWIN model, effectively enhancing the diagnosis capability for weak faults in rotation vector reducer crankshaft bearings under variable-speed conditions.
In the EMU domain, Ding [
21] proposes an impact-response-based convolutional sparse coding method that effectively detects anomalies by resolving nonlinear and non-stationary modulation issues in wheelset bearings using test-rig data. Deng et al. [
22] enhance detection accuracy for high-speed train axle bearings through Singular Value Decomposition (SVD) denoising combined with Adaptive Time-Reassignment Multisynchrosqueezing Transformer, utilizing operational parameters such as temperature and speed. For real-time anomaly detection, Jin et al. [
23] develop Time Density-Weighted Incremental Support Vector Data Description to improve the timeliness and accuracy of online roller bearing monitoring; Liu et al. [
24] construct a hybrid multi-layer LSTM and Isolation Forest model that reduces early warning response latency for high-speed EMU axle box bearings through temperature time-series prediction and deviation-index dynamic monitoring.
This study proposes an innovative anomaly detection method for the non-real-time vibration signals of the motor bearings in the CR400AF high-speed train. The proposed VMD-CBR-OCSVM model establishes a cohesive analytical pipeline. First, the raw vibration signal is fed into the VMD module, which adaptively decomposes the complex signal into a series of IMFs through power-spectrum-guided parameter optimization. These IMFs are then processed by a hybrid CNN-BiLSTM-ResNet (CBR) network for temporal prediction. By leveraging the combined strengths of convolution, bidirectional long short-term memory, and residual connections, the CBR network accurately forecasts the future values of each component and reconstructs the full signal by summing them. Subsequently, the residual between the predicted and original signals is computed, and its RMS value is derived as a discriminative feature representing state deviations. Finally, this RMS feature is input into OC-SVM, where unsupervised anomaly identification is achieved by comparing it against the decision boundary learned solely from normal data. The research framework is illustrated in
Figure 1.
Unlike previous research that relies on laboratory simulations or publicly available datasets, this integrated framework is validated using data collected from real operational environments. This ensures both a large dataset and high sampling frequency, while more accurately capturing the complexity and variability of actual operating conditions—providing essential, realistic data for training each stage of the proposed model. Laboratory simulation data are typically idealized, making it difficult to account for random factors and subtle faults present in real-world environments, while publicly available datasets often suffer from issues such as insufficient sample size and inadequate sampling frequency. By utilizing this unique dataset to validate a complete, end-to-end detection framework, this study not only fills a gap in this field but also provides solid technical support for the application of deep learning in anomaly detection for EMUs. The proposed VMD-CBR-OCSVM framework constitutes an integrated analytical pipeline, wherein each algorithmic component is meticulously engineered to address a specific challenge inherent in the unsupervised anomaly detection of non-stationary vibration signals. The functional roles and synergistic interactions of these components are delineated as follows:
VMD functions as an adaptive signal preprocessor. Its primary role is to mitigate the analytical challenges posed by strong non-stationarity and multi-component coupling in the raw vibration signals. By leveraging a power spectrum-guided mechanism to autonomously optimize its key parameters, VMD decomposes the complex source signal into a finite set of band-limited, quasi-orthogonal IMFs. This decomposition effectively disentangles the complex signal into simpler, more structured sub-components, thereby enhancing the signal-to-noise ratio and providing a refined input for the subsequent temporal modeling stage.
The CBR hybrid network serves as the core spatio-temporal predictor. This module is designed to learn the complex underlying patterns representative of normal bearing operation. The architecture synergistically combines the strengths of its constituent deep learning models: the CNN layers extract salient local spatial features and patterns from the input IMFs; the BiLSTM layers capture long-range temporal dependencies and contextual information both forwards and backwards in time; and the ResNet components, through skip connections, facilitate the training of a deeper network by alleviating the vanishing gradient problem and enabling robust feature fusion. The objective of this network is to learn a high-fidelity predictive model of normal system dynamics. Consequently, any significant deviation from the expected behavior, manifested as a substantial prediction error, serves as a potent indicator of an emerging anomaly.
The OC-SVM operates as the final unsupervised anomaly detector. This component is tasked with interpreting the prediction residuals generated by the CBR network. The RMS value of the prediction error sequence is calculated as a discriminative feature, chosen for its sensitivity to changes in signal energy. The OC-SVM is trained exclusively on the RMS values derived from normal operation data. Utilizing a Radial Basis Function (RBF) kernel, it learns a tight decision boundary that encapsulates the intrinsic distribution of normal condition features. Any subsequent observation that falls outside this learned boundary is classified as an anomaly, thus enabling fully unsupervised detection without any requirement for labeled fault data.
This integrated framework establishes a coherent workflow tailored for EMU motor bearing monitoring, combining adaptive signal decomposition, temporal modeling and statistical decision-making to form a robust solution for early fault detection in high-speed railway traction systems. The main contributions of this study are as follows:
A novel adaptive VMD preprocessing method: We introduce a power spectrum-guided mechanism that autonomously optimizes the critical decomposition parameters (mode number and penalty factor). This physics-informed approach eliminates the reliance on empirical tuning, effectively addressing the challenges of strong non-stationarity and low signal-to-noise ratio in raw vibration signals, thereby ensuring robust and reliable signal decomposition.
An advanced spatio-temporal prediction network: We design a hybrid CBR architecture as a core predictor. This network innovatively combines convolutional layers for spatial feature extraction, bidirectional LSTM layers for capturing long-range temporal dependencies, and residual connections to facilitate the training of a deep network and enable effective feature fusion. This design allows for high-fidelity modeling of the complex dynamics inherent in normal bearing vibration signals.
A practical unsupervised detection framework: We establish a complete anomaly-detection pipeline that requires only normal operational data for training. By using the RMS of the prediction errors from the CBR network as a sensitive feature—selected for its proven effectiveness as a stable time-domain statistic that is highly sensitive to changes in signal energy caused by bearing faults [
25,
26,
27]—and training an OC-SVM model, the framework learns the intrinsic distribution of normal conditions. This approach provides a viable and practical solution for real-world applications where labeled fault data are scarce or unavailable.
2. Power-Spectrum-Guided Variational Mode Decomposition Parameter Optimization
VMD is a variational optimization-based adaptive signal decomposition method with significant application value in non-stationary signal processing. High-speed-train motor bearing vibration signals are characterized by strong non-stationarity, multi-component coupling, and low signal-to-noise ratio. Traditional Empirical Mode Decomposition (EMD) methods exhibit inherent limitations, notably mode mixing, under these conditions. In contrast, VMD mitigates the noise sensitivity inherent in recursive decomposition via its frequency-domain energy-adaptive partition mechanism. The decomposition dimension is governed by the number of modes K, while the bandwidth penalty factor regulates the bandwidth of the extracted modes, establishing a dual-parameter optimization framework. Simultaneously, the iterative update strategy for the center frequencies effectively prevents spectral overlap. This combination provides a robust analytical approach for signal decoupling under complex operating conditions.
2.1. Determination of the Number of Modes
Within the VMD algorithm, the number of decomposition modes, denoted as K, determines the quantity of IMFs generated. Each IMF represents the frequency characteristics inherent in the original vibration signal. For performance degradation analysis of high-speed-train motor bearings using VMD, the selection of parameters K and is critical. An appropriate K value ensures accurate feature extraction, avoiding both over-decomposition and under-decomposition. Similarly, a suitable value facilitates accelerated convergence and enhanced noise resistance, thus enabling the effective detection of motor bearing abnormalities. Consequently, optimizing these two parameters significantly improves the accuracy of motor bearing anomaly detection under non-real-time conditions.
Significant peaks in the PSD typically reveal the main components of the signal. If the PSD, calculated using the Welch method [
28], displays multiple distinct peaks, it suggests that the signal may contain multiple different vibration sources. In such cases, selecting too small a value for “
K” may result in mode mixing, where different vibration sources are incorrectly decomposed into the same IMF. Conversely, if the PSD shows fewer peaks, selecting an excessively large “
K” may lead to redundant decomposition, generating unnecessary IMFs that often only contain noise or secondary components. By analyzing the distribution characteristics of the PSD, an appropriate value for “
K” can be chosen, ensuring sufficient decomposition of the signal while avoiding noise interference. The optimization procedure for the modal count
K comprises the following steps:
Step 1: Define the original signal. Assume that the vibration signal from the motor bearings of EMU, after denoising and smoothing, is
. This is a continuous signal in the time domain. However, in practical processing, it is typically represented by a discrete sampled signal. Given the sampling rate
, the sampling interval is
and the number of sampling points is
N. The discrete signal can be expressed as:
Step 2: Compute the power spectral density using Welch’s method.
In this context,
denotes the PSD;
f represents the frequency vector defined over
;
corresponds to the
m-th signal segment with length
;
signifies the window function of length
;
M indicates the total number of signal segments; and
specifies the length of each signal segment.
Step 3: Detect significant peaks. The peak detection formula is defined as:
where
is the threshold coefficient;
denotes the peak frequency satisfying the condition;
represents the number of qualified peak frequencies; and the initial number of modes
is estimated as:
Define the candidate range
:
Step 3: Compute modal separability and reconstruction error. For every
K within the candidate range, decompose the signal into
K modes
, with the optimization objective formulated as:
subject to the constraint:
where
denotes the unit impulse function,
j is the imaginary unit, ∗ represents the convolution operator,
signifies the time derivative,
is the center frequency of the
k-th IMF, and
indicates the
norm. To solve this constrained optimization problem, we introduce the Lagrange multiplier
and penalty factor
, constructing the augmented Lagrangian function as follows:
where
denotes the inner product. The modes
and center frequencies
are iteratively updated and finally output as their discrete representations
and
. For each mode
, its center frequency is computed as:
where
is the power spectral density of
. The modal separability
is defined as the minimum interval between adjacent center frequencies; a larger
indicates less mode mixing.
If
, then
. The reconstructed signal is:
The reconstruction error
is defined as the normalized mean square error between the original signal and the reconstructed signal; the smaller
, the more complete the original information preserved by the modal components.
Step 5: Optimize the objective function to obtain the optimal number of decomposition modes. Select the optimal
K for EMU motor bearing signals, defining the objective function as:
where
is the weight coefficient. Optimize the objective function to balance separability and error, preserving information of the original vibration signal without over-decomposition or under-decomposition. The optimal number of decomposition modes
is:
2.2. Determination Method for the Penalty Factor
In EMU motor bearing anomaly detection, the bandwidth of the power spectral density reflects the frequency dispersion degree of the signal. If the power spectral density exhibits broadband distribution, it indicates that the signal contains rich frequency components. In this case, a smaller value is recommended to relax the bandwidth constraints of IMFs, effectively capturing dynamic characteristics of broadband signals. If the power spectral density shows power concentrated in specific narrow frequency bands, potential local fault features may exist. Under such conditions, selecting a larger value tightens the bandwidth of IMFs, enhancing frequency locality of modes to improve identification accuracy of narrowband fault features. The determination steps for penalty factor are as follows: In EMU motor bearing anomaly detection, the bandwidth of the power spectral density reflects the frequency dispersion degree of the signal. If the power spectral density exhibits broadband distribution, it indicates that the signal contains rich frequency components. In this case, a smaller value is recommended to relax the bandwidth constraints of IMFs, effectively capturing dynamic characteristics of broadband signals. If the power spectral density shows power concentrated in specific narrow frequency bands, potential local fault features may exist. Under such conditions, selecting a larger value tightens the bandwidth of IMFs, enhancing frequency locality of modes to improve identification accuracy of narrowband fault features. The determination steps for penalty factor are as follows:
Step 1: After determining the optimal number of modes , systematically adjust the value of within the test range from 100 to 5000 with a step size of 100 to optimize anomalous signal extraction.
Step 2: For each value, perform VMD decomposition using Equations (6)–(8). Output modes .
Step 3: Calculate bandwidth stability. For each mode
, compute its bandwidth and evaluate stability. First, calculate the power spectral density for each mode:
Then calculate its peak frequency, half-power points, lower/upper cutoff frequencies, and bandwidth using Equations (16)–(20).
Finally, compute the bandwidth stability
:
where
prevents division by zero.
and
denote calculating the mean and standard deviation of different bandwidths, respectively. A higher
value indicates smaller variation in bandwidths
relative to their mean (i.e., smaller standard deviation). Therefore, a higher
value signifies better consistency or stability of bandwidths, meaning the bandwidth remains relatively unchanged under different measurement conditions. A lower
value indicates larger variation in bandwidths
relative to their mean (i.e., larger standard deviation). This shows significant bandwidth variation across conditions, lacking stability. Thus, a higher
value indicates better stability of modal bandwidths and more consistent decomposition results. Directly select the
that maximizes
, i.e.,
The overall procedure for the power-spectrum-guided variational mode decomposition parameter optimization method is described in Algorithm 1.
Algorithm 1. Adaptive VMD parameter optimization. |
1: | Input: Discrete vibration signal , sampling rate , window function , segment parameters , threshold , weight |
2: | Output: Optimal modes , optimal parameters , |
3: | Compute Welch power spectrum density using Equation (2) |
4: | Detect significant peaks: |
5: | |
6: | Define candidate range |
7: | for each do |
8: | Perform VMD decomposition with current K using Equations (6)–(8) |
9: | Calculate center frequencies using Equation (9) |
10: | Compute separation |
11: | Compute reconstruction |
12: | Calculate error |
13: | Evaluate |
14: | end for |
15: | |
16: | Define range |
17: | for each do |
18: | Perform VMD with and current |
19: | for each mode do |
20: | Compute using Equation (15) |
21: | Find , , using Equations (16)–(19) |
22: | Calculate bandwidth |
23: | end for |
24: | Compute stability |
25: | end for |
26: | |
27: | Perform final VMD with and to get |
28: | Return, , |
3. Time-Series Forecasting and Anomaly Detection Based on Deep Learning Networks
3.1. IMF Time-Series Forecasting Based on Deep Learning Networks
Since motor bearing vibration signals in high-speed train sets typically contain components of different frequencies, processing these signals requires models capable of distinguishing various frequency features. Traditional time-series models often struggle to directly handle such complex signals. By decomposing the signal using the VMD method, K intrinsic mode functions
are obtained, effectively extracting different frequency components from the signal. To improve prediction accuracy, we employ a deep learning architecture termed CBR, which integrates CNN, BiLSTM, and ResNet. The overall structure of this CBR time-series prediction framework is depicted in
Figure 2, and its processing workflow operates as follows:
The input layer of the network receives K intrinsic mode functions derived from VMD. First, the signals are processed through a residual convolutional encoding module composed of multiple residual blocks. This module progressively expands the input channels from 1 to 64 via convolutional operations, then increases to 128, further to 256, and finally to a 512-dimensional feature representation. Each residual block incorporates standardized convolution and ReLU activation functions, effectively mitigating the vanishing gradient problem through residual connections. During feature extraction, the temporal dimension is gradually compressed via pooling operations, significantly reducing computational complexity while preserving critical spatial features of the signals.
Subsequently, the network employs a bidirectional long short-term memory network for temporal modeling. This module comprises two BiLSTM layers: the first layer receives 512-dimensional input features and performs bidirectional temporal modeling with 256 hidden units; the second layer further refines the features into 128-dimensional hidden states. By processing sequences bidirectionally (forward and backward), the BiLSTM comprehensively captures long-term dependencies and complex dynamic characteristics in vibration signals, making it particularly suitable for identifying periodic patterns and anomalous evolution trends in motor bearing vibration signals.
During feature propagation, the network introduces an innovative residual connection mechanism. The original input signals undergo channel dimension adjustment via convolutional layers and are then aligned with features from the main path through adaptive pooling, enabling cross-layer feature fusion. This design significantly enhances the retention of original signal characteristics, effectively prevents information degradation in deep networks, and improves model robustness.
Finally, temporal features are mapped to predictions through a fully connected layer module. This module employs a two-stage linear transformation: first reducing the 128-dimensional features to 64 dimensions, then compressing them to a 1-dimensional output. ReLU activation functions are applied between stages to strengthen nonlinear expressive capabilities. The prediction results for each IMF are superimposed to reconstruct the complete vibration signal. By analyzing the deviation patterns between predicted and actual signals, precise monitoring of motor bearing operational states and early fault diagnosis are achieved.
This architecture synergizes CNN’s spatial feature extraction, BiLSTM’s temporal dynamic modeling, and ResNet’s deep optimization mechanisms. It significantly enhances the accuracy and robustness of vibration signal prediction while preserving critical signal characteristics.The detailed hyperparameter configurations and training details are provided in
Table 1.
3.2. Data Reconstruction and Anomaly Detection
3.2.1. Data Reconstruction
Based on the proposed prediction model, time-series forecasting is performed on the
K modal functions obtained from VMD, yielding their corresponding predicted values
. The data reconstruction process is achieved by linearly superimposing all predicted
, with the expression given as follows:
The predicted signal
is obtained by summing all the
components. This effectively preserves the multi-scale frequency characteristics of the original signal and integrates the dynamic properties of each component, thereby achieving prediction through multi-band information fusion.
3.2.2. Anomaly Detection
Given the practical challenge of acquiring labeled abnormal vibration signals in electric multiple-unit motors, which aligns with the objective of detecting unknown anomalies, this study employs an unsupervised approach by training exclusively on normal signals. This strategy ensures the model learns a robust representation of normal operational patterns without being biased by specific, known faults, thereby enhancing its sensitivity to general deviations. A reserved independent test dataset (containing both normal and abnormal samples) is used to evaluate the generalization performance of the VMD-CNN-BiLSTM-ResNet hybrid model in distinguishing anomalous states. (This process refers to decomposing signals via VMD and applying the CBR time-series prediction framework).
The processing workflow involves first calculating residuals between actual and predicted values of the original vibration signals, then capturing local degradation characteristics, segmenting the residual signals into multiple sliding windows of length
, subsequently computing the RMS value for each window, and ultimately training to establish a binary classifier through the OC-SVM algorithm based on these RMS values, where each window’s RMS value is treated as a single training sample for OC-SVM. During the testing phase, anomaly detection is implemented via this trained binary classifier, where the RMS is calculated as:
This feature quantifies the cumulative deviation of the predictive model in the time domain, effectively characterizing bearing state degradation.
The core of the algorithm lies in solving for an optimal separating hyperplane in the feature space, formulated as the optimization problem:
subject to:
Here,
w represents the normal vector of the hyperplane;
denotes the slack variable;
indicates the offset of the hyperplane;
n signifies the number of training samples;
corresponds to the RMS feature value of the
i-th sample; and
represents a nonlinear mapping implemented through the RBF kernel
, where
automatically adapts to the data distribution. The hyperparameter
, which controls the model’s sensitivity to outliers, was set to 0.05. This value was selected through a cross-validation process designed to minimize false alarms, with the detailed empirical justification provided in
Section 4.4.1.
The decision function
defines the signed distance in the feature space. The output threshold
is dynamically determined by the
-quantile of the decision values from the training data:
The final state determination rule is given by:
In this way, it can effectively reduce false positives while maintaining detection sensitivity, thereby enhancing the accuracy and practical applicability of abnormal motor bearing detection in EMUs.
5. Conclusions and Future Work
This paper proposed an integrated vibration anomaly detection method for motor bearings in EMU by fusing VMD with deep learning. The main findings and contributions of this study are three-fold:
First, a power-spectrum-guided adaptive VMD parameter optimization mechanism was established, which effectively overcame the mode-mixing problem and enabled adaptive decomposition of highly non-stationary vibration signals.
Second, a hybrid CNN-BiLSTM-ResNet model was developed, which achieved near-perfect prediction performance (, ) on normal operational data by leveraging multi-scale feature extraction and temporal dependency modeling.
Third, an unsupervised anomaly detection framework based on prediction-error RMS and OC-SVM was constructed. Most significantly, this framework demonstrated superior performance on real-world operational data, achieving an accuracy of 95.2% and an F1-score of 0.909 in detecting anomalous states. This represents a substantial improvement (e.g., a 44.4% relative increase in F1-score) over conventional methods like Isolation Forest, highlighting its practical efficacy. Despite the demonstrated effectiveness of the RMS feature for trend-based anomaly detection in this non-real-time monitoring context, it is noteworthy that feature engineering offers avenues for further enhancement. The RMS value was chosen for its robustness in capturing energy-based deviations indicative of gradual degradation. However, other feature domains, such as kurtosis for impulsivity or entropy for signal complexity, could offer complementary benefits for detecting a wider variety of fault types, particularly incipient localized faults.
The significance of these findings lies in providing a reliable, annotation-free, technical solution for early fault detection and warning in EMU motor bearings under complex operational environments. This approach effectively addresses the industrial challenge of scarce fault data and enables trend analysis using readily available normal vibration data.
Future research will focus on two primary directions to translate these findings into practical applications:
First, methodological advancements through multi-source data fusion (e.g., integrating temperature and rotational speed) employing ensemble and transfer learning to further enhance detection precision and robustness. Building on the feature discussion above, a core aspect will be the exploration of multi-feature fusion strategies within this framework, combining time-domain, frequency-domain, and information-theoretic features to further improve detection sensitivity, specificity, and overall robustness.
Second, we will focus on the development and deployment of a real-time online monitoring and early-warning system for EMU motor bearing health management. As a direct application of this framework, the system will incorporate multi-dimensional features such as RMS, kurtosis, and entropy to enhance the sensitivity and reliability of condition monitoring, thereby ensuring safe and stable train operations.