Next Article in Journal
Optimizing Multitenancy: Adaptive Resource Allocation in Serverless Cloud Environments Using Reinforcement Learning
Previous Article in Journal
Privacy-Preserving Byzantine-Tolerant Federated Learning Scheme in Vehicular Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Determination of the Condition of Railway Rolling Stock Using Automatic Classifiers

1
MAQLAB Research Group, Department of Mechanical Engineering, Universidad Carlos III de Madrid, Av. de la Universidad, 30, 28911 Leganés, Spain
2
MAQLAB Research Group, Department of Mechanics, Universidad Nacional de Educación a Distancia, C/Juan del Rosal, 12, 28040 Madrid, Spain
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(15), 3006; https://doi.org/10.3390/electronics14153006
Submission received: 12 June 2025 / Revised: 11 July 2025 / Accepted: 15 July 2025 / Published: 28 July 2025

Abstract

Efficient maintenance is paramount for rail transport systems to avoid catastrophic accidents. Therefore, a method that enables the early detection of defects in critical components is crucial for increasing the availability of rolling stock and reducing maintenance costs. This work’s main contribution is the proposal of a methodology for analyzing vibration signals. The vibration signals, obtained from a bogie axle on a test bench, are decomposed into intrinsic functions, to which classical signal processing techniques are then applied. Finally, decision trees are employed to characterize the axle’s state, yielding excellent results.

1. Introduction

It is an acknowledged fact within the field of rail operation that “the economic efficiency and competitiveness of railway transport depends on the safety, availability and maintenance” [1], so researchers, manufacturers, and operators keep in mind that a failure of one the structural elements that suffers high stress such as bogies or wheelsets may result in catastrophic effects both for rolling stock and people. The Eschede rail accident [2], a serious accident in Germany in 1998 that went to court for negligent homicide, was a case in point. The accident was caused by a broken wheel, and both the wheel manufacturer and the railway company were charged. Considering the high standards of maintenance, the authors aim to integrate new procedures and equipment guided by Industry 4.0 [3], in order to increase both security and maintenance efficiency.
Advancements in computing technology have enabled the emulation of physical phenomena, as initially proposed by Feynman in 1982 [4], as well as the reproduction of elements and entities in a completely digital way. Examples are the digital twins used to replicate and test blocks and mechanical assemblies such as bearings [5] or to monitor fruit quality evolution [6].
Advances in computer hardware and software have made it possible to process and store large data more efficiently and quickly. This has also made it easier to handle large datasets in a shorter period of time [7,8], allowing complex calculations to be executed with decreasing time consumption. This makes Artificial Intelligence (AI) viable. That is the name given to methods created to copy how the human brain functions [9], describing the creation of artificial models and computational algorithms that resemble human learning and reproduce human skills.

1.1. Machine Learning

In 1955, Artificial Intelligence (AI) was first introduced as a concept by McCarthy et al. [10], who stated that “If a machine can do a job, then an automatic calculator can be programmed to simulate the machine”, so the basic concepts of AI, such as the use of language and neural networks, were introduced for the first time for computer science.
Once the concept of AI has been defined, its functioning is described [11], and both the objectives and the methodologies used to achieve them are identified, from problem solving to communication, so that the concept of learning is introduced as part of any developed and recognized intelligence, which then describes the concept of machine learning.
Having described machine learning as a concept and as a method, the concept of learning is considered to take into account the fact that machine learning often requires large amounts of data and computational power to achieve impressive results in areas such as autonomous cars and image recognition [12]. However, this approach has its limitations, as it lacks the ability to reason logically and identify causal relationships. The development of machine learning systems and methodologies usually starts with data collection and labeling. The next steps are analysis and algorithm selection until a trained model can be developed and implemented [13].
It is necessary to address the three main points to be taken into account, machine learning, machine teaching, and how the human being is involved in the whole process; therefore, whether the human being is involved in the machine learning process [14] is considered to define different approaches [12], such as active learning [15], in which the system has full control over teaching and learning processes, interactive machine learning [16], in which humans interactively deliver information in a more focused, frequent and incremental way compared to traditional machine learning, and a combination of machine learning and machine teaching [13].

1.2. Vibration Analysis: Condition Monitoring Tool

Vibration signal analysis is one of the most widely used techniques for inspecting mechanical components under operational conditions [17], from the first models [18], as it allows testing in a wide range of elements and situations [19], such as railway infrastructure, general-purpose equipment, or rolling stock.
Many authors studying vibration signals evolution for condition monitoring apply their techniques to rolling elements to analyze bearing faults [20]. Some of them have dealt with vibration analysis performed on railway systems [21]; others focus on ground or track disturbance induced by rolling stock traffic [22].
Vibration signal analysis is a common testing ground for a wide range of mechanical analyses used to determine bearing failures [23], and other authors perform analysis for axle components [24,25].

1.3. Empirical Mode Decomposition (EMD)

In light of the necessity to develop effective diagnostic techniques, considerable reliance has been placed on signal modeling [25]; thus, signal analytical models are considered the optimal choice compared to full dynamic simulations [26]. It is therefore imperative to develop models that have been empirically tested with real signals. There is limited knowledge about the quantitative effect of their parameters.
The use of envelope analysis for diagnostic purposes has been a customary practice for a long time, owing to the fact that the spectrum of the raw signal frequently exhibits an insufficiency of diagnostic information concerning faults [27]; consequently, novel techniques for enhancing the efficiency of analysis have been developed.
Empirical Mode Decomposition (EMD) was first proposed by Huang et al. in 1998 [28]. The methodology employed is to use envelopes delimited by local maxima and minima, and these envelopes are connected by a cubic spline line. The local maxima are designated as the upper envelope, while the local minima are assigned to the lower envelope. This ensures that all data is enclosed between these envelopes. Subsequent to the definition of the envelopes, the next step is to determine the conditions to develop the so-called intrinsic mode function:
“An intrinsic mode function (IMF) is a function that satisfies two conditions: (1) in the whole data set, the number of extrema and the number of zero crossings must either equal or differ at most by one; and (2) at any point, the mean value of the envelope defined by the local maxima and the envelope defined by the local minima is zero.”
The aim of the EMD technique is to decompose the vibration signals into subsignals (IMFs). Theoretically, each IMF is directly related to the sources that generated it [28]. The purpose of the application of the EMD technique in this study is to ease the identification of the condition of rolling stock, the failure of which can lead to catastrophic accidents with a high probability of human casualties. This is done by decomposing the vibration signals into several IMFs, which can then be measured to obtain a reliable estimate. A method called “bivariate modal decomposition” has been developed [29]. The use of this instrument in this project enables the decomposition of complex signals, thus providing a sophisticated analytical framework.
Since its introduction in 1998, the EMD technique has undergone significant development in a number of disciplines, with widespread use in a variety of applications, especially in the field of prediction [30], although its utilization is extensive in the identification of faults [31], satellite navigation [32], river stages [33], biomedicine [34], or dam deformation [35].
As mentioned above, the EMD technique is based on signal decomposition [36], which allows features to be extracted for analysis [22]. Once a guide to bearing diagnostics has been established [27], the EMD technique has been widely used, for example, in the analysis of wind turbine generator bearings [37].
The complete EMD process is depicted in Figure 1. The upper figure shows the original signal, the middle figure represents the location of the maxima of both the upper (blue) and lower (red) envelopes, and the lower figure represents the first IMF.

1.4. Decision Trees

Machine learning models can be classified into four categories: supervised, semi-supervised, unsupervised, and reinforcement learning, each of which includes several subcategories [38], and among the supervised models are those used for classification and regression [39].
Decision trees have been widely used for classification and regression analysis. While these models are effective, they often suffer from over-complexity and over-fitting and therefore need some degree of simplification [40].
As examples, among the uses of decision trees is fault diagnosis using discrete wavelets [41], and the use of decision trees for fault diagnosis using discrete wavelets and classification combined [42].
The extent to which decision trees are still suitable for classification purposes has been explored, and the feasibility of their use for railway rolling stock characterization has been tested in recent years; the present work is an adaptation of those results combined with the EMD methodology [43].

1.5. Approaches and Challenges

Regarding the other approaches that have been taken to achieve the goal of optimizing the analysis of incoming data, Gafni et al. [44] identify federated learning as a tool for the future, for, among other purposes, using information received from different sources, which, in many cases, may not be fully reliable and/or significant, to a certain extent; Hu et al. [45] address complexity not only with different data sources, but also with different research objectives and data improvement. The amount of data gives rise to a number of challenges, but it also creates an area of interest in the form of decentralized data, as has been pointed out by Yang et al. [46], but the quality of the information, and thus the effectiveness of the analysis, is influenced by the use of data from different sources and the methods of data collection.

1.6. Aim of the Study

The present work explores the use of decision trees combined with EMD as an alternative method to classify and categorize the condition of railway rolling stock. This is achieved by analyzing the different stages of component deterioration, with the aim of determining the initial stage of mechanical fault and improving overall predictive maintenance, which offers results comparable to those obtained by Ruiz et al. [47], whose work was based on the use of Support Vector Machine, and Bustos et al. [48], who used EMD and K-Nearest Neighbor to perform classification under similar conditions.
The methodology applied in this work is frequently used in bearing condition analysis, as is demonstrated by the extensive literature referred to in this introduction. The literature mainly refers to the methodology used for rolling bearings and has been presented more broadly in terms of both elapsed time and the range of methodologies.

1.7. Structure of the Paper

The structure of the document is as follows: The following section details the methodology and techniques applied throughout the research process. Section 3 details the experimental system, the test bed, and the signal processing. Section 4 shows the results of the model training. Section 5 contains the conclusions of this work, as well as suggestions for future research.

2. Proposed Methodology

This section describes the methodology and techniques applied in the treatment of vibration signals in detail. The analysis process starts from two considerations or starting points: firstly, the definition of the bogie under study and its characteristics as a specific mechanical system to be analyzed, and secondly, the working conditions during the study process and the description of the test bench. The second starting point is the measurement process, which covers the equipment used, the sensors used, their description and positioning, and the speed at which the tests were performed.
The next stage of the process was the extraction of the vibration data and its conversion into a MATLAB® R2024b format for signal processing. The selection and extraction of the raw data from the database were performed according to the measurement conditions. The vibration data in MATLAB® format was pre-processed for each accelerometer. Finally, the consolidated data were processed in both the time and frequency domains, and the results obtained were analyzed.
Once the collection of vibration signals was completed and prior to their conversion, the EMD technique was applied to each signal. The main aim of this technique is to identify the intrinsic oscillatory modes of any given dynamic signal by its characteristic time scales. The data was then decomposed according to these modes. This is an iterative method for decomposing a dynamic signal into a set of IMFs based on three assumptions [28]:
  • For any analyzed signal, the number of extrema is at least two: one maximum and one minimum.
  • The characteristic time scale is defined by the time lapse between the extrema.
  • If the given signal does not have extrema but does have inflection points, the data can be differentiated to disclose the extrema.
It is important to note that the execution of the algorithm results in the extraction of two loops. The first loop is known as the sieving process, which is executed iteratively until the extracted signal meets the two IMF conditions. The function of the outer loop is to extract all the IMFs from the input signal. The algorithm works as follows:
  • For any given signal x(t), identify all extrema.
  • Connect all related extrema with a cubic spline curve, so an upper envelope emax(t) is obtained by connecting all maxima and a lower envelope emin(t) by connecting all minima. Computing the mean between envelopes results in m1(t), as shown in Equation (1):
m 1 t = e m a x t + e m i n t 2
3.
To obtain the first IMF candidate h1(t), the mean is subtracted from the input signal, as shown in Equation (2):
h 1 t = x t m 1 ( t )
4.
When the obtained h1(t) does not meet IMF conditions, h1(t) is taken as the input data, and Steps 1–3 are repeated until h1n(t) meets IMF conditions, as shown in Equation (3):
h 1 n t = h 1 n 1 t m 1 ( t )
5.
Then h1n(t) is renamed c1, becoming the first IMF component.
6.
The residue is calculated, as shown in Equation (4):
r 1 t = x t c 1 ( t )
7.
The residue is used as the new input signal, and steps 1–6 are repeated until no more IMFs can be extracted, following the procedure specified by Equations (5) and (6):
c k t = r k 1 t m k t
r k t = r k 1 t c k t r 0 t = x t
Finally, the input signal given x(t) is decomposed into a set of NE components ck(t) that are the IMFs and residue as shown in Equation (7):
x t =   k = 1 N E c k t + r N ( t )
As said in the Introduction, in this research, the bivariate EMD algorithm developed by Rilling et al. [29] is used to obtain IMFs.
Once IMFs are obtained from all input signals, a set of EMD-based parameters is computed, so for any extracted IMF, the average PSD and spectral power of each accelerometer are calculated, using Equation (8):
S C k f = t N X C k ( f ) 2 ;   P S D C k = S C k ( f )
where ∆t is the sample time, N is the number of data points in the signal, X C k (f) is the Fourier transform of the IMF c k t , and P S D C k   or   S C k is the power spectral density of IMF c k t .
Equation (9) is used to compute the average PSD of every set of IMFs:
P S D ¯ C k = j = 1 n s P S D j C k n s
where P S D ¯ C k is the average PSD of the processed IMF, P S D C k is the power spectral density of the IMF ck(t) extracted from the jth signal x(t), and ns is the number of recorded signals.
Then, the spectral or signal power is computed. According to [17], the signal power P equals the integral (or sum, for the discrete case) over the frequency range of the general distribution function S or PSD, as shown in Equations (10) and (11):
P = f = 0 N 1 S f f = f = 0 N / 2 S o n e f f
S o n e f = 2 S f f = 1 N 2 1 S f f = 0 ,   f = N 2
where ∆f is the frequency interval, N is number of data points in the signal, and S(f) is the signal PSD.

3. Experimental System

To perform the project, a series of vibration signals were obtained from a railway axle, with the following conditions: (i) no faults and (ii) three levels of faults. Subsequent to the testing of the system, the vibration signals were obtained and stored for further processing using two common analysis techniques: signal envelope and power spectral density (PSD). Afterwards, the signal characteristics in the time and frequency domains were extracted using MATLAB® R2024b software.
This project research is based on Empirical Mode Decomposition (EMD), so each vibration signal is decomposed into its significant components, the intrinsic mode functions (IMFs), using MATLAB®, and for each IMF, for both the time and frequency domains, its characteristics were extracted.
Finally, decision tree algorithms were tested, with the aforementioned features being utilized as predictors. The subsequent details provide a comprehensive overview of this process.

3.1. Obtaining Vibration Signals: Equipment

A test bench for bogies was utilized in order to acquire vibration signals (see Figure 2). In this instance, a type Y-21 bogie was subjected to testing, with the rear axle disregarded in the context of travel direction.
The following points constitute the primary aspects of the test bench:
The loading system is operated by hydraulic cylinders. These are installed on the bogie over the front axle, which is the moving axle. The system is used to assess the movement of the axle using two rollers positioned on each of the axle wheels.
The arrangement of three uniaxial accelerometers was implemented to facilitate the measurement of acceleration in the three spatial directions: longitudinal, axial, and vertical. These components were installed on the axle box cover in order to acquire the vibration signals from the bearing and the wheelset.
The scheme shown in Figure 3 represents the directions of space in which the sensors are arranged and the axle defect determined by the depth e in mm.

3.2. Obtaining Vibration Signals: Experiment Procedures

The sensors used in this work were CMSS-RAIL-9100 piezoelectric accelerometers manufactured by SKF (AB SKF, Göteborg, Sweden) for industrial use, which are insensitive to electromagnetic fields and radiation. The measurements could be enabled under harsh conditions, and accelerometer details are listed below (see Table 1).
Four type of test conditions, which were characterized by the parameter e, failure depth in mm, were arranged (see Figure 3, right scheme): undamaged axle (D0, 0.0 mm), damaged Axle Defect 1 (D1, 5.7 mm), Defect 2 (D2, 10.9 mm) and Defect 3 (D3, 15.0 mm).
Tests were performed at 50 km/h and 10 Tm (50% of maximum axle load), the sampling frequency was set at 12,800 Hz, and the sampling time was set at 1.28 s, with 16,384 data points per measure.

3.3. Testing Development

The database located in the MAQLAB Laboratory was managed by condition monitoring software that stored the acceleration and speed data received from the measurement system, as well as the date, the accelerometer identifier, and other useful information, in the database.
Once vibration signals were obtained, they were processed using the EMD technique so that each signal was decomposed into its IMFs using MATLAB®, and then the following parameters were extracted for each IMF:
Time domain:
  • RMS (RMS)
  • Peak value (PK)
  • Kurtosis (KUR)
  • Crest factor (FCR)
  • Standard deviation (DST)
  • Skewness (SKW)
  • Shape factor (SFR)
Frequency domain and PSD:
  • RMS (RMS-f)
  • Peak value (PF-f)
  • Peak value frequency (FR)
  • Spectral power (P)
One of the signals used in this work is represented in the time domain in Figure 4.

3.4. Signal Processing

As outlined in Section 3.3, the first step was to obtain the primary hyperparameters for each IMF. These parameters were subsequently used as predictors to train models using decision tree algorithms.
As described in Section 2, the processing of the vibration signals used the EMD technique, which ensures the decomposition of each signal into its constituent IMFs employing MATLAB® software.
This process was completed by obtaining up to nine IMFs for each signal. It should be noted that some signals can be further decomposed into ten IMFs; however, for the purposes of this study, nine IMFs were used to preserve a homogeneous analysis.
The process of signal analysis is described below:
  • For any given signal x(t), all IMFs are obtained.
  • Up to 9 IMFs are obtained for every signal.
  • The following groups are considered according to the IMF amount used in each case: 3 IMF, 4 IMF, 5 IMF, 6 IMF, 7 IMF, 8 IMF, and 9 IMF.
  • For each IMF, the hyperparameters referred to were obtained.
  • For each group, the model was trained.
During the process of EMD decomposing the signals into their IMF components, using MATLAB® software, it was found that the maximum number of components common to all signals was nine, although some signals reached up to ten components. In this study, the first nine IMFs of all signals were considered to homogenize the study.
Figure 5 below illustrates a single signal, obtained from the time domain and decomposed into its nine IMFs, as previously described.
Decision trees were used in the present work using MATLAB®, which provides three types, Fine Tree, Medium Tree, and Coarse Tree. Fine Tree was selected with characteristics described below:
  • Gini index, to ensure that only one class exists on any node on every split.
  • Up to 100 divisions.
  • Cross validation: all gathered data were distributed over five packs, one used for training, one for analysis, and one for validation.
The training process was as follows:
  • The model was trained for every hyperparameter. The obtained results were registered.
  • The hyperparameters were combined in groups of two, three, and so on, until all hyperparameters were combined for each class, time and frequency, at each end of the axle.
  • Different combinations of hyperparameters belonging to the two classes, time and frequency, were achieved by varying the composition of the constituent groups of hyperparameters undergoing training. This process was continued until all the hyperparameters of the two classes were used for each end of the axle.
  • Steps 1 to 3 were performed by taking all the hyperparameters for the axis as a whole without distinguishing the ends.
After the generation of each training-related confusion matrix and ROC curve, subsequent analytical procedures could be initiated.
In this paper, we have considered the combinations of hyperparameters shown, selected because they have given significant results. These combinations are presented for comparison purposes, with the aim of assessing the effectiveness of the method as a whole.

4. Results

As outlined in the preceding sections, the algorithms have been subjected to rigorous testing. The effectiveness of each test has been documented, providing a comprehensive evaluation of the performance of the algorithms, where for each hyperparameter used as predictor, the model is tested for both axle ends, and each of the aforementioned hyperparameters is utilized as a predictor, thereby enabling the model to undergo testing for both the left and right axle ends, in accordance with the direction of travel. This comprehensive evaluation covers a total number of IMFs ranging from three to nine, ensuring a thorough assessment of the model’s capabilities. Any result is an effectiveness percentage.
The first tests have been performed with a single hyperparameter as a predictor. So far, no substantial results have been obtained, as most of the values do not reach 80% efficiency, both in time and frequency.
The initial approximation was conducted by integrating the hyperparameters that functioned as predictors and attained the best efficiency in each instance.
The first combination was performed by taking the predictors with the least effective results in the time domain, which are peak value (PK), skewness (SKW), and shape factor (SFR).
The findings from these experiments have not resulted in a substantial improvement in the efficiency of the model. Notably, the attained efficiency has remained below 70%, indicating a clear deficiency in the process.
The subsequent stage of the experiment was the combination of hyperparameters. The use of these metrics as predictors is indicated in the following efficiency results. These results were obtained by taking the RMS and kurtosis of those combinations that have been referred to. It should be noted that this was done individually for each combination.
As shown in Table 2, a range of percentages has been obtained for the various numbers of IMFs (three to nine) on either side of the bogie (left or right), utilizing two distinct combinations of hyperparameters. Each combination corresponds to the following hyperparameters:
(a)
RMS (RMS) and kurtosis (KUR)
(b)
RMS (RMS), kurtosis (KUR), crest factor (FCR), and standard deviation (DST).
  • For RMS and kurtosis combined, the results are higher for higher numbers of IMFs used, mostly over 80% overall efficiency, although still inconclusive.
  • For RMS, kurtosis, crest factor, and standard deviation, the results are in the 70% range mostly, although some are just below 70%.
The subsequent execution of the testing procedure involves the integration of the time and frequency hyperparameters as predictors, considering these combinations at both axle endings.
Following the structure set out in Table 2, Table 3 shows the results from combinations of parameters from both the time and frequency domains:
(c)
RMS, kurtosis, standard deviation, RMS-f, peak value frequency (FR), and spectral Power (P).
(d)
RMS, kurtosis, crest factor, standard deviation, and all considered PSD parameters.
The results obtained demonstrate an overall effectiveness of more than 80%, with only a few exceptions where the value is slightly below this threshold; however, taken as a whole, they do not constitute significantly high values that can be considered acceptable for practical use.
The ensuing phases of the experiment were performed with all the hyperparameters hitherto considered in this work serving as predictors, initially for each end of the axle, both in the time and the frequency domains; the obtained results are shown in Table 4, where (t) represents hyperparameters in time domain, (fr) in the frequency domain, and (t + fr) all combined.
  • The overall efficiency using all hyperparameters considered in this work ranges from 81.3% to as high as over 96%, so the results obtained in these trials are significantly superior; however, it should be noted that these results do not constitute conclusive evidence.
It is important to note at this point that the initial observation is considered in conjunction with the results obtained for the case of five IMFs. This is carried out in order to observe the confusion matrices that have been obtained at both ends: the model accuracy at the left axle end is 96.2% and at the right axle end is 85.4%; both confusion matrices are shown below (see Figure 6). As it was previously mentioned in Section 3.2, four conditions are defined: undamaged axle (D0, 0.0 mm), damaged Axle Defect 1 (D1, 5.7 mm), damaged Axle Defect 2 (D2, 10.9 mm), and damaged Axle Defect 3 (D3, 15.0 mm).
Subsequently, a series of tests were conducted, incorporating a range of combinations of hyperparameters as predictors. However, it is important to note that the distinction between the axle ends is not recognized during this process. Therefore, the combination of predictors was derived by merging the values from both ends of the axle (see Table 5):
(I)
RMS (RMS), kurtosis (KUR), crest factor (FCR), and standard deviation (DST);
(II)
RMS (RMS), kurtosis (KUR), standard deviation (DST), RMS (PSD), peak value frequency (FR), and power (P);
(III)
RMS (RMS), kurtosis (KUR), crest factor (FCR), and standard deviation (DST) plus all PSD parameters.
  • Considering RMS, kurtosis, crest factor, and standard deviation, efficiency improves with an increase in the number of IMFs considered for the study, reaching 99% efficiency.
  • The subsequent case examined in this study employs the root mean square (RMS) in both the time and frequency domains, alongside kurtosis, standard deviation, peak value frequency, and power. The outcomes of this analysis are analogous to those previously obtained.
  • It is evident from the findings of this study that the results obtained from the hyperparameters in the frequency domain, which include RMS, kurtosis, crest factor, and standard deviation, are marginally higher. The efficiency levels recorded were 88.9% at the lowest for three IMFs, with all others exceeding 90%.
The final stage of the model training process involves the consideration of all the hyperparameters utilized in this study. In this stage, values of both axle ends are taken into account, and results are gathered and displayed in Table 6 below.
Subsequent to the conclusion of the final test, it became apparent that the outcomes demonstrated exceedingly elevated figures. As demonstrated in Figure 1, at the lowest number of IMFs, an efficiency approaching 90% was attained, which signifies a high success rate, as illustrated in Figure 7 below.
For comparison purposes, Figure 8 shows confusion matrices for the cases of five IMFs and nine IMFs, which both hold very similar accuracy.
The confusion matrices shown in Figure 8 above graphically show the approximation of the model’s efficiency for both cases considered (five and nine IMFs). The values of precision (see Figure 9) and accuracy (see Figure 10) for both cases are shown below so that both models can be compared. The ROC curves associated with both cases are also presented.
For the purpose of comparison, the precision and accuracy values for the analyses conducted for five and nine IMFs are presented. On the one hand, it is evident that the precision of the model exceeds 99% in both cases, with a marginal increase observed in the case of nine IMFs.
Finally, the ROC curves associated with each of the two cases compared, for five and nine IMFs, are plotted, showing that, in both cases, the results are close to 100% overall effectiveness (see Figure 11).

5. Conclusions

In an analysis of the results obtained from training decision tree algorithms using vibration signals decomposed by the Empirical Mode Decomposition (EMD) technique into the so-called intrinsic mode function (IMF), it can be concluded that the resulting technique is effective enough to characterize rolling stock equipment, such as axles, which were used for testing in this work.
The first approach was carried out considering the first three IMFs so that a comprehensive analysis could be performed; using only the first two IMFs was discarded as those two subsignals could be considered too close to the original signal, so the first three IMFs were selected as the starting point, taking that consideration into account.
Despite the fact that the 87.8% model accuracy rate does not meet the necessary criteria for practical considerations, this value could potentially be considered for further analysis. The classification of a large number of elements, as shown in the related confusion matrix, is indicative of an accurate and precise analysis (see Figure 7).
When operating a system based on five IMFs, the model accuracy escalates to 99.3% which means that it is a highly reliable and accurate classification methodology; the accuracy ranges from 99.3% for five IMFs to 99.8% for nine IMFs.
Considering how similar the accuracy values of both classes are, we could make some considerations to meet any requirement:
  • A classification model based on the analysis of the first five IMFs is sufficiently accurate for operational use and is therefore open to further development. For rapid response, this technique is currently recommended.
  • The nine-IMF case offers higher accuracy, so it is recommended for high-precision use. However, the close accuracy of the other classes analyzed in this work could mean that a lower computing effort could be recommended to obtain a high-precision model with less computational expense.
Further consideration of the requirements for the best model should be given, taking into account not only the model results but also the preparation and computation timing costs. However, the high accuracy obtained by the five-IMF class could be considered without any other considerations.
On the basis of this study, further work could be carried out, taking into account other combinations of IMFs, the application of the methodology using different signal types and axle conditions, and tests under operational conditions of rolling stock.

Author Contributions

Conceptualization, E.J., H.R. and A.B.; methodology, E.J. and A.B.; software, A.B.; validation, H.R. and A.B.; formal analysis, E.J.; investigation, E.J., H.R. and A.B.; resources, H.R. and A.B.; data curation, A.B.; writing—original draft preparation, E.J.; writing—review and editing, E.J., H.R. and A.B.; visualization, E.J.; supervision, H.R. and A.B.; project administration, H.R.; funding acquisition, H.R. and A.B. All authors have read and agreed to the published version of the manuscript.

Funding

The research work described in this paper was supported by the Spanish State Research Agency through the projects TED2021-131372A-I00 (AEI) and MCIN/AEI/10.13039/501100011033 (grant numbers MC4.0 PID2020-116984RB-C21-C22) and the project MEMRIAAP-CM-UC3M, supported by Comunidad de Madrid.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy and legal reasons.

Acknowledgments

The authors are thankful for the support provided by the participating companies (Renfe, Alstom Spain, SKF Spain, and Dano-Rail-Danobatgroup Railway) in this project. During the preparation of this work, the authors used online language IA editors for English editing. The authors reviewed and edited the content as needed after using this tool/service and take full responsibility for the content of the publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AIArtificial Intelligence
EMDEmpirical Mode Decomposition
IMFIntrinsic Mode Function
RMSRoot Mean Square
PSDPower Spectral Density
PKPeak Value
KURKurtosis
FCRCrest Factor
DSTStandard Deviation
SKWSkewness
SFRShape Factor
RMS-fRoot Mean Square (Frequency)
PK-fPeak Value
FRPeak Value Frequency
PSpectral Power
ROCReceiver Operating Characteristic

References

  1. WIDEM. Available online: http://www.widem.org/objectives.shtml (accessed on 19 March 2025).
  2. Esslinger, V.; Kieselbach, R.; Koller, R.; Weisse, B. The Railway Accident of Eschede—Technical Background. Eng. Fail. Anal. 2004, 11, 515–535. [Google Scholar] [CrossRef]
  3. Bustos, A.; Rubio, H.; Soriano-Heras, E.; Castejon, C. Methodology for the Integration of a High-Speed Train in Maintenance 4.0. J. Comput. Des. Eng. 2021, 8, 1605–1621. [Google Scholar] [CrossRef]
  4. Feynman, R.P. Simulating physics with computers. Int. J. Theor. Phys. 1982, 21, 467–488. [Google Scholar] [CrossRef]
  5. Lingli, C.; Yongchang, X.; Dongdong, L.; Honggui, H. Digital twin-driven graph domain adaptation neural network for remaining useful life prediction of rolling bearing. Reliab. Eng. Syst. Saf. 2024, 245, 109991. [Google Scholar] [CrossRef]
  6. Melesse, T.Y.; Bollo, M.; Di Pasquale, V.; Centro, F.; Riemma, S. Machine Learning-Based Digital Twin for Monitoring Fruit Quality Evolution. Procedia Comput. Sci. 2022, 200, 13–20. [Google Scholar] [CrossRef]
  7. What Is Quantum Computing?—IBM. Available online: https://www.ibm.com/think/topics/quantum-computing (accessed on 26 January 2025).
  8. Prati, E. Quantum Neuromorphic Hardware for Quantum Artificial Intelligence. In Proceedings of the 8th International Workshop DICE2016: Spacetime—Matter—Quantum Mechanics, Castiglioncello, Italy, 12–16 September 2016; IOP Publishing Ltd.: Bristol, UK; Volume 880, p. 012018. [Google Scholar]
  9. Ahmad, I.; Liu, Y.; Javeed, D.; Shamshad, N.; Sarwr, D.; Ahmad, S. A Review of Artificial Intelligence Techniques for Selection & Evaluation. IOP Conf. Ser. Mater. Sci. Eng. 2020, 853, 012055. [Google Scholar] [CrossRef]
  10. McCarthy, J.; Minsky, M.L.; Shannon, C.E. A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence, August 31, 1955. AI Mag. 2006, 27, 12–14. [Google Scholar]
  11. Russel, S.J.; Peter, N. Artificial Intelligence: A Modern Approach, 4th ed.; Pearson Education Limited: Edinburgh, UK, 2010; ISBN 978-0-13-461099-3. [Google Scholar]
  12. Holmberg, L.; Davidsson, P.; Linde, P. A Feature Space Focus in Machine Teaching. In Proceedings of the 2020 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), Austin, TX, USA, 23–27 March 2020; pp. 1–2. [Google Scholar]
  13. Simard, P.Y.; Amershi, S.; Chickering, D.M.; Pelton, A.E.; Ghorashi, S.; Meek, C.; Ramos, G.; Suh, J.; Verwey, J.; Wang, M.; et al. Machine teaching: A new paradigm for building machine learning systems. arXiv 2017, arXiv:1707.06742. [Google Scholar] [CrossRef]
  14. Mosqueira-Rey, E.; Hernández-Pereira, E.; Alonso-Ríos, D.; Bobes-Bascarán, J.; Fernández-Leal, Á. Human-in-the-Loop Machine Learning: A State of the Art. Artif. Intell. Rev. 2023, 56, 3005–3054. [Google Scholar] [CrossRef]
  15. Available online: https://minds.wisconsin.edu/handle/1793/60660 (accessed on 20 March 2025).
  16. Power to the People: The Role of Humans in Interactive Machine Learning—AI Magazine. Available online: https://ojs.aaai.org/aimagazine/index.php/aimagazine/article/view/2513 (accessed on 20 March 2025).
  17. Braun, S. Discover Signal Processing: An Interactive Guide for Engineers; Wiley: Chichester, UK, 2008; ISBN 978-0-470-51970-7. [Google Scholar]
  18. McFadden, P.D.; Smith, J.D. Model for the Vibration Produced by a Single Point Defect in a Rolling Element Bearing. J. Sound Vib. 1984, 96, 69–82. [Google Scholar] [CrossRef]
  19. Vives, J. Incorporating Machine Learning into Vibration Detection for Wind Turbines. Model. Simul. Eng. 2022, 234, 1–8. [Google Scholar] [CrossRef]
  20. Antoni, J.; Randall, R.B. A Stochastic Model for Simulation and Diagnostics of Rolling Element Bearings with Localized Faults. J. Vib. Acoust. Trans. ASME 2003, 125, 282–289. [Google Scholar] [CrossRef]
  21. Mani, E.; Clive, R.; Paul, W.; Edward, S.; Arash, A.; Mayorkinos, P. Perspectives on railway axle bearing condition monitoring. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit. 2020, 234, 17–31. [Google Scholar]
  22. Bustos, A.; Rubio, H.; Castejón, C.; García-Prada, J.C. EMD-Based Methodology for the Identification of a High-Speed Train Running in a Gear Operating State. Sensors 2018, 18, 793. [Google Scholar] [CrossRef] [PubMed]
  23. Guo, J.; Wang, Z.; Li, H.; Yang, Y.; Huang, C.-G.; Yazdi, M.; Kang, H.S. A Hybrid Prognosis Scheme for Rolling Bearings Based on a Novel Health Indicator and Nonlinear Wiener Process. Reliab. Eng. Syst. Saf. 2024, 245, 110014. [Google Scholar] [CrossRef]
  24. Luo, H.; Bo, L.; Peng, C.; Hou, D. Fault Diagnosis for High-Speed Train Axle-Box Bearing Using Simplified Shallow Information Fusion Convolutional Neural Network. Sensors 2020, 20, 4930. [Google Scholar] [CrossRef]
  25. Borghesani, P.; Smith, W.A.; Randall, R.B.; Antoni, J.; El Badaoui, M.; Peng, Z. Bearing Signal Models and Their Effect on Bearing Diagnostics. Mech. Syst. Signal Process. 2022, 174, 109077. [Google Scholar] [CrossRef]
  26. Cao, H.; Niu, L.; Xi, S.; Chen, X. Mechanical Model Development of Rolling Bearing-Rotor Systems: A Review. Mech. Syst. Signal Process. 2018, 102, 37–58. [Google Scholar] [CrossRef]
  27. Randall, R.B.; Antoni, J. Rolling Element Bearing Diagnostics—A Tutorial. Mech. Syst. Signal Process. 2011, 25, 485–520. [Google Scholar] [CrossRef]
  28. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proc. R. Soc. Lond. A 1998, 454, 903–995. [Google Scholar] [CrossRef]
  29. Rilling, G.; Flandrin, P.; Goncalves, P.; Lilly, J.M. Bivariate Empirical Mode Decomposition. IEEE Signal Process. Lett. 2007, 14, 936–939. [Google Scholar] [CrossRef]
  30. Looney, D.; Mandic, D.P. A Machine Learning Enhanced Empirical Mode Decomposition. In Proceedings of the 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 31 March–4 April 2008; pp. 1897–1900. [Google Scholar]
  31. Sun, G.; Zhang, R.; Liu, Z.; Wu, L.; Yu, Q.; Tan, X. EMD-Based Noise Reduction Study of Steel Cored Conveyor Belt Containing Slag Signal. Alex. Eng. J. 2024, 98, 56–67. [Google Scholar] [CrossRef]
  32. Li, W.; Guo, J. Extraction of Periodic Signals in Global Navigation Satellite System (GNSS) Vertical Coordinate Time Series Using the Adaptive Ensemble Empirical Modal Decomposition Method. Nonlinear Process. Geophys. 2024, 31, 99–113. [Google Scholar] [CrossRef]
  33. Heddam, S.; Vishwakarma, D.K.; Abed, S.A.; Sharma, P.; Al-Ansari, N.; Alataway, A.; Dewidar, A.Z.; Mattar, M.A. Hybrid River Stage Forecasting Based on Machine Learning with Empirical Mode Decomposition. Appl. Water Sci. 2024, 14, 46. [Google Scholar] [CrossRef]
  34. Colominas, M.A.; Schlotthauer, G.; Torres, M.E. Improved Complete Ensemble EMD: A Suitable Tool for Biomedical Signal Processing. Biomed. Signal Process. Control 2014, 14, 19–29. [Google Scholar] [CrossRef]
  35. Zhang, C.; Fu, S.; Ou, B.; Liu, Z.; Hu, M. Prediction of Dam Deformation Using SSA-LSTM Model Based on Empirical Mode Decomposition Method and Wavelet Threshold Noise Reduction. Water 2022, 14, 3380. [Google Scholar] [CrossRef]
  36. Zeiler, A.; Faltermeier, R.; Keck, I.R.; Tomé, A.M.; Puntonet, C.G.; Lang, E.W. Empirical Mode Decomposition—An Introduction. In Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, Spain, 18–23 July 2010; pp. 1–8. [Google Scholar]
  37. Meng, D.; Wang, H.; Yang, S.; Lv, Z.; Hu, Z.; Wang, Z. Fault Analysis of Wind Power Rolling Bearing Based on EMD Feature Extraction. CMES 2021, 130, 543–558. [Google Scholar] [CrossRef]
  38. Talaei Khoei, T.; Kaabouch, N. Machine Learning: Models, Challenges, and Research Directions. Future Internet 2023, 15, 332. [Google Scholar] [CrossRef]
  39. Bartz-Beielstein, T. Supervised Learning: Classification and Regression. In Online Machine Learning: A Practical Guide with Examples in Python; Bartz, E., Bartz-Beielstein, T., Eds.; Machine Learning: Foundations, Methodologies, and Applications; Springer Nature: Singapore, 2024; pp. 13–22. ISBN 978-981-9970-07-0. [Google Scholar]
  40. Quinlan, J.R. Simplifying Decision Trees. Int. J. Man-Mach. Stud. 1987, 27, 221–234. [Google Scholar] [CrossRef]
  41. Saravanan, N.; Ramachandran, K.I. Fault Diagnosis of Spur Bevel Gear Box Using Discrete Wavelet Features and Decision Tree Classification. Expert Syst. Appl. 2009, 36, 9564–9573. [Google Scholar] [CrossRef]
  42. Cañete-Sifuentes, L.; Monroy, R.; Medina-Pérez, M.A.; Loyola-González, O.; Vera Voronisky, F. Classification Based on Multivariate Contrast Patterns. IEEE Access 2019, 7, 55744–55762. [Google Scholar] [CrossRef]
  43. Junquera, E.; Rubio, H.; Bustos, A.; Soriano, E. Categorización Del Estado Operativo de Material Rodante Ferroviario Mediante El Uso de Clasificadores Automáticos. In Proceedings of the XVI Congreso Iberoamericano de Ingeniería Mecánica, Concepción, Chile, 22–24 October 2024. [Google Scholar]
  44. Gafni, T.; Shlezinger, N.; Cohen, K.; Eldar, Y.C.; Poor, H.V. Federated Learning: A Signal Processing Perspective. IEEE Signal Process. Mag. 2022, 39, 14–41. [Google Scholar] [CrossRef]
  45. Hu, Z.; Shaloudegi, K.; Zhang, G.; Yu, Y. Federated Learning Meets Multi-Objective Optimization. IEEE Trans. Netw. Sci. Eng. 2022, 9, 2039–2051. [Google Scholar] [CrossRef]
  46. Yang, B.; Lei, Y.; Li, X.; Li, N.; Si, X.; Chen, C. A Dynamic Barycenter Bridging Network for Federated Transfer Fault Diagnosis in Machine Groups. Mech. Syst. Signal Process. 2025, 230, 112605. [Google Scholar] [CrossRef]
  47. Ruiz, E.; Bustos, A.; Rubio, H.; Castejón, C. Application of automatic classifiers for condition monitoring of railway rolling stock. Técnica Ind. 2023, 336, 38–45. [Google Scholar] [CrossRef]
  48. Bustos, A.; Rubio, H.; Peláez, G.; García-Prada, J.C. Clasificación del estado de un eje ferroviario mediante EMD y clasificadores automáticos. In Proceedings of the XXIV Congreso Nacional de Ingeniería Mecánica (CNIM-2023), Las Palmas Gran Canaria, Spain, 25–27 October 2023. (In Spanish). [Google Scholar]
Figure 1. EMD process.
Figure 1. EMD process.
Electronics 14 03006 g001
Figure 2. Test bench: bogie, transmission mechanism, load system, and measurement equipment.
Figure 2. Test bench: bogie, transmission mechanism, load system, and measurement equipment.
Electronics 14 03006 g002
Figure 3. Schematic representation: axle and defects.
Figure 3. Schematic representation: axle and defects.
Electronics 14 03006 g003
Figure 4. Time domain signal.
Figure 4. Time domain signal.
Electronics 14 03006 g004
Figure 5. Signal decomposed into its 9 IMFs.
Figure 5. Signal decomposed into its 9 IMFs.
Electronics 14 03006 g005
Figure 6. Confusion matrices for all hyperparameters combined in the case of 5 IMFs. (a) Left axle end; (b) right axle end.
Figure 6. Confusion matrices for all hyperparameters combined in the case of 5 IMFs. (a) Left axle end; (b) right axle end.
Electronics 14 03006 g006
Figure 7. Confusion matrix for global test, case of 3 IMFs.
Figure 7. Confusion matrix for global test, case of 3 IMFs.
Electronics 14 03006 g007
Figure 8. Confusion matrix for global test: (a) case of 5 IMFs; (b) case of 9 IMFs.
Figure 8. Confusion matrix for global test: (a) case of 5 IMFs; (b) case of 9 IMFs.
Electronics 14 03006 g008
Figure 9. Confusion matrix, precision: (a) for 5 IMFs; (b) for 9 IMFs.
Figure 9. Confusion matrix, precision: (a) for 5 IMFs; (b) for 9 IMFs.
Electronics 14 03006 g009
Figure 10. Confusion matrix, accuracy: (a) for 5 IMFs; (b) for 9 IMFs.
Figure 10. Confusion matrix, accuracy: (a) for 5 IMFs; (b) for 9 IMFs.
Electronics 14 03006 g010
Figure 11. ROC curves: (a) for 5 IMFs; (b) for 9 IMFs.
Figure 11. ROC curves: (a) for 5 IMFs; (b) for 9 IMFs.
Electronics 14 03006 g011
Table 1. Accelerometer technical details.
Table 1. Accelerometer technical details.
ParameterValue
Sensitivity (±20%)10.2 mV (m/s2)
Acceleration range±490 m/s2
Frequency range (±3 dB)0.52 Hz to 8 kHz
Temperature range−54 °C to 121 °C
Resonance frequency25 kHz
Amplitude linearity±1%
Transverse sensitivity≤7%
Table 2. Ratio of percentages obtained for the different IMFs, for combinations (a) and (b).
Table 2. Ratio of percentages obtained for the different IMFs, for combinations (a) and (b).
IMF3456789
SideLRLRLRLRLRLRLR
c. (a)69.468.674.078.394.176.389.974.293.375.195.583.595.586.9
c. (b)69.770.769.475.173.971.069.972.478.671.286.674.487.476.0
Table 3. Ratio of percentages obtained for the different IMFs, for combinations (c) and (d).
Table 3. Ratio of percentages obtained for the different IMFs, for combinations (c) and (d).
IMF3456789
SideLRLRLRLRLRLRLR
c. (c)84.285.881.887.780.176.479.679.280.696.784.678.484.579.1
c. (d)83.083.982.888.491.778.686.278.591.180.895.184.595.587.3
Table 4. Model results. All hyperparameters combined.
Table 4. Model results. All hyperparameters combined.
IMF3456789
SideLRLRLRLRLRLRLR
(t)74.874.880.280.394.084.292.682.295.986.595.490.395.892.5
(fr)81.284.376.485.467.771.269.975.164.368.359.663.157.461.4
(t + fr)82.381.381.884.396.285.490.782.996.087.799.690.299.492.6
Table 5. Ratio of percentages obtained for the different IMFs, for combinations (I) to (III).
Table 5. Ratio of percentages obtained for the different IMFs, for combinations (I) to (III).
IMF3456789
(I)87.387.798.591.296.799.099.1
(II)89.488.995.991.793.699.099.0
(III)88.990.598.195.896.699.399.4
Table 6. Model results. Global test.
Table 6. Model results. Global test.
IMF3456789
c. global87.890.499.399.299.699.799.8
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Junquera, E.; Rubio, H.; Bustos, A. Determination of the Condition of Railway Rolling Stock Using Automatic Classifiers. Electronics 2025, 14, 3006. https://doi.org/10.3390/electronics14153006

AMA Style

Junquera E, Rubio H, Bustos A. Determination of the Condition of Railway Rolling Stock Using Automatic Classifiers. Electronics. 2025; 14(15):3006. https://doi.org/10.3390/electronics14153006

Chicago/Turabian Style

Junquera, Enrique, Higinio Rubio, and Alejandro Bustos. 2025. "Determination of the Condition of Railway Rolling Stock Using Automatic Classifiers" Electronics 14, no. 15: 3006. https://doi.org/10.3390/electronics14153006

APA Style

Junquera, E., Rubio, H., & Bustos, A. (2025). Determination of the Condition of Railway Rolling Stock Using Automatic Classifiers. Electronics, 14(15), 3006. https://doi.org/10.3390/electronics14153006

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop