Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods

Chegini, Alireza; Mohammadi, Mohammadreza; Mosleh, Araliya; Vale, Cecilia; Ghiasi, Ramin; Silva, Ruben; Guedes, Antonio; Meixedo, Andreia; Malekjafarian, Abdollah

doi:10.3390/machines14030286

Open AccessArticle

Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods

by

Alireza Chegini

¹,

Mohammadreza Mohammadi

²

,

Araliya Mosleh

^2,*

,

Cecilia Vale

²

,

Ramin Ghiasi

³

,

Ruben Silva

²

,

Antonio Guedes

²

,

Andreia Meixedo

²

and

Abdollah Malekjafarian

³

¹

Department of Civil Engineering, Faculty of Engineering, Islamic Azad University, Qazvin 1477893855, Iran

²

CONSTRUCT, Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal

³

Structural Dynamics and Assessment Laboratory, School of Civil Engineering, University College Dublin, D04 V1W8 Dublin, Ireland

^*

Author to whom correspondence should be addressed.

Machines 2026, 14(3), 286; https://doi.org/10.3390/machines14030286

Submission received: 9 January 2026 / Revised: 23 February 2026 / Accepted: 25 February 2026 / Published: 3 March 2026

(This article belongs to the Special Issue Rolling Contact Fatigue and Wear of Rails and Wheels)

Download

Browse Figures

Versions Notes

Abstract

Artificial-intelligence-driven wayside monitoring has become a promising solution for early identification of railway wheel flats, enabling safer operations and more efficient maintenance planning. This study introduces a comparative investigation of supervised and unsupervised machine learning strategies for wheel flat identification, with particular emphasis on real-time applicability and sensor cost reduction. Support Vector Machines (SVMs) and k-means clustering are evaluated as representative supervised and unsupervised approaches using vibration data obtained from numerically simulated train–track interactions under realistic operating conditions, including train speeds of 120 km/h and 200 km/h and multiple wheel flat severities. A key contribution of this work is the proposal of a simplified supervised classification framework that directly exploits Auto-Regressive features extracted from rail-mounted accelerometers, eliminating the need for feature normalization and multi-sensor data fusion. This simplification significantly reduces computational effort, making the approach suitable for real-time deployment in operational railway environments. In parallel, a systematic sensitivity analysis is conducted to assess the influence of sensor placement and to identify the minimum sensor configuration required to achieve reliable damage classification. The outputs from the current study show that an SVM emerges with more accurate defect classification than the k-means clustering, allowing a wayside system with fewer sensors.

Keywords:

wayside condition monitoring; wheel flat identification; clustering method; SVM; unsupervised method

1. Introduction

The rail sector plays a crucial role in modern transportation systems, providing an efficient, reliable, and sustainable means of moving passengers and freight over long distances. To ensure the safety and efficiency of rail operations, it is vital to prioritize the integrity of railway infrastructure, particularly the wheels and tracks. Defects in the wheels, like flats, can significantly degrade vehicle–track interaction, resulting in premature component wear, increased vibration levels, and reduced ride comfort. Therefore, early detection and classification of wheel defects are critical for enabling timely maintenance actions and preventing potential derailments and service disruptions.

Conventionally, railway operators have relied on regular manual inspections; however, advances in sensing technologies have led to the development of more comprehensive condition monitoring systems. Two primary monitoring strategies are currently employed: wayside and onboard monitoring. Wayside systems [1], which involve sensors positioned along the track, allow continuous monitoring of all trains passing through a given location, with relatively low maintenance costs and no need for train retrofitting. In contrast, onboard systems, which use sensors placed on the train itself [2], provide detailed and real-time information about the condition of the train. However, these systems are generally more expensive to install and maintain, and typically monitor only a single train at a time rather than multiple trains.

Both wayside and onboard monitoring systems are increasingly benefiting from the introduction of Artificial Intelligence (AI), which is revolutionizing the way rail networks ensure safety, efficiency, and reliability [3]. AI-based systems can process and analyze vast amounts of sensor data in real time, enabling immediate decision-making and action, while helping to maintain system integrity with minimal disruption.

Several studies in the literature report the application of machine learning (ML) methods applied to the identification of out-of-roundness (OOR) wheel defects, in particular wheel flats. This type of methodology usually involves feature extraction techniques using data from acceleration or strain measurements and classifier methods to identify the defects [4]. Common feature extraction techniques include Auto-Regressive (AR) models [5] and wavelet transforms [6]. For feature classification, recent studies have increasingly applied machine learning methods to distinguish between healthy and defective wheels [7,8]. Typically, two different approaches are used: (i) supervised methods, where the algorithm learns from labeled training data [9,10,11,12], and (ii) unsupervised methods, where the model is trained using data without labeled outputs [4]. Partitional and density-based clustering methods, such as k-means [13] and DBSCAN [14], are widely used for classification and pattern identification in complex datasets. While supervised learning typically offers higher accuracy due to the availability of labeled data, it requires significant effort for data annotation. In contrast, unsupervised learning avoids labeling costs and is more scalable, though often at the expense of classification accuracy and interpretability.

Advances in computing power have driven increased use of deep learning techniques for complex pattern recognition tasks. In particular, Convolutional Neural Networks (CNNs) have been widely adopted for classification and image-based applications [15,16] and have recently been applied to wheel failure detection using wheel tread images [17,18,19].

Xing et al. [19] developed a machine vision method based on an enhanced YOLOv3 framework, incorporating convolution layers for wheel flat detection. Trilla et al. [18] developed a wayside system based on an image-processing technique using CNN to detect and diagnose tread defects, including wheel flats. Zhang et al. [17] proposed a deep learning algorithm to improve upon existing YOLO models by incorporating optimized convolutional layers for detecting small defects. Liao et al. [20] present an effective application of evolutionary optimization to balance multiple performance metrics.

Therefore, these reported integrated AI techniques in wayside systems have contributed to enhancing the accuracy of wheel flat detection. In the studies by Mosleh et al. [21] and Mohammadi et al. [5], which employed unsupervised methodologies based on outlier analysis, no misclassifications were observed in the simulated dataset using only one pair of accelerometers installed on the rail. Ghiasi et al. [14], using a multistage clustering framework (M-CLUSTER) for unsupervised monitoring, achieved 98% accuracy in detecting train wheel flats using accelerometers. Despite the great effectiveness of ML-based methods in damage detection, only a few can classify wheel flat severities and not very efficiently. Jorge et al. [22] employed a sparse autoencoder and k-means clustering technique to classify wheel flats. Subsequently, Mohammadi et al. [23] extended this work by localizing the defects and classifying the length of flats in trains with multiple defective wheels. In another study, Guo et al. [24] developed an image-based inspection technique for monitoring wheel tread defects, such as flats, using SVM and a supervised approach for defect classification.

While previous studies have successfully implemented machine learning techniques for wheel flat detection, most methods focus on either unsupervised clustering approaches or supervised classification techniques, often requiring extensive labeled datasets or complex feature extraction processes. For example, Mosleh et al. [25] used supervised classification on acceleration signals, while Ghiasi et al. [14] and Mosleh et al. [4] applied unsupervised clustering. Despite their effectiveness, several critical gaps remain. Few studies provide a direct and quantitative comparison between supervised and unsupervised approaches under identical operational conditions, feature sets, and sensor layouts, making it difficult to objectively assess their relative suitability for wayside deployment. Moreover, the majority of reported systems are oriented toward binary defect detection, while explicit multi-class classification of wheel-flat severity, which is essential for maintenance prioritization, is rarely addressed. Most methods are validated under specific speeds, loads, or environmental conditions, leaving uncertainty about performance across variable railway operations. Prior research emphasizes defect detection but often ignores the classification of wheel flat severity, which is essential for prioritizing maintenance actions. Furthermore, many approaches rely on complex feature extraction, data fusion, or extensive sensor arrays, limiting practical implementation in real-world systems. Table 1 provides a quantitative benchmarking of representative studies and reported approaches for wheel-flat monitoring. The comparison summarizes key implementation aspects, including the number and type of sensors, investigated speed range, minimum reported detectable flat length (when available), and the methodological capability (defect detection versus severity classification). This overview allows the proposed framework to be positioned with respect to existing state-of-the-art methodologies and typical wayside monitoring configurations.

The benchmarking indicates that most reported approaches rely on multi-sensor configurations and are mainly oriented toward wheel flat detection rather than explicit severity classification. Moreover, a large portion of the existing studies are validated under limited operational conditions, such as restricted speed ranges or controlled loading scenarios, and in several cases focus on relatively large wheel flat sizes. In contrast, the proposed framework demonstrates reliable performance using a reduced number of rail-mounted accelerometers, supports multi-class wheel-flat severity classification, and maintains stable performance at higher vehicle speed (200 km/h). These characteristics indicate a favorable balance between classification capability, sensor economy, and practical applicability for wayside condition monitoring systems.

To address the gaps identified above and evidenced through the benchmarking in Table 1, this study presents a systematic comparison between supervised (SVM) and unsupervised (k-means clustering) approaches for wheel-flat identification using identical datasets, features, and sensor configurations. In addition to defect detection, the investigation focuses on multi-class severity classification, which is essential for maintenance prioritization and is rarely addressed in existing wayside systems. Furthermore, a simplified supervised framework is adopted that removes feature normalization and data fusion steps, aiming to reduce computational complexity and enable real-time implementation with a limited number of sensors.

To address these challenges, this study proposes:

A comparative analysis of supervised and unsupervised methods to assess their strengths and weaknesses in wheel flat identification, at train speeds of 120 and 200 km/h;
A sensitivity analysis on sensor placement to optimize the number of sensors while maintaining accuracy;
An implementation-oriented simplified supervised approach that eliminates the need for feature normalization and data fusion, reducing computational complexity while maintaining robust classification performance.

2. Application of Machine Learning Algorithms for Identifying Railway Wheel Defects

Detecting and classifying wheel flats requires a structured approach that combines data collection, preprocessing, and machine learning algorithms. By organizing the process into clear stages, the proposed methodology ensures precise defect identification while optimizing computational efficiency. The following subsection outlines the detailed methodology for wheel flat detection, covering the steps from data acquisition through wayside sensors to the evaluation of machine learning algorithms for defect classification.

2.1. Overview of the Methodology

Figure 1 represents the schematic of the proposed methodology for detecting wheel flats and classifying their length into different severity levels. The proposed methodology for wheel flat identification starts with collecting acceleration data from wayside sensors mounted on the rail as trains pass. These sensors capture time series data that reflect the vibration patterns caused by the wheels, which can indicate the presence of defects. To obtain the most similar and reliable reproduction of the rail response, an artificial noise is generated based on the maximum response of the signal and added to the original signal. Therefore, new signals are generated considering a noise amplitude level of 5% of the maximum signal amplitude. Following the approach adopted in the literature [36], additionally, a low-pass Chebyshev Type II digital filter with a cutoff frequency of 1500 Hz is applied to filter all time-series data, and a sampling frequency of 10 kHz is used to analyze acceleration signals for both baseline and damaged scenarios. Afterwards, to convert the sensor data into damage-sensitive features, the Auto-Regressive (AR) model is applied. The AR model order is selected based on the stabilization of the Akaike Information Criterion (AIC), which occurs for model orders above 40 [5]. This model helps reduce the size of the input data, making it more manageable for subsequent analysis. The AR model extracts significant features that are sensitive to damage, which are essential for identifying wheel flats [4]. To suppress environmental and operational effects, PCA (Principal Component Analysis) is used, allowing damage-sensitive features to remain dominant and improving the robustness of wheel flat identification. PCA also enhances the sensitivity to damage by reducing the dimensionality of the data while retaining the most relevant information. During normalization, principal components explaining more than 80% of the cumulative variance are removed, as they mainly represent environmental and operational effects [37]. The normalized features are combined into a single damage index (DI) using Mahalanobis Distance (MD), a method that evaluates the similarity between feature sets from undamaged and damaged wheels; shorter MD values indicate higher similarity [5]. MD effectively reduces the complexity of the data by consolidating multiple features into one indicator for easier analysis and comparison. A wheel defect detection algorithm is applied to define a confidence boundary (CB) for each accelerometer through outlier analysis. Generally, the literature states that the Mahalanobis squared distance can be approximated by a chi-squared distribution in n-dimensional space. Thus, since the chi-squared distribution models the sum of squares of independent random variables (Gaussians), it can be considered that the Mahalanobis Distance can also be approximated by a Gaussian statistical distribution [38,39,40,41]. The inverse cumulative distribution function (ICDF) of the Gaussian distribution is used to estimate the CB. If the damage indicator exceeds the CB threshold (typically 1%), the feature is considered an outlier, indicating a potential defect [5,21]. The identified defective scenarios are then classified based on the severity of the damage, specifically the flat length. This classification is carried out by comparing the damage indicator against predefined thresholds and analyzing the severity based on the outlier detection results. More detailed descriptions of the individual steps of the proposed unsupervised methodology for defective wheel identification are provided in the following references. To evaluate the effectiveness of different approaches, a comparative analysis is conducted between unsupervised methods (e.g., k-means clustering) and supervised methods (e.g., SVM). This analysis helps determine the strengths and weaknesses of each method for wheel flat identification. Finally, a simplified framework is proposed for supervised classification, eliminating the need for feature normalization and data fusion. This reduction in complexity maintains robust classification performance while decreasing the computational burden, making the methodology more efficient for real-time applications.

2.2. Unsupervised Classification Technique Using k-Means

The k-means clustering technique is used for the unsupervised defect classification step, which requires a priori definition of the number of clusters. To determine the optimal number of clusters, the global silhouette index (SIL) is utilized, while the centroid (CD) of each cluster is computed using k-means. Clusters’ dissimilarities are generally defined as distance metrics. Among these, the Euclidean (square root of the sum-of-squares) is, by far, the most used. Several studies have shown that as wheel flat size or severity increases, these features exhibit monotonic and near-linear changes in magnitude, making Euclidean distance a natural measure of dissimilarity in this space [42,43]. Under such conditions, Euclidean distance effectively captures differences in overall feature magnitude and distribution associated with increasing defect severity. Moreover, k-means clustering with Euclidean distance has been widely adopted in railway fault diagnosis and severity classification studies, where it has been demonstrated to produce clusters consistent with known defect severity levels [4,14]. It is assumed that the features assigned to a cluster are undamaged if the centroid (CD) of that cluster is lower than the confidence boundary (CB). Otherwise, they are assumed to be damaged. Subsequently, the centroids of damaged clusters are listed in ascending order. There is a correlation (Equation (1)) between different centroids and different levels of severity, with the highest value coming from the cluster with the most severe damage. k-means clustering partitions the damage indices

x

into

k

clusters, minimizing the within-cluster variance:

J = \sum_{i = 1}^{k} \sum_{X_{j} \in C_{i}} ‖x_{j} - μ_{i}‖

(1)

in which

C_{i}

represents a set of data points assigned to the cluster

i

,

x_{j}

is a data point in the cluster

i

, and

μ_{i}

represents the centroid of the cluster

i

, computed through Equation (2) as

μ_{i} = \frac{1}{|C_{i}|} \sum_{X_{j} \in C_{i}} x_{j}

(2)

The clustering quality is assessed through the silhouette coefficient

S (j)

computed for each data point

x_{j}

using Equation (3) as follows:

S (j) = \frac{b (j) - a (j)}{m a x (a (j), b (j))}

(3)

where

a (j)

is the average distance from

x_{j}

to all other points in the same cluster,

b (j)

is the minimum average distance from

x_{j}

to points in the nearest different cluster. Afterward, the mean silhouette score is then computed for each

k

clusters and the optimal number of clusters is selected through Equation (4):

k^{*} = \arg \max_{k} S_{k}

(4)

in which the

S_{k}

is the mean silhouette score for the

k

clusters. After determining the optimal

k

, k-means clustering is applied again with the

k^{*}

, using the city-block distance metric, ensuring that clustering is based on absolute deviations, which is robust to feature scaling variations.

2.3. Supervised Classification Technique Using SVM

The SVM technique is widely used for classification problems due to its ability to find an optimal hyperplane that separates data points with a maximum margin [44]. However, classical SVM is inherently a binary classifier, meaning it can only distinguish between two classes. To extend SVM to multi-class problems, Error-Correcting Output Codes (ECOCs) are employed, which decompose a multi-class problem into multiple binary classification problems [45]. An ECOC creates a coding matrix that maps multi-class labels into multiple binary problems, ensuring better error correction and improved classification accuracy. For a dataset with

k

classes, the ECOC constructs multiple binary classifiers, where each classifier is trained to distinguish between a subset of classes. A one-vs.-one (OvO) approach is implemented, in which a separate classifier is trained for each pair of classes. For problems with K classes,

(\binom{k}{2}) = (\frac{k (k - 1)}{2})

binary classifiers are trained. During the prediction process, the outputs of these binary classifiers are aggregated to determine the final class label. In the current study, K = 3 classes are considered and a coding matrix

C \in {\{+ 1, - 1\}}^{3 * M}

is designed, where

M

is the number of binary SVMs. Each class is assigned a unique codeword (row of

C

), and each column corresponds to a binary SVM trained to separate a subset of classes [46].

Each binary SVM solves a constrained optimization problem to maximize the margin between classes. The primal formulation of the SVM aims to find the weight vector

w_{m}

and bias

b_{m}

that define the separating hyperplane. The optimization problem is described by Equation (5) as follows:

\min_{w_{m}, b_{m}, ξ_{m}} \frac{1}{2} {‖w_{m}‖}^{2} + C \sum_{i = 1}^{n} ξ_{m, i}

(5)

which is subject to the following constraints (Equation (6)):

y_{m, i} (w_{m}^{T} ϕ (x_{i}) + b_{m}) \geq 1 - ξ_{m, i}, ξ_{m, i} \geq 0, \forall i,

(6)

where

w_{m}

and

b_{m}

are the weight vector defining the hyperplane and bias term for the

m

th binary SVM.

Function

ξ

(

m

,

i

) represents the slack variables, which allow for misclassifications (soft margin), while

C

is the regularization parameter that controls the trade-off between maximizing the margin and minimizing classification errors. The function

ϕ (x_{i})

is the feature mapping function that projects the input

ϕ (x_{i})

into a higher-dimensional space. As described in Figure 1, the calculated damage indices from data fusion are selected as the input data for SVM, while for the simplified method, the extracted features are considered as input features. In this framework, the input to the SVM consists of 40 features extracted from each sensor for each train passage, eliminating the need for additional preprocessing steps such as feature normalization and data fusion.

y_{m, i}

is the label of the

i

^th sample for the

m

^th binary problem

(y_{m, i} \in \{+ 1, - 1\})

. Moreover, it can be mentioned that the term

\frac{1}{2} {‖w_{m}‖}^{2}

maximizes the margin between the two classes and the term

C \sum_{i = 1}^{n} ξ_{m, i}

misclassifications. The constraints ensure that samples are on the correct side of the margin, with some flexibility allowed by

ξ_{m, i}

.

The primal problem is converted to its dual form for efficient optimization using kernel functions Radial Basis (R), Polynomial (P), Linear (L), and Gaussian (G). These kernels enable SVM to handle non-linearly separable data by implicitly mapping it to a higher-dimensional space, allowing for non-linear decision boundaries. The dual problem is solved using Equation (7):

\max_{\propto_{m}} \sum_{i = 1}^{n} \propto_{m, i} - \frac{1}{2} \sum_{i, j} \propto_{m, i} \propto_{m, j} y_{m, i} y_{m, j} K (x_{i}, x_{j})

(7)

subjected to Equation (8):

0 \leq \propto_{m, i} \leq C, \sum_{i = 1}^{n} \propto_{m, i} y_{m, i} = 0

(8)

where

\propto_{m, i}

and

\propto_{m, i}

are Lagrange multipliers associated with the

i

^th and

j

^th sample for the

m

^th binary SVM and

K (x_{i}, x_{j})

is the Kernel function that computes the inner product

ϕ {(x_{i})}^{T} ϕ (x_{j})

in the feature space. Here,

x_{i}, x_{j}

represent the feature vectors of the

i

^th and

j

^th samples, respectively. Once the SVM is trained, the decision function for a test sample

x

in the ECOC framework will be defined through Equation (9):

f (x) = \sum_{i = 1}^{n} \propto_{m, i} y_{m, i} K (x_{i}, x_{j}) + b_{m}

(9)

Then the vector of binary prediction (

s_{m}

) is defined through the sign of

f (x)

. Afterward, the ECOC framework combines the binary predictions to produce the final multi-class label. The final decision function selects the class

k^{*}

whose codeword

c_{k}

is closest to

s

according to a loss function (Hamming loss), which is described by employing Equation (10) as follows:

k^{*} = a r g \min_{k} \frac{1}{M} \sum_{m = 1}^{M} Ι (s_{m} \neq c_{k, m})

(10)

where

Ι (.)

is the indicator function, and for

s_{m} \neq c_{k, m}

, it is equal to 1; otherwise, it is 0.

3. Numerical Modeling

This section introduces the virtual track-side monitoring system and presents numerical simulations of train-track interactions, wheel profiles, and track profiles. The numerical simulations of dynamic interactions between trains and tracks are conducted using the in-house software Vehicle–Structure Interaction (VSI), enabling the generation of synthetic measurement responses [47]. Developed in MATLAB version R2018a [48], the VSI tool integrates the structural matrices of the track and vehicle, which are initially modeled independently using a finite element (FE) package. Although these subsystems are modeled separately, VSI employs a fully coupled approach to integrate them. The train-to-track coupling is achieved through a 3D wheel-rail contact model based on Hertzian theory [49], with normal and tangential contact forces including rolling friction creep calculated via the USETAB routine [50].

The track is modeled through a finite element environment implemented in ANSYS release 19.2 [51], in which the rail, fasteners/pad, sleeper, ballast, and foundation components are represented through a layered modeling approach that defines their interactions within the track system. The rails are simulated using BEAM181 elements, and the sleepers are also modeled using the same element type, with the rail–sleeper connection defined through rail pads and fastening systems that link the two components. The mechanical behavior of the rail pads, ballast, and foundation is introduced through linear spring-damper elements modeled using COMBIN14, which are defined in the longitudinal, lateral, and vertical directions to represent the force-displacement relations between the rail-sleeper assembly and the supporting layers. The mass associated with the ballast is incorporated through concentrated mass elements modeled using MASS21, which are connected to the sleeper nodes to account for the inertial contribution of the ballast within the numerical model. The coupling between the different layers of the track system is enabled by the COMBIN14 elements, which define the interaction between the rail, sleepers, ballast, and foundation. The mechanical properties assigned to the rail, sleepers, rail pads, ballast, and foundation are summarized in Table 2, which provides the mechanical parameters used in the numerical simulations.

The railway vehicle considered in this study corresponds to an Alfa Pendular train (Portuguese passenger train) composed of six wagons and is numerically simulated using ANSYS [51]. The kinematic configuration of the vehicle is defined using rigid beam finite elements, which establish the geometric connectivity between the main structural components without introducing structural flexibility. The car body, bogies, and wheelsets are connected through these rigid elements to reproduce the overall vehicle layout. The inertial properties of the vehicle components are introduced through concentrated mass elements (MASS21) located at the respective centers of gravity of the car body, bogies, and wheelsets. These mass elements account for both translational mass and rotary inertia, allowing the representation of roll, pitch, and yaw motions of each component. The primary and secondary suspension systems are modeled using linear spring–damper elements (COMBIN14), which connect the wheelsets to the bogies and the bogies to the car body, respectively. The suspension elements are defined independently in the longitudinal, lateral, and vertical directions, allowing the directional stiffness and damping characteristics of the suspensions to be applied. Moreover, the stiffness, damping, concentrated mass, and rotational inertia parameters are denoted by

k, c, m

and

I

, respectively, while subscripts

c b, b

and

w

refer to the car body, bogies and, wheelsets. The geometric configuration of the vehicle is described through a set of characteristic distances defining the relative positioning of the main components. The longitudinal, transversal, and vertical distances between the car body, bogies, and wheelsets are represented by the parameters

a, b

and

h

, respectively. The wheel–rail interface geometry is further characterized by the track gauge

s

and the nominal wheel rolling radius

R_{0}

. The values considered for each parameter in the present study are summarized in Table 3, which provides the mechanical and geometric parameters used in the numerical simulations.

A comprehensive description of the numerical scheme of the train and track components is available in the study performed by Mosleh et al. [36]. A schematic of the numerical simulation for the train–track interaction system is represented in Figure 2a.

The proposed simulated wayside monitoring system’s sensor placement is illustrated in Figure 2b. The wheel flat identification setup consists of eight accelerometers, located at the midspan of the right and left rails. Measurement points 1 through 4 correspond to the sensors on the right rail (opposite side of the defective wheel), while measurement points 5 through 8 are located on the left rail (side of the damage). Each sensor is spaced 0.6 m apart.

From the literature [59], it is possible to consider two main shapes for wheel flat geometries. A newly formed wheel flat has sharp edges and appears immediately after the defect is created (Figure 3a). During operation, continued wheel rolling and tread wear gradually smooth these edges; therefore, the ends of the flat become rounded over time, while the middle part of the flat remains largely unchanged (Figure 3b).

Different intervals of wheel flat lengths are considered across previous studies on wheel flat identification [21,60,61]. This study simulates defective train wheels by considering two levels of wheel flat severity, classified as Damage 1 and Damage 2, based on the flat length (

L_{f}

). To consider the variability of wheel flats observed in operation,

L_{f}

is modeled as a random variable following a uniform distribution, denoted by

U (a, b)

. Following this notation,

L_{f}

is randomly generated between the lower

a

and upper

b

boundary, while all values within this interval are equally likely to occur. Accordingly, the flat lengths corresponding to Damage 1 ranged between 10 and 25 mm and are distributed as

L_{f} ~ U (10, 25)

. On the other hand, the flats regarding Damage 2 ranged between 28 and 50 mm and are distributed as

L_{f} ~ U (28, 50)

. The depth of the wheel flat (

D_{f}

) is determined using Equation (11), where

R_{w}

denotes the radius of the wheel.

D_{f} = \frac{{L_{f}}^{2}}{16 R_{w}}

(11)

Additionally, the vertical profile of a wheel flat is determined using expression 12:

Z = - \frac{D_{f}}{2} (1 - c o s \frac{2 π x}{L_{f}}) \cdot H (x - (2 π R_{w} - L_{f})), 0 \leq x \leq 2 π R_{w}

(12)

Here,

x

refers to the coordinates that are aligned with the track’s longitudinal direction, and H denotes the periodic Heaviside function.

Track irregularities, although minor, significantly affect wheel–rail contact forces because the rails are not perfectly smooth. Therefore, it is crucial to account for these irregularities, even in numerical simulations. Power spectral density (PSD) curves are generated [62] to create artificial rail unevenness profiles in both vertical and transverse directions using MATLAB [48]. In accordance with the European Standard EN 13848-2 [63], consequently, rail unevenness profiles are generated for spatial wavelengths, covering the D1 (3 m to 25 m) and D2 (25 m to 70 m) wavelength ranges. Since the wavelengths applied to construct the unevenness profile are relatively long in comparison to wheel flats, the excited frequencies due to track unevenness are considerably lower than the dominant frequencies of wheel flat impacts.

4. Simulation Scenarios

For the baseline scenario, a total of 120 simulations are conducted for the unsupervised approach and 50 simulations for the supervised approach, considering four rail unevenness profiles, train speeds ranging from 40 to 220 km/h in increments of 20 km/h, and three loading configurations: (i) empty train, (ii) half-loaded train, (iii) fully loaded train. In this context, the empty train condition corresponds to a vehicle operating without passengers, where only the self-weight of the vehicle components is considered in the numerical analyses. The fully loaded train represents the maximum passenger occupancy, leading to the highest static wheel–rail contact forces, while the half-loaded train corresponds to an intermediate loading condition. These loading scenarios directly influence the vertical forces transmitted to the rails and are therefore included to account for realistic operational variability. Dynamic analyses for damage scenarios are conducted considering the last wheel of the third wagon as a defective wheel. In the unsupervised approach, simulations are performed at train speeds of 120 km/h and 200 km/h, with 130 scenarios per speed (260 total). For each speed, 130 simulations are conducted, including 50 and 80 analyses with varying flat geometries corresponding to damages 1 and 2, respectively. For the supervised method, the same speeds are considered, with 100 scenarios per speed (200 total), evenly distributing 50 analyses across damages 1 and 2. The summary of the damaged and undamaged scenarios for both machine learning approaches is described in Table 4.

5. Time-Series Representation for Defect Detection

Figure 4 illustrates the vertical acceleration responses for train passages across a healthy wheel and two flat-severity levels: L1 = 10.9 mm, and L2 = 49.4 mm. These measurements are acquired by accelerometer 3 while the train speed is considered 120 km/h. The results represent that acceleration signals for a minor defect (L = 10.9 mm, red line) closely resemble the baseline (green line), complicating early identification of the wheel flat. In contrast, the defect severity correlates with markedly amplified acceleration amplitudes. As shown in Figure 4, peak acceleration ranges from −9.67 m/s² to 7.48 m/s² for L1 = 10.9 mm, intensifying to −127.35 m/s² and 227.98 m/s² for L2 = 49.4 mm. As shown in this figure, the time series signal represents how increasing the length of the flat causes a more significant acceleration amplitude, emphasizing the need for advanced identification strategies to identify defective wheels.

6. Wheel Flat Identification Application: Results and Discussion

This section presents a comprehensive analysis of wheel flat identification using both supervised and unsupervised machine learning approaches. A comparative study between the SVM method and k-means clustering is conducted to evaluate their respective strengths and limitations. Additionally, a sensitivity analysis on sensor placement is performed to optimize the number of sensors required while ensuring high classification accuracy. Furthermore, a simplified preprocessing framework is introduced for supervised classification, which eliminates the need for feature normalization and data fusion. This method significantly reduces computational complexity while maintaining reliable classification performance, offering a more efficient solution for wheel flat identification. To address the need for a clear and reproducible description of the entire workflow, Table 5 summarizes all preprocessing and classification stages together with their governing parameters and outputs. This unified representation complements Figure 1 by providing an explicit parameter-level view of the pipeline. All parameters reported in Table 5 are fully defined and described in the corresponding sections of the manuscript.

6.1. Damage Detection

The process of damage detection in wheel flats involves four key steps, feature extraction, feature normalization, data fusion, and outlier analysis, as illustrated in Figure 1. These techniques have been extensively studied in previous works by the authors [5,21] and are briefly elaborated in this section. The methodology ensures accurate detection of wheel flat damage while accounting for various operational conditions and train speeds.

Feature extraction is the first step in damage detection, converting raw acceleration data from wayside sensors into damage-sensitive indicators. Time-series data are collected from eight accelerometers installed at mid-span positions on both rails. For each recorded passage, 40 statistical features per sensor are extracted using an Auto-Regressive model of order 40, forming a structured three-dimensional dataset of size 250 × 40 × 8 (passages × features × sensors).

Figure 5a and Figure 6a display the extracted AR model features at sensor position 3 for train speeds of 120 km/h and 200 km/h, respectively. The results show clear ascending and descending patterns for damaged scenarios, with noticeable differences in amplitude. However, at higher speeds (200 km/h), the distinction between baseline and low-severity damage becomes less evident due to increased environmental and operational variations. This highlights the necessity of feature normalization to enhance robustness. Data normalization is a crucial step for mitigating the influence of environmental and operational effects on the extracted features. As illustrated in Figure 4, distinguishing between defective and healthy wheels directly from time-series signals is a challenging task. This difficulty arises primarily from variations induced by environmental and operational conditions, which introduce additional variability into the feature space. To reduce the impact of these effects and enhance damage sensitivity, the extracted features must be appropriately modeled. In this study, feature normalization is carried out using a latent-variable approach based on Principal Component Analysis (PCA). By applying PCA to the dataset, external variations are effectively suppressed, allowing damage-sensitive features to remain dominant and improving the robustness of wheel flat identification. A feature matrix of size 8 × 40 is constructed for each passage, and the first two principal components (accounting for over 80% of the variance) are removed to enhance sensitivity to damage.

Figure 5b and Figure 6b illustrate the impact of PCA-based feature normalization, showing a significant reduction in variability caused by environmental and operational conditions. However, despite this improvement, the separation between damaged and undamaged wheels remains ambiguous, necessitating an additional data fusion step.

To further enhance damage detection accuracy, MD is employed as a damage index (DI). The MD calculation consolidates 40 PCA-normalized features per sensor into a single indicator that quantifies the deviation of a passage from the baseline condition. The output is an 8 × 250 matrix, where each entry represents the MD-based damage index for a specific sensor and train passage. Figure 5c and Figure 6c illustrate the effectiveness of this approach, providing a clearer visual separation between baseline and damaged scenarios. While this step successfully captures variations between damaged and undamaged wheels, the differences in MD amplitudes across sensors necessitate further validation through confidence boundary analysis to ensure consistent and reliable damage detection.

To classify a passage as healthy or damaged, an outlier detection algorithm is applied based on CB estimation. The squared Mahalanobis Distance values are approximated using a chi-squared distribution, allowing the damage indices to be modeled as a Gaussian distribution. The inverse cumulative distribution function (ICDF) is then used to determine a 1% CB threshold. Figure 5d and Figure 6d illustrate the final damage detection results, comparing the computed damage indices with the established confidence boundaries. The analysis shows that baseline scenarios (the first 120 passages) remain within the confidence boundary, confirming healthy wheel conditions, while damaged scenarios (passages 121–250) exceed the confidence boundary, accurately detecting wheel flats. The green dots in Figure 5d and Figure 6d represent damaged wheel scenarios, and the proposed methodology can distinguish between damaged scenarios (green dots in Figure 5d and Figure 6d) and baseline conditions (black dots in Figure 5d and Figure 6d). All damage scenarios are correctly identified without false positives or negatives, demonstrating the effectiveness of the methodology. Furthermore, it is observed that damage detection is reliable with a single sensor, regardless of defect severity or position. This highlights the potential for sensor optimization, which is further investigated in the following sections.

6.2. Classification Approach Using k-Means

After successfully detecting wheel flat damage, the next step is to classify the severity of the defect. This section presents the classification results obtained using the k-means clustering technique, an unsupervised learning method. The analysis is performed for vehicle speeds of 120 km/h and 200 km/h, using eight accelerometers installed along the track. The goal is to evaluate the performance of k-means clustering in identifying different levels of wheel flat severity based on sensor data.

Figure 7a shows the classification results for accelerometer 3 for the vehicle speed of 120 km/h, while the three damage classes are well-separated, with clear boundaries between the baseline and both levels of damage. The clustering process successfully distinguishes between Damage 1 and Damage 2, indicating that the extracted features are highly sensitive to defect severity. These results confirm that at 120 km/h, k-means clustering provides highly accurate classification without any misclassification.

Figure 7b illustrates the classification results when the vehicle passes at 200 km/h. The performance of the clustering method declines at this higher speed due to increased noise and external variations. Damage classification for accelerometer 3 (Figure 7b) at 200 km/h show a significant overlap between baseline and damaged scenarios. The presence of misclassified points suggests that the extracted features become less reliable at higher speeds. The overlapping regions indicate that, at higher train speeds, the damage sensitivity of individual sensors decreases, leading to lower clustering accuracy. The main challenge at 200 km/h is the increased environmental and operational variability, leading to overlapping damage index distributions, and since the k-means clustering is an unsupervised approach, the pattern in the amplitude variation in the damage indices is not clearly captured by the algorithm, which affects the feature distribution and reduces classification accuracy. As a result, a sensor fusion approach is applied in the next step to enhance the robustness of the classification process.

To improve classification accuracy at 200 km/h, a sensor fusion technique is applied, merging the information from all eight accelerometers. The results are presented in Figure 7c. The clustering performance does not show a substantial improvement. The distinction between different damage severities remains unclear, indicating that external noise and operational variability continue to impact classification accuracy. Since sensor fusion alone does not significantly enhance classification performance, an additional step is introduced to refine the analysis. To further improve clustering accuracy, baseline (healthy) scenarios are removed from the dataset in the next stage of analysis. The rationale behind this approach is that eliminating non-damaged cases will allow the clustering algorithm to focus exclusively on distinguishing between different levels of wheel flat severity, thereby minimizing the influence of external variations. The results of this refined approach are discussed in the following section.

Figure 8 presents the classification results for wheel flat severities considering feature fusion for accelerometers 3 (Figure 8a), while the outputs for sensor fusion are described through Figure 8b as representative examples at a train speed of 200 km/h. In this analysis, only damaged scenarios are considered, while the features from all eight sensors are merged through feature fusion. Despite the expectation that removing baseline (healthy) scenarios would improve classification accuracy, the results indicate no significant improvement. Note that damage 1 and damage 2 are separated by a vertical red line in the figure. As shown in the figure, the overlap between different damage severity levels persists, suggesting that even with feature fusion and baseline exclusion, the k-means clustering approach continues to struggle to achieve precise classification at higher speeds.

To further enhance the sensitivity of the extracted features to wheel flat damage, the defect classification at 200 km/h is re-evaluated using merged data after sensor fusion, considering only damaged scenarios. As illustrated in Figure 8b, sensor fusion leads to an increase in classification accuracy. However, despite this improvement, the defect classification remains imprecise, with some degree of misclassification still present. These results indicate that while merging sensor data improves overall feature sensitivity, it does not completely resolve the challenges associated with high-speed classification using k-means clustering.

6.3. Classification Approach Using SVM

The results from the previous section demonstrated a clear distinction between damaged and healthy wheel scenarios at a train speed of 120 km/h. However, at higher speeds (200 km/h), the separation between baseline and damaged cases became less evident due to environmental and operational variations. To address this limitation and improve classification accuracy, this section presents a supervised SVM-based approach. Different kernel functions, including Radial Basis Function (RBF), Polynomial (P), Linear (L), and Gaussian (G), are tested to optimize classification performance [45]. The dataset used for training and testing consists of 50 simulations each for healthy wheels, Damage 1 (mild severity), and Damage 2 (severe flat length). Furthermore, 10-fold cross-validation is used to evaluate the performance of the SVM classifiers. This approach allows each sample to be used for both training and testing, reducing variability and providing a more robust estimate compared to a single random split, such as 80% training and 20% testing. The dataset is divided into 10 equal segments, where in each iteration, nine folds are used for training and the remaining fold is used for validation. This process is repeated 10 times, with a different fold used for validation in each iteration. The confusion matrices reflect the testing results across all folds. It should also mention that default SVM kernel parameters were employed (BoxConstraint = 1, KernelScale = ‘auto’, PolynomialOrder = 3 for the Polynomial kernel, and KernelOffset = 0 where applicable), as they provided satisfactory performance without the need for further hyperparameter optimization.

Figure 9 presents the classification results obtained using SVM for both 120 km/h and 200 km/h train speeds in the form of confusion matrices. In addition to the class-wise prediction counts, the last column and last row report precision, false discovery rate (FDR), recall, and false negative rate (FNR), while the final cell shows the overall accuracy and error rate. It should be noted that, since the dataset is class-balanced and the task is a single-label multi-class classification problem, the micro-averaged F1-score is numerically equal to the overall accuracy. The analysis considers feature fusion from all eight installed accelerometers along the track. SVM provides accurate classification across all kernel functions, with no misclassifications observed at a training speed of 120 km/h. The method successfully distinguishes between healthy wheels and different levels of damage severity. The classification accuracy remains high when the train passes at 200 km/h, achieving 99.3% across all kernel functions. Despite the increased noise and external variations at higher speeds, the classifier is trained using labeled data and learned decision boundaries that effectively separate damage-related features from variations induced by speed, noise, and sensor-specific responses, resulting in significantly fewer misclassifications than the unsupervised k-means clustering (Figure 8). These results indicate that SVM, when applied with multiple sensors, ensures accurate classification regardless of train speed. Given its superior performance, the R kernel is selected for further analyses in subsequent sections.

6.4. Comparison of Unsupervised and Supervised Methods: Influence of Sensor Position on Wheel Flat Identification Accuracy

This section evaluates the performance of unsupervised (k-means clustering) and supervised SVM methods for classifying wheel-flat severities across different sensor layouts. The primary objective is to determine the influence of accelerometer positioning on classification accuracy and assess the feasibility of reducing the number of sensors while maintaining reliable defect identification. To analyze the impact of sensor positioning, classification results are first examined separately for sensors located on either side of the damage. Figure 10 and Figure 11 illustrate the accuracy of k-means clustering for train speeds of 120 km/h and 200 km/h, respectively. Note that in Figure 10, the baseline scenarios are separated from the defective scenarios by a vertical red line, whereas in Figure 11, the same marker is used to distinguish Damage 1 from Damage 2. For the speed of 120 km/h, all train passages are included (Figure 10), while for the speed of 200 km/h, the healthy train passages are excluded (Figure 11).

As illustrated in these figures, damage clustering analysis proceeds without misclassification when using sensors positioned on the opposite side of the damaged wheel, for both 120 km/h and 200 km/h (Figure 10a and Figure 11a). This suggests that placing sensors on the opposite rail provides more stable and accurate damage classification by capturing a clearer response to the dynamic interaction between the wheel and rail. However, when sensors are positioned on the same side as the damage, classification performance declines, particularly at higher speeds. At 120 km/h, the classification still maintains reasonable accuracy, but a limited number of misclassifications appear (Figure 10b). As the train speed increases to 200 km/h, the number of misclassified damage cases increases significantly (Figure 11b), indicating that at higher speeds, local vibrations and external noise interfere with the clustering process, making accurate damage classification more challenging. These findings highlight the sensitivity of the unsupervised k-means method to sensor placement, where sensors on the opposite side of the damage produce more reliable results, while those on the same side suffer from higher misclassification rates, particularly at elevated train speeds. The superior performance of sensors located on the opposite side of the damaged wheel is attributed to the spatial filtering effect of the rail–sleeper system, which captures a more stable structural response, whereas sensors on the same side are more influenced by localized high-frequency impact forces that increase feature variability and reduce classification separability.

Figure 12 presents the defect classification results using the SVM method with the R kernel function, analyzing the impact of sensor placement on classification accuracy. The classification is performed separately for sensors on either side of the track to assess whether sensor location affects the robustness of the supervised approach.

Figure 12a,b illustrate the damage classification results using sensors positioned on the opposite side of the damaged wheel (sensors 1–4) for train speeds of 120 km/h and 200 km/h, respectively. The results demonstrate high classification accuracy, with a clear distinction between baseline (healthy) wheels and different levels of wheel flat severity. The ability of the SVM method to maintain precise classification for both speed conditions indicates its robustness and reliability in damage detection, even when sensors are positioned away from the defect location.

On the other hand, Figure 12c,d display the classification results using sensors located on the same side as the damaged wheel (sensors 5–8), considering both train speeds. The results indicate that the classification accuracy remains consistent with the previous cases, showing that the supervised method is unaffected by sensor placement. Unlike the unsupervised approach, which exhibited a strong dependency on sensor location, the SVM method provides stable and reliable classification performance, regardless of whether the sensors are positioned on the same or opposite side of the defect.

These findings confirm that, as the classifier is trained using labeled data and learned decision boundaries that remain consistent across different sensor locations, the supervised method delivers robust results for flat severity classification across different sensor layouts and speeds when four sensors are installed on each side of the track. In contrast, the unsupervised method demonstrated sensitivity to sensor positioning, particularly at higher speeds, where misclassifications increased significantly when sensors were placed on the same side as the defect. This highlights the advantage of the supervised approach, as it ensures accurate classification without being influenced by sensor placement or train speed.

To further optimize the number of sensors required for accurate damage classification, a single-sensor approach is evaluated to determine if it is possible to reduce sensor usage while maintaining classification accuracy. As discussed earlier in this section, two different sensor layouts were initially tested: one configuration with eight sensors, four on each side of the track, and another setup considering four sensors on each side separately. Following these evaluations, the setup is further reduced to a single sensor, leading to the selection of sensors 3 and 7 for damage classification using SVM. This step aims to analyze the feasibility of achieving reliable classification performance with minimal sensor deployment.

Figure 13 illustrates the damage classification results using accelerometers 3 and 7 separately for both 120 km/h and 200 km/h train speeds. As observed in Figure 13a,c, the classification accuracy is higher at 120 km/h, whereas at 200 km/h (Figure 13b,d), the accuracy decreases due to increased noise and environmental variability. Moreover, a key observation from Figure 13a,b is that when the single sensor is positioned on the opposite side of the damage, the flat severity classification achieves higher accuracy compared to when the sensor is installed on the same side as the defect (Figure 13c,d). This trend holds regardless of vehicle speed, confirming that sensor placement significantly influences classification performance. Additionally, the sensitivity to the sensor location becomes more pronounced at higher speeds (200 km/h), further emphasizing the challenges associated with reduced sensor configurations.

Despite these variations, the results indicate that even with a single sensor, damage severity can still be classified with robust accuracy using the SVM method, achieving a minimum accuracy of 92% in the worst-case scenario. This finding highlights the feasibility of a cost-effective monitoring system that minimizes the number of required sensors while maintaining reliable classification capabilities. However, as noted earlier, using four sensors on each side of the track provides consistent classification performance without sensitivity to sensor placement or train speed, whereas reducing the number of sensors to one introduces sensitivity to the side of the track, particularly at higher speeds. This trade-off between sensor optimization and classification stability must be carefully considered when designing wayside monitoring systems for railway applications.

To verify that the performance of the single-sensor configuration is not caused by random cross-validation variability, a paired statistical comparison was conducted using fold-wise accuracies obtained from identical 10-fold partitions for both single-sensor and multi-sensor configurations. The Wilcoxon signed-rank test indicated a statistically significant difference (p = 0.0078 < 0.05), which was further confirmed by the paired t-test (p = 0.0032 < 0.05). Since both models were evaluated on the same folds, the observed performance gap cannot be attributed to random sampling variation. The results therefore demonstrate that the single-sensor configuration consistently achieves accuracies above 92% while maintaining statistically reliable classification performance.

Table 6 and Table 7 present a comparative analysis of SVM (supervised) and k-means clustering (unsupervised) methods, evaluating their classification accuracy, sensitivity to sensor placement, and misclassification rates under different sensor configurations and train speeds. Moreover, the accuracy, precision, recall, and F1 score for different fault levels are evaluated to reveal class-specific detection capability. The results show that SVM consistently outperforms k-means clustering, achieving higher accuracy and greater robustness across sensor placements and speeds. Additionally, the table demonstrates that reducing the number of sensors to a single unit remains feasible with SVM. These findings reinforce that SVM is the more reliable approach for damage classification in railway condition monitoring systems. The decrease in k-means accuracy at 200 km/h is mainly due to increased dynamic noise and vibration levels, which reduce the stability of the extracted AR features and lead to greater overlap between damage classes. In addition, speed-dependent shifts in dominant excitation frequencies alter the signal content, decreasing the separability of clusters in the distance-based algorithm. These effects, combined with the larger variability of high-speed responses, explain the reduced performance of the unsupervised method compared to the supervised SVM approach.

6.5. Simplified Preprocessing Framework for SVM-Based Classification

In the original unsupervised framework, preprocessing steps such as PCA-based feature normalization and data fusion using Mahalanobis Distance were primarily introduced to reduce feature dimensionality and mitigate redundancy, which is essential for stabilizing outlier analysis and k-means-clustering-based damage detection. In contrast, in the supervised framework adopted in this study, these steps are not strictly required. SVMs are inherently capable of handling moderately high-dimensional feature spaces and identifying optimal separating hyperplanes without the need for prior dimensionality reduction, provided that the input features are properly scaled. Moreover, the AR-model-based features used in this work are already compact and damage-sensitive; therefore, additional PCA or distance-based data fusion did not lead to measurable improvements in classification performance within the supervised scheme. Removing these steps simplifies the classification pipeline while maintaining comparable accuracy and interpretability. To enhance damage classification efficiency while reducing computational complexity, a simplified preprocessing framework is introduced for supervised SVM classification. Classical SVM models often require extensive preprocessing steps, including feature normalization and data fusion, to optimize classification accuracy. However, these additional steps increase the computational load and may not be practical for real-time railway monitoring applications. The proposed simplified framework eliminates the need for feature normalization and data fusion, significantly streamlining the classification process without compromising accuracy.

The key advantage of this approach lies in its ability to directly utilize raw extracted features for classification. By removing normalization steps, the model maintains high accuracy while reducing processing time. Additionally, eliminating data fusion minimizes dependencies on multiple sensors, making the system more adaptable to real-world implementations where a reduced number of sensors is desirable.

The results demonstrate that the simplified framework maintains robust classification performance through SVM, achieving comparable accuracy to the classical SVM approach. Even with minimal preprocessing, the model effectively distinguishes between different levels of damage severity, confirming the feasibility of this optimization. Furthermore, the simplified approach retains its effectiveness across varying train speeds, ensuring consistent performance under different operational conditions.

Figure 14 and Figure 15 illustrate the defect classification results obtained using the simplified framework with accelerometers 3 and 7, both separately and simultaneously, under different vehicle speeds. Figure 14 presents the classification results when sensors 3 and 7 are used simultaneously for vehicle speeds of 120 km/h and 200 km/h. The results demonstrate that flat severity classification achieves high accuracy regardless of the vehicle speed. Furthermore, classification precision improves at 200 km/h, suggesting that the model benefits from higher dynamic interactions at higher velocities.

Figure 15 provides a detailed comparison of the classification performance when sensors 3 and 7 are analyzed separately. The findings indicate that at 120 km/h, the defect classification is not sensitive to the side of the damage (Figure 15a,c). However, at 200 km/h, classification accuracy varies with sensor placement, with sensor 3 performing better than sensor 7 (Figure 15b,d). This suggests that sensor location becomes more critical at higher speeds, affecting classification precision in the simplified framework.

The comparison between the classical SVM and the simplified framework, as illustrated in Figure 13 and Figure 15, highlights differences in classification accuracy. For sensor 3, at a train speed of 120 km/h, the classical SVM (Figure 13a) achieves 100% accuracy, whereas the simplified approach (Figure 15a) slightly decreases to 96%. At 200 km/h, the accuracy of the classical SVM (Figure 13b) remains at 96%, whereas the simplified framework (Figure 15b) reaches 98.7%. Similarly, for sensor 7, the classical SVM method (Figure 13c,d) attains 97.3 and 92% accuracy at 120 km/h and 200 km/h, respectively, while the simplified approach (Figure 15c,d) shows a slight drop to 96% and 91.3%. These results indicate that while the simplified method slightly reduces classification accuracy, the impact remains minimal. Given its reduced computational complexity and faster processing time, the simplified framework remains a viable alternative, particularly for real-time railway monitoring applications where efficiency is a priority. It should be noted that the reported classification results are obtained from numerical simulations conducted under controlled and class-balanced conditions, in which noise levels were explicitly introduced to better reflect realistic system behavior. In this context, the micro-averaged F1-score is equivalent to the overall classification accuracy, and the presented confusion matrices provide class-wise precision, recall, and error rates, which implicitly reflect variability in the model predictions. While these results demonstrate the effectiveness of the proposed SVM-based framework under controlled numerical conditions, performance degradation may be expected when applying the methodology to real experimental data, due to additional environmental variability, loading uncertainties, and other sources of operational noise that are not fully captured in the numerical model. Therefore, caution should be exercised when interpreting near-perfect classification accuracy obtained from simulated data, and further validation using real measurement datasets is required to assess the generalization capability and robustness of the proposed approach.

7. Conclusions

This study evaluated supervised (SVM) and unsupervised (k-means) machine learning approaches for wayside detection and severity classification of railway wheel flats at train speeds of 120 and 200 km/h. The results show that the classical SVM approach provides consistent and reliable classification across the investigated scenarios, achieving accuracies above 99% with four sensors and above 92% using a single-sensor configuration. In contrast, the k-means clustering method exhibited pronounced sensitivity to train speed and sensor placement, with its accuracy decreasing to 75.2% at 200 km/h even when eight sensors were employed. It should be emphasized that these conclusions are derived from high-fidelity synthetic data generated through numerical simulations, and performance levels may differ under real-world operational conditions.

A sensitivity analysis on sensor placement confirmed that reliable detection can be achieved with a reduced number of accelerometers, particularly when sensors are positioned on the opposite side of the defect. This finding supports significant sensor optimization and associated cost reduction for practical wayside deployments.

A simplified preprocessing framework was introduced, eliminating feature normalization and data fusion while maintaining high accuracy (91.3–98.7%). The reduced computational burden makes the approach suitable for real-time edge implementation in existing Wheel Condition Monitoring System infrastructures. The proposed workflow can be directly embedded in current wayside condition monitoring systems, where vibration signals from rail-mounted accelerometers are processed on-site to extract AR features, followed by real-time SVM classification and transmission of severity alerts to the maintenance platform.

Overall, supervised learning—especially SVM and the simplified framework—proved superior for wheel-flat severity classification, enabling accurate and cost-effective monitoring with minimal sensor configurations. Compared with typical multi-sensor wayside installations, the optimized configuration offers substantial reductions in hardware, cabling, installation effort, and long-term maintenance costs per monitoring site.

The proposed methodology will be tested and validated in an upcoming field trial through ongoing projects involving different types of trains (passenger and freight), where its performance will be rigorously assessed under real operational conditions. Further developments will incorporate modeling uncertainties, sensor errors, and environmental effects to enhance robustness.

Author Contributions

Conceptualization, A.M. (Araliya Mosleh) and C.V.; methodology, A.M. (Araliya Mosleh), A.M. (Andreia Meixedo), R.G. and A.M. (Abdollah Malekjafarian); software, A.M. (Andreia Meixedo) and R.G.; validation, A.C., M.M. and A.M. (Araliya Mosleh); formal analysis, A.C., M.M. and A.M. (Araliya Mosleh); investigation, A.C., R.S., A.G., A.M. (Araliya Mosleh) and M.M.; resources, C.V. and A.M. (Araliya Mosleh); data curation, A.C. and M.M.; writing—original draft preparation, A.C., A.M. (Araliya Mosleh), M.M., R.S. and A.G.; writing—review and editing, A.M. (Araliya Mosleh), C.V., A.M. (Abdollah Malekjafarian) and R.G.; visualization, A.C.; supervision, C.V. and A.M. (Araliya Mosleh); project administration, A.M. (Araliya Mosleh) and C.V.; funding acquisition, A.M. (Araliya Mosleh) and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

We gratefully appreciate the financial support from the Base Funding UIDB/04708/2020 with DOI 10.54499/UIDB/04708/2020 (https://doi.org/10.54499/UIDB/04708/2020) and Programmatic Funding—UIDP/04708/2020 with DOI 10.54499/UIDP/04708/2020 (https://doi.org/10.54499/UIDP/04708/2020) of the CONSTRUCT—Instituto de I&D em Estruturas e Construções—funded by national funds through the FCT/MCTES (PIDDAC); and Grant no. 2021.04272.CEECIND with DOI: https://doi.org/10.54499/2021.04272.CEECIND/CP1679/CT0003, from the Stimulus of Scientific Employment, Individual Support (CEECIND)—4th Edition provided by FCT (Fundação para a Ciência e Tecnologia). The authors also acknowledge the financial support from the FCT scholarship no. 2024.02919.BD. This work was supported by the NRPCES project—“New Replacement Policy Considering Environmental Sustainability”, funded under the EUROGIA2030 Programme of the EUREKA Clusters Sustainability Call 2022 (SUS2022-039), with national co-financing from FEDER through the Norte 2030 program, under the scope of Portugal 2030 with national project number: 19217.

Data Availability Statement

The original contributions presented in this study are included in the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shaikh, M.Z.; Ahmed, Z.; Chowdhry, B.S.; Baro, E.N.; Hussain, T.; Uqaili, M.A.; Mehran, S.; Kumar, D.; Shah, A.A. State-of-the-art wayside condition monitoring systems for railway wheels: A comprehensive review. IEEE Access 2023, 11, 13257–13279. [Google Scholar] [CrossRef]
Bragança, C.; Souza, E.F.; Ribeiro, D.; Meixedo, A.; Bittencourt, T.N.; Carvalho, H. Drive-by Methodologies Applied to Railway Infrastructure Subsystems: A Literature Review—Part II: Track and Vehicle. Appl. Sci. 2023, 13, 6982. [Google Scholar] [CrossRef]
Fu, W.; He, Q.; Feng, Q.; Li, J.; Zheng, F.; Zhang, B. Recent advances in wayside railway wheel flat detection techniques: A review. Sensors 2023, 23, 3916. [Google Scholar] [CrossRef] [PubMed]
Mosleh, A.; Meixedo, A.; Ribeiro, D.; Montenegro, P.; Calçada, R. Automatic clustering-based approach for train wheels condition monitoring. Int. J. Rail Transp. 2023, 11, 639–664. [Google Scholar] [CrossRef]
Mohammadi, M.; Mosleh, A.; Vale, C.; Ribeiro, D.; Montenegro, P.; Meixedo, A. An Unsupervised Learning Approach for Wayside Train Wheel Flat Detection. Sensors 2023, 23, 1910. [Google Scholar] [CrossRef]
Belotti, V.; Crenna, F.; Michelini, R.C.; Rossi, G.B. Wheel-flat diagnostic tool via wavelet transform. Mech. Syst. Signal Process. 2006, 20, 1953–1966. [Google Scholar] [CrossRef]
Ni, Y.-Q.; Zhang, Q.-H. A Bayesian machine learning approach for online detection of railway wheel defects using track-side monitoring. Struct. Health Monit. 2021, 20, 1536–1550. [Google Scholar] [CrossRef]
Li, Y.; Zuo, M.J.; Lin, J.; Liu, J. Fault detection method for railway wheel flat using an adaptive multiscale morphological filter. Mech. Syst. Signal Process. 2017, 84, 642–658. [Google Scholar] [CrossRef]
Krummenacher, G.; Ong, C.S.; Koller, S.; Kobayashi, S.; Buhmann, J.M. Wheel defect detection with machine learning. IEEE Trans. Intell. Transp. Syst. 2017, 19, 1176–1187. [Google Scholar] [CrossRef]
Vitola, J.; Pozo, F.; Tibaduiza, D.; Anaya, M. Distributed Piezoelectric Sensor System for Damage Identification in Structures Subjected to Temperature Changes. Sensors 2017, 17, 1252. [Google Scholar] [CrossRef]
Shafique, R.; Siddiqui, H.; Rustam, F.; Ullah, S.; Siddique, M.; Lee, E.; Ashraf, I.; Dudley, S. A Novel Approach to Railway Track Faults Detection Using Acoustic Analysis. Sensors 2021, 21, 6221. [Google Scholar] [CrossRef]
Addin, O.; Sapuan, S.; Mahdi, E.; Othman, M. A Naïve-Bayes classifier for damage detection in engineering materials. Mater. Des. 2007, 28, 2379–2386. [Google Scholar] [CrossRef]
Arthur, D.; Vassilvitskii, S. k-Means++: The Advantages of Careful Seeding; Stanford University: Stanford, CA, USA, 2006. [Google Scholar]
Ghiasi, R.; Gordan, M.; Mosleh, A.; Ribeiro, D.; Malekjafarian, A. M-CLUSTER: Multistage clustering for unsupervised train wheel condition monitoring. Veh. Syst. Dyn. 2024, 64, 177–202. [Google Scholar] [CrossRef]
Li, Z.; Liu, F.; Yang, W.; Peng, S.; Zhou, J. A survey of convolutional neural networks: Analysis, applications, and prospects. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6999–7019. [Google Scholar] [CrossRef] [PubMed]
Jamshidi, A.; Hajizadeh, S.; Su, Z.; Naeimi, M.; Núñez, A.; Dollevoet, R.; De Schutter, B.; Li, Z. A decision support approach for condition-based maintenance of rails based on big data analysis. Transp. Res. Part C Emerg. Technol. 2018, 95, 185–206. [Google Scholar] [CrossRef]
Zhang, Q.; Peng, J.; Tian, K.; Wang, A.; Li, J.; Gao, X. Advancing Ultrasonic Defect Detection in High-Speed Wheels via UT-YOLO. Sensors 2024, 24, 1555. [Google Scholar] [CrossRef]
Trilla, A.; Bob-Manuel, J.; Lamoureux, B.; Vilasis-Cardona, X. Integrated Multiple-Defect Detection and Evaluation of Rail Wheel Tread Images using Convolutional Neural Networks. Int. J. Progn. Health Manag. 2021, 12, 2906. [Google Scholar] [CrossRef]
Xing, Z.; Zhang, Z.; Yao, X.; Qin, Y.; Jia, L. Rail wheel tread defect detection using improved YOLOv3. Measurement 2022, 203, 111959. [Google Scholar] [CrossRef]
Liao, R.; Zhang, Y.; Wang, H.; Zhao, T.; Wang, X. Multi-objective optimisation of surveillance camera placement for bridge–ship collision early-warning using an improved non-dominated sorting genetic algorithm. Adv. Eng. Inform. 2026, 69, 103918. [Google Scholar] [CrossRef]
Mosleh, A.; Meixedo, A.; Ribeiro, D.; Montenegro, P.; Calçada, R. Early wheel flat detection: An automatic data-driven wavelet-based approach for railways. Veh. Syst. Dyn. 2023, 61, 1644–1673. [Google Scholar] [CrossRef]
Jorge, T.; Magalhães, J.; Silva, R.; Guedes, A.; Ribeiro, D.; Vale, C.; Meixedo, A.; Mosleh, A.; Montenegro, P.; Cury, A. Early identification of out-of-roundness damage wheels in railway freight vehicles using a wayside system and a stacked sparse autoencoder. Veh. Syst. Dyn. 2025, 63, 232–257. [Google Scholar] [CrossRef]
Mohammadi, M.; Mosleh, A.; Vale, C.; Ribeiro, D.; Montenegro, P.; Meixedo, A. Smart railways: AI-based track-side monitoring for wheel flat identification. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2025, 239, 272–289. [Google Scholar] [CrossRef]
Guo, G.; Peng, J.; Yang, K.; Xie, L.; Song, W. Wheel Tread Defects Inspection Based on SVM. In Proceedings of the 2017 Far East NDT New Technology & Application Forum (FENDT), Xi’an, China, 22–24 June 2017. [Google Scholar]
Mosleh, A.; Ghiasi, R.; Gordan, M.; Ribeiro, D.; Malekjafarian, A. Wayside monitoring for railway wheel flat identification: A multiclass supervised learning approach. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2026, 240, 112–129. [Google Scholar] [CrossRef]
Nowakowski, T.; Komorski, P.; Szymański, G.M.; Tomaszewski, F. Wheel-flat detection on trams using envelope analysis with Hilbert transform. Lat. Am. J. Solids Struct. 2019, 16, e148. [Google Scholar] [CrossRef]
Barman, J.; Hazarika, D. Linear and quadratic time–frequency analysis of vibration for fault detection and identification of NFR trains. IEEE Trans. Instrum. Meas. 2020, 69, 8902–8909. [Google Scholar] [CrossRef]
Salehi, M.; Bagherzadeh, S.A.; Fakhari, M. Experimental detection of train wheel defects using wayside vibration signal processing. Struct. Health Monit. 2023, 22, 3286–3301. [Google Scholar] [CrossRef]
Lourenço, A.; Ferraz, C.; Ribeiro, D.; Mosleh, A.; Montenegro, P.; Vale, C.; Meixedo, A.; Marreiros, G. Adaptive time series representation for out-of-round railway wheels fault diagnosis in wayside monitoring. Eng. Fail. Anal. 2023, 152, 107433. [Google Scholar] [CrossRef]
Gao, R.; He, Q.; Feng, Q. Railway wheel flat detection system based on a parallelogram mechanism. Sensors 2019, 19, 3614. [Google Scholar] [CrossRef]
Zhou, C.; Gao, L.; Xiao, H.; Hou, B. Railway wheel flat recognition and precise positioning method based on multisensor arrays. Appl. Sci. 2020, 10, 1297. [Google Scholar] [CrossRef]
Filograno, M.L.; Guillén, P.C.; Rodríguez-Barrios, A.; Martín-López, S.; Rodríguez-Plaza, M.; Andrés-Alguacil, Á.; González-Herráez, M. Real-time monitoring of railway traffic using fiber Bragg grating sensors. IEEE Sens. J. 2011, 12, 85–92. [Google Scholar] [CrossRef]
Mishra, S.; Sharan, P.; Saara, K. Real time implementation of fiber Bragg grating sensor in monitoring flat wheel detection for railways. Eng. Fail. Anal. 2022, 138, 106376. [Google Scholar] [CrossRef]
Komorski, P.; Szymanski, G.M.; Nowakowski, T.; Orczyk, M. Advanced acoustic signal analysis used for wheel-flat detection. Lat. Am. J. Solids Struct. 2021, 18, e338. [Google Scholar] [CrossRef]
Deilamsalehy, H.; Havens, T.C.; Lautala, P. Sensor fusion of wayside visible and thermal imagery for rail car wheel and bearing damage detection. In Proceedings of the ASME/IEEE Joint Rail Conference, Philadelphia, PA, USA, 4–7 April 2017. [Google Scholar]
Mosleh, A.; Montenegro, P.A.; Costa, P.A.; Calçada, R. Railway vehicle wheel flat detection with multiple records using spectral kurtosis analysis. Appl. Sci. 2021, 11, 4002. [Google Scholar] [CrossRef]
Kerschen, G.; Feeny, B.F.; Golinval, J.C. On the exploitation of chaos to build reduced-order models. Comput. Methods Appl. Mech. Eng. 2003, 192, 1785–1795. [Google Scholar] [CrossRef][Green Version]
Santos, J.P.; Crémona, C.; Calado, L.; Silveira, P.; Orcesi, A.D. On-line unsupervised detection of early damage. Struct. Control Health Monit. 2016, 23, 1047–1069. [Google Scholar] [CrossRef]
de Oliveira Dias Prudente dos Santos, J.P.; Crémona, C.; da Silveira, A.P.C.; de Oliveira Martins, L.C. Real-time damage detection based on pattern recognition. Struct. Concr. 2016, 17, 338–354. [Google Scholar] [CrossRef]
Worden, K.; Sohn, H.; Farrar, C.R. Novelty Detection in a Changing Environment: Regression and Interpolation Approaches. J. Sound Vib. 2002, 258, 741–761. [Google Scholar] [CrossRef]
Datteo, A.; Busca, G.; Quattromani, G.; Cigada, A. On the use of AR models for SHM: A global sensitivity and uncertainty analysis framework. Reliab. Eng. Syst. Saf. 2018, 170, 99–115. [Google Scholar] [CrossRef]
Nielsen, J.C.O.; Pieringer, A.; Thompson, D.J.; Torstensson, P.T. Wheel–Rail Impact Loads, Noise and Vibration: A Review of Excitation Mechanisms, Prediction Methods and Mitigation Measures. In Noise and Vibration Mitigation for Rail Transportation Systems; Springer International Publishing: Cham, Switzerland, 2021. [Google Scholar]
Zhang, M.; Cavallo, A.; Tomasini, G. Data-driven diagnosis of railway wheel flat based on multi-channel vibration data fusion. Measurement 2026, 264, 120191. [Google Scholar] [CrossRef]
Vladimir, V. Statistical Learning Theory; Wiley: New York, NY, USA, 1998; Volume 1, p. 2. [Google Scholar]
Harris, D.; Christopher, B.; Linda, K.; Alex, S.; Vladimir, V. Support vector regression machines. Adv. Neural Inf. Process. Syst. 1997, 28, 779–784. [Google Scholar]
Roshan, K.; Saurabh, S. Machine Learning: A Review on Binary Classification. Int. J. Comput. Appl. 2017, 160, 11–15. [Google Scholar] [CrossRef]
Montenegro, P.A.; Calçada, R. Wheel–rail contact model for railway vehicle–structure interaction applications: Development and validation. Railw. Eng. Sci. 2023, 31, 181–206. [Google Scholar] [CrossRef]
MATLAB^®, version R2018a; The MathWorks Inc.: Natick, MA, USA, 2018.
Hertz, H. Ueber die Berührung fester elastischer Körper. J. Für Die Reine Und Angew. Math. 1882, 1882, 156–171. [Google Scholar] [CrossRef]
Kalker, J.J. Book of Tables for the Herzian Creep-Force Law; Delft University of Technology, Faculty of Technical Mathematics and Informatics: Delft, The Netherlands, 1996. [Google Scholar]
ANSYS^®, release 19.2; Academic Research; ANSYS: Canonsburg, PA, USA, 2018.
EN 13674–1; European Standard. Railway Applications Railway Applications—Track-Rail-Part1, Final Draft. European Union: Brussels, Belgium, 2002.
Zhai, W.; Wang, K.; Cai, C. Fundamentals of vehicle–track coupled dynamics. Veh. Syst. Dyn. 2009, 47, 1349–1376. [Google Scholar] [CrossRef]
ERRI D 214/RP 5; Rail Bridges for Speeds > 200 km/h: NUMERICAL Investigation of the Effect of Track Irregularities at Bridge Resonance. European Rail Research Institute: Utrecht, The Netherlands, 1999.
UIC 774-3-R; Track/Bridge Interaction—Recommendations for Calculation, 2nd Edition. International Union of Railways (UIC): Paris, France, 2001.
Wu, Y.-S.; Yang, Y.-B. Steady-state response and riding comfort of trains moving over a series of simply supported bridges. Eng. Struct. 2003, 25, 251–265. [Google Scholar] [CrossRef]
ERRI D 202/RP 11; Improved Knowledge of Forces in CWR Track (Including Switches): Parametric Study and Sensivity Analysis of CWERRI. European Rail Research Institute: Utrecht, The Netherlands, 1999.
Auersch, L. Dynamic interaction of various beams with the underlying soil—Finite and infinite, half-space and Winkler models. Eur. J. Mech.—A/Solids 2008, 27, 933–958. [Google Scholar] [CrossRef]
Maglio, M.; Vernersson, T.; Nielsen, J.C.O.; Ekberg, A.; Kabo, E. Influence of railway wheel tread damage on wheel–rail impact loads and the durability of wheelsets. Railw. Eng. Sci. 2024, 32, 20–35. [Google Scholar] [CrossRef]
Alemi, A.; Corman, F.; Pang, Y.; Lodewijks, G. Evaluation of the influential parameters contributing to the reconstruction of railway wheel defect signals. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2019, 234, 1005–1016. [Google Scholar] [CrossRef]
Alemi, A.; Corman, F.; Pang, Y.; Lodewijks, G. Reconstruction of an informative railway wheel defect signal from wheel–rail contact signals measured by multiple wayside sensors. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2018, 233, 49–62. [Google Scholar] [CrossRef]
Mosleh, A.; Costa, P.A.; Calçada, R. A new strategy to estimate static loads for the dynamic weighing in motion of railway vehicles. Proc. Inst. Mech. Eng. Part F 2020, 234, 183–200. [Google Scholar] [CrossRef]
BS EN 13848-1; Railway Applications/Track-Track Geometry Quality. BS EN Standard: London, UK, 2003.

Figure 1. Illustration of the proposed methodology.

Figure 2. Simulated trackside monitoring system: (a) numerical modeling of train–track system; (b) virtual wayside monitoring system.

Figure 3. Schematic geometry of (a) a new wheel flat and (b) a wheel flat with rounded edges [59].

Figure 4. Vertical acceleration in the time domain for three different wheel conditions.

Figure 5. Damage detection through the unsupervised method for the vehicle speed of 120 km/h using sensor 3: (a) feature extraction; (b) feature normalization; (c) data fusion; (d) damage detection.

Figure 6. Damage detection through the unsupervised method for the vehicle speed of 200 km/h using sensor 3: (a) feature extraction; (b) feature normalization; (c) data fusion; (d) damage detection.

Figure 7. Unsupervised classification with 8 sensors: (a) feature fusion-accelerometer 3, speed of 120 km/h; (b) feature fusion, accelerometer 3, speed of 200 km/h; (c) sensor fusion, speed of 200 km/h.

Figure 8. Unsupervised classification with 8 sensors without baseline, considering vehicle speed at 200 km/h: (a) feature fusion, accelerometer 3; (b) sensor fusion.

Figure 9. Supervised classification with 8 sensors with baseline (feature fusion speed 120 and 200) for all functions: (a) function R; (b) function P; (c) function L, (d) function G).

Figure 10. Unsupervised classification with baseline using feature fusion for each side of the track separately for speed 120: (a) other side of the damage (1–4); (b) side of the damage (5–8).

Figure 11. Unsupervised classification without baseline using feature fusion for each side of the track separately for a speed of 200: (a) other side of the damage (1–4); (b) side of the damage (5–8).

Figure 12. Supervised classification with baseline using feature fusion for each side of the track separately for speeds of 120 and 200 through function R using 4 sensors: (a) other side of the damage, speed 120 km/h; (b) other side of the damage, speed of 200 km/h; (c) side of the damage, speed of 120 km/h; (d) side of the damage, speed of 200 km/h.

Figure 13. Supervised classification with baseline using feature fusion for each side of the track separately for speeds of 120 and 200 through function R using 1 sensor: (a) sensor 3, speed of 120 km/h; (b) sensor 3, speed of 200 km/h; (c) sensor 7, speed of 120 km/h; (d) sensor 7, speed of 200 km/h.

Figure 14. Supervised classification using feature extraction through the simplified framework using both sensors 3 and 7 (function L): (a) speeds of 120 km/h (b) and 200 km/h.

Figure 15. Supervised classification using feature extraction for sensors 3 and 7 separately: (a) sensor 3, speed of 120 km/h, function r; (b) sensor 3, speed of 200 km/h, function r; (c) sensor 7, speed of 120 km/h, function P; (d) sensor 7, speed of 200 km/h, function P.

Table 1. Indicative comparison with reported operational systems.

References	Accuracy	Sensor Number	Min Flat Length	Train Speed	Techniques/Methods
Belotti et al. 2006 [6]	Accurate detection within the investigated scenarios	4 accelerometers	15 mm	10–100 km/h	Detection and localization: power spectral density
Nowakowski et al. 2017 [26]	Qualitative successful separation using threshold/envelope indicators	4 accelerometers	66 mm	≤70 km/h	Detection: envelope spectrum analysis
Barman et al. 2020 [27]	Qualitative successful detection	1 accelerometer (ADXL335 MEMS)	-	32–65 km/h	Detection: WT and Wigner–Ville Transform
Mohammadi et al. 2023 [5]	Accurate detection within the investigated scenarios	8 accelerometers	50 mm	40–120 km/h baseline; 80 km/h damaged	Unsupervised detection: outlier analysis
Mosleh et al. 2023 [21]	Accurate detection within the investigated scenarios	19 measurement points (SG and accelerometer)	5 mm	20–60 m/s baseline; 60/20 m/s damaged	Unsupervised detection: outlier analysis
Mosleh et al. 2022 [4]	Accurate detection within the investigated scenarios	19 accelerometers	10 mm	20–60 m/s baseline; 60 m/s damaged	Unsupervised detection: outlier analysis Unsupervised classification: k-means
Ghiasi et al. 2024 [14]	classification (k-means stage): Accurate detection within the investigated scenarios with 12 sensors	12 accelerometers	10 mm	20–60 m/s (baseline); 60 m/s (damaged scenarios)	Unsupervised multistage clustering, including k-means and DBSCAN
Mohammadi et al. 2025 [23]	Accurate detection within the investigated scenarios	9 sensors total (8 accelerometers + 1 strain gauge)	10 mm	40–120 km/h baseline; 80 km/h damaged	Unsupervised detection: outlier analysis Unsupervised localization: HMM Unsupervised classification: autoencoder and k-means
Salehi et al. 2023 [28]	Repeatable MSSA/crest-factor defect signatures	5 accelerometers (piezoelectric)	25 mm	10/20/30/40 km/h	Detection: multi-channel singular spectrum analysis (MSSA)
Lourenço et al. 2023 [29]	Accurate detection within the investigated scenarios	24 accelerometers + strain gauges	25 mm	60–100 km/h baseline; 80 km/h damaged	Unsupervised diagnosis: Isolation Forest and HMM
Jorge et al. 2024 [22]	Accurate detection within the investigated scenarios	4 accelerometers	10 mm	40–120 km/h	Detection: outlier analysis diagnosis and classification: k-means
Gao et al. 2019 [30]	Estimation error = +/−0.05 mm)	6 displacement sensors (eddy-current)	32 mm	15 km/h	Detection: parallelogram mechanism (waveform analysis)
Zhou et al. 2020 [31]	Not reported (accurate detection-localization; matched offline inspection)	20 sensors	-	20 m/s	Detection and localization: data fusion
Filograno et al. 2011 [32]	Imperfection detection in traces	20 FBG strain sensors	-	200–300 km/h	Detection: wavelength variation analysis
Mishra et al. 2022 [33]	Detection via FFT/spectrogram and strain contrast)	12 FBG sensors (+4 trigger FBGs)	-	70 km/h study and 35–90 km/h field	Detection: FFT
Komorski et al. 2021 [34]	Diagnostic parameters/envelope spectrum separate defect scenarios	3 Microphones	66 mm	20–40 km/h	Detection: envelope spectrum analysis
Deilamsalehy et al. 2017 [35]	>30% Improvement (segmentation accuracy vs. thermal-only)	2 cameras (1 thermal + 1 visible)	-	-	Detection: Hough transform

Table 2. Mechanical characteristics of the track [36].

Parameter		Value
Rail	$A_{r} (m^{2})$	7.67 × 10⁻⁴	[52]
	$ρ_{r} (k g / m^{3})$	7850	[52]
	$I_{r} (m^{4})$	30.38 × 10⁻⁶	[52]
	$ν_{r}$	0.28	[52]
	$E_{r} (N / m^{3})$	210 × 10⁹	[52]
Rail pad, longitudinal	$K_{r} (N / m)$	20 × 10⁶	[53]
	$C_{p} (N \cdot s / m)$	50 × 10³	[53]
Rail pad, lateral	$K_{p} (N / m)$	20 × 10⁶	[53]
	$C_{p} (N \cdot s / m)$	50 × 10³	[53]
Rail pad, vertical	$K_{p} (N / m)$	500 × 10⁶	[54]
	$C_{p} (N \cdot s / m)$	200 × 10³	[54]
Sleeper	$ρ_{s} (k g / m^{3})$	2590
	$ν_{s}$	0.2
	$E_{s} (N / m^{3})$	40.9 × 10⁹
Ballast, longitudinal	$K_{b, x} (N / m)$	9000 × 10³	[55]
	$C_{b, x} (N \cdot s / m)$	15 × 10³	[56]
Ballast, lateral	$K_{b, y} (N / m)$	2250 × 10³	[57]
	$C_{b, y} (N \cdot s / m)$	15 × 10³	[56]
Ballast, vertical	$K_{b, z} (N / m)$	30 × 10⁶	[57]
	$C_{b, z} (N \cdot s / m)$	15 × 10³	[56]
Foundation, longitudinal	$K_{f, x} (N / m)$	20 × 10⁶	[58]
	$C_{f, x} (N \cdot s / m)$	5.01 × 10²	[58]
Foundation, lateral	$K_{f, y} (N / m)$	20 × 10⁶	[58]
	$C_{f, y} (N \cdot s / m)$	5.01 × 10²	[58]
Foundation, vertical	$K_{f, z} (N / m)$	20 × 10⁶	[58]
	$C_{f, z} (N \cdot s / m)$	5.01 × 10²	[58]

Table 3. Mechanical properties of the vehicle [36].

Parameter	Value
Car body mass, $m_{c b}$	35,640 kg
Car body roll moment of inertia, $I_{c b, x}$	55,120 kg·m²
Car body pitch moment of inertia, $I_{c b, y}$	1,475,000 kg·m²
Car body yaw moment of inertia, $I_{c b, z}$	1,477,000 kg·m²
Bogie mass, $m_{b}$	2829 kg
Bogie roll moment of inertia, $I_{b, x}$	2700 kg·m²
Bogie pitch moment of inertia, $I_{b, y}$	1931.4 kg·m²
Bogie yaw moment of inertia, $I_{b, z}$	3878.7 kg·m²
Wheelset mass, $m_{w}$	1711 kg
Wheelset roll moment of inertia, $I_{w, x}$	733.4303 kg·m²
Wheelset yaw moment of inertia, $I_{w, y}$	733.4303 kg·m²
Stiffness of the primary longitudinal suspension, $k_{1, x}$	34,981,000 N/m
Stiffness of the primary transversal suspension, $k_{1, y}$	30,948,200 N/m
Stiffness of the primary vertical suspension, $k_{1, z}$	1,652,820 N/m
Damping of the primary vertical suspension, $c_{1, z}$	16,739 N·s/m
Stiffness of the secondary longitudinal suspension, $k_{2, x}$	4,905,000 N/m
Stiffness of the secondary transversal suspension, $k_{2, y}$	2,500,000 N/m
Stiffness of the secondary vertical suspension, $k_{1, z}$	734,832 N/m
Damping of the secondary longitudinal suspension, $c_{2, x}$	400,000 N·s/m
Damping of the secondary transversal suspension, $c_{2, y}$	17,500 N·s/m
Damping of the secondary vertical suspension, $c_{2, z}$	35,000 N·s/m
The static load transmitted by each wheel	64,000 N
Longitudinal distance between bogies, $a_{1}$	19 m
Longitudinal distance between wheelsets, $a_{2}$	2.7 m
Transversal distance between vertical secondary suspensions, $b_{1}$	2.144 m
Transversal distance between longitudinal secondary suspensions, $b_{2}$	2.846 m
Transversal distance between primary suspensions, $b_{3}$	2.144 m
Vertical distance between car body center and secondary suspension, $h_{1}$	0.936 m
Vertical distance between bogie center and secondary suspension, $h_{2}$	0.142 m
Vertical distance between bogie center and wheelset center, $h_{3}$	0.065 m
Nominal rolling radius, $R_{0}$	0.43 m
Gauge, $s$	1.67 m

Table 4. Damage and undamaged scenarios.

		Baseline Scenario	Damaged Scenario
Train		Alfa pendular	Alfa pendular
Number of loading schemes		3	1 (full capacity)
Unevenness profiles		4	1
Speeds (km/h)		40–220 Interval: 20 km/h	120 and 200
Noise ratio		5%	5%
Flat lengths (mm)		-	10–25 mm (Damage 1) 28–50 mm (Damage 2)
Number of numerical analyses	Unsupervised approach	120	130
Number of numerical analyses	Supervised	50	100

Table 5. Unified summary of preprocessing and classification pipeline.

Stage	Method/Tools	Key Parameters	Output
Data acquisition	Wayside accelerometers	8 sensors, fs = 10 kHz	Raw time series
Filtering	low-pass Chebyshev Type II	fc = 1500 Hz	Filtered signals
Feature extraction	AR model	Order = 40	Feature matrix (250 × 40 × 8)
Normalization	PC	Remove first 2 PCs (>80% variance)	Normalized features
Data fusion	Mahalanobis Distance	Baseline mean and covariance	Damage Index
Detection	Confidence Boundary	1%	Healthy/Damaged wheels
Classification—unsupervised method	k-means	-	Severity clusters
Classification—supervised method	SVM	Kfold = 10	Severity labels

Table 6. Clustering performance for damage classification through the unsupervised approach under different sensor configurations and train speeds.

Method	Number of Sensors	Sensor Location	Train Speed (km/h)	Error	Accuracy (%)	Note
k-means (Unsupervised)	8 (4 per side)	Both sides	120	0	100	Works well with sufficient sensors
k-means (Unsupervised)	8 (4 per side)	Both sides	200	62	75.20	Accuracy drops due to noise
k-means (Unsupervised)	4 (one side only)	Opposite side of damage	120	0	100	Better than same-side placement
k-means (Unsupervised)	4 (one side only)	Same side as damage	120	3	98.80	Performance degrades significantly
k-means (Unsupervised)	4 (one side only)	Opposite side of damage	200	1	99.23	Highly affected by train speed
k-means (Unsupervised)	4 (one side only)	Same side as damage	200	63	51.53	Poor classification performance

Table 7. Clustering performance for damage classification through the supervised approach under different sensor configurations and train speeds.

Method	Number of Sensors	Sensor Location	Train Speed (km/h)	Error	Accuracy (%)	F1 Score (%)	Recall (%)	Precision (%)	Note
SVM (Supervised)	8 (4 per side)	Both sides	120	0	100	100	100	100	Robust classification
SVM (Supervised)	8 (4 per side)	Both sides	200	1	99.30	99.33	99.34	99.33	Performs well even at high speed
SVM (Supervised)	4 (one side only)	Opposite side of damage	120	0	100	100	100	100	Still effective with fewer sensors
SVM (Supervised)	4 (one side only)	Same side as damage	120	0	100	100	100	100	Little impact of sensor placement
SVM (Supervised)	4 (one side only)	Opposite side of damage	200	1	99.30	99.33	99.34	99.33	Slight accuracy drop
SVM (Supervised)	4 (one side only)	Same side as damage	200	1	99.30	99.33	99.34	99.33	Still better than k-means
SVM (Supervised)	1 (single sensor)	Opposite side of damage	120	0	100	100	100	100	Reliable with minimal setup
SVM (Supervised)	1 (single sensor)	Same side as damage	120	4	97.30	97.35	97.38	97.33	Slight sensitivity to placement
SVM (Supervised)	1 (single sensor)	Opposite side of damage	200	6	96	95.99	95.98	96	Accuracy is affected by speed
SVM (Supervised)	1 (single sensor)	Same side as damage	200	12	92	92.18	92.37	92	Less effective at high-speed

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chegini, A.; Mohammadi, M.; Mosleh, A.; Vale, C.; Ghiasi, R.; Silva, R.; Guedes, A.; Meixedo, A.; Malekjafarian, A. Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods. Machines 2026, 14, 286. https://doi.org/10.3390/machines14030286

AMA Style

Chegini A, Mohammadi M, Mosleh A, Vale C, Ghiasi R, Silva R, Guedes A, Meixedo A, Malekjafarian A. Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods. Machines. 2026; 14(3):286. https://doi.org/10.3390/machines14030286

Chicago/Turabian Style

Chegini, Alireza, Mohammadreza Mohammadi, Araliya Mosleh, Cecilia Vale, Ramin Ghiasi, Ruben Silva, Antonio Guedes, Andreia Meixedo, and Abdollah Malekjafarian. 2026. "Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods" Machines 14, no. 3: 286. https://doi.org/10.3390/machines14030286

APA Style

Chegini, A., Mohammadi, M., Mosleh, A., Vale, C., Ghiasi, R., Silva, R., Guedes, A., Meixedo, A., & Malekjafarian, A. (2026). Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods. Machines, 14(3), 286. https://doi.org/10.3390/machines14030286

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Support Vector Machine and k-Means Clustering for Advanced Wheel Flat Identification: A Comparison of Supervised and Unsupervised Methods

Abstract

1. Introduction

2. Application of Machine Learning Algorithms for Identifying Railway Wheel Defects

2.1. Overview of the Methodology

2.2. Unsupervised Classification Technique Using k-Means

2.3. Supervised Classification Technique Using SVM

3. Numerical Modeling

4. Simulation Scenarios

5. Time-Series Representation for Defect Detection

6. Wheel Flat Identification Application: Results and Discussion

6.1. Damage Detection

6.2. Classification Approach Using k-Means

6.3. Classification Approach Using SVM

6.4. Comparison of Unsupervised and Supervised Methods: Influence of Sensor Position on Wheel Flat Identification Accuracy

6.5. Simplified Preprocessing Framework for SVM-Based Classification

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI