A Novel Statistical Method for Spectral Analysis of A Short-Duration Signal and Its Application to Current Data for Stator Fault Diagnosis

Hebda-Sobkowicz, Justyna; Michalak, Anna; Wodecki, Jacek; Zimroz, Radosław; Wolkiewicz, Marcin; Szabat, Krzysztof

doi:10.3390/en19051351

Open AccessArticle

A Novel Statistical Method for Spectral Analysis of A Short-Duration Signal and Its Application to Current Data for Stator Fault Diagnosis

by

Justyna Hebda-Sobkowicz

¹

,

Anna Michalak

¹

,

Jacek Wodecki

¹

,

Radosław Zimroz

¹

,

Marcin Wolkiewicz

²

and

Krzysztof Szabat

^2,*

¹

Faculty of Geoengineering, Mining and Geology, Wroclaw University of Science and Technology, Na Grobli 15, 50-421 Wroclaw, Poland

²

Faculty of Electrical Engineering, Wroclaw University of Science and Technology, Wybrzeże Wyspiańskiego 27, 50-370 Wroclaw, Poland

^*

Author to whom correspondence should be addressed.

Energies 2026, 19(5), 1351; https://doi.org/10.3390/en19051351

Submission received: 7 January 2026 / Revised: 28 January 2026 / Accepted: 19 February 2026 / Published: 6 March 2026

(This article belongs to the Special Issue Electric Machinery, Transformers, and Modern Drives—4th Edition)

Download

Browse Figures

Versions Notes

Abstract

In this paper, a novel approach for fault detection in the stator windings of induction motors is presented. The procedure is based on spectral analysis of the current signal. However, due to the specific target application, short duration signals (0.2 s) are utilized, which results in poor spectral resolution. To address this issue, a statistical methodology is developed to minimize uncertainty in decision-making. To construct a health indicator (HI), a statistical analysis is performed to identify spectral components that are both informative and robust. For the selected fault-related frequencies, the HI was created. Using confidence intervals and statistical testing, a fault detection scheme was proposed. The method was validated on an experimental dataset, including both healthy and faulty conditions. The method has been tested on current signals with five levels of fault severity and seven load conditions. Experimental studies on a dedicated test rig demonstrated the high efficiency of the proposed approach for such specific constraints.

Keywords:

induction electric motors; fault detection; spectral analysis; current signal; statistical testing; short signal; varying load

1. Introduction

Induction motors are widely used in industrial applications due to their robustness, efficiency, and cost-effectiveness. However, like all rotating machinery, they are subject to various electrical and mechanical faults that can compromise reliability and shorten service life. Among these, stator winding faults—particularly inter-turn short circuits (ITSCs)—are considered especially critical because they can quickly evolve into severe failures if not detected at an early stage. Therefore, accurate and timely fault diagnosis is of great importance for predictive maintenance strategies and Industry 4.0-oriented smart monitoring systems. The adoption of condition-based monitoring and predictive maintenance strategies has become a cornerstone of Industry 4.0-oriented manufacturing systems [1,2]. Modern industrial facilities increasingly rely on intelligent monitoring systems that integrate Internet of Things (IoT) sensors and advanced analytics to enable real-time fault detection and proactive maintenance scheduling [3,4]. These data-driven approaches not only reduce unplanned downtime but also optimize maintenance costs and extend equipment service life, making reliable fault detection algorithms essential for competitive industrial operations. Early and reliable ITSC detection has long been a priority, with foundational studies analyzing turn-to-turn short circuits in operating motors and during switch-off transients directly in the frequency domain [5,6].

Traditionally, motor current signature analysis (MCSA) has served as the cornerstone of fault diagnosis in induction machines, relying on the identification of characteristic frequency components in the stator current spectrum [7,8]. Over the years, various signal processing enhancements have been introduced to improve the sensitivity of MCSA, including empirical mode decomposition (EMD), wavelet transform, and principal component analysis (PCA) [9]. These methods have proven effective for rotor bar faults, eccentricities, and bearing anomalies. For stator faults, spectral strategies are particularly effective: parametric spectral estimation sharpens fault-related lines under short records and varying loads [10]. Advanced parametric spectral estimation methods have demonstrated superior frequency resolution compared to non-parametric approaches when signal duration is restricted [10]. These high-resolution techniques address the fundamental trade-off between time and frequency resolution inherent in short-duration signal analysis. Rotor slot harmonics (RSH) have emerged as particularly sensitive fault indicators for stator winding defects, with the matrix pencil method specifically designed to extract RSH components with high precision [11]. Further contributions underline frequency-domain diagnostics under short records and inverter-fed conditions, including harmonic/sideband modeling and high-resolution estimation [12,13].

Short-Time Fourier Transform (STFT) has been widely used to generate spectrograms that reveal transient fault signatures requiring time–frequency analyses [14,15]. When combined with deep learning, STFT-based approaches have achieved high accuracy in fault classification, with some studies reporting accuracies above

97 %

in rotor bar fault detection [16]. In [17], authors show that time–frequency trajectory tracking methods can support interpretation under variable speeds. More advanced spectral estimation methods, such as minimum-norm time–frequency analysis, have demonstrated improved resolution for multiple-fault scenarios under variable operating conditions [18]. Unlike STFT, these subspace-based approaches can achieve frequency resolution that is not strictly constrained by signal duration, making them particularly suitable for the analysis of brief measurement windows encountered in rapid industrial inspection scenarios. In parallel, statistics such as spectral kurtosis have proven effective in detecting impulsive or transient fault patterns [16]. Related cyclostationary analysis can assist in the demodulation and reinforcement of sidebands in the spectrum [19,20]. Cyclostationary signal analysis offers a powerful framework for extracting periodic modulation patterns characteristic of rotating machinery faults. Vibration and current signals from faulty motors exhibit cyclostationary behavior due to the periodic interaction between stationary and rotating components [21]. Spectral correlation and cyclic coherence maps can reveal fault-related modulation sidebands even when they are masked by noise or other cyclic components in conventional power spectra. Improved cyclostationary methods combining Teager–Kaiser energy operator (TKEO) demodulation with fast spectral correlation have successfully diagnosed broken rotor bar and bearing faults under various operating conditions [22,23]. Recent surveys synthesize early-detection strategies and data-driven pipelines for induction machines, emphasizing the practical role of spectral indicators and sidebands [16,24].

Recent years have witnessed remarkable progress in the integration of artificial intelligence (AI) with motor fault diagnosis. Deep learning models leveraging time–frequency images or raw signal data have outperformed traditional approaches in terms of accuracy and generalization. For instance, convolutional neural networks (CNNs) trained on STFT-based spectral representations, as well as lightweight models such as ShuffleNetV2, have achieved nearly

99 %

accuracy while remaining computationally efficient [16]. Ensemble methods, such as the weighted probability ensemble deep learning (WPEDL) framework, combine features from both current and vibration signals and achieve classification rates exceeding

99 %

across multiple fault types [16]. Additionally, graph neural networks (GNNs) have been introduced for the direct analysis of raw current and vibration signals, showing promising results in capturing complex dependencies between sensor modalities [16]. Unlike traditional CNNs that process signals as independent samples, GNNs construct graph structures that explicitly model dependencies between data points, potentially improving generalization to unseen operating conditions. In parallel, recent works report AR/Prony-based spectral MCSA and ITSC-oriented indicators under inverter-fed or variable-load conditions [25]. However, deep learning approaches often require extensive labeled datasets and lack the interpretability necessary for safety-critical industrial applications. The black-box nature of neural networks makes it difficult to validate their decisions based on a physical understanding of fault mechanisms.

With respect to stator winding faults, several recent contributions have specifically addressed ITSC detection [26]. Other works have developed indicators, such as the complex current unbalance coefficient (CCUC), to discriminate between ITSC and voltage unbalance, achieving high robustness under realistic operating conditions (see related negative-sequence compensation in [27]). Inter-turn short circuits create an imbalance in the three-phase stator windings, generating negative-sequence components that are nominally absent in healthy motors under balanced supply conditions [28,29]. Negative-sequence current analysis can distinguish ITSC-induced asymmetry from supply voltage unbalance, achieving high robustness under realistic operating conditions [29]. Space-vector trajectory analysis during start-up transients can reveal ITSC signatures that are obscured during steady-state operation [30]. Envelope energy methods applied to start-up current signals have achieved high accuracy in early ITSC fault detection, with reported detection rates of 96.9% using machine learning classification [31]. The amplification of fault signatures during transient operation, when currents reach several times their rated values, makes start-up analysis particularly attractive for detecting incipient faults. The fusion of multi-sensor data and advanced deep learning architectures has further improved early-stage fault detection capabilities [32]. While these methods represent important advances, they often rely on advanced feature engineering, extensive training datasets, or access to multiple sensing modalities, which may restrict their adoption in industrial environments. By contrast, spectral techniques that target physics-informed sidebands—such as stator-fault components and rotor-slot harmonics—offer explainability and low sensor overhead [5,6,11].

Despite these technological advances, important limitations persist: application to short-duration signals is often hindered by the fundamental trade-off between time and frequency resolution, and most of the aforementioned methods lack a rigorous statistical framework for decision-making. Specifically, the significance of characteristic fault frequencies is rarely tested using statistical inference, such as confidence intervals, hypothesis testing, or test power analysis. This gap becomes particularly evident when analyzing short-duration signals (e.g., 0.2 s, 10,000 samples), where poor spectral resolution introduces high uncertainty. Although some research has proposed statistic-based spectral indicators for bearing faults [33,34], systematic statistical methodologies for short-time current signals remain scarce. Recent contributions focused on start-up features, space-vector indicators, and sequence compensation highlight the domain need but typically do not attach formal confidence measures [27,30,31]. Furthermore, deep learning approaches typically do not provide probabilistic outputs or uncertainty quantification, limiting their trustworthiness in safety-critical applications [35]. This fundamental limitation undermines the development of objective detection thresholds that balance sensitivity against false alarm rates, which is essential in industrial contexts where maintenance decisions carry significant economic consequences and regulatory compliance may require statistically justified diagnostic criteria.

The present work introduces a novel statistical approach for the spectral analysis of short-duration signals and applies it to stator fault diagnosis. The proposed approach employs confidence intervals and hypothesis testing [36,37,38] to statistically evaluate informative spectral components, enabling the construction of a health indicator (HI) that captures significant amplitude variations associated with inter-turn short circuits. Unlike purely data-driven techniques, the method provides interpretable results with quantified uncertainty, thus enhancing its trustworthiness. The contribution is further validated through experimental studies on a dedicated test rig, encompassing five levels of fault severity and seven load conditions. It is tested on previously unseen data to demonstrate robustness. The novelty of the study is a statistically grounded framework for stator winding fault detection based on a spectral health indicator (HI) constructed from very short signal records (e.g.,

0.2

s). The selection of a 0.2 s time window is motivated by the requirement to monitor machines operating under dynamic load conditions. In such applications, a longer acquisition time (e.g., 1 s) would violate the stationarity assumption due to supply frequency fluctuations, leading to spectral smearing. Furthermore, the short window aligns with the capabilities of low-cost embedded sensors designed for rapid condition assessment during short duty cycles. The proposed approach operates robustly across multiple fault severities (including a single shorted turn) and a wide range of load conditions. By explicitly modeling the sampling distribution of spectral amplitude estimates, the method forms confidence bounds and hypothesis tests at physics-informed sidebands, enabling reliable decisions even when frequency resolution is coarse and spectral estimates are biased. Experimental validation on previously unseen data indicates a high probability of detection at controlled false-alarm rates. In summary, the contribution advances spectral and statistical diagnostics of ITSC and strengthens decision-making by providing a statistically validated HI tailored to very short records.

The remainder of this paper is organized as follows. Section 2 presents the proposed methodology, including the diagnostic framework and data processing strategy. Key implementation details and statistical assumptions for the spectral analysis are provided to facilitate reproducibility and industrial transfer. Section 3 describes the experimental setup and the measurement procedure used to collect current signals at different fault stages under various operating conditions. Section 4 summarizes and discusses the aggregated diagnostic results obtained for all fault types and load levels. Finally, Section 5 provides the main conclusions and outlines directions for future research.

2. Methodology

The proposed methodology aims to diagnose inter-turn short circuits in stator windings based on the statistical analysis of amplitudes at characteristic fault frequencies (FFs) extracted from short-duration current signals. The approach combines classical spectral analysis with statistical hypothesis testing to obtain a scalar health indicator, i.e., HI value, which is subsequently used for decision-making.

The proposed diagnostic approach is organized into three main stages: dataset construction, feature-based health modeling, and validation.

1.

Dataset construction. Raw three-phase stator current signals—denoted as IsA, IsB, and IsC—were acquired under both healthy and faulty operating conditions, comprising five fault severity levels and seven load conditions. Each signal was segmented into short, fixed-length intervals to standardize the input size and augment the number of samples for analysis. The resulting segments were labeled and divided into separate datasets for subsequent processing, ensuring that identical signals did not appear in both sets. The overall dataset construction procedure is described in detail in Section 2.3.

2.

Feature-based health modeling. To construct the health indicator, it is first necessary to identify the fault-related informative frequencies. Therefore, statistical testing is employed to determine which frequencies are statistically significant. Subsequently, the health indicator and its confidence interval are computed, both of which are essential for decision-making regarding fault diagnosis. This step consists of two main stages:

(a): Feature selection based on statistical testing. Spectral features are extracted from the current signals to characterize both healthy and faulty operating conditions. Statistical analysis is then applied to identify the most informative features for distinguishing between these states (see Section 2.4.1).
(b): Health indicator and confidence interval construction. Using the healthy reference data and the selected informative features, the health indicator and its confidence interval are established, providing a quantitative reference for subsequent fault detection (see Section 2.4.2).

The step-by-step procedure of the feature-based health modeling is described in detail in Section 2.4.

3.: Validation procedure. During validation, new unseen three-phase stator current signals—representing both healthy and faulty conditions with five fault-severity levels at various load settings—are employed. For each tested signal, the computed health indicator is compared against the reference confidence range. If the indicator falls within this range, the signal is classified as healthy; otherwise, it is identified as faulty. Finally, the overall diagnostic performance of the proposed method is evaluated. The step-by-step statistical testing procedure is described in detail in Section 2.5.

2.1. Spectral Representation of the Signal

Let

x (t)

,

t = 0, \dots, T - 1

, be a current signal of length T. The first stage of the method involves the computation of the signal spectrum. A one-sided periodogram (non-parametric power spectral density, PSD) is used with a Hanning window

w (\cdot)

, and the number of FFT points is set to

N = 2^{⌊ {log}_{2} T ⌋}

. Zero padding is applied to improve frequency resolution.

The one-sided periodogram of

x (t)

is defined as [39]:

{\hat{P}}_{x} (f) = \frac{Δ t}{N} | \sum_{n = 0}^{N - 1} w (n) x (n) e^{- i 2 π f Δ t n} |^{2}, f \in [0, \frac{f s}{2} - 1],

(1)

with the frequency

f (k) = \frac{k f s}{N}

for

k = 0, \dots, \frac{N}{2}

, and the

Δ f = \frac{f s}{N}

and

Δ t = t_{2} - t_{1}

. The one-sided amplitude spectral density is defined as

{\hat{SD}}_{x} (f) = \sqrt{\frac{2 P_{x} {∥ w ∥}^{2}}{{(\sum_{n = 0}^{N} w (n))}^{2}}},

(2)

where

∥ w ∥

is the Euclidean norm of vector w.

2.2. Theoretical Foundations of Stator Winding Fault Detection Based on Current Spectrum Analysis

The primary cause of characteristic harmonics in the stator current during an inter-turn short circuit (ITSC) is the disturbance of the magnetic circuit’s symmetry. In a healthy state, the stator windings produce a balanced magnetomotive force (MMF), resulting in an approximately sinusoidal flux distribution in the air gap. The occurrence of an ITSC reduces the effective number of turns in the faulty phase, leading to a significant short-circuit current that generates a locally opposing magnetic field. This negative MMF acts against the main MMF of the stator winding, causing its asymmetry, which weakens and distorts the total MMF. This disturbance creates an uneven flux distribution in the air gap, and this disturbed magnetic field subsequently interacts with the rotor. This interaction modulates the stator current with frequencies intrinsically linked to the rotor’s construction, particularly the rotor slot harmonics (RSH). The severity of the fault is directly correlated with the amplitude of these induced harmonic components; a larger fault, characterized by a higher short-circuit current and greater MMF distortion, results in a more pronounced increase in their magnitudes in the current spectrum.

As a result of the FFT transformation of the stator current signal, an ITSC fault manifests as an increase in the amplitudes of several characteristic frequency groups of components. The first group comprises the basic fault harmonics in the low-frequency range, which are a direct consequence of the MMF disturbance and depend on the rotor speed or slip. These components are calculated using the following formula:

f_{s h}^{(1)} (k, m) = f_{s} (k \frac{(1 - s)}{p_{p}} + m),

(3)

where

f_{s}

—supply voltage frequency,

p_{p}

—number of pairs of poles, s—slip,

k = 1, 2, 3, \dots

,

m \in Z_{odd}

, where m belongs to the odd integer values.

The second, and particularly sensitive, group consists of the rotor slot harmonics (RSH) in the medium-frequency range, which are related to the physical construction of the rotor. The amplification of RSH during ITSC occurs because the stator asymmetry created by the shorted turns distorts the air-gap magnetomotive force (MMF), causing uneven flux distribution. This disturbed field interacts with the rotor’s physical structure (rotor slot spacing), thereby modulating the stator current with frequencies intrinsically linked to rotor geometry. Unlike low-frequency fault harmonics, RSH are less affected by slip variations at varying loads, making them more robust indicators. While these harmonics exist under normal conditions with small amplitudes, a stator fault significantly amplifies them. The general formula for these components is

f_{s h}^{(2)} (k, m) = f_{s} (k N_{r} \frac{(1 - s)}{p_{p}} + m),

(4)

where

N_{r}

—number of rotor bars.

The most diagnostically valuable is typically the Lower Rotor Slot Harmonic (LRSH) given by

f_{s h}^{(2)} (1, 1) = f_{s} (N_{r} \frac{(1 - s)}{p_{p}} - 1) .

(5)

The third group includes supply harmonics, calculated as

s f_{n} = n f_{s},

(6)

which are multiples of the fundamental frequency; however, these are less specific indicators compared to the first two groups.

It is essential to consider practical aspects, such as load dependence, as the frequency position of

f_{s h}

components varies with slip. Under very light loads, they may approach supply-related harmonics, complicating detection. Furthermore, low-amplitude fault harmonics in the early stages can be masked by noise in conventional FFT analysis. For a measured rotor speed of n, the slip is determined as

s = \frac{n_{s} - n}{n_{s}} .

(7)

The synchronous speed

n_{s}

can be calculated using the following formula:

n_{s} = \frac{60 \cdot f_{s}}{p_{p}},

(8)

where

p_{p}

denotes the number of pole pairs.

In the presented approach, seven fault-related frequencies, denoted as

F F_{l} \in FF

, are investigated. They consist of the third supply harmonic, i.e.,

s f_{3}

, and the rotor slot harmonics for

k = 1

and

m \in {- 5, - 3, - 1, 1, 3, 5}

. The resulting vector of potential fault frequencies

FF

considered in this study is defined as follows:

FF : = [F F_{1}, F F_{2}, \dots, F F_{7}] = [s f_{3}, f_{s h}^{(2)} (1, - 5), f_{s h}^{(2)} (1, - 3), f_{s h}^{(2)} (1, - 1), f_{s h}^{(2)} (1, 1), f_{s h}^{(2)} (1, 3), f_{s h}^{(2)} (1, 5)] .

(9)

Each empirical frequency estimate

{\hat{F F}}_{l}

is obtained as

{\hat{F F}}_{l} = arg max_{f \in [F F_{l} - 2 Δ f, F F_{l} + 2 Δ f]} {\hat{SD}}_{x} (f),

(10)

where

{\hat{SD}}_{x} (f)

denotes the estimated spectral density of the signal.

The selection of these components was determined experimentally by comparing healthy and faulty operating conditions. The chosen spectral components exhibit the largest differences in spectral amplitudes, making them the most informative for fault characterization.

The graphical illustration of

\hat{F F_{l}}

detection for different fault stages, i.e., healthy signal (upper panel), fault 1 (middle panel), and fault 5 (bottom panel), is presented in Figure 1. The

{\hat{F F}}_{l}

is marked with a black dashed line, with the region of local maximum exploration, i.e.,

[F F_{l} - 2 Δ f, F F_{l} + 2 Δ f]

marked with a shaded gray area.

The exemplary theoretical values of

F F_{l}

used in this study (for load 6) for the analyzed motor, with the number of rotor slots equal to

N_{r} = 26

, a measured rotor speed of

n = 1407 rpm

, and a current supply of

f_{s} = 50

Hz are listed in Table 1.

Using the formula for RSH, see Equation (4), and for the formula for the slip and synchronous speed, see Equations (7) and (8); one can note that

1 - s = 1 - \frac{n_{s} - n}{n_{s}} = \frac{n}{n_{s}} .

(11)

Substituting into the fault frequency model from Equation (4) gives

f_{s h}^{(2)} (k, m) = f_{s} (\frac{k N_{r}}{P_{p}} \frac{n}{n_{s}} + m) .

(12)

Using

n_{s} = \frac{60 f_{s}}{P_{p}}

(hence

\frac{1}{n_{s}} = \frac{P_{p}}{60 f_{s}}

), the expression simplifies to a linear form

f_{s h}^{(2)} (k, m) = \frac{k N_{r}}{60} n + f_{s} m .

(13)

This linear dependence yields a direct sensitivity of the predicted fault frequency to rotor-speed estimation errors. For the parameters

N_{r} = 26

,

f_{s} = 50 Hz

,

k = 1

, and

m = 1

, the expression becomes

f_{s h}^{(2)} (1, 1) = \frac{26}{60} n + 50 .

(14)

A rotor-speed estimation error

δ n

produces a frequency prediction error

δ f \approx \frac{d f_{s h}^{(2)} (1, 1)}{d n} δ n = \frac{26}{60} δ n .

(15)

To ensure that the true spectral peak remains inside the search window of

\pm 2 Δ f

, the condition

|δ f| \leq 2 Δ f

(16)

must hold, which yields the admissible speed error bound

|δ n| \leq \frac{2 Δ f}{26 / 60} \approx \frac{2 Δ f}{0.4333} rpm .

(17)

Summarizing, for the

Δ f \approx 6 Hz

, small speed/load variations within

|δ n| \leq \frac{2 \cdot 6}{0.4333} \approx 27 rpm

will not affect the effectiveness of the proposed method, as the spectral peak of empirical frequency estimate

{\hat{F F}}_{l}

in Equation (10) will be properly assigned.

2.3. Dataset Construction

Both healthy and faulty three-phase stator current signals were used to build the database for analysis, including different load levels. Each raw signal was divided into fixed-length windows (in the presented analysis,

0.2

s) to ensure that uniform data frames were generated from the input data. Based on this segmentation, the obtained signals were organized into datasets. Three datasets were prepared from healthy signals, i.e.,

A_{0}

,

A_{1}

, and

A_{2}

, and two from faulty ones, i.e.,

B_{1}

and

B_{2}

, with all datasets containing an equal number of segments. The desired feature of the datasets is their independence (the datasets come from the same machine but from different experiments run at different times). Spectra are computed using fixed parameters and reveal stable, i.e., spectral estimates and their confidence intervals do not vary systematically across adjacent windows. The resulting datasets are summarized in Table 2.

This division ensures a strict separation between datasets of equal size used for different purposes, thereby facilitating a reliable evaluation of diagnostic performance. In the present analysis, each subset contains

M = 100

segments of

0.2

s. The algorithm of the dataset construction (for each current phase, fault type, and load) is presented in Algorithm 1.

Algorithm 1: Dataset construction (for each current phase, fault type, and load)

Data: Current-signal datasets: healthy

A

and faulty

B

Result: Datasets

A_{0}, A_{1}, A_{2} \in A

and

B_{1}, B_{2} \in B

of healthy and faulty data

Dataset construction:

1.

Collect raw current signals under healthy (

A

) and faulty (

B

) conditions.

2.

Divide each raw signal into fixed-length windows to ensure uniform input size.

3.

Label the obtained segments and organize them into disjoint datasets.

4.

Select M segments for each subset (

A_{0}

,

A_{1}

,

B_{1}

,

A_{2}

,

B_{2}

).

Dataset for $\hat{F F C I}$ and $\hat{H I C I}$ construction: $x_{i}^{*} \in A_{0}$ (reference healthy),
Datasets for feature selection: $x_{i} \in A_{1}$ (healthy), $y_{i} \in B_{1}$ (faulty),
Datasets used for validation procedure: $s_{i} \in A_{2}$ (healthy), $u_{i} \in B_{2}$ (faulty).

5.

Save the prepared datasets with corresponding labels and segment indices for further diagnostic analysis.

In the proposed methodology, the construction of the health indicator involves an internal statistical testing procedure, namely a feature selection approach, which is independent of the final validation stage of the entire diagnostic framework. This separation is required since the algorithm for selecting informative fault frequencies (

F F_{l}

) relies on an internally defined statistical test, which is explicitly described in the next section.

2.4. Feature-Based Health Modeling

The feature-based health modeling stage employs three datasets: two healthy datasets,

A_{0}

and

A_{1}

, and one faulty dataset,

B_{1}

. The signals

x_{i}^{*} \in A_{0}

are used to establish the confidence intervals

{\hat{F F C I}}_{l}

for the spectral amplitudes at each characteristic frequency

{\hat{F F}}_{l}

, as well as the confidence interval for the health indicator

\hat{H I C I}

. The signals

x_{i} \in A_{1}

and

y_{i} \in B_{1}

are then used to estimate the empirical size (Type I error) and empirical power of the statistical test defined below, enabling the selection of informative fault frequencies. The two main stages include:

the selection of informative ${\hat{F F}}_{l}$ features, described in detail in Section 2.4.1;
the construction of the confidence interval for the decision-maker health indicator discussed in Section 2.4.2.

2.4.1. Feature Selection Based on Statistical Testing

For each fault frequency

F F_{l}

from healthy signals

x_{i}^{*} \in A_{0}

, where

l = 1, 2, \dots, 7

, and

i = 1, \dots, M

, a confidence interval

{\hat{F F C I}}_{l}

is estimated as follows:

{\hat{F F C I}}_{l} = [q_{α / 2} ({\hat{SD}}_{x_{1}^{*}} ({\hat{F F}}_{l}), \dots, {\hat{SD}}_{x_{M}^{*}} ({\hat{F F}}_{l})), q_{1 - α / 2} ({\hat{SD}}_{x_{1}^{*}} ({\hat{F F}}_{l}), \dots, {\hat{SD}}_{x_{M}^{*}} ({\hat{F F}}_{l})], x_{i}^{*} \in A_{0},

(18)

where

q_{α / 2} (\cdot)

is the empirical quantile of order

α / 2

with

α = 0.01

. The confidence intervals

{\hat{F F C I}}_{l}

are used to assess which of the

\hat{F F}

are informative. For this purpose, a statistical test was defined that involved the verification of two criteria. For a signal z, the statistic

T_{z}

and the test

φ_{z}

are defined as follows:

T_{z} (l) : = {\hat{SD}}_{z} ({\hat{F F}}_{l}), φ_{z} (l) = \{\begin{matrix} 1, & T_{z} (l) \notin {\hat{F F C I}}_{l}, \\ 0, & T_{z} (l) \in {\hat{F F C I}}_{l}, \end{matrix}

(19)

with the following testing hypotheses:

H_{0, l} : T_{z} (l) \in {\hat{F F C I}}_{l} (healthy), H_{1, l} : T_{z} (l) \notin {\hat{F F C I}}_{l} (faulty) .

(20)

For healthy signals

x_{i} \in A_{1}

and faulty signals

y_{i} \in B_{1}

, where

i = 1, \dots, M

the empirical size and power of the test [37,38] denoted

φ_{z}

at

F F_{l}

respectively are defined as follows:

Criteria 1 : {\hat{κ}}_{φ_{x}} (l) = \frac{1}{M} \sum_{i = 1}^{M} φ_{x_{i}} (l), Criteria 2 : {\hat{π}}_{φ_{y}} (l) = \frac{1}{M} \sum_{i = 1}^{M} φ_{y_{i}} (l), l = 1, \dots, 7 .

(21)

The selected informative set of fault frequency indexes is defined as follows:

L_{F F} : = \{l \in {1, \dots, 7} : {\hat{κ}}_{φ_{x}} (l) \leq 0.01 \land {\hat{π}}_{φ_{y}} (l) \geq 0.99\} .

(22)

This means that the informative frequencies are those for which, under healthy conditions, the corresponding amplitudes fall within the confidence interval

{\hat{F F C I}}_{l}

with a Type I error probability not exceeding

1 %

, and under faulty conditions, they fall outside this interval with a confidence level of

99 %

.

2.4.2. Health Indicator and Confidence Interval Construction

For a given

L_{F F}

, the spectrum-based health indicator, HI, for signal z is defined as the sum of spectrum amplitudes at the informative

{\hat{F F}}_{l}

, i.e.:

{\hat{H I}}_{z} = \sum_{l \in L_{F F}} {\hat{SD}}_{z} (F F_{l}) .

(23)

For healthy signals

x_{i}^{*} \in A_{0}

,

i = 1, \dots, M

, a confidence interval of the indicator

\hat{H I}

is estimated as follows:

\hat{H I C I} = [q_{α / 2} ({\hat{H I}}_{x_{1}^{*}}, \dots, {\hat{H I}}_{x_{M}^{*}}), q_{1 - α / 2} ({\hat{H I}}_{x_{1}^{*}}, \dots, {\hat{H I}}_{x_{M}^{*}})], x_{i}^{*} \in A_{0}, α = 0.01 .

(24)

The

\hat{H I C I}

serves as a quantitative reference for subsequent fault detection and decision-making. It reflects the deviation of the current signal from the healthy reference state, enabling the assessment of the machine’s operational health and facilitating the identification of emerging faults with statistical confidence.

The algorithm for feature-based health modeling is summarized in Algorithm 2.

Algorithm 2: Construction of the health indicator confidence interval (for each current phase, fault type, and load)

Data: Current-signal datasets: healthy

A_{0}, A_{1}

and faulty

B_{1}

Result: The confidence interval

\hat{H I C I}

of healthy dataset

Feature-based health modeling:

1.: Import signals from $A_{0}$ , $A_{1}$ , and $B_{1}$ .
Feature selection
2.: For each $x_{i}^{*} \in A_{0}$ , $x_{i} \in A_{1}$ , and $y_{i} \in B_{1}$ , compute the spectrum and estimate ${\hat{F F}}_{l}$ using Equation (10).
3.: Determine confidence intervals ${\hat{F F C I}}_{l}$ from dataset $A_{0}$ , using Equation (18).
4.: Compute empirical size and test power: ${\hat{κ}}_{φ_{x}} (l)$ and ${\hat{π}}_{φ_{y}} (l)$ , using Equation (21).
5.: Select informative ${\hat{F F}}_{l}$ , i.e., $L_{F F}$ according to Equation (22).
Confidence interval construction
6.: Compute the health indicator ${\hat{H I}}_{x_{i}^{*}}$ from signals $x_{i}^{*} \in A_{0}$ using the selected $\hat{F F}$ Equation (23).
7.: Establish the indicator confidence interval $\hat{H I C I}$ from healthy dataset $A_{0}$ using Equation (24).

2.5. Validation Procedure

For a signal z with the HI-based statistic

{\hat{H I}}_{z}

, the test

ϕ_{z} (H I)

is defined as follows:

ϕ_{z} (HI) = \{\begin{matrix} 1, & {\hat{H I}}_{z} \notin \hat{H I C I}, \\ 0, & {\hat{H I}}_{z} \in \hat{H I C I}, \end{matrix}

(25)

with the following testing hypotheses:

H_{0} : {\hat{H I}}_{z} \in \hat{H I C I} (healthy) vs . H_{1} : {\hat{H I}}_{z} \notin \hat{H I C I} (faulty) .

(26)

For the signals

s_{i} \in A_{2}

(healthy) and

u_{i} \in B_{2}

(faulty), where

i = 1, \dots, M

, the indicator values

{\hat{H I}}_{s_{i}}

and

{\hat{H I}}_{u_{i}}

are computed using Equation (23), and each signal is classified according to the HI-based decision rule defined in Equation (25). The count of correct decisions (CCD) is then defined as

{\hat{CCD}}_{H} = \sum_{s_{i} \in A_{2}} ϕ_{s_{i}} (HI) (healthy), {\hat{CCD}}_{F} = \sum_{u_{i} \in B_{2}} ϕ_{u_{i}} (HI) (faulty),

(27)

where

ϕ_{z} (HI)

is the decision function defined in Equation (25), returning 1 for faulty signals and 0 for healthy ones.

The diagnostic performance is evaluated using the fault detection rates (FDRs):

{FDR}_{H} = \frac{{\hat{CCD}}_{H}}{M} (healthy), {FDR}_{F} = \frac{{\hat{CCD}}_{F}}{M} (faulty) .

(28)

A higher

{FDR}_{F}

value (closer to 1) indicates superior fault detection performance;

{FDR}_{F} = 1

means that 100% of faulty signals are correctly identified as faulty. Conversely, a lower

{FDR}_{H}

value (closer to 0) reflects better acceptance of healthy signals;

{FDR}_{H} = 0

implies that none of the healthy signals were incorrectly classified as faulty, i.e., 100% of healthy signals were correctly identified as healthy. The desired diagnostic property is to achieve

{FDR}_{H}

close to 0 and

{FDR}_{F}

close to 1. The algorithm of the validation procedure is summarized in Algorithm 3.

Algorithm 3: Validation procedure (for each current phase, fault type and load)

Data: Current-signal datasets: healthy

A_{2} \in A

and faulty

B_{2} \in B

;

The confidence interval health indicator of healthy dataset:

\hat{H I C I}

Result: Classification of the tested signal as healthy or faulty

Validation procedure:

1.

Import signals from

A_{2}

and

B_{2}

.

2.

For each

s_{i} \in A_{2}

,

u_{i} \in B_{2}

, compute the spectrum and estimate informative

{\hat{F F}}_{l}, l \in L_{F F}

using Equation (10).

3.

For each signal z, compute

\hat{H I} (z)

using Equation (23).

4.

Apply the HI-based decision rule Equation (25):

If ${\hat{H I}}_{z} \in \hat{H I C I}$ : classify as healthy.
If ${\hat{H I}}_{z} \notin \hat{H I C I}$ : classify as faulty.

5.

Calculate

{FDR}_{H}

and

{FDR}_{F}

using Equation (28) and report the overall diagnostic performance.

Figure 2 depicts the schematic block flowchart of the proposed methodology, covering dataset construction (Stage 1), feature extraction and selection (Stage 2.1), health-indicator and confidence intervals calculation (Stage 2.2), and the final decision and validation procedure (Stage 3). The figure provides a compact overview that is referenced throughout the section.

3. Experiments

Experimental research aimed at detecting and analyzing inter-turn short circuits in the stator winding of an induction motor was conducted on a specially designed laboratory stand. This chapter details the stand’s configuration, the measuring components used, and the adopted methodology for fault modelling.

3.1. Research Stand and Experimental Methodology

The research stand consisted of two fundamental subsystems: a drive system with the motor under test and a data acquisition and measurement system (see Figure 3). The main component was an Sh90L-4 squirrel-cage induction motor (Cantoni Group, Cieszyn, Poland) with a rated power of 1.5 kW, which served as the test object. This motor was powered via a frequency converter operating in scalar control mode (U/f = const) with an open speed loop. The output frequency range was set from 10 to 50 Hz to cover typical industrial variable-speed drive scenarios, from light-load (10 Hz) to rated-speed (50 Hz @ 1500 rpm) operation.

The load torque was generated in a controlled manner by a second drive unit, which utilized a Permanent Magnet Synchronous Motor (PMSM). This configuration, with the PMSM acting as a load motor, allowed for the precise application of a variable mechanical torque to the shaft of the motor under test, enabling the observation of its behaviour across different operating points.

3.2. Modelling Stator Winding Faults

A crucial aspect of the stand was the physical capability to simulate inter-turn short circuits. For this purpose, the tested Sh90L-4 motor was specially modified. A series of taps were brought out from the original stator winding, corresponding to a specific number of turns in each phase (in the range from 0 to 10 turns). These taps were connected to an external terminal board, as illustrated in Figure 4.

A short circuit between selected turns was realized by directly connecting (shorting) the corresponding terminals on this board (Figure 5). It is important to emphasize that no additional current-limiting resistance was introduced into the short-circuit loop. This approach enabled the replication of conditions similar to a real, sudden fault, characterized by significant currents flowing through the shorted loop.

Following this approach, we explicitly define five fault-severity levels by the number of shorted turns (

N_{s h}

), which we use as an unambiguous measure of severity (independent of operating-point-dependent short-circuit current). The levels are summarized in Table 3.

From a diagnostic perspective, fault 1 is the least severe in terms of operational disruption (the motor can still run), but it is the most challenging to detect due to the weak signal signature. However, early detection at this stage is critical because the high circulating current in the shorted loop creates a localized hotspot. Without intervention, this leads to rapid insulation degradation, propagating the fault to more turns and eventually causing a catastrophic phase-to-ground or phase-to-phase failure.

3.3. Data Acquisition System

The measurement of current signals was fundamental to the diagnostic process. For recording the three-phase stator currents, three LEM LA 25-NP current transducers (LEM International SA, Meyrin, Switzerland) were used. The main metrological parameters of these transducers are summarized in Table 4.

The voltage signals from the outputs of the LEM transducers were fed to a high-quality National Instruments NI PXIe-4492 data acquisition card, (National Instruments NI, Austin, TX, USA) mounted in a PXI system. This card, with key parameters presented in Table 5, ensured signal recording with very high resolution.

The entire measurement process, including card control, data collection, preliminary processing, and visualization, was managed by a proprietary application developed in the LabVIEW environment.

4. Results

In this section, the analyzed signals are presented. The signals correspond to different fault stages, including the healthy condition and the most severe fault (fault 5). The datasets

A

and

B

are independent (the datasets come from the same machine but from different experiments run at different times). Spectra are computed using a Hann-windowed periodogram with fixed parameters, and stability is verified by confirming that spectral estimates and their confidence intervals do not vary systematically across adjacent windows. The measurements were conducted under different levels of load conditions (from level 0 to 6), with the load gradually increasing over time. The recorded signals (with each of the three phase currents overlapping) are shown in Figure 6 with the zoomed in view presented in Figure 7. There is a visible increase in the signal amplitude for fault 2–fault 5, corresponding to the increasing load over time for load levels greater than 2. The load profiles estimated from the signals are illustrated in Figure 8, with the zoomed in view presented in Figure 9. Estimated load profiles exhibit slight variations between fault categories, resulting from fault-induced changes in current amplitude and waveform distortion. For higher fault severities (faults 2–5), phase divergence becomes more pronounced due to increased stator current asymmetry. The load level was estimated from the RMS value of the stator current.

4.1. Dataset Construction for Analyzed Current Data

For each load level, 5 s current signals were extracted for further analysis (highlighted in the green shaded areas in Figure 6, Figure 7 and Figure 8). The sampling frequency of the measurements was 50 kHz. Considering the requirements of the intended industrial application, the maximum signal length was limited to 0.2 s (10,000 samples). In the spectral analysis, the FFT length was set to 8192 points, which resulted in a frequency resolution of approximately 6 Hz. Each 5 s signal (for every load and current phase) was further divided into 0.2 s segments to create equal-sized datasets:

A_{0}

,

A_{1}

, and

A_{2}

for healthy signals, and

B_{1}

and

B_{2}

for faulty signals. Each dataset contained 100 segments that were used for statistical testing. The indices of the selected signals in each dataset were determined using a pseudo-random number generator that produces a deterministic sequence of numbers that appear random. For example, set

A_{0}

contains indices starting with

[73, 229, 92, 220, 224, 180, \dots]

, whereas set

A_{1}

starts with

[74, 93, 96, 44, 211, 153, \dots]

. This segmentation procedure ensures a balanced and statistically representative dataset for evaluating the proposed method.

4.2. Partial Results—Feature Selection Based on Statistical Testing

In this section, the results of the proposed methodology are presented for the data corresponding to the maximum load applied during the experiment (load 6). As discussed in Section 2.2, fault detection under very light loads can be challenging. Therefore, load 6 was selected as the most representative operating condition for demonstrating the analysis results. The following subsections present the obtained health indicators, confidence intervals, and diagnostic performance measures.

Exemplary current signals for healthy and faulty conditions are presented in Figure 10. Each subplot contains six different signals, although some of them overlap and are therefore not fully visible. A slight phase shift between the signals can be observed when comparing healthy and faulty cases. There is also a small difference in the maximum amplitudes among the current types (IsA, IsB, and IsC), but it is difficult to distinguish visually.

In Figure 11, the signal spectra are presented. The 0.2 s signal duration yields a frequency resolution of approximately 6 Hz (with 8192-point FFT), which constrains the separation of closely spaced harmonics. Despite this limitation, the proposed statistical framework successfully identifies informative fault frequencies.

The fault frequencies, denoted as

{\hat{F F}}_{l}

, are identified as local maxima within the theoretical fault frequency range and a window of

\pm 2

samples (according to Equation (10)). This range is indicated by the gray shaded area.

As shown in Figure 11, certain fault frequencies, such as

F F_{4}

(559.7 Hz) and

F F_{5}

(659.7 Hz), exhibit noticeable variations in spectral amplitude as the damage level increases. To verify this observation, the amplitudes of the spectrum at

F F_{l}

are analyzed. The corresponding

{\hat{F F}}_{l}

values are summarized in Table 6.

Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17, Figure 18, Figure 19, Figure 20, Figure 21, Figure 22, Figure 23, Figure 24 and Figure 25 present the results of testing the

{\hat{F F}}_{l}

amplitudes for each fault stage and healthy data.

Spectrum amplitudes at the fault frequencies

{\hat{F F}}_{l}

, obtained from the healthy dataset

A_{0}

(shown as blue dots), are used to construct the

99 %

confidence intervals

{\hat{F F C I}}_{l}

(blue shaded areas) and to perform the corresponding hypothesis testing. Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17 and Figure 18 present the testing results for the healthy data

{\hat{F F}}_{l} \in A_{1}

(red dots). Each figure includes results for the three current phases—IsA, IsB, and IsC—displayed in separate subplots, with the titles indicating the empirical size

{\hat{κ}}_{φ_{x}}

of the test. The desired outcome is

{\hat{κ}}_{φ_{x}} \leq 0.01

. As observed, four frequencies

l \in L

do not meet this criterion, with the following results:

{\hat{κ}}_{φ_{x} (2)} = 0.04

(IsB, IsC),

{\hat{κ}}_{φ_{x} (3)} = 0.02

(IsA),

{\hat{κ}}_{φ_{x} (4)} = 0.02

(IsA), and

{\hat{κ}}_{φ_{x} (7)} = 0.02

(IsB). Consequently,

F F_{2}

for IsB and IsC,

F F_{3}

for IsA,

F F_{4}

for IsA, and

F F_{7}

for IsB are excluded from the set of informative fault frequencies, as they do not satisfy the expected result of Criterion 1.

Figure 19, Figure 20, Figure 21, Figure 22, Figure 23, Figure 24 and Figure 25 present the results of testing the faulty data

{\hat{F F}}_{l} \in B_{1}

(red dots). Each figure shows the results for individual current phases—IsA, IsB, and IsC—in separate subplots, with titles indicating the test power

{\hat{π}}_{φ_{y}}

, for which the desired outcome is

{\hat{π}}_{φ_{y}} \geq 0.99

.

For

{\hat{F F}}_{5}

, the value of

{\hat{π}}_{φ_{y}} = 1

is achieved for all fault stages and current phases. In contrast, for

{\hat{F F}}_{7}

, none of the analyzed cases reaches the expected value. Consequently,

F F_{7}

is excluded from the set of informative fault frequencies, as it does not satisfy the expected result of Criterion 2.

Among all analyzed frequencies,

F F_{4}

and

F F_{5}

most consistently meet the testing criteria, indicating a strong sensitivity of their spectral components to fault-related variations in the current signal.

The summarized testing results are presented in Figure 26 and Figure 27. Figure 26 presents the aggregate results for the healthy-case test (test on healthy data) across loads and fault frequency indexes for all currents. Figure 27 presents the aggregate results for the faulty-case test (test on faulty data) across loads and fault frequency indexes for all currents.

In Table 7, the summarized results of feature selection based on statistical testing are presented for all load levels and current phases.

Since, for some analyzed cases, it was not possible to fully satisfy the statistical test assumptions and extract an informative set

L_{F F}

of fault frequency indexes (as even a minor deviation in the test power or the empirical size exceeding, e.g., 0.005 leads to rejection), to ensure consistency and robustness, the informative set

L_{F F}

of fault frequency indexes identified at maximum load (load 6) for each current separately is adopted across all load levels. This choice is justified by the superior statistical separability between healthy and faulty conditions observed at higher loads.

Therefore, without a significant loss of generality, the set of fault frequency indexes identified as informative for load 6 is adopted for all load levels, namely, for IsA—

L_{F F} = 5

, for IsB—

L_{F F} = {4, 5}

, and for IsC—

L_{F F} = 5

.

4.3. Partial Results—Health Indicator and Confidence Interval Construction

The informative fault frequencies (IsA—

F F_{5}

, IsB—

F F_{4}

,

F F_{5}

, and IsC—

F F_{5}

) are used to calculate the

\hat{H I}

values. The 99% confidence intervals

\hat{H I C I}

, computed based on the healthy dataset

A_{0}

for each current phase, are represented by the blue shaded areas and are used in the decision-making process.

Figure 28 presents the results of testing the healthy dataset

A_{2}

(red dots), with the corresponding fault detection rate

{\hat{F D R}}_{H}

values indicated in the subplot titles. The expected outcome is

{\hat{F D R}}_{H} = 0

, meaning that all healthy data samples are correctly classified as healthy. Each figure shows the results for the individual current phases—IsA, IsB, and IsC—in separate subplots.

As observed, the

{\hat{F D R}}_{H}

values for all current phases remain close to zero, confirming the high reliability of the proposed approach in correctly identifying healthy operating conditions.

Figure 29 presents the results of testing the faulty dataset

B_{2}

(red dots), with the corresponding fault detection rate

{\hat{F D R}}_{F}

values indicated in the subplot titles. The expected outcome is

{\hat{F D R}}_{F} = 1

, which means that all

\hat{H I}

values corresponding to faulty data are correctly classified as faulty. Each figure shows the results for the individual current phases—IsA, IsB, and IsC—in separate subplots.

As observed, the

{\hat{F D R}}_{F}

values for all current phases are equal to 1, confirming the full detection of faulty conditions by the proposed method.

4.4. Validation Procedure for Analyzed Current Data

Figure 30 presents the summarized results of the validation procedure for all load conditions, current phases, and fault categories, using the testing datasets

A_{2}

and

B_{2}

. As observed, the fault detection effectiveness for the faulty datasets reaches 100% across all analyzed cases. For the healthy datasets, the effectiveness remains nearly 100%, with

{\hat{F D R}}_{H}

values close to zero, confirming the high reliability of the proposed diagnostic methodology.

The data from the same test rig, acquired during a separate experimental session at load 0 (the most challenging case for diagnosis), are used for testing (10 signals of 0.2 s). The exemplary signals of currents A, B, and C for healthy and faulty data are presented in Figure 31. The corresponding spectra are presented in Figure 32. These separate experiments ensure that the thermal conditions, background noise, and mechanical transients are uncorrelated with the training data. The proposed statistical method was applied to this new dataset without re-training the baseline parameters. The results of health indicator testing are presented in Figure 33, Figure 34 and Figure 35.

The summarized results obtained for this independent validation scenario are presented in Figure 36 (tested healthy data) and Figure 37 (tested faulty data). This additional validation confirms that the high accuracy reported is not a result of data leakage but demonstrates the method’s capability to generalize across different operating cycles.

4.5. Influence of the Significance Level on the HI Outcome and Detection Performance

In the proposed methodology, the significance level

α

governs the strictness of the decision rule through the width of the two-sided acceptance band defined by the

(1 - α)

confidence interval (CI). Decreasing

α

widens the CI (i.e., produces a more conservative acceptance band), which reduces the likelihood of spurious detections on healthy signals. Conversely, increasing

α

leads to narrower CIs and therefore a tighter acceptance band, enhancing sensitivity to deviations while potentially increasing the false-alarm rate in healthy data.

This behavior is consistent with the nominal size of a two-sided test: for a

(1 - α)

confidence band, approximately

α

of healthy observations are expected to fall outside the band, with about

α / 2

in each tail. As a result, a larger

α

naturally tends to increase the false discovery rate computed on healthy data, denoted as

{FDR}_{H}

.

Figure 38 empirically confirms this relationship by showing an almost linear trend:

{FDR}_{H}

increases with

α

over the considered range. To evaluate whether this effect is consistent across operating conditions, Figure 39 summarizes

{FDR}_{H}

for all tested loads and currents. The results indicate that the increase in

{FDR}_{H}

with

α

is systematic across the full set of conditions, which further supports using a small

α

when false-alarm control on healthy signals is prioritized.

At the same time, selecting a too small

α

may reduce statistical power (increase Type II errors), potentially delaying fault detection. Importantly, this trade-off does not manifest as a loss of fault detection effectiveness in our experiments. Figure 40 aggregates

{FDR}_{F}

(computed on faulty data) across all loads and currents and shows no degradation of performance for the tested values of

α

.

Therefore, while

α

primarily affects

{FDR}_{H}

through CI tightening/loosening, the fault-related metric

{FDR}_{F}

remains stable, indicating no loss in efficiency.

Based on these observations, we adopt

α = 0.01

as a deliberate compromise that emphasizes robustness against false alarms while preserving fault detection performance across the full range of operating conditions.

The average processing time per 0.2 s window for the entire online pipeline (three phases, and all fault/healthy states) was 13.43 ms, i.e., about

6.7 %

of the 200 ms window duration, leaving a

93.3

timing margin. This confirms real-time feasibility of the online stage.

5. Conclusions

A feature-based methodology for fault detection in induction motor systems has been presented, utilizing three-phase stator current signals. In the proposed approach, frequency-domain analysis was combined with statistical testing to identify informative fault-related spectral components, construct confidence intervals, and develop a quantitative health indicator HI along with its confidence range HICI. Most existing MCSA/envelope-based approaches rely on very similar frequency-domain indicators (e.g., amplitudes at fault-related sidebands). The difference lies in how the final decision is made from these features. In standard MCSA/envelope practice, fault detection typically depends on selecting a fixed threshold, which is often chosen heuristically, tuned separately for different operating conditions, or requires expert interpretation. As a result, the decision boundary may be difficult to reproduce, and the achieved false-alarm rate is not explicitly controlled, especially when load or current conditions vary.

The proposed methodology replaces heuristic thresholding with an automated, data-driven statistical test. Empirical confidence bands are estimated from healthy data offline, and online detection reduces to verifying whether the indicator falls outside the

(1 - α)

acceptance band. This yields explicit and interpretable control of the nominal false-alarm rate via

α

and provides a consistent, automatically constructed decision boundary without manual threshold selection.

It was demonstrated that the informative fault frequencies—

F F_{5}

for IsA,

F F_{4}

and

F F_{5}

for IsB, and

F F_{5}

for IsC—exhibit the highest sensitivity to fault-induced variations in the current spectrum. Through validation under multiple load conditions, it was confirmed that using data from load 6 as a reference provides robust generalization and high diagnostic accuracy across all operating conditions. For the faulty datasets, the fault detection rate

{\hat{F D R}}_{F}

was found to reach

100 %

in all analyzed cases, whereas for the healthy datasets,

{\hat{F D R}}_{H}

values remained close to zero, indicating the absence of false alarms. These results demonstrate that the proposed health indicator and statistical testing framework offer high reliability, well-defined decision boundaries, and robustness to load variations.

Although the experimental validation was conducted on an Sh90L-4 motor, the proposed method is adaptable to induction motors with different structural parameters. Since the fault-related frequencies (such as RSH) are determined analytically based on the number of rotor bars (

N_{r}

) and pole pairs (

p_{p}

), the algorithm can be reconfigured for any motor by updating these parameters. The statistical main of the method operates independently of the specific frequency location, making it applicable to a wide range of AC machinery, provided that the characteristic frequencies do not overlap significantly with the fundamental supply frequency or its low-order harmonics.

Obtaining a guaranteed “perfectly healthy” baseline is a common challenge in industrial condition monitoring. The standard procedure is to perform the healthy state acquisition (dataset

A_{0}

) immediately after motor installation or maintenance. This ensures that the baseline reflects the optimal machine state. In a bad scenario, when the system is installed on a motor that is already in operation (with unknown health status), the current state is adopted as the baseline

A_{0}

. The proposed method functions as an anomaly detector. It detects statistically significant deviations from the reference state, rather than measuring absolute health against a theoretical ideal. A key advantage of our statistical approach is its adaptability. If the “baseline” motor already exhibits slight wear or noise, the variance in the spectral components in the dataset

A_{0}

will be naturally higher. Consequently, the calculated confidence interval automatically widens. This mechanism naturally desensitizes the health indicator to the pre-existing condition, ensuring that the system detects only significant future degradation (trend changes) rather than triggering false positives based on the initial state.

The proposed methodology is particularly tailored for two specific industrial scenarios where traditional MCSA is often inapplicable due to signal duration requirements. The first scenario involves automated manufacturing and robotics (e.g., pick-and-place operations), where motors operate under short duty cycles with rapid speed changes. In such regimes, acquiring a continuous 1 s steady-state signal is unfeasible, whereas the proposed method effectively utilizes brief 0.2 s stable windows for reliable diagnosis. The second scenario concerns Edge AI and IoT monitoring systems implemented on low-cost microcontrollers. The capability to achieve high diagnostic accuracy using short data buffers minimizes memory usage and computational latency, enabling cost-effective, real-time condition monitoring on embedded platforms.

In conclusion, the developed methodology can be regarded as an interpretable and statistically grounded framework for fault detection in electric machines. Future work will focus on extending the method to enable fault classification and validating the approach in real industrial environments with time-varying load profiles.

Author Contributions

Conceptualization, J.H.-S.; methodology, J.H.-S.; software, J.H.-S., M.W. and K.S.; validation, J.H.-S., A.M. and J.W.; formal analysis, J.H.-S.; investigation, J.H-S.; resources, J.H.-S. and R.Z.; data curation, M.W.; writing—J.H.-S., M.W. and A.M.; writing—review and editing, J.H.-S., A.M., M.W. and R.Z.; visualization, J.H.-S. and M.W.; supervision, R.Z.; project administration, K.S.; funding acquisition, K.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded by the European Union under GA no 101101961—HECATE. Views and opinions expressed are, however, those of the author(s) only and do not necessarily reflect those of the European Union or Clean Aviation Joint Undertaking. Neither the European Union nor the granting authority can be held responsible for them. The project is supported by the Clean Aviation Joint Undertaking and its Members.

Data Availability Statement

The data presented in this study are not publicly available due to confidentiality restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Drakaki, M.; Karnavas, Y.L.; Tziafettas, I.A.; Linardos, V.; Tzionas, P. Machine learning and deep learning based methods toward industry 4.0 predictive maintenance in induction motors: State of the art survey. J. Ind. Eng. Manag. 2022, 15, 31–57. [Google Scholar] [CrossRef]
Achouch, M.; Dimitrova, M.; Ziane, K.; Sattarpanah Karganroudi, S.; Dhouib, R.; Ibrahim, H.; Adda, M. On Predictive Maintenance in Industry 4.0: Overview, Models, and Challenges. Appl. Sci. 2022, 12, 8081. [Google Scholar] [CrossRef]
Singh, R.R.; Bhatti, G.; Kalel, D.; Vairavasundaram, I.; Alsaif, F. Building a Digital Twin Powered Intelligent Predictive Maintenance System for Industrial AC Machines. Machines 2023, 11, 796. [Google Scholar] [CrossRef]
Ayvaz, S.; Alpay, K. Predictive maintenance system for production lines in manufacturing: A machine learning approach using IoT data in real-time. Expert Syst. Appl. 2021, 173, 114598. [Google Scholar] [CrossRef]
Joksimovic, G.M.; Penman, J. The detection of inter-turn short circuits in the stator windings of operating motors. IEEE Trans. Ind. Electron. 2000, 47, 1078–1084. [Google Scholar] [CrossRef]
Nandi, S.; Toliyat, H.A. Novel frequency-domain-based technique to detect stator interturn faults in induction machines using stator-induced voltages after switch-off. IEEE Trans. Ind. Appl. 2002, 38, 101–109. [Google Scholar] [CrossRef]
Li, W.; Mechefske, C.K. Detection of induction motor faults: A comparison of stator current, vibration, and acoustic methods. J. Vib. Control 2006, 12, 165–188. [Google Scholar] [CrossRef]
Niu, G.; Dong, X.; Chen, Y. Motor fault diagnostics based on current signatures: A review. IEEE Trans. Instrum. Meas. 2023, 72, 3520919. [Google Scholar] [CrossRef]
Benbouzid, M.E.H.; Kliman, G.B. What stator current processing-based technique to use for induction motor rotor faults diagnosis? IEEE Trans. Energy Convers. 2003, 18, 238–244. [Google Scholar] [CrossRef]
El Bouchikhi, E.H.; Choqueuse, V.; Benbouzid, M. Induction machine faults detection using stator current parametric spectral estimation. Mech. Syst. Signal Process. 2015, 52–53, 447–464. [Google Scholar] [CrossRef]
Kouadria, M.; Chedjara, Z.; Su, C.L.; Benbouzid, M.; Guerrero, J.M.; Ibrahim, B.S.K.K.; Ahmed, H. Diagnosis of induction motor stator faults around rotor slot harmonics using the Matrix Pencil method. Results Eng. 2025, 25, 104240. [Google Scholar] [CrossRef]
Garbiec, T.; Jagiela, M. Accounting for Slot Harmonics and Nonsinusoidal Unbalanced Voltage Supply in High-Speed Solid-Rotor Induction Motor Using Complex Multi-Harmonic Finite Element Analysis. Energies 2021, 14, 5404. [Google Scholar] [CrossRef]
Elbouchikhi, E.; Benbouzid, M. Parametric signal processing approach. In Signal Processing for Fault Detection and Diagnosis in Electric Machines and Systems; The Institution of Engineering and Technology: Stevenage, UK, 2020; Chapter 1; pp. 3–50. [Google Scholar] [CrossRef]
Huillery, J.; Millioz, F.; Martin, N. On the description of spectrogram probabilities with a chi-squared law. IEEE Trans. Signal Process. 2008, 56, 2249–2258. [Google Scholar] [CrossRef]
Hory, C.; Martin, N.; Chehikian, A. Spectrogram segmentation by means of statistical features for non-stationary signal interpretation. IEEE Trans. Signal Process. 2002, 50, 2915–2925. [Google Scholar] [CrossRef]
Kumar, R.R.; Andriollo, M.; Cirrincione, G.; Cirrincione, M.; Tortella, A. A Comprehensive Review of Conventional and Intelligence-Based Approaches for the Fault Diagnosis and Condition Monitoring of Induction Motors. Energies 2022, 15, 8938. [Google Scholar] [CrossRef]
Gerber, T.; Martin, N.; Mailhes, C. Time-frequency tracking of spectral structures estimated by a data-driven method. IEEE Trans. Ind. Electron. 2015, 62, 6616–6626. [Google Scholar] [CrossRef]
Percival, D.B.; Walden, A.T. Spectral Analysis for Physical Applications: Multitaper and Conventional Univariate Techniques; Cambridge University Press: Cambridge, UK, 1993. [Google Scholar]
Cioch, W.; Knapik, O.; Leśkow, J. Finding a frequency signature for a cyclostationary signal with applications to wheel bearing diagnostics. Mech. Syst. Signal Process. 2013, 38, 55–64. [Google Scholar] [CrossRef]
Firla, M.; Li, Z.Y.; Martin, N.; Pachaud, C.; Barszcz, T. Automatic characteristic frequency association and all-sideband demodulation for the detection of a bearing fault. Mech. Syst. Signal Process. 2016, 80, 335–348. [Google Scholar] [CrossRef]
Wodecki, J.; Michalak, A.; Hebda-Sobkowicz, J.; Wyłomańska, A.; Szabat, K.; Wolkiewicz, M.; Pawlak, M.; Zimroz, R. Cyclostationary analysis of vibration signals from electric motor—Understanding of bi-frequency map. Mech. Sci. 2025, 16, 597–614. [Google Scholar] [CrossRef]
Peng, Z.; Chu, F. Application of the wavelet transform in machine condition monitoring and fault diagnostics: A review with bibliography. Mech. Syst. Signal Process. 2004, 18, 199–221. [Google Scholar] [CrossRef]
Feng, Z.; Chu, F. Cyclostationary Analysis for Gearbox and Bearing Fault Diagnosis. Shock Vib. 2015, 2015, 542472. [Google Scholar] [CrossRef]
Dong, X.; Yuan, J.; Xiong, L.; Niu, G. Fault Detection of Interturn Short Circuit in Induction Motors Under Nonstationary Conditions and Unbalanced Supply Voltage. IEEE Trans. Instrum. Meas. 2024, 73, 3527410. [Google Scholar] [CrossRef]
Diversi, R.; Guidorzi, R.; Soverini, U.; Tondini, A.; Biagiotti, L.; Papi, M.; Saponara, S. An Autoregressive-Based Motor Current Signature Approach for Fault Diagnosis. Sensors 2025, 25, 1130. [Google Scholar] [CrossRef]
Tomczyk, M.; Szeląg, W.; Grzebyk, T.; Klimczak, A. Identification of Inter-Turn Short-Circuits in Induction Motor Stator Windings under Load Variations. Energies 2021, 15, 117. [Google Scholar] [CrossRef]
Bakhri, S.; Ertugrul, N. A Negative Sequence Current Phasor Compensation Technique for the Accurate Detection of Stator Shorted Turn Faults in Induction Motors. Energies 2022, 15, 3100. [Google Scholar] [CrossRef]
Oviedo, S.; Borras-Morell, C. Motor current signature analysis and negative sequence current based stator winding short fault detection in an induction motor. Dyna 2011, 170, 217–226. [Google Scholar]
Ruzimov, S.; Zhang, J.; Huang, X.; Aziz, M.S. Detection of Inter-Turn Short-Circuit Faults for Inverter-Fed Induction Motors Based on Negative-Sequence Current Analysis. Sensors 2025, 25, 4844. [Google Scholar] [CrossRef]
Rengifo, J.; Moreira, J.; Vaca-Urbano, F.; Alvarez-Alvarado, M.S. Detection of Inter-Turn Short Circuits in Induction Motors Using the Current Space Vector and Machine Learning Classifiers. Energies 2024, 17, 2241. [Google Scholar] [CrossRef]
Chen, L.; Shen, J.; Xu, G.; Chi, C.; Feng, Q.; Zhou, Y.; Deng, Y.; Wen, H. Induction motor stator winding inter-turn short circuit fault detection based on start-up current envelope energy. Sensors 2023, 23, 8581. [Google Scholar] [CrossRef]
Guedidi, A.; Laala, W.; Guettaf, A.; Arif, A. Early detection and localization of stator inter-turn short circuit fault in induction motor using deep learning-based multi-sensor fusion. Diagnostyka 2023, 24, 2023401. [Google Scholar] [CrossRef]
Picot, A.; Obeid, Z.; Régnier, J.; Poignant, S.; Darnis, O.; Maussion, P. Statistic-based spectral indicator for bearing fault detection in permanent-magnet synchronous machines using the stator current. Mech. Syst. Signal Process. 2014, 46, 424–441. [Google Scholar] [CrossRef]
Harmouche, J.; Delpha, C.; Diallo, D. Improved fault diagnosis of ball bearings based on the global spectrum of vibration signals. IEEE Trans. Energy Convers. 2015, 30, 376–383. [Google Scholar] [CrossRef]
Mari, S.; Bucci, G.; Ciancetta, F.; Fiorucci, E.; Fioravanti, A. Impact of Measurement Uncertainty on Fault Diagnosis Systems: A Case Study on Electrical Faults in Induction Motors. Sensors 2024, 24, 5263. [Google Scholar] [CrossRef]
Wasserman, L. All of Statistics: A Concise Course in Statistical Inference; Springer Texts in Statistics; Springer: New York, NY, USA, 2004. [Google Scholar] [CrossRef]
Lehmann, E.L.; Romano, J.P. Testing Statistical Hypotheses, 3rd ed.; Springer: New York, NY, USA, 2005. [Google Scholar]
Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Duxbury/Thomson Learning: Pacific Grove, CA, USA, 2002. [Google Scholar]
Rivera, M.; Faiz, J. Discrimination of stator inter-turn short-circuit and voltage unbalance using current unbalance indicators. IEEE Trans. Ind. Electron. 2020, 67, 6420–6429. [Google Scholar]

Figure 1. Exemplary results of

{\hat{F F}}_{l}

selection on

\hat{S D}

of the current A (IsA) of healthy signal and signals at different fault stages.

Figure 1. Exemplary results of

{\hat{F F}}_{l}

selection on

\hat{S D}

of the current A (IsA) of healthy signal and signals at different fault stages.

Figure 2. Flowchart of the proposed methodology. Stage 1: dataset construction; Stage 2.1: feature modeling and selection; Stage 2.2: health indicator and confidence interval; Stage 3: testing and validation.

Figure 3. Experiment station designed for the study on stator winding faults.

Figure 4. Specially prepared induction motor for stator winding faults.

Figure 5. Terminal board of the modified induction motor with winding taps.

Figure 6. Current signals with different stages of fault and load.

Figure 7. Current signals with different stages of fault and load—zoomed in view.

Figure 8. Load profiles of healthy signal and signals with different stages of fault.

Figure 9. Load profiles of healthy signal and signals with different stages of fault—zoomed in view.

Figure 10. Current signals for 0.2-s length signals (load 6).

Figure 11. Comparison of spectra for 0.2-s current signals (load 6).