An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology

Khan, Muhammad Amir; Asad, Bilal; Vaimann, Toomas; Kallaste, Ants

doi:10.3390/machines12070495

Open AccessArticle

An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology

¹

Department of Electrical (Power) Engineering, The Islamia University of Bahawalpur, Bahawalpur 6300, Pakistan

²

Department of Electrical Power Engineering and Mechatronics, Tallinn University of Technology, 12616 Tallinn, Estonia

^*

Author to whom correspondence should be addressed.

Machines 2024, 12(7), 495; https://doi.org/10.3390/machines12070495

Submission received: 21 June 2024 / Revised: 15 July 2024 / Accepted: 19 July 2024 / Published: 22 July 2024

(This article belongs to the Special Issue Machine Learning Based Predictive Maintenance and Condition Monitoring)

Download

Browse Figures

Versions Notes

Abstract

This paper introduces a sophisticated approach for identifying and categorizing broken rotor bars in direct torque-controlled (DTC) induction motors. DTC is implemented in industrial drive systems as a suitable control method to preserve torque control performance, which sometimes shows its impact on fault-representing frequencies. This is because of the DTC’s closed-loop control nature, whichtriesto reduce speed and torque ripples by changing the voltage profile. The proposed model utilizes the modified Shapley Additive exPlanations (SHAP) technique in combination with gradient-boosting decision trees (GBDT) to detect and classify the abnormalities in BRBs at diverse (0%, 25%, 50%, 75%, and 100%) loading conditions. To prevent overfitting of the proposed model, we used the adaptive fold cross-validation (AF-CV) technique, which can dynamically adjust the number of folds during the optimization process. By employing extensive feature engineering in the original dataset and then applying Shapely Additive exPlanations(SHAP)-based feature selection, our methodology effectively identifies informative features from signals (three-phase current, three-phase voltage, torque, and speed) and motor characteristics. The gradient-boosting decision tree (GBDT) classifier, trained using the given characteristics, extracts consistent and reliable classification performance under different loading circumstances and enables precise and accurate detection and classification of broken rotor bars. The proposed approach (SHAP-Fusion GBDT with AF-CV) is a major advancement in the field of machine learning in detecting motor anomalies at varying loading conditions and proved to be an effective mechanism for preventative maintenance and preventing faults in DTC-controlled induction motors byattaining an accuracy rate of 99% for all loading conditions.

Keywords:

broken rotor bars; artificial intelligence; fault classification; feature extraction; induction motors; condition monitoring; machine learning; frequency domain analysis; supervised learning; gradient-boosting trees; variable speed drives; time domain analysis

1. Introduction

Induction motors (IMs) are considered extremely important pieces of electrical equipment because of their variety of applications, such as pumps, compressors, fans, conveyors, traction applications, etc. Moreover, their simple, cheap, reliable, and efficient construction makes them suitable for rough industrial environments with extensive duty cycles. Induction motors (IMs) experience various stresses due to mechanical, electrical, thermal, and environmental factors, each of which can contribute to the degradation of motor performance and increase the likelihood of faults such as broken rotor bars [1]. Many issues can arise in the motors due to these stresses, and the lack of clarity regarding these problems may result in a disastrous motor breakdown. To mitigate these issues, it is imperative to provide early detection and diagnosis techniques for identifying potential faults in the components of the IM. This will provide sufficient warning time before they reach the point of failure [2]. Usually, broken rotor bars account for 10% of induction motor defects, making this one of the most frequently occurring problems. In closed-loop control systems, detecting faults may present difficulties because of several interlinked state variables. Field-oriented control (FOC) and direct torque control (DTC), two prevalent control methods, are renowned for their efficacy and feasibility. The occurrence of a high-frequency factor in the stator current spectrum of DTC-fed squirrel cage induction motors (SCIM) is due to the influence of DTC’s control settings and switching frequencies [3,4]. Additionally, the amplitude of the phase current’s sideband near the source frequency is influenced by the control structure. The FEM analysis and cross-sectional view of the induction motor utilized in this investigation to analyze BRBs are depicted in Figure 1. It is evident that the flux density around the broken bars increases and puts the neighboring bars under increased magnetic stress, making them vulnerable too.

Approximately 85% of all electrical machines are squirrel cage induction motors (SQIMs) due to their reliability, robustness, and cost-effectiveness. Typically, variable-frequency drives (VFDs) or AC drives with adjustable speeds are the preferred methods for powering these induction motors, providing efficient and flexible control over their operation [5]. Overall, faults in induction motors can be broadly classified into electrical and mechanical categories. On the electrical side, approximately 30–40% of faults are associated with the stator, while 5–10% occur on the rotor side. Mechanical faults, which constitute about 40–50% of the total, often involve issues with components such as bearings and air gaps [6,7]. Rotor failures are caused by several different forces acting on them. The rotor bars can be affected due to thermal, electric, mechanical, and environmental factors. When there is a broken rotor bar (BRB), the torque and speed ripples increase, and the effective value is decreased. This may increase the vibrations affecting the other parts of the machine as well [8]. How to find and fix faults is an important part of protecting AC drives so that the problems mentioned above don’t happen. The MCSA method depends on specific frequencies in the current spectrum of the stator that serve as indicators of fault, such as broken rotor bars indicating frequencies described by Equation (1).

f_{b r b} = (1 \pm 2 K s) f_{s}

(1)

where

f_{s}

is the supply frequency, K is an integer, and s denotes the rotor slip.

The authors in [9] introduced a third-order energy operator (TOEO) that utilizes a demodulated current signal to detect damaged rotor bars in low-load induction machines. In [10], the authors suggested an approach based on time domain-based analysis of incipient faults that occurred in broken rotor bars byemploying electromagnetic signatures. In [11], the authors proposed artificial neural networks (ANNs) with Hilbert transform to diagnose bearing faults in induction machines through motor current signature analysis based on theirunique spectral signature. In [12], the authors utilized the fast Fourier transform (FFT) for analyzing and understanding patterns in healthy and faulty signals based on single-current acquisition to diagnose bearing faults. In [13,14], the authors suggested an FFT that helps to detect and segregate various faults and their severity by analyzing components near the fundamental frequency. In [15,16], the authors employed FFT with advanced signal processing techniques like STFT and WT to diagnose abnormal conditions in bearings by orthogonal matching. In [17], the authors suggested convolutional neural networks (CNNs) and motor current signature analysis (MCSA) during transient states to diagnose abnormal conditions in bearings in induction motors. In [18], the authors suggested a reliable approach to diagnosing early bearing faults by employing the inverse thresholding technique through non-stationary assessment of fault frequencies in induction machines. Alternatively, multiple studies suggest utilizing continuous or discrete wavelet transforms (CWTs and DWTs) [19,20] to analyze signals that are both stationary and non-stationary in the time-frequency domain. Moreover, techniques like MUSIC transform [21,22], the stator current envelope [23], the maximum covariance [24], and the zoom-FFT techniques [25] are widely available in the literature.

In [26,27], the authors suggested a novel and effective signal processing method for the fast Fourier transform (FFT) to identify broken bar faults by utilizing the Goertzel algorithm. In [28], the authors utilized stator currents, voltages, or stray flux by focusing on specific frequency components for diagnosing broken rotor bars. In [29], the authors utilized stock well transform by employing maximum magnitude and phase angle obtained through S-transform to diagnose bearing conditions of the shaft side and fan side in induction motors. In [30,31], the authors proposed the application of wavelet-based transforms and Wigner–Ville distributions for fault identification in inverter-driven induction motors to diagnose bearing faults. However, their effectiveness in detecting induction motors powered by inverters is uncertain due to the presence of cross-terms and varying time-frequency resolutions, as highlighted in [32]. In [33,34], the authors employed the short-time Fourier transform (STFT), a straightforward approach for time-frequency analysis, to accurately trace harmonics associated with damaged rotor bars in induction machines. The authors in [35] employed variational mode decomposition (VMD) and also the empirical wavelet transform (EWT) for the detection of broken rotor bar faults at different operational conditions. The methodology proposed by the authors [36] uses the adaptive slope transform (AST) combined with the Chirplet transform (CT) to detect and classify abnormal conditions of broken rotor bars (BRBs) in grid-fed (DTC) induction motors. The authors in [37,38] suggested the application of ensemble empirical mode decomposition (EEMD) as a better choice to diagnose broken rotor bars in inverter-driven induction motors. The instant speed during MME was also monitored for the diagnosis of BRB faults in induction motors powered by inverters [39,40]. The authors in [41] suggested the precise identification of BRB faults that are influenced by control dynamics and load variations, although explainability in the field of machine learning is now becoming an important technique for selecting hidden features from datasets.

After the advancements in the fields of machine learning and deep learning, many feature extraction and selection methodologies wereintroduced to search for the best features from a dataset, and SHAP is considered thebest feature selection technique to trace the informative attributes. This technique can handle large volumes of operations in data management to optimize their data center processes. In [42], the author’s introduceda new technique named SHAP as a feature selection mechanism, utilizing a game theoretic approach and attaining better outcomes than existing techniques. In [43], the authors suggested the convolutional neural network CNN to diagnose broken rotor bars using continuous wavelet transform to generate time-frequency images in induction motors. In [44], the proposed convolutional neural network uses thefinite element method FEM to trace incipient bearing faults in DTC control induction machines. In [32], authors proposed multiple machine-learning algorithms named DT, ANN, and deep learning models to diagnose bearing faults based on comparison methodologies. In [45], the authors make use of sparse autoencoders with multi-layer perception to detect bearing faults based on motor current signature analysis. In [46], the authors proposed amachine learning model named random forest to detect failure modes of machines in different sections based on Shapely Additive exPlanations and prove its validity on different experimental datasets. In [47], the authors proposed a hybrid ML algorithm deep neural network DNN with gradient-boosting decision tree GBDT and feature selection based on SHAP values and achieved remarkable breakthroughs in diagnosing problems related to the medical field. In [48], the authors proposed an improved version of the SHAP technique named Kernel SHAP to detect abnormalities in multiple fields through autoencoders. In [49], the authors proposed a novel framework using different boosting algorithms like XGBoost, AdaBoost, and LightGBM for diagnosing compressive strength predictions in concrete using the Shapely Additive exPlanations (SHAP) approach. In [50], the authors utilized Shapely Additive exPlanations on real operations of DC data streams obtained from accelerometers to evaluate their performance through MAE, MPAE, and RMSE assessment metrics and find excellent outcomes.In [51], authors proposed an efficient feature selection technique named Shapely Additive exPlanations for the diagnosis of bearing faults in induction machines using an ML algorithm support vector machine. In [52], the authors employed Shapely Additive exPlanations on the time series SAR dataset for the prediction of spatial landscapes utilizing different machine learning algorithms.

SHapely Additive exPlanations (SHAP)

Shapley Additive exPlanations (SHAP) is an additive approach utilized for interpreting machine learning models by quantifying the contribution of each feature to the model’s predictions. It was first introduced by Lundberg and Lee in 2017 and is specifically built for explainable AI (XAI) [53]. As stated above, SHAP measures the Shapley values of the input features, i.e., the average marginal contribution of feature values across all possible features set in the model. It is the same thing assharing a gift evenly among friends based on our contributions. SHAP values help you determine the rank of importance of the prediction for each feature of your dataset. Looking at these numbers quantifies which variables matter most when you want to know what your model is telling you and which ones do notmatter at all. This enabled better and more streamlined models while also not compromising performance, resulting in abetter choice of features. SHAP values are a statistically applicable way of ranking features based on significance. The Shapley value estimation of the jth feature with i combinations of features, target feature x, j index, data streams D with matrix X, and predictive model f is calculated as follows:

\emptyset_{i j} = f^{*} (x + j) - f^{*} (x - j)

(2)

where

\emptyset_{i j}

denotes the average Shapley value of the j-th feature through the ith feature,

f^{*} (x + j)

is the prediction for target x with a random number, including the j-th feature, and the group of features absent of the j-th feature is denoted as fˆ(xj). The equation is as follows to calculate the Shapley value of the j-th feature, which is the target, x, in general:

\emptyset_{j} (x) = \frac{1}{n} \sum_{k = 1}^{n} (\emptyset_{i j})

(3)

The significance of every attribute is calculated consistently and is ranked according to their Shapley rate in a predetermined arrangement. The model is subsequently trained to utilize the most significant attribute, as demonstrated in the subsequent iterations, starting from the top and progressing until the most favorable attribute subset is recognized.

1st interaction: $k_{1}$ = $f_{1}$
2nd interaction: $k_{2}$ = $f_{1} f_{2}$
3rd interaction: $k_{3}$ = $f_{1} f_{2} f_{3}$ , $\dots \dots \dots f_{n}$
Ultimately, the objective problem can be effectively modeled by identifying and utilizing the optimal feature subset.

In this paper, dynamic SHAP interaction feature selection (DSHAP-IFS) with gradient-boosted decision trees (GBDT) is introduced—an innovative technique for feature selection by exploiting interactions among features to improve the model performance. Like a house built on the foundation of GBDT, DSHAP-IFS increases the importance of feature selection and contributes insight into the importance of the feature in the dataset. Through the iterative refinement of feature selection, by dynamically augmenting SHAP values, DSHAP-IFS can capture the subtle correlations among features, enabling further enhancement of model interpretability and predictive performance. This methodology helps identify the critical features that make a model work so that the learned model can be applied more rigorously to predict output for a variety of different applications. The main objectives and contributions of this article are given below:

Introduction of a sophisticated method leveraging SHAP-fusion GBDT for precise detection and classification of BRBs in DTC-controlled induction motors.
Application of extensive feature engineering and SHAP-based feature selection to extract informative features from electrical signals (current, voltage, torque, speed) and motor characteristics.
Explore the impact of SHAP-based feature selection on model interpretability and understanding of the underlying mechanisms driving BRB detection and classification in DTC-controlled induction motors.
To ensure that the proposed method performs reliably under diverse loading conditions (0%, 25%, 50%, 75%, and 100%) and attains a high accuracy rate (99%) in the detection and classification of broken rotor bars.
Application of adaptive fold cross-validation to reduce the overfitting in which the number of folds is changed during the optimization process.
Demonstration of consistent and reliable classification performance of the GBDT classifier under varying loading conditions, ensuring accurate detection and classification of BRBs across different operational scenarios.
Significance in advancing the field of machine learning for motor anomaly detection by achieving an impressive accuracy rate of 99% for all loading conditions, thereby contributing to the development of preventative maintenance strategies and enhancing the dependability of DTC-controlled induction motors.

2. Detection and Classification of Anomalies in Induction Motors: Analysis of Electrical Signatures in Healthy and Faulty States, with Emphasis on Broken Rotor Bars

This work focuses on the detection of problems in induction motors by analyzing electrical signals, with particular attention on identifying broken rotor bars. The objective was to obtain a precise representation of the motor’s performance, particularly in identifying any damaged rotor bars. In a healthy motor, the phase current, phase voltage, speed, and torque should exhibit a smooth and sinusoidal waveform. Furthermore, when the rotor bars of the motor are broken, an extra harmonic component is generated around the left sideband (LSB) and right side band (RSB)around the fundamental component at 50 Hz in the current, voltage, speed, and torque, which can be used to detect the failure in loaded conditions. Below is a brief explanation of the healthy and faulty states for phase current, voltage, torque, and speed: The following Equations (2)–(9) represent the healthy and faulty equations for current, voltage, speed, and torque and are given below:

2.1. Current Analysis

Takethe healthycondition of a motor as ℎ and the defective state (namely, a broken rotor bar) as brb. The mathematical representations for current (I), voltage (V), speed (N), and torque (T) in both healthy and malfunctioning situations are as follows:

2.1.1. Healthy Condition

In a healthy state, take the phase current

I_{p h}

that is utilized to describe the sinusoidal function:

I_{p h} = I_{m} \sin (ω t + \emptyset)

(4)

2.1.2. Faulty Condition

In a faulty state, take the phase current

I_{p h}

that is used to represent an additional harmonic fault-induced current component:

I_{p h} = I_{m} \sin (ω t + \emptyset) + I_{f} \sin (ω_{f} t + \emptyset_{f})

(5)

The amplitude of the fault-induced current is represented by

I_{f}

, the fault frequency is denoted as

ω_{f}

, and the phase angle of the fault component is indicated by

\emptyset_{f}

. This equation demonstrates that the failure contributes a novel frequency component to the current signal.

2.2. Voltage Analysis

2.2.1. Healthy Condition

Take voltage

V_{r h}

in that can be characterized by the following equation when it is in a healthy state:

V_{r h} = I_{r h} (R_{r} + j X_{r}) + I_{m h} (j X_{m})

(6)

where

I_{m h}

represents the magnetizing current,

R_{r}

represents the rotor resistance,

X_{m}

represents the magnetizing reactance, and

I_{r h}

represents the rotor current.

2.2.2. Faulty Condition

In a faulty state (broken rotor bars), the voltage in the rotor can be described by:

V_{r b r b} = I_{r b r b} (R_{r} + j X_{r}) + I_{m b r b} (j X_{m})

(7)

The rotor current and magnetizing current with broken rotor bars are

I_{r b r b}

and

I_{m b r b}

, respectively.

2.3. Speed Analysis

2.3.1. Healthy Condition

Take

N_{h}

to represent the rotor speed in a healthy state of motor:

N_{h} = (1 - S_{h}) . \frac{120 f}{P}

(8)

The variables in the equation are represented as follows:

S_{h}

represents the slip,

f

for frequency, and

P

for the number of poles. This equation represents the relationship between the speed of the rotor, the slip, the frequency, and the number of poles in an optimal state.

2.3.2. Faulty Condition

In a faulty state (broken rotor bars), the rotor speed

N_{b r b}

can be described as:

N_{b r b} = (1 - S_{b r b}) \frac{120 f_{b r b}}{P} + N_{f} Sin (2 ω_{s} t + \emptyset_{n})

(9)

Under faulty conditions,

f_{b r b}

represents frequency,

S_{b r b}

represents slip,

N_{f}

represents the amplitude of fault-induced speed oscillation,

ω_{s}

represents synchronous speed frequency, t represents time, and

\emptyset_{n}

represents fault component phase angle.

2.4. Torque Analysis

2.4.1. Healthy Condition

In the healthy state,

T_{h}

can be described as:

T_{h} = \frac{3}{2 π} \frac{P}{S_{h}} \frac{V_{s h}^{2}}{R_{s}} \frac{S_{h}}{S_{h}^{2} + {(X_{s} + X_{m})}^{2}}

(10)

In which

P

illustrates the no. of poles,

S_{h}

is the slip,

R_{s}

is the rotor resistance,

X_{s}

is the stator resistance and

X_{m}

shows the magnetizing reactance.

2.4.2. Faulty Condition

In faulty state (broken rotor bars),

T_{b r b}

can be described as:

T_{b r b} = \frac{3}{2 π} \frac{P}{S_{b r b}} \frac{V_{s b r b}^{2}}{R_{s}} \frac{S_{b r b}}{S_{b r b}^{2} + {(X_{s} + X_{m})}^{2}}

(11)

where,

S_{b r b}

is the slip under faulty condition, and

V_{s b r b}^{2}

are the stator voltages under faulty circumstances. This equation shows how the torque is affected by the presence of broken rotor bars, reflecting changes in the motor’s performance due to the fault.

3. Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) with GBDT

Our research introduces DSHAP-IFS, a novel feature selection technique designed for identifying abnormalities in motor systems. DSHAP-IFS combine SHAP (SHapley Additive exPlanations) values with gradient-boosting decision trees (GBDT). DSHAP-IFS is designed to interact with several electrical signal features, such as current, voltage, speed, and torque, and accurately determine the dynamic significance of these aspects. Selection of effectors: The process refines feature selection iteratively by enhancing SHAP values through feature interactions, hence indicating the features chosen for model training. GBDT is utilized by DSHAP-IFS to accurately detect intricate feature interactions and non-linear relationships within electrical signals. This leads to enhanced modeling accuracy and increased interpretability of the model. Hence, the method given offers a comprehensive understanding of the significance of the features necessary for identifying and choosing important features from the dataset. This knowledge might be valuable in improving the effectiveness of using the dataset to detect abnormalities in motor systems. The proposed approach is shown in Algorithm 1.

Algorithm 1. DSHAP-IFS

Input features

▪: Feature matrix X∈ $R^{n * 8}$ containing features [Va, Vb, Vc, Ia, Ib, Ic, torque, speed]
▪: Target vector Y ∈ ${0, 1}^{n}$ representing Fault types
▪: Iterations: number of iterations for the feature selection process
▪: K: top number of features to be selected

Output

▪: $Reduced feature matrix X_S_{k}$ $, Final GBDT model y_f i n a l$

Initialization

▪: $S^{0}$ = {Va, Vb, Vc, Ia, Ib, Ic, torque, speed}
▪: $Train the GBDT model : \hat{y} = f (X;$ Ѳ)
▪: Calculate initial SHAP values:
▪: For each feature j in {Va, Vb, Vc, Ia, Ib, Ic, torque, speed} and each sample i in {1, 2, 3, …, n} through
▪: $\emptyset_{i, j} = S H A P (X_{i, j}; f; θ)$ :

2.: Iterative feature selection

▪: For t = 0 to iterations
▪: Enhance SHAP values with interactions:
▪: $For each feature j in S^{(t - 1)}$

\emptyset_{i, j}^{e n h a n c e m e n t s} = \emptyset_{i, j} + \sum_{k \neq j} i n t e r a c t i o n (\emptyset_{i, j}, \emptyset_{i, k})

(12)

3.: Compute dynamic feature importance

I_{j} = \frac{1}{n} \sum_{i = 1}^{n} | \emptyset_{i, j}^{e n h a n c e d} |

(13)

▪: Calculate the average absolute enhanced values.

4.: Select features for the next iteration

S^{(t + 1)} = {j \in N | I_{j}^{(t)} \geq τ^{(t)}}

(14)

▪: $Define threshold τ^{(t)}$ and update the selected features.
▪: $Combine SHAP values to rank features I_{j}$ and choose the top k attributes with the highest priority values:

S_{k} = {j_{1}, j_{2}, \dots, j_{k}}, where, I_{j 1 >} I_{j 2 >}, \dots, I_{j k}

(15)

▪: $Train the final GBDT model using only the selected features S_{k}$ $and introduce the X_{s k}$ be the reduced feature matrix containing only the selected features.

\hat{y_{f i n a l}} = f (S_{k}; θ_{f i n a l})

(16)

5.: Return

▪: $X_S_{k}$
▪: $y_f i n a l$

End algorithm

By capturing more complex feature correlations, dynamic augmentation of SHAP value via feature interactions makes this method distinctive. This strategy can improve feature selection and model performance by providing a more holistic understanding of feature significance. After finding the faulty features in the instantaneous spectrum, the algorithm for selecting and extracting strong features was implemented in the subsequent steps.

DSHAP-IFS is a novel feature selection technique that leverages the SHAP (Shapley Additive exPlanations) values in combination with gradient-boosting decision trees (GBDT).
DSHAP-IFS dynamically adjusts the importance of features based on their interaction with other features, allowing for more accurate and comprehensive feature selection.
By integrating with GBDT, DSHAP-IFS can effectively handle complex feature interactions and non-linear relationships within the data.
DSHAP-IFS follows an iterative process to enhance feature selection, iteratively updating the importance of features based on their interactions and selecting the top features for model training.
The combination of DSHAP-IFS with GBDT results in improved model performance by selecting the most relevant features and capturing complex feature interactions.
One of the main advantages of DSHAP-IFS is that it provides a comprehensive perspective on feature importance, considering not only individual feature importance but also their interactions, hence leading to more robust and interpretable models. A flowchart of DSHAP-IFS is shown in Figure 2.

4. Proposed Technique DSHAP-IFS Model Performance Evaluation Criteria

In analyseslike regression, the model error refers to the inconsistency between the observed assessment in a sample and the values predicted by the regression model. The model’s performance was assessed using the following evaluation criteria:

Mean Absolute Error (MAE): MAE is just the sum of the difference between the predicted and actual (predicted–actual) values. It gives a clear idea of how our model is performing, irrespective of the bias of the errors.

M A E = \frac{1}{n} \frac{\sum_{i = 1}^{n} | \hat{y_{i}} - y_{i} |}{n}

(17)

RMSE (Root Mean Squared Error): The RMSE calculates the square root of the average of squared differences between predicted and actual analysis. It puts heavy weight on larger errors than the mean absolute error.

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(\hat{y_{i}} - y_{i})}^{2}}

(18)

MAPE (Mean Absolute Percentage Error): MAPE is the mean absolute percentage error, which is the simplest, and in the machine learning world, it is the amount of error between the predicted values and the true value. It is a comparative measure and expresses accuracy as a percentage.

M A P E = \frac{100}{n} \sum_{i = 1}^{n} | \frac{\hat{y_{i}} - y_{i}}{y_{i}} |

(19)

Table 1 shows the summary of different feature selection (FS) methods, consisting of Permutation Importance, LIME, BORUTA, MOMI, LASSO, SHAP, and the introduced DSHAP-IFS approach. The feature importance we calculated uses an algorithm called Permutation Importance to measure the importance of a feature by observing the performance of models run after shuffling (randomizing) feature values. LIME approximates complex model predictions with interpretable models, and this offers local interpretability. The importance of other features is determined by BORUTA against random probes. MOMI simply gives greedy information on which features are leading to the model being more complex (MOMI is a model-agnostic algorithm). LASSO promotes sparsenessin the models by penalizing the coefficients of the features. Interpreting model predictions with SHAP values, the RESHAP-IFS method adds SHAP values togradient-boosting decision trees to improve feature selection. It surpasses all competing methods in accuracy as well as execution time, suggesting it is a promising solution for FS tasks.

Figure 3 shows the DSHAP-IFS feature importance plot, which provides a robust approach to interpreting machine learning model predictions by estimating the contribution of each feature to individual predictions. The graph shows Ib as the top feature from the dataset; red values show high-importance features, and blue values show low-importance features. Observations on the plot provide additional information about which features were important, helping understand the contribution of different input variables to model outcomes and supporting decision-making when deploying and improving the model.

5. Gradient-Boosting Decision Trees Approach

Gradient-boosting decision trees (GBDT) is a machine learning architecture used for regression and classification tasks by generating a prediction model by merging multiple weak prediction models, usually decision trees. The forest of trees grows based on the idea that each new tree learns from the mistakes made by the previous trees and makes corrections accordingly. Ensemble learning refers to the utilization of various machine learning techniques to perform predictive modeling. Gradient-boosting decision trees (GBDT) is an ensemble technique that aims to minimize bias and variation by aggregating the predictions of numerous base estimators [42]. For any dataset having d dimensions and n examples

D = \{(x_{i,} y_{i})\},

(

x_{i} ϵ R^{d}, y_{i} ϵ

R, I = 1, 2, 3,…, n), the output F is forecasted as the addition of K additive functions, which is expressed as

F_{x} = \sum_{m = 0}^{M} β_{m} h (x; {{R_{l m}}_{I}^{L}})

(20)

In which h shows a tree by L nodes, and

R_{l m}

denotes the partitioned region defined by the terminal node l of the m-th tree expansion coefficients M0 are jointly fitted to the training data set with

β_{m} h

by minimizing a regularized objective function.

ζ = \sum_{i}^{n} ψ (y_{i}, F (x_{i}))

(21)

where

ψ

a differentiable function for loss that is selected as the squared error this investigation and loss function is minimized by iteratively incorporating leaf nodes that generate the steepest descent, which is mathematically expressed as:

γ_{l m} = a r g \min_{γ} \sum_{x_{i} ε R_{l m}}^{N} ψ (y_{i}, F_{m - 1} (x_{i}) + γ)

(22)

F_{m} (x) = F_{m - 1} (x) + v γ_{l m} (x_{i} ϵ R_{l m})

(23)

where the shrinkage factor is

v

for the range of (0, 1] that regulates the learning rate of the training procedure. Empirically, small attributes of

v

are advantageous for the model’s preservation and, as a result, contribute to its generalization capability. In conclusion, GBDT is an effective and versatile ensemble learning approach that continues the building of models by iteratively building trees that correct the errors made earlier. It tackles the problem as a single optimization problem to be minimized with gradient descent, which makes it applicable to a very large range of regression and classification tasks. At the heart of the mathematical basis of GBDT, one computes the gradients, fits decision trees to residuals (RFT), and iteratively updates the model to improve performance.

6. Optimized Hyperparameters for Gradient-Boosted Decision Trees: Balancing Model Complexity and Overfitting

Table 2 describes the optimal hyperparameters for the gradient-boosted decision trees (GBDT) tuning, which is designed to balance model complexity and reduce overfitting. The number of estimators (n_estimators), the learning rate (learning_rate), the maximum tree depth (max_depth), the minimum number of samples required to split a node (min_samples_split) and to be a leaf (min_samples_leaf), and the fraction of samples to be used for tree fitting (subsample) are all critical parameters. The loss function, criterion, and max_features hyperparameters are also set to specify the number of features to consider the loss function, and criterion for node separation, respectively. In GBDT models, such parameters ensure a balance between the model complexity and generalization performance per iteration.

7. Experimental Setup for Data Collection and Acquisition

Two identical machines were positioned back-to-back on a shared mechanical base to ensure precise measurements. One of these machines functioned as the load motor, outfitted with an industrial inverter to refine slip precision and control. To minimize its impact, the load-side inverter operated in scalar mode rather than DTC mode. Data collection encompassed various control contexts, including grid-fed, scalar, and direct torque control (DTC). DTC, representing an advanced drive control technique, holds promise for replacing conventional pulse width modulation (PWM) drives. Phase currents were meticulously measured at a sampling frequency of 20 KHz to yield detailed insights. Alternatively, power was supplied to the motor through an industrial inverter operating in direct current mode. Data collection involved diverse conditions, encompassing both healthy states and those featuring one, two, and three damaged rotor bars. The dataset comprised information from induction machines, capturing both normal and damaged rotor bar scenarios. This dataset formed the basis for training machine learning models tailored for fault detection that were categorized into healthy (no broken rotor bar) and faulty data, specifying instances with 1 BRB, 2 BRB, and 3 BRB. The training dataset consisted of 1.2 million samples at a frequency of 20 KHz, each representing a distinct scenario. Additionally, 300,000 samples were reserved for validation purposes. Figure 4 illustrates the healthy and faulty broken rotor bars that are artificially created by the drill to collect data. Figure 4 and Figure 5 showcases the rotors with broken rotor bars and the back-to-back connection of test and loading machines. Meanwhile, Figure 6 outlines the step-by-step procedure of the suggested technique for detecting and classifying abnormalities in BRBs. The parameters of the test machine are given in Table 3.

Dataset Statistics

The dataset provides comprehensive statistics for detecting broken rotor bars in direct torque-controlled (DTC) induction motors under varying loading conditions. It is organized by fault type, with each fault type monitored at five different loading conditions: 0%, 25%, 50%, 75%, and 100%. Data collection is consistent across all conditions, with a sampling frequency set at 20 kHz. For the healthy condition, the dataset includes 8250 samples, with 1650 samples collected per loading condition. This category represents the motor’s normal operational state without any faults, providing a baseline for comparison. The dataset consists of 8250 samples of motors operating with a single broken rotor bar and consistently dispersed across five different loading conditions. This guarantees a comprehensive description of the motor’s performance across all operational loads. The dataset for two damaged rotor bars consists of 8250 samples, which are used to analyze the behavior under varying levels of the loading parameter. The 3 BRB category comprises an equivalent number of samples, specifically 8250 samples, taken simultaneously under identical loading conditions and sampling frequencies. The dataset comprises a total of 33,000 samples, with an equal distribution across the four fault types (healthy, 1 BRB, 2 BRB, and 3 BRB) and five loading circumstances. Each fault type and loading condition combination includes 1650 samples, ensuring a balanced dataset for robust training and evaluation of machine learning models. The high-resolution data captured at a sampling frequency of 20 kHz is critical for accurately detecting and diagnosing motor faults, providing valuable insights for preventative maintenance and fault prediction in DTC-controlled induction motors. The details of dataset statistics is shown in Table 4.

8. Results and Discussion

Figure 6, Figure 7, Figure 8 and Figure 9 represent the frequency spectrum of the motor’s current, voltage, speed, and torque underdifferent loading circumstances. As the system load progressively rises from 0% to 100%, many parameters, including current, voltage, torque, and speed, undergo alterations in their spectral characteristics. These changes are evident as a rise in the occurrence of harmonics surrounding the primary frequency component, usually seen at about 50 Hz in the current and voltage signal spectrum. More specifically, these harmonics represent sidebands known as the left-side band (LSB) and right-side band (LSB) around the center frequency. The left-side band is the lower frequency, and the right-side band is the higher frequency. The appearance and strengthening of these harmonics in the signal spectrum demonstrate the impact of load variation on the electrical system, which is of great importance for understanding the system’s behavior and performance in different operating conditions. The fast Fourier transform (FFT) analysis of current, voltage, speed, and torque signals for different conditions (healthy, 100% load, 1 BRB, 2 BRB, and 3 BRB phase unbalance) is an indispensable tool to investigate the operating performance and fault conditions of induction motors. Whenever the motor is healthy and operating at full load, the FFT will also show the highest frequency components for the normal motor mode of operation. Introducing one broken rotor bar (BRB) results in additional frequency components indicative of the fault, with further changes observed in the presence of two or three broken rotor bars. By comparing the FFT results across different conditions, characteristic frequency signatures of rotor bar faults emerge, enabling effective diagnosis and assessment of fault severity in induction motors, thereby facilitating condition monitoring and maintenance strategies for optimal motor performance and reliability. In the context of induction motor operation, as the load increases from 0% to 100%, the amplitude of harmonics around the fundamental component tends to increase. The fundamental component, typically appearing at 50Hz in power systems operating at 50 Hz, represents the primary frequency associated with the motor’s rotational speed. This phenomenon can be attributed to the non-linear behavior of the motor under varying load conditions, which leads to the generation of additional harmonics in the current, voltage, speed, and torque signals. As the load increases, the motor experiences greater magnetic flux variations and saturation effects, resulting in non-sinusoidal waveforms with enhanced harmonic content. This observation is consistent with the principles of electrical machinery and power system analysis, where load variations impact the spectral characteristics of motor signals due to changes in operating conditions and system dynamics. Such findings contribute to the academic understanding of motor behavior and provide insights into the effects of load variations on harmonic distortion in induction motors, offering implications for motor performance assessment and condition monitoring strategies in practical applications. Figure 7, Figure 8, Figure 9 and Figure 10 illustrate the frequency domain analysis of current, voltage, torque, and speed signals for healthy, 1 BRB, 2 BRB, and 3 BRB at different full-load scenarios.

The confusion matrix illustrates the classification performance of a machine learning model in distinguishing between healthy and faulty rotor bar conditions in induction motors operating at 100% load. Therefore, a properly functioning and robust system would be the most precise, as it correctly identified the highest percentage of healthy rotor bars as healthy. The classification accuracy diminishes as the fault severity escalates from 1 BRB to 2 BRB and 3 BRB, indicating greater difficulty in determining the presence and kind of rotor bar faults. The expected performance of the model can be assessed using the confusion matrix, which provides a concise overview of the true positive, true negative, false positive, and false negative classifications of rotor bar circumstances. An in-depth analysis of the confusion matrix will be conducted to gain a comprehensive understanding of the model’s ability to detect and categorize various fault scenarios. This analysis aims to improve fault diagnosis and condition monitoring strategies for induction motors operating under diverse load conditions. Figure 11 shows the confusion matrix analysis for a healthy state and faulty states (1 BRB, 2 BRBs, and 3 BRBs) at 100% loading circumstances.

A ROC curve describes the classification model’s ability to differentiate between normal and defective rotor bar conditions in induction motors. The curves are presented for three fault levels: 1 BRB, 2 BRB, and 3 BRB. A receiver operating characteristic (ROC) curve illustrates the relationship between the sensitivity (true positive rate) and the specificity (1—false positive rate) of a classification model when the threshold for classifying data points varies. The ROC curves in this study assess the model’s capacity to correctly identify true-positive cases while minimizing false-positive detections across various fault scenarios. An ideal classification model will have an ROC curve that roughly aligns with the upper-left corner of the plot, indicating a high level of sensitivity and a low rate of false positives. By analyzing the ROC curves of healthy and faulty rotor bars, we may obtain vital information about the model’s capacity to effectively identify rotor bar faults of different levels of severity in various operating settings. Figure 12 shows the receiver operating characteristic curves for a healthy state: 1 BRB, 2BR, and 3 BRB at 100% loading conditions.

Table 5, Table 6, Table 7 and Table 8 demonstrate the performance metrics for the intact rotor bars and various fault scenarios (1 BRB, 2 BRBs, and 3 BRBs) when the induction motor operates at different load levels ranging from 0% to 100%. The tables collectively illustrate the classification model’s performance metrics—accuracy, precision, recall, and F1 score—under various loading conditions for healthy rotor bars and rotor bars with one, two, and three broken bars (1 BRB, 2 BRBs, and 3 BRBs). For healthy rotor bars, the model demonstrates exceptional performance, achieving perfect accuracy and high precision, recall, and F1 scores across all load levels. The consistent achievement of performance measures with accuracy levels exceeding 99% for healthy rotor bars signifies that the model possesses the capability to accurately and precisely classify these bars amidst varying loads. Specifically, the accuracy falls a bit when looking at the 1 BRB conditions, but the precision, recall, and F1 scores are consistent and high, showing the model is working relatively well. Weight fall of the fault impact increases to 2 BRBs, down accuracy to 2 BRBs, and the minimized precision, recall, and F1 scores alsoshowthat the difficulty of fault detection rises from healthy to faulty conditions. For 4BRBs, the performance of the proposed model drops significantly due to even lower precision, recall, and F1 scores as well as reduced accuracy compared to 3 BRBs, indicating that this model faces more difficulties in identifying and classifying severe faults under stress conditions. These requirements highlight the importance of developing powerful algorithms that not only work effectively for a wide range of fault scenarios but also ensure that high performance is maintained, hence enabling good fault diagnostics and condition monitoring in real industrial machinery. We employ these tables to assess the performance of powerful fault-tracing algorithms on real industrial machinery. Makarov and Goresky highlight the importance of operational conditions and fault severity in the establishment of successful fault diagnosis and condition monitoring schemes.

The performance measures during the detection and classification of 3 BRBs (broken rotor bars) in direct torque-controlled (DTC) induction motors under different loading conditions are illustrated in Figure 13. The rows correspond to a percentage of load from 0% to 100%. Accuracy, precision, recall, and F1 score: These are the metrics that will be used to calculate the performance of the classification model. 0% load: It means that the model indicates the actual condition 98.64% of the time, whetherthe part is healthy or faulty. Consequently, we have obtained high test accuracy, e.g., precision, recall, and F1 score metrics show consistent performance, ranging from 94.11% to 94.38%. In general, the results presented show that the approach using dynamic SHAP feature selection with GBDT can decently identify cracked rotor bars in DTC induction motors at different load conditions. The high accuracy, precision, recall, and F1 score metrics highlight the reliability of the classification model, enlarging itsin predicting maintenance and fault prevention of industrial drive systems.

9. Conclusions and Future Recommendations

This research presents an improved and advanced diagnostic approach for the detection and classification of broken rotor bars in direct torque-controlled (DTC) induction motors by leveraging the dynamic SHAP interaction feature selection (DSHAP-IFS) methodology in conjunction with gradient-boosting decision trees (GBDT). The suggested technique represents significant efficiency in identifying and classifying the rotor bar defects under various loading conditions (0%, 25%, 50%, 75%, and 100%). Through extensive feature engineering and the application of improved SHAP-based feature selection, the methodology effectively isolates informative features from current, voltage, speed, torque, and motor characteristics, leading to high-performance classification. The results indicate a remarkable accuracy rate of 99% across all loading scenarios, showcasing the potential of this approach for reliable fault detection and condition monitoring in industrial machinery. This method not only enhances the precision of fault diagnosis but also contributes to the sustainability and operational efficiency of induction motors by enabling proactive maintenance strategies. The proposed SHAP-fusion GBDT approach marks a significant advancement in machine learning applications for motor anomaly detection, providing a robust solution for ensuring the dependability and longevity of DTC-controlled induction motors. It also improves the reliability of DTC-controlled induction motors. The effective use of the SHAP-Fusion GBDT approach in this study paves the way for its use in other fault diagnosis and predictive maintenance in various process sectors. Machine learning can be utilized alongside advanced feature selection techniques such as SHAP, allowing for the practical implementation of data-driven approaches to improve operational efficiency and system reliability, allowing for future development and expansion. The following list outlines several future recommendations that will help researchers proceed with the work for the proposed methodology:

Investigate the integration of real-time monitoring capabilities into the diagnostic framework to enable proactive fault mitigation strategies.
Explore the scalability and computational efficiency of the SHAP-Fusion GBDT methodology for deployment in large-scale industrial environments.
Conduct further research to enhance the interpretability of SHAP-based feature selection and its implications for understanding the diagnostic process.
Extend the evaluation of the diagnostic approach to different motor types, operating conditions, and fault severities to enhance its versatility and applicability.
Investigate the potential integration of predictive maintenance capabilities to enable predictive fault detection and maintenance scheduling.

Author Contributions

Conceptualization, B.A. and M.A.K.; methodology, M.A.K.; software, M.A.K.; validation, B.A., T.V. and A.K.; formal analysis, M.A.K.; investigation, B.A.; resources, T.V. and A.K.; data curation, B.A.; writing—original draft preparation, M.A.K.; writing—review and editing, B.A.; visualization, T.V.; supervision, B.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gundewar, S.K.; Kane, P.V. Condition monitoring and fault diagnosis of induction motor. J. Vib. Eng. Technol. 2021, 9, 643–674. [Google Scholar] [CrossRef]
Fu, P.; Wang, J.; Zhang, X.; Zhang, L.; Gao, R.X. Dynamic routing-based multimodal neural network for multi-sensory fault diagnosis of induction motor. J. Manuf. Syst. 2020, 55, 264–272. [Google Scholar] [CrossRef]
Sheikh, M.A.; Bakhsh, S.T.; Irfan, M.; Nor NB, M.; Nowakowski, G. A review to diagnose faults related to three-phase industrial induction motors. J. Fail. Anal. Prev. 2022, 22, 1546–1557. [Google Scholar] [CrossRef]
Gyftakis, K.N.; Cardoso, A.J.M. Reliable detection of stator interturn faults of very low severity level in induction motors. IEEE Trans. Ind. Electron. 2020, 68, 3475–3484. [Google Scholar] [CrossRef]
Yetgin, A.G.; Turan, M.; Cevher, B.; Çanakoğlu, A.İ.; Gün, A. Squirrel cage induction motor design and the effect of specific magnetic and electrical loading coefficient. Int. J. Appl. Math. Electron. Comput. 2019, 7, 1–8. [Google Scholar] [CrossRef]
De Souza, D.F.; Salotti FA, M.; Sauer, I.L.; Tatizawa, H.; de Almeida, A.T.; Kanashiro, A.G. A Performance Evaluation of Three-Phase Induction Electric Motors between 1945 and 2020. Energies 2022, 15, 2002. [Google Scholar] [CrossRef]
Adamou, A.A.; Alaoui, C. Energy efficiency model-based Digital shadow for Induction motors: Towards the implementation of a Digital Twin. Eng. Sci. Technol. Int. J. 2023, 44, 101469. [Google Scholar] [CrossRef]
Fernandez-Cavero, V.; García-Escudero, L.A.; Pons-Llinares, J.; Fernández-Temprano, M.A.; Duque-Perez, O.; Morinigo-Sotelo, D. Diagnosis of broken rotor bars during the startup of inverter-fed induction motors using the dragon transform and functional ANOVA. Appl. Sci. 2021, 11, 3769. [Google Scholar] [CrossRef]
Wang, W.; Song, X.; Liu, G.; Chen, Q.; Zhao, W.; Zhu, H. Induction motor broken rotor bar fault diagnosis based on third-order energy operator demodulated current signal. IEEE Trans. Energy Convers. 2021, 37, 1052–1059. [Google Scholar] [CrossRef]
Eldeeb, H.H.; Secrest, C.; Zhao, H.; Mohammed, O.A. Time-Domain based Diagnosis of Stator Incipient Faults in DTC Driven Induction Motors using External ElectroMagnetic Signatures. In Proceedings of the 2021 IEEE Energy Conversion Congress and Exposition (ECCE), Vancouver, BC, Canada, 10–14 October 2021; pp. 5124–5128. [Google Scholar]
Senthil Kumar, R.; Gerald Christopher Raj, I.; Suresh, K.P.; Leninpugalhanthi, P.; Suresh, M.; Panchal, H.; Meenakumari, R.; Sadasivuni, K.K. A method for broken bar fault diagnosis in three phase induction motor drive system using Artificial Neural Networks. Int. J. Ambient Energy 2022, 43, 5138–5144. [Google Scholar] [CrossRef]
Meira, M.; Bossio, G.R.; Verucchi, C.J.; Ruschetti, C.R.; Bossio, J.M. Speed estimation during the starting transient of induction motors. IEEE Trans. Instrum. Meas. 2020, 70, 9000108. [Google Scholar] [CrossRef]
Goyal, D.; Mongia, C.; Sehgal, S. Applications of digital signal processing in monitoring machining processes and rotary components: A review. IEEE Sens. J. 2021, 21, 8780–8804. [Google Scholar] [CrossRef]
Ramu, S.K.; Irudayaraj GC, R.; Subramani, S.; Subramaniam, U. Broken rotor bar fault detection using Hilbert transform and neural networks applied to direct torque control of induction motor drive. IET Power Electron. 2020, 13, 3328–3338. [Google Scholar] [CrossRef]
Ramu, S.K.; Vairavasundaram, I.; Aljafari, B.; Kareri, T. Rotor Bar Fault Diagnosis in Indirect Field–Oriented Control-Fed Induction Motor Drive Using Hilbert Transform, Discrete Wavelet Transform, and Energy Eigenvalue Computation. Machines 2023, 11, 711. [Google Scholar] [CrossRef]
Abdellah, C.; Mama, C.; Meflah Abderrahmane, M.R.; Mohammed, B. Current Park’s vector pattern technique for diagnosis of broken rotor bars fault in saturated induction motor. J. Electr. Eng. Technol. 2023, 18, 2749–2758. [Google Scholar] [CrossRef]
Valtierra-Rodriguez, M.; Rivera-Guillen, J.R.; Basurto-Hurtado, J.A.; De-Santiago-Perez, J.J.; Granados-Lieberman, D.; Amezquita-Sanchez, J.P. Convolutional neural network and motor current signature analysis during the transient state for detection of broken rotor bars in induction motors. Sensors 2020, 20, 3721. [Google Scholar] [CrossRef] [PubMed]
Halder, S.; Bhat, S.; Dora, B.K. Inverse thresholding to spectrogram for the detection of broken rotor bar in induction motor. Measurement 2022, 198, 111400. [Google Scholar] [CrossRef]
El Idrissi, A.; Derouich, A.; Mahfoud, S.; El Ouanjli, N.; Chantoufi, A.; Al-Sumaiti, A.S.; Mossa, M.A. Bearing fault diagnosis for an induction motor controlled by an artificial neural network—Direct torque control using the Hilbert transform. Mathematics 2022, 10, 4258. [Google Scholar] [CrossRef]
Pietrzak, P.; Wolkiewicz, M. Stator Winding Fault Detection of Permanent Magnet Synchronous Motors Based on the Short-Time Fourier Transform. Power Electron. Drives 2022, 7, 112–133. [Google Scholar] [CrossRef]
Bensaoucha, S.; Bessedik, S.A.; Ameur, A.; Moreau, S.; Teta, A. A Comparative Study for Broken Rotor Bars Fault Detection in Induction Machine using DWT and MUSIC techniques. In Proceedings of the 2020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP), El Oued, Algeria, 16–17 May 2020; IEEE; pp. 523–528. [Google Scholar]
Zamudio-Ramirez, I.; Ramirez-Núñez, J.A.; Antonino-Daviu, J.; Osornio-Rios, R.A.; Quijano-Lopez, A.; Razik, H.; de Jesus Romero-Troncoso, R. Automatic diagnosis of electromechanical faults in induction motors based on the transient analysis of the stray flux via MUSIC methods. IEEE Trans. Ind. Appl. 2020, 56, 3604–3613. [Google Scholar] [CrossRef]
Dehina, W.; Boumehraz, M.; Kratz, F. Diagnosis and Detection of Rotor Bars Faults in Induction Motor Using HT and DWT Techniques. In Proceedings of the 2021 18th International Multi-Conference on Systems, Signals & Devices (SSD), Monastir, Tunisia, 22–25 March 2021; pp. 109–115. [Google Scholar]
Arabacı, H.; Mohamed, M.A. Detection of induction motor broken rotor bar faults under no load condition by using support vector machines. Int. J. Intell. Eng. Inform. 2021, 9, 470–486. [Google Scholar] [CrossRef]
Guezam, A.; Bessedik, S.A.; Djekidel, R. Fault diagnosis of induction motors rotor using current signature with different signal processing techniques. Diagnostyka 2022, 23, 2022201. [Google Scholar]
Martinez-Roman, J.; Puche-Panadero, R.; Sapena-Bano, A.; Burriel-Valencia, J.; Riera-Guasp, M.; Pineda-Sanchez, M. Locally optimized chirplet spectrogram for condition monitoring of induction machines in transient regime. Measurement 2022, 190, 110690. [Google Scholar] [CrossRef]
Fernandez-Cavero, V.; Pons-Llinares, J.; Duque-Perez, O.; Morinigo-Sotelo, D. Detection and quantification of bar breakage harmonics evolutions in inverter-fed motors through the dragon transform. ISA Trans. 2021, 109, 352–367. [Google Scholar] [CrossRef] [PubMed]
Nishat Toma, R.; Kim, J.M. Bearing fault classification of induction motors using discrete wavelet transform and ensemble machine learning algorithms. Appl. Sci. 2020, 10, 5251. [Google Scholar] [CrossRef]
Singh, M. Fault Diagnosis of a Three-Phase Induction Motor Using Stockwell Transform and Machine Learning Techniques. Ph.D. Thesis, Indian Institute of Technology Jodhpur, Jheepasani, India, 2020. [Google Scholar]
Misra, S.; Kumar, S.; Sayyad, S.; Bongale, A.; Jadhav, P.; Kotecha, K.; Abraham, A.; Gabralla, L.A. Fault detection in induction motor using time domain and spectral imaging-based transfer learning approach on vibration data. Sensors 2022, 22, 8210. [Google Scholar] [CrossRef]
Pietrzak, P.; Wolkiewicz, M. Fault diagnosis of PMSM stator winding based on continuous wavelet transform analysis of stator phase current signal and selected artificial intelligence techniques. Electronics 2023, 12, 1543. [Google Scholar] [CrossRef]
Barrera-Llanga, K.; Burriel-Valencia, J.; Sapena-Bañó, Á.; Martínez-Román, J. A comparative analysis of deep learning convolutional neural network architectures for fault diagnosis of broken rotor bars in induction motors. Sensors 2023, 19, 8196. [Google Scholar] [CrossRef]
Halder, S.; Bhat, S.; Zychma, D.; Sowa, P. Broken rotor bar fault diagnosis techniques based on motor current signature analysis for induction motor—A review. Energies 2022, 15, 8569. [Google Scholar] [CrossRef]
Ferrucho-Alvarez, E.R.; Martinez-Herrera, A.L.; Cabal-Yepez, E.; Rodriguez-Donate, C.; Lopez-Ramirez, M.; Mata-Chavez, R.I. Broken rotor bar detection in induction motors through contrast estimation. Sensors 2021, 21, 7446. [Google Scholar] [CrossRef]
Yan, R.; Shang, Z.; Xu, H.; Wen, J.; Zhao, Z.; Chen, X.; Gao, R.X. Wavelet transform for rotary machine fault diagnosis: 10 years revisited. Mech. Syst. Signal Process. 2023, 200, 110545. [Google Scholar] [CrossRef]
Fernandez-Cavero, V.; Pons-Llinares, J.; Duque-Perez, O.; Morinigo-Sotelo, D. Detection of broken rotor bars in nonlinear startups of inverter-fed induction motors. IEEE Trans. Ind. Appl. 2021, 57, 2559–2568. [Google Scholar] [CrossRef]
Liu, X.; Yan, Y.; Hu, K.; Zhang, S.; Li, H.; Zhang, Z.; Shi, T. Fault diagnosis of rotor broken bar in induction motor based on successive variational mode decomposition. Energies 2022, 15, 1196. [Google Scholar] [CrossRef]
Fares, N.; Aoulmi, Z.; Thelaidjia, T.; Ounnas, D. Learning Machine Based on Optimized Dimensionality Reduction Algorithm for Fault Diagnosis of Rotor Broken Bars in Induction Machine. Eur. J. Electr. Eng. 2022, 24, 171. [Google Scholar] [CrossRef]
Saidi, L.; Fnaiech, F.; Henao, H.; Capolino, G.A.; Cirrincione, G. Diagnosis of broken-bars fault in induction machines using higher order spectral analysis. ISA Trans. 2013, 52, 140–148. [Google Scholar] [CrossRef] [PubMed]
Salomon, C.P.; Santana, W.C.; Lambert-Torres, G.; da Silva, L.E.B.; Bonaldi, E.L.; de Oliveira, L.E.D.L.; Pellicel, A.; Figueiredo, G.C.; Lope, M.A.A. Discrimination of synchronous machines rotor faults in electrical signature analysis based on symmetrical components. IEEE Trans. Ind. Appl. 2016, 53, 3146–3155. [Google Scholar] [CrossRef]
Kumar, R.S.; Raj, I.G.C.; Alhamrouni, I.; Saravanan, S.; Prabaharan, N.; Ishwarya, S.; Gokdag, M.; Salem, M. A combined HT and ANN based early broken bar fault diagnosis approach for IFOC fed induction motor drive. Alex. Eng. J. 2023, 66, 15–30. [Google Scholar] [CrossRef]
Marcílio, W.E.; Eler, D.M. From explanations to feature selection: Assessing SHAP values as feature selection mechanism. In Proceedings of the 2020 33rd SIBGRAPI conference on Graphics, Patterns and Images (SIBGRAPI), Porto de Galinhas, Brazil, 7–10 November 2020; pp. 340–347. [Google Scholar]
Saberi, A.N.; Belahcen, A.; Sobra, J.; Vaimann, T. Lightgbm-based fault diagnosis of rotating machinery under changing working conditions using modified recursive feature elimination. IEEE Access 2022, 10, 81910–81925. [Google Scholar] [CrossRef]
Dişli, F.; Gedikpınar, M.; Sengur, A. Deep transfer learning-based broken rotor fault diagnosis for Induction Motors. Turk. J. Sci. Technol. 2023, 18, 275–290. [Google Scholar] [CrossRef]
Chisedzi, L.P.; Muteba, M. Detection of Broken Rotor Bars in Cage Induction Motors Using Machine Learning Methods. Sensors 2023, 23, 9079. [Google Scholar] [CrossRef]
Mangalathu, S.; Hwang, S.H.; Jeon, J.S. Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach. Eng. Struct. 2020, 219, 110927. [Google Scholar] [CrossRef]
Nohara, Y.; Matsumoto, K.; Soejima, H.; Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Programs Biomed. 2022, 214, 106584. [Google Scholar] [CrossRef] [PubMed]
Antwarg, L.; Miller, R.M.; Shapira, B.; Rokach, L. Explaining anomalies detected by autoencoders using Shapley Additive Explanations. Expert Syst. Appl. 2021, 186, 115736. [Google Scholar] [CrossRef]
Ekanayake, I.U.; Meddage DP, P.; Rathnayake, U. A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Stud. Constr. Mater. 2022, 16, e01059. [Google Scholar] [CrossRef]
Gebreyesus, Y.; Dalton, D.; Nixon, S.; De Chiara, D.; Chinnici, M. Machine learning for data center optimizations: Feature selection using Shapley additive exPlanation (SHAP). Future Internet 2023, 15, 88. [Google Scholar] [CrossRef]
Santos, M.R.; Guedes, A.; Sanchez-Gendriz, I. SHapley Additive exPlanations (SHAP) for Efficient Feature Selection in Rolling Bearing Fault Diagnosis. Mach. Learn. Knowl. Extr. 2024, 6, 316–341. [Google Scholar] [CrossRef]
Al-Najjar, H.A.; Pradhan, B.; Beydoun, G.; Sarkar, R.; Park, H.J.; Alamri, A. A novel method using explainable artificial intelligence (XAI)-based Shapley Additive Explanations for spatial landslide prediction using Time-Series SAR dataset. Gondwana Res. 2023, 123, 107–124. [Google Scholar] [CrossRef]
Chen, H.; Lundberg, S.; Lee, S.I. Explaining models by propagating Shapley values of local components. In Explainable AI in Healthcare and Medicine: Building a Culture of Transparency and Accountability; Springer: Cham, Switzerland, 2021; pp. 261–270. [Google Scholar]

Figure 1. Broken rotor bar cross-sectional views of induction motor for (a) healthy, (b) 1 BRB, and (c) 3 BRB.

Figure 2. Flow chart of DSHAP-IFS for feature selection using GBDT.

Figure 3. Proposed DSHAP-IFS feature importance plot.

Figure 4. (a) Healthy BRB (b) one BRB(c) two BRB (d) three BRB.

Figure 5. Practical setup with loading motor (right side) and testing motor (left side).

Figure 6. Proposed fault diagnostic architecture for DTC-controlled induction machine for BRB detection and classification.

Figure 7. Frequency domain analysis of current signals for healthy, 1 BRB, 2 BRB, and 3 BRB at different full-load scenarios.

Figure 8. Frequency domain analysis of voltage signals for healthy, 1 BRB, 2 BRB, and 3 BRB at different full load scenarios.

Figure 9. Frequency domain analysis of torque signals for healthy, 1 BRB, 2 BRB, and 3 BRB at different full load scenarios.

Figure 10. Frequency domain analysis of speed signals for healthy, 1 BRB, 2 BRB, and 3 BRB at different full load scenarios.

Figure 11. Confusion matrix analysis for a healthy state, 1 BRB, 2 BRBs, and 3 BRBs at 100% loading conditions.

Figure 12. Receiver operating characteristic curves for a healthy state, 1 BRB, 2BR, and 3 BRB at 100% loading conditions.

Figure 13. Performance measures for broken rotor bars (BRBs) under various loading conditions.

Table 1. Comparison of feature selection techniques with proposed DSHAP-IFS.

Feature Selection Approach	MAE	RMSE	MAPE	Execution Time (s)
Permutation importance	0.612	0.0453	0.039	106.34
LIME	0.577	0.423	0.032	147.83
BORUTA	0.321	0.243	0.021	132.98
MOMI	0.342	0.369	0.018	111.43
LASSO	0.443	0.345	0.029	157.23
SHAP	0.0387	0.401	0.024	102.86
Proposed DSHAP-IFS	0.0317	0.221	0.0031	82.34

Table 2. Optimized hyperparameters for gradient-boosted decision trees (GBDT).

Parameters	Functions of Parameters	Default Value
n_estimators	The precise number of iterations	150
learning_rate	Max depth of every tree to define the learning rate of gradient boosting	0.01
max_depth	Maximum depth of each tree	5
min_samples_split	Minimum No. of samples required to split a node	2
min_samples_leaf	Minimum No. of samples required at a leaf node	7
max_features	Number of features to consider for each split	sqrt
subsample	The fraction of samples used for fitting each tree	0.9
loss	Loss function	deviance
criterion	When performing branching, calculate the impurity index of a weak estimate.	“Friedman_mse”

Table 3. Main parameters and specifications of the motor under investigation.

Sr.No	Parameter	Symbol	Value
1	Pole Configuration	P	4
2	Phases	$\emptyset$	3
3	Connection	$∆$ /Y	Delta/star
4	Stator slots	$N_{s}$	36 (non-skewed)
5	Rotor slots	$n_{r}$	28 (skewed)
6	Terminal voltage	$V_{s}$	690V/400V @ 50 Hz
7	Rated current	$I$	8.8A/13.5A
8	Rated power	$P_{r}$	7.5 kW @ 50 Hz
9	Rated slip	S	0.0667
10	Rated speed	$N_{r}$	1500rpm@50Hz

Table 4. Class labels and the related conditions for a dataset of DTC-controlled induction motors for BRB diagnosis.

Sr.No	Fault Type	No. of Samples Per Loading Condition	Total Samples	Sampling Frequency (KHz)	Loading Condition (%)	Fault Label
1	Healthy	1650	8250	20	0, 25, 50, 75, 100	Healthy
2	One BRB	1650	8250	20	0, 25, 50, 75, 100	1 BRB
3	Two BRB	1650	8250	20	0, 25, 50, 75, 100	2 BRB
4	Three BRB	1650	8250	20	0, 25, 50, 75, 100	3 BRB

Table 5. Performance measures for healthy bars under various loading conditions.

Load (%)	Accuracy	Precision	Recall	F1 Score
At 0% Load	1.00	0.9638	0.9639	0.9639
At 25% Load	1.00	0.9646	0.9647	0.9647
At 50% Load	1.00	0.966	0.9662	0.9661
At 75% Load	1.00	0.9652	0.9654	0.9653
At 100% Load	1.00	0.9638	0.9639	0.9639

Table 6. Performance measures for 1 BRB under various loading conditions.

Load (%)	Accuracy	Precision	Recall	F1 Score
At 0% Load	0.9978	0.9638	0.9639	0.9639
At 25% Load	0.9971	0.9646	0.9647	0.9647
At 50% Load	0.9961	0.9660	0.9662	0.9661
At 75% Load	0.9968	0.9652	0.9654	0.9653
At 100% Load	0.9965	0.9638	0.9639	0.9639

Table 7. Performance measures for 2 BRBs under various loading conditions.

Load (%)	Accuracy	Precision	Recall	F1 Score
At 0% Load	0.9865	0.9642	0.9639	0.9642
At 25% Load	0.9745	0.9641	0.9637	0.9639
At 50% Load	0.9853	0.9639	0.9639	0.9637
At 75% Load	0.9843	0.9642	0.9636	0.9631
At 100% Load	0.9798	0.9642	0.9631	0.9633

Table 8. Performance measures for 3 BRBs under various loading conditions.

Load (%)	Accuracy	Precision	Recall	F1 Score
At 0% Load	0.9764	0.9444	0.9421	0.9411
At 25% Load	0.9843	0.9442	0.9433	0.9418
At 50% Load	0.9647	0.9443	0.9427	0.9423
At 75% Load	0.9764	0.9441	0.9431	0.9412
At 100% Load	0.9867	0.9438	0.9425	0.9417

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khan, M.A.; Asad, B.; Vaimann, T.; Kallaste, A. An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology. Machines 2024, 12, 495. https://doi.org/10.3390/machines12070495

AMA Style

Khan MA, Asad B, Vaimann T, Kallaste A. An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology. Machines. 2024; 12(7):495. https://doi.org/10.3390/machines12070495

Chicago/Turabian Style

Khan, Muhammad Amir, Bilal Asad, Toomas Vaimann, and Ants Kallaste. 2024. "An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology" Machines 12, no. 7: 495. https://doi.org/10.3390/machines12070495

APA Style

Khan, M. A., Asad, B., Vaimann, T., & Kallaste, A. (2024). An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology. Machines, 12(7), 495. https://doi.org/10.3390/machines12070495

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Advanced Diagnostic Approach for Broken Rotor Bar Detection and Classification in DTC Controlled Induction Motors by Leveraging Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) GBDT Methodology

Abstract

1. Introduction

SHapely Additive exPlanations (SHAP)

2. Detection and Classification of Anomalies in Induction Motors: Analysis of Electrical Signatures in Healthy and Faulty States, with Emphasis on Broken Rotor Bars

2.1. Current Analysis

2.1.1. Healthy Condition

2.1.2. Faulty Condition

2.2. Voltage Analysis

2.2.1. Healthy Condition

2.2.2. Faulty Condition

2.3. Speed Analysis

2.3.1. Healthy Condition

2.3.2. Faulty Condition

2.4. Torque Analysis

2.4.1. Healthy Condition

2.4.2. Faulty Condition

3. Dynamic SHAP Interaction Feature Selection (DSHAP-IFS) with GBDT

4. Proposed Technique DSHAP-IFS Model Performance Evaluation Criteria

5. Gradient-Boosting Decision Trees Approach

6. Optimized Hyperparameters for Gradient-Boosted Decision Trees: Balancing Model Complexity and Overfitting

7. Experimental Setup for Data Collection and Acquisition

Dataset Statistics

8. Results and Discussion

9. Conclusions and Future Recommendations

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI