1. Introduction
The induction motor (IM) is a fundamental component across many industries [
1] due to its robust performance, high efficiency, and favorable power density. IMs are widely deployed in applications ranging from electric vehicles and aerospace drive systems to household appliances [
2,
3,
4]. Nevertheless, components within IM drive systems are vulnerable to failure from elevated electro-thermal and mechanical stresses; such failures undermine system reliability and can cause costly unplanned downtime. Consequently, fault diagnosis (FD) in converter-fed IMs is essential for ensuring the dependable operation of variable speed drive (VSD) systems, and it has therefore attracted substantial research attention, particularly in the contexts of renewable energy systems and smart grids [
5,
6,
7].
In inverter-driven IMs, stator windings are exposed to combined electromagnetic and mechanical stresses [
8] that can precipitate insulation degradation and eventual breakdown if not detected early [
9]. Undetected faults may produce irreversible damage to the motor and the wider drive system [
10]. Typical manifestations include coil-to-coil, phase-to-phase, and phase-to-ground faults, among others [
11].
The stringent performance and reliability demands of modern applications, such as electric vehicles and high-speed rail systems, have elevated fault detection in electric drive systems to a critical research priority [
12]. Historically, FD methods for IMs have fallen into two broad categories: model-based and signal-based approaches [
13]. Model-based techniques use mathematical models to reproduce both normal and faulty behavior, enabling theoretically grounded anomaly detection and precise residual analysis [
14]. In contrast, signal-based methods, including vibration analysis, temperature monitoring, and motor current signature analysis (MCSA), are commonly used to identify bearing defects, misalignment, rotor imbalance, and other mechanical issues [
15]. While effective for many applications, signal-based methods are often sensitive to noise, interference, and signal distortion, factors that can degrade diagnostic reliability in practice [
16].
Infrared thermography [
17] is a widely used technique for detecting abnormal thermal signatures. Similarly, MCSA methods, including Park’s vector [
18], extended Park’s vector [
19], and negative-sequence impedance [
20], have proven effective in diagnosing drive faults in line-fed machines. However, these approaches face significant challenges when applied to IM drives, particularly in detecting early-stage interturn faults and distinguishing them from harmonics generated by the drive system. Despite extensive research on IM fault detection, diagnosing incipient faults and accurately assessing their severity in drive-fed systems remains a complex task. Since fault development is generally gradual [
13], precise fault severity estimation is essential for timely maintenance planning and repair prioritization [
21].
Drive-fed IMs are now integral to modern industrial operations. As line-fed motors are increasingly replaced by voltage-source inverter (VSI)-fed systems, ensuring operational reliability and enabling early fault detection have become critical for minimizing unplanned downtime and mitigating financial losses [
22]. Timely identification of early-stage faults also supports preventive maintenance, reduces the risk of fault escalation, and safeguards the overall drive system [
23].
Inverter-fed motors experience elevated electrical stress on stator windings due to the high harmonic content of the supply voltage, which increases thermal loading [
24]. Consequently, effective condition monitoring systems must be capable of detecting faults regardless of the type of power supply. Fault diagnosis in converter-fed motors is particularly challenging because of electromagnetic interference (EMI), high switching frequencies, and rapid voltage transients inherent to inverter operation [
25]. These factors complicate the distinction between fault-induced anomalies and normal operating patterns, underscoring the need for advanced diagnostic techniques. Despite the importance of this topic, it remains relatively underexplored, especially under variable speed and torque conditions.
Artificial intelligence (AI)-based approaches have shown strong potential for modeling complex systems without requiring extensive prior knowledge or explicit parameter definitions [
26]. Such methods can automatically extract input features, making them particularly suitable for high-dimensional and nonlinear systems. However, conventional machine learning (ML) algorithms often struggle to process large datasets and typically rely on manual feature extraction, limiting their adaptability. In contrast, deep learning (DL), a subfield of ML, has recently achieved significant success in FD applications [
27,
28,
29,
30]. Hierarchical DL architectures are increasingly incorporated into condition monitoring frameworks to improve both diagnostic accuracy and automation [
14,
31,
32].
This study explores the use of such architectures for the early detection of interturn faults in power-electronics-fed drive systems. Specifically, we propose a novel hybrid framework that combines ML and DL for two-level decision making, fault classification, and severity assessment.
The proposed method is a multi-wide-kernel convolutional neural network (MWK-CNN) designed for fault classification and severity estimation in IM drives. Unlike conventional CNNs, which typically employ small kernels (e.g., 3 × 3), the MWK-CNN uses larger convolutional filters to capture a broader range of contextual features in the input data [
33]. Wide kernels offer several advantages, including an increased receptive field, enhanced shape bias, reduced computational complexity, and improved overall model performance. Despite these advantages, wide-kernel CNNs have seen limited adoption in FD applications. For instance, large-kernel CNNs have been applied to one-dimensional vibration signals, and wide-kernel deep convolutional autoencoders have been developed for diagnosing faults in rotating machinery using vibration data [
33,
34].
Building on this foundational work, we present a multi-size wide-kernel CNN architecture specifically optimized for improving diagnostic accuracy and robustness in electric drive systems. The main contributions of this study are:
Development of a novel MWK-CNN architecture for FD in IM drives, incorporating multiple kernel sizes across convolutional layers.
Effective feature extraction from input signals through wide-kernel convolutions, enabling accurate fault detection.
Enhanced diagnostic accuracy through integration of diverse kernel sizes, outperforming conventional small-kernel CNNs.
Robustness and generalizability, demonstrated through evaluation on an IM drive dataset collected under a variety of operating conditions.
The remainder of this paper is structured as follows.
Section 2 presents an overview of electric drive systems and common fault types.
Section 3 reviews the fundamentals of CNNs.
Section 4 describes the proposed methodology in detail.
Section 5 provides a comparative analysis and in-depth discussion of results. Finally,
Section 6 summarizes the key findings and concludes the study.
2. Electrical Drive and Its Faults
Field-oriented (vector) control is a widely adopted strategy for regulating induction-motor drives, particularly in VSD systems [
35]. Among its variants, speed-sensorless vector control has gained particular prominence in IM drive applications [
36], offering high efficiency and precise speed regulation without relying on external sensors such as encoders. Additional benefits include rapid dynamic response, reduced system cost, enhanced reliability, and a more compact overall design [
37].
Electrical-drive faults can take many forms, including overcurrent, overvoltage, communication errors, and failures within the drive’s power electronics or the connected motor [
21]. Such faults commonly originate from power-supply disturbances, overload conditions, or malfunctions in the drive’s internal components or communication interfaces [
13].
Common types of electrical-drive faults include:
Overcurrent: Excessive current flow—typically caused by overloads, short circuits, or wiring faults—that can trip protective devices and damage power-stage components.
Overvoltage: Elevated DC-bus or phase voltages resulting from supply transients, regenerative events, or converter malfunctions.
Communication faults: Loss or corruption of communication between the drive and its controller or feedback devices (e.g., PLCs, encoders), often due to cable defects, configuration errors, or interface failures.
Power-electronics faults: Open-circuit or short-circuit failures in switching devices (e.g., IGBTs/MOSFETs), gate-driver faults, or DC-link component failures that degrade converter performance.
Motor faults: Electrical faults in the motor windings (open or short circuits, including interturn faults), insulation degradation, and phase imbalances that reduce torque capability and increase heating.
Mechanical faults: Bearing defects, rotor eccentricity, misalignment, and shaft-related issues—typically revealed through vibration signatures and changes in electrical parameters.
These fault classes often interact (for example, a power-electronics fault can induce thermal stress that accelerates insulation failure), which complicates diagnosis and motivates integrated electrical–mechanical monitoring strategies.
4. Proposed Fault Diagnosis Method
Early fault diagnosis in electrical drives (EDFD) is essential to avoid disruptions in industrial operations. Electrical drive faults may manifest as overcurrent, overvoltage, communication errors, or failures in the drive’s power electronics or connected motor. Such faults can result from power supply disturbances, overload conditions, or issues within the drive’s communication systems or internal components. If undetected, these faults can lead to complete machinery shutdowns and significant production losses. The primary objective of this work is to develop an MWK-CNN for effective EDFD. Electrical signals are collected under various fault conditions and operational settings for subsequent analysis.
CNN models have been widely applied to FD and have demonstrated strong performance in EDFD. However, conventional CNN architectures typically employ small convolutional kernels for feature extraction, limiting their ability to accurately identify and differentiate between various electrical faults. To address this limitation, the proposed MWK-CNN incorporates wide convolution kernels, which expand the receptive field and allow the model to capture both high- and low-frequency features in vibration signals.
This enhanced architecture improves the network’s capacity to learn temporal characteristics inherent in the signals, which are often time-dependent and periodic. As a result, the proposed model achieves more effective feature extraction and higher diagnostic accuracy. A key structural innovation is the use of large convolution kernels in the initial layer, which plays a crucial role in preserving temporal features and enhancing diagnostic performance. The overall workflow of the proposed method is illustrated in
Figure 1.
The first layer of the MWK-CNN employs multiple convolution kernels of different sizes, as shown in
Figure 2. This multi-kernel design enables the capture of features across multiple frequency bands. The primary motivation for using wide kernels is to enhance the extraction of short-term signal characteristics. Compared to standard CNNs, the wide-kernel architecture offers advantages for EDFD, including improved global feature extraction, reduced overfitting, and more effective learning of relevant patterns.
One of the main strengths of CNNs is their ability to automatically identify informative diagnostic indicators while filtering out irrelevant data. The complete architecture of the MWK-CNN is depicted in
Figure 2. The first convolutional layer uses three wide kernels with sizes 150, 75, and 50, respectively, each followed by max-pooling layers with pool sizes of 15, 10, and 5.
Because the first layer uses multiple kernel sizes, a concatenation layer merges their outputs before passing the data to subsequent layers. FCLs follow this concatenation stage. Both convolutional and fully connected layers use the ReLU activation function, while the final output layer applies the SoftMax function for classification. The Adam optimizer is used to train and fine-tune the MWK-CNN model.
6. Conclusions
This study addresses the critical challenge of accurate drive fault detection by proposing a multi-scale wide-kernel convolutional neural network (MSWK-CNN) capable of directly processing raw drive signals, eliminating the need for manual feature extraction. The proposed model was benchmarked against established deep learning architectures, including 1D-CNN, LSTM, and ANN, using a purpose-built data acquisition system that collected drive signal data under both normal and faulty operating conditions. Fault types considered included PTGF, PTPF, OLF, OVF, UVF, and NOM. The expansive kernel architecture enhances the model’s ability to capture both local and global signal characteristics, thereby improving its capacity to distinguish between healthy and faulty conditions. The MSK-CNN, trained and validated on a comprehensive dataset collected from diverse operating scenarios, demonstrates heightened sensitivity to subtle fault signatures and delivers superior diagnostic performance. Moreover, the proposed approach shows strong potential for future applications in detecting additional drive-related issues, such as inverter pulse failures and gate-level defects. Future research should focus on improving the interpretability of wide-kernel CNN models, as a deeper understanding of their decision-making processes is essential for fostering user confidence and ensuring safe, reliable deployment in critical industrial environments.