Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions

Li, Tao; Wang, Enyu; Yang, Jun

doi:10.3390/s25061884

Open AccessArticle

Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions

by

Tao Li

^1,2,3,*

,

Enyu Wang

⁴

and

Jun Yang

³

¹

College of Railway Transportation, Hunan University of Technology, Zhuzhou 412007, China

²

College of Mechanical and Vehicle Engineering, Hunan University, Changsha 410082, China

³

Zhuzhou Times New Material Technology Co., Ltd., Zhuzhou 412007, China

⁴

College of Electrical and Information Engineering, Hunan University of Technology, Zhuzhou 412007, China

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(6), 1884; https://doi.org/10.3390/s25061884

Submission received: 21 January 2025 / Revised: 24 February 2025 / Accepted: 12 March 2025 / Published: 18 March 2025

(This article belongs to the Special Issue Machine Learning-Assisted Advanced Sensing Technologies for Modern Power Converters)

Download

Browse Figures

Versions Notes

Abstract

Open-circuit (OC) faults in power converters are common issues in motor drive systems, significantly affecting the safe and stable operation of the system. Conventional models can accurately diagnose faults under a single operating condition. However, when conditions change, these models may fail to recognize new fault features, resulting in a decrease in diagnosis accuracy. To address this challenge, this paper proposes a lifelong learning-enabled fractional order-convolutional encoder model for open-circuit fault diagnosis of power converters under multi-conditions. Firstly, the model automatically extracts and identifies fault signal features using the convolutional module and the encoder module, respectively. Subsequently, the model’s iterative computational process is optimized by learning historical gradient information through fractional order, and enhancing the model’s ability to capture the long-term dependencies inherent in fault signals. Finally, a multilevel lifelong learning framework has been established to enable the model to continuously learn the fault features of power converter under multi-conditions, thereby avoiding catastrophic forgetting that can occur when the model learns different tasks. The proposed model effectively addresses the challenge of low fault diagnosis accuracy that occurs when the operating conditions of the power converter change, achieving a diagnosis accuracy of 96.89% across 85 fault categories under multi-conditions.

Keywords:

lifelong learning; power converter; open-circuit fault; fault diagnosis; fractional order

1. Introduction

The power converter, as a core component of motor drive systems, is widely used in fields such as electric vehicles, renewable energy generation, and rail transportation. During operation, the power converter often needs to switch frequently between the rectifier and inverter states, which may lead to faults under multi-conditions. Therefore, the ability to quickly and accurately diagnose faults in power converters across multi-conditions is essential for ensuring the safe operation of motor drive systems.

Power converter faults are primarily classified into short-circuit (SC) faults and open-circuit (OC) faults, which are manifested by the failure of internal power devices such as an insulated gate bipolar transistor (IGBT) [1]. The main causes of SC faults can be divided into two aspects. On one hand, abnormal Pulse Width Modulation (PWM) control signals may lead to SC faults. On the other hand, IGBT breakdown may occur under high-stress conditions such as over-voltage, over-current, or excessive junction temperature. When an SC fault occurs, protective devices within the power converter, such as circuit breakers or fast-acting fuses, are activated to isolate the failed component and shut down the power converter [2]. The primary cause of OC faults is IGBT thermal fatigue failure, in which failure of internal solder layers or bonding wires prevents the IGBT from conducting current [3]. Once an OC fault occurs, the output current of the power converter becomes distorted and unbalanced; this does not trigger protective devices but severely reduces the output power quality and may even lead to secondary faults [4]. This paper focuses on the diagnosis of OC faults in the power converter.

For OC faults, current diagnosis methods mainly include model-based methods [5,6,7], signal-based methods [8,9,10], and data-driven methods [11,12,13]. Model-based methods require the establishment of mathematical models that characterize the physical behavior of the actual system. By observing and calculating the residuals between the measured values of the actual system and the simulated values from the mathematical model, the current state of the system can be assessed. In reference [14], the current residual is calculated by comparing the actual current path with the estimated current path, and faults are identified based on characteristic residual patterns. In reference [15], the extended Kalman filter is adjusted by measuring currents on both the battery and capacitor sides until the residual falls below a set correction threshold, and fault detection and localization are achieved by determining whether the maximum correction count or correction values on the faulted side exceed the threshold. In reference [16], output current and its rate of change are used as fault detection variables, while the sum of phase current and neutral point voltage residuals is used for fault localization. In reference [17], a process of state augmentation and nonsingular coordinate transformation was designed for the observation system to address misdiagnosis issues arising from interactions between different fault types. Model-based methods generally offer high diagnosis accuracy; however, establishing mathematical models for complex systems is challenging. Furthermore, when system parameters change, the model needs to be redefined.

An IGBT OC fault leads to voltage oscillations at the load end of the power converter and introduces significant harmonics in the output current. Fault diagnosis methods based on observing these signal variations are referred to as signal-based methods. When an OC fault occurs in the IGBT, the current path in its corresponding bridge arm changes. Reference [18] proposes diagnosing inverter OC faults by utilizing changes in the current path under effective vector coordinates. Reference [19] designed a normalized cost function for detecting OC faults and used the current mean and phase angle for fault localization. In addition to current detection, voltage signals can also be used to diagnose IGBT OC faults. Reference [20] utilizes the similarity of capacitor voltages under fault and normal conditions in modular multilevel converters, employing a correlation coefficient for early-stage fault localization. Reference [21] performs fault diagnosis based on the characteristic that the common-mode voltage of an inverter remains equal during normal operation and reduces fault miss-detection rates by injecting active common-mode voltage. Reference [22] proposed a fault diagnosis method based on the dynamic characteristics of midpoint voltage and a fault-tolerant control strategy based on Complementary Switch Blocking, which mitigates fault impacts by blocking gate drive signals of the complementary switch. Reference [23] processes three-phase voltage signals using ensemble empirical mode decomposition, breaking them down into intrinsic mode functions and calculating their norm entropy to characterize fault signal statistics. However, in practical applications, current signals are susceptible to load effects, reducing diagnosis accuracy and increasing diagnosis time. Voltage-based methods often require additional sensors, raising operational and maintenance costs.

Data-driven methods build fault diagnosis models by training on large amounts of historical data to establish a mapping relationship between fault features and fault modes. Once the model detects abnormal data, it can quickly identify the fault type using previously learned knowledge, completing the diagnosis process. For online fault diagnosis applications, the model needs to respond rapidly to fault signals. Numerous studies have aimed to improve diagnosis speed and efficiency to meet real-time requirements. For IGBT OC faults in three-phase PWM inverters, Reference [24] designed an ensemble classifier based on the Extreme Learning Machine with a reliability mechanism and used a Random Vector Functional Link (RVFL) network to identify fault features, reducing diagnosis time. Building on this, reference [25] proposed an RVFL network based on the Fast Fourier Transform and Relief algorithm to prevent misdiagnosis due to similar fault features. Reference [26] introduced a diagnosis method for IGBT OC faults in Neutral Point Clamped (NPC) three-level inverters, using Independent Component Analysis and Joint Approximate Diagonalization algorithms to separate fault signals, allowing for more precise identification of complex fault patterns. Reference [27] proposed a fault detection and localization method based on the Entropy of Wavelet Packets feature extraction and Support Vector Machine (SVM) to diagnose IGBT OC faults in multilevel inverters. Reference [28] considered the multi-scale characteristics of fault signals and developed a fault diagnosis algorithm based on multi-scale Approximate Entropy, Dempster–Shafer evidence theory, and Deng entropy fusion, effectively addressing conflicts and uncertainties among features and enhancing the mutual benefits of features across different scales.

The aforementioned methods demonstrate rapid processing speeds and high accuracy in diagnosing OC faults, they still rely on feature extraction algorithms to derive various characteristics from the raw signals. The diagnosis efficacy of these methods is significantly constrained by the selection process of features based on individual expert experience. This human intervention not only adds to operational complexity but may also introduce the risk of misjudgment. In contrast, deep neural networks possess exceptional feature parsing and selection capabilities, allowing them to automatically identify and select optimal fault features based on internal signal correlations, thereby enhancing the accuracy and reliability of fault diagnosis [29,30]. Reference [31] proposed a deep convolutional network model based on the inception module, which can automatically extract and identify fault features in three-phase PWM converters, applicable in both inverter and rectifier states. Reference [32] eliminated redundant features and sampling points through correlation analysis, followed by wavelet transformation to further compress feature data, accelerating the training process of deep feedforward networks. Reference [33] combined neural network models with classification algorithms, achieving rapid fault feature identification along the time dimension using an improved Long Short-Term Memory (LSTM) network, while employing SVM to classify the outputs of the enhanced LSTM, thus addressing the interpretability issues of neural network outputs. Reference [34] introduced a wide residual network model based on incremental learning, which combines the automatic feature extraction capability of residual networks with the incremental learning ability of generalized learning systems, allowing for incremental learning of new data without retraining the model. To tackle the challenges of extracting fault features from NPC inverters under non-stationary conditions, reference [35] proposed a fault diagnosis method based on attention collaborative stacked LSTM (ASLSTM) networks, which extracts highly discriminative features from multi-source time series data, improving the stability of fault diagnosis for NPC inverters.

All of the aforementioned models diagnose OC faults in power converters by analyzing current or voltage signals. However, in practice, these signals are usually affected by the electromagnetic environment and produce burrs, spikes, and other disturbances, which may reduce the diagnostic accuracy of the models. Fractional order gradient descent is an optimization method based on fractional order calculus, which allows for the incorporation of historical information during the gradient update, thereby enhancing the model’s learning capability. Compared to traditional integer-order gradient descent, fractional order gradient descent can adjust the optimization path of the neural network model through non-integer-order gradients, improve global search capabilities, and thus enhance the model’s robustness against noise. Reference [36] proposed an optimizer based on fractional order momentum gradient descent, which enables the neural network model to better converge to the global optimal solution, and improves the fault diagnosis accuracy under small sample datasets. Reference [37] employed the long-term memory property of Caputo–Fabrizio fractional order derivatives to solve the local dependence and singularity problems of traditional integer orders in practical applications, and improved the extraction effect of weak fault signals. Reference [38] utilizes the fractional order chaotic system to map the original vibration signals into the three-dimensional chaotic space, constructs a 3D dynamic error phase map with obvious differentiation ability, and realizes the rapid classification of fault signals.

Although neural networks exhibit high accuracy in fault diagnosis, when the operating state of the power converter changes, the neural network model needs to be retrained on new fault features, which leads to a decrease in diagnosis accuracy. Therefore, this paper proposes a lifelong learning-enabled fractional order-convolutional encoder model that can continuously learn the open-circuit fault characteristics of the power converter under multi-conditions. This model maintains high diagnostic accuracy for all faults, highlighting its significant research implications and practical applications. The contributions of this paper are as follows:

1. A convolutional encoder model was constructed for learning and diagnosing OC faults in the power converter. The time series features of three-phase current fault signals and the relative positional relationships between each phase signal are automatically extracted by the convolutional module. The encoder module is utilized to identify and classify these features, enabling the automatic learning and diagnosis of fault samples.

2. Fractional order is utilized to optimize the convolutional encoder model. This model improves the optimization process of backpropagation by incorporating fractional order, allowing it to fully consider the historical gradient information when updating, and is conducive to capturing long-term dependencies in time-series data. Additionally, the smooth gradient update path reduces the oscillations during the optimization process, facilitating stable convergence to the global optimal solution and improving robustness against anomalous noise.

3. A multilevel lifelong learning framework is designed to enable the model to continuously learn from new fault samples. Limit the range of updates during which the model parameters learn new tasks by incorporating a resilient regularization penalty term into the loss function. In addition, a random small number of samples from previous tasks are inserted into the new fault samples and by learning the soft labels from earlier tasks to improve the stability and accuracy of the model under different tasks.

2. Fault Analysis

The structure of the motor drive system is shown in Figure 1, consisting mainly of a battery, power converter, and motor. When the system operates the motor, the power converter works in the inverter state, converting the electric energy from direct current (DC) to alternating current (AC) and delivering it to the motor. Conversely, when the system is in a braking state, the motor operates in generator mode, outputting three-phase AC to the power converter. At this time, the power converter works in a rectifier state, converting the three-phase AC into DC and storing it in the battery. Therefore, during the operation of the motor drive system, the power converter continuously alternates between rectifier and inverter state.

The circuit structure of the power converter is shown in Figure 2. The DC power source V_D, capacitor C₁, and resistor R₁ form an equivalent DC power source responsible for supplying and storing electrical energy. S_x and D_x (x = 1, 2, …, 6) represent IGBT and Fast Recovery Diode (FRD), respectively. The three-phase bridge circuit consists of six pairs of IGBT with anti-parallel FRD. The three-phase AC, i_k (k = a, b, c), flows through the LCL filter—comprising an inductor-capacitor-inductor configuration—before being supplied to the motor. The LCL filter comprises inductors L₁ and L₂, resistor R₂, and capacitor C₂. The six pairs of IGBT in the three-phase bridge circuit are turned on and off in a specific pattern by the PWM control system, enabling bidirectional energy conversion between AC and DC. The signal acquisition module captures the system’s three-phase currents using current sensors. In rectification mode, the sensor captures the power converter’s bridge arm current I₁ as the fault diagnosis signal, and in inverter mode, it captures the load-side current I₂. An IGBT OC fault causes them to remain off, independent of PWM control, leading to current distortions that vary with converter operating conditions and motor types, such as AC Induction Motors (ACIM) or Permanent Magnet Synchronous Machines (PMSMs) in electric vehicles. This study thus examines the approach for diagnosing IGBT OC faults in the power converter under multi-conditions and motors.

In practical applications, diagnosing single or double OC faults is more practical as the probability of multiple OC faults occurring simultaneously is very low. This helps to identify and locate the failed device in time, preventing the fault from spreading to other parts of the system, which could result in secondary failures or even system paralysis. Furthermore, the OC fault diagnosis approach proposed in this paper is not restricted to a specific circuit topology or a particular model of IGBT device. This is because the approach can automatically learn and recognize fault characteristics from the signal. And no matter which topology or device, the occurrence of an OC fault will produce distinct abnormal features in the current or voltage signal. Consequently, the approach is universal and can be extended to new power converters with redundant designs or more switching devices, thus providing theoretical support and technical guarantee to improve the safety and reliability of the overall system. Based on the multi-conditions of the power converter and various motors, four distinct operating tasks can be classified, as shown in Table 1.

The simulation circuit is constructed using the MATLAB/Simulink R2022b platform according to the circuit topology of Figure 2 to simulate the current waveforms of the power converter when OC failure occurs under different tasks, and the parameters of the simulation circuit are shown in Table 2. The MATLAB/Simulink R2022b software is developed by MathWorks Incorporated (Natick, MA, USA).

2.1. Fault Analysis of the Power Converter Working Under Inverter Condition

When the power converter operates in the inverter state, energy transfers from the DC to the AC side. During normal operation, the output currents i_a, i_b, and i_c are balanced three-phase currents. To examine the waveform changes, phase A current i_a is used as an example. In the first half of the i_a cycle, S₄ remains off while S₁, controlled by the PWM controller, alternates on and off in a set pattern, outputting a PWM square wave voltage with amplitude V_D and positive polarity to phase A. This PWM square wave voltage, acting on an inductive load, produces a sinusoidal current i_a in phase A, where i_a > 0. In the second half-cycle, the pattern is reversed, with S₁ off and S₄ controlled by PWM, resulting in a sinusoidal current i_a with negative polarity.

For Task A, with an ACIM, the three-phase current simulated waveform with an OC fault in S₁ is shown in Figure 3a. In the second cycle, when S₁ fails, it is effectively off, and with S₁ also off, the DC side voltage cannot reach the AC side through phase A, leading to i_a = 0 in the first half-cycle. The resulting unbalanced power among the three phases distorts the currents in the non-faulted phases i_b and i_c, causing harmonic generation and amplitude increase. If S₄ experiences an OC fault, there will be no current output in the second half of i_a. Similar current waveform changes occur in phases B and C with single IGBT OC faults.

Dual IGBT OC faults can occur in three configurations: (1) faults in the same-phase IGBTs, (2) faults in different-phase IGBTs in the same half-bridge, and (3) faults in different-phase IGBTs in different half-bridges. Figure 3b shows the three-phase current simulated waveform with OC faults in S₁ and S₄, representing same-phase faults. Since S1 and S₄ affect separate halves of the i_a cycle independently, the fault waveform appears as a superposition of two symmetrical single IGBT faults in phase A, resulting in zero current in phase A while currents i_b and i_c remain opposite.

Figure 3c illustrates the three-phase current simulated waveform when S₁ and S₃ experience faults, representing different-phase faults within the same half-bridge. In the three-phase bridge circuit, any two phases in the same half-bridge have a 120^◦ phase difference. Since each phase IGBT conducts over a 180^◦ interval, overlapping occurs when a dual OC fault happens in the same half-bridge. Based on the distinct distortions, the three-phase current waveform is divided into four segments. Segment I is affected only by S₁, and Segment III only by S₃, behaving as single IGBT faults. Segment II shows overlapping effects from S₁ and S₃ faults, with both phases A and B OC, and i_a = i_b = i_c = 0 due to the three-phase current vector sum equaling zero. Segment IV remains unaffected by faults.

S₁ and S₆ represent faults in different phases and half-bridges, with a 120^◦ phase overlap. Their dual fault three-phase current simulated waveform is shown in Figure 3d. Segments II and IV are influenced by individual IGBT faults, while Segment III remains fault-free. Segment I reflects overlapping effects from both S₁ and S₆ faults, resulting in i_a = 0 and i_b = −i_c. Due to the increased current magnitude in phases B and C, the rapid current drop induces intense oscillations and distortion.

For Task C, with a PMSM, a typical three-phase current simulated waveform under an OC fault is shown in Figure 4. Similarly to the previous analysis, a single IGBT OC fault results in no current in the affected phase for half of the cycle, while the currents in the other phases become distorted. Figure 4a illustrates the three-phase current simulated waveform with an OC fault in S₁. In cases of dual IGBT OC faults, the analysis depends on whether there is an overlap in the faulted phases. Non-overlapping segments behave as single IGBT faults, while overlapping segments follow the same analysis as Task A. However, due to the differences in motor type and control strategy, the current distortions resulting from IGBT faults vary compared to other tasks.

2.2. Fault Analysis of Power Converter Working Under Rectifier Condition

When the power converter operates in a rectifier state, energy flows from the AC to the DC side. For Task B, where the motor is an ACIM, a typical three-phase current simulated waveform under an OC fault is shown in Figure 5. When the rectifier is operating normally, symmetrical three-phase voltage at the AC side generates symmetrical three-phase current through the load inductance.

Taking A-phase current i_a as an example, the three-phase current simulated waveform under a single IGBT OC fault is analyzed. When i_a > 0, the current flows through D₁ to the DC side or returns to the AC side through S₄. When i_a < 0, the current flows back to the AC side via either S₁ or D₄. Thus, an OC fault in S₁ affects only the i_a < 0 part, as shown in Figure 5a, where i_a = 0 during this period due to the lack of return current through S₁. When the A-phase voltage reaches the lowest value among the three phases, D₄ conducts to carry the current back to the motor. In the latter half-cycle of i_a, two no-current intervals appear since D₄ turns off when other diodes are conducting. Due to the constant vector sum of the three-phase currents, some distortion also occurs in i_b and i_c.

The types of dual IGBT OC faults in a rectifier state are similar to those in an inverter state. Figure 5b shows the three-phase current simulated waveform for an OC fault in the A-phase IGBT, where only D₁ and D₄ conduct, causing four zero-current intervals in i_a over one cycle. As IGBTs in the same phase sequence do not conduct simultaneously, no overlap occurs in fault effects, so the other two-phase currents remain distorted but do not reach zero.

Figure 5c shows the case where upper-bridge IGBTs in A-phase and B-phase experience OC faults. Segments I and III are impacted by individual IGBT faults, similar to a single IGBT OC fault waveform. Segment II remains unaffected, operating normally. In Segment IV, both S₁ and S₃ OC are in OC, resulting in zero intervals for both i_a and i_b. Since i_c > 0, D₂ remains at positive voltage and cannot conduct, meaning D₄ and D₆ do not turn off simultaneously, allowing at least two phases to conduct current. This fault waveform can be seen as an overlap of the individual S₁ and S₃ faults.

Figure 5d presents the scenario of a simultaneous OC fault in upper-bridge IGBT S₁ of A-phase and lower-bridge IGBT S₆ of B-phase. Segment I is fault-free, and Segment II involves only the S₆ fault. In Segment III, i_a ≤ 0 and with S₁ in fault, A-phase current only flows through D₄. Likewise, with i_b ≥ 0 and S₆ in fault, B-phase current can only pass through D₃, preventing A- and B-phases from forming a current loop, meaning they do not conduct simultaneously during this stage. When i_c < 0, S₅ conducts while S₂ is off, forming a current loop between B- and C-phases through D₃ and S₅, resulting in i_a = 0 and i_b = −i_c. When i_c ≥ 0, S₂ conducts while S₅ is off, forming a loop between A- and B-phases through S₂ and D₄, making i_b = 0 and i_a = −i_c. Segment IV experiences only a single IGBT fault but with an increase in current amplitude across all phases.

For Task D, where the motor is a PMSM operating in a rectifier state, a typical OC fault three-phase current simulated waveform is shown in Figure 6. Unlike the previous analysis, an OC fault does not cause segments of the current to reach zero. Instead, it leads to an imbalance in the three-phase power. This imbalance disrupts the symmetry of the current waveform, which can significantly affect the performance of the PMSM and lead to irregularities in power flow within the system.

The above content analyzed the four typical types of faults in the power converter under different operational tasks. Extending this analysis to all phases of IGBTs results in a total of 85 distinct fault types, as summarized in Table 3. Unique labels are assigned to each type of fault and used for neural network model training.

3. Approach

A lifelong learning-enabled fractional order-convolutional encoder (LL-FO-CE) model is proposed, with a flowchart shown in Figure 7. The proposed model is capable of continuously learning the three-phase current signals of a power converter across four different task types and accurately diagnosing faults in all previously learned tasks.

A simulation circuit is built on the MATLAB/Simulink platform, and a single cycle of current signals is collected while the power converter is operating stably, denoted as I_original. To enhance the model’s generalization ability and robustness, data preprocessing is applied to the raw signals. Since OC faults in the power converter occur randomly and may appear at any point within a cycle, the original sample data are shifted along the time dimension to simulate current signals with faults occurring at different times. The data are cyclically shifted by a fixed step size, as shown in (1).

\begin{matrix} I_{1, s h i f t e d} = I_{o r i g i n a l} [x_{t}], (t \geq 1, t \in N^{*}) \\ I_{j, s h i f t e d} [x_{t}] = I_{j - 1} [x_{t + p}], (j \geq 2, j \in N^{*}) \end{matrix}

(1)

The vector I_j,shifted represents the j-th shifted sample, where x_t is the sample point and p is the shifting step size. The shifted sample obtained is then used as the base sample for the next shift, with this process repeated until all data have been cycled through, ultimately yielding a total of j groups of fault samples. However, in practical applications, current signals are often affected by noise during acquisition. To simulate this interference, a random Gaussian white noise is applied to each sample group, as shown in (2).

I_{j, n o i s e} = I_{j, s h i f t e d} + c \cdot \frac{1}{\sqrt{2 π} σ} \exp (- \frac{{(x - μ)}^{2}}{2 σ^{2}})

(2)

where μ is the mean of the Gaussian distribution, σ² is the variance, and c is the noise amplification factor. Setting μ = 0, σ² = 1 and c = 10, we obtain the noise-augmented fault sample I_j,noise. Given the large values in the original data, to prevent disproportionately large features from skewing the training results, the data are standardized using the Z-Score, as shown in (3).

I_{j} = \frac{I_{j, n o i s e} - μ_{I}}{σ_{I}}

(3)

where I_j represents the fault sample after data preprocessing, while μ_I is the mean of I_j,noise, and σ_I is the standard deviation of I_j,noise. Finally, by merging all the fault samples, we obtain the dataset I of three-phase current signals from the power converter under four different tasks, as shown in (4).

I = [I_{1}, I_{2}, \dots, I_{j}, \dots, I_{N}], (j = 1, 2, \dots, N)

(4)

The dataset I will be used for training and testing the LL-FO-CE model. The first part of the model is the convolutional module, which is designed to extract features from the data, as shown in (5) and (6) [39].

H_{c o n v} = C o n v 1 D (I)

(5)

H_{p o o l} = M a x (H_{c o n v})

(6)

where H_conv is the output of the convolutional layer, and H_pool is the output of the max-pooling layer. After the three-phase current signal samples are input into the model, they undergo multiple layers of 1D convolution and max-pooling to thoroughly extract the features of the fault samples. The hidden feature outputs from the convolutional module are then fed into the encoder module. First, the variables undergo positional encoding, as shown in (7)–(9) [40].

X = H_{p o o l} + P E

(7)

P E_{(p o s, 2 i)} = \sin (p o s / 10000^{2 i / d})

(8)

P E_{(p o s, 2 i + 1)} = \cos (p o s / 10000^{2 i / d})

(9)

where pos represents the position of each sample point in the sequence, d is the dimensionality of the position vector, which corresponds to the number of channels in the feature map output from the convolutional module, and i is the index of the dimension of the position vector. Then, the attention weights for the features are calculated using self-attention, as shown in (10) [41].

A t t e n t i o n (Q, K, V) = Softmax (\frac{Q \cdot K^{T}}{\sqrt{d_{k}}}) V

(10)

√d_k is the scaling factor, which helps prevent the vanishing gradients caused by excessively large products. Q, K, and V represent the Query, Key, and Value vectors in self-attention, as shown in (11)–(13).

Q = X \cdot W_{Q}

(11)

K = X \cdot W_{K}

(12)

V = X \cdot W_{V}

(13)

W_Q, W_K, and W_V are the weight parameter matrices, which are updated through model training. Then, by utilizing the multi-head attention mechanism, the model computes attention in parallel across different subspaces, as shown in (14).

M u l t i H e a d (Q, K, V) = C o n c a t (h e a d_{1}, h e a d_{2}, \dots, h e a d_{h}) W_{O}

(14)

The calculation formula for the i-th self-attention head is shown in (15).

h e a d_{i} = A t t e n t i o n (Q \times W_{Q i}, K \times W_{K i}, V \times W_{V i})

(15)

The output of each encoder layer will pass through a feedforward neural network consisting of two linear transformations and a nonlinear activation function, as shown in (16).

F F N (x) = \max (0, x W_{1} + b_{1}) W_{2} + b_{2}

(16)

W and b are the weight parameter matrix and bias parameter of the feedforward neural network, respectively, while max represents the activation function. The encoder module is composed of multiple identical encoders stacked together, with each layer including multi-head self-attention and a feedforward neural network. Thus, the output of a single-layer encoder can be expressed, as shown in (17) and (18).

Z = L a y e r N o r m (X + M u l t i H e a d (Q, K, V))

(17)

Z^{'} = L a y e r N o r m (Z + F F N (Z))

(18)

Z represents the output of the multi-head attention, Z′ denotes the output of the encoder, and LayerNorm refers to layer normalization. The output of the encoder module is then passed through a fully connected layer, mapping it to the dimensionality of the number of classes, as shown in (19).

H_{f c} = Z^{'} W_{f c} + b_{f c}

(19)

H_fc represents the hidden state output from the fully connected layer, while W_fc and b_fc denote the weight parameter matrix and the bias parameter vector, respectively. Finally, the hidden state is transformed into a probability distribution using the Softmax function, and the final classification result is determined through Argmax, as shown in (20) and (21).

P = S o f t m a x (H_{f c})

(20)

C l a s s = A r g m a x (P)

(21)

The equations above outline the fundamental architecture and the forward propagation process of the proposed model. In the backpropagation and parameter update phase of the model, the fractional order derivative is utilized to optimize the gradient descent process [36]. The Caputo fractional order gradient descent is given by (22).

{}^{C}D_{θ}^{α} L (θ) = \sum_{i = I}^{\infty} \frac{f^{(i)} (θ_{0})}{Γ (i + 1 - α)} {(θ - θ_{0})}^{(i - α)}

(22)

^CD is the Caputo fractional operator, L(θ) is the loss function, α is the order of the fractional derivative, and I is the smallest integer greater than α; thus, I − 1 < α < I. θ represents the neural network parameters, and Γ is the Gamma function. When I = 1 and 0 < α < 1, the expression is given by (23).

{}^{C}D_{θ_{K}}^{α} L (θ_{K}) = \sum_{i = 1}^{\infty} \frac{f^{(i)} (θ_{K - 1})}{Γ (i + 1 - α)} {(θ_{K} - θ_{K - 1})}^{(i - α)}

(23)

When I = 2, and 1 < α < 2, the expression becomes (24).

{}^{C}D_{θ_{K}}^{α} L (θ_{K}) = \sum_{i = 2}^{\infty} \frac{f^{(i)} (θ_{K - 1})}{Γ (i + 1 - α)} {(θ_{K} - θ_{K - 1})}^{(i - α)}

(24)

Therefore, the expression for the fractional order when 0 < α < 2 is given by (25).

{}^{C}D_{θ_{K}}^{α} L (θ_{K}) = \sum_{i = 0}^{\infty} \frac{f^{(i + 1)} (θ_{K - 1})}{Γ (i + 2 - α)} {(θ_{K} - θ_{K - 1})}^{(i + 1 - α)}

(25)

By applying the Taylor series expansion to (25) up to the first term, we obtain the fractional order loss function for the case when 0 < α < 2, as shown in (26).

{}^{C}D_{θ_{K}}^{α} L (θ_{K}) = \frac{f^{(1)} (θ_{K - 1})}{Γ (2 - α)} {|θ_{K} - θ_{K - 1} + δ|}^{1 - α}

(26)

δ is a small positive number introduced to prevent division by zero. Therefore, the parameter update expression based on the fractional order gradient descent is shown in (27).

θ_{K + 1} = θ_{K} - μ \frac{f^{(1)} (θ_{K - 1})}{Γ (2 - α)} {|θ_{K} - θ_{K - 1} + δ|}^{1 - α}

(27)

This method allows the neural network to update parameters using fractional order gradients during backpropagation, enabling the model to quickly find the global optimal solution and avoid getting trapped in local optima.

The above process allows the model to train optimally for fault diagnosis tasks in a single state of the power converter. However, when the state changes, the model will be unable to recognize new fault features and will need to be retrained with new fault samples. To enable the model to continue learning new fault samples, a multilevel lifelong learning framework has been designed. This framework includes randomly inserting samples from previous tasks into the new batch of training data, learning soft labels from previous tasks, and adding a regularization penalty term to the loss function.

First, a replay buffer containing randomly selected fault samples from previous tasks is constructed, as shown in (28) [42].

D_{A, r e} = [I_{j}], I_{j} \subseteq D_{A}, j ~ U (1, m)

(28)

I_j represents the fault samples from Task A, where j is a random number in the interval (1, m), and m is the total number of samples in Task A. This operation is performed at the beginning of each iteration cycle when the model is learning a new task, to update the samples in the replay buffer. The samples from D_A,re are combined with the current batch of training data D_B, as shown in (29).

D_{A B} = D_{A, r e} + D_{B}

(29)

D_AB contains samples from both the previous Task A and the current Task B, which are used for model training. To maintain consistency across the two different tasks, the model learns the soft labels from the previous task, as shown in (30) [43].

L_{K D} (θ) = (1 - β) L_{C E} (θ | y_{s} (x_{A B})) + β \cdot L_{K L} \{θ | σ [\frac{y_{s} (x_{A B})}{T}], σ [\frac{y_{T} (x_{A B})}{T}]\}

(30)

L_CE is the cross-entropy loss function, while L_KL denotes the Kullback–Leibler (KL) divergence loss function. The σ represents the Softmax function, and y_T and y_S refer to the model’s predictions on the previous task and the current task, respectively. β is a weight parameter, and T is the temperature coefficient. The loss L_KD consists of two parts: one is the hard target loss L_CE, which measures the discrepancy between the model’s predictions on the current task and the true labels. The other part is the soft target loss L_KL, which indicates the difference between the model’s output on the previous task and the soft labels. By adjusting the size of β, the model’s focus during the learning of new tasks can be controlled. A larger β places more emphasis on retaining previous knowledge, while a smaller β favors the learning of the new task. The temperature coefficient T is used to regulate the difficulty of learning the knowledge from previous tasks; a larger T facilitates the acquisition of knowledge from earlier tasks.

To mitigate the model’s forgetting speed regarding previous tasks, a regularization penalty term that measures the importance of the model parameters is added to the loss function. For Task D, the optimal parameters θ should maximize the conditional probability P(θ|D), which can be expressed using Bayes’ theorem as (31) [44].

\log P (θ | D) = \log \frac{P (D_{B} | θ D_{A}) P (θ D_{A})}{P (D_{A} D_{B})}

(31)

D_A and D_B are two distinct tasks that make up Task D. Since D_A and D_B are independent of each other, we obtain (32).

\log P (θ | D) = \log P (D_{B} | θ) + \log P (θ | D_{A}) - \log P (D_{B})

(32)

Under the given parameters, logP(D_B|θ) represents the loss function for Task D_B, denoted as −L_B(θ), and logP(D_B) is a constant. Thus, the optimization objective is expressed as (33).

\max \log P (θ | D) = \max (- L_{B} (θ) + \log P (θ | D_{A}))

(33)

Simplifying the right side of the equation yields (34).

\max \log P (θ | D) = \min (L_{B} (θ) - \log P (θ | D_{A}))

(34)

For the posterior probability P(θ|D_A), it can be approximated as a Gaussian distribution that conforms to the prior probability P(D_A|θ). Let f(θ) = log P(D_A|θ), and expand it up to the third term at the optimal parameter θ^*_A using the Taylor formula, as shown in (35).

f (θ) = f (θ_{A}^{*}) + f^{'} (θ_{A}^{*}) (θ - θ_{A}^{*}) + \frac{1}{2} f^{″} (θ_{A}^{*}) {(θ - θ_{A}^{*})}^{2} + o (θ_{A}^{*})

(35)

Since θ^*_A is the optimal solution for f(θ), it follows that f′(θ^*_A) = 0. Substituting the probability density function of f(θ) gives us (36).

f (θ) = \log \frac{1}{\sqrt{2 π} δ} - \frac{{(θ - μ)}^{2}}{2 δ^{2}} \approx f (θ_{A}^{*}) + \frac{1}{2} f^{″} (θ_{A}^{*}) {(θ - θ_{A}^{*})}^{2}

(36)

By solving, we obtain μ = θ^*_A and δ² = −1/f″(θ^*_A). Substituting these into (34) gives us the expression shown in (37).

\max \log P (θ | D) = \min (L_{B} (θ) - \log (1 / \sqrt{2 π} δ) - \frac{1}{2} f^{″} (θ_{A}^{*}) {(θ - θ_{A}^{*})}^{2})

(37)

log [1/(√2π)δ] is a constant. Based on the optimization objective, we can construct the loss function L_REG as shown in (38).

L_{R E G} (θ) = L_{B} (θ) - \frac{1}{2} f^{″} (θ_{A}^{*}) {(θ - θ_{A}^{*})}^{2}

(38)

where f″(θ^*_A) can be represented by the diagonal elements of the Fisher information matrix −F_A,i, as shown in (39) [45].

F_{A, i} = \frac{1}{m} {\sum_{t = 1}^{m} [\frac{\partial L (θ | y_{p, A} (t))}{\partial θ_{A, i}^{*}}]}^{2}

(39)

θ^*_A represents the network parameters of the neural network on Task D_A, y_p,A(t) is the predicted value on D_A, and m is the total amount of data in D_A. By calculating the partial derivative of the loss function of all predicted values with respect to the network parameters θ^*_A, the importance of weight F_A,i of the parameters for Task D_A can be obtained. The neural network will continue training on Task D_B, and the loss function will be constructed by incorporating F_A,i as shown in (40).

L_{R E G} (θ_{B, K + 1}) = L_{C E} (θ_{B, K}) - \frac{λ}{2} \sum_{i = 1}^{n} F_{A, i} {(θ_{B, i} - θ_{A, i}^{*})}^{2}

(40)

The loss function L_KD is combined with L_REG, and the parameters of the LL-FO-CE model are updated through fractional-order gradient descent during backpropagation, resulting in the parameter update expression as shown in (41).

\begin{array}{l} θ_{N, K + 1} = θ_{N, K} - μ {}^{C}D_{θ_{N, K}}^{α} L_{K D} (θ_{N, K}) + μ \frac{\partial L_{R E G} (θ_{N, K})}{\partial θ} \\ = θ_{N, K} - μ \frac{1}{Γ (2 - α)} [\frac{\partial L_{C E} (θ_{N, K - 1})}{\partial θ} - β \frac{\partial L_{C E} (θ_{N, K - 1})}{\partial θ} + β \frac{L_{K L} (θ_{N, K - 1})}{\partial θ}] {|θ_{K} - θ_{K - 1} + δ|}^{1 - α} \\ + μ \frac{\partial L_{C E} (θ_{N, K - 1})}{\partial θ} - \frac{μ λ}{m} \sum_{i = 1}^{n} \{\sum_{x = 1}^{m} {[\frac{\partial L_{C E} (θ | y_{N - 1} (x))}{\partial θ_{N - 1, i}^{*}}]}^{2}\} (θ_{N, i} - θ_{N - 1, i}^{*}) \frac{\partial θ_{N, i}}{\partial θ} \end{array}

(41)

4. Simulation

In this section, the proposed LL-FO-CE model will be verified in detail for its diagnostic performance of power converter OC faults under different tasks. Additionally, LL-FO-CE will be compared with four state-of-the-art (SOTA) models that have been validated for their accuracy in diagnosing OC faults in power converters: CNN-Transformer [46], Res-BiLSTM [47], ASLSTM [35], and TCN [48]. These models are innovative and demonstrate exceptional performance in OC fault diagnosis, as reported in their original publications.

Co-simulation experiments were conducted using Python and MATLAB/Simulink, specifically with Python version 3.9 and MATLAB/Simulink version R2022b. Python is an open source programming language developed and maintained by a global community of developers. A deep neural network is constructed using the PyTorch 2.3.0 framework, with CUDA employed to accelerate the training process. Simulation circuits are developed in MATLAB/Simulink R2022b to generate fault simulation data for training and testing the neural network model.

4.1. Fractional Order Design

According to Section 3, the fractional order significantly influences the speed of parameter updating and convergence in the neural network model. The order also determines the memory length and strength of the fractional order derivatives. An appropriate order can more effectively capture the long-term dependencies within the data, thereby enhancing the model’s diagnosis performance. However, the optimal order often varies across different types of tasks, necessitating exploration to identify the most suitable order for the model.

It is first necessary to determine the optimal number of training epochs for the model on different tasks. Let LL-FO-CE be continuously trained on four tasks, and the loss curves and validation set accuracy curves during training are shown in Figure 8. The upper half of the figure displays the model’s accuracy on the validation set, while the lower half shows the training loss. Curves of different colors represent the model’s accuracy or loss for each task. The stages indicate different training phases: in Stage I, the model is trained on Task A; in Stage II, it switches to Task B and continues training, and so on. In Stage I, when the epoch reaches 25, the accuracy converges to its maximum value, and the loss converges to its minimum. As the training stages progress, the optimal number of iterations required for the model to converge on each task gradually increases. This is because the model requires more training epochs to solidify its memory of the previous tasks. Upon discussion, the optimal number of iterations for the model in Stage II, Stage III, and Stage IV are found to be 40, 90, and 170, respectively.

The optimal training epochs above represent the best values for neural network models using traditional integer-order gradient descent. Building on this, we further investigate the optimal order for the model under fractional order gradient descent. Figure 9 shows the average accuracy curve on validation sets of previously learned tasks under different fractional orders, where order = 1 represents traditional integer-order gradient descent. During the first two stages, all orders converge to the highest accuracy. In Stage III, differences in convergence speed and maximum accuracy emerge across orders. In Stage IV, integer-order gradient descent requires 325 epochs to converge, with a maximum average accuracy of 92%. In contrast, fractional order gradient descent with an order of 1.2 converges by epoch 210, achieving a maximum average accuracy of 94% across all validation sets, a 2% improvement over integer-order descent. This indicates that with order = 1.2, fractional order gradient descent enables faster convergence and higher accuracy.

Combined with the above studies, the optimal hyperparameters for the training process of LL-FO-CE are shown in Table 4. The fundamental training parameters, such as epoch, learning rate, and batch size, are also applicable to the other SOTA models discussed in the paper.

Since fractional order gradient descent allows the model to find a globally optimal solution to the task, this means that the model still performs well when the task becomes more complex. The following discussion is carried out to verify this idea. Let LL-FO-CE and other SOTA models be trained separately on four tasks and subsequently tested on a dataset with random pulse noise signals applied. The accuracy results are presented in Figure 10. LL-FO-CE achieves the highest classification accuracy on the test set for each task, with an average accuracy of 95%. In contrast, the other four SOTA models exhibit lower average accuracies of 91.25%, 91.5%, 63.25%, and 65%, respectively. This demonstrates that fractional order enhances the model’s robustness to noisy data.

To further investigate the diagnostic performance of each model on the random pulse noise signal test set, the high-dimensional data output from the model’s hidden layer is downscaled and visualized using t-distributed stochastic neighbor embedding (t-SNE), as shown in Figure 11. Each subgraph corresponds to a specific task, and the points of different colors represent different categories of fault samples; their correspondences are indicated in the upper right corner of each subgraph. The distribution of color blocks for each category in the picture can reflect the diagnostic performance of the model on the corresponding fault samples of the category. If the data points of the same category are closely clustered together, with a clear separation between points of different categories, it indicates that the model can accurately identify and classify different types of samples, signifying high fault diagnosis accuracy. Figure 11a shows the t-SNE plots for each model in Task A. The LL-FO-CE achieves clear categorization of the data, exhibiting tight clustering within each category and significant separation between different categories. This suggests that the model effectively recognizes the features of various sample categories, leading to accurate diagnoses. The clustering results of the CNN-Transformer indicate that the distance between certain categories is smaller, and the boundaries are more ambiguous. This suggests that the model’s ability to differentiate the features of various categories is somewhat inadequate. The Res-BiLSTM can form clusters for samples in most categories; however, the tightness of the data within each category is relatively low, and there is some category overlap. Specifically, data from different categories are clustered in the same region, indicating that the model misdiagnoses some fault samples, which results in lower diagnostic accuracy. The ASLSTM combines a significant number of data points from different categories, resulting in indistinct category boundaries. This suggests that its diagnostic performance on the random pulse noise signal test set is poor, and its robustness is weak. The TCN distributes all data points along a continuous curve, indicating that it recognizes temporal relationships in the data but fails to identify other features, resulting in no obvious clustering structure. Overall, the LL-FO-CE demonstrates optimal performance across all tasks, exhibiting clear category boundaries and tight clustering. This indicates that the model can maintain strong diagnostic performance on the random pulse noise signal test set, further proving its robust anti-interference capability.

4.2. Performance Study of the Lifelong Learning Framework

To verify the lifelong learning capability of the proposed model, the following experiments were conducted in this study. The models were trained sequentially from Task A to Task D, and the accuracy of each model on the validation set for all learned tasks was evaluated at the end of each epoch, as shown in Figure 12. In Stage I, while training on Task A, all models except ASLSTM and TCN achieved the highest accuracy. In Stage II, when the models switched to Task B, all of them attained high accuracy on the validation set for Task B before the end of the training phase. However, for the validation set of Task A, the accuracy of CNN-Transformer and Res-BiLSTM dropped below 10% within just a few epochs, while only the LL-FO-CE maintained high accuracy on Task A. In Stage III, all models except LL-FO-CE forgot the knowledge of Task B and could only learn and remember features from Task C. In the final stage, all models achieved high accuracy on Task D, but only LL-FO-CE was able to maintain high accuracy across the other three tasks simultaneously. This indicates that neural network models typically struggle to retain memory for single tasks, and once they engage in continuous learning of different tasks, they tend to forget previously acquired knowledge. In contrast, the lifelong learning framework proposed in this paper enables the LL-FO-CE to maintain long-term memory of all learned knowledge, ensuring that it retains high accuracy on previous tasks and avoids the catastrophic forgetting that often occurs when learning different tasks.

4.3. Research on Fault Diagnosis Performance

A fault sample test set containing all 85 fault categories was constructed to validate the fault diagnosis performance. The LL-FO-CE model was trained continuously on four tasks alongside other SOTA models, and the diagnosis results for all fault categories were validated on the test set. The stacked plots of the diagnosis results and confusion matrix for each model are shown in Figure 13.

Figure 13b–f shows the confusion matrices of the diagnosis results of each model in the validation set, where the X-axis represents the true labels of the samples and the Y-axis represents the predicted labels of the models. Figure 13a shows a stacked plot of the confusion matrices, where the Z-axis of the plot represents the different models. It was shown that the diagnosis results of the other SOTA models on the first 64 category samples significantly deviated from the true categories and failed to accurately classify the faulty samples. These samples all belonged to Task A, Task B, and Task C, further suggesting that these SOTA models experienced catastrophic forgetting when learning different tasks consecutively, resulting in a significant decrease in diagnosis ability on previous tasks. In addition, these models judged the vast majority of fault samples with a predictive label of 1 as “no fault”, which shows that they lost the ability to recognize faults in the first 64 categories. For all 85 fault categories, the diagnosis average accuracies of CNN-Transformer, Res-BiLSTM, ASLSTM, and TCN are 25.83%, 25.87%, 25.85%, and 25.88%, respectively.

The diagnosis of fault samples by LL-FO-CE primarily focuses on the main diagonal of the confusion matrix, achieving an accuracy of 96.89%. This indicates that the proposed model exhibits accurate classification ability on all 85 fault categories. Although the model deviates from the true label on the classification of a few samples, almost all samples successfully identify the faults, i.e., the predicted labels are not equal to 1. The experimental results validate that LL-FO-CE is able to accurately diagnose OC faults of IGBTs of the power converter under multiple operating conditions and demonstrates high diagnosis accuracy.

Figure 14 illustrates the average diagnosis time and its standard deviation for the five models evaluated on the test set. The left axis represents the different model categories, while the bottom axis indicates the diagnosis time, defined as the computation time of each model. The rightmost side of each bar is labeled with the specific time value with the error bar indicating the standard deviation. As illustrated in the figure, LL-FO-CE exhibits the shortest diagnostic time, with an average of 306.43 ms and the smallest standard deviation of 4.2 ms. This indicates that its computation time is stable, minimally affected by input samples, and demonstrates strong robustness. In comparison, the average computation time and standard deviation for the CNN-Transformer are higher, at 320.32 ms and 6 ms, respectively. The average computation time for Res-BiLSTM is 452.66 ms, which is 47.7% higher than that of LL-FO-CE. Additionally, its standard deviation is relatively large at 9.5 ms, suggesting that the model is unstable. The average diagnostic time for ASLSTM is 442.89 ms, and the standard deviation is 4.44 ms. The average computation time and standard deviation of TCN are 484.72 ms and 21.13 ms, respectively, which are the maximum values among the five models, indicating that the model has not only high computational complexity but also poor stability. Overall, LL-FO-CE demonstrates the highest computational efficiency among the five models and outperforms the others in terms of stability, with high real-time performance and robustness, and is suitable for real-time fault diagnosis tasks.

5. Semi-Physical Experiments

To further verify the reliability of the proposed fault diagnosis method, a semi-physical virtual simulation system for power converter fault diagnosis has been established using the OPAL-RT OP4510 real-time simulator, a Tektronix (Beaverton, OR, USA) oscilloscope, and a desktop computer equipped with an AMD R5-7500F CPU, an RTX 4060 Ti GPU, and 32 GB of RAM. The CPU is manufactured by Advanced Micro Devices, Inc. (Santa Clara, CA, USA), while the GPU is produced by NVIDIA Corporation (Santa Clara, CA, USA). The experimental setup is shown in Figure 15. The OPAL-RT OP4510 is used to generate the control signals and run the simulation model, which operates on RT-LAB, the industrial-grade real-time simulation system developed by OPAL-RT (Montreal, QC, Canada). The oscilloscope monitors and collects the current signals generated by the OP4510 in real time, producing waveform files. The desktop computer is used to run the fault diagnosis program.

The workflow of the semi-physical virtual simulation experiment system is as follows: First, build and compile the simulation model file according to the RT-LAB standard on the computer. Next, transfer the compiled file to the OP4510 real-time simulation platform. After that, start the simulation model on the OP4510 and perform the corresponding control operations on the model. At the same time, an oscilloscope is used to monitor and collect the current signal, generating the corresponding current waveform file. During this process, the computer will run the fault diagnosis program synchronously to analyze the waveform data generated by the oscilloscope in depth, so as to realize the accurate detection of faults and the precise location of the failed device.

The real-time emulator OP4510 generates and controls PWM signals to drive a power converter emulation circuit. In this circuit, each IGBT device corresponds to a specific PWM signal. When all PWM signals are functioning normally, the power converter maintains a stable operating state. However, if a PWM signal is disconnected, the corresponding IGBT device will enter a shutdown state, thereby simulating an OC fault scenario of the power converter in real applications. The system adopts the double closed-loop control method based on the d-q rotating coordinate system, which is mainstream in the engineering field. The main parameter settings are detailed in Table 5 to ensure the accuracy and stability of the system control.

A validation set of three-phase current data, encompassing 85 fault categories, was obtained through the semi-physical virtual simulation system. Each fault category includes 60 sets of three-phase current signal samples, with eight typical fault waveforms illustrated in Figure 16. In comparison to the Simulink simulation waveforms, these waveforms exhibit noticeable burrs and noise interference, which may increase the difficulty of fault diagnosis.

Four evaluation metrics are used to assess the effectiveness of fault diagnosis: accuracy, precision, recall, and F1 score. Accuracy represents the number of correctly diagnosed samples as a proportion of the total number of samples, reflecting the overall effectiveness of the fault diagnosis results, as shown in (42).

A c c u r a c y = \frac{T P + T N}{N}

(42)

where TP is the number of samples correctly diagnosed as positive classes, which means that no faulty samples were correctly diagnosed. TN is the number of samples correctly diagnosed as negative classes, which means that faulty samples were correctly diagnosed.

The precision indicates the proportion of samples predicted by the model to belong to each category that actually do belong to that category. It is used to measure the accuracy of the model’s predictions for each category. The global precision is calculated by Macro-Average as shown in (43).

M a c r o P r e c i s i o n = \frac{1}{N} \sum_{i = 1}^{N} \frac{T P_{i}}{T P_{i} + F P_{i}}

(43)

where TP_i denotes the number of samples correctly predicted by the model to be in category i, FP_i denotes the number of samples incorrectly predicted by the model to be in category i, and N represents the total number of samples across all categories. Macro-Averaging is applicable when the number of samples is equal for all categories and their importance is considered uniform.

Recall indicates, for each category, the proportion of samples that correctly belong to that category and are correctly predicted by the model as being in that category. It is used to measure the model’s ability to recognize each category. The global recall is calculated by Macro-Average as in (44).

M a c r o R e c a l l = \frac{1}{N} \sum_{i = 1}^{N} \frac{T P_{i}}{T P_{i} + F N_{i}}

(44)

F1 Score is the harmonic mean of precision and recall, providing a more comprehensive assessment of the model’s performance metrics in terms of precision and recall. The global F1 Score is calculated by Macro-Average as in (45).

M a c r o F 1 s c o r e = \frac{1}{N} \sum_{i = 1}^{N} \frac{2 * \frac{T P_{i}}{T P_{i} + F P_{i}} * \frac{T P_{i}}{T P_{i} + F N_{i}}}{\frac{T P_{i}}{T P_{i} + F P_{i}} + \frac{T P_{i}}{T P_{i} + F N_{i}}} = \frac{2}{N} \sum_{i = 1}^{N} \frac{T P_{i}}{2 T P_{i} + F N_{i} + F P_{i}}

(45)

The models were permitted to make ten predictions on the validation set, and their fault diagnosis performance was evaluated using the aforementioned metrics. The results are presented in Figure 17, and Table 6 displays the average of accuracy, precision, recall, and F1 score for each model based on ten predictions across various tasks.

For Task A, Task B, and Task C, the accuracy of LL-FO-CE is 92.9%, 94.5%, and 87.9%, respectively, which shows the best performance among all models. The precision, recall, and F1 scores also represent the highest values across all models, indicating strong diagnosis accuracy and fault identification capability. In contrast, the other models struggled with fault recognition and diagnosis on the validation set for the first three tasks. This limitation arose because they could not maintain long-term memory of different tasks and gradually forgot the knowledge of previous tasks while continuously learning new tasks.

For Task D, the accuracy of LL-FO-CE, CNN-Transformer, Res-BiLSTM, ASLSTM, and TCN are 98.5%, 98%, 97.1%, 94.3%, and 92.9%, respectively. The precision scores are 98.6%, 98.1%, 98%, 92.1%, and 92.3%, respectively. The recall scores are 98.5%, 98%, 97.1%, 90.8%, and 92.4%. The F1 scores are 98.5%, 98.1%, 97.6%, 90.8%, and 92.2%, respectively. All models demonstrate improved performance on this task because Task D is the final task to be learned. However, the presence of disturbances in the fault sample waveforms results in a certain degree of degradation in the diagnosis performance of the other models. LL-FO-CE enhances the backpropagation optimization process through fractional order derivatives, which are highly robust, thereby maintaining superior diagnosis performance compared to other SOTA models when applied to actual fault waveforms with disturbances.

Figure 17 illustrates the overall average of 10 predictions made by each model across all tasks in the validation set. The different colored areas represent various metrics, while distinct colored bars indicate different models. For all 85 fault categories, LL-FO-CE demonstrates optimal performance across all metrics, achieving an average diagnosis accuracy of 93%. The other models, which can only perform better on Task D, exhibit performance averages ranging from 20% to 30% across all tasks. This figure visualizes the performance disparity between the proposed model and other SOTA models in the validation set, highlighting the advantages of the proposed model in diagnosing power converter faults under varying conditions.

6. Conclusions

A lifelong learning-enabled fractional order-convolutional encoder model is proposed for continuous learning as well as accurate diagnosis of OC faults in power converters under multi-conditions. The conclusions are as follows:

1. A convolutional encoder model is proposed for learning and diagnosing OC faults in the power converter. The model automatically extracts time series features from three-phase current fault signals and analyzes the relative positional relationships between each signal using a convolution module. It employs an encoder module to identify and classify these features, thereby enabling the automatic learning and diagnosis of fault samples.

2. The optimization process of the neural network model is improved by using the global search and smooth gradient properties of fractional order. The global search property enables the model to comprehensively consider historical gradient information during backpropagation, effectively exploits the long-term dependencies of fault samples. The smooth gradient property can effectively reduce the oscillation of the optimization path and accelerate the speed of the model convergence to the global optimal solution. Compared to integer-order models, the diagnosis accuracy improves by 2%, and the number of training epochs decreases by 115. It shows stronger robustness to abnormal noise signal samples, with an accuracy improvement of 2.75% compared to other models.

3. The designed multilevel lifelong learning framework equips the model with the ability to continuously learn various tasks. By incorporating loss function penalty terms, random replay, and soft labeling for previous tasks, the model is able to continue to learn new fault samples after training is completed and maintains stability and consistency across all fault classes. The study shows that the proposed model achieves 96.89% on the test set and 93% on the validation set after continuously learning fault samples from four different tasks of the power converter, which solves the problem of decreasing diagnosis accuracy of the conventional models when the operating state of the power converter changes, and is of great engineering practical significance to improve the operation safety and reliability of the motor drive system.

4. The lifelong learning-enabled fractional order-convolution encoder model proposed in this study has achieved significant results in diagnosing OC faults of power converters under multi-conditions. This approach has a wide range of applications in intelligent operation and maintenance systems for electric vehicles, renewable energy generation, rail transportation, etc. By facilitating real-time monitoring and early diagnosis of faults, it enables early warning and localization of faults, thus improving system safety and reliability while reducing maintenance costs. Future research will further focus on life prediction and reliability management technologies for power semiconductor devices based on Prognostics and Health Management. This will provide health monitoring and intelligent operation and maintenance support throughout the entire life cycle of power electronic systems.

Author Contributions

Conceptualization, T.L. and E.W.; methodology, T.L. and E.W.; validation, T.L. and E.W.; formal analysis, T.L.; investigation, E.W.; resources, E.W.; data curation, T.L. and E.W.; writing—original draft preparation, T.L. and E.W.; writing—review and editing, E.W.; Supervision, J.Y.; project administration, T.L. and J.Y.; funding acquisition, J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Nos. 62373142, 62173137, and 52172403), and the Natural Science Foundation of Hunan Province, China (No. 2024JJ7129).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare that this study received funding from the National Natural Science Foundation of China, and the Natural Science Foundation of Hunan Province, China. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication. Authors Tao Li and Jun Yang were employed by the company Zhuzhou Times New Material Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Moradzadeh, A.; Mohammadi-Ivatloo, B.; Pourhossein, K.; Anvari-Moghaddam, A. Data Mining Applications to Fault Diagnosis in Power Electronic Systems: A Systematic Review. IEEE Trans. Power Electron. 2022, 37, 6026–6050. [Google Scholar] [CrossRef]
Liang, J.P.; Zhang, K.; Al-Durra, A.; Muyeen, S.M.; Zhou, D.M. A state-of-the-art review on wind power converter fault diagnosis. Energy Rep. 2022, 8, 5341–5369. [Google Scholar] [CrossRef]
Li, W.Z.; Zhou, D.; Iannuzzo, F.; Hartmann, M.; Blaabjerg, F. Separation and Validation of Bond-Wire and Solder Layer Failure Modes in IGBT Modules. IEEE Trans. Ind. Appl. 2022, 58, 2324–2331. [Google Scholar] [CrossRef]
Xiao, Q.; Jin, Y.; Jia, H.J.; Tang, Y.; Cupertino, A.F.; Mu, Y.; Teodorescu, R.; Blaabjerg, F.; Pou, J. Review of Fault Diagnosis and Fault-Tolerant Control Methods of the Modular Multilevel Converter Under Submodule Failure. IEEE Trans. Power Electron. 2023, 38, 12059–12077. [Google Scholar] [CrossRef]
Reyes-Malanche, J.A.; Villalobos-Pina, F.J.; Cabal-Yepez, E.; Alvarez-Salas, R.; Rodriguez-Donate, C. Open-Circuit Fault Diagnosis in Power Inverters Through Currents Analysis in Time Domain. IEEE Trans. Instrum. Meas. 2021, 70, 3517512. [Google Scholar] [CrossRef]
Li, G.H.; Xu, S.; Sun, Z.Y.; Yao, C.X.; Ren, G.Z.; Ma, G.T. Open-Circuit Fault Diagnosis for Three-Level ANPC Inverter Based on Predictive Current Vector Residual. IEEE Trans. Ind. Appl. 2023, 59, 6837–6851. [Google Scholar] [CrossRef]
Li, Z.; Wheeler, P.; Watson, A.; Costabeber, A.; Wang, B.R.; Ren, Y.N.; Bai, Z.H.; Ma, H. A Fast Diagnosis Method for Both IGBT Faults and Current Sensor Faults in Grid-Tied Three-Phase Inverters With Two Current Sensors. IEEE Trans. Power Electron. 2020, 35, 5267–5278. [Google Scholar] [CrossRef]
Jiang, C.Y.; Liu, H.C.; Wheeler, P.; Wu, F.J.; Huo, J. An Open-Circuit Fault Detection Method of PMSM Fed by Dual Inverter With High Robustness. IEEE Trans. Energy Convers. 2023, 38, 1727–1737. [Google Scholar] [CrossRef]
Zhang, W.W.; He, Y.G. A Simple Open-Circuit Fault Diagnosis Method for Grid-Tied T-Type Three-Level Inverters With Various Power Factors Based on Instantaneous Current Distortion. IEEE J. Emerg. Sel. Top. Power Electron. 2023, 11, 1071–1085. [Google Scholar] [CrossRef]
Hu, Y.F.; Zhang, Z.; Sun, D.B.; Gu, C.J.; Li, Y.J. Fault Diagnosis of Full-Bridge Power Converter for SRMs Based on Modified Current Detection. IEEE J. Emerg. Sel. Top. Power Electron. 2024, 12, 1042–1053. [Google Scholar] [CrossRef]
Yan, H.; Xu, Y.X.; Cai, F.Y.; Zhang, H.; Zhao, W.D.; Gerada, C. PWM-VSI Fault Diagnosis for a PMSM Drive Based on the Fuzzy Logic Approach. IEEE Trans. Power Electron. 2019, 34, 759–768. [Google Scholar] [CrossRef]
Xing, Z.K.; He, Y.G.; Zhang, W.W. An Online Multiple Open-Switch Fault Diagnosis Method for T-Type Three-Level Inverters Based on Multimodal Deep Residual Filter Network. IEEE Trans. Ind. Electron. 2023, 70, 10669–10679. [Google Scholar] [CrossRef]
Yan, H.; Peng, Y.M.; Shang, W.J.; Kong, D.D. Open-circuit fault diagnosis in voltage source inverter for motor drive by using deep neural network. Eng. Appl. Artif. Intell. 2023, 120, 105866. [Google Scholar] [CrossRef]
Zhang, M.Y.; Zhang, Z.B.; Li, Z.; Chen, H.Y.; Zhou, D.H. A Unified Open-Circuit-Fault Diagnosis Method for Three-Level Neutral-Point-Clamped Power Converters. IEEE Trans. Power Electron. 2023, 38, 3834–3846. [Google Scholar] [CrossRef]
Ding, S.C.; Tang, D.W.; Hang, J.; Zhao, J.F.; Gui, S.N. Robust Open-Switch Fault Diagnosis of Bidirectional DC/DC Converters Based on Extended Kalman Filter With Multiple Corrections. IEEE Trans. Circuits Syst. I-Regul. Pap. 2024, 71, 4363–4374. [Google Scholar] [CrossRef]
Zhang, W.W.; He, Y.G.; Wang, X.; Chen, J.F. A Comprehensive Method for Online Switch Fault Diagnosis and Capacitor Condition Monitoring of Three-Level T-Type Inverters. IEEE Trans. Power Electron. 2023, 38, 10183–10195. [Google Scholar] [CrossRef]
Xu, S.Q.; Chen, X.Y.; Liu, F.; Wang, H.; Chai, Y.; Zheng, W.X.; Chen, H.T. A Novel Adaptive SMO-Based Simultaneous Diagnosis Method for IGBT Open-Circuit Faults and Current Sensor Incipient Faults of Inverters in PMSM Drives for Electric Vehicles. IEEE Trans. Instrum. Meas. 2023, 72, 3526915. [Google Scholar] [CrossRef]
Li, X.M.; Li, S.Z.; Chen, W.; Shi, T.N.; Xia, C.L. A Fast Diagnosis Strategy for Inverter Open-Circuit Faults Based on the Current Path of Brushless DC Motors. IEEE Trans. Power Electron. 2023, 38, 9311–9316. [Google Scholar] [CrossRef]
Huang, W.T.; Du, J.C.; Hua, W.; Lu, W.Z.; Bi, K.T.; Zhu, Y.X.; Fan, Q.G. Current-Based Open-Circuit Fault Diagnosis for PMSM Drives With Model Predictive Control. IEEE Trans. Power Electron. 2021, 36, 10695–10704. [Google Scholar] [CrossRef]
Zhou, D.H.; Qiu, H.; Yang, S.F.; Tang, Y. Submodule Voltage Similarity-Based Open-Circuit Fault Diagnosis for Modular Multilevel Converters. IEEE Trans. Power Electron. 2019, 34, 8008–8016. [Google Scholar] [CrossRef]
Cheng, Y.; Sun, Y.; Li, X.; Dan, H.; Lin, J.; Su, M. Active Common-Mode Voltage-Based Open-Switch Fault Diagnosis of Inverters in IM-Drive Systems. IEEE Trans. Ind. Electron. 2021, 68, 103–115. [Google Scholar] [CrossRef]
Song, C.C.; Sangwongwanich, A.; Yang, Y.H.; Blaabjerg, F. Open-Circuit Fault Diagnosis and Tolerant Control for 2/3-Level DAB Converters. IEEE Trans. Power Electron. 2023, 38, 5392–5410. [Google Scholar] [CrossRef]
Liang, J.P.; Zhang, K.; Al-Durra, A.; Zhou, D.M. A novel fault diagnostic method in power converters for wind power generation system. Appl. Energy 2020, 266, 114851. [Google Scholar] [CrossRef]
Gou, B.; Xu, Y.; Xia, Y.; Wilson, G.; Liu, S.Y. An Intelligent Time-Adaptive Data-Driven Method for Sensor Fault Diagnosis in Induction Motor Drive System. IEEE Trans. Ind. Electron. 2019, 66, 9817–9827. [Google Scholar] [CrossRef]
Gou, B.; Xu, Y.; Xia, Y.; Deng, Q.L.; Ge, X.L. An Online Data-Driven Method for Simultaneous Diagnosis of IGBT and Current Sensor Fault of Three-Phase PWM Inverter in Induction Motor Drives. IEEE Trans. Power Electron. 2020, 35, 13281–13294. [Google Scholar] [CrossRef]
Hu, H.L.; Feng, F.; Wang, T. Open-circuit fault diagnosis of NPC inverter IGBT based on independent component analysis and neural network. Energy Rep. 2020, 6, 134–143. [Google Scholar] [CrossRef]
Sarita, K.; Kumar, S.; Saket, R.K. OC fault diagnosis of multilevel inverter using SVM technique and detection algorithm. Comput. Electr. Eng. 2021, 96, 107481. [Google Scholar] [CrossRef]
Liang, J.P.; Zhang, K.; Al-Durra, A.; Zhou, D.M. A Multi-Information Fusion Algorithm to Fault Diagnosis of Power Converter in Wind Power Generation Systems. IEEE Trans. Ind. Inform. 2024, 20, 1167–1179. [Google Scholar] [CrossRef]
Zhao, S.; Chen, J.; Zhang, C.; He, Y. An online open circuit faults diagnosis method for converter using the lightweight two-channel deep network. Measurement 2024, 243, 116213. [Google Scholar] [CrossRef]
Li, D.; Li, C.; Yang, J.; Chen, Z.; Liu, X.; Wang, X.; Yang, J.; Li, T. Bayesian optimization-attention-feedforward neural network based train traction motor-gearbox coupled noise prediction. Measurement 2024, 238, 115323. [Google Scholar] [CrossRef]
Deng, X.; Wan, C.G.; Jiang, L.; Gao, G.; Huang, Y. Open-Switch Fault Diagnosis of Three-Phase PWM Converter Systems for Magnet Power Supply on EAST. IEEE Trans. Power Electron. 2023, 38, 1064–1078. [Google Scholar] [CrossRef]
Kou, L.; Liu, C.; Cai, G.W.; Zhang, Z. Fault Diagnosis for Power Electronics Converters based on Deep Feedforward Network and Wavelet Compression. Electr. Power Syst. Res. 2020, 185, 106370. [Google Scholar] [CrossRef]
Han, Y.M.; Qi, W.; Ding, N.; Geng, Z.Q. Short-Time Wavelet Entropy Integrating Improved LSTM for Fault Diagnosis of Modular Multilevel Converter. IEEE Trans. Cybern. 2022, 52, 7504–7512. [Google Scholar] [CrossRef]
Zhang, S.Q.; Wang, R.J.; Wang, L.B.; Si, Y.P.; Lin, A.H.; Wang, Y.C. Fault Diagnosis for Power Converters Based on Incremental Learning. IEEE Trans. Instrum. Meas. 2023, 72, 3512813. [Google Scholar] [CrossRef]
Si, Y.P.; Wang, R.J.; Zhang, S.Q.; Zhou, W.T.; Lin, A.H.; Wang, Y.C. Fault Diagnosis Based on Attention Collaborative LSTM Networks for NPC Three-Level Inverters. IEEE Trans. Instrum. Meas. 2022, 71, 3512416. [Google Scholar] [CrossRef]
Li, T.; Wu, X.; Luo, Z.; Chen, Y.; He, C.; Ding, R.; Zhang, C.; Yang, J. A Bearing Fault Diagnosis Method under Small Sample Conditions Based on the Fractional Order Siamese Deep Residual Shrinkage Network. Fractal Fract. 2024, 8, 134. [Google Scholar] [CrossRef]
Xu, X.; Li, B.; Qiao, Z.; Shi, P.; Shao, H.; Li, R. Caputo-Fabrizio fractional order derivative stochastic resonance enhanced by ADOF and its application in fault diagnosis of wind turbine drivetrain. Renew. Energy 2023, 219, 119398. [Google Scholar] [CrossRef]
Li, S.-Y.; Tam, L.-M.; Wu, S.-P.; Tsai, W.-L.; Hu, C.-W.; Cheng, L.-Y.; Xu, Y.-X.; Cheng, S.-C. The Performance Investigation of Smart Diagnosis for Bearings Using Mixed Chaotic Features with Fractional Order. Sensors 2023, 23, 3801. [Google Scholar] [CrossRef]
Alzubaidi, L.; Zhang, J.L.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data 2021, 8, 1–74. [Google Scholar] [CrossRef]
Chen, S.S.; Guo, W. Auto-Encoders in Deep Learning-A Review with New Perspectives. Mathematics 2023, 11, 1777. [Google Scholar] [CrossRef]
Zhong, H.Y.; Lv, Y.; Yuan, R.; Yang, D. Bearing fault diagnosis using transfer learning and self-attention ensemble lightweight convolutional neural network. Neurocomputing 2022, 501, 765–777. [Google Scholar] [CrossRef]
Wang, L.; Zhang, X.; Su, H.; Zhu, J. A Comprehensive Survey of Continual Learning: Theory, Method and Application. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 5362–5383. [Google Scholar] [CrossRef]
Gou, J.P.; Sun, L.Y.; Yu, B.S.; Wan, S.H.; Ou, W.H.; Yi, Z. Multilevel Attention-Based Sample Correlations for Knowledge Distillation. IEEE Trans. Ind. Inform. 2023, 19, 7099–7109. [Google Scholar] [CrossRef]
Hihn, H.; Braun, D.A. Online continual learning through unsupervised mutual information maximization. Neurocomputing 2024, 578, 127422. [Google Scholar] [CrossRef]
Pascanu, R.; Bengio, Y. Revisiting Natural Gradient for Deep Networks. Comput. Sci. 2014, 37, 1655–1658. [Google Scholar]
Jung, M.; Lee, J.; Kim, J. A lightweight CNN-transformer model for learning traveling salesman problems. Appl. Intell. 2024, 54, 7982–7993. [Google Scholar] [CrossRef]
Xie, J.L.; Shi, W.F.; Shi, Y.Q. Research on Fault Diagnosis of Six-Phase Propulsion Motor Drive Inverter for Marine Electric Propulsion System Based on Res-BiLSTM. Machines 2022, 10, 736. [Google Scholar] [CrossRef]
Gao, Y.T.; Wang, W.; Lin, Q.B.; Cai, F.H.; Chai, Q.Q. Fault Diagnosis for Power Converters Based on Optimized Temporal Convolutional Network. IEEE Trans. Instrum. Meas. 2021, 70, 1110. [Google Scholar] [CrossRef]

Figure 1. Motor drive system structure.

Figure 2. Power converter circuit diagram.

Figure 3. Three-phase current simulated waveform of IGBT OC fault under Task A, where I–IV represent different segments of the waveform. (a) S₁ OC fault. (b) S₁ S₄ OC fault. (c) S₁ S₃ OC fault. (d) S₁ S₆ OC fault.

Figure 4. Three-phase current simulated waveform of IGBT OC fault under Task B, where I–IV represent different segments of the waveform. (a) S₁ OC fault. (b) S₁ S₄ OC fault. (c) S₁ S₃ OC fault. (d) S₁ S₆ OC fault.

Figure 5. Three-phase current simulated waveform of IGBT OC fault under Task C, where I–IV represent different segments of the waveform. (a) S₁ OC fault. (b) S₁ S₄ OC fault. (c) S₁ S₃ OC fault. (d) S₁ S₆ OC fault.

Figure 6. Three-phase current simulated waveform of IGBT OC fault under Task D, where I–IV represent different segments of the waveform. (a) S₁ OC fault. (b) S₁ S₄ OC fault. (c) S₁ S₃ OC fault. (d) S₁ S₆ OC fault.

Figure 7. Lifelong learning-enabled fractional order-convolutional encoder model for open-circuit fault diagnosis of power converters under multi-conditions flow chat.

Figure 8. The optimal training epochs for the model across different tasks.

Figure 9. Optimal fractional order of derivative.

Figure 10. The fault diagnosis accuracy of each model on the random pulse noise signal dataset.

Figure 11. The t-SNE plots of each model on the random pulse noise signal dataset. (a) The t-SNE plot for each model on Task A. (b) The t-SNE plot for each model on Task B. (c) The t-SNE plot for each model on Task C. (d) The t-SNE plot for each model on Task D.

Figure 12. Accuracy of each model on the validation set during continuous learning of different tasks.

Figure 13. Stacked plots of diagnosis results and confusion matrices for all fault categories of each model. (a) Stacked confusion matrix of fault diagnosis for each model on the test set. (b) Confusion matrix of fault diagnosis for LL-FO-CE model on the test set. (c) Confusion matrix of fault diagnosis for CNN-Transformer model on the test set. (d) Confusion matrix of fault diagnosis for Res-BiLSTM model on the test set. (e) Confusion matrix of fault diagnosis for ASLSTM model on the test set. (f) Confusion matrix of fault diagnosis for TCN model on the test set.

Figure 14. Comparison of diagnosis time of each model in the test set.

Figure 15. Semi-physical virtual simulation system for power converter fault diagnosis.

Figure 16. Partial fault waveforms for semi-physical virtual simulation system. (a) Fault waveform of a three-phase circuit with fault label 2. (b) Fault waveform of a three-phase circuit with fault label 10. (c) Fault waveform of a three-phase circuit with fault label 23. (d) Fault waveform of a three-phase circuit with fault label 31. (e) Fault waveform of a three-phase circuit with fault label 44. (f) Fault waveform of a three-phase circuit with fault label 52. (g) Fault waveform of a three-phase circuit with fault label 65. (h) Fault waveform of a three-phase circuit with fault label 73.

Figure 17. Overall average of various models across 10 predictions under all tasks.

Table 1. Four operating tasks of the power converter.

Tasks	Power Converter Conditions	Type of Motor
Task A	Inverter	AC Induction Motor
Task B	Rectifier	AC Induction Motor
Task C	Inverter	PMSM
Task D	Rectifier	PMSM

Table 2. Component parameters of the simulation circuit.

Symbol	Explanation	Value
V_D	DC source	800 V
R₁	Bleeder resistor	10⁻⁶ Ω
R₂	Damping resistor	3 Ω
C₁	DC-link capacitor	5.6 × 10⁻³ F
C₂	Damping Capacitor	10⁻⁴ F
L₁	Converter-side inductor	8 × 10⁻⁴ H
L₂	Grid-side inductor	2 × 10⁻⁴ H

Table 3. Labeling of power converter OC faults under different tasks.

OC Fault Device		Normal	S₁	S₂	S₃	S₄	S₅	S₆	S₁·S₂	S₁·S₃	S₁·S₄	S₁·S₅
Fault Label	Task A	1	2	3	4	5	6	7	8	9	10	11
	Task B		23	24	25	26	27	28	29	30	31	32
	Task C		44	45	46	47	48	49	50	51	52	53
	Task D		65	66	67	68	69	70	71	72	73	74
OC Fault Device		S₁·S₆	S₂·S₃	S₂·S₄	S₂·S₅	S₂·S₆	S₃·S₄	S₃·S₅	S₃·S₆	S₄·S₅	S₄·S₆	S₅·S₆
Fault Label	Task A	12	13	14	15	16	17	18	19	20	21	22
	Task B	33	34	35	36	37	38	39	40	41	42	43
	Task C	54	55	56	57	58	59	60	61	62	63	64
	Task D	75	76	77	78	79	80	81	82	83	84	85

Table 4. Optimal hyperparameters.

Hyperparameter		Explanation	Best Value
Epoch	Stage I	Training epoch	25
	Stage II		40
	Stage III		90
	Stage IV		170
μ		Learning rate	10⁻⁴
α		Fractional order	1.2
β		Weight parameter in knowledge distillation	0.5
T		Temperature coefficient in knowledge distillation	5
Loss	L_CE	Loss function	Cross entropy
Loss	L_KL	Loss function	Kullback–Leibler divergence
λ		Regularization coefficient	0.05
Batch_size		Batch size	64
Replay_batch_size		Replay batch size	16

Table 5. Main parameters of double closed-loop control based on dq rotating coordinate system.

Parameters	Explanation	Value
K_p₁	Voltage outer ring scale factor	0.5
K_p₂	Current inner loop scaling factor	30
K_i₁	Voltage outer loop integration factor	210
K_i₂	Current inner loop integration factor	510
V_DC^*	DC Voltage Reference	800
i_q^*	Reactive current reference value	0

Table 6. The average accuracy, precision, recall, and f1 score of various models across ten predictions under four tasks.

Model	Task A			Task B					Task C				Task D
Model	Acc.	Pre.	Rec.	F1.	Acc.	Pre.	Rec.	F1.	Acc.	Pre.	Rec.	F1.	Acc.	Pre.	Rec.	F1.
LL-FO-CE	0.929	0.935	0.931	0.933	0.945	0.815	0.807	0.811	0.879	0.847	0.806	0.827	0.985	0.986	0.985	0.985
CNN-Transformer	0.043	0.008	0.025	0.012	0.044	0.024	0.023	0.024	0.234	0.184	0.224	0.154	0.970	0.971	0.970	0.971
Res-BiLSTM	0.045	0.002	0.031	0.003	0.044	0.022	0.023	0.023	0.044	0.002	0.045	0.004	0.971	0.980	0.971	0.976
ASLSTM	0.024	0.001	0.019	0.003	0.015	0.015	0.008	0.011	0.045	0.002	0.04	0.004	0.943	0.921	0.908	0.908
TCN	0.045	0.003	0.028	0.006	0.045	0.016	0.024	0.019	0.045	0.002	0.033	0.003	0.929	0.923	0.924	0.922

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, T.; Wang, E.; Yang, J. Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions. Sensors 2025, 25, 1884. https://doi.org/10.3390/s25061884

AMA Style

Li T, Wang E, Yang J. Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions. Sensors. 2025; 25(6):1884. https://doi.org/10.3390/s25061884

Chicago/Turabian Style

Li, Tao, Enyu Wang, and Jun Yang. 2025. "Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions" Sensors 25, no. 6: 1884. https://doi.org/10.3390/s25061884

APA Style

Li, T., Wang, E., & Yang, J. (2025). Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions. Sensors, 25(6), 1884. https://doi.org/10.3390/s25061884

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lifelong Learning-Enabled Fractional Order-Convolutional Encoder Model for Open-Circuit Fault Diagnosis of Power Converters Under Multi-Conditions

Abstract

1. Introduction

2. Fault Analysis

2.1. Fault Analysis of the Power Converter Working Under Inverter Condition

2.2. Fault Analysis of Power Converter Working Under Rectifier Condition

3. Approach

4. Simulation

4.1. Fractional Order Design

4.2. Performance Study of the Lifelong Learning Framework

4.3. Research on Fault Diagnosis Performance

5. Semi-Physical Experiments

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI