1. Introduction
As critical components for DC-AC power conversion, inverters serve as the technological cornerstone, enabling precise control and the efficient energy conversion in modern power systems. The cascaded H-bridge multilevel inverter (CHMI), recognized as a high-performance multilevel converter, offers several advantages including good harmonic characteristics, reduced dv/dt, enhanced modularity and high efficiency [
1,
2]. Consequently, the CHMI has been widely adopted in critical domains, including renewable energy generation systems, electric drive applications, and charging stations [
3,
4,
5]. Compared to two-level inverters, the CHMI has more power semiconductor devices, posing a challenge to its reliable operation. The research effort presented in this work aims to apply automated machine learning techniques to learn circuit behavior and fault characteristics, thereby improving and accelerating inverter fault diagnosis.
Within inverter systems, insulated gate bipolar transistors (IGBTs) are widely employed due to their favorable electrical characteristics [
4]. However, silicon-based semiconductor materials have inherent physical limitations, and achieving nanosecond-level switching precision requires complex control strategies. These factors make IGBT modules particularly susceptible to various failure mechanisms [
5,
6]. Although the modular structure of the CHMI facilitates extension to higher output voltage levels, the probability of inverter failure increases significantly as the number of inverter levels rises [
7]. Two common types of IGBT failures are short-circuit faults (SCFs) and open-circuit faults (OCFs). SCFs can induce severe current distortion and transient thermal effects within an extremely short time (<10
) and have been effectively addressed through mature hardware protection mechanisms [
8,
9]. In practical applications, protection circuits are typically integrated into IGBT driver modules to convert SCFs into OCFs [
10,
11]. In contrast, OCFs manifest as distorted currents and voltages, which can severely affect system operation and may further damage the remaining devices. In CHMI systems, the integration of multiple power modules introduces additional challenges for accurate and rapid OCF diagnosis compared with conventional two-level inverters. Therefore, developing robust fault diagnosis strategies specifically targeting OCFs is essential for reliable localization and identification [
12].
Previous diagnosis strategies for inverter OCFs can be broadly categorized into model-based methods and data-driven methods [
13]. Model-based fault diagnosis methods have been extensively studied over the past decades and constitute a well-established research paradigm. Model-based methods are centered around constructing precise mathematical models and formulating explicit numerical expressions for complex systems [
14]. For instance, a mixed logical dynamic (MLD) model is proposed for diagnosing IGBT OCFs in a single-phase three-level neutral-point-clamped (NPC) inverter, where fault conditions are inferred through logical constraints embedded in the system model [
15]. Another representative line of research employs sliding mode observers to monitor circuit operating states, in which OCF occurrence is detected when the estimated circulating current exhibits a significant deviation from the measured current [
16]. Model-based fault diagnosis methods are characterized by high diagnostic speed and clear physical interpretability. However, their performance is usually sensitive to model-parameter accuracy and threshold selection. As power electronic systems become more complex, constructing accurate models and implementing reliable diagnostic logic become increasingly difficult, which may limit the applicability of traditional model-based methods [
17].
Driven by the rapid development of industrial digitalization and the exponential growth of hardware computing power, data-driven fault diagnosis methods have attracted significant research attention. These approaches analyze the relationship between measured voltage and current signals and inverter operating states. By establishing correlations between operational data and fault conditions through systematic data analysis, data-driven methods enable effective fault detection and localization. In power converter fault diagnosis, shallow machine-learning methods based on feature engineering are often combined with signal processing techniques. Moradzadeh et al. [
18] reviewed the application of data-driven methods in fault diagnosis of power electronics systems and evaluated some data mining techniques. Wang et al. [
19] performed CHMI fault diagnosis using only phase-voltage signals. In their approach, the Fast Fourier Transform (FFT) was employed for feature extraction, followed by Relative Principal Component Analysis (RPCA) to reduce the dimensionality of the feature space. Gomathy et al. [
20] applied Discrete Wavelet Transform (DWT) and Principal Component Analysis (PCA) to extract fault-related features from current signals. However, these methods are typically designed for specific converter topologies and heavily rely on manually engineered feature extraction procedures. Moreover, their diagnostic performance strongly depends on the selection of fault-sensitive signals, feature extraction strategies, dimensionality reduction methods, and classifier parameters. Feature engineering, which includes data preprocessing, feature extraction, and feature selection, remains a classic procedure that has achieved remarkable results [
21]. Deep learning methods integrate these stages into a unified framework that can be jointly optimized to maximize classification performance, thereby demonstrating powerful feature learning capabilities. Kiranyaz et al. [
22] proposed a real-time diagnostic framework for OCFs in power-switching devices using one-dimensional convolutional neural networks (1D-CNNs). Xing et al. [
23] achieved high efficiency and robustness using a deep residual filter network. However, the feature extraction capability of CNNs may be limited when using 1D feature representations. To better exploit the representation ability of CNNs, Wang et al. [
24] proposed a method that transformed three-phase current signals into input feature maps, achieving improved fault diagnosis accuracy compared to prior studies. Wen et al. [
25] attempted to use a special method to convert raw signal data into two-dimensional (2D) images and achieved high diagnostic accuracy using 2D-CNNs. Deng et al. [
26] proposed a diagnostic strategy based on sliding-time window to construct concise feature samples for modular multilevel converters (MMCs) in both time and frequency domains. Nevertheless, most existing deep learning-based methods still rely on manually designed network architectures, which heavily depend on expert experience and tedious trial-and-error tuning. This inevitably limits their adaptability to diverse fault patterns and practical deployment constraints.
For CHMI OCF diagnosis, the above limitation becomes more critical when highly similar fault modes are considered. In such cases, different fault modes may exhibit nearly identical global voltage waveforms, while the discriminative information is mainly reflected in weak and localized voltage signal distortions. Consequently, handcrafted time-domain, frequency-domain, or statistical features may be insufficient to stably capture these fine-grained local differences. To enhance feature discriminability, Liu et al. [
27] proposed a PCA-based principal component rearrangement (PCR) method. However, such PCA/PCR-based methods still rely on domain expertise for feature selection, component retention, and parameter tuning, which may limit their adaptability to diverse operating conditions and highly similar fault patterns. Some active diagnosis strategies improve the separability of similar fault modes by adjusting modulation schemes or operating states after fault detection [
28]. Although such methods enhance fault distinguishability, they introduce additional control complexity and degrade waveform quality, e.g., increasing total harmonic distortion (THD). Therefore, a diagnostic method that automatically extracts discriminative local fault features, reducing reliance on manual feature engineering design and expert intervention, while preserving the original modulation scheme is highly desirable.
Although deep learning methods have significantly improved diagnostic accuracy, their practical deployment in resource-constrained industrial environments remains challenging. In many existing studies, the proposed models are mainly evaluated through PC-based simulations or laboratory validations, which often lead to computationally intensive network architectures that are difficult to deploy in practical applications. Specialized AI hardware platforms, such as application-specific integrated circuits (ASICs) and field-programmable gate arrays (FPGAs), typically feature a relatively closed and less mature software ecosystem and also require specialized hardware design expertise [
29]. In contrast, edge computing platforms, such as the NVIDIA Jetson Orin Nano, provide a flexible and developer-friendly software ecosystem. It is compatible with mainstream deep learning frameworks and inference acceleration libraries [
30]. This allows researchers to focus on algorithm development while benefiting from low deployment cost, simplified implementation, and shortened development cycles [
31,
32]. Furthermore, the NVIDIA Jetson platform provides a mature workflow for model compression and optimization, facilitating the efficient deployment of lightweight fault diagnosis models. However, most existing deep-learning-based fault diagnosis models still rely on manually designed network architectures, where key structural choices, such as convolutional operations, network depth, channel width, and downsampling strategy, are usually determined by expert experience and iterative hyperparameter tuning. Such a manual design process may lead to suboptimal architectures for specific diagnostic tasks, especially when the discriminative fault information is weak and locally distributed. For CHMI fault diagnosis, inappropriate architecture design may either weaken fine-grained voltage distortions or introduce unnecessary computational overhead, which is unfavorable for edge deployment. This practical limitation motivates the use of neural architecture search (NAS). Liu et al. [
33] proposed a differentiable architecture search (DARTS), which relaxes the discrete architecture selection problem into a continuous optimization problem and enables gradient-based architecture optimization. Compared with reinforcement-learning-based and evolutionary-based NAS methods [
34,
35], DARTS substantially reduces computational costs while maintaining competitive search performance. For fault diagnosis tasks, this capability is particularly valuable, as it enables the automatic discovery of network architectures adapted to task-specific fault patterns without extensive manual intervention.
In this paper, a data-driven OCF diagnosis method for CHMI based on differentiable architecture search is proposed. The proposed framework automatically derives task-specific lightweight neural architectures for accurate inverter fault diagnosis while facilitating efficient deployment on edge platforms. The main contributions of this paper are summarized as follows:
- (1)
An edge-oriented intelligent fault diagnosis method for CHMI is proposed. The method leverages DARTS to automatically derive task-specific diagnostic models, thereby reducing reliance on manual network design and handcrafted feature engineering.
- (2)
A differentiable special cell search strategy is proposed to improve search efficiency and enable lightweight architecture design suitable for practical deployment.
- (3)
A practical model deployment and inference optimization scheme is established by integrating the searched model with TensorRT, enabling efficient inference on resource-constrained edge devices.
The remainder of this paper is organized as follows.
Section 2 analyzes the CHMI fault mechanism and clarifies the challenges of diagnosing similar fault modes.
Section 3 presents the proposed fault diagnosis method.
Section 4 provides experimental validation and performance evaluation under different operating conditions, including edge deployment experiments. Finally,
Section 5 provides a discussion of the experimental results and limitations of the proposed method, while
Section 6 concludes this paper and discusses future research directions.
5. Discussion and Limitations
This paper integrates neural architecture search (NAS) with edge deployment for inverter fault diagnosis. Different from conventional fault diagnosis methods that rely on manually designed feature extraction procedures or manual neural network architectures, the proposed PC-DARTS-SC framework automatically searches for a task-adaptive diagnostic architecture based on measured voltage signals. The experimental results show that the searched architecture is effective in distinguishing highly similar single-IGBT OCF modes under the investigated laboratory conditions. In particular, the proposed special cell search strategy avoids premature feature-map downsampling and helps preserve weak and localized voltage distortions in the reconstructed voltage samples. In addition, the deployment results on the Jetson Orin Nano platform indicate that the searched lightweight model can achieve low-latency inference on resource-constrained edge hardware.
Nevertheless, several limitations should be acknowledged. First, the experiments are conducted on a single-phase five-level CHMI prototype, controlled single-IGBT OCF cases, and two investigated RL load conditions. Therefore, the reported results mainly demonstrate the feasibility and effectiveness of the proposed method within the considered experimental domain. Broader generalization to different inverter configurations, hardware platforms, control strategies, and operating conditions has not been fully validated in this study and requires further investigation. Second, although the fault data are experimentally measured from a physical inverter prototype, the faults are emulated under controlled laboratory conditions by disabling the corresponding IGBT gate-drive signals. This setting enables safe, repeatable, and balanced data acquisition, but it cannot fully reproduce naturally developed faults, long-term device aging, sensor drift, load transients, or complex electromagnetic interference in practical applications.
Regarding the data splitting strategy, the adopted cycle-level split avoids direct overlap between samples because each sample corresponds to one complete fundamental voltage cycle and no overlapping sliding window is used. However, this cycle-level random split does not guarantee strict temporal independence between the training and test samples, since adjacent voltage cycles may still originate from the same continuous acquisition sequence and may exhibit temporal correlation. Therefore, the current results should be interpreted as diagnostic performance under the adopted cycle-level non-overlapping split strategy. A stricter time-ordered or cycle-block-based split will be further investigated in future work to assess the influence of temporal dependence on diagnostic performance. In addition, the proposed framework includes an offline architecture search stage, which introduces extra computational cost during model development. However, this search process is performed before deployment, and only the final compact diagnostic model is used for inference. Future work will further investigate cross-condition adaptation, validation on different inverter platforms, more diverse fault types, and field-measured fault data to improve the practical applicability of the proposed method.