Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model

Wang, Yao; Lan, Tianle; Ye, Qing; Sheng, Dejie; Bao, Zhizhou; Song, Runan

doi:10.3390/electronics15030681

Open AccessArticle

Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model

by

Yao Wang

^1,2

,

Tianle Lan

^1,2,

Qing Ye

^3,*,

Dejie Sheng

^1,2,

Zhizhou Bao

⁴ and

Runan Song

⁵

¹

School of Electrical Engineering, Hebei University of Technology, Tianjin 300401, China

²

State Key Laboratory of Smart Power Distribution Equipment and System, Hebei University of Technology, Tianjin 300400, China

³

Zhejiang High and Low Voltage Electrical Appliances Product Quality Inspection Center, Wenzhou 325600, China

⁴

People Electrical Appliance Group, Wenzhou 325604, China

⁵

China Electric Power Research Institute, Beijing 100192, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(3), 681; https://doi.org/10.3390/electronics15030681

Submission received: 8 January 2026 / Revised: 1 February 2026 / Accepted: 2 February 2026 / Published: 4 February 2026

(This article belongs to the Section Circuit and Signal Processing)

Download

Browse Figures

Versions Notes

Abstract

Low-voltage AC series arc faults are a leading cause of electrical fires, posing significant risks to life and property. While artificial intelligence-based detection methods have achieved high accuracy, they often suffer from limited interpretability and are typically tailored to specific loads, thus struggling to adapt to the diverse and dynamic load conditions in residential environments. To address these limitations, this paper proposes a novel interpretable arc fault detection model based on prototype learning with a hybrid attention mechanism. Specifically, we design a Tri-Domain Dynamic Attention (TDDA) module that integrates time-domain, frequency-domain, and temporal derivative information, and embed it into a Convolutional Neural Network (CNN) for enhanced feature extraction. Visual prototypes are constructed from sample characteristics, forming a tri-domain arc fault prototype set. A dedicated non-arc prototype set is further introduced to refine the decision boundary and improve accuracy. The proposed model is validated through comprehensive experiments and hardware implementation on a dedicated test platform. Results demonstrate that our model achieves an accuracy of 99.65%, maintains over 99% accuracy across various single-load conditions, and exhibits high detection performance under complex multi-load scenarios.

Keywords:

series arc fault detection; attention mechanism; prototype learning; model interpretability

1. Introduction

In the first half of 2025, 140,000 fires caused by electrical faults were reported across China, accounting for 25.4% of all reported fires during this period. Electrical fires account for the largest proportion [1]. The difficulty in accurately detecting arc faults makes them a persistent and major cause of electrical fires. Consequently, research into effective detection techniques is critically needed.

Traditional arc fault detection technologies primarily rely on the physical characteristics of arcs and time-frequency domain features. Zhao S. et al. [2] extracted the electromagnetic radiation signals from steadily burning arcs in different DC systems to develop a fault detection method based on steady patterns in the frequency domain, such as the structural similarity index and 6 dB bandwidth bins. This approach can effectively avoid nuisance tripping and demonstrates good adaptability across different DC power systems. Xiong Qing et al. [3] captured arc-induced high-frequency signals using parallel capacitors and employed a Rogowski coil to measure the amplitude of current pulses and differences in the integrated Fast Fourier Transform, enabling the detection and localization of series arc faults. Their method proved robust under various environmental and load conditions. Nashrulloh M. et al. [4] investigated the current spectrum characteristics at different arc fault locations through PSIM simulation and MATLAB-based FFT analysis (the study, published in September 2021, likely utilized MATLAB R2021a or R2020b), achieving fault location by distinguishing harmonic components. However, this method is dependent on simulation environments and fixed load conditions. Kim et al. [5] proposed detecting series AC arc faults using only voltage waveforms, identifying them by analyzing the unique symmetric energy profile formed by harmonics generated during arc ignition and extinction. Kavi et al. [6] introduced a time-domain technique based on mathematical morphology, termed the Decomposed Open-Close Alternating Sequence (DOCAS), for large grid-connected photovoltaic systems. This method detects faults by correlating sustained random spikes in the algorithm’s output with the rate of change in DC arc current and voltage and utilizes the increased effective fault resistance for localization and noise suppression. For naval shipboard DC power systems with pulsed loads, Maqsood et al. [7] adopted a clustering-based approach to extract unique feature vectors from Short-Time Fourier Transform (STFT) analysis, enabling the differentiation of load transients, shunt faults, and series arcing faults. Balamurugan et al. [8] employed a computer-controlled mechatronic testbed to generate repeatable arc conditions and compared the effectiveness of Fast Fourier Transform (FFT) and STFT in analyzing arc voltage/current waveforms for PV arc fault detection. Cho et al. [9] focused on optimizing frequency feature extraction for DC series arcs through FFT analysis by adjusting sampling frequencies and points, aiming to distinguish fault conditions from normal operations. He et al. [10] proposed a dual-signal multi-timescale feature extraction framework guided by arcing physical characteristics, combined with a decision tree and criteria, achieving high detection accuracy in complex scenarios involving diverse loads, topologies, and arc-generating modes. Xiong et al. [11] developed an accurate DC arc model incorporating steady-state impedance, high-frequency, and dynamic characteristics, and proposed a detection algorithm that integrates a K-line diagram with the spectrum integral difference in arc current. This approach can effectively discriminate between normal operation, arc faults, switching actions, and load mutations.

In recent years, significant advancements in artificial intelligence have provided new solutions for arc fault detection. Deep learning methods can automatically extract deeper features from raw signals, enabling more accurate fault identification. Numerous studies have focused on different approaches to data preprocessing, feature extraction, and model architecture. For instance, Y. Wang et al. [12] proposed a hybrid detection method based on improved Mel-Frequency Cepstral Coefficients (MFCC) preprocessing and a lightweight neural network, achieving high recognition accuracy under various loads. Lu S. et al. [13] addressed arc detection in photovoltaic systems using a Lightweight Transfer Convolutional Neural Network with Adversarial Data Augmentation (LTCNN-ADA), tackling challenges such as the discrepancy between source and target domain data and the scarcity of fault data in the target domain. Q. Yu et al. [14] focused on low-voltage three-phase systems, proposing a fault arc phase selection method based on a global temporal convolutional network, which enhances feature extraction through an attention mechanism. However, such end-to-end intelligent diagnostic methods still commonly face issues including lack of model interpretability [15], shortage of training data, and limited generalization capability due to models often being trained on single-load scenarios. To address these challenges, subsequent research has explored various improvements in feature enhancement, data representation, and model efficiency. Chu et al. [16] proposed converting time-domain current signals into grayscale images and feeding them into a Long Short-Term Memory (LSTM) network for identification, achieving good performance in residential applications, though with reduced accuracy for thyristor-based loads. Yang et al. [17] innovatively transformed current time series into visibility graphs and utilized a graph convolutional network for learning, improving detection robustness in environments with variable loads. Park et al. [18] integrated artificial intelligence with Time-Frequency Domain Reflectometry (TFDR), employing denoising autoencoders and generative adversarial networks to enhance noise-resistant detection of series arc faults in DC grids. At the level of feature engineering, Dai et al. [19] transformed signals into images via a relative position matrix, employing a mixed-attention residual network to detect singular features. Qu et al. [20] proposed a multi-domain deep feature association framework, which integrates time-domain, frequency-domain, and wavelet packet energy features through a stacked neural network. Gong et al. [21] devised a detection model that synergizes wavelet transform, eigenvalue decomposition, and a deep neural network, notable for its minimal input requirements and training efficiency. For multi-branch load circuits, Tian et al. [22] introduced a feature enhancement method based on seasonal-trend decomposition and recursive least squares, effectively extracting fault features obscured by normal operating signals. Although these methods have achieved high accuracy under specific conditions, their generalizability, real-time performance, and interpretability in broader, more complex real-world scenarios require further verification and improvement.

The effectiveness of conventional machine learning algorithms in arc fault diagnosis is often constrained by their dependence on manually selected features [23]. In contrast, prototype learning has emerged as a promising paradigm that integrates representation learning with case-based reasoning [24]. It operates by learning representative prototypes (typical samples or patterns) for each category. During prediction, the model makes decisions by comparing the similarity between an input sample and all learned prototypes.

This paper proposes an arc fault detection method based on mixed attention to extract key arc features and prototype learning ideas. The results show the effectiveness of the method.

The main contributions of this paper are as follows.

(1): An arc test platform covering 12 kinds of household loads is established. The accuracy of the proposed arc detection method is not less than 99% when arc occurs in a different load branch. After deploying the method to the hardware, the arc fault detection under multi-load conditions can be realized.
(2): This paper proposes a hybrid attention mechanism to extract multi-dimensional arc features, facilitating the construction of a prototype set. Building on these features, we establish an arc fault detection model. The three-dimensional features serve as spatial coordinates, enabling the visualization of arc characteristics and thereby enhancing model interpretability.
(3): To achieve precise visualization of the prototype set, we propose a novel convex hull algorithm that iteratively approximates and refines the decision boundaries under different working conditions, thereby enhancing the representational accuracy of the prototype set.

2. Platform Construction and Data Analysis

2.1. Arc Fault Experiment Platform

The experimental arc fault detection platform has been constructed. The corresponding circuit schematic is detailed in Figure 1. The circuit is composed of AC power supply (220 V/50 Hz), common household load, line impedance and arc generator. To simulate the actual household electricity environment, several representative and common household appliances are selected, including fluorescent lamps (40 W), LED lamps (144 W), vacuum cleaner (200 W), switching power supplies (360 W), water dispenser (350 W), electric iron (1600 W), induction cooker (2100 W), microwave oven (1100 W), refrigerator (200 W), washing machine (160 W), water heater (1500 W), and bathroom heater (1800 W). According to the load type and function of household appliances, all the loads are divided into five branches: lighting line, ordinary socket line, high-power electrical line, kitchen dedicated line and bathroom dedicated line. The specific household loads employed in this study, along with their rated power and the assigned branch circuit classification, are summarized in Table 1.

Arcs are generated using two methods: an arc generator and carbonized cables. The arc generator produces an arc by precisely adjusting the gap between a copper rod and a carbon rod via a stepper motor. Meanwhile, carbonized cables simulate an arc fault path, which is initiated by first cutting into double-stranded conductors and then inducing a high-voltage breakdown.

For offline analysis and method development, line current signals are acquired via a current transformer and an oscilloscope, with subsequent detection algorithms being developed and evaluated on a PC. For real-time operation, the current signal conditioned by the transformer is fed into an embedded chip. This chip integrates signal processing and fault detection algorithms to execute real-time alarm or protection functions.

The developed experimental platform acquires low-voltage AC series arc fault data. It can collect both arc fault and normal (arc-free) current waveforms under various load conditions, providing the necessary dataset for subsequent analysis. The design and testing procedures of this platform comply with the IEC 62606 standard [25].

2.2. Data Acquisition and Analysis

In the experiment, the current transformer is used to collect the line current information, and the sampling frequency is 100 kHz. Figure 2 is the transformer sampling waveform when the single load is in normal operation and the arc occurs.

Figure 2 illustrates the current waveforms on the energized branch before and after an arc fault occurs. A discernible current reduction and a current-zero “flat shoulder” phenomenon are observable post-arc. Notably, current waveforms vary significantly across different load types. For instance, loads like bathroom heaters and electric irons draw high-amplitude currents, whereas refrigerators and washing machines exhibit lower-amplitude currents. While the current waveform of resistive loads approximates a sine wave, loads such as LED lamps, microwave ovens, and water dispensers exhibit inherently distorted waveforms due to their operational principles, with this distortion present both before and after an arc event. Crucially, for loads like LED lights, switching power supplies, and water dispensers, there is no pronounced distinctive change in the time-domain waveform immediately before versus after the arc occurrence. Furthermore, any alteration in the load’s operating state inherently modifies its current waveform. Therefore, reliable arc fault detection cannot be achieved by relying solely on time-domain features, necessitating the analysis of more discriminative characteristics.

Spectral analysis was performed on the collected waveforms, as shown in Figure 3. When an arc fault occurs in a single load, the current spectrum exhibits significant high-frequency components that are absent during normal operation. Distinctive spectral differences among loads are observed in specific frequency bands: loads such as LED lamps, fluorescent lamps, switching power supplies, water dispensers, refrigerators, induction cookers, water heaters, and bathroom heaters show marked variations predominantly below 1 kHz, while vacuum cleaners and microwave ovens exhibit pronounced differences primarily below 100 Hz. Critically, for most loads, the high-frequency energy content during an arcing event is substantially higher than that during normal operation. Above 1 kHz, arc faults demonstrate particularly rich and discriminative high-frequency characteristics. Although spectral aliasing may occur in the high-frequency range for some loads even under arc-free conditions, the distinct high-frequency signatures associated with arcing provide a reliable basis for further exploration and enable effective arc fault detection.

The aforementioned analysis reveals two primary challenges for reliable arc fault detection: significant variability in current waveforms across diverse household loads, and complex, often overlapping high-frequency signatures between normal and fault states. These challenges render conventional fixed-threshold or single-domain feature-based methods susceptible to misdetection. To address these issues, a method capable of learning robust and generalized representations from limited and variable data is required. Prototype learning offers a suitable framework due to its inherent advantages in handling data scarcity, providing model interpretability, and generating compact yet discriminative class representations. Consequently, this paper integrates an improved attention mechanism with a prototype learning framework to extract the discriminative features essential for accurate detection under complex operating conditions.

3. Improved Attention Mechanism Fusion Feature Extraction Method

3.1. Prototype Learning

Prototype learning is a methodology rooted in metric learning. Its core idea is to learn a representative prototype for each class and perform classification or representation learning based on the distance between a query sample and each class prototype. This approach typically employs an embedding function that maps input data into a low-dimensional space, where samples cluster around their corresponding prototypes while prototypes of different classes are separated from each other. Owing to this mechanism, prototype learning can rapidly establish and update class representations with only a few samples, thereby effectively addressing classification problems under data scarcity. Moreover, it is structurally simple, computationally efficient, interpretable, and exhibits strong generalization capability.

3.2. Multidimension Feature Extraction

In low-voltage AC systems, a series arc fault is characterized by a significant increase in line current harmonics and the emergence of a current-zero ‘flat shoulder’ in the time domain, alongside distinct high-frequency spectral signatures. To effectively construct a representative arc fault prototype and facilitate subsequent visualization, we select a tri-dimensional feature set encompassing the time domain, frequency domain, and the time derivative (rate of change). This multi-perspective approach leverages comprehensive signal information, thereby enhancing prototype representativeness. Time-domain features provide raw morphological information, offering the most intuitive depiction. Frequency-domain features reveal spectral composition, explaining the signal’s frequency structure. The time derivative captures the dynamic characteristics by describing the signal’s instantaneous rate of change. By integrating these three complementary dimensions within a unified feature space, the constructed prototype can characterize the target fault mode more comprehensively and intrinsically, mitigating the limitations inherent in any single-dimensional representation.

3.3. Attention Mechanism

The attention mechanism in deep learning finds key information from global information to pay attention to, ignoring secondary invalid information, which can improve the expression ability and performance of the model [26], as shown in Figure 4.

The core of the attention mechanism is to calculate the weight of Value according to the similarity between Query and Key and generate output by weighted summation. The specific calculation formula is shown in Formula (1). In the formula, ‘Q’ represents ‘Query’, ‘K’ represents ‘Key’, ‘V’ represents ‘Value’ and d_k represents the dimension of the key.

A t t e n t i o n (Q, K, V) = s o f t \max (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(1)

By integrating multiple attention forms, the hybrid attention mechanism effectively strengthens the ability of the deep learning model to capture and select key features. When the input characteristic graph is X ∈ R^H×W×C, where H, W, C represent the height, width and number of channels in turn. The global information extraction of channel attention is shown in Equation (2).

z_{c} = \frac{1}{H \times W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} X_{c, i, j}

(2)

Z_c denotes the global channel statistical vector, i, j denotes the quantity of the spatial position. The channel weight calculation is expressed in Equation (3).

A_{c} = σ (F_{2} \cdot δ (F_{1} \cdot z_{c} + b_{1}) + b_{2})

(3)

F₁ and F₂ are the weights of the fully connected layer, σ is the sigmoid function, δ is the Rectified Linear Unit (ReLU) activation function, and A_c is the channel attention weight. In spatial attention, spatial feature extraction is represented by Formulas (4) and (5).

M_{a v g} = \frac{1}{C} \sum_{c = 1}^{C} X_{c, i, j}

(4)

M_{\max} = \max X_{c, i, j}

(5)

M_avg and M_max represent the average feature map and the maximum feature map, respectively. C is the channel index. The calculation of spatial weight is shown in Equation (6).

A_{s} = σ (f^{k \times k} (M_{c a t}))

(6)

In Equation (6), M_cat is the splicing result of two feature maps, f is a k × k convolution operation, usually k = 7, and A_s is the spatial attention weight. The weight combination of mixed attention is expressed in Equation (7).

A_{m i x e d} = L a y e r N o r m (α A_{c} + β A_{s})

(7)

In Equation (7), α, β are the learning fusion weights, and α + β = 1.

The ordinary single-head attention mechanism is good at capturing the global long-term dependencies within the sequence, but its computational complexity is high, and it is easy to lose key location information due to the lack of built-in location awareness mechanism, and the weight may also be unstable when dealing with noise. As a lightweight and efficient channel attention mechanism, Squeeze-Excitation Network (SENet) can effectively improve the sensitivity of the model to important channel features, but its drawback is that it completely ignores the information of spatial dimension, and its channel weight generated by global pooling is easy to become unstable under noise interference [27]. Convolutional Block Attention Module (CBAM) is a hybrid attention mechanism, which achieves more comprehensive feature enhancement by combining the two dimensions of channel and space [28]. However, its spatial attention module is redundant in processing one-dimensional signals, and its channel attention is still insufficient in complex cross-channel nonlinear interaction modeling [29].

3.4. TDDA Module

The one-dimensional nature of arc fault signals necessitates a model capable of precisely identifying and extracting the decisive regions for fault diagnosis. To construct a highly discriminative feature representation for low-voltage AC arc faults, this study begins with an analysis of the underlying physical characteristics: the fault signature manifests as millisecond-level high-frequency oscillations within the current waveform. Its energy distribution varies nonlinearly with load conditions and is frequently obscured by background noise. Conventional feature extraction methods face a fundamental trade-off between capturing transient details and maintaining adaptability across varying operational conditions. While fixed-threshold techniques are susceptible to missing subtle arcs, purely data-driven models often overfit in limited-data scenarios. To overcome these limitations, we propose a triadic attention fusion paradigm. This approach integrates three complementary components: a static unit matrix A0 encoding foundational physical constraints, a dynamic convolution weight A1 that adapts to load-induced feature drift, and a trainable offset matrix A2 dedicated to modeling nonlinear inter-channel interactions. Instantiating this paradigm, we present the Ternary Dynamic Attention module, a lightweight, embeddable component designed for CNN architectures.

Based on the theoretical principles of attention mechanisms detailed in Section 3.3, this work recombines the core design elements of global statistical aggregation, channel-wise reweighting, and spatial-sequence focusing as encapsulated in Equation (1) through Equation (7). The core design of the TDDA module is to decompose the hybrid attention mechanism into three dedicated yet cooperative matrices. The static identity matrix A0 establishes a stable prior for channel attention, providing a constant and robust initial reference state aligned with the concept formalized in Equation (3). The dynamic convolution weight A1, structured through a learnable bottleneck operation corresponding to Equation (8), emulates the input-adaptive weight generation of self-attention from Equation (1). It also incorporates the focused regional emphasis characteristic of the spatial attention mechanisms described in Equation (4) through Equation (6). The trainable bias matrix A2 introduces higher-order nonlinear interactions, a component often abstracted in standard attention formulations. This matrix functions as a parameter set optimized via gradient-based learning, serving to refine the combined weighting of A0 and A1. This fusion and refinement process is analogous to the weighting principle illustrated in Equation (7) and enhances the modeling of complex cross-channel dependencies. Through the synergistic operation of A0, A1, and A2, the TDDA module preserves the representational strength of the classical attention framework while specifically augmenting its stability, selectivity, and nonlinear modeling capacity for processing one-dimensional fault signals. The structure of the module is shown in Figure 5. The fusion operation at the bottom of Figure 5 implements the final step of the ternary attention mechanism. It computes a weighted sum of the static prior A0 and the dynamic weight A1, then adds the trainable offset A2 to generate the final attention weights for feature modulation. This design directly extends the core fusion principle outlined in Equation (7).

In this model, B represents the batch size, L represents the sequence length, and C represents the number of channels. The dimension of the input data is B × L×C. Firstly, the Global Average Pooling (GAP) is used to compress the sequence dimension, and the channel-level statistical description vector is generated. By calculating the global average value of each channel, we focus on the overall characteristics of the channel. Then, the dimension is transposed, and a new dimension is added to the data after global average pooling to meet the input requirements of 1 × 1 convolution. The 1 × 1 convolution reduces the sequence length to C/r (r is the preset compression ratio), and the final dimension is extended to C/r times by the dynamic weight expansion module to obtain the symmetric matrix A1. The process generated by A1 can be represented by Formula (8). Where U is denoted by Formula (9).

A_{1} = s o f t \max (\frac{U U^{T}}{\sqrt{d}}) \in R^{B \times \frac{C}{r} \times \frac{C}{r}}

(8)

U = Re L U (W_{1} z + b_{1}) \in R^{B \times \frac{C}{r}}

(9)

In Formulas (8) and (9), W₁ represents the learnable weight matrix, r is the compression ratio, d = C/r, and d is the scaling factor.

A1 as a dynamic weight to participate in the subsequent data processing process. A0 participates in the operation as a static unit matrix, and A2 plays the role of offset matrix, as shown in Figure 6.

The matrix A2 is initialized as an all-zero matrix, with dimensions B × C/r × C/r consistent with A1. Its parameters are then optimized via backpropagation based on the model’s loss function. Through this process, A2 learns to represent nonlinear interactions across channels, which enhances the model’s adaptability to noise interference and nonlinear patterns and provides a corrective adjustment to the static and dynamic weights. This generation process for A2 is summarized by Equation (10).

A_{2}^{(t + 1)} = A_{2}^{(t)} - η \cdot \nabla_{A_{2}} L (A_{2}^{(t)})

(10)

The Formula (10) represents the result of the t-th iteration when the gradient descent is updated, η is the learning rate and L is the loss function. The accuracy and loss rate of the A2 training process are shown in Figure 7.

The synergistic design of the three matrices is tailored to precisely capture the distinctive physical characteristics of arc faults. The static identity matrix A0 provides a stable, physics-aligned prior, ensuring consistent attention to fundamental channel features that characterize the arc’s core energy distribution. This enhances model robustness against strong noise interference. The dynamic convolution weight A1 adaptively learns from the input signal to locate and amplify the key millisecond-level high-frequency oscillatory transients in the current waveform, which vary with load conditions. This enables dynamic focus on the most discriminative temporal regions for fault identification. The trainable offset matrix A2 is dedicated to modeling complex nonlinear interactions across channels—an essential aspect of arc behavior. By applying a learned nonlinear correction to the initial attention weights formed by A0 and A1, A2 refines the feature representation, allowing the model to better distinguish genuine arc signatures from background noise or load harmonics.

To illustrate the contribution of different modules in TDDA, we designed a ablation experiment of 7500 data, including 3744 arc samples and 3756 non-arc samples. Based on the CNN model in Section 4.1, the A1 matrix, A1 and A0, and the effect of the entire TDDA are added, as shown in Table 2.

The TDDA module introduces three matrices: A1, A2, and A0. Among these, A1 is the core component for extracting key features from arc samples, while A2 is a trainable offset matrix that enables nonlinear channel interactions. The introduction of A1 and A2 accounts for the majority of the module’s parameter increase compared to the baseline. A0 provides a channel-attention prior, further refining the feature selection. Collectively, these components allow the TDDA-enhanced model to focus on more discriminative features, leading to a significant improvement in accuracy over the conventional CNN baseline. Ultimately, this design achieved an 8.51% increase in accuracy with only a 3.1% growth in model parameters.

By comparing the accuracy, parameter increment, and Flops increment of the model in this study between the TDDA module and other mixed attention mechanisms, the effect of the TDDA module is further illustrated, as shown in Table 3. The reduction rate of SE and CBAM is 16, the convolution kernel size of Efficient Channel Attention (ECA) is 3, and the reduction rate of TDDA is 8. Compared with other hybrid attention mechanism models, TDDA adds a static unit matrix A0 to provide channel independence prior, which solves the problem of weight instability of SENet under noise. In the one-dimensional signal scenario, a CBAM redundant spatial attention branch is proposed to replace the two-dimensional convolution operation with a lightweight one-dimensional convolution. By introducing the trainable offset matrix A2 into the channel attention, the shortcomings of CBAM in cross-channel nonlinear modeling are solved. Compared with the ordinary attention mechanism, TDDA uses a dilated convolution hierarchy instead of position coding to avoid the loss of absolute position information, and performs timing compression through global average pooling, which significantly reduces the computational complexity.

4. Arc Fault Detection Method Based on Fusion Prototype Learning Model

4.1. TDDA-CNN Prototype Learning Model

This paper proposes a backbone network architecture for multi-dimensional feature extraction, which is optimized for three types of derivative features of 50 Hz AC current signals: time domain waveform, frequency domain distribution and time domain change rate (di/dt). Based on 100 kHz sampling rate data (1000 points per power frequency cycle), the network uses three parallel Tri-Domain Dynamic Attention-Convolutional Neural Network (TDDA-CNN) branch backbone networks to input one-dimensional data, extract features through three convolution modules, and finally generate high-resolution prototype vectors through Dense compression. Get the basic characteristics of the prototype, as shown in Figure 8.

The features extracted from the backbone network are embedded into a three-dimensional feature space that jointly represents information from the time domain, the frequency domain, and their respective rates of change, thereby constructing the visual prototype. In the first convolution block, 1-Dimensional Convolution (Conv1D) is used to extract local features, ReLU activation introduces nonlinearity, and 1-Dimensional Max Pooling (MaxPool1D) downsampling retains the main features. In the second convolution block, Conv1D is used to expand the receptive field, ReLU introduces nonlinearity, TDDA module is used for channel attention calibration, noise suppression, and MaxPool1D is used for downsampling. In the third convolution block, Conv1D is used to further expand the convolution receptive field, ReLU introduces nonlinearity, TDDA is used for secondary calibration of deep features, and 1-Dimensional Global Average Pooling (GlobalAvgPool1D) is used to globally average along the sequence dimension to output global feature vectors. After the feature vectors of the three dimensions are generated, the feature fusion, dimension reduction and visualization operations are further performed to generate the prototype in the three-dimensional scene. The introduction of Gated Linear Unit (GLU) in the change rate branch can better capture the characteristics of the current change signal. The activation function ReLU enhances the nonlinear expression ability of arc oscillation characteristics in the time domain branch, strengthens the nonlinear relationship between frequency bands in the frequency domain branch, and realizes the gated control and characteristic nonlinear enhancement in the rate of change branch.

The structure of each layer of the backbone network is shown in Table 4, where L is the length of the sequence and C is the number of channels. Enter a sample with 1000 × 1 as an example.

Current data under both normal and arc fault conditions were collected for 12 types of household loads using the experimental platform described in Section 2.1. These data were then used to train the backbone network. The sampling rate is 100 khz, a cycle is 20 ms, and a cycle of data is used as a data sample. The results collected by the experimental platform are normalized, averaged and processed by Fourier transform to obtain data, including with arc and without arc. The dataset was partitioned into training, validation, and test sets using a stratified random sampling approach. This method ensures that the class distribution across all subsets reflects the original proportion of arc-containing and arc-free samples. Specifically, samples were first grouped by label into two separate subsets. Each subset was then independently shuffled and randomly divided into proportions of 75%, 15%, and 10% to form the training, validation, and test splits, respectively. Finally, the corresponding splits from both categories were combined to create the final datasets. As shown in Table 5.

The training was carried out in batches, with a total of 10 rounds and a batch size of 32. Adam is selected as the optimizer, the learning rate is set to 0.0001, the random seed is fixed to 42, and the loss function is cross entropy. The accuracy and loss changes in the training model are shown in Figure 9.

The arc fault recognition model achieved an accuracy of 99.65%, with a precision of 99.75%, a recall of 99.41%, a False Positive Rate (FPR) of 0.174%, and a False Negative Rate (FNR) of 0.594%. The specific results are shown in Table 6. We confirmed the above results through repeated training to ensure the generalization of the model.

The confusion matrix presented in Figure 10 categorizes the 12 tested loads into four types: resistive loads, power electronic loads, motor loads, and gas discharge loads. Specifically, the resistive load group comprises a water heater, a bathroom heater, an electric iron, and a water dispenser. The power electronic load group includes LED lights, a switching power supply, an induction cooker, and a microwave oven. Motor loads consist of a vacuum cleaner, a refrigerator, and a washing machine, while the gas discharge load is represented by a fluorescent lamp.

To evaluate the model’s robustness against typical household electromagnetic interference—such as equipment transients, background noise, and AC powerline disturbances—we conducted tests by injecting three types of noise: Gaussian white noise, impulse noise, and periodic noise [30,31]. The Signal-to-Noise Ratio (SNR) was varied from 30 dB to −5 dB to simulate conditions ranging from typical to harsh home environments. The detailed performance metrics under these noise conditions are summarized in Table 7.

The robustness evaluation across varying SNRs and noise types reveals a consistent performance trend. The model’s accuracy monotonically declines as the SNR decreases, with a more pronounced degradation observed below 5 dB. Notably, even under extreme noise conditions at 0 dB and −5 dB, the model maintains robust recognition accuracy. Among the three noise types, impulse noise causes the most significant performance drop due to its transient similarity to arc signatures, whereas periodic noise is relatively easier to suppress owing to its regular pattern.

The horizontal and vertical axes of the confusion matrix correspond to the predicted category and the actual category of the model, respectively. The value on the main diagonal is the number of samples that are correctly identified. Diagonal data is the number of correctly identified samples. Due to the single type of gas discharge load, the main model based on prototype learning idea has fewer samples than other types of load feature extraction, so the accuracy of arc fault judgment of gas discharge load is lower than that of other types of loads.

To illustrate the contribution of each branch of the model, we designed ablation experiments to compare the accuracy and parameters of the time domain, frequency domain, and change rate branches in turn, as shown in Table 8. We use TIM, FRE, and DEL to represent the time domain, frequency domain and rate of change branch, respectively. It can be seen from Table 6 that under the action of TDDA, the detection of a single branch can have a higher accuracy. Compared with other branches, the frequency domain branch has the most significant improvement in accuracy.

4.2. Visualization of 3D Prototype Feature Set

Through the processing of arc data by the TDDA-CNN prototype learning model, the arc prototype feature set under the corresponding load can be obtained. The end of each branch is a 32-dimensional fully connected layer, which is used to transform the learned features into a 32-dimensional vector. For each sample i, we define a three-dimensional mapping function as shown in Equation (11).

M (f_{t, k}^{i}, f_{f, k}^{i}, f_{d, k}^{i}) \in {\{P_{k}^{i}\}}_{k = 1}^{32}

(11)

The calculation of each point P is shown in Formula (12), where the three f in M represent the kth element of the time domain, frequency domain, and rate of change branch eigenvectors, respectively.

P_{k}^{i} = (f_{t, k}^{i}, f_{f, k}^{i}, f_{d, k}^{i}) \in R^{3}

(12)

For different class c, the generation of the prototype point set is shown in Equation (13), where Nc is the number of samples in class c.

P_{c} = {\{(\frac{1}{N_{c}} \sum_{i = 1}^{N_{c}} f_{t, k}^{i}, \frac{1}{N_{c}} \sum_{i = 1}^{N_{c}} f_{f, k}^{i}, \frac{1}{N_{c}} \sum_{i = 1}^{N_{c}} f_{d, k}^{i})\}}_{k = 1}^{32}

(13)

To visualize the prototype features, we construct a three-dimensional coordinate system. Its axes correspond to the time-domain features, frequency-domain features, and the feature change rate, respectively. For each arc fault sample, the output vector from the fully connected layer of the TDDA-CNN model is mapped to a point in this space, forming a single prototype. By aggregating a large number of such prototypes generated under the same working condition, a cluster emerges. The processed visualization of this cluster constitutes the arc prototype feature set for that specific condition, as shown in Figure 11.

In Figure 11, the arc fault prototype set of different loads has different degrees of interval range and maximum value on the axis, and presents the aggregation within a certain range, which lays a foundation for dividing the range of arc prototype feature set under different working conditions.

4.3. Arc Fault Prototype Feature Set Correction

To enhance the detection accuracy, this paper proposes a corrective method that refines the arc prototype set by leveraging the non-arc prototype set. The procedure consists of three key steps. First, the non-arc prototype set is constructed from normal load data to serve as a reference for the negative class (normal operation) in the feature space. Second, an improved convex hull algorithm is employed to delineate the geometric boundaries for both the arc and non-arc prototype sets, respectively. Finally, the overlapping regions between these two boundaries are identified and removed from the arc prototype set. This yields a refined decision region for arc faults, which is more discriminative by explicitly excluding ambiguous zones prone to confusion with normal states.

4.3.1. Convex Hull Approximation Algorithm

The proposed arc fault detection method utilizes a prototype feature set derived from the TDDA-CNN model. For an input sample under unknown operating conditions, the model outputs a prototype point, which is then visualized as a set of coordinates in a three-dimensional feature space. A fault is detected if this point falls within the spatial region occupied by the arc fault prototype set.

However, a simple convex hull is often insufficient for precisely delineating the complex boundary of this region, potentially leading to misclassification. To address this, we design a convex hull approximation algorithm to more accurately model the boundary, particularly for the arc-free point set. The specific steps of this algorithm are described below.

By randomly selecting n boundary points as the initial vertex set V. These points should be located at or near the boundary of the real prototype set so that the algorithm can iterate from a reasonable starting point. For the tth iteration, we pre-extend and correct each point in the vertex set V(t). Finally, the convergence results after the space update are judged.

Convex Hull Pre-Expansion and Boundary Correction

In the search space F_i^(t), a new point v is found, so that the volume of the new convex hull is the largest after replacing the current vertex v_i^(t) with this new point. By removing the vertex v_i^(t) from the current vertex set V^(t), the set V^(t)_i2 is obtained. A point v is found in the search space V^(t)_i2, so that the volume of the convex hull formed by adding v to V^(t)_i2 is the largest. This newly found point is denoted as v_i^(pre), that is, the pre-expansion point.

Since the pre-expansion point vi(pre) may not be on the boundary of the real prototype set, we need to modify it to the boundary. For the point v^(t) the loss function L(v) is defined as the classification confidence, then the gradient is shown as shown in Formula (14).

\nabla L (v) = {(\frac{\partial L}{\partial x}, \frac{\partial L}{\partial y}, \frac{\partial L}{\partial z})}^{T}

(14)

The iterative update formula is shown in Equation (15), where α_t is the adaptive step size, Proj_n is the projection along the normal vector.

v^{(t + 1)} = v^{(t)} + α_{t} \cdot \Pr o j_{n (v^{(t)})} (\nabla L (v^{(t)}))

(15)

The adaptive step size is shown in Equation (16), where η is the basic learning rate and L_max is the boundary threshold.

α_{t} = η \cdot \frac{L_{\max} - L (v^{(t)})}{{‖\nabla L (v^{(t)})‖}_{2}}

(16)

According to the above steps, the point v_i^(corr) close to the real boundary is obtained by iterative correction with v_i^(pre) as input.

Space Update and Convergence Judgment

After obtaining the correction point v_i^(corr), we need to update the search space corresponding to the vertex for further optimization in subsequent iterations. Firstly, the normal vector n_i(t) at the correction point v_i^(corr) is calculated. This normal vector can be understood as the vertical direction of the real boundary at this point, pointing to the outside of the convex hull. Then, the search space F_i^(t) is updated as the intersection of the original search space and a half space. This half-space is defined by the normal vector n_i^(t) and the point v_i^(corr), and only the points on the inner side of the boundary or on the boundary are considered, thus reducing the search range.

After completing a round of iterations on all vertices, we obtain a new vertex set V^(t+1) Then, we calculate the volume change rate of the new convex hull and the old convex hull. If this relative change is less than a preset threshold, the convex hull is considered to have converged and the iteration is stopped. Otherwise, continue the next iteration.

4.3.2. Convex Hull Correction Effect

By establishing convex hulls and selecting some point sets in three-dimensional space as the boundary of three-dimensional graphics, the prototype feature set can be intuitively fitted, as shown in Figure 12.

To address the potential spatial overlap between non-arc and arc prototype sets, which can degrade detection accuracy, we implement a correction step. As Figure 13 illustrates, we first train the TDDA-CNN model exclusively on non-arc samples. Then, following the method in Section 4.3.1, we identify and excise the overlapping regions between the non-arc and arc prototype sets from the former. This refinement of the non-arc prototype set sharpens the detector’s overall discrimination capability.

As shown in Figure 13, the yellow and blue regions represent the archetypal set regions with and without arc, and the green represents the overlapping part. After the non-arc data is processed by the TDDA-CNN prototype learning model, the generated non-arc prototype set will have two cases of coincidence and non-coincidence. For the non-coincidence case, it has no effect on the arc prototype set. For the arc prototype set with coincidence, the overlap part is removed in the arc part to realize the correction of the prototype set.

After correcting the prototype set of different loads, the data to be detected is processed by the TDDA-CNN prototype learning model, and the arc fault detection accuracy before and after the convex hull correction under different loads is obtained. To further illustrate the effect of convex hull correction, the arc fault detection accuracy of different loads before and after convex hull correction is compared with the minimum bounding rectangle, ellipse fitting, and Gaussian mixture model, as shown in Table 9. As shown in Table 9, compared with other methods, the improved convex hull algorithm has achieved superior performance under different loads after correction.

In this paper, TDDA module, CNN network and prototype learning form a unified architecture that is closely coordinated and deepened layer by layer. As an attention mechanism embedded in each branch, the TDDA module dynamically strengthens the channel information most related to the arc state in the features extracted by CNN and improves the discrimination of features. On this basis, the CNN network further fuses and abstracts the local patterns of each branch through multi-layer convolution and pooling operations and finally maps the high-dimensional features of each branch into 32-dimensional feature vectors. These three 32-dimensional vectors are transformed into X, Y, and Z coordinates, which together form a point in a three-dimensional space, thus transforming the multivariate time series features into geometric representations that can be intuitively expressed and measured.

Based on this geometric representation, prototype learning constructs a self-evolvable decision-making framework: by clustering the points of similar samples in three-dimensional space, two types of prototype point sets of ‘arc’ and ‘no arc’ are formed and continuously updated. The new samples are classified by comparing the spatial distance with various prototype points, and their own characteristics are also involved in the iterative correction of the prototype set. Therefore, TDDA and CNN jointly play the role of feature extraction and structuring, transforming the original signal into a spatial point with high discrimination; on this basis, prototype learning realizes interpretable continuous learning and inference. The three are progressive from feature optimization, space construction to dynamic discrimination, and are unified in an end-to-end arc fault detection system.

5. Experimental Verification

To evaluate the performance of the model proposed in this paper, the intelligent circuit breaker using the arc fault judgment algorithm proposed in this paper is connected to the multi-load topology, as shown in Figure 14. Load types include common household loads such as fluorescent lamps, LED lamps, switching power supplies, water dispensers, vacuum cleaners, refrigerators, microwave ovens, induction cookers, electric irons, water heaters, etc. These loads can be powered on either a single load or multiple loads. On the branch line where the load is electrified, the arc fault that can develop into an electrical fire in the real scene is simulated by the carbonized cable to test whether the circuit breaker deployed in this method can detect the arc and cut off the line in time to avoid the occurrence of electrical fire.

The deployment of the model is based on STM32H750 microcontroller (manufactured by STMicroelectronics, Geneva, Switzerland; sourced from Huai’an Shenbiao Intelligent Technology Co., Ltd.: Huai’an, China). The single inference delay of the optimization model deployed on the platform is 7.9 ms, which meets the real-time requirements of the protection action. Under continuous operation, the CPU load is about 65–70%, and the model weight and peak Random Access Memory (RAM) occupancy are 412 KB and 89 KB, respectively. The resource occupancy is significantly lower than the chip limit, reflecting the lightness of deployment. Under 3.3 V power supply, the average operating current of the system is 42 mA, showing good power consumption characteristics.

The breaking results of the circuit breaker under the scenario of single load arc are shown in Figure 15. It can be seen from Figure 15 that the circuit breakers deployed in this method can operate in a short time after the arc occurs, remove the arc fault, and realize the protection of the equipment.

There may also be multiple loads working at the same time in the household scenario. In the case of arcing of multiple loads, the breaking results of the circuit breaker are shown in Figure 16. It can be seen from Figure 16 that the circuit breakers deployed in this method can operate in a short time after the arc occurs, remove the arc fault, and realize the protection of the equipment.

To verify the reliability of the test results, based on the requirements of IEC62606 standard [25], multiple tests were carried out on the scenarios of single load arcing and multiple load arcing. Each load type was tested three times, as shown in Table 10. The effect under different loads is further illustrated by Figure 17. After testing, in the household electricity scene, the circuit breaker using this method has a good effect on the identification of low-voltage AC arc faults, and no missed detection or false tripping was observed.

Figure 17 presents the statistical response times across all tested load scenarios. This bar chart aggregates the 24 data points (8 scenarios × 3 trials) from Table 10. It can be observed that the variation in detection time (indicated by the error bars) for each load is minimal. Furthermore, all measured action times fall within the required limits stipulated by the IEC 62606 standard for the corresponding test conditions. These results collectively demonstrate the high stability and reliability of the algorithm under various and complex real working conditions.

To systematically evaluate the limit performance and generalization capability of the proposed algorithm under unknown and complex working conditions, this section designs an advanced stress test that goes beyond the standard test specifications. The test aims to proactively explore the performance boundaries of the algorithm. With reference to the line combination configuration of the test platform described in Section 2, multiple sets of random load and multi-line combination scenarios that were absent from the training phase are constructed, with their complexity sequentially increasing. This design simulates unpredictable extreme power usage combinations that may occur in actual household grids, thereby verifying the model’s robustness when confronted with out-of-distribution samples. Each condition was tested 100 times. The key performance indicators obtained from this stress test, including the action time, number of false trips, and number of missed detections, are presented in Table 11 below.

The test results delineate the trend of algorithm performance with increasing system complexity. Firstly, the algorithm maintains a 100% correct action rate across most untrained random combination scenarios, demonstrating its strong generalization capability. However, under the most complex condition of a random four-line combination, a single missed detection occurred, concomitant with the widest action time range observed. These concurrent findings indicate that when the number and complexity of arc signals requiring concurrent processing reach a certain threshold, the system’s real-time computing resources and decision margin encounter their limits. This point, therefore, defines the performance boundary of the algorithm under the current deployment configuration. This discovery holds significant engineering value, as it clearly delineates the stable operational range of the algorithm and provides a quantitative basis for subsequent hardware selection and system capacity design.

Table 12 and Figure 18 compare the performance of the proposed method with other low-voltage AC series arc fault detection algorithms. Our method shows superior performance in feature extraction, recognition method, load applicability, and accuracy. It is important to note that the “Number of loads” metric refers to the count of distinct load types tested, not the number operating concurrently. Specifically, within the IEC 62606 standard framework, our method achieves detection for the greatest variety of loads and the highest accuracy, which substantiates its effectiveness and advanced nature.

Figure 18 presents a radar chart comparing the performance of the proposed method and three existing approaches across two key dimensions: number of load types covered and detection accuracy. The radar chart is constructed with five concentric rings, where each ring represents an effectiveness level on a scale from 1 to 5, with 1 being the least effective and 5 being the most effective. As shown, the proposed method occupies the outermost region in the chart, signifying its superior overall performance. Specifically, it achieves the highest load coverage while maintaining a top-tier accuracy of 99.65%. This result demonstrates an effective balance between generalizability and precision.

The practical deployment of the proposed method should consider several potential limitations. First, under extreme electromagnetic interference—such as that generated by large motor drives or radio frequency devices—the acquired current signal quality may degrade, which could theoretically affect the stability of feature extraction and increase variance in response times. Second, while the hardware validation included a representative set of household loads, it does not encompass all possible appliance types or combinations. Therefore, the model’s ability to generalize to unseen loads, particularly those with novel topologies or operating principles, requires further verification. Furthermore, the tests validated performance for single arc faults during multi-load operation; however, more complex scenarios involving concurrent intermittent arcs at multiple locations were not evaluated. The system’s capability to reliably identify and discriminate such rare but theoretically possible fault conditions remains an area for future study. These limitations represent common engineering challenges and point toward specific directions for subsequent research and optimization.

6. Conclusions

This study tackles the critical challenge of balancing diagnostic accuracy with model interpretability in arc fault detection. We propose a novel low-voltage AC arc fault detection method based on a TDDA-CNN prototype learning framework. Its core innovation is the seamless integration of a hybrid attention module for enhanced feature extraction and a prototype learning mechanism for intrinsic interpretability. Experimental results show that our method achieves outstanding accuracy (exceeding 99% under single-load conditions) with robust generalization in multi-load scenarios. More importantly, it fundamentally addresses the ‘black box’ problem of conventional AI models by providing a three-dimensional visualizable prototype set and corrective guidance from an arc-free prototype, thereby offering transparent decision-making insights.

In summary, the principal contribution of this work is a unified framework that simultaneously delivers high accuracy and strong interpretability—a combination essential for building trustworthy AI systems in safety-critical applications like electrical fire prevention. The successful validation on an experimental prototype confirms its practical potential for reliable arc detection in complex household environments.

Future work will proceed along two main trajectories to transition this technology from laboratory validation to field deployment: first, to enhance deployment feasibility, we will focus on model compression and optimization for embedded systems. This includes implementing specific techniques such as structured pruning and post-training quantization to reduce model size and computational latency, enabling cost-effective hardware integration. Second, to ensure long-term robustness, we will expand the validation to assess the impact of critical environmental factors including temperature and humidity variations, as well as more diverse, unseen load types and concurrent fault scenarios.

Author Contributions

Conceptualization, Q.Y. and D.S.; methodology, T.L.; software, D.S. and Q.Y.; validation, Y.W., T.L. and Z.B.; formal analysis, T.L. and Z.B.; investigation, D.S. and R.S.; resources, Y.W. and Z.B.; data curation, Q.Y. and R.S.; writing—original draft preparation, R.S. and Z.B.; writing—review and editing, T.L. and R.S.; visualization, Q.Y. and D.S.; supervision, Y.W.; project administration, Y.W.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Science and Technology Project of Hebei Education Department under Grant No. CXY2023006, the National Natural Science Foundation of China under Grant No. 52477140 and the Tianjin Science and Technology Plan Project under Grant No. 24YFXTHZ00360.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Zhizhou Bao was employed by the company People Electrical Appliance Group. Author Runan Song was employed by the company China Electric Power Research Institute. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Fire Data Statistics of Fire Department. Available online: https://gd.119.gov.cn/ywdt/xfwy/content/post_4746501.html (accessed on 7 January 2026).
Zhao, S.; Wang, Y.; Niu, F.; Zhu, C.; Xu, Y.; Li, K. A series DC arc fault detection method based on steady pattern of high-frequency electromagnetic radiation. IEEE Trans. Plasma Sci. 2019, 47, 4370–4377. [Google Scholar] [CrossRef]
Xiong, Q.; Feng, X.; Gattozzi, A.L.; Liu, X.; Zheng, L.; Zhu, L.; Ji, S.; Hebner, R.E. Series arc fault detection and localization in DC distribution system. IEEE Trans. Instrum. Meas. 2020, 69, 122–134. [Google Scholar] [CrossRef]
Nashrulloh, M.Z.T.; Prasetyono, E.; Anggriawan, D.O. Mapping Detection of DC Series Arc Fault Based on Fast Fourier Transform. In Proceedings of the 2021 International Electronics Symposium (IES), Surabaya, Indonesia, 29–30 September 2021; pp. 582–587. [Google Scholar]
Kim, J.C.; Neacşu, D.O.; Lehman, B.; Ball, R. Series AC arc fault detection using only voltage waveforms. In Proceedings of the 2019 IEEE Applied Power Electronics Conference and Exposition (APEC), Anaheim, CA, USA, 17–21 March 2019; pp. 2385–2389. [Google Scholar]
Kavi, M.; Mishra, Y.; Vilathgamuwa, M. DC arc fault detection for grid-connected large-scale photovoltaic systems. IEEE J. Photovolt. 2020, 10, 1489–1502. [Google Scholar] [CrossRef]
Maqsood, A.; Oslebo, D.; Corzine, K.; Parsa, L.; Ma, Y. STFT cluster analysis for DC pulsed load monitoring and fault detection on naval shipboard power systems. IEEE Trans. Transp. Electrif. 2020, 6, 821–831. [Google Scholar] [CrossRef]
Balamurugan, R.; Al-Janahi, F.; Bouhali, O.; Shukri, S.; Abdulmawjood, K.; Balog, R.S. Fourier transform and short-time Fourier transform decomposition for photovoltaic arc fault detection. In Proceedings of the 2020 47th IEEE Photovoltaic Specialists Conference (PVSC), Calgary, AB, Canada, 15 June–21 August 2020; pp. 2737–2742. [Google Scholar]
Cho, C.G.; Ahn, J.B.; Lee, J.H.; Ryoo, H.J.; Lee, K.D.; Kim, Y.J. DC series arc fault characteristic comparision of a fast Fourier transform result. In Proceedings of the 2020 8th International Conference on Condition Monitoring and Diagnosis (CMD), Phuket, Thailand, 25–28 October 2020; pp. 218–221. [Google Scholar]
He, Z.; Zhao, H.; Li, D.; Li, W. Series arc fault detection via arcing physical characteristics-guided feature extraction: A dual-signal multi-timescale method. Int. J. Electr. Power Energy Syst. 2025, 173, 111385. [Google Scholar] [CrossRef]
Xiong, Q.; Zhang, J.; Li, J.; Tang, Y.; Zhuang, Y.; Cui, Y.; Li, R.; Ji, S. Integrated DC arc model and DC arc detection approach based on K-line diagram and spectrum integral difference. IET Power Electron. 2025, 18, e12849. [Google Scholar] [CrossRef]
Wang, Y.; Sheng, D.; Hu, H.; Han, K.; Zhou, J.; Hou, L. A Novel Series Arc Fault Detection Method Based on Mel-Frequency Cepstral Coefficients and Fully Connected Neural Network. IEEE Access 2022, 10, 97983–97994. [Google Scholar] [CrossRef]
Lu, S.; Ma, R.; Sirojan, T.; Phung, B.T.; Zhang, D. Lightweight transfer nets and adversarial data augmentation for photovoltaic series arc fault detection with limited fault data. Int. J. Electr. Power Energy Syst. 2021, 130, 107035. [Google Scholar] [CrossRef]
Yu, Q.; Zhao, L.; Yang, Y. Three-Phase Fault Arc Phase Selection Based on Global Attention Temporal Convolutional Neural Network. Appl. Sci. 2022, 12, 11280. [Google Scholar] [CrossRef]
Zhang, T.; Lin, J.; Jiao, J.; Zhang, H.; Li, H. An interpretable latent denoising diffusion probabilistic model for fault diagnosis under limited data. IEEE Trans. Ind. Inform. 2024, 20, 10354–10365. [Google Scholar] [CrossRef]
Chu, R.; Patrick, S.; Yang, K. Series Arc Fault Detection Method Based on Time Domain Imaging and Long Short-Term Memory Network for Residential Applications. Algorithms 2025, 18, 497. [Google Scholar] [CrossRef]
Yang, J.; Mahato, N.K.; Yang, J.; Gong, G.; Liu, L.; Qiang, R.; Wang, L.; Liu, X. VisGCL: Visibility Graph Convolutional Learning on Time Series Data for Arc Fault Detection in Low-Voltage Distribution Networks. IET Sci. Meas. Technol. 2025, 19, e70007. [Google Scholar] [CrossRef]
Park, H.P.; Kwon, G.Y.; Lee, C.K.; Chang, S.J. AI-enhanced time–frequency domain reflectometry for robust series arc fault detection in DC grids. Measurement 2024, 238, 115188. [Google Scholar] [CrossRef]
Dai, W.; Zhou, X.; Sun, Z.; Zhai, G. Series alternating current arc fault detection method based on relative position matrix and deep convolutional neural network. Eng. Appl. Artif. Intell. 2024, 136, 108874. [Google Scholar] [CrossRef]
Qu, N.; Wei, W.; Hu, C.; Shi, S.; Zhang, H. Series arc fault detection based on multi-domain depth feature association. J. Power Electron. 2024, 24, 1809–1819. [Google Scholar] [CrossRef]
Gong, Q.; Peng, K.; Gao, Q.; Feng, L.; Xiao, C. Series arc fault identification method based on wavelet transform and feature values decomposition fusion DNN. Electr. Power Syst. Res. 2023, 221, 109391. [Google Scholar] [CrossRef]
Tian, H.; Hou, L.; Gao, H. Series arc fault detection and line selection method based on STD-RLS feature enhancement. Electr. Power Syst. Res. 2026, 255, 112723. [Google Scholar] [CrossRef]
Cai, X.; Wai, R.J. Intelligent DC arc-fault detection of solar PV power generation system via optimized VMD-based signal processing and PSO-SVM classifier. IEEE J. Photovolt. 2022, 12, 1058–1077. [Google Scholar] [CrossRef]
Gao, Q.; Yang, X.; Song, K. Integrating novel convolutional neural network with prototype learning for rotating machinery fault diagnosis under small sample. Meas. Sci. Technol. 2025, 36, 096210. [Google Scholar] [CrossRef]
IEC 62606:2013+AMD1:2017+AMD2:2022 CSV; General Requirements for Arc Fault Detection Devices. International Electrotechnical Commission: Geneva, Switzerland, 2022.
He, X.; Zhao, W.; Gao, Z.; Zhang, L.; Zhang, Q.; Li, X. A novel deep reinforcement learning model based on DDPG considering attention mechanism and combined with GRU network for short-term load forecasting. Appl. Soft Comput. 2025, 184, 113739. [Google Scholar] [CrossRef]
Simon, J.; Kapileswar, N.; Ravikanth, S.K.; Gudimalla, S.; Greeshmanth, R.; Polasi, P.K. EffiResNet-SENet: An Optimization-assisted Deep Learning Approach for Range-based Wireless Sensor Network Localization. Wirel. Pers. Commun. 2025, 143, 395–425. [Google Scholar] [CrossRef]
Sreejam, M.; Agilandeeswari, L. Deep multimodal unmixing of hyperspectral images using Convolutional Block Attention Module (CBAM) and LiDAR features. Egypt. J. Remote Sens. Space Sci. 2025, 28, 666–680. [Google Scholar] [CrossRef]
Deng, M.; Zhang, Z.; Zhou, H.; Chen, X. Short-Term Electricity Load Forecasting Based on T-CFSFDP Clustering and Stacking-BiGRU-CBAM. Comput. Mater. Contin. 2025, 84, 1189. [Google Scholar] [CrossRef]
Schütze, J.; Kirsch, C.; Kollmeier, B.; Ewert, S.D. Comparison of speech intelligibility in a real and virtual living room using loudspeaker and headphone presentations. Acta Acust. 2025, 9, 6. [Google Scholar] [CrossRef]
Joyner, K.; Milligan, M.; Knipe, P. Estimates and measurements of radiofrequency exposures in smart-connected homes. Bioelectromagnetics 2024, 45, 329–337. [Google Scholar] [CrossRef]
Jiang, R.; Wang, Y.; Gao, X.; Bao, G.; Hong, Q.; Booth, C.D. AC Series Arc Fault Detection Based on RLC Arc Model and Convolutional Neural Network. IEEE Sens. J. 2023, 23, 14618–14627. [Google Scholar] [CrossRef]
Tang, A.; Wang, Z.; Tian, S.; Gao, H.; Gao, Y.; Guo, F. Series Arc Fault Identification Method Based on Lightweight Convolutional Neural Network. IEEE Access 2024, 12, 5851–5863. [Google Scholar] [CrossRef]
Zhang, P.; Qin, Y.; Song, R. Time-frequency Analysis and Identification of Series Fault Arc under Generalized S Transform. Power Syst. Technol. 2024, 48, 2995–3003. [Google Scholar]

Figure 1. Arc fault test platform schematic diagram.

Figure 2. Time domain waveform of single load before and after arcing.

Figure 3. Frequency domain waveform of single load before and after arcing.

Figure 4. Attention mechanism schematic diagram.

Figure 5. TDDA module schematic diagram.

Figure 6. Trainable offset matrix schematics.

Figure 7. Loss function accuracy and loss rate.

Figure 8. TDDA-CNN prototype learning model schematic diagram.

Figure 9. Model learning curve.

Figure 10. Confusion matrix results.

Figure 11. Arc fault prototype feature set under partial load.

Figure 12. Convex hull to establish schematic diagram.

Figure 13. Arc prototype feature correction diagram.

Figure 14. Hardware verification test diagram.

Figure 15. Test results of single load arcing.

Figure 16. Test results of multiple load arcing.

Figure 17. Comparison of different load action time.

Figure 18. Comparison of the effects of different methods.

Table 1. Distribution of experimental loads across branch circuits.

Load	Rated Power	Branch Circuit
LED lamps	144 w	lighting line
Fluorescent lamps	40 w	lighting line
Vacuum cleaner	200 w	ordinary socket line
Switching power supplies	360 w
Water dispenser	350 w
Refrigerator	200 w	kitchen dedicated line
Induction cooker	2100 w
Microwave oven	1100 w
Water heater	1500 w	bathroom dedicated line
Washing machine	160 w
Bathroom heater	1800 w
Electric iron	1600 w	high-power electrical line

Table 2. TDDA module ablation experiment.

Combined Scheme	Accuracy/%	Parameters/K
Baseline	91.12	148.2
A1	96.78	150.3
A0 + A1	98.96	151.9
A0 + A1 + A2	99.63	153.5

Table 3. Experimental load type.

Attention Module	Accuracy/%	Parameters	Flops
SE	93.37	144	32,256
CBAM	97.74	152	67,456
ECA	89.38	6	32,192
TDDA	99.63	313	65,008

Table 4. Backbone network parameters.

	Layer	L × C
ConvBlock1	Conv1D	1000 × 32
	ReLU	1000 × 32
	MaxPool1D	500 × 32
ConvBlock2	Conv1D	500 × 64
	ReLU	500 × 64
	TDDA	500 × 64
	MaxPool1D	250 × 64
ConvBlock3	Conv1D	250 × 128
	ReLU	250 × 128
	TDDA	250 × 128
	GAP1D	128 × 1
	Dense	32 × 1

Table 5. Division of different data sets.

Dataset	Label	Number of Samples	Dataset Sample Size
Training set	Arc	33,800	67,500
Training set	No Arc	33,700	67,500
Validation	Arc	4600	9000
Validation	No Arc	4400	9000
Test set	Arc	7000	13,500
Test set	No Arc	6500	13,500

Table 6. TDDA-CNN model training results.

Predicted Class
		Arc	Normal	Total
Actual Class	Arc	37,477	224	37,701
	Normal	91	52,208	52,299
	Total	37,568	52,432	90,000
Precision	99.75%
Recall	99.41%
FPR	0.174%
FNR	0.594%
Accuracy	99.65%

Table 7. Detection accuracy under different noises.

Noise Type/SNR	30 dB	20 dB	10 dB	5 dB	0 dB	−5 dB
Gaussian white noise	99.63%	98.58%	97.51%	96.41%	92.31%	87.15%
Impulse noise	99.59%	98.54%	97.48%	96.39%	91.20%	86.10%
Periodic noise	99.64%	99.22%	98.56%	97.49%	93.82%	89.12%

Table 8. Ablation experiments of different branches.

Combined Scheme	Accuracy/%	Parameters/K
TIM	98.81	40.85
TIM + FRE	99.31	81.7
TIM + FRE + DEL	99.65	151.9

Table 9. Comparison of Correction Effects.

Load Branch with Arc	Best Fitting Ellipse	Minimal Boundary Rectangle	Gaussian Mixture Model	Improved Convex Hull	Improved Convex Hull After Correction
fluorescent lamp	95.45%	96.15%	97.75%	96.45%	99.70%
LED lamps	94.30%	95.55%	96.10%	95.75%	99.65%
vacuum cleaner	88.80%	88.55%	89.45%	89.20%	99.70%
switching power supply	94.85%	93.25%	95.65%	94.50%	99.70%
water dispenser	94.70%	94.75%	98.85%	97.65%	99.65%
refrigerator	87.90%	85.45%	94.45%	88.25%	99.65%
induction cooker	95.75%	95.15%	98.85%	96.45%	99.65%
microwave oven	97.90%	95.05%	99.25%	98.75%	99.85%
water heater	96.45%	95.80%	98.45%	97.45%	99.70%
washing machine	87.70%	96.55%	98.15%	97.10%	99.70%
bathtub	98.50%	96.90%	98.30%	97.85%	99.85%
electric iron	97.65%	96.45%	98.20%	96.95%	99.75%

Table 10. Circuit Breaker Action Results.

Load Type	The First Test Action Time/ms	The Second Test Action Time/ms	The Third Test Action Time/ms
Refrigerator	114	115	116
Electric Iron	146	150	153
Vacuum Cleaner	97	100	105
Bathroom Heater	156	160	163
Water Heater and Washing Machine	85	86	88
Microwave Oven and Induction Cooker	125	130	136
Vacuum Cleaner and Water Dispenser	90	92	94
LED Lamp and Fluorescent Lamp	152	155	157

Table 11. Evaluation of Limit Performance and Generalization Capability.

Experimental Condition	Single Load (Participate in Training)	Random Combination of Two Loads (Not Involved in Training)	Random Combination of Three Loads (Not Involved in Training)	Random Combination of Two Lines (Not Involved in Training)	Random Combination of Three Lines (Not Involved in Training)	Random Combination of Four Lines (Not Involved in Training)
Action time range/ms	98–160	85–170	88–176	96–178	103–182	114–186
Number of false trips	0	0	0	0	0	0
Number of missed detections	0	0	0	0	0	1

Table 12. Comparison of Effects of Different Methods.

Recognition Algorithm	Number of Loads	Feature Extraction Method	Accuracy
The method of this paper	12	TDDA-CNN	99.65%
1D-CNN [32]	9	RLC arc model	99.33%
Lightweight CNN [33]	3	Depth separable convolution	99.97%
2D-CNN [34]	4	CWT	98.5%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Y.; Lan, T.; Ye, Q.; Sheng, D.; Bao, Z.; Song, R. Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model. Electronics 2026, 15, 681. https://doi.org/10.3390/electronics15030681

AMA Style

Wang Y, Lan T, Ye Q, Sheng D, Bao Z, Song R. Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model. Electronics. 2026; 15(3):681. https://doi.org/10.3390/electronics15030681

Chicago/Turabian Style

Wang, Yao, Tianle Lan, Qing Ye, Dejie Sheng, Zhizhou Bao, and Runan Song. 2026. "Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model" Electronics 15, no. 3: 681. https://doi.org/10.3390/electronics15030681

APA Style

Wang, Y., Lan, T., Ye, Q., Sheng, D., Bao, Z., & Song, R. (2026). Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model. Electronics, 15(3), 681. https://doi.org/10.3390/electronics15030681

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Series Arc Fault Detection Method Based on TDDA-CNN Prototype Learning Model

Abstract

1. Introduction

2. Platform Construction and Data Analysis

2.1. Arc Fault Experiment Platform

2.2. Data Acquisition and Analysis

3. Improved Attention Mechanism Fusion Feature Extraction Method

3.1. Prototype Learning

3.2. Multidimension Feature Extraction

3.3. Attention Mechanism

3.4. TDDA Module

4. Arc Fault Detection Method Based on Fusion Prototype Learning Model

4.1. TDDA-CNN Prototype Learning Model

4.2. Visualization of 3D Prototype Feature Set

4.3. Arc Fault Prototype Feature Set Correction

4.3.1. Convex Hull Approximation Algorithm

Convex Hull Pre-Expansion and Boundary Correction

Space Update and Convergence Judgment

4.3.2. Convex Hull Correction Effect

5. Experimental Verification

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI