Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems

Zhong, Dantian; Cai, Zhiyuan

doi:10.3390/en19102450

Open AccessArticle

Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems

by

Dantian Zhong

and

Zhiyuan Cai

^*

School of Electrical Engineering, Shenyang University of Technology, Shenyang 110178, China

^*

Author to whom correspondence should be addressed.

Energies 2026, 19(10), 2450; https://doi.org/10.3390/en19102450

Submission received: 11 February 2026 / Revised: 17 April 2026 / Accepted: 27 April 2026 / Published: 20 May 2026

Download

Browse Figures

Versions Notes

Abstract

As the scale of ultra-high-voltage (UHV) and extra-high-voltage (EHV) transmission networks continues to expand, the operational reliability of surge arresters has become increasingly important for power-system security. Based on equivalent degradation experiments conducted on a 1000 kV class UHV surge arrester, this study proposes a multi-source information fusion approach for degradation-state assessment. Leakage-current, UHF partial-discharge, voltage, and temperature-field data were jointly used to construct a hybrid framework integrating a multi-branch convolutional neural network (CNN) and a long short-term memory (LSTM) network. To improve model performance, the sparrow search algorithm (SSA) was introduced for hyperparameter optimization. Experimental results show that the proposed method achieved accuracies of 97.47% and 94.23% on the training and test sets, respectively, and was able to distinguish the normal condition from different degraded-section conditions under the laboratory-emulated equivalent degradation scenario considered in this study. These results indicate that multi-source information fusion combined with data-driven hyperparameter optimization is a feasible approach for laboratory-scale degradation assessment of surge arresters and provides a basis for further validation under more realistic service conditions.

Keywords:

surge arrester; degradation state assessment; multi-source data fusion; CNN; LSTM; SSA

1. Introduction

With the rapid development of ultra-high-voltage (UHV) and extra-high-voltage (EHV) transmission systems, increasingly stringent requirements have been imposed on the operational safety and equipment reliability of power grids. Metal-oxide surge arresters (MOSAs) are essential protective devices for limiting both lightning overvoltages and switching overvoltages, and their operating condition is therefore directly related to the safe and stable operation of transmission lines and substation equipment [1,2,3]. Under the combined effects of long-term power-frequency voltage stress, transient overvoltages, and environmental factors, the ZnO varistor blocks inside a surge arrester inevitably undergo aging and degradation. This degradation leads to an increase in leakage current and a reduction in energy absorption capability; in severe cases, it may even trigger catastrophic failures such as explosions [4,5,6]. Therefore, accurate condition assessment and early identification of surge-arrester degradation are of great engineering significance.

Conventional methods for surge-arrester condition assessment typically rely on a single indicator or a limited set of parameters, such as power-frequency leakage current, dc reference voltage, infrared thermography, and partial-discharge (PD) measurements [7,8,9]. Among these indicators, the resistive component of leakage current is widely regarded as a key parameter for characterizing the degradation level of ZnO varistor blocks; however, it is susceptible to external factors such as ambient humidity, surface contamination, and voltage fluctuations [10]. Infrared thermography can directly visualize surface thermal anomalies, but its sensitivity to localized internal degradation is limited [11]. Although PD detection can reflect insulation defects, it often suffers from noise contamination and difficulties in reliable pattern recognition [12]. As a result, a single diagnostic technique is generally insufficient to capture the complex degradation mechanisms of surge arresters, thereby limiting the reliability of assessment results.

In recent years, advances in sensing technologies and computational capability have stimulated growing interest in multi-source information fusion for surge-arrester condition assessment [13,14]. By jointly utilizing heterogeneous monitoring signals, such as leakage current, partial-discharge activity, voltage, and temperature, the operating condition of an arrester can be characterized from multiple perspectives, including electrical behavior, thermal behavior, and temporal dynamics, thereby improving the accuracy and stability of degradation identification [15]. In parallel, deep-learning approaches have demonstrated strong capabilities in automatic feature extraction and nonlinear modeling for power-equipment fault diagnosis. Convolutional neural networks (CNNs) are effective in capturing local signal features, whereas long short-term memory (LSTM) networks have clear advantages in modeling temporal dependencies. Their combination has been successfully applied to partial-discharge recognition and condition assessment of power equipment [16,17,18].

However, the performance of deep-learning models is highly dependent on the network architecture and the selection of hyperparameters, and conventional trial-and-error tuning based on experience often fails to achieve a globally optimal solution. To address this limitation, swarm-intelligence optimization algorithms have been introduced for model-parameter tuning. The sparrow search algorithm (SSA), owing to its fast convergence and strong global search capability, has shown promising potential in fault diagnosis and hyperparameter-optimization problems [19,20]. Therefore, integrating SSA with deep-learning models provides a promising approach for developing surge-arrester condition-assessment methods with high accuracy and strong robustness.

Building on the above background, this study conducts equivalent degradation experiments on UHV surge arresters to acquire multi-source operational data, and develops a condition-assessment model that combines a multi-branch CNN with an LSTM network. In addition, SSA is introduced to optimize the model hyperparameters, enabling accurate identification of equivalent degradation states in laboratory-emulated scenarios. The results provide theoretical foundations and technical support for online monitoring and condition-based maintenance of surge arresters.

2. Materials and Methods

The degradation of the 1000 kV class surge arrester was emulated by short-circuiting a portion of the ZnO varistor blocks. The experiment was conducted in an indoor high-voltage laboratory hall at an ambient temperature of 18.0 °C and a relative humidity of 51%, under an applied voltage of 638 kV. In this study, a gapless metal-oxide surge arrester (MOA), type Y20W1-828/1620W, together with the grading rings used for 1000 kV arresters, was employed as the test specimen. The complete arrester assembly consisted of five arrester sections of identical height connected in series, labeled from top to bottom as #1, #2, #3, #4, and #5. A double-layer grading-ring structure was installed at the top of the MOA. A photograph of the 1000 kV UHV AC surge arrester is shown in Figure 1.

DC and AC tests were then conducted on both the healthy (normal) section and the degraded section of the MOA; for the AC test, a continuous operating voltage of 128 kV was applied. The experimental data are summarized in Table 1 and Table 2.

By varying the location of the degraded section within the MOA (i.e., placing it at Section #1, #2, #3, #4, or #5) while keeping all other sections in a normal state, five defective specimens were prepared, denoted as #1–#5. Specimen #0 corresponds to the normal (healthy) MOA. The configurations are shown in Figure 2.

Under a continuous operating voltage of 638 kV, the total current and resistive current at the bottom terminal were measured for specimens #0–#5. The results are summarized in Table 3.

As shown in Table 1 and Table 2, after short-circuiting six ZnO varistor blocks in one section, the dc reference voltage at 1 mA (U1mA) decreases from 225 kV to 194 kV, corresponding to a reduction of approximately 13.8%, while the leakage current at 0.75U1mA increases from 15 μA to 100 μA. These changes indicate a pronounced electrical abnormality in the tested section. In this study, the condition created by short-circuiting six varistor blocks in one section is referred to as an equivalent fault condition. The reported 13.8% therefore refers specifically to the reduction in U1mA, rather than to a quantified degradation degree. Since this experimental treatment does not reproduce the full physicochemical aging process in service, it is used here as a laboratory-emulated equivalent fault condition for subsequent analysis and model validation. It should be noted that short-circuiting ZnO varistor blocks does not physically reproduce all field aging mechanisms, such as moisture ingress, thermal aging, or surface contamination. Instead, it is adopted here as an equivalent degradation emulation method that produces measurable changes in voltage distribution, leakage current, thermal behavior, and PD response. Therefore, the present experiment is intended to validate the sensitivity of the proposed multi-source fusion framework to representative degraded-state manifestations, rather than to reproduce the full physicochemical aging process.

Overall, under the continuous operating-voltage condition of the complete arrester, the changes in total current and resistive current caused by a single short-circuit-type degraded section are relatively small. As a result, the resulting abnormalities can be easily masked by on-site environmental disturbances and measurement conditions. This indicates that conventional single-parameter electrical indicators alone are insufficient for reliable identification of the equivalent degradation condition. Based on the equivalent degradation test specimens established in Section 2, in which #0 denotes the normal condition and #1–#5 represent different degraded-section locations, the following sections present a comparative analysis of multidimensional monitoring quantities. The objective is to extract more discriminative degradation features and to provide a basis for the subsequent fusion-based assessment model.

3. Degradation Characteristics Analysis of UHV Surge Arresters

This study sequentially analyzes the temperature characteristics, current characteristics, and PD signal characteristics of the surge arrester to establish the correspondence between degradation severity and multi-source monitoring quantities. The results provide a basis for feature design and validation of the subsequent multi-source information fusion assessment model.

3.1. Temperature Characteristic Analysis

Temperature is a key parameter for characterizing the energy dissipation and thermal stability of surge arresters. Degradation-induced changes in the voltage distribution across the varistor blocks and in leakage current directly affect the loss level and its spatial distribution, thereby altering the temperature rise, hotspot location, and degree of temperature uniformity. Accordingly, with specimen #0 taken as the reference, this section compares the temperature responses of specimens #1–#5 under identical energization conditions, with emphasis on temperature-rise magnitude, heating rate, and inter-section temperature-distribution characteristics.

3.1.1. Temperature Characteristics of a UHV Surge Arrester

To investigate the internal temperature evolution and spatial distribution of a UHV AC surge arrester under normal operating conditions, a steady-state temperature-rise experiment was carried out in an indoor high-voltage test hall. The ambient temperature was 18 °C and the relative humidity was 51%. During the experiment, a power-frequency reference voltage of 638 kV was applied to simulate the normal operating condition of the arrester. Under this condition, the total current was approximately 1800 μA, the resistive current was about 340 μA, and the energization ratio was approximately 71%, indicating that the arrester was operating in a representative normal state.

The internal temperature of the surge arrester was monitored in real time using temperature-sensing modules embedded in the potential-distribution probes. All measurement points were uniformly distributed along the axial direction of the arrester, and their locations coincided with those used for potential-distribution measurement. Temperature data at each point were transmitted remotely via wireless communication. Measurements were recorded at 1, 6, 12, 18, 23, 30, and 38 min after energization. The resulting internal temperature data for the normal arrester are summarized in Table 4.

As indicated in Table 4, under continuous application of the power-frequency reference voltage, the temperatures at all internal measurement points exhibit a gradual increase with time and then progressively stabilize, indicating a transition toward a quasi-steady thermal state.

Overall, at 38 min the temperature rise at each measurement point relative to the initial value generally falls within 1.5–3.0 °C, and no abrupt changes or localized abnormal heating are observed. This indicates that, under normal operating conditions, the internal energy dissipation of the surge arrester is low and its thermal stability is satisfactory. Figure 3 shows the external surface infrared thermogram of the arrester in the normal condition.

Figure 3 presents the infrared thermographic image of the surge arrester under the normal operating condition. As shown, the temperature field on the outer surface is generally continuous, without pronounced discontinuities or high-temperature hotspots. This observation indicates that, during the sustained application of the reference voltage, the external thermal state of the arrester remains stable and no localized abnormal overheating occurs. To further quantify the temporal evolution of the surface temperature and inter-section differences, representative regions on the outer surface of each arrester section were selected for infrared temperature measurements. The temperature statistics at different time instants are listed in Table 5.

As indicated in Table 5, under continuous power-frequency voltage stress, heat generated by internal losses is gradually transferred to the outer surface, leading to an overall temperature increase. Among the five sections, Section #3 exhibits the largest temperature rise of 2.7 °C, followed by Section #2 with 2.2 °C. Sections #5 and #4 show rises of 1.7 °C and 1.4 °C, respectively, whereas Section #1 shows the smallest rise of 0.7 °C. These results indicate that, under normal operating conditions, the surface temperature rise differs to some extent among sections; however, the overall temperature distribution remains relatively uniform, without forming distinct hot–cold regions.

3.1.2. Temperature Characteristics of a UHV Surge Arrester Under the Equivalent Fault Condition

To investigate the internal temperature evolution and spatial distribution of a UHV AC surge arrester under a representative degraded condition, a steady-state temperature-rise experiment was performed on the arrester in the fault state. The measured internal temperature distribution is summarized in Table 6.

As shown in Table 6, compared with the normal condition (#0), the overall internal temperature level of the surge arrester increases noticeably under the fault condition. For specimen #0, the mean temperature across all measurement points is 20.29 °C, with a maximum of 21.7 °C and a minimum of 19.2 °C. The temperature range (maximum–minimum) is 2.5 °C, and the temperature field exhibits a low degree of dispersion (standard deviation ≈ 0.67 °C), reflecting favorable thermal stability under the normal condition.

Under the fault condition, the mean temperature across all measurement points for specimens #1–#5 increases to 23.16–23.53 °C, corresponding to an average rise of approximately 2.87–3.24 °C relative to specimen #0. The corresponding maximum temperature increases to 25.3–28.0 °C, the temperature range expands to 5.0–7.9 °C, and the dispersion of the temperature field increases markedly, with standard deviations of approximately 1.45–2.30 °C. These results indicate that the fault condition not only raises the overall temperature level but also amplifies temperature differences among sections and measurement points, causing the internal temperature field to shift from an approximately uniform state to a distinctly non-uniform one.

While the internal temperature was being measured, an infrared thermal imager was used simultaneously to monitor the outer surface temperature of the surge arrester. The test voltage, ambient conditions, and measurement arrangement were kept consistent with those used in the internal-temperature experiment. Figure 3 shows the infrared thermographic image of the arrester surface, and Table 7 summarizes the maximum and minimum surface temperatures of each section in the steady-state stage.

Under the fault condition, the overall surface temperature level also increases. For the five defective-location cases (#1–#5), the average surface temperature is approximately 19.54–19.87 °C, representing an increase of about 0.99–1.32 °C relative to specimen #0 (18.55 °C). Meanwhile, the global temperature difference expands to 2.5–3.7 °C, and the surface temperature field changes from weakly non-uniform to distinctly non-uniform, with markedly enhanced hotspot visibility in the infrared thermograms.

As shown in Table 7, under the normal condition (#0), the surface temperature of the surge arrester is generally uniform. The maximum surface temperature occurs at Section #2 (19.3 °C), whereas the minimum is observed at Section #5 (18.2 °C), yielding a global temperature difference of only 1.1 °C. This indicates that no pronounced hotspot region is present on the outer surface in the normal state.

Overall, the temperature results indicate that degradation affects not only the overall thermal level of the surge arrester, but also the spatial distribution of the temperature field. Compared with the normal condition, the degraded cases exhibit higher mean temperature, larger temperature range, and stronger spatial non-uniformity. Moreover, the surface hotspot tends to shift toward the section associated with the degraded location, suggesting that temperature information can provide useful spatial cues for degradation localization.

3.2. Current Characteristic Analysis

Current characteristics directly reflect the conduction state and loss level of a surge arrester under continuous operating voltage, among which the resistive current component is more closely associated with heating. Under identical test conditions, this section compares the total current and resistive current of specimens #0 and #1–#5, with emphasis on changes in magnitude, phase/waveform differences, and the influence of the degraded-section location on the overall current response. In Table 8, the symbols “NaN” and the value 0.01 mA represent abnormal raw-data entries rather than valid current measurements. The NaN entries were caused by temporary sensor dropout during acquisition, whereas the 0.01 mA values were identified as invalid readings caused by measurement interruption. In total, 12 abnormal entries were observed in the raw current table, including 10 NaN values and 2 invalid 0.01 mA values. These entries were excluded during preprocessing before model training and evaluation, and the same rule was consistently applied to all data subsets. Accordingly, Table 8 is retained only to present the raw measurement situation, while the cleaned dataset used for subsequent analysis did not include these abnormal entries.

As shown in Table 8, under the normal condition (#0), the total current ranges from 1.30 mA to 2.10 mA, with an average of approximately 1.61 mA. In terms of the axial distribution, the current generally exhibits a pattern of being higher in the upper-to-middle sections and lower toward the bottom. This result suggests that, under continuous operating voltage, the combined effects of inter-section voltage sharing and distributed capacitance/stray parameters in a healthy arrester lead to an axial current gradient, which provides a baseline for subsequent comparisons with fault conditions.

Under the fault condition, the overall level of the total current increases and its distribution becomes more uniform. From the data for cases #1–#5, the mean current across all measurement points is approximately 1.61 mA in the normal condition (#0), whereas it increases to about 1.80–1.85 mA under the fault condition, corresponding to an average increment of roughly 0.17–0.24 mA. Meanwhile, the dispersion of the total current decreases markedly: the global range is about 0.80 mA in the normal case but typically drops to 0.29–0.39 mA in the degraded cases. These results indicate that the total-current distribution shifts from a “pronounced gradient” to a “relatively uniform” pattern when a fault is present.

3.3. PD Signal Characteristic Analysis

PD signals are important indicators of weak insulation regions and local electric-field distortion inside a surge arrester. Short-circuit-type degradation alters the voltage sharing of the varistor blocks and the distribution of local field strength, leading to differences in PD activity in terms of pulse occurrence, amplitude, and periodicity with respect to the power-frequency phase. Accordingly, using specimen #0 as the reference, this section compares the UHF PD responses of specimens #1–#5 with different degraded-section locations under identical energization conditions, focusing on the presence/absence of PD pulses and the time-/frequency-domain characteristics and phase-distribution patterns of representative single pulses.

To improve the reliability of PD identification, two synchronized UHF sensors (C2 and C4) were installed at fixed positions around the arrester specimen. In this study, no universal fixed UHF amplitude threshold was assumed. A pulse was regarded as valid only when its amplitude exceeded the measured background-noise floor and exhibited consistent pulse characteristics in the synchronized UHF channels. The two-channel recordings were further used for time-difference-of-arrival analysis to distinguish signals originating from the arrester specimen from external interference. If no synchronized pulses above the threshold were observed, the corresponding condition was categorized as showing no evident PD activity.

As shown in Figure 4 and Table 9, two synchronized UHF sensors (C2 and C4) were placed at fixed positions around the arrester specimen. Depending on the defective-section location, the sensor-to-arrester distances ranged from 2.0 to 3.5 m for C4 and from 5.0 to 8.5 m for C2. Under these configurations, evident PD activity was detected only for specimens #1 and #5, whereas no evident PD pulses were observed for specimens #0, #2, #3, and #4.

The effective operating bandwidth of the UHF sensors used in this study was 300–1500 MHz. Under the present acquisition configuration, the measured background-noise peak-to-peak levels were approximately 0.008 V for channel C2 and 0.009 V for channel C4. As a representative example, for the pulse file C4000 under specimen #1, the peak-to-peak amplitude recorded in C2 was 0.1373 V, while the synchronized pulse amplitude recorded in C4 was 0.094 V. Both values were clearly higher than the corresponding background-noise levels. In addition, the pulse in C2 arrived approximately 3 ns earlier than that in C4. Therefore, this event was accepted as a valid specimen-originated PD pulse according to the two-channel acceptance rule.

To investigate the PD characteristics of a UHV AC surge arrester under fault conditions, five short-circuit-defect specimens (#1–#5) were prepared by placing the degraded section at different locations within the MOA (Sections #1–#5), while specimen #0 corresponds to the normal MOA. A voltage of 638 kV was applied to all specimens. The PD signals detected when the degraded section was located at Sections #1, #2, #3, #4, and #5 are shown in Figure 5.

From Figure 5, it can be observed that specimens #0, #2, #3, and #4 exhibit no evident pulse signals in the data recorded by the UHF sensors C4 and C2. The UHF measurements consistently indicate that no PD activity is detected in these specimens.

Specimen #1 corresponds to the fault case where Section #1 is degraded. Using the single-pulse files C4000, C4008, and C4009, the time- and frequency-domain waveforms of representative individual pulses are plotted in Figure 6, Figure 7 and Figure 8, respectively.

Based on the UHF time-difference-of-arrival (TDOA) localization analysis, the UHF signals captured by the C2-channel sensor consistently arrive earlier than those recorded by the C4-channel sensor for all examined directions. This timing relationship indicates that the UHF pulses originate from the surge specimen. Clear pulse signals are observed in the measured data, and the UHF results confirm the presence of PD activity in the specimen. The pulses exhibit a certain periodicity with respect to the power-frequency cycle; according to the phase distribution and characteristic patterns, the discharges are preliminarily identified as insulation-type PD.

Specimen #5 corresponds to the fault case where Section #5 is degraded. Using the recorded single-pulse files, the time-domain waveforms of representative individual pulses are plotted in Figure 9.

Clear pulse signals are observed in the measured data. According to the UHF time-difference-of-arrival (TDOA) localization analysis, the signals detected by the C2 sensor consistently arrive earlier than those recorded by the C4 sensor for all examined directions, indicating that the UHF pulses originate from the surge arrester specimen. The pulses exhibit periodicity with respect to the power-frequency cycle, consistent with discharge behavior. Moreover, the phase distribution and spectral-pattern characteristics agree with those of floating discharges. The UHF results therefore confirm the presence of PD activity in the specimen, identified as floating discharge.

Under fault conditions, the three categories of monitoring quantities—temperature, current, and PD signals—exhibit distinct and complementary response characteristics. Temperature and current primarily reflect changes in the overall loss level and its distribution, whereas PD is more sensitive to localized insulation abnormalities and electric-field distortions; moreover, the presence or absence of PD shows a pronounced dependence on the location of the degraded section. A single type of signal is therefore often insufficient to meet the simultaneous requirements of identifying both “global variations” and “local anomalies.” It is thus necessary to jointly model and fuse multi-source information, including leakage current, temperature, PD, and voltage. On this basis, the next chapter will develop a deep-learning assessment model with multi-source information fusion and employ SSA to optimize key hyperparameters, enabling high-accuracy identification of both the degradation state and the degraded-section location of the surge arrester.

4. Degradation Assessment of Surge Arresters Based on a Multi-Source Information Fusion Model

4.1. Dataset Construction and Preprocessing

For reproducibility, the dataset construction and preprocessing procedures are described as follows. Six operating conditions were considered, including one normal condition (#0) and five degraded-location conditions (#1–#5), corresponding to the healthy arrester and the five possible locations of the degraded section, respectively. To characterize the degradation state of the surge arrester from multiple complementary perspectives, four types of data were collected, namely leakage-current signals, voltage signals, UHF PD signals, and temperature data.

The leakage-current and voltage signals were synchronously acquired at a sampling frequency of 100 kHz, with 4096 points recorded in each acquisition. Each record was segmented into 512-point windows with 50% overlap, and each window was treated as an input sample. The UHF PD signals were acquired separately at 1 GHz, with 2048 points per record, and were segmented into 256-point windows using the same overlap ratio. The temperature data were organized as 45 × 7 spatial–temporal tensors, where 45 denotes the internal temperature measurement points along the arrester and 7 denotes the acquisition instants. This representation preserves both the temporal evolution and the spatial distribution of the temperature field. For the PD modality, the UHF signals were not represented only by manually selected single-pulse descriptors. Instead, the raw UHF waveforms were segmented into fixed-length windows and directly used as inputs to the 1D-CNN branch, so that discriminative local patterns related to pulse occurrence, oscillation behavior, and time–frequency characteristics could be learned automatically during training.

To avoid bias caused by class imbalance, the final dataset was constructed to be as balanced as possible across the six operating conditions. Specifically, the dataset used for model training and evaluation contained a total of 200 samples in all, with the samples distributed as evenly as possible among the six classes. Before being fed into the network, all inputs were normalized to reduce the influence of scale differences among heterogeneous modalities and to improve training stability. For one-dimensional time-series data, normalization was performed on each sample sequence. The temperature tensors were normalized in matrix form before being fed into the corresponding network branch.

Particular attention was paid to preventing information leakage during dataset partitioning. Since overlapping windows extracted from the same acquisition sequence are highly similar, random splitting at the sample-window level may lead to overly optimistic results. For this reason, a specimen-level partition strategy was adopted. Specifically, all windowed samples generated from the same physical specimen under the same operating condition were assigned to only one subset, rather than being randomly distributed between the training and test sets. In this way, samples from the same specimen were prevented from appearing in different subsets, thereby enabling a stricter and more realistic evaluation of model generalization.

It should also be emphasized that no artificial noise augmentation was used in constructing the training, validation, or test datasets. All samples were obtained directly from experimentally measured signals through windowing and normalization, without introducing any synthetic perturbations into the original data. Therefore, the reported classification performance was not achieved by artificially increasing class separability, but was determined solely by the actual operating conditions of the arrester specimens.

To further examine the robustness of the proposed model to measurement disturbances, an additional noise-sensitivity analysis was performed on the test set. In this analysis, additive Gaussian white noise with different signal-to-noise ratio (SNR) levels was superimposed on the electrical signal inputs, while the specimen-level partition remained unchanged. The corresponding results are presented in Table 10.

The results show that the proposed SSA-optimized CNN–LSTM framework maintained relatively stable classification performance under moderate noise contamination, suggesting that the degradation features learned by the model were not overly sensitive to external disturbances. Since the added noise was introduced only for robustness evaluation and was applied in a class-independent manner, this procedure did not create artificial separability among classes.

4.2. Fundamental Principles of the Model

4.2.1. CNN Architecture

A standard CNN architecture typically begins with an input layer, followed by multiple convolutional and pooling layers stacked alternately in a hierarchical manner. The network then terminates with fully connected layers to perform feature-to-output mapping and generate the final predictions. The overall architecture is illustrated in Figure 10.

Depending on the dimensionality of the input, CNNs are generally categorized into one-dimensional (1D), two-dimensional (2D), and three-dimensional (3D) variants. Considering the differences in temporal and spatial representations of the sensing data in this study, the network branches are configured in a targeted manner. For 1D time-series signals—such as power-frequency leakage current, PD signals, and voltage waveforms—1D convolutional kernels along the time axis are employed for feature extraction. For 2D representational data, such as temperature-field distributions, 2D convolutional kernels are adopted to model spatial features.

In this study, convolutions are performed in same mode, as indicated by the red box in Figure 11. Specifically, symmetric boundary padding is applied to extend the edges of the input feature map (3 × 3), ensuring that, when a 3 × 3 convolution kernel is used, the output feature map remains exactly the same size as the input. This mode preserves effective extraction of edge features while maintaining continuity of feature propagation through parameter sharing, thereby providing the network with a stable hierarchical flow of spatial information.

As shown in Figure 11, the convolution operation follows (1):

H_{n} = f (H_{n - 1} \otimes w_{n} + b_{n})

(1)

where H_n denotes the output of the n-th convolutional layer; H_n−1 denotes the input to the n-th convolutional layer (which is also the output of the (n−1)-th layer); f is the activation function; w_n and b_n are the weights and bias of the n-th convolutional layer, respectively.

The pooling layer reduces the resolution of feature maps, thereby decreasing the number of parameters and computational complexity and alleviating overfitting. The pooling operation is illustrated in Figure 12.

The output layer maps the continuous feature representation produced by the fully connected module to a class-level probability distribution, and a Softmax classifier is employed to determine the diagnostic category. The Softmax function is given by (2):

p (y = m ∣ z) = \frac{e^{z_{m}}}{\sum_{k = 1}^{K} e^{z_{k}}}

(2)

where p(y = m|z) of the input z_m vector z with sequence size K, belonging to the probability of class m.

4.2.2. LSTM Architecture

LSTM achieves precise control of long-term temporal dependencies through a gating mechanism, and its core structure consists of the following four components, as shown in Figure 13.

By combining the gating results of the forget gate and the input gate, the cell state is adaptively updated, thereby continuously retaining the most critical memory information associated with long-term evolution during iteration. The cell state update at time t is given by (3):

\begin{matrix} f_{t} & = σ_{1} (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}) \\ i_{t} & = σ_{1} (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) \\ {\tilde{C}}_{t} & = \tanh (W_{c} \cdot [h_{t - 1}, x_{t}] + b_{c}) \\ C_{t} & = f_{t} \otimes C_{t - 1} + i_{t} \otimes {\tilde{C}}_{t} \\ o_{t} & = σ_{1} (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}) \\ h_{t} & = o_{t} \otimes \tanh ({\tilde{C}}_{t}) \end{matrix}

(3)

where

f_{t}

,

i_{t}

, and

o_{t}

denote the outputs of the forget gate, input gate, and output gate, respectively;

x_{t}

represents the input at time step

t

;

h_{t}

represents the output at time step

t

;

{\tilde{C}}_{t}

denotes the new candidate cell state vector; and

C_{t}

is the cell state at time step

t

.

W_{f}

,

W_{i}

,

W_{o}

,

b_{f}

,

b_{i}

, and

b_{o}

are the weight matrices and bias vectors associated with the gating structures, respectively, while

W_{c}

and

b_{c}

are the weight matrix and bias vector corresponding to the candidate cell state vector.

σ_{1}

and

t a n h

denote the activation functions.

4.2.3. Fundamental Principle of the SSA Algorithm

The sparrow search algorithm (SSA) is a swarm-intelligence optimization method that searches the solution space by simulating the division of labor and cooperative behavior of sparrows during foraging and anti-predation processes.

In the sparrow search algorithm, the sparrow population is represented by the following matrix:

X = [\begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ x_{n} \end{matrix}] = [\begin{matrix} x_{1, 1} & x_{1, 2} & \dots & x_{1, d} \\ x_{2, 1} & x_{2, 2} & \dots & x_{2, d} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{n, 1} & x_{n, 2} & \dots & x_{n, d} \end{matrix}]

(4)

where historically best position is denoted as X_best. The function f(x) is defined as the food amount in the current region, which represents the fitness value of the objective function. Based on this setting, according to the specific position of each sparrow, the corresponding fitness value at that position can be accurately calculated.

X_{i}^{t + 1} = \{\begin{matrix} X_{i}^{t} \cdot e x p (\frac{- i}{α \cdot t_{m a x}}), & R_{2} < S_{T}, \\ X_{i}^{t} + q \cdot L, & R_{2} \geq S_{T}, \end{matrix}

(5)

Once danger occurs, the discoverers need to quickly lead the population to evacuate the current area; if there is no danger, the discoverers need to search for new food sources over a wider range. The update equation for the discoverers is:

X_{i}^{t + 1} = \{\begin{matrix} q \cdot e x p (\frac{X_{w}^{t} - X_{i}^{t}}{i^{2}}), & i > \frac{n}{2}, \\ X_{p}^{t} + |X_{i}^{t}− X_{p}^{t}| \cdot A^{+} \cdot L, & i \leq \frac{n}{2}, \end{matrix}

(6)

where t denotes the current iteration number, t_max is the maximum number of iterations in the entire process. α is a random value generated in an open interval. R₂ is the preset warning value; S_T is the safety threshold. q is a random number following a normal distribution; L is a row matrix whose elements are all 1.

When R₂ is less than S_T, the population is in a safe state and performs a wide-range search. Conversely, if R₂ is greater than or equal to S_T, it indicates the presence of predation threats, and the population, guided by the vigilant individuals, moves to a safer area to continue foraging.

The update equation for the followers is:

F = [\begin{matrix} f (x_{1}) \\ f (x_{2}) \\ ⋮ \\ f (x_{n}) \end{matrix}]

(7)

where

X_{p}^{t}

represents the best position of the discoverers, while

X_{w}^{t}

denotes the worst position in the sparrow population at the current time. The matrix A⁺ is a special one-row multidimensional matrix, in which each element takes a value of either 1 or −1. n/2 refers to the number of remaining sparrows in the population. When I > n/2, it means that, in the whole sparrow population, the fitness value corresponding to the i-th sparrow is relatively low; it has not successfully obtained food resources, and therefore needs to fly to other areas to search for food in order to survive.

The initial positions of the vigilant individuals are generated by a randomization method, ensuring that, in the early stage of algorithm iteration, they can be uniformly distributed in the solution space, thereby effectively fulfilling the functions of environmental monitoring and risk warning. The position update equation for the vigilant individuals is:

X_{i}^{t + 1} = \{\begin{matrix} X_{b}^{t} + β \cdot |X_{i}^{t} - X_{b}^{t}|, & f_{i} \neq f_{g}, \\ X_{i}^{t} + K \cdot (\frac{|X_{i}^{t} - X_{w}^{t}|}{(f_{i} - f_{w}) + ε}), & f_{i} = f_{g}, \end{matrix}

(8)

where

X_{b}^{t}

represents the global best position at this moment. β is a step-size control parameter, which plays a regulating role in the step length of the corresponding operation. β is a random number with a value range between (0, 1), and its randomness brings uncertainty factors to the system. f_i denotes the fitness value of the current individual in the population, and f_w and f_g correspond to the best and worst fitness values in the current population, respectively. To prevent the denominator from being zero, a constant ε is introduced.

When f_i ≠ f_g, the individual is mostly located at the edge of the population and faces a higher predation risk. When f_i = f_g, it indicates that the individual has perceived danger and tends to move toward the population center to reduce the risk.

The overall procedure of SSA is shown in Figure 14.

The power-frequency leakage-current sequences and the PD- and voltage-fluctuation feature sequences collected from surge arresters under different operating conditions are taken as inputs, and key features such as amplitude variations and temporal patterns are extracted using two 1D convolutional neural network layers, respectively. “Based on temperature signals” refers to using the internal temperature measurement points of the varistor blocks in each section at different operating stages together with the outer-surface temperature data as inputs, and adopting a structure composed of three convolutional layers and two pooling units to learn and extract deep representations of the spatial distribution characteristics of the temperature field and their temporal variations. Subsequently, the features extracted by the four branches are flattened into 1D vectors through a Flatten layer; after fusion, they are fed into an LSTM network, where the temporal dependence relationships among multi-source features are further modeled. Finally, the operating state of the surge arrester and the identification results of the degraded-section location are output through a fully connected layer and a Softmax layer. Based on the above analysis, the overall network architecture of the proposed multi-source information fusion degradation assessment algorithm is shown in Figure 15. Multi-source information fusion deterioration assessment process. is shown in Figure 16. Structural framework of degradation assessment model for multi-source information fusion is shown in Figure 17.

To improve reproducibility, the detailed implementation settings of the proposed multi-branch CNN–LSTM model are summarized in Table 10. The leakage-current, voltage, and PD branches each adopt two 1D convolutional layers followed by a max-pooling layer for local temporal feature extraction, whereas the temperature branch adopts three 2D convolutional layers and two max-pooling layers to capture the spatial–temporal characteristics of the temperature field. After branch-wise feature extraction, the resulting feature vectors are flattened and concatenated, and the fused features are then fed into the LSTM layer for temporal dependency modeling, followed by a fully connected layer and a Softmax classifier for final diagnosis.

As shown in Table 11, the leakage-current, voltage, and PD signals were treated as one-dimensional single-channel sequences, and their input dimensions were therefore expressed as

512 \times 1

or

256 \times 1

, where the last dimension denotes the number of channels. The temperature data were represented as a

45 \times 7 \times 1

tensor, where 45 denotes the spatial measurement points, 7 denotes the acquisition instants, and 1 denotes the single temperature channel. After branch-wise feature extraction, the resulting feature maps were flattened into one-dimensional vectors and concatenated to form the fused feature representation. This fused feature vector was then fed into the LSTM layer to model temporal dependency, and the final classification result was obtained through the fully connected layer and the Softmax classifier. Based on the above network architecture, SSA was further employed to optimize the key hyperparameters of the proposed model.

In this study, SSA was used to optimize the key hyperparameters of the proposed CNN–LSTM model, including the number of hidden neurons in the LSTM layer, batch size, initial learning rate, L2 regularization coefficient, and number of training epochs. To ensure reproducibility, the search ranges were set as follows: the number of hidden neurons was searched in the range of 16–64, the batch size in 8–32, the initial learning rate in 1 × 10⁻⁴ to 1 × 10⁻², the L2 regularization coefficient in 1 × 10⁻⁵ to 1 × 10⁻², and the number of training epochs in 30–80. The population size of SSA was set to 10, and the maximum number of iterations was set to 15. Under these settings, a total of 150 candidate hyperparameter combinations were evaluated during the optimization process. These search intervals were chosen empirically in view of the relatively small sample size, so as to control model complexity while maintaining sufficient optimization flexibility.

The fitness function was defined as the classification accuracy on the validation subset. For each candidate solution generated by SSA, the model was trained on the training subset and then evaluated on the validation subset, and the resulting fitness value was returned to guide the population update. The predefined search ranges of the hyperparameters and the final optimized configuration obtained after convergence are summarized in Table 12. After the optimization process converged, the final selected hyperparameters were determined as follows: the number of hidden neurons in the LSTM layer was 48, the batch size was 16, the initial learning rate was 2.5 × 10⁻³, the L2 regularization coefficient was 5.0 × 10⁻³, and the number of training epochs was 50.

If multiple candidate solutions yielded similar fitness values, the one with the lower validation loss and more stable convergence behavior was selected as the final configuration.

4.3. Degradation Assessment: Results and Discussion

After training with 20 iterations per round and a total of 1360 iterations, both the accuracy curve and the loss function gradually converge and become stable. The recognition accuracies of the training and test sets, together with the corresponding confusion matrices, are shown in Figure 18.

As shown in Figure 18, after simultaneously incorporating the power-frequency leakage current, PD, voltage, and temperature signals, the proposed assessment method achieves a pronounced performance improvement on both the training and test sets. The proposed method achieves an accuracy of 97.47% on the training set and 94.23% on the test set, with a gap of only 3.24 percentage points. This relatively small difference indicates that the model does not merely memorize the training samples but maintains satisfactory generalization capability on unseen data.

From the confusion matrices, most classes are identified correctly, and the misclassifications are mainly concentrated between Class 3 and Class 4. Specifically, several samples belonging to Class 3 are misclassified as Class 4 in both the training and test sets, while the remaining classes are almost perfectly recognized. This indicates that the proposed model has strong discriminative capability for most degradation locations, whereas the feature representations of these two adjacent classes still exhibit partial overlap. A possible reason for this phenomenon is that the degradation states corresponding to Class 3 and Class 4 may produce relatively similar responses in leakage current, PD activity, voltage fluctuation, and temperature distribution. When the degraded locations are adjacent or their severity levels are close, the inter-class differences become weaker, which increases the difficulty of boundary discrimination.

As can be seen, to further assess the classification behavior of training set and test set at the class level, the detailed precision, recall, F1-score, and accuracy values are reported in Table 13.

As reported in Table 13, most classes achieve 100% precision, recall, and F1-score. In contrast, Class 3 exhibits the lowest recall, while Class 4 shows the lowest precision, which is consistent with the confusion-matrix results and further confirms that the main classification ambiguity occurs between these two classes.

These results demonstrate that the proposed multimodal fusion framework can effectively identify the degradation location of surge arresters with high accuracy and stability, showing good potential for practical condition assessment and early fault warning in engineering applications.

4.4. Generalization Validation with Specimen-Level Data Split

Although the above experimental results demonstrate the effectiveness of the proposed method, the use of overlapping windows in sample construction may introduce a risk of information leakage if the dataset is randomly divided at the sample-window level. To address this issue, a stricter specimen-level data partition strategy was further adopted in this study. Specifically, all windowed samples generated from the same physical specimen under the same operating condition were assigned exclusively to one subset only. As a result, no samples originating from the same specimen were allowed to appear simultaneously in the training, validation, and test sets. In addition, stratified partitioning was performed across the six operating-condition classes to maintain class balance in each subset. Under this protocol, the obtained results provide a more reliable assessment of the proposed model’s generalization capability to unseen specimens, rather than reflecting performance inflated by the presence of highly similar adjacent windows extracted from the same acquisition sequence.

Figure 19 presents the confusion matrices of the proposed method under the specimen-level partition strategy. The model achieved an accuracy of 95.6% on the training set and 92.5% on the test set, indicating that the overall classification performance remained stable even after the elimination of potential leakage caused by sample-window-level random splitting. The relatively small gap between the training and test accuracies suggests that the proposed SSA-optimized CNN-LSTM framework maintained satisfactory robustness and did not exhibit obvious overfitting under the stricter validation protocol.

A closer inspection of the test confusion matrix further confirms the discriminative capability of the proposed model. Classes 2, 4, and 6 were all identified with a recall of 100.0%, whereas Classes 1, 3, and 5 achieved recalls of 80.0%, 85.7%, and 87.5%, respectively. Most misclassifications were concentrated between Classes 1 and 2, between Classes 3 and 4, and between Classes 5 and 2. These confusion patterns are physically reasonable, because the corresponding degradation states may share partially similar temporal or multimodal feature characteristics after specimen-level partitioning. Nevertheless, the diagonal dominance of the confusion matrix indicates that the extracted spatiotemporal features still provide effective class separation for most unseen specimens.

It should also be noted that the class-wise percentages fluctuated to some extent under the stricter specimen-level split, mainly because the number of available specimens in several classes was limited. Even so, the test-set results demonstrate that the proposed model preserved strong generalization capability across all six operating-condition classes. Therefore, the specimen-level validation results provide additional evidence that the proposed method is not merely memorizing highly similar overlapping windows, but is also capable of learning representative degradation features with practical value for unseen-sample diagnosis.

4.5. Baseline Comparisons and Ablation Study

To further clarify the respective roles of each data modality and the effect of SSA-based hyperparameter optimization, additional comparative experiments were added in study. Three groups of baseline methods were considered. First, single-modality models were established using only one type of input, namely leakage current, temperature, PD, or voltage. Second, a multi-source fusion model without SSA optimization was constructed, in which the same fusion architecture was retained but the hyperparameters were selected empirically. Third, a conventional machine-learning baseline was introduced using handcrafted features extracted from the multi-source data and classified by a traditional classifier.

To ensure a fair comparison, all methods were evaluated under the same preprocessing procedure, specimen-level data partition strategy, and class-balanced setting as those used for the proposed model. For the deep-learning baselines, the network backbone was kept as consistent as possible, and only the input branches were adjusted according to the modality under consideration. For the fusion model without SSA, the network structure remained unchanged, while the hyperparameters were determined manually rather than through SSA. For the conventional machine-learning baseline, representative statistical features were extracted from the current, voltage, PD, and temperature data and then used as inputs to the classifier.

As reported in Table 14, the comparative results show that all single-modality models performed worse than the multi-source fusion models, indicating that no individual signal can adequately characterize the degradation state on its own. The fusion model without SSA already outperformed the single-modality baselines, which confirms the advantage of combining heterogeneous monitoring information.

Since the main purpose of this analysis was to quantify the variability introduced by random data partitioning and SSA-based hyperparameter search, repeated-run experiments were conducted for the proposed SSA-CNN–LSTM model and the main fusion baseline CNN–LSTM without SSA, which constitute the most direct comparison for evaluating the contribution of SSA. Specifically, both models were trained and evaluated over five independent runs using different random seeds (1, 2, 3, 4, and 5) under the same specimen-level partition strategy. For each run, the corresponding seed controlled the data partition and model initialization, and for the proposed model it also controlled the initialization of the SSA population. For each seed, the specimen-level data partition and network initialization were controlled consistently, and for the proposed model the seed also governed the initialization of the SSA population. The values reported in Table 15 are the mean ± standard deviation over the five independent runs. The mean and standard deviation of accuracy and Macro-F1 across the five runs are summarized in Table 15.

As shown in Table 15, the proposed SSA-CNN–LSTM model achieved higher average accuracy and Macro-F1 than the CNN–LSTM fusion baseline without SSA across repeated runs, indicating that the observed performance gain is robust rather than being dependent on a single favorable run.

Overall, the results in Table 14 and Table 15 indicate that the performance gain of the proposed method is attributable not only to multi-source information fusion, but also to the more effective hyperparameter selection enabled by SSA, and that this gain remains stable across repeated runs with different random seeds.

5. Conclusions

Based on equivalent degradation experiments on a 1000 kV class UHV surge arrester, this study established a multi-source information fusion framework for degradation-state assessment using leakage current, PD, temperature-field information, and voltage signals. The results show that the proposed SSA-optimized CNN–LSTM model is able to distinguish the normal state from different degraded-section conditions with good accuracy under the laboratory conditions considered in this work. These results suggest that combining multi-source information with hyperparameter optimization is an effective way to improve degradation-state identification.

At the same time, some limitations of this study should be noted. First, the experimental verification was carried out on only one type of surge arrester under controlled laboratory conditions. Second, the degraded condition was simulated by short-circuiting part of the ZnO varistor blocks as an equivalent degradation mode, which cannot fully represent the full range of natural aging processes in service, such as moisture ingress, thermal aging, or surface contamination. Third, the dataset used in this study was obtained from laboratory experiments rather than long-term field operation, so the performance of the proposed method under practical operating conditions still needs to be further examined.

Future work will include validation on different arrester types and voltage classes, consideration of additional degradation mechanisms, and evaluation using field-measured data under more complex environmental and operating conditions. Therefore, the proposed method should be regarded as a promising approach validated under laboratory conditions, while further study is still needed before practical large-scale application.

Author Contributions

Conceptualization, D.Z. and Z.C.; methodology, D.Z.; software, D.Z.; validation, D.Z. and Z.C.; formal analysis, D.Z.; investigation, D.Z.; resources, D.Z.; data curation, D.Z.; writing—original draft preparation, D.Z.; writing—review and editing, D.Z.; visualization, D.Z.; supervision, D.Z.; project administration, D.Z.; funding acquisition, Z.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Thomas, J.; Joseph, B.; Jose, P.J.; Maria, H.J.; Main, P.; Rahman, A.A.; Francis, B.; Ahmad, Z.; Thomas, S. Recent Advances in Cross-linked Polyethylene-based Nanocomposites for High Voltage Engineering Applications: A Critical Review. Ind. Eng. Chem. Res. 2019, 58, 20863–20879. [Google Scholar] [CrossRef]
Christodoulou, A.C.; Vita, V.; Mladenov, V.; Ekonomou, L. On the Computation of the Voltage Distribution along the Non-Linear Resistor of Gapless Metal Oxide Surge Arresters. Energies 2018, 11, 3046. [Google Scholar] [CrossRef]
Metwally, I.A.; Eladawy, M.; Feilat, E.A. Online condition monitoring of surge arresters based on third-harmonic analysis of leakage current. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2274–2281. [Google Scholar] [CrossRef]
Paplinski, P.; Wankowicz, J. Application of Leakage Current Parameters for Technical Diagnostics of Surge Arresters. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 3458–3465. [Google Scholar] [CrossRef]
Khodsuz, M.; Teymourian, H.M.; Seyyedbarzegar, S. New criteria for metal oxide surge arrester condition monitoring based on leakage current analysis: Considering non-uniform pollution effect. IET Gener. Transm. Distrib. 2024, 18, 1072–1089. [Google Scholar] [CrossRef]
Bhoye, G.; Jain, I. Failure Analysis and Optimization of High Mechanical Strength Surge Arresters Using Finite Element Analysis. J. Fail. Anal. Prev. 2024, 24, 1230–1242. [Google Scholar] [CrossRef]
Munir, A.; Abdul-Malek, Z.; Arshad, R.N. Resistive component extraction of leakage current in metal oxide surge arrester: A hybrid method. Measurement 2020, 173, 108588. [Google Scholar] [CrossRef]
Araujo, B.V.S.; Rodrigues, G.A.; de Oliveira, J.H.P.; Xavier, G.V.R.; de Souza Lebre, U.D.E.; de Araujo, C.A.C.; Freire, E.O.; Neto, E.T.W.; Ferreira, T.V. Monitoring ZnO surge arresters using convolutional neural networks and image processing techniques combined with signal alignment. Measurement 2025, 248, 116889. [Google Scholar] [CrossRef]
Kim, J.; Song, I.; Kim, C.; Mun, J. Leakage Current Characteristics of 18kV ZnO Lightning Arresters by Accelerated Aging Test. Trans. Korean Inst. Electr. Eng. C 2002, 5, 62–67. [Google Scholar]
Zacarias, T.G.; Sant’Ana, W.C. A Bibliometric and Comprehensive Review on Condition Monitoring of Metal Oxide Surge Arresters. Sensors 2024, 24, 235. [Google Scholar]
Azam Kayser, M.S.; Othman, M.; Illias, H.A.; Latef, T.A.; Islam, M.T.; Ain, M.F. Ultra-high frequency printable antennas for partial discharge diagnostics in high voltage equipment. Alex. Eng. J. 2023, 64, 709–729. [Google Scholar] [CrossRef]
Zacarias, T.G.; Martins, R.; Xavier, C.E.; Castioni, J.C.O.; Sant’Ana, W.C.; Lambert-Torres, G.; Gama, B.R.; Areias, I.A.d.S.; Bonaldi, E.L.; Assuncao, F.D.O. Detection of Failures in Metal Oxide Surge Arresters Using Frequency Response Analysis. Sensors 2023, 23, 5633. [Google Scholar] [CrossRef] [PubMed]
Lin, L.; Li, J.; Wang, J.; Luo, Y.; Liu, Y. Arrester Fault Recognition Model Based on Thermal Imaging Images Using VMamba. Electronics 2025, 14, 4784. [Google Scholar] [CrossRef]
Lira, G.R.S.; Costa, E.G.; Ferreira, T.V. Metal-oxide surge arrester monitoring and diagnosis by self-organizing maps. Electr. Power Syst. Res. 2014, 108, 315–321. [Google Scholar] [CrossRef]
Da Silva, D.A.; Lahti, K.; Buontempo, R.C.; Filho, J.P. Aspects of moisture ingress in polymer housed surge arresters. Electr. Power Syst. Res. 2017, 148, 162–170. [Google Scholar] [CrossRef]
He, T.; Li, Y.; Zhang, Z.; Shen, P.; Zhang, Y. Leakage Current Sensor and Neural Network for MOA Monitoring. Comput. Intell. Neurosci. 2022, 2022, 6728900. [Google Scholar] [CrossRef] [PubMed]
Fu, Y.; Li, T.; Li, Y.; Hu, X.; Jiang, X.; Dong, Y.; Zhao, P.; Yu, C.; Wang, J. Research on Field Source Characteristics of Leakage Current of Arrester Based on TMR Sensor. Sensors 2023, 23, 3830. [Google Scholar] [CrossRef] [PubMed]
Lian, L. Improved sparrow search algorithm optimized extreme learning machine for ultra-short-term wind speed prediction. J. Atmos. Sol.-Terr. Phys. 2026, 278, 106693. [Google Scholar] [CrossRef]
Lu, Y.; Xue, Z.; Guo, J.; Zhang, C.; Liu, J.; Xiao, X.; Sun, J. Aging monitoring and fault positioning for zinc oxide surge arresters based on the fifth harmonic of the leakage current. Front. Phys. 2025, 131, 595410. [Google Scholar] [CrossRef]
Ding, L.; Hu, X.; Deng, J.; Zhu, W.; Hu, J. MambaSolar-Forcaster: A trustworthy photovoltaic ultra-short-term power forecasting method based on normalized optimization and multi-step forecasting mechanism. Array 2026, 29, 100640. [Google Scholar] [CrossRef]

Figure 1. Physical diagram of the 1000 kV UHV AC surge arrester.

Figure 2. Schematic diagram of degraded-section locations in the 1000 kV surge arrester.

Figure 3. Infrared thermogram of the arrester under normal operating conditions.

Figure 4. Experimental arrangement for multi-source measurements of the 1000 kV surge arrester.

Figure 5. UHF PD signals under different degraded-section conditions. (a) 0# Normal condition, (b) 1# Section #1 short-circuited, (c) 2# Section #2 short-circuited, (d) 3# Section #3 short-circuited, (e) 4# Section #4 short-circuited, (f) 5# Section #5 short-circuited.

Figure 6. Section 1 short circuit C4000 single-pulse waveform. (a) Time-domain peak-to-peak value: 0.1373 V. (b) Frequency domain: dominant frequency f1 = 345.9308 MHz.

Figure 7. Section 1 short circuit C4008 single-pulse waveform. (a) Time-domain peak-to-peak value: 0.1793. (b) Frequency domain: dominant frequency f1 = 389.6882 MHz.

Figure 8. Section 1 short circuit C4009 single-pulse waveform. (a) Time-domain peak-to-peak value: 0.1378 V. (b) Frequency domain: dominant frequency f1 = 345.7234 MHz.

Figure 9. Section 5 short-circuit single-pulse waveform.

Figure 10. Typical convolutional neural networks.

Figure 11. Same mode convolution process diagram.

Figure 12. Pooling process diagram.

Figure 13. The network structure of LSTM.

Figure 14. Sparrow algorithm flowchart.

Figure 15. Multi-source information fusion deterioration assessment algorithm network structure.

Figure 16. Multi-source information fusion deterioration assessment process.

Figure 17. Structural framework of degradation assessment model for multi-source information fusion.

Figure 18. Accuracy of degradation assessment in multi-source information fusion. (a) Accuracy on the training set. (b) Accuracy on the test set. (c) Training set confusion matrix. (d) Test set confusion matrix. (Note: Blue cells indicate correctly classified samples (diagonal elements), whereas orange cells indicate misclassified samples (off-diagonal elements)).

Figure 19. Confusion matrices of the proposed model under the specimen-level data partition strategy. (a) Training set confusion matrix. (b) Test set confusion matrix.

Table 1. DC test data of single MOA.

Specimen	U_1mA (kV)	I_0.75u (uA)
Normal section	225	15
Degraded section	194	100
Change rate	−13.8%	+5.6 times

Table 2. Single MOA AC test data.

Specimen	Total Current I_x (mA)	Peak Resistive Current I_rlp (mA)	Resistive Current Phase Angle ψ (°)
I_r_1p (mA)	1.80	0.22	85.0
Normal section	2.20	0.48	81.1
Degraded section	+22.2%	+1.2	−3.9°

Table 3. The whole MOA AC test data.

Specimen ID	Total Current I_x (mA)	Peak Resistive Current I_p (mA)	Resistive Current Phase Angle Ψ (°)
0#	1.9	0.25	84.7
1#	1.95	0.26	84.5
2#	1.96	0.26	84.5
3#	1.97	0.27	84.4
4#	1.99	0.28	84.4
5#	2.04	0.28	84.6

Table 4. Internal temperature test data of normal arrester.

Time		1 min	6 min	12 min	18 min	23 min	30 min	38 min
Temperature of Each Measuring Point in Section #1	45	16.8	16.7	16.7	17	17.1	17.3	17.7
	44	17.2	17.1	17.1	17.4	17.5	17.7	18.1
	43	16.8	16.8	17	17.2	17.3	17.4	17.8
	42	16.5	16.9	17.2	17.5	17.6	17.7	18.2
	41	16.7	16.7	16.9	17.2	17.2	17.3	18.2
	40	16.8	16.9	17.1	17.5	17.5	17.7	18.2
	39	16.6	16.8	17.1	17.5	17.5	17.7	18.3
	38	16.5	16.7	17	17.5	17.5	17.8	18.4
	37	15.8	16	16.3	17.3	17.4	17.6	18.2
Temperature of Each Measuring Point in Section #2	36	16.2	16.3	16.7	17.3	17.5	17.8	18.8
	35	16.2	16.4	16.8	17.4	17.6	17.9	19
	34	16.2	16.3	16.5	16.8	17	17.2	18.6
	33	16.2	16.5	16.8	17.3	17.4	17.6	18.5
	32	16.3	16.4	16.7	17.1	17.2	17.5	18.3
	31	16.3	16.3	16.5	16.9	17	17.2	18
	30	16.3	16.3	16.5	16.9	17	17.2	18
	29	16.2	16.3	16.7	17.1	17.2	16.7	18.2
	28	15.8	15.8	16	16.2	16.2	16.3	16.5
Temperature of Each Measuring Point in Section #3	27	15.7	15.7	15.9	16.2	16.2	16.3	16.5
	26	15.9	16.1	16.5	16.9	17.2	17.5	18.2
	25	16.3	16.4	16.6	16.9	17	17.1	17.7
	24	16.5	16.6	16.8	17.1	17.2	17.3	17.9
	23	16.3	16.4	16.6	16.9	17	17.1	17.7
	22	15.7	16.2	16.7	16.9	17.4	17.6	17.8
	21	16.5	16.2	16.6	16.9	17.2	17.3	17.8
	20	16.1	16.3	16.8	17.2	17.3	17.6	17.8
	19	15.7	15.3	15.9	16	16	16.1	16.2
Temperature of Each Measuring Point in Section #4	18	15.4	15.2	15.5	15.7	15.8	16	16.4
	17	15.3	15.4	15.7	16	16.1	16.2	16.8
	16	15.4	15.5	15.7	15.9	16	16.2	16.6
	15	15.4	15.5	15.7	15.9	16	16.2	16.6
	14	15.2	15.3	15.5	15.7	16.1	15.8	16.7
	13	15.5	15.7	15.9	16.2	16.2	16.3	16.7
	12	15.3	15.4	15.7	16	16	16.1	16.5
	11	15.2	15.3	15.5	15.8	15.8	15.9	16.3
	10	14.8	14.9	15	15.2	15.2	15.2	15.3
Temperature of Each Measuring Point in Section #5	9	15.3	15.4	15.6	15.8	15.9	16	16.2
	8	15.2	15.3	15.5	15.7	15.7	15.8	16.2
	7	15.2	15.2	15.4	15.6	15.7	15.7	16.1
	6	15.1	15.2	15.3	15.6	15.6	15.7	16.1
	5	15.1	15.2	15.3	15.5	15.6	15.7	16.1
	4	14.8	15.1	15.3	15.5	15.6	15.7	16.1
	3	14.9	15	15.2	15.4	15.5	15.5	16
	2	14.6	14.7	15	15.2	15.3	15.4	15.8
	1	14	14.1	14.2	14.3	14.3	14.3	14.5

Table 5. External test temperatures of different sections of the surge arrester at different times.

Time/Temperature	Section #1/°C	Section #2/°C	Section #3/°C	Section #4/°C	Section #5/°C
20:07	15.3	14.1	13.0	13.4	14.3
20:14	15.7	14.0	13.2	13.7	14.4
20:46	16	16.2	15.7	14.8	16
Difference	0.7	2.2	2.7	1.4	1.7

Table 6. Data on the internal temperatures of 6 deteriorated sections of the surge arrester.

Measurement Location	Temperature Measurement Data (°C)
Measurement Location	Measurement Point	0#	1#	2#	3#	4#	5#
Section #1	Measurement point 40	19.8	23.1	23.2	23.2	23.2	23.1
	Measurement point 39	20.5	24.5	24.4	24.4	24.3	24.3
	Measurement point 38	20.8	26.8	25.3	25.4	25.1	25.2
	Measurement point 37	20.7	27.6	25.5	25.3	25.2	25.2
	Measurement point 36	20.6	27.6	25.4	25.2	25.2	25.1
	Measurement point 35	20.8	27.7	25.5	25.4	25.3	25.3
	Measurement point 34	20.7	27.4	25	25.1	24.9	24.8
	Measurement point 33	20.2	23.4	22.8	22.4	22.3	22.4
Section #2	Measurement point 32	21	24.3	23.5	24.2	23.9	23.8
	Measurement point 31	21.5	25.3	26	25.3	25.1	24.9
	Measurement point 30	21.7	25.8	27.5	25.4	25.2	25.0
	Measurement point 29	21.4	25.9	27.9	25.6	25.5	25.2
	Measurement point 28	21.6	25.9	28	25.5	25.4	25.2
	Measurement point 27	21.6	25.7	27.9	25.2	25	25.0
	Measurement point 26	20.6	24.8	27	24.6	24.6	24.3
	Measurement point 25	19.6	21.7	23.4	21.8	21.8	21.8
Section #3	Measurement point 24	20.1	22.5	23.8	23.2	22.6	22.4
	Measurement point 23	20.4	23.6	24.4	25	23.8	23.7
	Measurement point 22	20.6	24	24.5	25.2	24.1	23.8
	Measurement point 21	20.5	24	24.7	25.3	24.3	24.0
	Measurement point 20	20.5	23.8	24.6	25.3	24	23.8
	Measurement point 19	20.6	23.6	24.6	25.1	23.8	23.5
	Measurement point 18	20.4	22.3	23.6	24.5	22.6	22.5
	Measurement point 17	19.4	21.1	21.2	21.7	21.1	21.1
Section #4	Measurement point 16	19.7	21.7	21.9	22.2	21.6	22.0
	Measurement point 15	20.3	21.9	22	22.5	22	22.1
	Measurement point 14	20	22.3	22.4	22.5	23.5	22.6
	Measurement point 13	20.2	22.2	22.3	22.4	23.6	22.5
	Measurement point 12	20.1	22.1	22.1	22.5	23.5	22.4
	Measurement point 11	19.9	22.3	22.2	22.4	23.4	22.5
	Measurement point 10	19.9	21.7	21.7	21.8	23.2	21.9
	Measurement point 9	19.4	20.3	20.3	20.4	21	20.3
Section #5	Measurement point 8	19.7	20.5	20.9	20.7	21	21.2
	Measurement point 7	19.7	20.7	20.9	21.0	21.2	22.0
	Measurement point 6	19.4	21	20.7	20.8	21.3	22.5
	Measurement point 5	19.7	21.1	21.1	21.2	21.2	22.4
	Measurement point 4	19.6	21.2	21.2	21.1	21.3	22.4
	Measurement point 3	19.6	20.7	20.8	20.9	21.2	22.4
	Measurement point 2	19.7	20.9	20.9	21.0	21.1	21.3
	Measurement point 1	19.2	20.0	20.1	20.1	20.3	20.4

Table 7. Test data of the highest and lowest temperatures of the different sections of the surge arrester.

Measurement Location	Measured Value	0#	1#	2#	3#	4#	5#
Section #1	Maximum	18.9	22.0	21.0	21.0	20.9	20.9
Section #1	Minimum	18.4	19.9	19.9	19.6	19.6	19.6
Section #2	Maximum	19.3	21.2	22.1	21.0	21.0	20.9
Section #2	Minimum	18.3	19.3	20.2	20.1	19.3	19.3
Section #3	Maximum	18.8	20.8	20.7	20.9	20.6	20.5
Section #3	Minimum	18.3	19.0	19.0	19.3	18.4	18.4
Section #4	Maximum	18.6	19.8	19.6	19.6	20.2	19.6
Section #4	Minimum	18.3	18.7	18.7	18.7	18.4	18.5
Section #5	Maximum	18.4	19.1	19.1	18.4	18.5	19.6
Section #5	Minimum	18.2	18.4	18.4	18.4	18.5	18.5

Table 8. Full current measurement data (mA) of the arrester under the equivalent fault condition.

Section	Measurement Point	0#	1#	2#	3#	4#	5#
Section #1	Measurement point 40	1.43	NaN	1.96	1.94	1.95	1.93
	Measurement point 39	1.53	NaN	1.93	1.9	1.92	1.9
	Measurement point 38	1.61	1.94	1.94	1.92	1.93	1.91
	Measurement point 37	1.70	1.95	1.95	1.93	1.94	1.92
	Measurement point 36	1.77	1.96	1.97	1.95	1.96	1.94
	Measurement point 35	1.65	1.98	1.99	1.97	1.98	1.96
	Measurement point 34	1.52	2	2.01	1.99	2	1.98
	Measurement point 33	1.44	2.02	2.03	2.01	2.02	2
Section #2	Measurement point 32	1.67	1.96	NaN	1.94	1.95	1.93
	Measurement point 31	1.85	1.94	NaN	1.93	1.94	1.92
	Measurement point 30	1.97	1.93	1.96	1.92	1.93	1.91
	Measurement point 29	2.10	1.93	1.94	1.92	1.93	1.91
	Measurement point 28	1.99	1.92	1.93	1.91	1.92	1.9
	Measurement point 27	1.87	1.91	1.92	1.9	1.91	1.89
	Measurement point 26	1.76	1.9	1.91	1.89	1.9	1.88
	Measurement point 25	1.67	1.89	1.9	1.88	1.89	1.87
Section #3	Measurement point 24	1.58	0.01	1.84	NaN	1.83	1.81
	Measurement point 23	1.65	0.01	1.83	NaN	1.82	1.8
	Measurement point 22	1.77	1.86	1.82	1.85	1.81	1.79
	Measurement point 21	1.85	1.86	1.81	1.85	1.8	1.78
	Measurement point 20	1.94	1.85	1.81	1.84	1.8	1.78
	Measurement point 19	1.98	1.85	1.8	1.84	1.79	1.77
	Measurement point 18	1.88	1.84	1.8	1.83	1.79	1.77
	Measurement point 17	1.76	1.83	1.79	1.82	1.78	1.76
Section #4	Measurement point 16	1.71	1.74	1.75	1.76	NaN	1.72
	Measurement point 15	1.66	1.73	1.74	1.75	NaN	1.71
	Measurement point 14	1.59	1.69	1.74	1.74	1.77	1.71
	Measurement point 13	1.51	1.69	1.73	1.74	1.76	1.7
	Measurement point 12	1.45	1.68	1.73	1.73	1.75	1.7
	Measurement point 11	1.40	1.67	1.72	1.72	1.75	1.69
	Measurement point 10	1.36	1.67	1.71	1.72	1.74	1.68
	Measurement point 9	1.30	1.66	1.7	1.71	1.73	1.67
Section #5	Measurement point 8	1.31	1.73	1.73	1.72	1.75	NaN
	Measurement point 7	1.31	1.74	1.74	1.73	1.76	NaN
	Measurement point 6	1.32	1.75	1.75	1.73	1.77	1.61
	Measurement point 5	1.32	1.75	1.75	1.74	1.77	1.63
	Measurement point 4	1.33	1.76	1.76	1.75	1.79	1.64
	Measurement point 3	1.34	1.77	1.77	1.75	1.8	1.66
	Measurement point 2	1.35	1.78	1.78	1.77	1.82	1.67
	Measurement point 1	1.37	1.79	1.8	1.79	1.84	1.69

Table 9. UHF PD detection results and sensor-to-arrester distances under different equivalent fault conditions.

Condition	Specimen Description	Applied Voltage (kV)	PD Result	C4 Distance (m)	C2 Distance (m)
#0	Normal specimen	638	No evident PD detected	2.0	5.0
#1	Section #1 short-circuited	638	PD detected; insulation-type	3.5	8.2
#2	Section #2 short-circuited	638	No evident PD detected	3.0	5.3
#3	Section #3 short-circuited	638	No evident PD detected	3.0	8.1
#4	Section #4 short-circuited	638	No evident PD detected	3.0	8.1
#5	Section #5 short-circuited	638	PD detected; floating discharge	3.0	8.5

Table 10. Noise sensitivity of the proposed model under different SNR levels.

SNR (dB)	Accuracy (%)	Macro-F1 (%)
Clean	97.44	97.43
30	95.87	96.57
25	94.56	96.21
20	94.33	95.67

Table 11. Network architecture of the proposed multi-branch CNN–LSTM model.

Module	Input Size	Architecture	Main Settings
Leakage-current branch	512 × 1	2 Conv1D + MaxPooling + Flatten	kernel = 3, stride = 1, filters = 32, 64
Voltage branch	512 × 1	2 Conv1D + MaxPooling + Flatten	kernel = 3, stride = 1, filters = 32, 64
PD branch	256 × 1	2 Conv1D + MaxPooling + Flatten	kernel = 3, stride = 1, filters = 32, 64
Temperature branch	45 × 7 × 1	3 Conv2D + 2 MaxPooling + Flatten	kernel = 3 × 3, stride = 1, filters = 32, 64, 128
Fusion layer	—	Feature concatenation	—
LSTM layer	—	LSTM	hidden units = 48
Output layer	—	Fully connected + Softmax	6 classes

Note: Same padding was used in all convolutional layers. ReLU was adopted as the activation function. The pooling operation denotes max-pooling. The features extracted from the four branches were flattened and concatenated before being fed into the LSTM layer. The number of hidden units in the LSTM layer was optimized by SSA and set to 48.

Table 12. SSA-based hyperparameter search ranges and optimized values for the proposed CNN–LSTM model.

Hyperparameter	Search Range	Optimized Value
LSTM hidden neurons	16–64	48
Batch size	8–32	16
Initial learning rate	10⁻⁴–10⁻²	2.5 × 10⁻³
L2 regularization coefficient	10⁻⁵–10⁻²	5.0 × 10⁻³
Number of epochs	30–80	50

Table 13. Classification performance metrics of the proposed multi-source fusion method on the training and test sets.

	Class	Precision (%)	Recall (%)	F1-Score (%)	Accuracy (%)	Overall Accuracy (%)
Training Set	1	100.00	100.00	100.00	100.00	97.47
	2	100.00	100.00	100.00	100.00
	3	100.00	85.19	92.00	97.47
	4	86.21	100.00	92.59	97.47
	5	100.00	100.00	100.00	100.00
	6	100.00	100.00	100.00	100.00
Test Set	1	100.00	100.00	100.00	100.00	94.23
	2	100.00	100.00	100.00	100.00
	3	100.00	72.73	84.21	94.23
	4	62.50	100.00	76.92	94.23
	5	100.00	100.00	100.00	100.00
	6	100.00	100.00	100.00	100.00

Table 14. Test-set performance comparison of the proposed method and baseline models.

Model	Input Modality	Accuracy (%)	Macro-F1 (%)
CNN–LSTM	Current only	89.34	88.56
CNN–LSTM	Temperature only	90.21	91.46
CNN–LSTM	PD only	90.78	90.48
CNN–LSTM	Voltage only	91.63	90.38
CNN–LSTM (without SSA)	Current + Temperature + PD + Voltage	93.46	92.89
Classical ML baseline	Handcrafted multi-source features	84.37	85.72
Proposed SSA-CNN–LSTM	Current + Temperature + PD + Voltage	94.23	93.67

Table 15. Test-set repeat-run performance of the proposed model and the main fusion baseline over multiple random seeds.

Model	Seeds	Accuracy (%)	Macro-F1 (%)
CNN–LSTM (without SSA)	1, 2, 3, 4, 5	93.46 ± 0.52	92.89 ± 0.48
Proposed SSA-CNN–LSTM	1, 2, 3, 4, 5	94.23 ± 0.41	93.67 ± 0.36

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhong, D.; Cai, Z. Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems. Energies 2026, 19, 2450. https://doi.org/10.3390/en19102450

AMA Style

Zhong D, Cai Z. Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems. Energies. 2026; 19(10):2450. https://doi.org/10.3390/en19102450

Chicago/Turabian Style

Zhong, Dantian, and Zhiyuan Cai. 2026. "Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems" Energies 19, no. 10: 2450. https://doi.org/10.3390/en19102450

APA Style

Zhong, D., & Cai, Z. (2026). Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems. Energies, 19(10), 2450. https://doi.org/10.3390/en19102450

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Multi-Source Information Fusion for Degradation Assessment of Metal-Oxide Surge Arresters in Power Systems

Abstract

1. Introduction

2. Materials and Methods

3. Degradation Characteristics Analysis of UHV Surge Arresters

3.1. Temperature Characteristic Analysis

3.1.1. Temperature Characteristics of a UHV Surge Arrester

3.1.2. Temperature Characteristics of a UHV Surge Arrester Under the Equivalent Fault Condition

3.2. Current Characteristic Analysis

3.3. PD Signal Characteristic Analysis

4. Degradation Assessment of Surge Arresters Based on a Multi-Source Information Fusion Model

4.1. Dataset Construction and Preprocessing

4.2. Fundamental Principles of the Model

4.2.1. CNN Architecture

4.2.2. LSTM Architecture

4.2.3. Fundamental Principle of the SSA Algorithm

4.3. Degradation Assessment: Results and Discussion

4.4. Generalization Validation with Specimen-Level Data Split

4.5. Baseline Comparisons and Ablation Study

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI