1. Introduction
The structure and operating conditions of power transformers are complex. During their manufacturing, transportation, installation, commissioning, and operation, typical insulation defects such as bubbles, burrs, air gaps, surface contamination, and poor contact are prone to occur [
1]. These defects can lead to localized electric field breakdown on the surface or inside the transformer, resulting in partial discharge (PD) phenomena [
2]. PD signal detection can enable early warning and accurate diagnosis of insulation faults in transformers [
3]. However, in practical engineering, the detected PD signals are often contaminated with significant noise interference, which severely affects the effectiveness and accuracy of diagnosis.
To effectively suppress background noise in PD signals, extensive research has been conducted by scholars both domestically and internationally in recent years. Huang et al. [
4] proposed empirical mode decomposition (EMD), which offers a novel adaptive decomposition technique for processing PD signals. However, this method is plagued by severe mode mixing and substantial noise interference. To address these issues, Jin et al. [
5] introduced a PD denoising method for switchgear based on adaptive EEMD, which significantly outperforms traditional wavelet algorithms and EMD methods. Nevertheless, EEMD still exhibits residual noise interference and high computational complexity.
To further improve the mode mixing and spurious components in the decomposition process, scholars have proposed complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN). Sun et al. [
6] employed the CEEMDAN with adaptive noise method, which effectively removed residual white noise interference but failed to fundamentally resolve the mode-mixing problem. For improved mode mixing, the variational mode decomposition (VMD) method proposed by Zosso et al. [
7] achieved effective analysis of complex noise signals by leveraging the characteristics of finite bandwidth and unique central frequency. Building on this, Xia et al. [
8] introduced an improved VMD method for PD signal denoising, yet practical applications still face challenges in parameter selection. Xiao et al. [
9] proposed a method to determine the optimal number of modes
N based on the number of peaks in the signal spectrum, which is primarily aimed at analyzing low-frequency oscillations in power systems and does not fully account for the impact of noise on signals. Therefore, it has limitations in PD signal processing.
Other methods include Gilles et al.’s [
10] proposed empirical wavelet transform (EWT) method, which presets boundary frequencies by analyzing the distribution of peaks in the signal spectrum and constructs orthogonal filter banks to decompose signals by segmenting the spectrum into different bands. Ma et al. [
11] combined the improved empirical wavelet transform (IEWT) with the Fourier spectrum segmentation method, which essentially solved the mode-mixing problem during PD signal decomposition but exhibited drawbacks such as low efficiency in selecting characteristic components and high computational load. Zheng et al.’s [
12] adaptive empirical Fourier decomposition (AEFD) successfully decomposes non-stationary signals, yet requires manual spectrum boundary setting, introducing practical implementation uncertainties.
To address these problems, an improved power spectrum segmentation (IPSK) method based on spectral kurtosis is proposed in this paper. By introducing the improved power spectrum segmentation technology, the narrowband interference and white noise components in multi-source complex PD signals can be effectively filtered out, which improves the problem of insufficient decomposition accuracy of traditional methods in processing multi-source composite PD signals. To verify the effectiveness of the method, this paper selects typical methods such as EWT, AEFD, VMD, and CEEMDAN for comparative analysis, and it proves the superiority of the IPSK method in extracting PD signals.
3. Simulation Analysis
In order to verify the effectiveness of IPSK, two sets of single exponential oscillation attenuation models and double exponential oscillation attenuation models with different oscillation frequencies are set up to construct multi-source PD signals [
15]. To enhance the complexity of the PD signals, two additional sets of attenuation functions were incorporated into one of the models. This signal combines a variety of PD characteristics, including different discharge types and intensities, to simulate the complex discharge conditions that may occur in the actual operation of the transformer. The formula is
In these equations, A represents the signal amplitude, are the decay coefficients, is the oscillation frequency, and is the time of discharge oscillation, which take values of 5 µs, 13 µs, 20 µs, and 30 µs, respectively.
In the simulation experiment, the sampling frequency was set to 20 MHz, and the number of sampling points was 4000. The specific parameters of the PD signal are listed in
Table 1, and the corresponding time-domain waveform is shown in
Figure 5a.
To simulate realistic field measurement conditions, Gaussian white noise with a signal-to-noise ratio (SNR) of −2 dB and narrowband interference were introduced into the generated multi-source complex PD signals. The narrowband interference consists of a combination of sinusoidal signals and amplitude-modulated (AM) signals, mathematically expressed as follows:
The specific time-domain waveform of the noisy multi-source PD signal is shown in
Figure 5b.
To obtain the optimal frequency band segmentation results using the IPSK method for simulated signals, it is necessary to determine the appropriate value of parameter
w. Following the same procedure described in
Section 2.3, we calculated the NPSK as shown in
Figure 6. The results demonstrate that the kurtosis reaches its maximum value when
w = 0.4. Further increasing
w beyond this value shows negligible changes in the NPSK results. Therefore, the optimal parameter
w was determined to be 0.4 for subsequent analysis.
The value of
w obtained is substituted into the threshold
, yielding the segmentation results for IPSK. These results are compared with the spectral segmentation performance of the EWT and AEFD methods. To ensure consistency in the comparison, the segmentation results of IPSK are transformed from the power spectrum to the frequency spectrum, as illustrated in
Figure 7a–c.
As shown in
Figure 7a, the EWT failed to segment the amplitude modulation signal, and it incorrectly divided the frequency bands of the two PD signals into two parts. This segmentation error resulted in significant mode mixing during subsequent time-domain analysis, severely compromising the accuracy of signal feature extraction.
Figure 7b demonstrates that although the AEFD method successfully separated the sinusoidal and amplitude-modulated signals, it committed segmentation errors when processing the second PD signal by incorrectly assigning its right portion to the amplitude-modulated signal’s frequency band. In contrast,
Figure 7c reveals that the IPSK method achieved non-continuous segmentation in the frequency domain, with each segmented band occupying a limited frequency range containing minimal noise. Most importantly, IPSK correctly segmented both PD signals. These comparative results clearly indicate that IPSK provides more rational spectral segmentation than both EWT and AEFD methods, demonstrating a superior performance in processing noisy signals.
In order to feel the spectrum segmentation effect of IPSK more directly, the time-domain waveform reconstruction of each frequency band was carried out, and the results are shown in
Figure 8. It can be seen from the diagram that the reconstructed four components correspond to the PD signal with a frequency of 1 MHz, the sinusoidal narrowband interference signal, the PD signal with a frequency of 10 MHz and the amplitude modulation signal. It can be seen from the diagram that each component is clear, and there is no modal mixing phenomenon, indicating that the method accurately decomposes the multi-source complex PD signal into two independent PD signals.
The segmentation performance of IPSK was comparatively evaluated against four conventional methods (EWT, AEFD, VMD, and CEEMDAN), with their decomposition results presented in
Figure 9a–d. For EWT, the number of modes is set to 4, and the local maxima segmentation mode is used. For AEFD, the initial boundary set is configured according to the spectrum of the original signal as [80, 300, 720], and the adaptive
ε-neighborhood segmentation mode is employed. For VMD, the number of modes is set to 4, and the penalty parameter α is set to 2000.
As shown in
Figure 9a,b, while both EWT and AEFD methods successfully isolated the 1 MHz PD signal in their first components, these components remained significantly contaminated by white noise. Moreover, the 10 MHz PD signal exhibited mode-mixing effects, being dispersed across the second and third components while simultaneously suffering from varying levels of narrowband interference, which substantially compromised signal feature identification.
Figure 9c demonstrates that VMD could relatively clearly extract the 10 MHz PD signal in its second component. However, the 1 MHz PD signal was completely obscured by narrowband interference in the first component, rendering it unrecognizable. The decomposition results of
Figure 9d CEEMDAN show that the fourth component is a 1 MHz PD signal, but it still contains white noise interference and has some distortion. The modal mixing of the other components is serious, and the 10 MHz PD signal cannot be effectively identified. In summary, compared with the EWT, VMD, CEEMDAN, and AEFD methods, IPSK achieves a superior decomposition accuracy while more effectively separating multi-source complex PD signals into two independent PD components.
To quantitatively compare the accuracy of the PD signals extracted by each method, three evaluation metrics are provided: signal-to-noise ratio (SNR), root-mean-square error (RMSE), and normalized correlation coefficient (NCC) [
16]. Their calculation formulas are as follows:
In these formulas,
represents the original signal, and
represents the denoised signal.
and
are the expectations of the signals before and after denoising, respectively.
and
are the standard deviations of the signals before and after denoising, respectively. A higher SNR indicates more effective information contained in the PD signal; an RMSE value closer to 0 means less distortion of the pulse after denoising; and an NCC value closer to 1 indicates higher similarity between the denoised signal and the original signal. The evaluation metrics for each method are shown in
Table 2.
The comparative results in
Table 2 demonstrate that the IPSK method exhibits significant advantages across all evaluation metrics. In terms of the signal-to-noise ratio, IPSK (8.941 dB) shows an approximate 80% improvement over the second-best-performing AEFD (4.973 dB), indicating a superior noise suppression capability. Regarding signal fidelity, its RMSE value (1.311 × 10
−4) is 27.6% lower than the worst-performing VMD, reflecting a higher signal reconstruction accuracy. Meanwhile, IPSK achieves an NCC value of 0.961, approaching the ideal value of 1 and significantly exceeding CEEMDAN’s 0.711, with virtually no waveform distortion observed.
These outstanding performance characteristics primarily stem from the IPSK method’s innovative incorporation of improved power spectrum segmentation technology based on spectral kurtosis. By precisely identifying and separating narrowband interference and white noise components in multi-source composite PD signals, it effectively addresses the insufficient decomposition accuracy of traditional methods when processing complex PD signals.
4. Analysis of Measured Signal
To validate the effectiveness of the proposed method for practical signal analysis, we established a PD detection experimental platform, as illustrated in
Figure 10 [
17] when adjusting the voltage on the test transformer through this platform. When the current flows through the discharge model, partial discharge will occur. To better simulate the complex electromagnetic environment encountered in actual working conditions, the experiment specifically introduced two typical interference models: corona discharge and surface discharge, in order to obtain multi-source composite PD signals that are representative of engineering applications.
Due to significant discrepancies between laboratory conditions and actual operational environments, it is difficult to achieve the noise standards of a transformer during actual operation. Therefore, a periodic sinusoidal narrowband interference signal is added to the experimental signal, which has an amplitude of 9 mV and a frequency of 40 MHz. At the same time, Gaussian white noise with a signal-to-noise ratio of −2 dB is added. The final collected noisy multi-source PD signal is shown in
Figure 11b.
To obtain the frequency band division results of the IPSK method for the actual measured signals, it is necessary to determine the value of the parameter
w. The specific procedure is the same as described in
Section 2.3. The resulting NPSK is shown in
Figure 12. From the figure, it can be observed that when
w = 0.4, the kurtosis reaches its maximum value. Further increasing
w beyond this point results in almost no change in the NPSK value. Therefore, the parameter
w is set to 0.4.
By substituting the obtained
w value into the threshold
, the power spectrum segmentation boundary of the actual measured signal can be obtained, and the EWT, AEFD, and IPSK are compared in terms of spectrum segmentation, as shown in
Figure 13.
Figure 13a reveals that EWT divides the spectrum into three segments. However, the first segment contains only white noise without any valid signal components, representing an invalid partition. While the second segment successfully extracts the first PD signal, and the third segment mixes the second PD signal with narrowband interference. This segmentation approach induces significant mode mixing during subsequent time-domain analysis, severely compromising the accuracy of signal feature extraction. As shown in
Figure 13b, AEFD similarly partitions the spectrum into three segments. The first segment correctly extracts the first PD signal, but the second segment contains minimal PD information with predominant noise, constituting another invalid partition. Like EWT, AEFD’s third segment combines the second PD signal with narrowband interference. In contrast,
Figure 13c demonstrates that IPSK achieves non-contiguous spectral segmentation with well-defined, limited bandwidth intervals containing substantially less noise. This approach effectively eliminates most noise interference while successfully separating both PD signals, demonstrating clear advantages over conventional segmentation methods.
The segmentation performance of IPSK was compared with the EWT, AEFD, VMD, and CEEMDAN methods, with their respective reconstructed time-domain waveforms for each frequency band presented in
Figure 14a–e. For EWT, the number of modes is set to 3, and the local maxima segmentation mode is used. For AEFD, the initial boundary set is configured according to the spectrum of the actual measured signal as [100, 270], and the adaptive ε-neighborhood segmentation mode is employed. For VMD, the number of modes is set to 3, and the penalty parameter
α is set to 2000.
As shown in
Figure 14a, the three components reconstructed by the IPSK method correspond to the corona PD signal, the surface PD signal, and the periodic sinusoidal narrowband interference signal. It can be observed from the figure that the boundaries of each component are clear, the features are distinct, and there is no modal mixing. This indicates that the method accurately decomposes the complex PD signals into two independent PD signals, achieving good results. From
Figure 14b EWT, it can be seen that the first component decomposed by EWT is an invalid component, containing only some modal mixing and white noise. The second component is the corona PD signal, but it is heavily contaminated by white noise. The surface PD signal is located in the third component, but it is completely submerged in narrowband interference. The decomposition results of AEFD in
Figure 14c are similar to those of EWT, both only identifying the corona PD signal and containing invalid components. In
Figure 14d VMD, it can be observed that the corona PD signals are dispersed into the first and second components, where there is narrowband interference and waveform distortion, and the surface PD signals cannot be identified, resulting in the loss of effective information. Regarding
Figure 14e CEEMDAN, the decomposition results indicate that the corona PD signals exhibit modal mixing in the first and second components, and the first component is severely affected by narrowband interference. Furthermore, the surface PD signals are not recognized in the CEEMDAN decomposition results. Thus, through the aforementioned experiments and analysis, the superiority of the IPSK method in processing actual measured signals is further confirmed.